Skip to content
Ankita Wankhede edited this page Jul 26, 2017 · 3 revisions

TASK-------------For this assignment, we are going to summarize a text file using the technics learned. Input of the project is one text file. Use the simple approach below to summarize a text file:1.Read the file2.Remove all the words like “a the ! ? ...” Which does not have meaning using stopwords in NLTK3.Using Lemmatization, apply lemmatization on the remaining words4.Using POS, remove all the verbs5.Calculate the word frequency of the remining words6.Choose top five words that has been repeated most7.Go through the original text that you had in the file8.Find all the sentences with those most repeated words9.Extract those sentences and concatenate10.Enjoy the summarization

Clone this wiki locally