Keyword Detection in Text Summarization

Thomas, Justine Raju (2015) Keyword Detection in Text Summarization. MTech thesis.



Summarization is the process of reducing a text document in order to create a summary that retains the most important points of the original document. As the problem of information overload has grown, and as the quantity of data has increased, so has interest in automatic summarization. Extractive summary works on the given text to extract sentences that best convey the message hidden in the text. Most extractive summarization techniques revolve around the concept of indexing keywords and extracting sentences that have more keywords than the rest. Keyword extraction usually is done by extracting important words having a higher frequency than others, with stress on important. However the current techniques to handle this importance include a stop list which might include words that are critically important to the text. In this thesis, I present a work in progress to define an algorithm to extract truly significant keywords which might have lost its significance if subjected to the current keyword extraction algorithms.

Item Type:Thesis (MTech)
Uncontrolled Keywords:Keyword detection, Natural language processing, Single document summarization, Text summarization, Keyword extraction
Subjects:Engineering and Technology > Computer and Information Science > Data Mining
Divisions: Engineering and Technology > Department of Computer Science
ID Code:7964
Deposited By:Mr. Sanat Kumar Behera
Deposited On:23 Jun 2016 20:37
Last Modified:23 Jun 2016 20:37
Supervisor(s):Sathyababu, K

Repository Staff Only: item control page