Parts of speech tagging using hidden Markov model, maximum entropy model and conditional random field

Anand, A (2014) Parts of speech tagging using hidden Markov model, maximum entropy model and conditional random field. BTech thesis.



Parts of Speech tagging assigns the suitable part of speech or in other words, the lexical category to every word in the sentence in Natural language. It is one of the essential tasks of Natural Language Processing. Parts of Speech tagging is the very first step following which various other processes as in chunking, parsing, named entity recognition etc. are performed. An adaptation of various machine learning methods are applied namely Hidden Markov Model (HMM), Maximum Entropy Model(MEM) and Conditional Random Field(CRF) . For HMM models, we have used the suffix information for smoothing of the emission probabilities, while for ME model, the suffix information is used as features. Similar case for the CRF as that used by ME model. The significant points brought about by thesis can be highlighted below: • Use of Hidden Markov Model for Parts Of Speech tagging purpose. To create a sophisticated tagger using small set of training corpus , resources like a Dictionary is used that improves the overall accuracy of the tagger. • Machine learning techniques have been introduced for acquiring discriminative approach. The Maximum Entropy Model and Conditional Random Field has been used for this task. Keywords: Hidden Markov Model, Maximum Entropy Model, Conditional Random Field, POS tagger.

Item Type:Thesis (BTech)
Uncontrolled Keywords:Machine Learning, Hidden Markov Model, POS Tagger, Conditional Random Field, Maximum Entropy Model
Subjects:Engineering and Technology > Computer and Information Science
Divisions: Engineering and Technology > Department of Computer Science
ID Code:6171
Deposited By:Hemanta Biswal
Deposited On:28 Aug 2014 13:58
Last Modified:28 Aug 2014 13:58
Supervisor(s):Rath, S K

Repository Staff Only: item control page