[PDF][PDF] An Accurate Persian Part-of-Speech Tagger.
Comput. Syst. Sci. Eng., 2020•academia.edu
The processing of any natural language requires that the grammatical properties of every
word in that language are tagged by a part of speech (POS) tagger. To present a more
accurate POS tagger for the Persian language, we propose an improved and accurate
tagger called IAoM that supports properties of text to speech systems such as Lexical Stress
Search, Homograph words Disambiguation, Break Phrase Detection, and main aspects of
Persian morphology. IAoM uses Maximum Likelihood Estimation (MLE) to determine the …
word in that language are tagged by a part of speech (POS) tagger. To present a more
accurate POS tagger for the Persian language, we propose an improved and accurate
tagger called IAoM that supports properties of text to speech systems such as Lexical Stress
Search, Homograph words Disambiguation, Break Phrase Detection, and main aspects of
Persian morphology. IAoM uses Maximum Likelihood Estimation (MLE) to determine the …
The processing of any natural language requires that the grammatical properties of every word in that language are tagged by a part of speech (POS) tagger. To present a more accurate POS tagger for the Persian language, we propose an improved and accurate tagger called IAoM that supports properties of text to speech systems such as Lexical Stress Search, Homograph words Disambiguation, Break Phrase Detection, and main aspects of Persian morphology. IAoM uses Maximum Likelihood Estimation (MLE) to determine the tags of unknown words. In addition, it uses a few defined rules for the sake of achieving high accuracy. For tagging the input corpus, IAoM uses a Hidden Markov Model (HMM) alongside the Viterbi algorithm. To present a fair evaluation, we have performed various experiments on both homogeneous and heterogeneous Persian corpora and studied the effect of the size of training set on the accuracy of IAoM. Experimental results demonstrate the merit of the proposed tagger in achieving an overall accuracy of 97.6%.
academia.edu
Showing the best result for this search. See all results