Title :
Dagger: The Slovak morphological classifier
Author :
Daniel Hládek;Ján Staš;Jozef Juhár
Author_Institution :
Department of Electronics and Multimedia Communications, Technical University of Koš
Abstract :
This paper proposes a classifier, based on hidden Markov model that can be used for solving the problem of part-of-speech tagging of the Slavic languages, such as Slovak, Czech or Polish. These languages are highly inflectional and morphologically rich and have a very large vocabulary. The probability matrices of the classical hidden Markov model are linearly interpolated with additional probability matrices that are calculated using a suffix-based word clustering function. The search space is restricted by a morphological dictionary.
Keywords :
"Hidden Markov models","Probability","Tagging","Training","Dictionaries","Mathematical model","Equations"
Conference_Titel :
ELMAR, 2012 Proceedings
Print_ISBN :
978-1-4673-1243-1