Title :
What we need is word, not morpheme; constructing word analyzer for Japanese
Author :
Kazuhide Yamamoto;Yuki Miyanishi;Kanji Takahashi;Yoshiki Inomata;Yuki Mikami;Yuta Sudo
Author_Institution :
Nagaoka University of Technology, Japan
Abstract :
This paper presents our work on building a Japanese word analyzer, SNOWMAN that is not so-called a “morphological analyzer.” Although there are some morphological analyzers still available, they all produce morphemes, not words as output. That is, they are insufficient to recognize word consisting of multiple morphemes, such as idioms. Moreover, it is quite important in Japanese processing to reduce orthographical variants, that is considered partially in the current analyzers. Our analyzer strives to solve both problems. We have produced an analyzer in which 320 thousand morphemes can be merged into 287 thousand words, and 28 thousand words with multiple morphemes can be recognized as word.
Keywords :
"Merging","Joining processes","Databases","Dictionaries","Manuals","Resource management","Uniform resource locators"
Conference_Titel :
Asian Language Processing (IALP), 2015 International Conference on
Print_ISBN :
978-1-4673-9595-3
DOI :
10.1109/IALP.2015.7451529