Title :
Word Segmentation Using Domain Knowledge Based on Conditional Random Fields
Author :
Fukuda, Takuya ; Izumi, Masataka ; Miura, Takao
Author_Institution :
Hosei Univ., Tokyo
Abstract :
In this investigation, we propose an experimental approach for word segmentation in Japanese under domain-dependent situation. We apply Conditional Random Fields (CRF) to our issue. CRF learns several probabilistic parameters from training data with specific feature functions dependent on domains. Here we propose how to define domain specific feature functions.
Keywords :
learning (artificial intelligence); natural language processing; probability; random processes; text analysis; Japanese word segmentation; conditional random field; domain knowledge; domain specific feature function; probabilistic parameter; text processing; training data; Artificial intelligence; Dictionaries; Natural languages; Pattern analysis; Speech; Statistics; Stochastic processes; Training data;
Conference_Titel :
Tools with Artificial Intelligence, 2007. ICTAI 2007. 19th IEEE International Conference on
Conference_Location :
Patras
Print_ISBN :
978-0-7695-3015-4
DOI :
10.1109/ICTAI.2007.93