Title :
An unsupervised method for lexical acquisition based on Bootstrapping
Author :
Zhang, Yuhan ; Yanquan Zhou
Author_Institution :
Res. Center of Intell. Sci. & Technol., Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
In this paper, we present an unsupervised method called Mutual Screening Graph Algorithm based on Bootstrapping (MSGA-Bootstrapping) for lexical acquisition. Bootstrapping is a weakly supervised algorithm that has been the focus of attention in many Natural Language Processing(NLP) and Information Extraction(IE) fields, especially in learning semantic lexicons. Our approach only needs unannotated corpuses to learn new words for each semantic category. MSGA-Bootstrapping hypothesizes the semantic class of a word based on collective information over a large body of extraction pattern contexts and the extraction patterns and words can mutual reinforced. Although there are some former algorithms on this task, their precision and stability can be enhanced. By counting on the impact of both the quality information and quantity information of words and patterns when scoring the words and patterns created by them, we improve the former bootstrapping algorithm. We also make MSGA-Bootstrapping run as an unsupervised method by changing the order of its processing. Experiments have shown that MSGA can outperform previous bootstrapping algorithm Basilisk and GMR (Graph Mutual Reinforcement based Bootstrapping). And the result of using MSGA-Bootstrapping as an unsupervised method is acceptable.
Keywords :
computer bootstrapping; natural language processing; unsupervised learning; bootstrapping; extraction patterns; information extraction; lexical acquisition method; mutual screening graph algorithm; natural language processing; unsupervised method; Automobiles; Data mining; Dictionaries; Learning systems; Manuals; Natural language processing; Natural languages; Stability; Unsupervised learning; Vehicle dynamics; Bootstrapping; Lexical acquisition; unsupervised method;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2009. NLP-KE 2009. International Conference on
Conference_Location :
Dalian
Print_ISBN :
978-1-4244-4538-7
Electronic_ISBN :
978-1-4244-4540-0
DOI :
10.1109/NLPKE.2009.5313737