DocumentCode
2259009
Title
An unsupervised method for lexical acquisition based on Bootstrapping
Author
Zhang, Yuhan ; Yanquan Zhou
Author_Institution
Res. Center of Intell. Sci. & Technol., Beijing Univ. of Posts & Telecommun., Beijing, China
fYear
2009
fDate
24-27 Sept. 2009
Firstpage
1
Lastpage
7
Abstract
In this paper, we present an unsupervised method called Mutual Screening Graph Algorithm based on Bootstrapping (MSGA-Bootstrapping) for lexical acquisition. Bootstrapping is a weakly supervised algorithm that has been the focus of attention in many Natural Language Processing(NLP) and Information Extraction(IE) fields, especially in learning semantic lexicons. Our approach only needs unannotated corpuses to learn new words for each semantic category. MSGA-Bootstrapping hypothesizes the semantic class of a word based on collective information over a large body of extraction pattern contexts and the extraction patterns and words can mutual reinforced. Although there are some former algorithms on this task, their precision and stability can be enhanced. By counting on the impact of both the quality information and quantity information of words and patterns when scoring the words and patterns created by them, we improve the former bootstrapping algorithm. We also make MSGA-Bootstrapping run as an unsupervised method by changing the order of its processing. Experiments have shown that MSGA can outperform previous bootstrapping algorithm Basilisk and GMR (Graph Mutual Reinforcement based Bootstrapping). And the result of using MSGA-Bootstrapping as an unsupervised method is acceptable.
Keywords
computer bootstrapping; natural language processing; unsupervised learning; bootstrapping; extraction patterns; information extraction; lexical acquisition method; mutual screening graph algorithm; natural language processing; unsupervised method; Automobiles; Data mining; Dictionaries; Learning systems; Manuals; Natural language processing; Natural languages; Stability; Unsupervised learning; Vehicle dynamics; Bootstrapping; Lexical acquisition; unsupervised method;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Language Processing and Knowledge Engineering, 2009. NLP-KE 2009. International Conference on
Conference_Location
Dalian
Print_ISBN
978-1-4244-4538-7
Electronic_ISBN
978-1-4244-4540-0
Type
conf
DOI
10.1109/NLPKE.2009.5313737
Filename
5313737
Link To Document