• DocumentCode
    2259009
  • Title

    An unsupervised method for lexical acquisition based on Bootstrapping

  • Author

    Zhang, Yuhan ; Yanquan Zhou

  • Author_Institution
    Res. Center of Intell. Sci. & Technol., Beijing Univ. of Posts & Telecommun., Beijing, China
  • fYear
    2009
  • fDate
    24-27 Sept. 2009
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    In this paper, we present an unsupervised method called Mutual Screening Graph Algorithm based on Bootstrapping (MSGA-Bootstrapping) for lexical acquisition. Bootstrapping is a weakly supervised algorithm that has been the focus of attention in many Natural Language Processing(NLP) and Information Extraction(IE) fields, especially in learning semantic lexicons. Our approach only needs unannotated corpuses to learn new words for each semantic category. MSGA-Bootstrapping hypothesizes the semantic class of a word based on collective information over a large body of extraction pattern contexts and the extraction patterns and words can mutual reinforced. Although there are some former algorithms on this task, their precision and stability can be enhanced. By counting on the impact of both the quality information and quantity information of words and patterns when scoring the words and patterns created by them, we improve the former bootstrapping algorithm. We also make MSGA-Bootstrapping run as an unsupervised method by changing the order of its processing. Experiments have shown that MSGA can outperform previous bootstrapping algorithm Basilisk and GMR (Graph Mutual Reinforcement based Bootstrapping). And the result of using MSGA-Bootstrapping as an unsupervised method is acceptable.
  • Keywords
    computer bootstrapping; natural language processing; unsupervised learning; bootstrapping; extraction patterns; information extraction; lexical acquisition method; mutual screening graph algorithm; natural language processing; unsupervised method; Automobiles; Data mining; Dictionaries; Learning systems; Manuals; Natural language processing; Natural languages; Stability; Unsupervised learning; Vehicle dynamics; Bootstrapping; Lexical acquisition; unsupervised method;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2009. NLP-KE 2009. International Conference on
  • Conference_Location
    Dalian
  • Print_ISBN
    978-1-4244-4538-7
  • Electronic_ISBN
    978-1-4244-4540-0
  • Type

    conf

  • DOI
    10.1109/NLPKE.2009.5313737
  • Filename
    5313737