• DocumentCode
    538053
  • Title

    Semi-automatic extension of morphological lexica

  • Author

    Kaufmann, Tobias ; Pfister, Beat

  • Author_Institution
    Speech Process. Group, ETH Zurich, Zurich, Switzerland
  • fYear
    2010
  • fDate
    18-20 Oct. 2010
  • Firstpage
    403
  • Lastpage
    409
  • Abstract
    We present a tool that facilitates the efficient extension of morphological lexica. The tool exploits information from a morphological lexicon, a morphological grammar and a text corpus to guide the acquisition process. In particular, it employs statistical models to analyze out-of-vocabulary words and predict lexical information. These models do not require any additional labeled data for training. Furthermore, they are based on generic features that are not specific to any particular language. This paper describes the general design of the tool and evaluates the accuracy of its machine learning components.
  • Keywords
    computational linguistics; learning (artificial intelligence); natural language processing; statistical analysis; text analysis; machine learning; morphological grammar; morphological lexicon; statistical model; text corpus; words analysis; Accuracy; Compounds; Context; Grammar; Joints; Pragmatics; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Technology (IMCSIT), Proceedings of the 2010 International Multiconference on
  • Conference_Location
    Wisla
  • ISSN
    2157-5525
  • Print_ISBN
    978-1-4244-6432-6
  • Type

    conf

  • DOI
    10.1109/IMCSIT.2010.5679738
  • Filename
    5679738