• DocumentCode
    2019161
  • Title

    Combining dependency and constituent-based resources for structure disambiguation

  • Author

    Galicia-haro, Soffa N. ; Gelbukh, Alexander ; Bolshakov, IgorA

  • Author_Institution
    Center for Comput. Res., Instituto Politecnico Nacional, Mexico City, Mexico
  • Volume
    5
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    2873
  • Abstract
    Unrestricted text analysis requires an accurate syntactic analysis but structural ambiguity is one of the most difficult problems to resolve. Researchers have tried different approaches to obtain the correct syntactic structure from analyzed sentences but no successful results have been obtained. Two different approaches have traditionally applied to syntactic analysis: constituent grammars and dependency grammars. We propose a model for syntactic analysis and disambiguation combining lexical dependencies and semantic proximity. Lexical dependencies are applied by means of a government pattern dictionary following the dependency approach. The semantic proximity is introduced by means of semantic closeness among constituents. Examples are given to illustrate the method´s contributions
  • Keywords
    computational linguistics; context-free grammars; government data processing; natural languages; text analysis; constituent grammars; constituent-based resources; dependency approach; dependency grammars; government pattern dictionary; lexical dependencies; semantic closeness; semantic proximity; structural ambiguity; structure disambiguation; syntactic analysis; syntactic structure; unrestricted text analysis; Dictionaries; Government; Information analysis; Mutual information; Natural language processing; Natural languages; Samarium; Statistical analysis; Tagging; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics, 2001 IEEE International Conference on
  • Conference_Location
    Tucson, AZ
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-7087-2
  • Type

    conf

  • DOI
    10.1109/ICSMC.2001.971945
  • Filename
    971945