• DocumentCode
    711106
  • Title

    An approach to automated thesaurus construction using clusterization-based dictionary analysis

  • Author

    Lagutina, Nadezhda ; Paramonov, Ilya ; Vorontsova, Inna ; Kasatkina, Natalia

  • Author_Institution
    P.G. Demidov Yaroslavl State Univ., Yaroslavl, Russia
  • fYear
    2015
  • fDate
    20-24 April 2015
  • Firstpage
    104
  • Lastpage
    109
  • Abstract
    In the paper an automated approach for construction of the terminological thesaurus for a specific domain is proposed. It uses an explanatory dictionary as the initial text corpus and a controlled vocabulary related to the target lexicon to initiate extraction of the terms for the thesaurus. Subdivision of the terms into semantic clusters is based on the CLOPE clustering algorithm. The approach diminishes the cost of the thesaurus creation by involving the expert only once during the whole construction process, and only for analysis of a small subset of the initial dictionary. To validate the performance of the proposed approach the authors successfully constructed a thesaurus in the cardiology domain.
  • Keywords
    computational linguistics; dictionaries; thesauri; CLOPE clustering algorithm; automated thesaurus construction; clusterization-based dictionary analysis; explanatory dictionary; semantic cluster; terminological thesaurus; Arteries; Clustering algorithms; Dictionaries; Heart; Semantics; Thesauri; Veins;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Open Innovations Association (FRUCT), 2015 17TH Conference of
  • Conference_Location
    Yaroslavl
  • ISSN
    2305-7254
  • Type

    conf

  • DOI
    10.1109/FRUCT.2015.7117979
  • Filename
    7117979