• DocumentCode
    445539
  • Title

    A genetic word clustering algorithm

  • Author

    Hernandez, German ; Bobadilla, Leonard ; Sanchez, Oscar

  • Author_Institution
    Comput. & Syst. Eng., Colombia Nat. Univ., Bogota, Colombia
  • Volume
    2
  • fYear
    2005
  • fDate
    2-5 Sept. 2005
  • Firstpage
    1075
  • Abstract
    In this work, a genetic word clustering algorithm, that classifies words present in the phrases of a linguistic corpus, is proposed. The underlying goal of word classification is to build a good probabilistic model of the language defined by the phrases in the corpus. Some experiments comparing the performance of the proposed algorithm with a classical word clustering algorithm were carried out.
  • Keywords
    classification; genetic algorithms; natural languages; pattern clustering; probability; text analysis; word processing; genetic word clustering; linguistic corpus; probabilistic model; word classification; Bioinformatics; Biomedical optical imaging; Character recognition; Clustering algorithms; Genetic engineering; Natural languages; Optical character recognition software; Speech recognition; Systems engineering and theory; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation, 2005. The 2005 IEEE Congress on
  • Print_ISBN
    0-7803-9363-5
  • Type

    conf

  • DOI
    10.1109/CEC.2005.1554810
  • Filename
    1554810