Title :
A genetic word clustering algorithm
Author :
Hernandez, German ; Bobadilla, Leonard ; Sanchez, Oscar
Author_Institution :
Comput. & Syst. Eng., Colombia Nat. Univ., Bogota, Colombia
Abstract :
In this work, a genetic word clustering algorithm, that classifies words present in the phrases of a linguistic corpus, is proposed. The underlying goal of word classification is to build a good probabilistic model of the language defined by the phrases in the corpus. Some experiments comparing the performance of the proposed algorithm with a classical word clustering algorithm were carried out.
Keywords :
classification; genetic algorithms; natural languages; pattern clustering; probability; text analysis; word processing; genetic word clustering; linguistic corpus; probabilistic model; word classification; Bioinformatics; Biomedical optical imaging; Character recognition; Clustering algorithms; Genetic engineering; Natural languages; Optical character recognition software; Speech recognition; Systems engineering and theory; Vocabulary;
Conference_Titel :
Evolutionary Computation, 2005. The 2005 IEEE Congress on
Print_ISBN :
0-7803-9363-5
DOI :
10.1109/CEC.2005.1554810