DocumentCode
445539
Title
A genetic word clustering algorithm
Author
Hernandez, German ; Bobadilla, Leonard ; Sanchez, Oscar
Author_Institution
Comput. & Syst. Eng., Colombia Nat. Univ., Bogota, Colombia
Volume
2
fYear
2005
fDate
2-5 Sept. 2005
Firstpage
1075
Abstract
In this work, a genetic word clustering algorithm, that classifies words present in the phrases of a linguistic corpus, is proposed. The underlying goal of word classification is to build a good probabilistic model of the language defined by the phrases in the corpus. Some experiments comparing the performance of the proposed algorithm with a classical word clustering algorithm were carried out.
Keywords
classification; genetic algorithms; natural languages; pattern clustering; probability; text analysis; word processing; genetic word clustering; linguistic corpus; probabilistic model; word classification; Bioinformatics; Biomedical optical imaging; Character recognition; Clustering algorithms; Genetic engineering; Natural languages; Optical character recognition software; Speech recognition; Systems engineering and theory; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Evolutionary Computation, 2005. The 2005 IEEE Congress on
Print_ISBN
0-7803-9363-5
Type
conf
DOI
10.1109/CEC.2005.1554810
Filename
1554810
Link To Document