DocumentCode :
376247
Title :
Does natural selection apply to natural language processing? an experiment for multiword unit extraction
Author :
Dias, Gaël ; Nunes, Sérgio
Author_Institution :
Center of Math., Beira Interior Univ., Covilha, Portugal
Volume :
1
fYear :
2001
fDate :
2001
Firstpage :
205
Abstract :
In this paper, we focus on the suitability of natural selection for the extraction of Multiword Units (i.e. complex lexical units such as compound nouns, idiomatic expressions or phrase templates). For that purpose, a fitness function is defined whose maximization serves as a basis for the identification of pertinent word N-grams together with a similarity measure. In order to propose a suitable platform for evaluation, a software application called GALEMU (Genetic ALgorithm for the Extraction of Multiword Units) has been implemented. Finally, we will provide an experiment realized over an unnnotated text corpus extracted from the database collection of the European Commission that evidences results with high precision rate
Keywords :
genetic algorithms; natural languages; GALEMU; fitness function; natural language processing; natural selection; similarity measures; Application software; Biological cells; Content based retrieval; Databases; Genetic algorithms; Humans; Indexing; Information retrieval; Mathematics; Natural language processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man, and Cybernetics, 2001 IEEE International Conference on
Conference_Location :
Tucson, AZ
ISSN :
1062-922X
Print_ISBN :
0-7803-7087-2
Type :
conf
DOI :
10.1109/ICSMC.2001.969813
Filename :
969813
Link To Document :
بازگشت