Title :
Applying transductive learning for automatic term extraction: The case of the ecology domain
Author :
Conrado, Merley S. ; Rossi, Rafael G. ; Pardo, Thiago A. S. ; Rezende, Solange O.
Author_Institution :
Inst. de Cienc. Mat. e de Comput., Univ. de Sao Paulo, Sao Paulo, Brazil
Abstract :
Terms are the basis for general text mining and natural language processing applications. However, the manual term extraction is unfeasible due to the huge number of words presented in a domain corpus and also the human effort required to do the extraction. For the term extraction task, machine learning techniques have been used to perform automatic term extraction (ATE). Inductive learning is commonly used, but it requires a large number of words labeled as terms and non-terms to build a classification model and classify unseen words. A better solution is the use of transductive learning, since it requires a small number of labeled examples to classify words as terms and non-terms. In this paper, we propose the use of transductive learning to ATE. The obtained results demonstrate that the application of transductive learning to ATE produces better results than the results obtained by inductive learning.
Keywords :
data mining; learning (artificial intelligence); natural language processing; text analysis; ATE; automatic term extraction; domain corpus; ecology domain; human effort; inductive learning; machine learning techniques; manual term extraction; natural language processing applications; text mining; transductive learning; Accuracy; Environmental factors; Feature extraction; Frequency modulation; Frequency-domain analysis; Pragmatics; Term Extraction; Transductive Learning;
Conference_Titel :
Informatics and Applications (ICIA),2013 Second International Conference on
Conference_Location :
Lodz
Print_ISBN :
978-1-4673-5255-0
DOI :
10.1109/ICoIA.2013.6650267