• DocumentCode
    1981487
  • Title

    Applying transductive learning for automatic term extraction: The case of the ecology domain

  • Author

    Conrado, Merley S. ; Rossi, Rafael G. ; Pardo, Thiago A. S. ; Rezende, Solange O.

  • Author_Institution
    Inst. de Cienc. Mat. e de Comput., Univ. de Sao Paulo, Sao Paulo, Brazil
  • fYear
    2013
  • fDate
    23-25 Sept. 2013
  • Firstpage
    264
  • Lastpage
    269
  • Abstract
    Terms are the basis for general text mining and natural language processing applications. However, the manual term extraction is unfeasible due to the huge number of words presented in a domain corpus and also the human effort required to do the extraction. For the term extraction task, machine learning techniques have been used to perform automatic term extraction (ATE). Inductive learning is commonly used, but it requires a large number of words labeled as terms and non-terms to build a classification model and classify unseen words. A better solution is the use of transductive learning, since it requires a small number of labeled examples to classify words as terms and non-terms. In this paper, we propose the use of transductive learning to ATE. The obtained results demonstrate that the application of transductive learning to ATE produces better results than the results obtained by inductive learning.
  • Keywords
    data mining; learning (artificial intelligence); natural language processing; text analysis; ATE; automatic term extraction; domain corpus; ecology domain; human effort; inductive learning; machine learning techniques; manual term extraction; natural language processing applications; text mining; transductive learning; Accuracy; Environmental factors; Feature extraction; Frequency modulation; Frequency-domain analysis; Pragmatics; Term Extraction; Transductive Learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Informatics and Applications (ICIA),2013 Second International Conference on
  • Conference_Location
    Lodz
  • Print_ISBN
    978-1-4673-5255-0
  • Type

    conf

  • DOI
    10.1109/ICoIA.2013.6650267
  • Filename
    6650267