Title :
Using SVM with uneven margins to extract acronym-expansions
Author :
Gao, Yong-mei ; Huang, Ya-lou
Author_Institution :
Coll. of Inf. Tech. Sci., Nankai Univ., Tianjin, China
Abstract :
Extracting acronyms and their expansions from plain text is an important problem in text mining. Previous research work shows that it can be solved using machine learning approaches. That is, converting the problem of acronym extraction into a binary classification one. We investigated the classification problem and found that the classes are highly unbalanced. So we try to tackle the problem using SVM with uneven margins. Experimental results showed that our approach can get better results than baseline methods of using rules and conventional SVM models. Experimental results also showed how uneven margins classifier making the tradeoff between the extracting precision and recall.
Keywords :
learning (artificial intelligence); pattern classification; support vector machines; text analysis; acronym-expansions; binary classification; classification problem; machine learning; support vector machines; text mining; uneven margins; Cybernetics; Data mining; Machine learning; Natural languages; Read only memory; Statistical learning; Support vector machine classification; Support vector machines; Text mining; Text recognition; Acronym Extraction; Information Extraction; SVM;
Conference_Titel :
Machine Learning and Cybernetics, 2009 International Conference on
Conference_Location :
Baoding
Print_ISBN :
978-1-4244-3702-3
Electronic_ISBN :
978-1-4244-3703-0
DOI :
10.1109/ICMLC.2009.5212276