Title :
A Hybrid Abbreviation Extraction Technique for Biomedical Literature
Author :
Song, Min ; Yoo, Illhoi
Author_Institution :
New Jersey Inst. of Technol. Univ., Newark
Abstract :
In this paper, we propose a novel technique to extract abbreviation combining natural language processing techniques and the Support Vector Machine (SVM) in biomedical literature. The proposed technique gives us the comparative advantages over others in the following aspects: 1) It incorporates lexical analysis techniques to supervised learning for extracting abbreviations. 2) It makes use of text chunking techniques to identify long forms of abbreviations. 3) It significantly improves Recall compared to other techniques. The experimental results show that our approach outperforms the leading abbreviation algorithms, Extract Abbrev, ALICE, and Acrophile, at least by 6% 13.9%, and 13.2% respectively, in both Precision and Recall on the Gold Standard Development corpus.
Keywords :
information retrieval; medical administrative data processing; medical computing; natural language processing; support vector machines; text analysis; vocabulary; Gold Standard Development corpus; biomedical literature; hybrid abbreviation extraction technique; lexical analysis; natural language processing; supervised learning; support vector machine; text chunking; Abstracts; Bioinformatics; Biomembranes; Conference management; Data mining; Natural language processing; Proteins; Supervised learning; Support vector machines; Technology management;
Conference_Titel :
Bioinformatics and Biomedicine, 2007. BIBM 2007. IEEE International Conference on
Conference_Location :
Fremont, CA
Print_ISBN :
978-0-7695-3031-4
DOI :
10.1109/BIBM.2007.33