Title :
An improved method for extracting acronym-definition pairs from biomedical Literature
Author :
Saneesh Mohammed, N. ; Nazeer, K. A. Abdul
Author_Institution :
Dept. of Comput. Sci. & Eng., Heera Coll. of Eng. & Technol., Trivandrum, India
Abstract :
This paper deals with the problem of extracting acronym-definition pairs from biomedical text. We propose an improved Text mining system based on pattern matching method and space reduction heuristics which increases both recall and precision. Three metrics were used for evaluating the system - recall (measure of how much relevant data the system has extracted from text), precision (measure of how much data returned by the system is actually correct) and f-factor (combined value of recall and precision). Experimental results achieved 98.68% recall and 98.68% precision.
Keywords :
data mining; information retrieval; medical computing; pattern matching; text analysis; acronym-definition pair extraction; biomedical literature; biomedical text; f-factor metric; information retrieval methods; pattern matching method; precision metric; recall metric; space reduction heuristics; system evaluation; text mining system; Bioinformatics; Databases; Measurement; Pattern matching; Terminology; Text mining;
Conference_Titel :
Control Communication and Computing (ICCC), 2013 International Conference on
Conference_Location :
Thiruvananthapuram
Print_ISBN :
978-1-4799-0573-7
DOI :
10.1109/ICCC.2013.6731649