Title :
A literature search tool for identifying disease-associated genes using Hidden Markov model
Author :
Sreekala, S. ; Nazeer, K. A. Abdul
Author_Institution :
Dept. of Comput. Sci. & Eng., Nat. Inst. of Technol., Calicut, India
Abstract :
Understanding the role of genetics is very important for the in-depth study of a disease. Even though lots of information about gene-disease association is available, it is difficult even for an expert user to manually extract it from the huge volume of literature. Therefore, this work introduces a novel extraction tool that can identify disease associated genes from the literature using text-mining algorithm. Here, Hidden Markov Model is combined with a rule-based Named Entity Recognition approach to identify gene symbols from the literature. This will predict the good candidate genes for the disease which will help in the further analysis of the disease.
Keywords :
bioinformatics; data mining; diseases; genetics; hidden Markov models; search problems; text analysis; disease-associated gene identification; extraction tool; gene-disease association; genetics; hidden Markov model; literature search tool; rule-based named entity recognition approach; text-mining algorithm; Abstracts; Bioinformatics; Diseases; Genetics; Hidden Markov models; Navigation; Probability;
Conference_Titel :
Computational Systems and Communications (ICCSC), 2014 First International Conference on
Conference_Location :
Trivandrum
Print_ISBN :
978-1-4799-6012-5
DOI :
10.1109/COMPSC.2014.7032627