Title :
BioSumm: A novel summarizer oriented to biological information
Author :
Baralis, Elena ; Fiori, Alessandro ; Montrucchio, Lorenzo
Author_Institution :
Politec. di Torino, Torino
Abstract :
The availability of increasingly wider repositories of biomedical and biological texts requires effective techniques to manage the huge mass of unstructured information there contained. The availability of ad-hoc document summaries, targeted to specific topics, may assist researchers in inferring previously undisclosed knowledge and in performing the biological validation of the results of data mining analysis. This paper presents BioSumm, a flexible framework which analyzes large collections of unclassified biomedical texts and produces ad-hoc summaries oriented to inferring knowledge of gene/protein relationships. Summary generation is driven by a novel grading function, which biases sentence selection by means of an appropriate domain dictionary.
Keywords :
bioinformatics; data mining; database management systems; dictionaries; document handling; BioSumm; biological information summarizer; biological text repository; biomedical text repository; data mining analysis; document summaries; domain dictionary; gene-protein relationships; grading function; unstructured information management; Availability; Data analysis; Data mining; Dictionaries; Indexing; Information retrieval; Navigation; Performance analysis; Petroleum; Proteins;
Conference_Titel :
BioInformatics and BioEngineering, 2008. BIBE 2008. 8th IEEE International Conference on
Conference_Location :
Athens
Print_ISBN :
978-1-4244-2844-1
Electronic_ISBN :
978-1-4244-2845-8
DOI :
10.1109/BIBE.2008.4696750