DocumentCode
3640097
Title
A Textual Representation Scheme for Identifying Clinical Relationships in Patient Records
Author
Rezarta Islamaj Doan;Aurelie Neveol;Zhiyong Lu
Author_Institution
Nat. Center for Biotechnol. Inf., Nat. Institutes of Health, Bethesda, MD, USA
fYear
2010
Firstpage
995
Lastpage
998
Abstract
The identification of relationships between clinical concepts in patient records is a preliminary step for many important applications in medical informatics, ranging from quality of care to hypothesis generation. In this work we describe an approach that facilitates the automatic recognition of relationships defined between two different concepts in text. Unlike the traditional bag-of-words representation, in this work, a relationship is represented with a scheme of five distinct context-blocks based on the position of concepts in the text. This scheme was applied to eight different relationships, between medical problems, treatments and tests, on a set of 349 patient records from the 4th i2b2 challenge. Results show that the context-block representation was very successful (F-Measure = 0.775) compared to the bag-of-words model (F-Measure = 0.402). The advantage of this representation scheme was the correct management of word position information, which may be critical in identifying certain relationships.
Keywords
"Context modeling","Machine learning","Semantics","Feature extraction","Bioinformatics","Diseases","Support vector machines"
Publisher
ieee
Conference_Titel
Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
Print_ISBN
978-1-4244-9211-4
Type
conf
DOI
10.1109/ICMLA.2010.164
Filename
5708983
Link To Document