Title :
Tactical Clinical Text Mining for Improved Patient Characterization
Author :
Champion, Heather ; Pizzi, Nick ; Krishnamoorthy, R.
Author_Institution :
IMT, Winnipeg, MB, Canada
fDate :
June 27 2014-July 2 2014
Abstract :
Clinical sources of information are markedly increasing in both volume and variety. A significant portion of the valuable data resides in the unstructured or semi-structured clinical text of documents stored in disparate repositories or embedded in HL7 messages. Clinical documents such as discharge summaries, prescriptions, lab reports, and free-form physician notes are filled with abbreviations, acronyms, misspellings, and ungrammatical phrases. However, synoptic reporting methods are restrictive for health care practitioners who wish to express critical and comprehensive patient information in electronic medical records. Furthermore, they have been superseded by systems that use natural language processing (NLP) to extract clinical concepts from free-form text. To address the growing need for efficient NLP solutions that can handle the volume and variety of clinical text, we have developed an optimized rules-based clinical concept extractor called TRACE (Tactical Rules-based AQL Clinical Extractor) using the Annotation Query Language (AQL). We present the experience we have gained applying text mining tools to this challenging domain, as well as a comparison of our solution to cTAKES (clinical Text Analysis and Knowledge Extraction System), an open-source clinical text miner, on a set of prescription documents. We also describe how efficient and scalable clinical text mining techniques will improve several of our company´s offerings.
Keywords :
data mining; electronic health records; natural language processing; public domain software; query languages; text analysis; HL7 messages; NLP; TRACE; abbreviations; acronyms; annotation query language; discharge summaries; electronic medical records; free-form physician notes; free-form text; information clinical sources; lab reports; misspellings; natural language processing; open-source clinical text miner; optimized rules-based clinical concept extractor; patient characterization improvement; prescriptions; semi structured clinical document text; synoptic reporting methods; tactical clinical text mining technique; tactical rules-based AQL clinical extractor; ungrammatical phrases; unstructured clinical document text; valuable data; Dictionaries; Drugs; Gold; Pipelines; Standards; Text mining; Annotation Query Language; Clinical Data; Natural Language Processing; SystemT; Text Mining; UIMA; cTAKES;
Conference_Titel :
Big Data (BigData Congress), 2014 IEEE International Congress on
Conference_Location :
Anchorage, AK
Print_ISBN :
978-1-4799-5056-0
DOI :
10.1109/BigData.Congress.2014.101