Title :
A Field Theoretical Approach to Medical Natural Language Processing
Author :
Taira, Ricky K. ; Bashyam, Vijayaraghavan ; Kangarloo, Hooshang
Author_Institution :
California Univ, Los Angeles
fDate :
7/1/2007 12:00:00 AM
Abstract :
A parser for medical free text reports has been developed that is based on a chemistry/physics inspired ldquofield theoryrdquo for word-word sentence-level dependencies. The transition from the linguistic world to the world of interacting particles with potential energies is guided by a psycholinguistics thought experiment related to the amount of ldquoworkrdquo required to bring a reference word into an anchored configuration of words. Calibration experiments involving four and five grams were conducted. Data from these experiments were used as a knowledge source for estimating field conditions for words in sentences sampled from a corpus of medical reports. The result of the parser is a dependency tree that represents the global minimum energy state of the system of words for a given sentence. The system was trained and tested on a corpus of radiology reports. Preliminary performance, as quantified by link recall and precision statistics, is 84.9% and 89.9%, respectively.
Keywords :
grammars; knowledge representation; medical computing; natural language processing; word processing; calibration experiments; dependency tree; field theoretical approach; field theory; global minimum energy state; linguistic world; medical free text reports; medical natural language processing; medical reports; psycholinguistics thought experiment; radiology reports; word-word sentence-level dependency; Calibration; Chemistry; Energy states; Natural language processing; Physics; Potential energy; Psychology; Radiology; Statistics; System testing; Knowledge representation; natural language processing (NLP); structured medical reporting; Information Storage and Retrieval; Information Theory; Medical Informatics; Medical Records Systems, Computerized; Natural Language Processing; Terminology as Topic; Vocabulary, Controlled;
Journal_Title :
Information Technology in Biomedicine, IEEE Transactions on
DOI :
10.1109/TITB.2006.884368