• DocumentCode
    1047391
  • Title

    A Field Theoretical Approach to Medical Natural Language Processing

  • Author

    Taira, Ricky K. ; Bashyam, Vijayaraghavan ; Kangarloo, Hooshang

  • Author_Institution
    California Univ, Los Angeles
  • Volume
    11
  • Issue
    4
  • fYear
    2007
  • fDate
    7/1/2007 12:00:00 AM
  • Firstpage
    364
  • Lastpage
    375
  • Abstract
    A parser for medical free text reports has been developed that is based on a chemistry/physics inspired ldquofield theoryrdquo for word-word sentence-level dependencies. The transition from the linguistic world to the world of interacting particles with potential energies is guided by a psycholinguistics thought experiment related to the amount of ldquoworkrdquo required to bring a reference word into an anchored configuration of words. Calibration experiments involving four and five grams were conducted. Data from these experiments were used as a knowledge source for estimating field conditions for words in sentences sampled from a corpus of medical reports. The result of the parser is a dependency tree that represents the global minimum energy state of the system of words for a given sentence. The system was trained and tested on a corpus of radiology reports. Preliminary performance, as quantified by link recall and precision statistics, is 84.9% and 89.9%, respectively.
  • Keywords
    grammars; knowledge representation; medical computing; natural language processing; word processing; calibration experiments; dependency tree; field theoretical approach; field theory; global minimum energy state; linguistic world; medical free text reports; medical natural language processing; medical reports; psycholinguistics thought experiment; radiology reports; word-word sentence-level dependency; Calibration; Chemistry; Energy states; Natural language processing; Physics; Potential energy; Psychology; Radiology; Statistics; System testing; Knowledge representation; natural language processing (NLP); structured medical reporting; Information Storage and Retrieval; Information Theory; Medical Informatics; Medical Records Systems, Computerized; Natural Language Processing; Terminology as Topic; Vocabulary, Controlled;
  • fLanguage
    English
  • Journal_Title
    Information Technology in Biomedicine, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1089-7771
  • Type

    jour

  • DOI
    10.1109/TITB.2006.884368
  • Filename
    4267693