• DocumentCode
    2369403
  • Title

    Automated classification of radiology reports for acute lung injury: Comparison of keyword and machine learning based natural language processing approaches

  • Author

    Solti, Imre ; Cooke, Colin R. ; Xia, Fei ; Wurfel, Mark M.

  • Author_Institution
    Dept. of Med. Educ. & Biomed. Inf., Univ. of Washington, Seattle, WA, USA
  • fYear
    2009
  • fDate
    1-4 Nov. 2009
  • Firstpage
    314
  • Lastpage
    319
  • Abstract
    This paper compares the performance of keyword and machine learning-based chest x-ray report classification for Acute Lung Injury (ALI). ALI mortality is approximately 30 percent. High mortality is, in part, a consequence of delayed manual chest x-ray classification. An automated system could reduce the time to recognize ALI and lead to reductions in mortality. For our study, 96 and 857 chest x-ray reports in two corpora were labeled by domain experts for ALI. We developed a keyword and a Maximum Entropy-based classification system. Word unigram and character n-grams provided the features for the machine learning system. The Maximum Entropy algorithm with character 6-gram achieved the highest performance (Recall=0.91, Precision=0.90 and F-measure=0.91) on the 857-report corpus. This study has shown that for the classification of ALI chest x-ray reports, the machine learning approach is superior to the keyword based system and achieves comparable results to highest performing physician annotators.
  • Keywords
    diseases; learning (artificial intelligence); medical diagnostic computing; natural language processing; pattern classification; radiology; acute lung injury; character n-grams; chest X-ray report classification; machine learning system; maximum entropy algorithm; maximum entropy-based classification system; natural language processing; radiology reports automated classification; word unigram; Biomedical informatics; Delay; Gold; Injuries; Learning systems; Lungs; Machine learning; Medical diagnostic imaging; Natural language processing; Radiology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine Workshop, 2009. BIBMW 2009. IEEE International Conference on
  • Conference_Location
    Washington, DC
  • Print_ISBN
    978-1-4244-5121-0
  • Type

    conf

  • DOI
    10.1109/BIBMW.2009.5332081
  • Filename
    5332081