• DocumentCode
    652145
  • Title

    Using Structured EHR Data and SVM to Support ICD-9-CM Coding

  • Author

    Ferrao, J.C. ; Janela, Filipe ; Oliveira, M.D. ; Martins, H.M.G.

  • Author_Institution
    Healthcare Sector, Siemens SA, Amadora, Portugal
  • fYear
    2013
  • fDate
    9-11 Sept. 2013
  • Firstpage
    511
  • Lastpage
    516
  • Abstract
    This study proposes a methodology to support coding professionals in assigning ICD-9-CM codes to inpatient episodes. This subject has been predominantly addressed through the use of natural language processing methods, which show limited generalizability. To surpass this issue, this paper proposes a methodology entailing an adaptive data processing method based on structured electronic health record data, whereby raw clinical data is mapped into a feature set, and based on which supervised learning algorithms are trained. After applying a filter method for feature selection, support vector machine (SVM) classifiers are trained to obtain predictions for assigning codes to each episode. This approach is tested using a dataset of inpatient episodes from a department of Internal Medicine. Classifiers exhibited F1-measure values around 52%. Recall was generally higher than precision, which is considered valuable for coding support purposes. Analyzing results on an individual code basis sheds light on some key-issues regarding the use of structured electronic health record data in supporting clinical coding.
  • Keywords
    data structures; learning (artificial intelligence); medical information systems; natural language processing; pattern classification; support vector machines; Department of Internal Medicine; F1-measure values; ICD-9-CM Coding; SVM; adaptive data processing method; feature selection; feature set; filter method; inpatient episodes; limited generalizability; natural language processing methods; structured EHR data; structured electronic health record data; supervised learning algorithms; support vector machine classifiers; Data models; Encoding; Medical diagnostic imaging; Predictive models; Supervised learning; Support vector machines; Training; ICD-9-CM coding; electronic health record; feature selection; filter method; support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Healthcare Informatics (ICHI), 2013 IEEE International Conference on
  • Conference_Location
    Philadelphia, PA
  • Type

    conf

  • DOI
    10.1109/ICHI.2013.79
  • Filename
    6680527