• DocumentCode
    2109889
  • Title

    Predicting atrial fibrillation and flutter using Electronic Health Records

  • Author

    Karnik, S. ; Sin Lam Tan ; Berg, B. ; Glurich, I. ; Jinfeng Zhang ; Vidaillet, H.J. ; Page, C.D. ; Chowdhary, R.

  • Author_Institution
    Biomed. Inf. Res. Center, Marshfield Clinic Res. Found., Marshfield, WI, USA
  • fYear
    2012
  • fDate
    Aug. 28 2012-Sept. 1 2012
  • Firstpage
    5562
  • Lastpage
    5565
  • Abstract
    Electronic Health Records (EHR) contain large amounts of useful information that could potentially be used for building models for predicting onset of diseases. In this study, we have investigated the use of free-text and coded data in Marshfield Clinic´s EHR, individually and in combination for building machine learning based models to predict the first ever episode of atrial fibrillation and/or atrial flutter (AFF). We trained and evaluated our AFF models on the EHR data across different time intervals (1, 3, 5 and all years) prior to first documented onset of AFF. We applied several machine learning methods, including naïve bayes, support vector machines (SVM), logistic regression and random forests for building AFF prediction models and evaluated these using 10-fold cross-validation approach. On text-based datasets, the best model achieved an F-measure of 60.1%, when applied exclusively to coded data. The combination of textual and coded data achieved comparable performance. The study results attest to the relative merit of utilizing textual data to complement the use of coded data for disease onset prediction modeling.
  • Keywords
    Bayes methods; diseases; learning (artificial intelligence); medical information systems; support vector machines; text analysis; AFF prediction models; EHR data; Marshfield clinic; SVM; atrial fibrillation; atrial flutter; coded data; disease onset prediction modeling; electronic health records; free-text; logistic regression; machine learning based models; naïve Bayes; random forests; support vector machines; text-based datasets; useful information; Atrial fibrillation; Data models; Diseases; Medical diagnostic imaging; Predictive models; Support vector machines; Unified modeling language; Atrial Fibrillation; Atrial Flutter; Electronic Health Records; Humans;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering in Medicine and Biology Society (EMBC), 2012 Annual International Conference of the IEEE
  • Conference_Location
    San Diego, CA
  • ISSN
    1557-170X
  • Print_ISBN
    978-1-4244-4119-8
  • Electronic_ISBN
    1557-170X
  • Type

    conf

  • DOI
    10.1109/EMBC.2012.6347254
  • Filename
    6347254