• DocumentCode
    541661
  • Title

    Matching data fragments with imperfect identifiers from disparate sources

  • Author

    Craig, Michael B. ; Moody, Benjamin E. ; Jia, Sherman ; Villarroel, Mauricio C. ; Mark, Roger G.

  • Author_Institution
    Div. of Health Sci. & Technol., Harvard-MIT, Cambridge, MA, USA
  • fYear
    2010
  • fDate
    26-29 Sept. 2010
  • Firstpage
    793
  • Lastpage
    796
  • Abstract
    The Multiparameter Intelligent Monitoring in Intensive Care (MIMIC-II) Database includes waveforms and derived parameters from bedside monitors, clinical data from an ICU information system, and data from other hospital laboratories and archives, for thousands of patients. These data come from devices under separate domains that often do not retain detailed information regarding relationships between parameters. We developed software for matching data fragments with incomplete and sometimes incorrect identifiers. We found that names, medical record numbers, waveform times and durations, and ICU admission and discharge records were most helpful when available; however, physiological data can also be used in some circumstances. Rule-based normalization and text edit-distance metrics are used in addition to a visual verification tool for patients whose records cannot be assembled automatically. Thus, a majority of the available waveform recordings are matched to patients in the clinical database.
  • Keywords
    medical information systems; medical signal processing; patient monitoring; waveform analysis; ICU information system; bedside monitors; clinical database; discharge records; hospital laboratories; imperfect identifiers; intensive care database; matching data fragments; medical record; multiparameter intelligent monitoring; physiological data; rule-based normalization; text edit-distance metrics; visual verification tool; waveform recordings; Biomedical monitoring; Databases; Heart rate; Hospitals; Monitoring; Real time systems; Servers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing in Cardiology, 2010
  • Conference_Location
    Belfast
  • ISSN
    0276-6547
  • Print_ISBN
    978-1-4244-7318-2
  • Type

    conf

  • Filename
    5738092