• DocumentCode
    3423626
  • Title

    Automatic phonetics-driven reconstruction of medical dictations on multiple levels of segmentation

  • Author

    Petrik, Stefan ; Pernkopf, Franz

  • Author_Institution
    Signal Process. & Speech Commun. Lab., Graz Univ. of Technol., Graz
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4317
  • Lastpage
    4320
  • Abstract
    Automatic phonetic reconstruction of medical dictations from non- literal and automatically recognized speech transcripts leads to closer-to-literal transcripts for training. In this paper, we introduce an extended alignment method assessing multiple levels of text segmentation and show how open issues like wrong segmentation in the recognized transcript can be resolved. Furthermore, the effect of context-dependent reconstruction and the phonetic similarity threshold on the quality of the reconstructed transcription is measured. Experiments show an increase in precision between 0.7% and 4.7% absolute without loss in recall for the combined system incorporating all of these techniques in comparison to the system in the previous work.
  • Keywords
    speech recognition; speech synthesis; automatic phonetics reconstruction; medical dictations; speech recognition; text segmentation; Automatic speech recognition; Biomedical acoustics; Error correction; Laboratories; Natural languages; Oral communication; Signal processing; Speech processing; Speech recognition; Text recognition; Automatic transcription; dictation; phonetic similarity; syllabification; text alignment;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518610
  • Filename
    4518610