• DocumentCode
    2812615
  • Title

    Automatic Synchronization of live speech and its Transcripts based on a frame-synchronous likelihood ratio test

  • Author

    Gao, Jie ; Zhao, Qingwei ; Yan, Yonghong

  • Author_Institution
    ThinkIT Speech Lab., Chinese Acad. of Sci., Beijing, China
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    1622
  • Lastpage
    1625
  • Abstract
    In this paper, we present our initial efforts in the task of Automatically Synchronizing live spoken Utterances with their Transcripts (textual contents) (ASUT) when the texts are known. We treat it as a online speech-text alignment problem. And it is further simplified into the problem of on-the-fly detecting of the end time of a spoken utterance given its textual content. A general framework called frame-synchronous likelihood ratio test (FS-LRT) procedure is proposed for this end time detection task and explored with the hidden Markov models (HMMs). The property of FS-LRT is studied empirically. Extensive experiments indicate that our proposed approach shows satisfying performance. In addition, FS-LRT has been successfully applied in a subtitling system for live broadcast news.
  • Keywords
    hidden Markov models; speech processing; text analysis; FS-LRT procedure; automatically synchronizing spoken utterances with their transcripts; end time detection task; frame-synchronous likelihood ratio test; hidden Markov model; live broadcast news; live speech; live spoken utterance; online speech-text alignment problem; textual content; Acoustic testing; Automatic speech recognition; Automatic testing; Delay; Digital multimedia broadcasting; Error correction; Hidden Markov models; Multimedia communication; Research and development; TV broadcasting; Automatically Synchronizing spoken Utterances with their Transcripts; frame-synchronous likelihood ratio test;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5496295
  • Filename
    5496295