• DocumentCode
    857172
  • Title

    Automatic speech recognition performance on a voicemail transcription task

  • Author

    Padmanabhan, Mukund ; Saon, George ; Huang, Jing ; Kingsbury, Brian ; Mangu, Lidia

  • Author_Institution
    Rennaisance Technol. Corp., East Setauket, NY, USA
  • Volume
    10
  • Issue
    7
  • fYear
    2002
  • fDate
    10/1/2002 12:00:00 AM
  • Firstpage
    433
  • Lastpage
    442
  • Abstract
    We report on the performance of automatic speech recognition (ASR) systems on voicemail transcription. Voicemail is spontaneous telephone speech recorded over a variety of channels; consequently, it is representative of many challenging problems in speech recognition. In the course of working on this task, several algorithms were developed that focus on different components of an ASR system, including lexicon design, feature extraction, hypothesis search, and adaptation. We report the improvements provided by these techniques, as well as other standard techniques, on a voicemail test set. Although the techniques are benchmarked on voicemail test data, their scope is not restricted to this domain as they address fundamental aspects of the speech recognition process.
  • Keywords
    feature extraction; speech recognition; voice mail; ASR system; acoustic model adaptation; algorithms; automatic speech recognition performance; feature extraction; hypothesis search; lexicon design; voicemail test data; voicemail transcription; Algorithm design and analysis; Automatic speech recognition; Error analysis; Feature extraction; Hidden Markov models; Information retrieval; Speech recognition; Telephony; Testing; Voice mail;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2002.804303
  • Filename
    1045275