• DocumentCode
    290120
  • Title

    A new method for segmenting continuous speech

  • Author

    Pawate, B.I. ; Dowling, Eric

  • Author_Institution
    Tsukuba Res. & Dev. Center, Texas Instrum., Ibaraki, Japan
  • Volume
    i
  • fYear
    1994
  • fDate
    19-22 Apr 1994
  • Abstract
    Speech recognition systems are increasingly utilized in various applications like telephone services where a user places a call by uttering the digits or the name of the person. One of the main problems in this application is the segmentation of the input utterance into speech and nonspeech portions. Current approaches typically suffer from two problems. They either incorporate noise as a part of the word to be enrolled or falsely classify a portion of a word as noise. As a result, recognition performance suffers. The authors present another approach to automatically segment continuous speech and create speaker dependent models. To verify the hypothesis, they use a database of 30 speakers whose speech has been recorded over the public switched telephone network. With this database, they benchmark their algorithm against a state of the art approach and show a 4× reduction in the error rate of the recognition system
  • Keywords
    speech processing; speech recognition; telephony; algorithm; error rate; input utterance; nonspeech portion; public switched telephone network; recognition performance; segmentation; speaker dependent models; speech portion; speech recognition systems; telephone services; Databases; Detectors; Error analysis; Instruments; Noise level; Speech enhancement; Speech recognition; Target recognition; Telephony; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
  • Conference_Location
    Adelaide, SA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-1775-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1994.389357
  • Filename
    389357