• DocumentCode
    302333
  • Title

    Robust recognition of cellular telephone speech by adaptive vector quantization

  • Author

    Sönmez, M. Kemal ; Rajasekaran, Raja ; Baras, John S.

  • Author_Institution
    Speech Res. Lab., Texas Instrum. Inc., Dallas, TX, USA
  • Volume
    1
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    503
  • Abstract
    The performance degradation as a result of acoustical environment mismatch remains an important practical problem in speech recognition. The problem carries a greater significance in applications over telecommunication channels, especially with the wider use of personal communications systems such as cellular phones which invariably present challenging acoustical conditions. In this work, we introduce a vector quantization (VQ) based compensation technique which both makes use of a priori information about likely acoustical environments and adapts to the test environment to improve recognition. The technique is progressive and requires neither simultaneously recorded speech from the training and the testing environments nor EM-type batch iterations. Instead of using simultaneously recorded data, the integrity of the updated VQ codebooks with respect to acoustical classes is maintained by endowing the codebooks with a topology and using transformations which preserve the topology of the reference environment. We report results on the McCaw Cellular Corpus where the technique decreases the word error for continuous ten digit recognition of cellular hands free microphone speech with land line trained models from 23.8% to 13.6% and the speaker dependent voice calling sentence error from 16.5% to 10.6%
  • Keywords
    acoustic noise; cellular radio; compensation; error analysis; speech coding; speech recognition; vector quantisation; McCaw Cellular Corpus; acoustical environment mismatch; adaptive vector quantization; cellular hands free microphone speech; cellular telephone speech; compensation technique; land line trained models; performance degradation; personal communications systems; speaker dependent voice calling sentence error; speech recognition; topology; transformations; updated VQ codebooks; word error; Acoustic testing; Cellular phones; Communication channels; Degradation; Microphones; Robustness; Speech recognition; Telephony; Topology; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.541143
  • Filename
    541143