• DocumentCode
    312220
  • Title

    Robust automatic speech recognition using a multi-channel signal separation front-end

  • Author

    Yen, Kuan Chieh ; Zhao, Yunxin

  • Author_Institution
    Beckman Inst. for Adv. Sci. & Technol., Illinois Univ., Urbana, IL, USA
  • Volume
    3
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    1337
  • Abstract
    A multi-channel signal separation front-end for robust automatic speech recognition under time-varying interference conditions is developed. The speech signals acquired by a dual-channel system are restored by adaptive decorrelation filtering, and then examined by a time-domain or frequency-domain source signal detection technique to determine the active regions of each source signal. The front-end is integrated with an HMM-based speaker-independent continuous speech recognition system by providing the restored signals within the active regions for recognition. Under a simulated room acoustic condition, the overall system shows very promising performance. For the conditions with SNR above -10 dB, the achieved word recognition accuracies are very close to that of the interference-free condition
  • Keywords
    acoustic signal detection; adaptive signal processing; frequency-domain analysis; hidden Markov models; pulse height analysers; speech recognition; speech recognition equipment; time-domain analysis; SNR; active regions; adaptive decorrelation filtering; dual-channel system; frequency-domain source signal detection technique; hidden Markov model; interference-free condition; multi-channel signal separation front-end; performance; robust automatic speech recognition; simulated room acoustic condition; speaker-independent continuous speech recognition system; speech signal restoration; time-domain source signal detection technique; time-varying interference conditions; word recognition accuracies; Adaptive filters; Automatic speech recognition; Decorrelation; Filtering; Interference; Robustness; Signal restoration; Source separation; Speech recognition; Time domain analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607860
  • Filename
    607860