• DocumentCode
    3340549
  • Title

    Microphone array sub-band speech recognition

  • Author

    McCowan, Iain A. ; Sridharan, Sridha

  • Author_Institution
    Speech Res. Lab., Queensland Univ. of Technol., Brisbane, Qld., Australia
  • Volume
    1
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    185
  • Abstract
    Proposes the integration of sub-band speech recognition with a microphone array. A broadband beamforming microphone array allows for natural integration with sub-band speech recognition as the beamformer is typically implemented as a combination of band-limited sub-arrays. In the paper, rather than recombining the sub-array outputs to give a single enhanced output, we propose the fusion of separate hidden Markov models trained on each subarray frequency band. In addition, a dynamic sub-band weighting scheme is proposed in which the cross- and auto-spectral densities of the microphone array inputs are used to estimate the reliability of each frequency band. The microphone array sub-band system is evaluated on an isolated digit recognition task and compared to the standard full-band approach. The results of the proposed dynamic weighting scheme are compared to those obtained using both fixed equal sub-band weights, as well as optimal sub-band weights calculated from a priori knowledge of the correct results
  • Keywords
    acoustic arrays; acoustic signal processing; array signal processing; hidden Markov models; microphones; spatial filters; speech enhancement; speech recognition; auto-spectral densities; band-limited sub-arrays; beamformer; broadband beamforming microphone array; cross-spectral densities; dynamic sub-band weighting scheme; dynamic weighting scheme; hidden Markov models; isolated digit recognition; microphone array sub-band speech recognition; reliability; standard full-band approach; Array signal processing; Australia; Frequency estimation; Hidden Markov models; Laboratories; Microphone arrays; Noise reduction; Noise robustness; Speech enhancement; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7041-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2001.940798
  • Filename
    940798