• DocumentCode
    1798356
  • Title

    Fast HMM-driven beamforming for robust speech recognition in reverberant environments

  • Author

    Wei-Tyng Hong

  • Author_Institution
    Dept. of Commun. Eng., Yuan Ze Univ., Taoyuan, Taiwan
  • Volume
    2
  • fYear
    2014
  • fDate
    13-16 July 2014
  • Firstpage
    529
  • Lastpage
    532
  • Abstract
    The reverberation is induced from the combination of the direct waveform and multiple reflected waveforms. Therefore, the reverberation is highly corrected with the original speech signal. This leads to dramatically degrade the performance of speech recognition. This paper extends VTS methodology to develop a robust technique for HMM-driven beamformer on the reverberation environments. We approximate the logarithm of Gaussian mixture models of HMM with VTS expansion. This makes it possible to obtain a simpler updating functions of beamformer parameters than the counterparts of original HMM-driven beamformer. The RWCP database is used for the simulation of the multi-channel recorded reverberation speech. A speaker-independent speech query task of Mandarin names was applied to evaluate the performance of the beamformers. Our experimental results show that the proposed algorithm was effective on compoutation reduction for the adaptation process of HMM-driven beamformer. It indicates that the proposed framework benefits the development in robust speech recognition on resource-constrained platforms.
  • Keywords
    hidden Markov models; microphone arrays; reverberation; speech recognition; Gaussian mixture models; Mandarin names; RWCP database; VTS expansion; VTS methodology; beamformer parameters; direct waveform; fast HMM-driven beamforming; multichannel recorded reverberation speech; original HMM-driven beamformer; original speech signal; reflected waveform; resource-constrained platforms; reverberant environments; reverberation environments; robust speech recognition; speaker-independent speech query task; Abstracts; Array signal processing; Field-flow fractionation; Lead; Radiation detectors; Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics (ICMLC), 2014 International Conference on
  • Conference_Location
    Lanzhou
  • ISSN
    2160-133X
  • Print_ISBN
    978-1-4799-4216-9
  • Type

    conf

  • DOI
    10.1109/ICMLC.2014.7009663
  • Filename
    7009663