• DocumentCode
    3372709
  • Title

    Robust speech recognition using feature-domain multi-channel bayesian estimators

  • Author

    Principi, Emanuele ; Rotili, Rudy ; Cifani, Simone ; Marinelli, Lorenzo ; Squartini, Stefano ; Piazza, Francesco

  • Author_Institution
    3MediaLabs, Univ. Politec. delle Marche, Ancona, Italy
  • fYear
    2010
  • fDate
    May 30 2010-June 2 2010
  • Firstpage
    2670
  • Lastpage
    2673
  • Abstract
    This paper proposes innovative multi-channel bayesian estimators in the feature-domain for robust speech recognition. Both minimum-mean-squared-error (MMSE) and maximum-a-posteriori (MAP) criteria have been explored: the related algorithms extend the multi-channel frequency-domain counterparts and generalize the single-channel feature-domain MMSE solution, recently appeared in the literature. Computer simulations conducted on a modified AURORA2 database show the efficacy of the frequency-domain multi-channel estimators when used as a pre-processing stage of a speech recognition engine, and that the proposed multi-channel MAP approach outperforms single-channel estimators by at least 3% on average.
  • Keywords
    Bayes methods; channel estimation; feature extraction; frequency-domain analysis; least mean squares methods; maximum likelihood estimation; speech recognition; MMSE; feature-domain analysis; frequency-domain analysis; maximum-a-posteriori algorithm; minimum mean square error; multichannel Bayesian estimator; speech recognition; Automatic speech recognition; Bayesian methods; Computer simulation; Engines; Frequency estimation; Microphones; Robustness; Spatial databases; Speech enhancement; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems (ISCAS), Proceedings of 2010 IEEE International Symposium on
  • Conference_Location
    Paris
  • Print_ISBN
    978-1-4244-5308-5
  • Electronic_ISBN
    978-1-4244-5309-2
  • Type

    conf

  • DOI
    10.1109/ISCAS.2010.5537057
  • Filename
    5537057