• DocumentCode
    1323612
  • Title

    A Statistical Approach to Mel-Domain Mask Estimation for Missing-Feature ASR

  • Author

    Borgström, Bengt J. ; Alwan, Abeer

  • Author_Institution
    Electr. Eng. Dept., Univ. of California, Los Angeles, CA, USA
  • Volume
    17
  • Issue
    11
  • fYear
    2010
  • Firstpage
    941
  • Lastpage
    944
  • Abstract
    In this letter, we present a statistical approach to Mel-domain mask estimation for missing feature (MF)-based automatic speech recognition (ASR). Mel-domain time-frequency masks are of interest, since MF systems have been shown successful in that domain. Time- and channel-specific reliability measures are derived as posterior probabilities of active speech using a 2-state speech model. Since closed form distributions for Mel-domain spectra do not exist, they are instead modeled as χ2 processes with empirically-determined degrees of freedom. Additionally, we present HMM-based decoding to exploit temporal correlation of spectral speech data. The proposed mask estimation algorithm is integrated with an example MF-based ASR front-end from, and is shown to outperform the spectral subtraction (SS)-based method from in terms of word-accuracy, when applied to the Aurora-2 database.
  • Keywords
    hidden Markov models; reliability; spectral analysis; speech recognition; statistical distributions; 2-state speech model; Aurora-2 database; HMM-based decoding; Mel-domain mask estimation; Mel-domain time-frequency masks; automatic speech recognition; channel-specific reliability; closed form distributions; missing-feature ASR; posterior probability; spectral speech data temporal correlation; spectral subtraction based method; statistical approach; Estimation; Noise; Speech; Speech recognition; Time frequency analysis; Training; $chi ^{2}$ random variables; mask estimation; missing features; noise robust ASR; speech presence uncertainty;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2010.2076348
  • Filename
    5570910