• DocumentCode
    395190
  • Title

    Novel robust feature extraction based on spectrally masked channel energy ratio (SMaChER) for speech recognition

  • Author

    Changxue, M.A.

  • Author_Institution
    Motorola Inc., Schaumburg, IL, USA
  • Volume
    2
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    Comparing speech recognition performance between human beings and computer, the latter´s performance degrades dramatically in a noisy mobile environment. Based on the perceptual study of speech sounds in terms of auditory masking and speech perception study, it has been suggested that the contribution of each auditory channel is rather independent as used in articulation index model. From the point of view of the signal process perspective, we can diminish the noise caused variability by using spectral masking so that the noise signal between spectrum gaps is masked and limit the error propagation between frequency (auditory) channels. Based on this observation, we propose a novel robust feature extraction based spectrally masked channel energy ratio (SMaChER). We show the significant error reduction rate on noisy data has been achieved.
  • Keywords
    feature extraction; hidden Markov models; land mobile radio; noise; spectral analysis; speech intelligibility; speech recognition; HMM model training; SMaChER; articulation index model; auditory channel; auditory masking; error propagation; error reduction rate; frequency channels; noisy data; noisy mobile environment; robust feature extraction; spectral masking; spectrally masked channel energy ratio; speech perception study; speech recognition performance; speech sounds; Acoustic noise; Degradation; Feature extraction; Humans; Mobile computing; Robustness; Signal processing; Speech coding; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1202288
  • Filename
    1202288