• DocumentCode
    290351
  • Title

    Optimization of time-frequency masking filters using the minimum classification error criterion

  • Author

    Bacchiani, Michiel ; Aikawa, Kiyoaki

  • Author_Institution
    ATR Human Inf. Process. Res. Labs, Kyoto, Japan
  • Volume
    ii
  • fYear
    1994
  • fDate
    19-22 Apr 1994
  • Abstract
    The dynamic cepstrum parameter representing a masked spectrum performed extremely well in continuous speech recognition. This paper proposes a new algorithm for optimizing the dynamic cepstrum lifter array. The masking filter is represented by a set of Gaussian-shaped lifters. The standard deviation and the gain of the Gaussians are trained in order to improve the performance of the time-frequency filter. Parameterizing the lifter shape provides robustness against unknown speech samples. Because of the parameterized lifter´s small degree of freedom, it can avoid over-learning. The gradient descent optimizing algorithm is formulated for both a neural network classifier and an HMM classifier. The optimized dynamic cepstrum successfully improved the speech recognition performance for the speech spoken even in a different speaking style
  • Keywords
    cepstral analysis; filtering theory; hidden Markov models; multilayer perceptrons; optimisation; pattern classification; speech recognition; time-frequency analysis; Gaussian-shaped lifters; HMM classifier; continuous speech recognition; dynamic cepstrum lifter array; dynamic cepstrum parameter; four-layered TDNN; gain; gradient descent optimizing algorithm; masked spectrum; minimum classification error; neural network classifier; optimized dynamic cepstrum; speech recognition performance; speech samples; standard deviation; time delay neural network; time-frequency masking filters; Cepstrum; Filters; Gaussian processes; Hidden Markov models; Neural networks; Performance gain; Robustness; Shape; Speech recognition; Time frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
  • Conference_Location
    Adelaide, SA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-1775-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1994.389685
  • Filename
    389685