• DocumentCode
    394280
  • Title

    Optimizing speech/non-speech classifier design using AdaBoost

  • Author

    Kwon, Oh-Wook ; Lee, Te-Won

  • Author_Institution
    Inst. for Neural Comput., Univ. of California, La Jolla, CA, USA
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    We propose a new method to design speech/non-speech classifiers for voice activity detection and robust endpoint detection using the adaptive boosting (AdaBoost) algorithm. The method uses a combination of simple base classifiers through the AdaBoost algorithm and a set of optimized speech features combined with spectral subtraction. The key benefits of this method are the simple implementation and low computational complexity. The AdaBoost classifier combined with spectral subtraction significantly improved the receiver operating characteristic curves of the G.729 voice activity detector. For speech recognition purpose, the method reduced 20-50% of miss errors for the same false alarm rate by using additional band pass energy and spectral distortion based on mel frequency cepstral coefficients.
  • Keywords
    optimisation; pattern classification; speech recognition; AdaBoost; G.729 voice activity detector; adaptive boosting algorithm; band pass energy; computational complexity; false alarm rate; mel frequency cepstral coefficients; miss errors; optimized speech features; receiver operating characteristic curves; robust endpoint detection; spectral distortion; spectral subtraction; speech recognition purpose; speech/nonspeech classifiers; voice activity detection; Algorithm design and analysis; Boosting; Computational complexity; Design methodology; Design optimization; Detectors; Mel frequency cepstral coefficient; Optimization methods; Robustness; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198811
  • Filename
    1198811