• DocumentCode
    1437247
  • Title

    An application of discriminative feature extraction to filter-bank-based speech recognition

  • Author

    Biem, Alain ; Katagiri, Shigeru ; McDermott, Erik ; Juang, Biing-hwang

  • Author_Institution
    ATR Human INf. Processing Res. Lab., Kyoto Univ., Japan
  • Volume
    9
  • Issue
    2
  • fYear
    2001
  • fDate
    2/1/2001 12:00:00 AM
  • Firstpage
    96
  • Lastpage
    110
  • Abstract
    A pattern recognizer is usually a modular system which consists of a feature extractor module and a classifier module. Traditionally, these two modules have been designed separately, which may not result in an optimal recognition accuracy. To alleviate this fundamental problem, the authors have developed a design method, named discriminative feature extraction (DFE), that enables one to design the overall recognizer, i.e., both the feature extractor and the classifier, in a manner consistent with the objective of minimizing recognition errors. This paper investigates the application of this method to designing a speech recognizer that consists of a filter-hank feature extractor and a multi-prototype distance classifier. Carefully investigated experiments demonstrate that DFE achieves the design of a better recognizer and provides an innovative recognition-oriented analysis of the filter-bank, as an alternative to conventional analysis based on psychoacoustic expertise or heuristics
  • Keywords
    channel bank filters; feature extraction; pattern classification; speech recognition; classifier module; discriminative feature extraction; feature extractor module; filter-bank-based speech recognition; modular system; multi-prototype distance classifier; pattern recognizer; recognition errors minimisation; recognition-oriented analysis; Character recognition; Decision theory; Design methodology; Feature extraction; Humans; Laboratories; Maximum likelihood estimation; Pattern recognition; Psychology; Speech recognition;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.902277
  • Filename
    902277