Title :
An application of discriminative feature extraction to filter-bank-based speech recognition
Author :
Biem, Alain ; Katagiri, Shigeru ; McDermott, Erik ; Juang, Biing-hwang
Author_Institution :
ATR Human INf. Processing Res. Lab., Kyoto Univ., Japan
fDate :
2/1/2001 12:00:00 AM
Abstract :
A pattern recognizer is usually a modular system which consists of a feature extractor module and a classifier module. Traditionally, these two modules have been designed separately, which may not result in an optimal recognition accuracy. To alleviate this fundamental problem, the authors have developed a design method, named discriminative feature extraction (DFE), that enables one to design the overall recognizer, i.e., both the feature extractor and the classifier, in a manner consistent with the objective of minimizing recognition errors. This paper investigates the application of this method to designing a speech recognizer that consists of a filter-hank feature extractor and a multi-prototype distance classifier. Carefully investigated experiments demonstrate that DFE achieves the design of a better recognizer and provides an innovative recognition-oriented analysis of the filter-bank, as an alternative to conventional analysis based on psychoacoustic expertise or heuristics
Keywords :
channel bank filters; feature extraction; pattern classification; speech recognition; classifier module; discriminative feature extraction; feature extractor module; filter-bank-based speech recognition; modular system; multi-prototype distance classifier; pattern recognizer; recognition errors minimisation; recognition-oriented analysis; Character recognition; Decision theory; Design methodology; Feature extraction; Humans; Laboratories; Maximum likelihood estimation; Pattern recognition; Psychology; Speech recognition;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on