Title :
Generalized likelihood ratio discriminant analysis
Author :
Lee, Hung-Shin ; Chen, Berlin
fDate :
Nov. 13 2009-Dec. 17 2009
Abstract :
In the past several decades, classifier-independent front-end feature extraction, where the derivation of acoustic features is lightly associated with the back-end model training or classification, has been prominently used in various pattern recognition tasks, including automatic speech recognition (ASR). In this paper, we present a novel discriminative feature transformation, named generalized likelihood ratio discriminant analysis (GLRDA), on the basis of the likelihood ratio test (LRT). It attempts to seek a lower dimensional feature subspace by making the most confusing situation, described by the null hypothesis, as unlikely to happen as possible without the homoscedastic assumption on class distributions. We also show that the classical linear discriminant analysis (LDA) and its well-known extension - heteroscedastic linear discriminant analysis (HLDA) can be regarded as two special cases of our proposed method. The empirical class confusion information can be further incorporated into GLRDA for better recognition performance. Experimental results demonstrate that GLRDA and its variant can yield moderate performance improvements over HLDA and LDA for the large vocabulary continuous speech recognition (LVCSR) task.
Keywords :
speech recognition; statistical testing; vocabulary; automatic speech recognition; back end model training; classifier independent front end feature extraction; discriminative feature transformation; generalized likelihood ratio discriminant analysis; heteroscedastic linear discriminant analysis; large vocabulary continuous speech recognition task; likelihood ratio test; pattern recognition tasks; statistical hypothesis testing; Automatic speech recognition; Computer science; Information analysis; Light rail systems; Linear discriminant analysis; Pattern analysis; Pattern recognition; Speech analysis; Speech recognition; Vocabulary;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location :
Merano
Print_ISBN :
978-1-4244-5478-5
Electronic_ISBN :
978-1-4244-5479-2
DOI :
10.1109/ASRU.2009.5373392