Title :
Articulatory feature detection with Support Vector Machines for integration into ASR and phone recognition
Author :
Chaudhari, Upendra V. ; Picheny, Michael
Author_Institution :
IBM T.J. Watson Res. Center, Yorktown Heights, NY, USA
fDate :
Nov. 13 2009-Dec. 17 2009
Abstract :
We study the use of support vector machines (SVM) for detecting the occurrence of articulatory features in speech audio data and using the information contained in the detector outputs to improve phone and speech recognition. Our expectation is that an SVM should be able to appropriately model the separation of the classes which may have complex distributions in feature space. We show that performance improves markedly when using discriminatively trained speaker dependent parameters for the SVM inputs, and compares quite well to results in the literature using other classifiers, namely artificial neural networks (ANN). Further, we show that the resulting detector outputs can be successfully integrated into a state of the art speech recognition system, with consequent performance gains. Notably, we test our system on English broadcast news data from dev04f.
Keywords :
speech recognition; support vector machines; articulatory feature detection; automatic speech recognition; phone recognition; speaker dependent parameter; speech audio data; support vector machine; Artificial neural networks; Automatic speech recognition; Broadcasting; Computer vision; Detectors; Performance gain; Speech recognition; Support vector machine classification; Support vector machines; System testing;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location :
Merano
Print_ISBN :
978-1-4244-5478-5
Electronic_ISBN :
978-1-4244-5479-2
DOI :
10.1109/ASRU.2009.5373326