Title :
Minimum detection error training of subword detectors
Author :
Canterla, Alfonso M. ; Johnsen, Magne H.
Author_Institution :
Dept. of Electron. & Telecommun, NTNU, Trondheim, Norway
Abstract :
This paper presents methods and results for optimizing subword detectors in continuous speech. Speech detectors are useful within areas like detection-based ASR, pronunciation training, phonetic analysis, word spotting, etc. We propose a new discriminative training criterion for subword unit detectors that is based on the Minimum Phone Error framework. The criterion can optimize the F-score or any other detection performance metric. The method is applied to the optimization of HMMs and MFCC filterbanks in phone detectors. The resulting filterbanks differ from each other and reflect acoustic properties of the corresponding detection classes. For the experiments in TIMIT, the best optimized detectors had a relative accuracy improvement of 31.3% over baseline and 18.2% over our previous MCE-based method.
Keywords :
acoustic signal processing; filtering theory; hidden Markov models; speech processing; F-score; HMM; MFCC filterbank; acoustic properties; continuous speech; detection performance metric; detection-based ASR; discriminative training criterion; minimum detection error training; minimum phone error framework; optimization; phone detector; phonetic analysis; pronunciation training; speech detector; subword unit detector; word spotting; Accuracy; Detectors; Feature extraction; Hidden Markov models; Optimization; Speech; Training; Detection; MPE; discriminative training; filterbank;
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location :
Waikoloa, HI
Print_ISBN :
978-1-4673-0365-1
Electronic_ISBN :
978-1-4673-0366-8
DOI :
10.1109/ASRU.2011.6163983