DocumentCode :
3486142
Title :
Minimum detection error training of subword detectors
Author :
Canterla, Alfonso M. ; Johnsen, Magne H.
Author_Institution :
Dept. of Electron. & Telecommun, NTNU, Trondheim, Norway
fYear :
2011
fDate :
11-15 Dec. 2011
Firstpage :
506
Lastpage :
511
Abstract :
This paper presents methods and results for optimizing subword detectors in continuous speech. Speech detectors are useful within areas like detection-based ASR, pronunciation training, phonetic analysis, word spotting, etc. We propose a new discriminative training criterion for subword unit detectors that is based on the Minimum Phone Error framework. The criterion can optimize the F-score or any other detection performance metric. The method is applied to the optimization of HMMs and MFCC filterbanks in phone detectors. The resulting filterbanks differ from each other and reflect acoustic properties of the corresponding detection classes. For the experiments in TIMIT, the best optimized detectors had a relative accuracy improvement of 31.3% over baseline and 18.2% over our previous MCE-based method.
Keywords :
acoustic signal processing; filtering theory; hidden Markov models; speech processing; F-score; HMM; MFCC filterbank; acoustic properties; continuous speech; detection performance metric; detection-based ASR; discriminative training criterion; minimum detection error training; minimum phone error framework; optimization; phone detector; phonetic analysis; pronunciation training; speech detector; subword unit detector; word spotting; Accuracy; Detectors; Feature extraction; Hidden Markov models; Optimization; Speech; Training; Detection; MPE; discriminative training; filterbank;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location :
Waikoloa, HI
Print_ISBN :
978-1-4673-0365-1
Electronic_ISBN :
978-1-4673-0366-8
Type :
conf
DOI :
10.1109/ASRU.2011.6163983
Filename :
6163983
Link To Document :
بازگشت