DocumentCode
3527138
Title
Multiple time resolution analysis of speech signal using MCE training with application to speech recognition
Author
Dimopoulos, Spiros ; Potamianos, Alexandros ; Lussier, Eric-Fosler ; Lee, Chin-Hui
Author_Institution
Dept. of Electron. & Comput. Eng., Tech. Univ. of Crete, Chania
fYear
2009
fDate
19-24 April 2009
Firstpage
3801
Lastpage
3804
Abstract
In this paper, we propose two methods of multiple time-resolution analysis of speech and their application to automatic speech recognition (ASR). Constant frame-rate multi-scale analysis is proposed based on a box of multi-scale features. Then a variable rate analysis is proposed based on the selection of the optimal temporal resolution on the fly by a properly trained non-linear classifier unit. The classifier´s parameters are trained using the discriminative method of minimum classification error (MCE) training. We use the recently proposed conditional random fields (CRF) phonetic recognition system that effectively combines highly correlated features. Results are reported on a frame-wise classification task and also on TIMIT phone recognition task. Results show that (i) CRFs can effectively combine multi-scale features and (ii) MCE trained variable rate CRFs are competitive with the ldquoboxrdquo combination method.
Keywords
speech processing; speech recognition; TIMIT phone recognition task; automatic speech recognition; conditional random fields; frame-wise classification task; minimum classification error; multiple time resolution analysis; multiscale features; phonetic recognition; speech signal; Application software; Automatic speech recognition; Computer science; Hidden Markov models; Signal analysis; Signal resolution; Spectral analysis; Speech analysis; Speech processing; Speech recognition; ASR; Conditional Random Fields; MCE; Multiple Frame Rates; Variable Frame Rate;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960455
Filename
4960455
Link To Document