DocumentCode :
1694144
Title :
Confidence index dynamic time warping for language-independent embedded speech recognition
Author :
Xianglilan Zhang ; Jiping Sun ; Zhigang Luo ; Ming Li
Author_Institution :
Sch. of Comput., Nat. Univ. of Defense Technol., Changsha, China
fYear :
2013
Firstpage :
8066
Lastpage :
8070
Abstract :
Language-independent embedded speech recognition is a necessary and important application. Considering personal privacy, collection difficulty of all the reference words, and limited storage space of mobile devices, language-independent (LI) embedded speech recognition should be classified into lightweight speaker-dependent (SD) cases. Dynamic time warping (DTW) is the state-of-the-art algorithm for small foot-print SD automatic speech recognition. To decrease the high computational complexity of DTW, and to avoid constraints-induced coarse approximation and inaccuracy problems, we introduce a novel confidence index dynamic time warping (CIDTW) approach. CIDTW defines a new cost function, called the confidence index cost function (CICF), to measure the similarity between merged speech training and testing data, while follows the same DTW process. With extensive experiments on three representative SD datasets, CIDTW achieves better accuracy and overall six times faster speeds compared with DTW.
Keywords :
computational complexity; speech recognition; CICF; CIDTW; LI embedded speech recognition; SD datasets; computational complexity; confidence index cost function; confidence index dynamic time warping; language-independent embedded speech recognition; limited storage space; merged speech training data; mobile devices; personal privacy; small foot-print SD automatic speech recognition; speaker-dependent datasets; state-of-the-art algorithm; testing data; Accuracy; Indexes; Mel frequency cepstral coefficient; Speech; Speech recognition; Testing; Training data; confidence index DTW; confidence index cost function; language-independent and lightweight speaker-dependent speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639236
Filename :
6639236
Link To Document :
بازگشت