DocumentCode :
578167
Title :
An improved VQ based algorithm for recognizing speaker-independent isolated words
Author :
Ma, Ding-ding ; Zeng, Xiao-qin
Author_Institution :
Inst. of Intell. Sci. & Technol., Hohai Univ., Nanjing, China
Volume :
2
fYear :
2012
fDate :
15-17 July 2012
Firstpage :
792
Lastpage :
796
Abstract :
In this paper, an improved codebook generation algorithm called SLVQ (Speaker Level Vector quantization) is proposed, which can improve the recognition accuracy of speaker independent isolated words. Linde-Buzo-Gary (LBG) algorithm is the most commonly used codebook design method. The idea behind LBG is to find an optimal codebook that minimizes the distortion between the training words and the codebook. But this does not guarantee that the testing words also have minimum distortion as training words. To address the problem of producing poor codebook for testing words in speaker independent speech recognition, the proposed method makes use of the diversity of different speakers by randomly selecting some speakers and their pronounced words in the codebook design procedure to optimize codebooks. An evaluation experiment has been conducted to compare the speech recognition performance of the codebooks produced by the LBG, the LVQ (learning vector quantization), and the SLVQ. It is clearly shown that the SLVQ method performs better than the other two methods.
Keywords :
learning (artificial intelligence); speaker recognition; vector quantisation; LBG algorithm; Linde-Buzo-Gary algorithm; SLVQ; VQ based algorithm; codebook design method; codebook generation algorithm; learning vector quantization; speaker independent speech recognition; speaker level vector quantization; speaker-independent isolated word recognition; testing words; training words; Abstracts; Convergence; Codebook generation; Speaker Independent; Speech recognition; Vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2012 International Conference on
Conference_Location :
Xian
ISSN :
2160-133X
Print_ISBN :
978-1-4673-1484-8
Type :
conf
DOI :
10.1109/ICMLC.2012.6359026
Filename :
6359026
Link To Document :
بازگشت