DocumentCode
578167
Title
An improved VQ based algorithm for recognizing speaker-independent isolated words
Author
Ma, Ding-ding ; Zeng, Xiao-qin
Author_Institution
Inst. of Intell. Sci. & Technol., Hohai Univ., Nanjing, China
Volume
2
fYear
2012
fDate
15-17 July 2012
Firstpage
792
Lastpage
796
Abstract
In this paper, an improved codebook generation algorithm called SLVQ (Speaker Level Vector quantization) is proposed, which can improve the recognition accuracy of speaker independent isolated words. Linde-Buzo-Gary (LBG) algorithm is the most commonly used codebook design method. The idea behind LBG is to find an optimal codebook that minimizes the distortion between the training words and the codebook. But this does not guarantee that the testing words also have minimum distortion as training words. To address the problem of producing poor codebook for testing words in speaker independent speech recognition, the proposed method makes use of the diversity of different speakers by randomly selecting some speakers and their pronounced words in the codebook design procedure to optimize codebooks. An evaluation experiment has been conducted to compare the speech recognition performance of the codebooks produced by the LBG, the LVQ (learning vector quantization), and the SLVQ. It is clearly shown that the SLVQ method performs better than the other two methods.
Keywords
learning (artificial intelligence); speaker recognition; vector quantisation; LBG algorithm; Linde-Buzo-Gary algorithm; SLVQ; VQ based algorithm; codebook design method; codebook generation algorithm; learning vector quantization; speaker independent speech recognition; speaker level vector quantization; speaker-independent isolated word recognition; testing words; training words; Abstracts; Convergence; Codebook generation; Speaker Independent; Speech recognition; Vector quantization;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics (ICMLC), 2012 International Conference on
Conference_Location
Xian
ISSN
2160-133X
Print_ISBN
978-1-4673-1484-8
Type
conf
DOI
10.1109/ICMLC.2012.6359026
Filename
6359026
Link To Document