DocumentCode :
3131871
Title :
Dialect identification based on VQ codebook design with GA-LBG algorithm
Author :
He, Yan ; Yu, Feng Qin
Author_Institution :
Sch. of Internet of Things Eng., Jiangnan Univ., Wuxi, China
Volume :
2
fYear :
2011
fDate :
20-21 Aug. 2011
Firstpage :
94
Lastpage :
97
Abstract :
In order to solve the problem of GA´s slow convergence in the VQ codebook design due to the strong global search ability and the complex computation, a hybrid algorithm based on the GA-LBG algorithm is adopted because the LBG algorithm as an iterative algorithm based on the nearest neighbor rule and the centroid rule owns the advantage of fast convergence. In the simulation experiment, MFCC extracted from Mandarin, Shanghainese, Cantonese and Hokkien are employed as the feature vectors to establish codebook models with GA-LBG for the dialect identification, and the recognition performances on different size of the VQ codebooks are studied. And simulation results demonstrate that the running time with GA-LBG reduces to 1066.4s, less than that with GA alone.
Keywords :
feature extraction; genetic algorithms; iterative methods; natural language processing; pattern clustering; search problems; vector quantisation; Cantonese; GA-LBG algorithm; Hokkien; MFCC extraction; Mandarin; Mel-frequency cepstrum coefficients; Shanghainese; VQ codebook design; centroid rule; complex computation; dialect identification; feature vector; genetic algorithm; global search ability; hybrid algorithm; iterative algorithm; nearest neighbor rule; vector quantization; Algorithm design and analysis; Clustering algorithms; Genetic algorithms; Mel frequency cepstral coefficient; Speech; Speech recognition; Training; GA-LBG algorithm; MFCC; dialect identification; vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing, Control and Industrial Engineering (CCIE), 2011 IEEE 2nd International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-9599-3
Type :
conf
DOI :
10.1109/CCIENG.2011.6008075
Filename :
6008075
Link To Document :
بازگشت