Title :
Dialect identification based on VQ codebook design with GA-LBG algorithm
Author :
He, Yan ; Yu, Feng Qin
Author_Institution :
Sch. of Internet of Things Eng., Jiangnan Univ., Wuxi, China
Abstract :
In order to solve the problem of GA´s slow convergence in the VQ codebook design due to the strong global search ability and the complex computation, a hybrid algorithm based on the GA-LBG algorithm is adopted because the LBG algorithm as an iterative algorithm based on the nearest neighbor rule and the centroid rule owns the advantage of fast convergence. In the simulation experiment, MFCC extracted from Mandarin, Shanghainese, Cantonese and Hokkien are employed as the feature vectors to establish codebook models with GA-LBG for the dialect identification, and the recognition performances on different size of the VQ codebooks are studied. And simulation results demonstrate that the running time with GA-LBG reduces to 1066.4s, less than that with GA alone.
Keywords :
feature extraction; genetic algorithms; iterative methods; natural language processing; pattern clustering; search problems; vector quantisation; Cantonese; GA-LBG algorithm; Hokkien; MFCC extraction; Mandarin; Mel-frequency cepstrum coefficients; Shanghainese; VQ codebook design; centroid rule; complex computation; dialect identification; feature vector; genetic algorithm; global search ability; hybrid algorithm; iterative algorithm; nearest neighbor rule; vector quantization; Algorithm design and analysis; Clustering algorithms; Genetic algorithms; Mel frequency cepstral coefficient; Speech; Speech recognition; Training; GA-LBG algorithm; MFCC; dialect identification; vector quantization;
Conference_Titel :
Computing, Control and Industrial Engineering (CCIE), 2011 IEEE 2nd International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-9599-3
DOI :
10.1109/CCIENG.2011.6008075