Title :
Speaker identification based on Gammatone cepstral coefficients and general regression neural network
Author :
Penghua Li ; Fangchao Hu ; Yinguo Li ; Baomei Qiu
Author_Institution :
Coll. of Autom., Chongqing Univ. of Posts & Telecommun., Chongqing, China
fDate :
May 31 2014-June 2 2014
Abstract :
This paper presents a speaker identification method using Gammatone cepstral coefficients extracted by Gammatone filters and a group of general regression neural networks. The Gammatone cepstral coefficients are adapted to the characteristics of speech signals through adjusting the Gammatone filter and the filter bank settings. To reduce the training data and time cost of the general regression neural network used as the classifier for speaker identification, the non-linear partition algorithm is employed to divide the Gammatone cepstral coefficients used as the speech features. In this sense, the speaker identification task is partitioned into a number of small tasks which can be operated by a group of general regression neural networks. Each recognition rate of these general regression neural networks is integrated into the final recognition rate of the speech signals. The results indicate that the proposed method has an acceptable recognition rate with high accuracy.
Keywords :
cepstral analysis; neural nets; regression analysis; speaker recognition; Gammatone cepstral coefficient; Gammatone filter; filter bank setting; general regression neural network; nonlinear partition algorithm; recognition rate; speaker identification method; speaker identification task; speech feature; speech signal; training data reduction; Feature extraction; Mel frequency cepstral coefficient; Neural networks; Speech; Speech recognition; Training; Gammatone Filter; General Regression Neural Network; Non-Linear Partition; Speaker Identification;
Conference_Titel :
Control and Decision Conference (2014 CCDC), The 26th Chinese
Conference_Location :
Changsha
Print_ISBN :
978-1-4799-3707-3
DOI :
10.1109/CCDC.2014.6852265