DocumentCode :
2252850
Title :
Speaker identification using FrFT-based spectrogram and RBF neural network
Author :
Li, Penghua ; Li, Yuanyuan ; Luo, Dechao ; Luo, Hongping
Author_Institution :
Automotive Electronics Engineering Research Center, College of Automation, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China
fYear :
2015
fDate :
28-30 July 2015
Firstpage :
3674
Lastpage :
3679
Abstract :
This paper address a speaker identification problem using optimized spectrogram and radial basis function (RBF) neural network. The proposed approach applies fractional Fourier transform (FrFT) to obtain spectrograms with different orders, which gives much more refined description of the speech signals. To reduce the computational complexity, these spectrograms are converted into low-dimensional vectors by local binary patterns (LBP) operator. The LBP vectors compose the searching space of particle swarm optimization (PSO) algorithm which is designed for find the optimal spectrogram. The fitness function of PSO algorithm is designed by between-class distances and within-class distances. Through getting the optimal LBP vectors, the similarity criterion is used to find the fractional orders corresponding to the optimal spectrograms. Then, the optimal speech features are fed to the RBF network for training and testing. The numerical experiments indicate that our approach has an acceptable recognition rate with high accuracy.
Keywords :
Feature extraction; Fourier transforms; Neural networks; Spectrogram; Speech; Testing; Training; Fractional Fourier Transform; Radial Basis Function Neural Network; Speaker Identification; Spectrogram;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control Conference (CCC), 2015 34th Chinese
Conference_Location :
Hangzhou, China
Type :
conf
DOI :
10.1109/ChiCC.2015.7260207
Filename :
7260207
Link To Document :
بازگشت