Title :
Further feature extraction for speaker recognition
Author :
Ma, Zhiyou ; Yang, Yingcbun ; Wu, Zhaohui
Author_Institution :
Coll. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou, China
Abstract :
This thesis presents a method of extracting a new speaker´s voice features for the purpose of synthetically using the voice of the donor speaker. In the small speaker set, it seems good to recognize speaker by their voice by means of the traditional feature extraction. Nevertheless, the performance of recognizer usually depressed owning to the limited feature space, it is hard to deal with the increasing of speaker set to be recognized. Accordingly it proposes a novel feature extraction method, further feature extract (FFE), which is based on some measures such as weight, differential, combination and selection, are taken to explore those voice characteristics that can be used to distinguish different speakers. Experiment based on 138-person YOHO database demonstrates that better performance can be achieved by the proposed method.
Keywords :
Gaussian processes; feature extraction; linear predictive coding; principal component analysis; speaker recognition; speech synthesis; Gaussian mixture model; Mel-frequency cepstrum coefficients; feature extraction; linear predictive coding; principal component analysis; small speaker set; speaker recognition; voice characteristics; Additive noise; Automatic speech recognition; Cepstral analysis; Character recognition; Computer science; Educational institutions; Feature extraction; Mel frequency cepstral coefficient; Speaker recognition; Speech recognition;
Conference_Titel :
Systems, Man and Cybernetics, 2003. IEEE International Conference on
Print_ISBN :
0-7803-7952-7
DOI :
10.1109/ICSMC.2003.1245637