DocumentCode
2811411
Title
Automatic speaker identification by means of Mel cepstrum, wavelets and wavelet packets
Author
Rufiner, H.L.
Volume
2
fYear
2000
fDate
2000
Firstpage
978
Abstract
The present work consists on the use of delta cepstra coefficients in Mel scale, wavelet and wavelet packet transforms to feed a system for automatic speaker identification based on neural networks. Different alternatives are tested for the classifier based on neural nets, having achieved very good performance for closed groups of speakers in a text independent form. When a single neural net is used for all the speakers, the results decay abruptly with increasing number of speakers to identify. This takes to implement a system where there is one neural net for each speaker, which provided excellent results, compared with the opposing ones in the bibliography using other methods. This classifier structure possesses other advantages, for example, adding a new speaker to the system only requires to train a net for the speaker in question, in contrast with a system where the classifier is formed by a single great net, which should be in general be trained completely again
Keywords
backpropagation; cepstral analysis; discrete wavelet transforms; neural nets; signal classification; speaker recognition; Hamming window; Mel cepstrum; automatic speaker identification; backpropagation; classifier structure; delta cepstra coefficients; discrete wavelet transforms; neural networks; temporal delays; text independent form; wavelet packets; Artificial neural networks; Automatic speech recognition; Cepstrum; Loudspeakers; Neural networks; Speech analysis; Speech recognition; Testing; Wavelet analysis; Wavelet packets;
fLanguage
English
Publisher
ieee
Conference_Titel
Engineering in Medicine and Biology Society, 2000. Proceedings of the 22nd Annual International Conference of the IEEE
Conference_Location
Chicago, IL
ISSN
1094-687X
Print_ISBN
0-7803-6465-1
Type
conf
DOI
10.1109/IEMBS.2000.897886
Filename
897886
Link To Document