Title :
Automatic speaker identification by means of Mel cepstrum, wavelets and wavelet packets
Abstract :
The present work consists on the use of delta cepstra coefficients in Mel scale, wavelet and wavelet packet transforms to feed a system for automatic speaker identification based on neural networks. Different alternatives are tested for the classifier based on neural nets, having achieved very good performance for closed groups of speakers in a text independent form. When a single neural net is used for all the speakers, the results decay abruptly with increasing number of speakers to identify. This takes to implement a system where there is one neural net for each speaker, which provided excellent results, compared with the opposing ones in the bibliography using other methods. This classifier structure possesses other advantages, for example, adding a new speaker to the system only requires to train a net for the speaker in question, in contrast with a system where the classifier is formed by a single great net, which should be in general be trained completely again
Keywords :
backpropagation; cepstral analysis; discrete wavelet transforms; neural nets; signal classification; speaker recognition; Hamming window; Mel cepstrum; automatic speaker identification; backpropagation; classifier structure; delta cepstra coefficients; discrete wavelet transforms; neural networks; temporal delays; text independent form; wavelet packets; Artificial neural networks; Automatic speech recognition; Cepstrum; Loudspeakers; Neural networks; Speech analysis; Speech recognition; Testing; Wavelet analysis; Wavelet packets;
Conference_Titel :
Engineering in Medicine and Biology Society, 2000. Proceedings of the 22nd Annual International Conference of the IEEE
Conference_Location :
Chicago, IL
Print_ISBN :
0-7803-6465-1
DOI :
10.1109/IEMBS.2000.897886