Title :
A comparison of speaker identification results using features based on cepstrum and Fourier-Bessel expansion
Author :
Gopalan, K. ; Anderson, Timothy R. ; Cupples, Edward J.
Author_Institution :
Purdue Univ., Hammond, IN, USA
fDate :
5/1/1999 12:00:00 AM
Abstract :
A compact representation of speech is possible using Bessel functions because of the similarity between voiced speech and the Bessel functions. Both voiced speech and the Bessel functions exhibit quasiperiodicity and decaying amplitude with time. This paper presents the results of speaker identification experiments using features obtained from (1) the Fourier-Bessel expansion and (2) the cepstral representation of speech frames. Identification scores of 65% and 76% were achieved using features based on J1(t) expansion of air-to-ground speech transmission databases of 143 and 1054 test utterances, respectively. The corresponding scores for the two databases using cepstral coefficients of a comparable size were 80% and 88%. A comparison of the two sets of features indicates that J1(t) can be used to model the hearing perception much like the mel cepstral coefficients
Keywords :
Bessel functions; cepstral analysis; feature extraction; hearing; signal representation; speaker recognition; Bessel functions; Fourier-Bessel expansion; air-to-ground speech transmission databases; cepstral coefficients; cepstral features; cepstral representation; cepstrum; compact speech representation; decaying amplitude; experiments; hearing perception; identification scores; mel cepstral coefficients; quasiperiodicity functions; speaker identification results; test utterances; voiced speech; Auditory system; Cepstral analysis; Cepstrum; Fourier series; Fourier transforms; Laboratories; Signal representations; Spatial databases; Speech analysis; Testing;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on