DocumentCode :
3295716
Title :
A text-independent speaker identification system using PARCOR and AR model
Author :
Liu, Chia-Hsiung ; Chen, Oscal T.-C.
Author_Institution :
Dept. of Electr. Eng., Nat. Chung Cheng Univ., Chia-Yi, Taiwan
Volume :
3
fYear :
2002
fDate :
4-7 Aug. 2002
Abstract :
In this work, we propose the partial-correlation (PARCOR) coefficients scheme to model the cross areas of the several cylinders from the vocal tract. By using the relationship of the acoustic impedance proportional to the reciprocal of cross areas, the ratios of cross areas between each neighboring cylinders are used to model a speaker´s vocal tract. The autoregressive model (AR model) is performed on the speech residual signals, that are produced from the inverse vocal tract transform based on the PARCOR, to generate features. These features with the conventional features from the Mel-Frequency Cepstral Coefficient (MFCC) are used for the identification engine of the Gaussian Mixture Model (GMM). According to our computer analyses in the TIMIT speech database, the proposed system can yield better identification performance than the conventional approach.
Keywords :
acoustic impedance; autoregressive processes; cepstral analysis; speaker recognition; AR model; Gaussian Mixture Model; Mel-Frequency Cepstral Coefficient; PARCOR; TIMIT speech database; acoustic impedance; autoregressive model; cross areas; partial-correlation coefficients scheme; speech residual signals; text-independent speaker identification system; vocal tract; Cepstral analysis; Engine cylinders; Impedance; Laboratories; Linear predictive coding; Signal analysis; Signal generators; Speech analysis; Speech recognition; Timing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems, 2002. MWSCAS-2002. The 2002 45th Midwest Symposium on
Print_ISBN :
0-7803-7523-8
Type :
conf
DOI :
10.1109/MWSCAS.2002.1187040
Filename :
1187040
Link To Document :
بازگشت