DocumentCode
3295716
Title
A text-independent speaker identification system using PARCOR and AR model
Author
Liu, Chia-Hsiung ; Chen, Oscal T.-C.
Author_Institution
Dept. of Electr. Eng., Nat. Chung Cheng Univ., Chia-Yi, Taiwan
Volume
3
fYear
2002
fDate
4-7 Aug. 2002
Abstract
In this work, we propose the partial-correlation (PARCOR) coefficients scheme to model the cross areas of the several cylinders from the vocal tract. By using the relationship of the acoustic impedance proportional to the reciprocal of cross areas, the ratios of cross areas between each neighboring cylinders are used to model a speaker´s vocal tract. The autoregressive model (AR model) is performed on the speech residual signals, that are produced from the inverse vocal tract transform based on the PARCOR, to generate features. These features with the conventional features from the Mel-Frequency Cepstral Coefficient (MFCC) are used for the identification engine of the Gaussian Mixture Model (GMM). According to our computer analyses in the TIMIT speech database, the proposed system can yield better identification performance than the conventional approach.
Keywords
acoustic impedance; autoregressive processes; cepstral analysis; speaker recognition; AR model; Gaussian Mixture Model; Mel-Frequency Cepstral Coefficient; PARCOR; TIMIT speech database; acoustic impedance; autoregressive model; cross areas; partial-correlation coefficients scheme; speech residual signals; text-independent speaker identification system; vocal tract; Cepstral analysis; Engine cylinders; Impedance; Laboratories; Linear predictive coding; Signal analysis; Signal generators; Speech analysis; Speech recognition; Timing;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems, 2002. MWSCAS-2002. The 2002 45th Midwest Symposium on
Print_ISBN
0-7803-7523-8
Type
conf
DOI
10.1109/MWSCAS.2002.1187040
Filename
1187040
Link To Document