• DocumentCode
    3295716
  • Title

    A text-independent speaker identification system using PARCOR and AR model

  • Author

    Liu, Chia-Hsiung ; Chen, Oscal T.-C.

  • Author_Institution
    Dept. of Electr. Eng., Nat. Chung Cheng Univ., Chia-Yi, Taiwan
  • Volume
    3
  • fYear
    2002
  • fDate
    4-7 Aug. 2002
  • Abstract
    In this work, we propose the partial-correlation (PARCOR) coefficients scheme to model the cross areas of the several cylinders from the vocal tract. By using the relationship of the acoustic impedance proportional to the reciprocal of cross areas, the ratios of cross areas between each neighboring cylinders are used to model a speaker´s vocal tract. The autoregressive model (AR model) is performed on the speech residual signals, that are produced from the inverse vocal tract transform based on the PARCOR, to generate features. These features with the conventional features from the Mel-Frequency Cepstral Coefficient (MFCC) are used for the identification engine of the Gaussian Mixture Model (GMM). According to our computer analyses in the TIMIT speech database, the proposed system can yield better identification performance than the conventional approach.
  • Keywords
    acoustic impedance; autoregressive processes; cepstral analysis; speaker recognition; AR model; Gaussian Mixture Model; Mel-Frequency Cepstral Coefficient; PARCOR; TIMIT speech database; acoustic impedance; autoregressive model; cross areas; partial-correlation coefficients scheme; speech residual signals; text-independent speaker identification system; vocal tract; Cepstral analysis; Engine cylinders; Impedance; Laboratories; Linear predictive coding; Signal analysis; Signal generators; Speech analysis; Speech recognition; Timing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems, 2002. MWSCAS-2002. The 2002 45th Midwest Symposium on
  • Print_ISBN
    0-7803-7523-8
  • Type

    conf

  • DOI
    10.1109/MWSCAS.2002.1187040
  • Filename
    1187040