مرکز منطقه ای اطلاع رساني علوم و فناوري - Continuous probabilistic acoustic map for speaker identification

DocumentCode :

3233769

Title :

Continuous probabilistic acoustic map for speaker identification

Author :

Tseng, Belle L. ; Soong, Frank K. ; Rosenberg, Aaron E.

Author_Institution :

MIT, Cambridge, MA, USA

Volume :

fYear :

1992

fDate :

23-26 Mar 1992

Firstpage :

161

Abstract :

A continuous probabilistic acoustic map (CPAM) approach to speaker recognition is investigated. In the CPAM formulation, the speech input of a speaker is parameterized as a mixture of tied, universal probability density functions (PDFs) with either a CPAM model alone for text-independent operation or a CPAM-based hidden Markov model (HMM) for text-dependent operation. A continuously spoken digit database of 20 speakers (10 M, 10 F) is used to evaluate the CPAM approach in both identification and verification performance. The CPAM approach is shown to perform better than a vector quantization based approach in text-independent speaker recognition, and as well as the text-dependent, conventional, continuous mixture HMM approach with significant representation efficiency. In particular, the CPAM-based HMM achieves an identification error rate of 1.7% and a verification equal-error rate of 4.0% with a CPAM of 128 PDFs while a conventional, continuous mixtures HMM needs 400 PDFs to achieve corresponding error rates of 1.9% and 4.0% using the same combined cepstral features and three-digit test utterances

Keywords :

hidden Markov models; speech recognition; HMM; cepstral features; continuous probabilistic acoustic map; continuously spoken digit database; hidden Markov model; identification error rate; speaker identification; speaker recognition; text-dependent operation; text-independent operation; three-digit test utterances; universal probability density functions; verification equal-error rate; Cepstral analysis; Databases; Error analysis; Hidden Markov models; Loudspeakers; Probability density function; Speaker recognition; Speech; Testing; Vector quantization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on

Conference_Location :

San Francisco, CA

ISSN :

1520-6149

Print_ISBN :

0-7803-0532-9

Type :

conf

DOI :

10.1109/ICASSP.1992.226095

Filename :

226095

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3233769