مرکز منطقه ای اطلاع رساني علوم و فناوري - Speaker indexing in large audio databases using anchor models

DocumentCode :

1749684

Title :

Speaker indexing in large audio databases using anchor models

Author :

Sturim, D.E. ; Reynolds, D.A. ; Singer, E. ; Campbell, J.P.

Author_Institution :

Lincoln Lab., MIT, Lexington, MA, USA

Volume :

fYear :

2001

fDate :

2001

Firstpage :

429

Abstract :

Introduces the technique of anchor modeling in the applications of speaker detection and speaker indexing. The anchor modeling algorithm is refined by pruning the number of models needed. The system is applied to the speaker detection problem where its performance is shown to fall short of the state-of-the-art Gaussian mixture model with universal background model (GMM-UBM) system. However, it is further shown that its computational efficiency lends itself to speaker indexing for searching large audio databases for desired speakers. Here, excessive computation may prohibit the use of the GMM-UBM recognition system. Finally, the paper presents a method for cascading anchor model and GMM-UBM detectors for speaker indexing. This approach benefits from the efficiency of anchor modeling and high accuracy of GMM-UBM recognition

Keywords :

database indexing; database management systems; probability; speaker recognition; Gaussian mixture model with universal background model system; anchor models; computational efficiency; large audio databases; speaker detection; speaker indexing; Audio databases; Computational efficiency; Contracts; Detectors; Embedded computing; Hidden Markov models; Indexing; Laboratories; Speaker recognition; Speech;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on

Conference_Location :

Salt Lake City, UT

ISSN :

1520-6149

Print_ISBN :

0-7803-7041-4

Type :

conf

DOI :

10.1109/ICASSP.2001.940859

Filename :

940859

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1749684