مرکز منطقه ای اطلاع رساني علوم و فناوري - Text-dependent speaker-recognition systems based on one-pass dynamic programming algorithm

DocumentCode :

2697241

Title :

Text-dependent speaker-recognition systems based on one-pass dynamic programming algorithm

Author :

Ramasubramanian, V. ; Kumar, Praveen V. ; Vijaywargiay, Deepak ; Harish, D. ; Thiyagarajan, S. ; Das, Aruneema

Author_Institution :

Siemens Corp. Technol., Bangalore

fYear :

2006

fDate :

28-30 June 2006

Firstpage :

Lastpage :

Abstract :

We propose variable-text text-dependent speaker-recognition systems based on the one-pass dynamic programming (DP) algorithm. The key feature of the proposed algorithm is its ability to use multiple templates for each of the words which form the `password´ text. The use of multiple templates allows the proposed system to capture the idiosyncratic intra-speaker variability of a word, resulting in significant improvement in the performance. Our algorithm also uses inter-word silence templates to handle continuous speech input. We use the proposed one-pass DP algorithm in three speaker-recognition systems, namely, closed-set speaker-identification (CSI), speaker-verification (SV) and open-set speaker-identification (OSI). These systems were evaluated on a 100 speaker and 200 speaker tasks using the TIDIGITS database and with various car noise conditions. The key result of this paper is that the use of multiple templates enhances the performance of all the three systems significantly -the use of multiple templates (in comparison to a single template) enhances the CSI performance from 94% to 100%, the SV EER from 1.6% to 0.09% and the OSI EER from 12.3% to 3.5% on a 100 speaker task. We also use the proposed one-pass DP for automatically extracting the multiple templates from continuous speech training data. The performance of the three systems using such automatically extracted multiple templates is as good as with manually extracted templates. Front-end noise suppression enables our systems to deliver robust performance in up to 0 dB car noise

Keywords :

dynamic programming; speaker recognition; speech processing; TIDIGITS database; continuous speech training data; front-end noise suppression; multiple template extraction; one-pass dynamic programming algorithm; password text; text-dependent speaker-recognition system; Data mining; Databases; Dynamic programming; Heuristic algorithms; Information systems; Open systems; Speech; Testing; Training data; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Speaker and Language Recognition Workshop, 2006. IEEE Odyssey 2006: The

Conference_Location :

San Juan

Print_ISBN :

1-424400471-1

Electronic_ISBN :

1-4244-0472-X

Type :

conf

DOI :

10.1109/ODYSSEY.2006.248121

Filename :

4013538

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2697241