DocumentCode :
2697241
Title :
Text-dependent speaker-recognition systems based on one-pass dynamic programming algorithm
Author :
Ramasubramanian, V. ; Kumar, Praveen V. ; Vijaywargiay, Deepak ; Harish, D. ; Thiyagarajan, S. ; Das, Aruneema
Author_Institution :
Siemens Corp. Technol., Bangalore
fYear :
2006
fDate :
28-30 June 2006
Firstpage :
1
Lastpage :
8
Abstract :
We propose variable-text text-dependent speaker-recognition systems based on the one-pass dynamic programming (DP) algorithm. The key feature of the proposed algorithm is its ability to use multiple templates for each of the words which form the `password´ text. The use of multiple templates allows the proposed system to capture the idiosyncratic intra-speaker variability of a word, resulting in significant improvement in the performance. Our algorithm also uses inter-word silence templates to handle continuous speech input. We use the proposed one-pass DP algorithm in three speaker-recognition systems, namely, closed-set speaker-identification (CSI), speaker-verification (SV) and open-set speaker-identification (OSI). These systems were evaluated on a 100 speaker and 200 speaker tasks using the TIDIGITS database and with various car noise conditions. The key result of this paper is that the use of multiple templates enhances the performance of all the three systems significantly -the use of multiple templates (in comparison to a single template) enhances the CSI performance from 94% to 100%, the SV EER from 1.6% to 0.09% and the OSI EER from 12.3% to 3.5% on a 100 speaker task. We also use the proposed one-pass DP for automatically extracting the multiple templates from continuous speech training data. The performance of the three systems using such automatically extracted multiple templates is as good as with manually extracted templates. Front-end noise suppression enables our systems to deliver robust performance in up to 0 dB car noise
Keywords :
dynamic programming; speaker recognition; speech processing; TIDIGITS database; continuous speech training data; front-end noise suppression; multiple template extraction; one-pass dynamic programming algorithm; password text; text-dependent speaker-recognition system; Data mining; Databases; Dynamic programming; Heuristic algorithms; Information systems; Open systems; Speech; Testing; Training data; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speaker and Language Recognition Workshop, 2006. IEEE Odyssey 2006: The
Conference_Location :
San Juan
Print_ISBN :
1-424400471-1
Electronic_ISBN :
1-4244-0472-X
Type :
conf
DOI :
10.1109/ODYSSEY.2006.248121
Filename :
4013538
Link To Document :
بازگشت