مرکز منطقه ای اطلاع رساني علوم و فناوري - Reusable binary-paired partitioned neural networks for text-independent speaker identification

DocumentCode :

3591024

Title :

Reusable binary-paired partitioned neural networks for text-independent speaker identification

Author :

Zahorian, Stephen A.

Author_Institution :

Dept. of Electr. & Comput. Eng., Old Dominion Univ., Norfolk, VA, USA

Volume :

fYear :

1999

Firstpage :

849

Abstract :

A neural network algorithm for speaker identification with large groups of speakers is described. This technique is derived from a technique in which an N-way speaker identification task is partitioned into N*(N-1)/2 two-way classification tasks. Each two-way classification task is performed using a small neural network which is a two-way, or pair-wise, network. The decisions of these two-way networks are then combined to make the N-way speaker identification decision (Rudasi and Zahorian, 1991 and 1992). Although very accurate, this method has the drawback of requiring a very large number of pair-wise networks. In the new approach, two-way neural network classifiers, each of which is trained only to separate two speakers, are also used to separate other pairs of speakers. This method is able to greatly reduce the number of pair-wise classifiers required for making an N-way classification decision, especially when the number of speakers is very large. For 100 speakers extracted from the TIMIT database, the number of pair-wise classifiers can be reduced by approximately a factor of 5, with only minor degradation in performance when 3 seconds or more of speech is used for identification. Using all 630 speakers from the TIMIT database, this method can be used to obtain over 99.7% accuracy. With the telephone version of the same database, an accuracy of 40.2% can be obtained

Keywords :

neural nets; pattern classification; speaker recognition; N-way speaker identification task; pair-wise classifiers; reusable binary-paired partitioned neural networks; text-independent speaker identification; two-way classification tasks; Databases; Degradation; Neural networks; Partitioning algorithms; Pattern recognition; Speech; Telephony; Training data; Vector quantization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on

ISSN :

1520-6149

Print_ISBN :

0-7803-5041-3

Type :

conf

DOI :

10.1109/ICASSP.1999.759804

Filename :

759804

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3591024