مرکز منطقه ای اطلاع رساني علوم و فناوري - Single-channel speaker-pair identification: A new approach based on automatic frame selection

DocumentCode :

3163361

Title :

Single-channel speaker-pair identification: A new approach based on automatic frame selection

Author :

Srinivasan, Ramji ; Ming, Ji ; Crookes, Danny

Author_Institution :

Inst. of Electron., Commun. & Inf. Technol., Queen´´s Univ. Belfast, Belfast, UK

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

4369

Lastpage :

4372

Abstract :

Given single-channel recordings of simultaneous speakers, we may need to identify the individual speakers for separating their voices. In this paper, we consider the problem of identifying two simultaneous speakers based on single-channel data, i.e., speakerpair identification. We model the problem as identifying speakers using noisy speech with partial temporal corruption, which corresponds to the heavily mixed speech frames. Inclusion of these noisy frames will damage the accuracy of both speakers identification. In this paper, we propose a new approach to automatically and optimally select the single-speaker dominated speech frames for identification. The new algorithm has been evaluated using two databases: 1) the GRID speech separation database and 2) the Wall Street Journal (WSJ0) database. The new approach has shown better performance than other approaches. On the Grid database, for example, the new approach outperformed the state of the art IBM approach in 5 out of 6 test conditions.

Keywords :

speaker recognition; GRID speech separation database; IBM approach; WSJ0 database; automatic frame selection; partial temporal corruption; single-channel data; single-channel speaker-pair identification; wall street journal database; Accuracy; Databases; Gain; Hidden Markov models; Speech; Speech recognition; Tunneling magnetoresistance; partial temporal corruption; speaker recognition; speaker-pair identification; speech separation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6288887

Filename :

6288887

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3163361