مرکز منطقه ای اطلاع رساني علوم و فناوري - Hands-free speaker identification based on spectral subtraction using a multi-channel least mean square approach

DocumentCode :

1689260

Title :

Hands-free speaker identification based on spectral subtraction using a multi-channel least mean square approach

Author :

Longbiao Wang ; Zhaofeng Zhang ; Kai, Atsuhiko

Author_Institution :

Nagaoka Univ. of Technol., Nagaoka, Japan

fYear :

2013

Firstpage :

7224

Lastpage :

7228

Abstract :

A dereverberation method based on generalized spectral subtraction (GSS) using a multi-channel least mean square (MCLMS) approach achieved significantly improved results on speech recognition experiments compared with conventional methods. In this study, we employ this method for hands-free speaker identification. The GSS-based dereverberation method using clean speech models degrades speaker identification performance, although it is very effective for speech recognition. One reason may be that the GSS-based dereverberation method causes distortion such as distortion characteristics between clean speech and dereverberant speech. In this study, we address this problem by training speaker models using dereverberant speech, which is obtained by suppressing reverberation from arbitrary artificial reverberant speech. We also propose a method that combines various compensation parameter sets to improve speaker identification and provide an efficient computational method. The speaker identification experiment was performed on large-scale farfield speech, with reverberant environments different to the training environments. The proposed method achieved a relative error reduction of 87.5%, compared with conventional cepstral mean normalization with beamforming using clean speech models, and 44.8% compared with reverberant speech models.

Keywords :

array signal processing; least mean squares methods; reverberation; speaker recognition; arbitrary artificial reverberant speech; clean speech models; computational method; dereverberant speech; dereverberation method; distortion characteristics; generalized spectral subtraction; hands free speaker identification; large scale farfield speech; multichannel least mean square approach; speaker models; speech recognition; Abstracts; Arrays; Indexes; Marine vehicles; Microphones; Speech; Training; dereverberation; hands-free; multi-channel LMS; speaker identification; spectral subtraction;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on

Conference_Location :

Vancouver, BC

ISSN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2013.6639065

Filename :

6639065

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1689260