DocumentCode :
3191576
Title :
An F-Ratio Based Optimization on Noisy Data for Speaker Recognition Application
Author :
Saha, Goutam ; Senapati, Suman ; Chakroborty, Sandipan
Author_Institution :
Department of Electronics and Electrical Communication Engineering Indian Institute of Technology, Kharagpur, India, Kharagpur-721 302, Email: gsaha@ece.iitkgp.ernet.in Telephone: +91-3222-283556/1470, FAX: +91-3222-255303
fYear :
2005
fDate :
11-13 Dec. 2005
Firstpage :
352
Lastpage :
355
Abstract :
Automatic Speaker Recognition (ASR) needs a robust acoustic feature for representation of speaker and an efficient modeling scheme to yield high recognition accuracy even at adverse conditions. This paper presents a noise study of an ASR system using Mel-Frequency Cepstral Coefficients (MFCC) and an Artificial Neural Network (ANN) classifier. Optimization in feature space using Fisher´s F-Ratio score is done in order to develop reduced speaker model in no noise (only ambient room noise is present) as well as in several noisy conditions. A new ranking scheme is also proposed in order to stabilize the rank of features in various noise levels by taking Arithmetic Mean of the F-Ratio scores obtained from various levels of Signal to Noise Ratio (SNR). The result is presented for a Text-Dependent ASR system with 25 speaker database.
Keywords :
ANN; ASR; Average F-Ratio; F-Ratio; MFCC; Acoustic noise; Artificial neural networks; Automatic speech recognition; Cepstral analysis; Loudspeakers; Mel frequency cepstral coefficient; Noise reduction; Noise robustness; Signal to noise ratio; Speaker recognition; ANN; ASR; Average F-Ratio; F-Ratio; MFCC;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
INDICON, 2005 Annual IEEE
Print_ISBN :
0-7803-9503-4
Type :
conf
DOI :
10.1109/INDCON.2005.1590188
Filename :
1590188
Link To Document :
بازگشت