DocumentCode
730675
Title
Weighted training for speech under Lombard Effect for speaker recognition
Author
Saleem, Muhammad Muneeb ; Gang Liu ; Hansen, John H. L.
Author_Institution
Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Richardson, TX, USA
fYear
2015
fDate
19-24 April 2015
Firstpage
4350
Lastpage
4354
Abstract
The presence of Lombard Effect in speech is proven to have severe effects on the performance of speech systems, especially speaker recognition. Varying kinds of Lombard speech are produced by speakers under influence of varying noise types [1]. This study proposes a high-accuracy classifier using deep neural networks for detecting various kinds of Lombard speech against neutral speech, independent of the noise levels causing the Lombard Effect. Lombard Effect detection accuracies as high as 95.7% are achieved using this novel model. The deep neural network based classification is further exploited by validation based weighted training of robust i-Vector based speaker identification systems. The proposed weighted training achieves a relative EER improvement of 28.4% over an i-Vector baseline system, confirming the effectiveness of deep neural networks in modeling Lombard Effect.
Keywords
acoustic noise; neural nets; speaker recognition; speech intelligibility; Lombard effect; Lombard speech; deep neural networks; neutral speech; noise levels; relative EER improvement; robust i-Vector; speaker identification; speaker recognition; speech; Accuracy; Neural networks; Noise; Speech; Speech recognition; Training; Training data; Lombard Effect; deep neural networks; robust; speaker identification; weighted training;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178792
Filename
7178792
Link To Document