• DocumentCode
    730675
  • Title

    Weighted training for speech under Lombard Effect for speaker recognition

  • Author

    Saleem, Muhammad Muneeb ; Gang Liu ; Hansen, John H. L.

  • Author_Institution
    Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Richardson, TX, USA
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    4350
  • Lastpage
    4354
  • Abstract
    The presence of Lombard Effect in speech is proven to have severe effects on the performance of speech systems, especially speaker recognition. Varying kinds of Lombard speech are produced by speakers under influence of varying noise types [1]. This study proposes a high-accuracy classifier using deep neural networks for detecting various kinds of Lombard speech against neutral speech, independent of the noise levels causing the Lombard Effect. Lombard Effect detection accuracies as high as 95.7% are achieved using this novel model. The deep neural network based classification is further exploited by validation based weighted training of robust i-Vector based speaker identification systems. The proposed weighted training achieves a relative EER improvement of 28.4% over an i-Vector baseline system, confirming the effectiveness of deep neural networks in modeling Lombard Effect.
  • Keywords
    acoustic noise; neural nets; speaker recognition; speech intelligibility; Lombard effect; Lombard speech; deep neural networks; neutral speech; noise levels; relative EER improvement; robust i-Vector; speaker identification; speaker recognition; speech; Accuracy; Neural networks; Noise; Speech; Speech recognition; Training; Training data; Lombard Effect; deep neural networks; robust; speaker identification; weighted training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178792
  • Filename
    7178792