• DocumentCode
    164841
  • Title

    Neural networks for distant speech recognition

  • Author

    Renals, Steve ; Swietojanski, Pawel

  • Author_Institution
    Centre for Speech Technol. Res., Univ. of Edinburgh, Edinburgh, UK
  • fYear
    2014
  • fDate
    12-14 May 2014
  • Firstpage
    172
  • Lastpage
    176
  • Abstract
    Distant conversational speech recognition is challenging owing to the presence of multiple, overlapping talkers, additional non-speech acoustic sources, and the effects of reverberation. In this paper we review work on distant speech recognition, with an emphasis on approaches which combine multichannel signal processing with acoustic modelling, and investigate the use of hybrid neural network / hidden Markov model acoustic models for distant speech recognition of meetings recorded using microphone arrays. In particular we investigate the use of convolutional and fully-connected neural networks with different activation functions (sigmoid, rectified linear, and maxout). We performed experiments on the AMI and ICSI meeting corpora, with results indicating that neural network models are capable of significant improvements in accuracy compared with discriminatively trained Gaussian mixture models.
  • Keywords
    Gaussian processes; hidden Markov models; microphone arrays; mixture models; neural nets; reverberation; speech recognition; Gaussian mixture models; acoustic modelling; activation functions; distant speech recognition; hybrid neural network - hidden Markov model acoustic models; maxout; microphone arrays; multichannel signal processing; nonspeech acoustic sources; overlapping talkers; rectified linear; reverberation effects; sigmoid; Acoustics; Hidden Markov models; Microphone arrays; Neural networks; Speech; Speech recognition; AMI corpus; ICSI corpus; beam-forming; convolutional neural networks; distant speech recognition; maxout networks; meetings; rectifier unit;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Hands-free Speech Communication and Microphone Arrays (HSCMA), 2014 4th Joint Workshop on
  • Conference_Location
    Villers-les-Nancy
  • Type

    conf

  • DOI
    10.1109/HSCMA.2014.6843274
  • Filename
    6843274