• DocumentCode
    3744855
  • Title

    The DIRHA-ENGLISH corpus and related tasks for distant-speech recognition in domestic environments

  • Author

    Mirco Ravanelli;Luca Cristoforetti;Roberto Gretter;Marco Pellin;Alessandro Sosi;Maurizio Omologo

  • Author_Institution
    Fondazione Bruno Kessler (FBK), 38123 Povo, Trento, Italy
  • fYear
    2015
  • Firstpage
    275
  • Lastpage
    282
  • Abstract
    This paper introduces the contents and the possible usage of the DIRHA-ENGLISH multi-microphone corpus, recently realized under the EC DIRHA project. The reference scenario is a domestic environment equipped with a large number of microphones and microphone arrays distributed in space. The corpus is composed of both real and simulated material, and it includes 12 US and 12 UK English native speakers. Each speaker uttered different sets of phonetically-rich sentences, newspaper articles, conversational speech, keywords, and commands. From this material, a large set of 1-minute sequences was generated, which also includes typical domestic background noise as well as inter/intra-room reverberation effects. Dev and test sets were derived, which represent a very precious material for different studies on multi-microphone speech processing and distant-speech recognition. Various tasks and corresponding Kaldi recipes have already been developed. The paper reports a first set of baseline results obtained using different techniques, including Deep Neural Networks (DNN), aligned with the state-of-the-art at international level.
  • Keywords
    "Speech","Speech recognition","Acoustics","Microphone arrays","Noise measurement","Harmonic analysis"
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2015 IEEE Workshop on
  • Type

    conf

  • DOI
    10.1109/ASRU.2015.7404805
  • Filename
    7404805