• DocumentCode
    164844
  • Title

    Word boundary agreementto combine multi-microphone hypotheses in distant speech recognition

  • Author

    Guerrero, Cristina ; Omologo, Maurizio

  • Author_Institution
    Fondazione Bruno Kessler-Irst, Trento, Italy
  • fYear
    2014
  • fDate
    12-14 May 2014
  • Firstpage
    187
  • Lastpage
    191
  • Abstract
    In this paper we propose a technique for combining hypotheses generated in a multi-microphone setting, which exploits complementarity and collective agreement among ASR outputs of different channels. The technique draws upon the information encoded in the available set of word lattices. As a first step, we identify word boundaries in which a comprehensive inter-channel agreement is found; then, these boundaries are used to reduce the global hypothesis search space. Global word posterior probabilities are estimated for the candidate words associated to each of the bounded segments. As a result, a single combined confusion network is generated from the multiple lattices. This approach offers a novel perspective to state of the art solutions based on confusion network combination. Promising results were obtained from an experimental evaluation in a simulated domestic environment equipped with a distributed microphone network. The development and test sets were simulated using real impulse responses estimated for a large set of microphone-speaker position pairs.
  • Keywords
    microphones; probability; speech recognition; ASR outputs; automatic speech recognition; bounded segments; candidate words; combined confusion network; distant speech recognition; distributed microphone network; global hypothesis search space; global word posterior probability estimation; information encoding; interchannel agreement; microphone-speaker position pairs; multimicrophone setting; simulated domestic environment; word boundary agreement; word lattices; Computer numerical control; Conferences; Lattices; Microphones; Speech; Speech recognition; Distant speech recognition; confusion networks; hypothesis combination; multi-microphone;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Hands-free Speech Communication and Microphone Arrays (HSCMA), 2014 4th Joint Workshop on
  • Conference_Location
    Villers-les-Nancy
  • Type

    conf

  • DOI
    10.1109/HSCMA.2014.6843277
  • Filename
    6843277