مرکز منطقه ای اطلاع رساني علوم و فناوري - Boosted acoustic model learning and hypotheses rescoring on the CHiME-3 task

DocumentCode :

3744874

Title :

Boosted acoustic model learning and hypotheses rescoring on the CHiME-3 task

Author :

Shahab Jalalvand;Daniele Falavigna;Marco Matassoni;Piergiorgio Svaizer;Maurizio Omologo

Author_Institution :

SHINE research unit, Fondazione Bruno Kessler (FBK), 38123 Povo, Trento, Italy

fYear :

2015

Firstpage :

409

Lastpage :

415

Abstract :

Speech recognition in a realistic noisy environment using multiple microphones is the focal point of the third CHiME challenge. Over the baseline ASR system provided for this challenge, we apply state of the art algorithms for boosting acoustic model learning and hypothesis rescoring to improve the final output. To this aim, we first use the automatic transcription of each channel to re-train the acoustic model for that channel and then we apply linear language model rescoring to find a better solution in the n-best list. LM rescoring is performed using an efficient set of N-gram and Recurrent Neural Network LM (RNNLM) trained on a wisely-selected text set. In the experiments, we show that the proposed approach improves not only the individual channel transcription, but also the enhanced channels produced by MVDR and delay-and-sum beamforming.

Keywords :

"Hidden Markov models","Training","Acoustics","Microphones","Decoding","Array signal processing","Speech recognition"

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition and Understanding (ASRU), 2015 IEEE Workshop on

Type :

conf

DOI :

10.1109/ASRU.2015.7404824

Filename :

7404824

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3744874