Title :
Experiments on cross-system acoustic model adaptation
Author :
Giuliani, Diego ; Brugnara, Fabio
Author_Institution :
Centro per Ricerca Sci. e Tecnologica, Trento
Abstract :
Most state-of-the-art automatic transcription systems generate word transcriptions of the incoming audio data through two or more decoding passes interleaved by adaptation of acoustic models. It was proved that better results are obtained when the adaptation procedure exploits a supervision generated by a system different than the one under adaptation. In this paper, cross-system adaptation is investigated by using supervisions generated by several systems built varying the phoneme set and the acoustic front-end. Furthermore, an adaptation procedure is presented that makes use of multiple supervisions of the audio data for adapting the acoustic models within the MLLR framework. The gain achieved with cross-system adaptation and by adapting the acoustic models exploiting multiple, intra-site and cross-site, supervisions is demonstrated on the English European parliamentary speeches task.
Keywords :
speech processing; speech recognition; English European parliamentary speeches task; adaptation procedure; cross-site supervisions; cross-system acoustic model adaptation; cross-system adaptation; intra-site supervision; multiple supervisions; state-of-the-art automatic transcription systems; word transcriptions; Adaptation model; Automatic speech recognition; Character recognition; Counting circuits; Decision trees; Error analysis; Maximum likelihood decoding; Maximum likelihood linear regression; Speech analysis; Voting; ASR system combination; automatic speech recognition; cross-system acoustic model adaptation;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
DOI :
10.1109/ASRU.2007.4430094