• DocumentCode
    3428006
  • Title

    Analysis-by-synthesis approach for acoustic model adaptation

  • Author

    Kraljevski, Ivan ; Duckhorn, Frank ; Strecha, Guntram ; Gebremedhin, Yitagessu Birhanu ; Wolff, Marcus ; Hoffmann, Raik

  • Author_Institution
    Dept. of Syst. Theor. & Speech Technol., Tech. Univ. Dresden, Dresden, Germany
  • fYear
    2013
  • fDate
    1-4 July 2013
  • Firstpage
    1611
  • Lastpage
    1616
  • Abstract
    This paper presents an analysis-by-synthesis approach for acoustic model adaptation. Using artificial speech data for speech recognition systems adaptation, has the potential to address the problem of data sparseness, to avoid speech recordings in real conditions and to provide the capability of performing large number of development cycles for Automatic Speech Recognition (ASR) systems in shorter time. The proposed adaptation framework uses unified ASR and synthesis system to produce artificial adaptation speech signals. In order to confirm the usability of the proposed approach, several experiments were performed where the artificial speech data was coded-decoded by different speech and waveform coders and the acoustic model used for synthesis was adapted for each coder. The recognition results show that the proposed method could be used successfully in the process of speech recognition systems performance assessment and improvement, not only for coded speech effects evaluation and adaptation, but also for other environment conditions.
  • Keywords
    signal synthesis; speech coding; speech recognition; speech synthesis; ASR systems; acoustic model adaptation; analysis-by-synthesis approach; artificial adaptation data speech signals; automatic speech recognition system adaptation; coded speech effect evaluation; data sparseness problem; speech coders; speech recordings; unified ASR adaptation framework; waveform coders; Acoustics; Adaptation models; Data models; Hidden Markov models; Speech; Speech coding; Speech recognition; acoustic model adaptation; speech codecs; speech recognition; speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    EUROCON, 2013 IEEE
  • Conference_Location
    Zagreb
  • Print_ISBN
    978-1-4673-2230-0
  • Type

    conf

  • DOI
    10.1109/EUROCON.2013.6625192
  • Filename
    6625192