Title :
Adaptation of Morph-Based Speech Recognition for Foreign Names and Acronyms
Author :
Mansikkaniemi, Andre ; Kurimo, Mikko
Author_Institution :
Dept. of Signal Process. & Acoust., Aalto Univ., Aalto, Finland
Abstract :
In this paper, we improve morph-based speech recognition system by focusing adaptation efforts on acronyms (ACRs) and foreign proper names (FPNs). An unsupervised language model (LM) adaptation framework based on two-pass decoding is used. Vocabulary adaptation is applied alongside unsupervised LM adaptation. The aim is to improve both language and pronunciation modeling for FPNs and ACRs. A smart selection algorithm is used to find the most likely topically related foreign words and acronyms from in-domain text. New pronunciation rules are generated for the selected words. Different kinds of morpheme adaptation operations are also evaluated on the ACR and FPN candidate words, to ensure optimal results are gained from pronunciation adaptation. Statistically significant improvements in average word error rate (WER), and term error rate (TER), are achieved using a combination of unsupervised LM adaptation with vocabulary adaptation focused on ACRs and FPNs.
Keywords :
decoding; error statistics; speech recognition; vocabulary; ACR candidate words; FPN candidate words; LM adaptation framework; TER; WER; acronyms; foreign proper names; foreign words; in-domain text; morph-based speech recognition adaptation; pronunciation rules; smart selection algorithm; term error rate; two-pass decoding; unsupervised language model; vocabulary adaptation; word error rate; Adaptation models; Speech; Speech processing; Speech recognition; Terminology; Training; Vocabulary; Foreign word detection; morph-based speech recognition; out-of-vocabulary (OOV) recognition; unsupervised language model (LM) adaptation;
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
DOI :
10.1109/TASLP.2015.2414818