DocumentCode :
3432434
Title :
Improved recognition of contact names in voice commands
Author :
Aleksic, Petar ; Allauzen, Cyril ; Elson, David ; Kracun, Aleksandar ; Casado, Diego Melendo ; Moreno, Pedro J.
Author_Institution :
Google Inc., Mountain View, CA, USA
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
5172
Lastpage :
5175
Abstract :
The recognition of contact names in mobile-device voice commands is a challenging problem. Some of the difficulties include potentially infinite vocabularies, low probability of contact tokens in the language model (LM), increased false triggering of contact voice commands when none are spoken, and very large and noisy contact name lists. In this paper we suggest solutions for each of these difficulties. We address low prior probability and out-of-vocabulary contact name problems by using class-based language models, and creating on-the-fly user dependent small language models containing only relevant names. These models are compiled dynamically based on analysis of the mobile device state. Since these solutions can increase biasing towards contact names during recognition, it is crucial to monitor false triggering. To properly balance this bias we introduce the concept of a contacts insertion reward. This reward is tuned using both positive and negative test sets. We show significant recognition performance improvements on data sets in three languages, without negatively impacting the overall system performance. The improvements are obtained in both offline evaluations as well as on live traffic experiments.
Keywords :
speech recognition; challenging problem; class-based language models; contact name recognition; contact token probability; contact voice commands; false triggering; infinite vocabularies; language model; live traffic experiments; mobile-device voice commands; noisy contact name lists; out-of-vocabulary contact name problems; recognition performance; system performance; voice commands; Accuracy; Context; Context modeling; Decoding; Speech recognition; Training; Transducers; FSTs; contact names; speech recognition; voice commands;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178957
Filename :
7178957
Link To Document :
بازگشت