Title :
Multipass strategies for improving accuracy in a voice search application
Author :
Zhang, Tianhe ; Rose, Richard ; Dahan, Jean
Abstract :
This paper describes a set of techniques for improving the performance of automated voice search services intended for mobile users accessing these services over a range of portable devices. Voice search is implemented as a two stage search procedure where string candidates generated by an automatic speech recognition (ASR) system are re-scored in order to identify the best matching entry from a potentially very large application specific database. The work in this paper deals specifically with user utterances that contain spoken letter sequences corresponding to spelled instances of search terms. Methods are investigated for identifying the most likely database entry associated with the decoded utterance. An experimental study is presented describing the characteristics of actual user utterances obtained from a prototype voice search service. The impact of these methods on word error rate is presented.
Keywords :
mobile computing; speech recognition; very large databases; automated voice search services; automatic speech recognition system; mobile users; multipass strategies; portable devices; spoken letter sequences; two stage search procedure; very large application specific database; Application software; Automatic speech recognition; Data analysis; Databases; Decoding; Displays; Humans; Prototypes; Search engines; Speech recognition; Speech recognition; String matching;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5494949