DocumentCode :
2788522
Title :
Multipass strategies for improving accuracy in a voice search application
Author :
Zhang, Tianhe ; Rose, Richard ; Dahan, Jean
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
5354
Lastpage :
5357
Abstract :
This paper describes a set of techniques for improving the performance of automated voice search services intended for mobile users accessing these services over a range of portable devices. Voice search is implemented as a two stage search procedure where string candidates generated by an automatic speech recognition (ASR) system are re-scored in order to identify the best matching entry from a potentially very large application specific database. The work in this paper deals specifically with user utterances that contain spoken letter sequences corresponding to spelled instances of search terms. Methods are investigated for identifying the most likely database entry associated with the decoded utterance. An experimental study is presented describing the characteristics of actual user utterances obtained from a prototype voice search service. The impact of these methods on word error rate is presented.
Keywords :
mobile computing; speech recognition; very large databases; automated voice search services; automatic speech recognition system; mobile users; multipass strategies; portable devices; spoken letter sequences; two stage search procedure; very large application specific database; Application software; Automatic speech recognition; Data analysis; Databases; Decoding; Displays; Humans; Prototypes; Search engines; Speech recognition; Speech recognition; String matching;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5494949
Filename :
5494949
Link To Document :
بازگشت