DocumentCode :
3167423
Title :
End-to-end speech recognition accuracy metric for voice-search tasks
Author :
Levit, Michael ; Chang, Shuangyu ; Buntschuh, Bruce ; Kibre, Nick
Author_Institution :
Speech at Microsoft, Microsoft Corp., Redmond, WA, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
5141
Lastpage :
5144
Abstract :
We introduce a novel metric for speech recognition success in voice search tasks, designed to reflect the impact of speech recognition errors on user´s overall experience with the system. The computation of the metric is seeded using intuitive labels from human subjects and subsequently automated by replacing human annotations with a machine learning algorithm. The results show that search-based recognition accuracy is significantly higher than accuracy based on sentence error rate computation, and that the automated system is very successful in replicating human judgments regarding search quality results.
Keywords :
information retrieval; learning (artificial intelligence); speech recognition; end-to-end speech recognition accuracy metric; human judgment replication; machine learning algorithm; search quality results; search-based recognition accuracy; sentence error rate computation; speech recognition error impact; voice search tasks; Accuracy; Error analysis; Humans; Measurement; Search engines; Speech; Speech recognition; semantic accuracy; voice search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6289078
Filename :
6289078
Link To Document :
بازگشت