Title :
End-to-end speech recognition accuracy metric for voice-search tasks
Author :
Levit, Michael ; Chang, Shuangyu ; Buntschuh, Bruce ; Kibre, Nick
Author_Institution :
Speech at Microsoft, Microsoft Corp., Redmond, WA, USA
Abstract :
We introduce a novel metric for speech recognition success in voice search tasks, designed to reflect the impact of speech recognition errors on user´s overall experience with the system. The computation of the metric is seeded using intuitive labels from human subjects and subsequently automated by replacing human annotations with a machine learning algorithm. The results show that search-based recognition accuracy is significantly higher than accuracy based on sentence error rate computation, and that the automated system is very successful in replicating human judgments regarding search quality results.
Keywords :
information retrieval; learning (artificial intelligence); speech recognition; end-to-end speech recognition accuracy metric; human judgment replication; machine learning algorithm; search quality results; search-based recognition accuracy; sentence error rate computation; speech recognition error impact; voice search tasks; Accuracy; Error analysis; Humans; Measurement; Search engines; Speech; Speech recognition; semantic accuracy; voice search;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6289078