DocumentCode :
178318
Title :
A phonetic similarity based noisy channel approach to ASR hypothesis re-ranking and error detection
Author :
Hacker, Martin ; Noth, E.
Author_Institution :
Embedded Syst. Initiative (ESI), Univ. of Erlangen-Nuremberg, Erlangen, Germany
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
2317
Lastpage :
2321
Abstract :
We present a new method to augment the correct transcript from automatic speech recognition (ASR) output containing multiple hypotheses. The error-prone ASR process is taken as black box and modeled as a noisy channel on phoneme level. The probabilities of the individual phoneme errors are assigned according to phonetic confusability. We score potential candidate hypotheses by their posterior probability of being the channel input given the competing ASR hypotheses as observed output. The resulting scores provide useful information not included in traditional confidence measures. We investigated the usefulness of the method for rescoring, re-ranking and word error detection. The method alone is not powerful enough to improve the recognition results, but by employing a decision tree classifier it is possible to isolate cases where the method works very well. Our results show that the combination with other knowledge sources and postprocessing techniques can lead to promising improvements.
Keywords :
decision trees; error detection; error statistics; pattern classification; speech processing; speech recognition; word processing; ASR hypothesis re-ranking; automatic speech recognition; decision tree classifier; error prone ASR process; knowledge source; phoneme error probability assignment; phoneme level; phonetic confusability; phonetic similarity based noisy channel approach; post-processing technique; posterior probability; potential candidate hypotheses; rescoring; word error detection; Data models; Google; Lattices; Mathematical model; Noise measurement; Speech; Speech recognition; Automatic speech recognition; confidence; error detection; error modeling; re-ranking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6854013
Filename :
6854013
Link To Document :
بازگشت