DocumentCode :
3425395
Title :
Template constrained posterior for verifying phone transcriptions
Author :
Wang, Lijuan ; Hu, Tao ; Soong, Frank
Author_Institution :
Microsoft Res. Asia, Beijing
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
4681
Lastpage :
4684
Abstract :
A new statistical confidence measure, template constrained posterior (TCP), is proposed for verifying phone transcriptions of speech databases. Different from generalized posterior probability (GPP), TCP is computed by considering string hypotheses that bear a focused unit, e.g., phone with partially matched left and right contexts. Parameters used for TCP include context window length, partial matching ratio, KLD threshold for selecting confusable phones, and verification threshold. They are determined by minimizing verification errors in a development set. Evaluated on a test set which contains 52.1% sentence errors and 0.62% phone errors, TCP achieves 92% and 88% error hit rate in rejected sentences, when the corresponding acceptance ratios are set at 90% and 80%, respectively.
Keywords :
probability; speech recognition; speech synthesis; KLD threshold; context window length; generalized posterior probability; partial matching ratio; phone transcription verification; speech databases; speech recognition; statistical confidence measure; string hypotheses; template constrained posterior; text-to-speech synthesis; verification threshold; Acoustic measurements; Asia; Databases; Equations; Information security; Probability; Speech analysis; Speech recognition; Speech synthesis; Testing; TCP; confidence measure; template constrained posterior;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518701
Filename :
4518701
Link To Document :
بازگشت