Title :
Template constrained posterior for verifying phone transcriptions
Author :
Wang, Lijuan ; Hu, Tao ; Soong, Frank
Author_Institution :
Microsoft Res. Asia, Beijing
fDate :
March 31 2008-April 4 2008
Abstract :
A new statistical confidence measure, template constrained posterior (TCP), is proposed for verifying phone transcriptions of speech databases. Different from generalized posterior probability (GPP), TCP is computed by considering string hypotheses that bear a focused unit, e.g., phone with partially matched left and right contexts. Parameters used for TCP include context window length, partial matching ratio, KLD threshold for selecting confusable phones, and verification threshold. They are determined by minimizing verification errors in a development set. Evaluated on a test set which contains 52.1% sentence errors and 0.62% phone errors, TCP achieves 92% and 88% error hit rate in rejected sentences, when the corresponding acceptance ratios are set at 90% and 80%, respectively.
Keywords :
probability; speech recognition; speech synthesis; KLD threshold; context window length; generalized posterior probability; partial matching ratio; phone transcription verification; speech databases; speech recognition; statistical confidence measure; string hypotheses; template constrained posterior; text-to-speech synthesis; verification threshold; Acoustic measurements; Asia; Databases; Equations; Information security; Probability; Speech analysis; Speech recognition; Speech synthesis; Testing; TCP; confidence measure; template constrained posterior;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518701