DocumentCode
3166303
Title
CRF-based confidence measures of recognized candidates for lattice-based audio indexing
Author
Ou, Zhijian ; Luo, Huaqing
Author_Institution
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
fYear
2012
fDate
25-30 March 2012
Firstpage
4933
Lastpage
4936
Abstract
The use of forward-backward (FB) computation based posterior probabilities as confidence measures (CMs) for all recognized candidates in a lattice seems to be common across various lattice-based audio indexing systems. However, a major limitation with this approach is that its performance for CMs cannot be improved easily, since it relies almost entirely on a single information source - the acoustic and language-model probabilities. In this paper, we propose to formulate computing CMs in the lattice case as a multi-class sequential labeling problem, using conditional random fields (CRFs) as the underlying model. In this approach, various relevant features including the FB posterior probabilities could be combined together. Note that CRFs are well suited to label sequence data and some features are defined over a word sequence. This paper presents how we resolve these two issues in the lattice case, beyond others´ previous work in CRF-based CMs for the 1-best case. Once properly implemented, the proposed approach achieves significant performance improvements for both CMs in the lattice case and lattice-based audio indexing.
Keywords
indexing; probability; speech recognition; CM; CRF-based confidence measurement; FB computation; acoustic probability; candidate recognition; conditional random field; forward-backward computation; label sequence data; language-model probability; lattice-based audio indexing system; multiclass sequential labeling problem; posterior probability; single information source; word sequence; Computational modeling; Error analysis; Feature extraction; Indexing; Labeling; Lattices; Speech recognition; CRF; Confidence measure; audio indexing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6289026
Filename
6289026
Link To Document