• DocumentCode
    3166303
  • Title

    CRF-based confidence measures of recognized candidates for lattice-based audio indexing

  • Author

    Ou, Zhijian ; Luo, Huaqing

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    4933
  • Lastpage
    4936
  • Abstract
    The use of forward-backward (FB) computation based posterior probabilities as confidence measures (CMs) for all recognized candidates in a lattice seems to be common across various lattice-based audio indexing systems. However, a major limitation with this approach is that its performance for CMs cannot be improved easily, since it relies almost entirely on a single information source - the acoustic and language-model probabilities. In this paper, we propose to formulate computing CMs in the lattice case as a multi-class sequential labeling problem, using conditional random fields (CRFs) as the underlying model. In this approach, various relevant features including the FB posterior probabilities could be combined together. Note that CRFs are well suited to label sequence data and some features are defined over a word sequence. This paper presents how we resolve these two issues in the lattice case, beyond others´ previous work in CRF-based CMs for the 1-best case. Once properly implemented, the proposed approach achieves significant performance improvements for both CMs in the lattice case and lattice-based audio indexing.
  • Keywords
    indexing; probability; speech recognition; CM; CRF-based confidence measurement; FB computation; acoustic probability; candidate recognition; conditional random field; forward-backward computation; label sequence data; language-model probability; lattice-based audio indexing system; multiclass sequential labeling problem; posterior probability; single information source; word sequence; Computational modeling; Error analysis; Feature extraction; Indexing; Labeling; Lattices; Speech recognition; CRF; Confidence measure; audio indexing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6289026
  • Filename
    6289026