• DocumentCode
    394314
  • Title

    A general algorithm for word graph matrix decomposition

  • Author

    Hakkani-Tür, Dilek ; Riccardi, Giuseppe

  • Author_Institution
    AT&T Labs.-Res., USA
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    In automatic speech recognition, word graphs (lattices) are commonly used as an approximate representation of the complete word search space. Usually these word lattices are acyclic and have no a-priori structure. More recently a new class of normalized word lattices have been proposed. These word lattices (a.k.a. sausages) are very efficient (space) and they provide a normalization (chunking) of the lattice, by aligning words from all possible hypotheses. We propose a general framework for lattice chunking, the pivot algorithm. There are four important components of the pivot algorithm. First, the time information is not necessary but is beneficial for the overall performance. Second, the algorithm allows the definition of a predefined chunk structure of the final word lattice. Third, the algorithm operates on both weighted and unweighted lattices. Fourth, the labels on the graph are generic, and could be words as well as part of speech tags or parse tags. While the algorithm has applications to many tasks (e.g. parsing, named entity extraction) we present results on the performance of confidence scores for different large vocabulary speech recognition tasks. We compare the results of our algorithms against off-the-shelf methods and show significant improvements.
  • Keywords
    feature extraction; matrix decomposition; speech recognition; approximate representation; automatic speech recognition; chunk structure; general algorithm; large vocabulary speech recognition; lattice chunking; named entity extraction; normalized word lattices; off-the-shelf methods; parse tags; parsing; sausages; speech tags; unweighted lattices; weighted lattices; word graph matrix decomposition; word graphs; word search space; Automatic speech recognition; Costs; Lattices; Matrix decomposition; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198851
  • Filename
    1198851