• DocumentCode
    3428246
  • Title

    Approximateword-lattice indexing with text indexers: Time-Anchored Lattice Expansion

  • Author

    Yu, Peng ; Shi, Yu ; Seide, Frank

  • Author_Institution
    Microsoft Res. Asia, Beijing
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    5248
  • Lastpage
    5251
  • Abstract
    We address the problem of how to represent or approximate speech lattices to be indexed with existing text indexers. We present a method named time-anchored lattice expansion (TALE), which can be implemented by a standard text indexer (STI). On a 170- hour lecture set, we compare TALE with other lattice indexing methods: confusion networks, position-specific posterior lattices (PSPL), and time-based merging for index (TMI). All methods achieve accuracies comparable to searching raw lattices when the corresponding index structures and phrase matching algorithms are used. However, when implemented with an STI, TALE significantly outperforms all other methods. Compared to indexing linear text, TALE improves accuracy by 30-60% for multi-word phrase searches and by 130% for two-term AND queries.
  • Keywords
    indexing; speech processing; approximate word-lattice indexing; position-specific posterior lattices; speech lattices; standard text indexer; time-anchored lattice expansion; time-based merging for index; Asia; Audio compression; Call conference; Indexing; Internet; Lattices; Merging; Search engines; Speech; Videos; Standard Text Indexer; keyword spotting; lattice indexing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518843
  • Filename
    4518843