DocumentCode
3428246
Title
Approximateword-lattice indexing with text indexers: Time-Anchored Lattice Expansion
Author
Yu, Peng ; Shi, Yu ; Seide, Frank
Author_Institution
Microsoft Res. Asia, Beijing
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
5248
Lastpage
5251
Abstract
We address the problem of how to represent or approximate speech lattices to be indexed with existing text indexers. We present a method named time-anchored lattice expansion (TALE), which can be implemented by a standard text indexer (STI). On a 170- hour lecture set, we compare TALE with other lattice indexing methods: confusion networks, position-specific posterior lattices (PSPL), and time-based merging for index (TMI). All methods achieve accuracies comparable to searching raw lattices when the corresponding index structures and phrase matching algorithms are used. However, when implemented with an STI, TALE significantly outperforms all other methods. Compared to indexing linear text, TALE improves accuracy by 30-60% for multi-word phrase searches and by 130% for two-term AND queries.
Keywords
indexing; speech processing; approximate word-lattice indexing; position-specific posterior lattices; speech lattices; standard text indexer; time-anchored lattice expansion; time-based merging for index; Asia; Audio compression; Call conference; Indexing; Internet; Lattices; Merging; Search engines; Speech; Videos; Standard Text Indexer; keyword spotting; lattice indexing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4518843
Filename
4518843
Link To Document