DocumentCode :
394314
Title :
A general algorithm for word graph matrix decomposition
Author :
Hakkani-Tür, Dilek ; Riccardi, Giuseppe
Author_Institution :
AT&T Labs.-Res., USA
Volume :
1
fYear :
2003
fDate :
6-10 April 2003
Abstract :
In automatic speech recognition, word graphs (lattices) are commonly used as an approximate representation of the complete word search space. Usually these word lattices are acyclic and have no a-priori structure. More recently a new class of normalized word lattices have been proposed. These word lattices (a.k.a. sausages) are very efficient (space) and they provide a normalization (chunking) of the lattice, by aligning words from all possible hypotheses. We propose a general framework for lattice chunking, the pivot algorithm. There are four important components of the pivot algorithm. First, the time information is not necessary but is beneficial for the overall performance. Second, the algorithm allows the definition of a predefined chunk structure of the final word lattice. Third, the algorithm operates on both weighted and unweighted lattices. Fourth, the labels on the graph are generic, and could be words as well as part of speech tags or parse tags. While the algorithm has applications to many tasks (e.g. parsing, named entity extraction) we present results on the performance of confidence scores for different large vocabulary speech recognition tasks. We compare the results of our algorithms against off-the-shelf methods and show significant improvements.
Keywords :
feature extraction; matrix decomposition; speech recognition; approximate representation; automatic speech recognition; chunk structure; general algorithm; large vocabulary speech recognition; lattice chunking; named entity extraction; normalized word lattices; off-the-shelf methods; parse tags; parsing; sausages; speech tags; unweighted lattices; weighted lattices; word graph matrix decomposition; word graphs; word search space; Automatic speech recognition; Costs; Lattices; Matrix decomposition; Speech recognition; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1198851
Filename :
1198851
Link To Document :
بازگشت