DocumentCode
2727285
Title
Topic indexing of spoken documents based on optimized N-best approach
Author
Zhang, Lei ; Chang, Jingxin ; Xiang, Xuezhi ; Feng, Xiaosen
Author_Institution
Inf. & Commun. Eng. Coll., Harbin Eng. Univ., Harbin, China
Volume
4
fYear
2009
fDate
20-22 Nov. 2009
Firstpage
302
Lastpage
305
Abstract
For topic indexing of spoken documents, the word error rate is hopefully decreased instead of the whole sentence error rate, so the center hypothesis among the N-best results is selected as the final output in speech recognition system. Then all spoken documents can be represented as vectors with high dimensions in vector space model, which can be combined with non-negative matrix factorization or singular value decomposition to map the vector space into semantic space. Experiment results show that optimized N-best approach is more suitable to the topic indexing system than one-best method. Combined with the non-negative matrix factorization, the correct topic indexing can achieve 98.1% in optimized N-best approach, which is 0.9% higher than the one-best approach under the same condition. When the semantic space is decreased to 10, there is about 11.1% difference between these two approaches. Furthermore, compared with singular value decomposition method, non-negative matrix factorization has the advantages of better performance, faster computation speed and less storage space.
Keywords
document handling; indexing; optimisation; singular value decomposition; speech recognition; vectors; non-negative matrix factorization; optimized N-best approach; sentence error rate; singular value decomposition; speech recognition system; spoken documents; topic indexing; vector space model; word error rate; Decoding; Educational institutions; Error analysis; Indexing; Information retrieval; Lattices; Matrix decomposition; Signal processing algorithms; Singular value decomposition; Speech recognition; NMF; SVD; Topic indexing; optimaized N-best;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Computing and Intelligent Systems, 2009. ICIS 2009. IEEE International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4244-4754-1
Electronic_ISBN
978-1-4244-4738-1
Type
conf
DOI
10.1109/ICICISYS.2009.5357691
Filename
5357691
Link To Document