DocumentCode :
2727285
Title :
Topic indexing of spoken documents based on optimized N-best approach
Author :
Zhang, Lei ; Chang, Jingxin ; Xiang, Xuezhi ; Feng, Xiaosen
Author_Institution :
Inf. & Commun. Eng. Coll., Harbin Eng. Univ., Harbin, China
Volume :
4
fYear :
2009
fDate :
20-22 Nov. 2009
Firstpage :
302
Lastpage :
305
Abstract :
For topic indexing of spoken documents, the word error rate is hopefully decreased instead of the whole sentence error rate, so the center hypothesis among the N-best results is selected as the final output in speech recognition system. Then all spoken documents can be represented as vectors with high dimensions in vector space model, which can be combined with non-negative matrix factorization or singular value decomposition to map the vector space into semantic space. Experiment results show that optimized N-best approach is more suitable to the topic indexing system than one-best method. Combined with the non-negative matrix factorization, the correct topic indexing can achieve 98.1% in optimized N-best approach, which is 0.9% higher than the one-best approach under the same condition. When the semantic space is decreased to 10, there is about 11.1% difference between these two approaches. Furthermore, compared with singular value decomposition method, non-negative matrix factorization has the advantages of better performance, faster computation speed and less storage space.
Keywords :
document handling; indexing; optimisation; singular value decomposition; speech recognition; vectors; non-negative matrix factorization; optimized N-best approach; sentence error rate; singular value decomposition; speech recognition system; spoken documents; topic indexing; vector space model; word error rate; Decoding; Educational institutions; Error analysis; Indexing; Information retrieval; Lattices; Matrix decomposition; Signal processing algorithms; Singular value decomposition; Speech recognition; NMF; SVD; Topic indexing; optimaized N-best;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Computing and Intelligent Systems, 2009. ICIS 2009. IEEE International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-4754-1
Electronic_ISBN :
978-1-4244-4738-1
Type :
conf
DOI :
10.1109/ICICISYS.2009.5357691
Filename :
5357691
Link To Document :
بازگشت