DocumentCode
2950362
Title
A new syllable-lattice based approach for Mandarin spoken document retrieval
Author
Zhang, Lei ; Gao, Yunxia ; Xiang, Xuezhi ; Lu, Dong
Author_Institution
Coll. of Inf. & Commun. Eng., Harbin Eng. Univ., Harbin, China
fYear
2009
fDate
13-15 Nov. 2009
Firstpage
1
Lastpage
4
Abstract
In our Mandarin spoken document retrieval system, the effects of both retrieval source and retrieval model are considered. For the retrieval source, the syllable-lattice is adopted which can ameliorate the effect of speech recognition error on document retrieval. For the retrieval model, the document length prior is combined with Jelinek-Mercer smoothing technique, which is widely applied in text document retrieval model. As far as we know, the combination of syllable lattice and retrieval model based on the document length prior is firstly introduced for spoken document retrieval. Experimental results show that the retrieval performance of lattice-based method outperforms that of 1-best method. Further more, in the retrieval model with the document length priors, lattice-based approach can achieve the best performance, which can improve about 30%.
Keywords
information retrieval; speech recognition; Jelinek-Mercer smoothing technique; Mandarin spoken document retrieval; document length prior; retrieval source; speech recognition error; syllable-lattice based approach; text document retrieval model; Broadcasting; Decoding; Educational institutions; Hidden Markov models; Information retrieval; Lattices; Natural languages; Search engines; Smoothing methods; Speech recognition; spoken document retrieval; syllable-lattice; the documen length priors;
fLanguage
English
Publisher
ieee
Conference_Titel
Wireless Communications & Signal Processing, 2009. WCSP 2009. International Conference on
Conference_Location
Nanjing
Print_ISBN
978-1-4244-4856-2
Electronic_ISBN
978-1-4244-5668-0
Type
conf
DOI
10.1109/WCSP.2009.5371545
Filename
5371545
Link To Document