DocumentCode :
3466220
Title :
LDA-Based Retrieval Framework for Semantic News Video Retrieval
Author :
Cao, Juan ; Li, Jintao ; Zhang, Yongdong ; Tang, Sheng
Author_Institution :
Chinese Acad. of Sci., Beijing
fYear :
2007
fDate :
17-19 Sept. 2007
Firstpage :
155
Lastpage :
160
Abstract :
Topic-based language model has attracted much attention as the propounding of semantic retrieval in recent years. Especially for the ASR text with errors, the topic representation is more reasonable than the exact term representation. Among these models, Latent Dirichlet Allocation(LDA) has been noted for its ability to discover the latent topic structure, and is broadly applied in many text-related tasks. But up to now its application in information retrieval(IR) is still limited to be a supplement to the standard document models, and furthermore, it has been pointed out that directly employing the basic LDA model will hurt retrieval performance. In this paper, we propose a lexicon-guided two-level LDA retrieval framework. It uses the HowNet to guide the first-level LDA model´s parameter estimation, and further construct the second-level LDA models based on the first-level´s inference results. We use TRECID 2005 ASR collection to evaluate it, and compare it with the vector space model(VSM) and latent semantic Indexing(LSI). Our experiments show the proposed method is very competitive.
Keywords :
video retrieval; HowNet; LDA-based retrieval framework; Latent Dirichlet Allocation; information retrieval; semantic news video retrieval; text-related tasks; topic-based language model; vector space model; Automatic speech recognition; Computers; Information processing; Information retrieval; Laboratories; Large scale integration; Linear discriminant analysis; Natural languages; Space technology; Videoconference; ASR text; LDA; Semantic video retrieval; Topic-based model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Semantic Computing, 2007. ICSC 2007. International Conference on
Conference_Location :
Irvine, CA
Print_ISBN :
978-0-7695-2997-4
Type :
conf
DOI :
10.1109/ICSC.2007.26
Filename :
4338344
Link To Document :
بازگشت