DocumentCode
598245
Title
Weakly supervised topic grouping of YouTube search results
Author
Liujuan Cao ; Rongrong Ji ; Wei Liu ; Yue Gao ; Ling-Yu Duan ; Chaoguang Men
Author_Institution
Sch. of Comput. Sci., Harbin Eng. Univ., Harbin, China
fYear
2012
fDate
Sept. 30 2012-Oct. 3 2012
Firstpage
2885
Lastpage
2888
Abstract
Recent years have witnessed an explosive growth of user contributed videos on websites like YouTube and Metacafe, which usually provide a query-by-keyword functionality to facilitate the user browsing. For a given query, the returned videos typically contain multiple topics that are mixed up to duplicate the user browsing. Therefore, their diversification and grouping are highly demanded to improve the user experiences. However, the tagging and content qualities of user contributed videos are uncontrolled against their precise grouping. In this paper, we present a weakly supervised topic grouping paradigm to diversify the returned videos of a given keyword query. Our grouping is based on the bag-of-words visual signature quantized over the spatiotemporal STIP descriptor [1] extracted from each returned video. First, we adopt a min-Hashing based visual similarity in combination of the tagging similarity to group the returned videos. Based on the initial grouping configurations, we mine the co-occurred discriminative sub-signatures, based on which we iteratively refine the first step. Such iteration well handles the noise in visual content and tagging, since neither of which is fully trusted during the grouping. We validate our schemes on over 2,000 video clips crawled from a set of YouTube keyword query results. Comparing to alternative approaches, our scheme has shown superior robustness and precision.
Keywords
social networking (online); spatiotemporal phenomena; text analysis; video retrieval; Metacafe; Websites; YouTube keyword query; YouTube search results; bag-of-words visual signature; content qualities; keyword query; min-Hashing based visual similarity; query-by-keyword functionality; returned video; spatiotemporal STIP descriptor; user browsing; user contributed video tagging qualities; user contributed videos; user experiences; visual content; visual tagging; weakly supervised topic grouping; Educational institutions; Robustness; Tagging; Videos; Visualization; Vocabulary; YouTube; Hashing; Pattern Mining; Search Result Diversification; Social Media; Video Retrieval;
fLanguage
English
Publisher
ieee
Conference_Titel
Image Processing (ICIP), 2012 19th IEEE International Conference on
Conference_Location
Orlando, FL
ISSN
1522-4880
Print_ISBN
978-1-4673-2534-9
Electronic_ISBN
1522-4880
Type
conf
DOI
10.1109/ICIP.2012.6467502
Filename
6467502
Link To Document