Title :
Discriminative bag-of-visual phrase learning for landmark recognition
Author :
Chen, Tao ; Yap, Kim-Hui ; Zhang, Dajiang
Author_Institution :
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
Abstract :
Bag-of-visual phrase (BoP) has been proposed and developed for landmark recognition recently. However, existing BoP methods for landmark recognition have two major shortcomings: (i) they try to construct a universal phrase vocabulary for all object categories, which lacks specific descriptive capabilities for a particular category, and (ii) they often adopt simple criterion such as the frequency information to mine the visual phrases, which may cause the selected phrases to be less discriminative or representative for recognition. In view of this, this paper proposes a new discriminative BoP approach for landmark recognition. First, the candidate visual phrases defined as adjacent pairwise words are selected for each category. A phrase-level similarity measure at the latent space is proposed to evaluate the semantic similarity between pairwise phrases. This is then integrated with the phrase frequency information to shortlist the discriminative phrases for each category through a proposed phrase ranking algorithm. Finally, the BoP and bag-of-words (BoW) histograms are combined through a pyramid matching method for recognition. Experimental results on two different datasets demonstrate that the proposed method is effective in landmark recognition.
Keywords :
image recognition; learning (artificial intelligence); natural language processing; adjacent pairwise words; bag-of-words histograms; descriptive capabilities; discriminative BoP approach; discriminative bag-of-visual phrase learning; landmark recognition; latent space; object categories; phrase frequency information; phrase-level similarity measure; pyramid matching; semantic similarity; simple criterion; universal phrase vocabulary; Computer vision; Conferences; Frequency measurement; Histograms; Semantics; Visualization; Vocabulary; BoP; BoW; discriminative visual phrases; landmark recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288028