Visual-textual joint relevance learning for tag-based social image search

Yue Gao; Meng Wang; Zheng-Jun Zha; Jialie Shen; Xuelong Li; Xindong Wu

doi:10.1109/TIP.2012.2202676

Visual-textual joint relevance learning for tag-based social image search

IEEE Trans Image Process. 2013 Jan;22(1):363-76. doi: 10.1109/TIP.2012.2202676. Epub 2012 Jun 5.

Authors

Yue Gao¹, Meng Wang, Zheng-Jun Zha, Jialie Shen, Xuelong Li, Xindong Wu

Affiliation

¹ Department of Automation, Tsinghua University, Beijing 100084, China.

PMID: 22692911
DOI: 10.1109/TIP.2012.2202676

Abstract

Due to the popularity of social media websites, extensive research efforts have been dedicated to tag-based social image search. Both visual information and tags have been investigated in the research field. However, most existing methods use tags and visual characteristics either separately or sequentially in order to estimate the relevance of images. In this paper, we propose an approach that simultaneously utilizes both visual and textual information to estimate the relevance of user tagged images. The relevance estimation is determined with a hypergraph learning approach. In this method, a social image hypergraph is constructed, where vertices represent images and hyperedges represent visual or textual terms. Learning is achieved with use of a set of pseudo-positive images, where the weights of hyperedges are updated throughout the learning process. In this way, the impact of different tags and visual words can be automatically modulated. Comparative results of the experiments conducted on a dataset including 370+images are presented, which demonstrate the effectiveness of the proposed approach.

Publication types

Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Algorithms*
Animals
Databases, Factual
Humans
Image Processing, Computer-Assisted / methods*
Photography / methods*
Social Media*