skip to main content
10.1145/2072298.2072054acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

Tag-based social image search with visual-text joint hypergraph learning

Published: 28 November 2011 Publication History

Abstract

Tag-based social image search has attracted great interest and how to order the search results based on relevance level is a research problem. Visual content of images and tags have both been investigated. However, existing methods usually employ tags and visual content separately or sequentially to learn the image relevance. This paper proposes a tag-based image search with visual-text joint hypergraph learning. We simultaneously investigate the bag-of-words and bag-of-visual-words representations of images and accomplish the relevance estimation with a hypergraph learning approach. Each textual or visual word generates a hyperedge in the constructed hypergraph. We conduct experiments with a real-world data set and experimental results demonstrate the effectiveness of our approach.

References

[1]
J. Bu, S. Tan, C. Chen, C. Wang, H. Wu, L. Zhang, and X. He. Music recommendation by unified hypergraph: combining social media information and music content. In Proceedings of the ACM International Conference on Multimedia, 2010.
[2]
L. Chen, D. Xu, W. Tsang, and J. Luo. Tag-based web photo retrieval improved by batch mode re-tagging. In Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, 2010.
[3]
T. Chen, M.-M. Cheng, P. Tan, A. Shamir, and S.-M. Hu. Sketch2photo: Internet image montage. ACM Trans. Graph, 28, 2009.
[4]
M.-M. Cheng, G.-X. Zhang, N. J. Mitra, X. Huang, and S.-M. Hu. Global contrast based salient region detection. In Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pages 409 416, 2011.
[5]
Y. Gao, M. Wang, Z. Zha, Q. Tian, Q. Dai, and N. Zhang. Less is more: Efficient 3d object retrieval with query view selection. IEEE Transactions on Multimedia, 11, 2011.
[6]
B. Geng, L. Yang, C. Xu, and X.-S. Hua. Ranking model adaptation for domain-specific search. IEEE Transactions on Knowledge and Data Engineering, 2010.
[7]
Y. Huang, Q. Liu, S. Zhang, and D. Metaxas. Video object segmentation by hypergraph cut. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pages 1 6, Miami, USA, 2009.
[8]
Y. Huang, Q. Liu, S. Zhang, and D. Metaxas. Image retrieval via probabilistic hypergraph ranking. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pages 1 6, 2010.
[9]
X. Li, C. G. Snoek, and M. Worring. Unsupervised multi-feature tag relevance learning for social image retrieval. In Proceedings of the ACM International Conference on Image and Video Retrieval, 2010.
[10]
D. Liu, X.-S. Hua, L. Yang, M. Wang, and H.-J. Zhang. Tag ranking. In Proceedings of the International Conference on World Wide Web, 2009.
[11]
D. Liu, M. Wang, X.-S. Hua, and H.-J. Zhang. Semi-automatic tagging of photo albums via exemplar selection and tag inference. IEEE Transaction on Multimedia, 13:82 91, 2011.
[12]
D. Liu, M. Wang, L. Yang, X.-S. Hua, and H. Zhang. Tag quality improvement for social images. In Proceeding of IEEE International Conference on Multimedia, pages 350 353, 2009.
[13]
D. Liu, S. Yan, X.-S. Hua, and H.-J. Zhang. Image tagging via collaborative tag propagation. IEEE Transaction on Multimedia, 13:702 712, 2011.
[14]
J. Shen, J. Shepherd, B. Cui, and K.-L. Tan. A novel framework for efficient automated singer identification in large music databases. ACM Transactions on Information Systems, 27, 2009.
[15]
J. Shen, D. Tao, and X. Li. Modality mixture projections for semantic video event detection. IEEE Transactions on Circuits and Systems for Video Technology, 18:1587 1596, 2008.
[16]
A. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain. Content-based image retrieval at the end of early years. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22:1349 1380, 2000.
[17]
M. Wang and X.-S. Hua. Active learning in multimedia annotation and retrieval: A survey. ACM Transactions on Intelligent Systems and Technology, 2:10 31, 2011.
[18]
M. Wang, X.-S. Hua, J. Tang, and R. Hong. Beyond distance measurement: Constructing neighborhood similarity for video annotation. IEEE Transactions on Multimedia, 11:465 476, 2009.
[19]
M. Wang, K. Yang, X.-S. Hua, and H.-J. Zhang. Towards relevant and diverse search of social images. IEEE Transactions on Multimedia, 12:829 842, 2010.
[20]
Y. Yang, D. Xu, F. Nie, S. Yan, and Y. Zhuang. Image clustering using local discriminant models and global integration. IEEE Transactions on Image Processing, 10:2761 2773, 2010.
[21]
Y. Yang, Y. Zhuang, F. Wu, and Y. Pan. Harmonizing hierarchical manifolds for multimedia document semantics understanding and cross-media retrieval. IEEE Transactions on Multimedia, 10:437 446, 2008.
[22]
R. Zass and A. Shashua. Probabilistic graph and hypergraph matching. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Alaska, USA, 2008.
[23]
D. Zhou, J. Huang, and B. Schokopf. Learning with hypergraphs: Clustering, classification, and embedding. In Proceedings of Advances in Neural Information Processing Systems 19, pages 1601 1608, 2007.
[24]
G. Zhu, S. Yan, and Y. Ma. Image tag refinement towards low-rank, content-tag prior and error sparsity. In Proceedings of the ACM International Conference on Multimedia, 2010.

Cited By

View all

Index Terms

  1. Tag-based social image search with visual-text joint hypergraph learning

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      MM '11: Proceedings of the 19th ACM international conference on Multimedia
      November 2011
      944 pages
      ISBN:9781450306164
      DOI:10.1145/2072298
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 28 November 2011

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. hypergraph learning
      2. tag-based image search
      3. visual-text

      Qualifiers

      • Short-paper

      Conference

      MM '11
      Sponsor:
      MM '11: ACM Multimedia Conference
      November 28 - December 1, 2011
      Arizona, Scottsdale, USA

      Acceptance Rates

      Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)15
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 09 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media