skip to main content
10.1145/2671188.2749319acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
short-paper

Information Gain Study for Visual Vocabulary Construction

Published: 22 June 2015 Publication History

Abstract

Content Based Image Retrieval (CBIR) systems retrieve the most similar images to a query image in a collection. One of the most popular models and widely applied in this task is the Bag of Visual Words model (BoVW). In this paper, we introduce an evaluation study of different information gain models used for the construction of a visual word vocabulary. In the proposed framework, the information gain is used as discriminative information to index image features and select the ones that have the highest values of information gain. The empirical experiments made for this study evaluate the effect of four different information gain models: tf-idf, entropy, bm25, tfc with respect to different descriptors and image databases. The results show that selecting the image features based on at least one of the studied information gain model allows the retrieval process to be more accurate than the classical Bag of Visual Words model.

References

[1]
H. Bay, T. Tuytelaars, and L. Gool. Surf: Speeded up robust features. In A. Leonardis, H. Bischof, and A. Pinz, editors, Computer - Vision - ECCV - 2006, volume 3951 of Lecture Notes in Computer Science, pages 404--417. Springer Berlin Heidelberg, 2006.
[2]
G. Csurka, C. Bray, C. Dance, and L. Fan. Visual categorization with bags of keypoints. Workshop on Statistical Learning in Computer Vision, ECCV, pages 1--22, 2004.
[3]
M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. The PASCAL Visual Object Classes Challenge 2012 (VOC2012) Results. http://www.pascal-network.org/challenges/-VOC/voc2012/workshop/index.html.
[4]
M. Hancock-Beaulieu, M. Gatford, X. Huang, S. E. Robertson, S. Walker, and P. W. Williams. Okapi at TREC-5. In E. M. Voorhees and D. K. Harman, editors, Proceedings of The Fifth Text REtrieval Conference, TREC 1996, Gaithersburg, Maryland, USA, November 20-22, 1996, volume Special Publication 500--238. National Institute of Standards and Technology (NIST), 1996.
[5]
Z. Harris. Distributional structure. Word, 10(23):146--162, 1954.
[6]
H. Jégou, M. Douze, and C. Schmid. Hamming embedding and weak geometric consistency for large scale image search. In A. Z. David Forsyth, Philip Torr, editor, European Conference on Computer Vision, volume I of LNCS, pages 304--317. Springer, oct 2008.
[7]
H. Jégou, M. Douze, C. Schmid, and P. Pérez. Aggregating local descriptors into a compact image representation. In 23rd IEEE Conference on Computer Vision & Pattern Recognition (CVPR '10), pages 3304--3311, San Francisco, United States, 2010. IEEE Computer Society.
[8]
F. R. López, H. Jiménez-Salazar, and D. Pinto. A competitive term selection method for information retrieval. In A. F. Gelbukh, editor, CICLing, volume 4394 of Lecture Notes in Computer Science, pages 468--475. Springer, 2007.
[9]
D. G. Lowe. Object recognition from local scale-invariant features. International Conference on Computer Vision, 2:1150--1157, 1999.
[10]
F. Mindru, T. Tuytelaars, L. J. V. Gool, and T. Moons. Moment invariants for recognition under changing viewpoint and illumination. Computer Vision and Image Understanding, 94(1-3):3--27, 2004.
[11]
D. Nistér and H. Stewénius. Scalable recognition with a vocabulary tree. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), volume 2, pages 2161--2168, June 2006.
[12]
L. Parsons, E. Haque, and H. Liu. Subspace clustering for high dimensional data: a review. In ACM SIGKDD, volume 6, pages 90--105. Explorations Newsletter, 2004.
[13]
F. Perronnin and C. R. Dance. Fisher kernels on visual vocabularies for image categorization. In Conference on Computer Vision and Pattern Recognition (CVPR 2007), 18-23 June 2007, Minneapolis, Minnesota, USA. IEEE Computer Society, 2007.
[14]
G. Salton and C. Buckley. Term-weighting approaches in automatic text retrieval. Inf. Process. Manage., 24(5):513--523, Aug. 1988.
[15]
J. Sivic and A. Zisserman. Video Google: A text retrieval approach to object matching in videos. In Proceedings of the International Conference on Computer Vision, pages 1470--1477, Oct. 2003.
[16]
T. Urruty, S. Gbèhounou, H. T. Le, J. Martinet, and C. Fernandez-Maloigne. Iterative random visual word selection. In International Conference on Multimedia Retrieval, Glasgow, United Kingdom, page 249, 2014.
[17]
K. E. A. van de Sande, T. Gevers, and C. G. M. Snoek. Evaluating color descriptors for object and scene recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(9):1582--1596, 2010.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICMR '15: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval
June 2015
700 pages
ISBN:9781450332743
DOI:10.1145/2671188
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 June 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. bag of visual words model
  2. image retrieval
  3. information gain
  4. vocabulary construction

Qualifiers

  • Short-paper

Conference

ICMR '15
Sponsor:

Acceptance Rates

ICMR '15 Paper Acceptance Rate 48 of 127 submissions, 38%;
Overall Acceptance Rate 210 of 694 submissions, 30%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 63
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Jan 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media