short-paper

Information Gain Study for Visual Vocabulary Construction

Authors:

Syntyche Gbèhounou,

Thierry Urruty,

François Lecellier,

Christine FernandezAuthors Info & Claims

ICMR '15: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval

Pages 503 - 506

https://doi.org/10.1145/2671188.2749319

Published: 22 June 2015 Publication History

Abstract

Content Based Image Retrieval (CBIR) systems retrieve the most similar images to a query image in a collection. One of the most popular models and widely applied in this task is the Bag of Visual Words model (BoVW). In this paper, we introduce an evaluation study of different information gain models used for the construction of a visual word vocabulary. In the proposed framework, the information gain is used as discriminative information to index image features and select the ones that have the highest values of information gain. The empirical experiments made for this study evaluate the effect of four different information gain models: tf-idf, entropy, bm25, tfc with respect to different descriptors and image databases. The results show that selecting the image features based on at least one of the studied information gain model allows the retrieval process to be more accurate than the classical Bag of Visual Words model.

References

[1]

H. Bay, T. Tuytelaars, and L. Gool. Surf: Speeded up robust features. In A. Leonardis, H. Bischof, and A. Pinz, editors, Computer - Vision - ECCV - 2006, volume 3951 of Lecture Notes in Computer Science, pages 404--417. Springer Berlin Heidelberg, 2006.

Digital Library

[2]

G. Csurka, C. Bray, C. Dance, and L. Fan. Visual categorization with bags of keypoints. Workshop on Statistical Learning in Computer Vision, ECCV, pages 1--22, 2004.

[3]

M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. The PASCAL Visual Object Classes Challenge 2012 (VOC2012) Results. http://www.pascal-network.org/challenges/-VOC/voc2012/workshop/index.html.

[4]

M. Hancock-Beaulieu, M. Gatford, X. Huang, S. E. Robertson, S. Walker, and P. W. Williams. Okapi at TREC-5. In E. M. Voorhees and D. K. Harman, editors, Proceedings of The Fifth Text REtrieval Conference, TREC 1996, Gaithersburg, Maryland, USA, November 20-22, 1996, volume Special Publication 500--238. National Institute of Standards and Technology (NIST), 1996.

[5]

Z. Harris. Distributional structure. Word, 10(23):146--162, 1954.

[6]

H. Jégou, M. Douze, and C. Schmid. Hamming embedding and weak geometric consistency for large scale image search. In A. Z. David Forsyth, Philip Torr, editor, European Conference on Computer Vision, volume I of LNCS, pages 304--317. Springer, oct 2008.

Digital Library

[7]

H. Jégou, M. Douze, C. Schmid, and P. Pérez. Aggregating local descriptors into a compact image representation. In 23rd IEEE Conference on Computer Vision & Pattern Recognition (CVPR '10), pages 3304--3311, San Francisco, United States, 2010. IEEE Computer Society.

[8]

F. R. López, H. Jiménez-Salazar, and D. Pinto. A competitive term selection method for information retrieval. In A. F. Gelbukh, editor, CICLing, volume 4394 of Lecture Notes in Computer Science, pages 468--475. Springer, 2007.

Digital Library

[9]

D. G. Lowe. Object recognition from local scale-invariant features. International Conference on Computer Vision, 2:1150--1157, 1999.

Digital Library

[10]

F. Mindru, T. Tuytelaars, L. J. V. Gool, and T. Moons. Moment invariants for recognition under changing viewpoint and illumination. Computer Vision and Image Understanding, 94(1-3):3--27, 2004.

Digital Library

[11]

D. Nistér and H. Stewénius. Scalable recognition with a vocabulary tree. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), volume 2, pages 2161--2168, June 2006.

Digital Library

[12]

L. Parsons, E. Haque, and H. Liu. Subspace clustering for high dimensional data: a review. In ACM SIGKDD, volume 6, pages 90--105. Explorations Newsletter, 2004.

Digital Library

[13]

F. Perronnin and C. R. Dance. Fisher kernels on visual vocabularies for image categorization. In Conference on Computer Vision and Pattern Recognition (CVPR 2007), 18-23 June 2007, Minneapolis, Minnesota, USA. IEEE Computer Society, 2007.

[14]

G. Salton and C. Buckley. Term-weighting approaches in automatic text retrieval. Inf. Process. Manage., 24(5):513--523, Aug. 1988.

Digital Library

[15]

J. Sivic and A. Zisserman. Video Google: A text retrieval approach to object matching in videos. In Proceedings of the International Conference on Computer Vision, pages 1470--1477, Oct. 2003.

Digital Library

[16]

T. Urruty, S. Gbèhounou, H. T. Le, J. Martinet, and C. Fernandez-Maloigne. Iterative random visual word selection. In International Conference on Multimedia Retrieval, Glasgow, United Kingdom, page 249, 2014.

Digital Library

[17]

K. E. A. van de Sande, T. Gevers, and C. G. M. Snoek. Evaluating color descriptors for object and scene recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(9):1582--1596, 2010.

Digital Library

Recommendations

Using information gain to improve multi-modal information retrieval systems

Nowadays, access to information requires managing multimedia databases effectively, and so, multi-modal retrieval techniques (particularly images retrieval) have become an active research direction. In the past few years, a lot of content-based image ...
The evolution of visual information retrieval

This paper seeks to provide a brief overview of those developments which have taken the theory and practice of image and video retrieval into the digital age. Drawing on a voluminous literature, the context in which visual information retrieval takes ...
Iterative Random Visual Word Selection
ICMR '14: Proceedings of International Conference on Multimedia Retrieval

In content based image retrieval, one of the most important step is the construction of image signatures. To do so, a part of state-of-the-art approaches propose to build a visual vocabulary. In this paper, we propose a new methodology for visual ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

ICMR '15: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval

June 2015

700 pages

ISBN:9781450332743

DOI:10.1145/2671188

General Chairs:
Alex Hauptmann
Carnegie Mellon University, USA
,
Chong-Wah Ngo
City University of Hong Kong, China
,
Xiangyang Xue
Fudan University, China
,
Program Chairs:
Yu-Gang Jiang
Fudan University, China
,
Cees Snoek
University of Amsterdam and Qualcomm Research Netherlands
,
Nuno Vasconcelos
University of California, San Diego, USA

Copyright © 2015 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 June 2015

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Conference

ICMR '15

Sponsor:

SIGMM

ICMR '15: International Conference on Multimedia Retrieval

June 23 - 26, 2015

Shanghai, China

Acceptance Rates

ICMR '15 Paper Acceptance Rate 48 of 127 submissions, 38%;

Overall Acceptance Rate 210 of 694 submissions, 30%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
63
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 16 Jan 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents