research-article

A Statistical Approach to Mining Semantic Similarity for Deep Unsupervised Hashing

Authors:

Jianqiang Huang,

Xian-Sheng HuaAuthors Info & Claims

MM '21: Proceedings of the 29th ACM International Conference on Multimedia

Pages 4306 - 4314

https://doi.org/10.1145/3474085.3475570

Published: 17 October 2021 Publication History

Abstract

The majority of deep unsupervised hashing methods usually first construct pairwise semantic similarity information and then learn to map images into compact hash codes while preserving the similarity structure, which implies that the quality of hash codes highly depends on the constructed semantic similarity structure. However, since the features of images for each kind of semantics usually scatter in high-dimensional space with unknown distribution, previous methods could introduce a large number of false positives and negatives for boundary points of distributions in the local semantic structure based on pairwise cosine distances. Towards this limitation, we propose a general distribution-based metric to depict the pairwise distance between images. Specifically, each image is characterized by its random augmentations that can be viewed as samples from the corresponding latent semantic distribution. Then we estimate the distances between images by calculating the sample distribution divergence of their semantics. By applying this new metric to deep unsupervised hashing, we come up with Distribution-based similArity sTructure rEconstruction (DATE). DATE can generate more accurate semantic similarity information by using non-parametric ball divergence. Moreover, DATE explores both semantic-preserving learning and contrastive learning to obtain high-quality hash codes. Extensive experiments on several widely-used datasets validate the superiority of our DATE.

Supplementary Material

MP4 File (MM21-fp2144.mp4)

Presentation video - short version

Download
8.00 MB

References

[1]

Yue Cao, Mingsheng Long, Bin Liu, and Jianmin Wang. 2018a. Deep cauchy hashing for hamming space retrieval. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[2]

Zhangjie Cao, Mingsheng Long, Jianmin Wang, and Philip S Yu. 2017. Hashnet: Deep learning to hash by continuation. In Proceedings of the IEEE international conference on computer vision.

[3]

Zhangjie Cao, Ziping Sun, Mingsheng Long, Jianmin Wang, and Philip S Yu. 2018b. Deep priority hashing. In Proceedings of the 26th ACM international conference on Multimedia.

Digital Library

[4]

Ting Chen, Simon Kornblith, Mohammad Norouzi, and Geoffrey Hinton. 2020. A simple framework for contrastive learning of visual representations. In Proceedings of the International Conference on Machine Learning.

[5]

Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yantao Zheng. 2009. NUS-WIDE: a real-world web image database from National University of Singapore. In Proceedings of the ACM international Conference on Image and Video Retrieval.

Digital Library

[6]

Cheng Chang Himanshu Rai Junwei Ma Satya Krishna Gorti Maksims Volkovs Chundi Liu, Guangwei Yu. 2019. Guided Similarity Separation for Image Retrieval. In Proceedings of the International Conference on Neural Information Processing Systems (NeurIPS).

Digital Library

[7]

Bo Dai, Ruiqi Guo, Sanjiv Kumar, Niao He, and Le Song. 2017. Stochastic generative hashing. In Proceedings of the International Conference on Machine Learning.

Digital Library

[8]

Yunchao Gong, Svetlana Lazebnik, Albert Gordo, and Florent Perronnin. 2012. Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 35, 12 (2012), 2916--2929.

Digital Library

[9]

Yifan Gu, Shidong Wang, Haofeng Zhang, Yazhou Yao, and Li Liu. 2019. Clustering-driven unsupervised deep hashing for image retrieval. Neurocomputing, Vol. 368 (2019), 114--123.

Digital Library

[10]

Raia Hadsell, Sumit Chopra, and Yann LeCun. 2006. Dimensionality reduction by learning an invariant mapping. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

Digital Library

[11]

Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie, and Ross Girshick. 2020. Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[12]

Jae-Pil Heo, Youngwoon Lee, Junfeng He, Shih-Fu Chang, and Sung-Eui Yoon. 2012. Spherical hashing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

Digital Library

[13]

Qinghao Hu, Jiaxiang Wu, Jian Cheng, Lifang Wu, and Hanqing Lu. 2017. Pseudo label based unsupervised deep discriminative hashing for image retrieval. In Proceedings of the 25th ACM international conference on Multimedia.

Digital Library

[14]

Mark J Huiskes and Michael S Lew. 2008. The MIR flickr retrieval evaluation. In Proceedings of the 1st ACM international conference on Multimedia information retrieval.

Digital Library

[15]

Zhongming Jin, Cheng Li, Yue Lin, and Deng Cai. 2013. Density sensitive hashing. IEEE transactions on cybernetics, Vol. 44, 8 (2013), 1362--1371.

[16]

Alex Krizhevsky, Geoffrey Hinton, et al. 2009. Learning multiple layers of features from tiny images. (2009).

[17]

Hanjiang Lai, Yan Pan, Ye Liu, and Shuicheng Yan. 2015. Simultaneous feature learning and hash coding with deep neural networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[18]

Qi Li, Zhenan Sun, Ran He, and Tieniu Tan. 2017. Deep supervised discrete hashing. In Proceedings of the International Conference on Neural Information Processing Systems (NeurIPS). 2482--2491.

Digital Library

[19]

Yunfan Li, Peng Hu, Zitao Liu, Dezhong Peng, Joey Tianyi Zhou, and Xi Peng. 2021. Contrastive Clustering. In Proceedings of the AAAI Conference on Artificial Intelligence.

[20]

Kevin Lin, Jiwen Lu, Chu-Song Chen, and Jie Zhou. 2016. Learning compact binary descriptors with unsupervised deep neural networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[21]

Mingbao Lin, Rongrong Ji, Hong Liu, and Yongjian Wu. 2018. Supervised online hashing via hadamard codebook learning. In Proceedings of the 26th ACM international conference on Multimedia.

Digital Library

[22]

Xiao Luo, Chong Chen, Huasong Zhong, Hao Zhang, Minghua Deng, Jianqiang Huang, and Xiansheng Hua. 2020. A Survey on Deep Hashing Methods. arXiv preprint arXiv:2003.03369 (2020).

[23]

Xiao Luo, Daqing Wu, Chong Chen, Jinwen Ma, and Minghua Deng. 2021 a. Deep Unsupervised Hashing by Global and Local Consistency. In IEEE International Conference on Multimedia and Expo (ICME).

[24]

Xiao Luo, Daqing Wu, Zeyu Ma, Chong Chen, Jinwen Ma, Minghua Deng, Zhongming Jin, Jianqiang Huang, and Xian-sheng Hua. 2021 b. CIMON: Towards High-quality Hash Codes. In Proceedings of the International Joint Conference on Artificial Intelligence.

[25]

Wenliang Pan, Yuan Tian, Xueqin Wang, and Heping Zhang. 2018. Ball divergence: nonparametric two sample test. Annals of statistics, Vol. 46, 3 (2018), 1109.

[26]

Wenliang Pan, Xueqin Wang, Heping Zhang, Hongtu Zhu, and Jin Zhu. 2019. Ball covariance: A generic measure of dependence in banach space. J. Amer. Statist. Assoc. (2019).

[27]

Fumin Shen, Yan Xu, Li Liu, Yang Yang, Zi Huang, and Heng Tao Shen. 2018. Unsupervised deep hashing with similarity-adaptive and discrete optimization. IEEE transactions on pattern analysis and machine intelligence, Vol. 40, 12 (2018), 3034--3044.

Digital Library

[28]

Yuming Shen, Li Liu, and Ling Shao. 2019. Unsupervised binary representation learning with deep variational networks. International Journal of Computer Vision, Vol. 127, 11 (2019), 1614--1628.

Digital Library

[29]

Yuming Shen, Jie Qin, Jiaxin Chen, Mengyang Yu, Li Liu, Fan Zhu, Fumin Shen, and Ling Shao. 2020. Auto-encoding twin-bottleneck hashing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2818--2827.

[30]

K. Simonyan and A. Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. In Proceedings of the International Conference on Learning Representations.

[31]

Jingkuan Song, Tao He, Lianli Gao, Xing Xu, Alan Hanjalic, and Heng Tao Shen. 2018. Binary generative adversarial networks for image retrieval. In Proceedings of the AAAI Conference on Artificial Intelligence.

[32]

Shupeng Su, Chao Zhang, Kai Han, and Yonghong Tian. 2018. Greedy hash: Towards fast optimization for accurate hash coding in cnn. In Proceedings of the International Conference on Neural Information Processing Systems.

Digital Library

[33]

Rong-Cheng Tu, Xian-Ling Mao, and Wei Wei. 2020. MLS3RDUH: Deep Unsupervised Hashing via Manifold based Local Semantic Similarity Structure Reconstructing. In Proceedings of the International Joint Conference on Artificial Intelligence.

[34]

Laurens Van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of machine learning research, Vol. 9, 11 (2008).

[35]

Jingdong Wang, Heng Tao Shen, Jingkuan Song, and Jianqiu Ji. 2014. Hashing for similarity search: A survey. arXiv preprint arXiv:1408.2927 (2014).

[36]

Tongzhou Wang and Phillip Isola. 2020. Understanding Contrastive Representation Learning through Alignment and Uniformity on the Hypersphere. In Proceedings of the International Conference on Machine Learning.

[37]

Yair Weiss, Antonio Torralba, and Rob Fergus. 2009. Spectral hashing. In Proceedings of the International Conference on Neural Information Processing Systems (NeurIPS).

[38]

Cheng Yan, Guansong Pang, Xiao Bai, Chunhua Shen, Jun Zhou, and Edwin Hancock. 2019. Deep hashing by discriminating hard examples. In Proceedings of the 27th ACM International Conference on Multimedia.

Digital Library

[39]

Erkun Yang, Cheng Deng, Tongliang Liu, Wei Liu, and Dacheng Tao. 2018. Semantic structure-based unsupervised deep hashing. In Proceedings of the International Joint Conference on Artificial Intelligence.

Digital Library

[40]

Erkun Yang, Tongliang Liu, Cheng Deng, Wei Liu, and Dacheng Tao. 2019. Distillhash: Unsupervised deep hashing by distilling data pairs. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[41]

Haofeng Zhang, Li Liu, Yang Long, and Ling Shao. 2017. Unsupervised deep hashing with pseudo labels for scalable image retrieval. IEEE Transactions on Image Processing, Vol. 27, 4 (2017), 1626--1638.

Digital Library

[42]

Wanqian Zhang, Dayan Wu, Yu Zhou, Bo Li, Weiping Wang, and Dan Meng. 2020. Deep Unsupervised Hybrid-similarity Hadamard Hashing. In Proceedings of the 28th ACM International Conference on Multimedia.

Digital Library

[43]

Shu Zhao, Dayan Wu, Wanqian Zhang, Yu Zhou, Bo Li, and Weiping Wang. 2020. Asymmetric Deep Hashing for Efficient Hash Code Compression. In Proceedings of the 28th ACM International Conference on Multimedia.

Digital Library

[44]

Maciej Zieba, Piotr Semberecki, Tarek El-Gaaly, and Tomasz Trzcinski. 2018. Bingan: Learning compact binary descriptors with a regularized gan. In Proceedings of the International Conference on Neural Information Processing Systems (NeurIPS).

Digital Library

Cited By

Sun TJiang BLi BLv JGao YDong WBagchi SZhang Y(2024)SimEncProceedings of the 2024 USENIX Conference on Usenix Annual Technical Conference10.5555/3691992.3692030(615-630)Online publication date: 10-Jul-2024
https://dl.acm.org/doi/10.5555/3691992.3692030
Chen BWu ZLiu YZeng BLu GZhang ZLarson K(2024)Enhancing cross-modal retrieval via visual-textual prompt hashingProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence10.24963/ijcai.2024/69(623-631)Online publication date: 3-Aug-2024
https://dl.acm.org/doi/10.24963/ijcai.2024/69
Kawai VValem LBaldassin ABorin EPedronette DLatecki L(2024)Rank-based Hashing for Effective and Efficient Nearest Neighbor Search for Image RetrievalACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365958020:10(1-19)Online publication date: 12-Sep-2024
https://dl.acm.org/doi/10.1145/3659580
Show More Cited By

Index Terms

A Statistical Approach to Mining Semantic Similarity for Deep Unsupervised Hashing
1. Information systems
  1. Information retrieval
    1. Retrieval models and ranking
      1. Similarity measures

Recommendations

Deep Self-Adaptive Hashing for Image Retrieval
CIKM '21: Proceedings of the 30th ACM International Conference on Information & Knowledge Management

Hashing technology has been widely used in image retrieval due to its computational and storage efficiency. Recently, deep unsupervised hashing methods have attracted increasing attention due to the high cost of human annotations in the real world and ...
Deep Unsupervised Hybrid-similarity Hadamard Hashing
MM '20: Proceedings of the 28th ACM International Conference on Multimedia

Hashing has become increasingly important for large-scale image retrieval. Recently, deep supervised hashing has shown promising performance, yet little work has been done under the more realistic unsupervised setting. The most challenging problem in ...
Unsupervised Hashing with Semantic Concept Mining
PACMMOD

Recently, to improve the unsupervised image retrieval performance, plenty of unsupervised hashing methods have been proposed by designing a semantic similarity matrix, which is based on the similarities between image features extracted by a pre-trained ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '21: Proceedings of the 29th ACM International Conference on Multimedia

October 2021

5796 pages

ISBN:9781450386517

DOI:10.1145/3474085

General Chairs:
Heng Tao Shen
University of Electronic Science&Technology of China, China
,
Yueting Zhuang
Zhejiang University, China
,
John R. Smith
IBM, USA
,
Program Chairs:
Yang Yang
University of Electronic Science and Technology of China, China
,
Pablo Cesar
CWI&TU Delft, The Netherlands
,
Florian Metze
FACEBOOK, Inc., USA
,
Balakrishnan Prabhakaran
University of Texas at Dallas, USA

Copyright © 2021 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 October 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

The National Natural Science Foundation of China
The National Key Research and Development Program of China

Conference

MM '21

Sponsor:

SIGMM

MM '21: ACM Multimedia Conference

October 20 - 24, 2021

Virtual Event, China

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

18
Total Citations
View Citations
447
Total Downloads

Downloads (Last 12 months)80
Downloads (Last 6 weeks)3

Reflects downloads up to 31 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Sun TJiang BLi BLv JGao YDong WBagchi SZhang Y(2024)SimEncProceedings of the 2024 USENIX Conference on Usenix Annual Technical Conference10.5555/3691992.3692030(615-630)Online publication date: 10-Jul-2024
https://dl.acm.org/doi/10.5555/3691992.3692030
Chen BWu ZLiu YZeng BLu GZhang ZLarson K(2024)Enhancing cross-modal retrieval via visual-textual prompt hashingProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence10.24963/ijcai.2024/69(623-631)Online publication date: 3-Aug-2024
https://dl.acm.org/doi/10.24963/ijcai.2024/69
Kawai VValem LBaldassin ABorin EPedronette DLatecki L(2024)Rank-based Hashing for Effective and Efficient Nearest Neighbor Search for Image RetrievalACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365958020:10(1-19)Online publication date: 12-Sep-2024
https://dl.acm.org/doi/10.1145/3659580
Ma ZLi YLuo YLuo XLi JChen CHua XLu G(2024)Discrepancy and Structure-Based Contrast for Test-Time Adaptive RetrievalIEEE Transactions on Multimedia10.1109/TMM.2024.338133726(8665-8677)Online publication date: 25-Mar-2024
https://dl.acm.org/doi/10.1109/TMM.2024.3381337
Zhang FChen CHua XLuo X(2024)FATE: Learning Effective Binary Descriptors With Group FairnessIEEE Transactions on Image Processing10.1109/TIP.2024.340613433(3648-3661)Online publication date: 2024
https://doi.org/10.1109/TIP.2024.3406134
Wei RLiu YSong JXie YZhou K(2024)Exploring Hierarchical Information in Hyperbolic Space for Self-Supervised Image HashingIEEE Transactions on Image Processing10.1109/TIP.2024.337135833(1768-1781)Online publication date: 5-Mar-2024
https://dl.acm.org/doi/10.1109/TIP.2024.3371358
Cao HHuang LNie JWei Z(2024)Unsupervised Deep Hashing With Fine-Grained Similarity-Preserving Contrastive Learning for Image RetrievalIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.332044434:5(4095-4108)Online publication date: May-2024
https://doi.org/10.1109/TCSVT.2023.3320444
Wei RLiu YSong JCui HXie YZhou KEl Saddik AMei TCucchiara RBertini MTobon Vallejo DAtrey PHossain M(2023)CHAIN: Exploring Global-Local Spatio-Temporal Information for Improved Self-Supervised Video HashingProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3613440(1677-1688)Online publication date: 26-Oct-2023
https://dl.acm.org/doi/10.1145/3581783.3613440
Song ZSu QChen JEl Saddik AMei TCucchiara RBertini MTobon Vallejo DAtrey PHossain M(2023)Unsupervised Hashing with Contrastive Learning by Exploiting Similarity Knowledge and Hidden Structure of DataProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3612596(6350-6358)Online publication date: 26-Oct-2023
https://dl.acm.org/doi/10.1145/3581783.3612596
Wen JXiang SPan CEl Saddik AMei TCucchiara RBertini MTobon Vallejo DAtrey PHossain M(2023)Exploring Universal Principles for Graph Contrastive Learning: A Statistical PerspectiveProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3612229(3579-3589)Online publication date: 26-Oct-2023
https://dl.acm.org/doi/10.1145/3581783.3612229
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten