skip to main content
10.1145/3206025.3206034acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

Asymmetric Discrete Cross-Modal Hashing

Published: 05 June 2018 Publication History

Abstract

Recently, cross-modal hashing (CMH) methods have attracted much attention. Many methods have been explored; however, there are still some issues that need to be further considered. 1) How to efficiently construct the correlations among heterogeneous modalities. 2) How to solve the NP-hard optimization problem and avoid the large quantization errors generated by relaxation. 3) How to handle the complex and difficult problem in most CMH methods that simultaneously learning the hash codes and hash functions. To address these challenges, we present a novel cross-modal hashing algorithm, named Asymmetric Discrete Cross-Modal Hashing (ADCH). Specifically, it leverages the collective matrix factorization technique to learn the common latent representations while preserving not only the cross-correlation from different modalities but also the semantic similarity. Instead of relaxing the binary constraints, it generates the hash codes directly using an iterative optimization algorithm proposed in this work. Based the learnt hash codes, ADCH further learns a series of binary classifiers as hash functions, which is flexible and effective. Extensive experiments are conducted on three real-world datasets. The results demonstrate that ADCH outperforms several state-of-the-art cross-modal hashing baselines.

References

[1]
Neyshabur Behnam, Srebro Nati, Salakhutdinov Ruslan, Makarychev Yury, and Yadollahpour Payman . 2013. The power of asymmetry in binary hashing. In Proceedings of the Advances in Neural Information Processing Systems. 2823--2831.
[2]
Zhen-Duo Chen, Wan-Jin Yu, Chuan-Xiang Li, Liqiang Nie, and Xin-Shun Xu . 2018. Dual deep neural networks cross-modal hashing. In Proceedings of the AAAI Conference on Artificial Intelligence.
[3]
Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yantao Zheng . 2009. NUS-WIDE: A real-world web image database from National University of Singapore Proceedings of the ACM International Conference on Image and Video Retrieval. 48.
[4]
Cheng Da, Shibiao Xu, Kun Ding, Gaofeng Meng, Shiming Xiang, and Chunhong Pan . 2017. AMVH: Asymmetric multi-valued hashing. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 736--744.
[5]
Scott Deerwester, Susan T Dumais, George W Furnas, Thomas K Landauer, and Richard Harshman . 1990. Indexing by latent semantic analysis. J. Am. Soc. Inf. Sci. Vol. 41, 6 (1990), 391.
[6]
Guiguang Ding, Yuchen Guo, and Jile Zhou . 2014. Collective matrix factorization hashing for multimodal data Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2075--2082.
[7]
Fuli Feng, Xiangnan He, Yiqun Liu, Liqiang Nie, and Tat-Seng Chua . 2018. Learning on partial-order hypergraphs. In Proceedings of the International World Wide Web Conferences. 1523--1532.
[8]
Fuli Feng, Liqiang Nie, Xiang Wang, Richang Hong, and Tat-Seng Chua . 2017. Computational social indicators: A case study of chinese university ranking Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. 455--464.
[9]
Aristides Gionis, Piotr Indyk, Rajeev Motwani, et almbox. . 1999. Similarity search in high dimensions via hashing. In Proceedings of the International Conference on Very Large Data Bases. 518--529.
[10]
Albert Gordo, Florent Perronnin, Yunchao Gong, and Svetlana Lazebnik . 2014. Asymmetric distances for binary embeddings. IEEE Trans. Pattern Anal. Mach. Intell. Vol. 36, 1 (2014), 33--47.
[11]
Jie Gui, Tongliang Liu, Zhenan Sun, Dacheng Tao, and Tieniu Tan . 2018. Fast supervised discrete hashing. IEEE Trans. Pattern Anal. Mach. Intell. Vol. 40, 2 (2018), 490--496.
[12]
Xiangnan He and Tat-Seng Chua . 2017. Neural factorization machines for sparse predictive analytics Proceedings of the AAAI Conference on Artificial Intelligence. 355--364.
[13]
Mark J Huiskes and Michael S Lew . 2008. The MIR flickr retrieval evaluation. In Proceedings of the ACM International Conference on Multimedia Information Retrieval. 39--43.
[14]
Qing-Yuan Jiang and Wu-Jun Li . 2017. Deep cross-modal hashing. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3270--3278.
[15]
Yu-Gang Jiang, Jun Wang, Xiangyang Xue, and Shih-Fu Chang . 2013. Query-adaptive image search with hash codes. IEEE Trans. Multimedia Vol. 15, 2 (2013), 442--453.
[16]
Wang-Cheng Kang, Wu-Jun Li, and Zhi-Hua Zhou . 2016. Column sampling based discrete supervised hashing. In Proceedings of the AAAI Conference on Artificial Intelligence. 1230--1236.
[17]
Weihao Kong and Wu-Jun Li . 2012. Isotropic hashing. In Proceedings of the Advances in Neural Information Processing Systems. 1646--1654.
[18]
Shaishav Kumar and Raghavendra Udupa . 2011. Learning hash functions for cross-view similarity search Proceedings of the International Joint Conference on Artificial Intelligence. 1360--1365.
[19]
Guosheng Lin, Chunhua Shen, Qinfeng Shi, Anton Van den Hengel, and David Suter . 2014. Fast supervised hashing with decision trees for high-dimensional data Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1963--1970.
[20]
Guosheng Lin, Chunhua Shen, David Suter, and Anton van den Hengel . 2013. A general two-step approach to learning-based hashing Proceedings of the International Conference on Computer Vision. 2552--2559.
[21]
Zijia Lin, Guiguang Ding, Mingqing Hu, and Jianmin Wang . 2015. Semantics-preserving hashing for cross-view retrieval Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3864--3872.
[22]
Hong Liu, Rongrong Ji, Yongjian Wu, and Gang Hua . 2016 c. Supervised matrix factorization for cross-modality hashing Proceedings of the International Joint Conference on Artificial Intelligence. 1767--1773.
[23]
Hong Liu, Rongrong Ji, Yongjian Wu, Feiyue Huang, and Baochang Zhang . 2017 a. Cross-modality binary code learning via fusion similarity hashing Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 7380--7388.
[24]
Meng Liu, Liqiang Nie, Meng Wang, and Baoquan Chen . 2017 b. Towards micro-video understanding by joint sequential-sparse modeling Proceedings of the ACM International Conference on Multimedia. 970--978.
[25]
Wei Liu, Jun Wang, Rongrong Ji, Yu-Gang Jiang, and Shih-Fu Chang . 2012. Supervised hashing with kernels. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2074--2081.
[26]
Xianglong Liu, Cheng Deng, Bo Lang, Dacheng Tao, and Xuelong Li . 2016 a. Query-adaptive reciprocal hash tables for nearest neighbor search. IEEE Trans. Image Process. Vol. 25, 2 (2016), 907--919.
[27]
Xianglong Liu, Lei Huang, Cheng Deng, Bo Lang, and Dacheng Tao . 2016 b. Query-adaptive hash code ranking for large-scale multi-view visual search. IEEE Trans. Image Process. Vol. 25, 10 (2016), 4514--4524.
[28]
Mingsheng Long, Yue Cao, Jianmin Wang, and Philip S Yu . 2016. Composite correlation quantization for efficient multimodal retrieval Proceedings of the AAAI Conference on Artificial Intelligence. 579--588.
[29]
David G Lowe . 2004. Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. Vol. 60, 2 (2004), 91--110.
[30]
Xin Luo, Ye Wu, and Xin-Shun Xu . 2018. Scalable supervised discrete hashing for large-scale search Proceedings of the International World Wide Web Conferences. 1603--1612.
[31]
Jonathan Masci, Michael M Bronstein, Alexander M Bronstein, and Jürgen Schmidhuber . 2014. Multimodal similarity-preserving hashing. IEEE Trans. Pattern Anal. Mach. Intell. Vol. 36, 4 (2014), 824--830.
[32]
Liqiang Nie, Xuemeng Song, and Tat-Seng Chua . 2016. Learning from multiple social networks. Morgan & Claypool Publishers.
[33]
Liqiang Nie, Meng Wang, Zheng-Jun Zha, and Tat-Seng Chua . 2012 a. Oracle in image search: A content-based approach to performance prediction. ACM Trans. Inf. Syst. Vol. 30, 2 (2012), 13.
[34]
Liqiang Nie, Shuicheng Yan, Meng Wang, Richang Hong, and Tat-Seng Chua . 2012 b. Harvesting visual concepts for image search with complex queries Proceedings of the ACM International Conference on Multimedia. 59--68.
[35]
Nikhil Rasiwasia, Jose Costa Pereira, Emanuele Coviello, Gabriel Doyle, Gert RG Lanckriet, Roger Levy, and Nuno Vasconcelos . 2010. A new approach to cross-modal multimedia retrieval Proceedings of the ACM International Conference on Multimedia. 251--260.
[36]
Fumin Shen, Chunhua Shen, Wei Liu, and Heng Tao Shen . 2015. Supervised discrete hashing. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 37--45.
[37]
Fumin Shen, Chunhua Shen, Qinfeng Shi, Anton Hengel, and Zhenmin Tang . 2013. Inductive hashing on manifolds. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1562--1569.
[38]
Jingkuan Song, Yang Yang, Yi Yang, Zi Huang, and Heng Tao Shen . 2013. Inter-media hashing for large-scale retrieval from heterogeneous data sources Proceedings of the ACM SIGMOD Conference on Management of Data. 785--796.
[39]
Jinhui Tang, Zechao Li, Meng Wang, and Ruizhen Zhao . 2015. Neighborhood discriminant hashing for large-scale image retrieval. IEEE Trans. Image Process. Vol. 24, 9 (2015), 2827--2840.
[40]
Jun Wang, Sanjiv Kumar, and Shih-Fu Chang . 2010. Semi-supervised hashing for scalable image retrieval Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3424--3431.
[41]
Jian Wang, Xin-Shun Xu, Shanqing Guo, Lizhen Cui, and Xiao-Lin Wang . 2016. Linear unsupervised hashing for ANN search in Euclidean space. Neurocomputing Vol. 171 (2016), 283--292.
[42]
Meng Wang, Weisheng Li, Dong Liu, Jialie Shen, and Shuicheng Yan . 2015. Facilitating image search with a scalable and compact semantic mapping. IEEE Trans. Cybern. Vol. 45, 8 (2015), 1561--1574.
[43]
Yair Weiss, Antonio Torralba, and Rob Fergus . 2009. Spectral hashing. In Proceedings of the Advances in Neural Information Processing Systems. 1753--1760.
[44]
Rongkai Xia, Yan Pan, Hanjiang Lai, Cong Liu, and Shuicheng Yan . 2014. Supervised hashing for image retrieval via image representation learning Proceedings of the AAAI Conference on Artificial Intelligence. 2156--2162.
[45]
Xing Xu, Fumin Shen, Yang Yang, Heng Tao Shen, and Xuelong Li . 2017. Learning discriminative binary codes for large-scale cross-modal retrieval. IEEE Trans. Image Process. Vol. 26, 5 (2017), 2494--2507.
[46]
Ting-Kun Yan, Xin-Shun Xu, Shanqing Guo, Zi Huang, and Xiaolin Wang . 2016. Supervised robust discrete multimodal hashing for cross-media retrieval Proceedings of the ACM International Conference on Information and Knowledge Management. 1271--1280.
[47]
Erkun Yang, Cheng Deng, Wei Liu, Xianglong Liu, Dacheng Tao, and Xinbo Gao . 2017. Pairwise relationship guided deep hashing for cross-modal retrieval Proceedings of the AAAI Conference on Artificial Intelligence. 1618--1625.
[48]
Yang Yang, Zheng-Jun Zha, Yue Gao, Xiaofeng Zhu, and Tat-Seng Chua . 2014. Exploiting web images for semantic video indexing via robust sample-specific loss. IEEE Trans. Multimedia Vol. 16, 6 (2014), 1677--1689.
[49]
Dongqing Zhang and Wu-Jun Li . 2014. Large-scale supervised multimodal hashing with semantic correlation maximization Proceedings of the AAAI Conference on Artificial Intelligence. 2177--2183.
[50]
Peng-Fei Zhang, Chuan-Xiang Li, Meng-Yuan Liu, Liqiang Nie, and Xin-Shun Xu . 2017. Semi-relaxation supervised hashing for cross-modal retrieval Proceedings of the ACM International Conference on Multimedia. 1762--1770.
[51]
Jile Zhou, Guiguang Ding, and Yuchen Guo . 2014. Latent semantic sparse hashing for cross-modal similarity search Proceedings of the AAAI Conference on Artificial Intelligence. 415--424.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICMR '18: Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval
June 2018
550 pages
ISBN:9781450350464
DOI:10.1145/3206025
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 June 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. cross-modal retrieval
  2. discrete optimization.
  3. hashing
  4. learning to hash
  5. multimedia retrieval

Qualifiers

  • Research-article

Funding Sources

  • National Natural Science Foundation of China
  • Key Research and Development Program of Shandong Province

Conference

ICMR '18
Sponsor:

Acceptance Rates

ICMR '18 Paper Acceptance Rate 44 of 136 submissions, 32%;
Overall Acceptance Rate 254 of 830 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media