skip to main content
research-article

Sparse hashing for fast multimedia search

Published: 17 May 2013 Publication History

Abstract

Hash-based methods achieve fast similarity search by representing high-dimensional data with compact binary codes. However, both generating binary codes and encoding unseen data effectively and efficiently remain very challenging tasks. In this article, we focus on these tasks to implement approximate similarity search by proposing a novel hash based method named sparse hashing (SH for short). To generate interpretable (or semantically meaningful) binary codes, the proposed SH first converts original data into low-dimensional data through a novel nonnegative sparse coding method. SH then converts the low-dimensional data into Hamming space (i.e., binary encoding low-dimensional data) by a new binarization rule. After this, training data are represented by generated binary codes. To efficiently and effectively encode unseen data, SH learns hash functions by taking a-priori knowledge into account, such as implicit group effect of the features in training data, and the correlations between original space and the learned Hamming space. SH is able to perform fast approximate similarity search by efficient bit XOR operations in the memory of a modern PC with short binary code representations. Experimental results show that the proposed SH significantly outperforms state-of-the-art techniques.

References

[1]
Andoni, A. and Indyk, P. 2008. Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. Comm. ACM 51, 1, 117--122.
[2]
Baluja, S. and Covell, M. 2010. Beyond “near-duplicates”: Learning hash codes for efficient similar-image retrieval. In Proceedings of the 20th International Conference on Pattern Recognition.
[3]
Belkin, M., Niyogi, P., and Sindhwani, V. 2006. Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. J. Mach. Learn. Res. 7, 2399--2434.
[4]
Breiman, L. and Friedman, J. H. 1997. Predicting multivariate responses in multiple linear regression. J. R. Statist. Soc. Series B 59, 1, 3--54.
[5]
Charikar, M. S. 2002. Similarity estimation techniques from rounding algorithms. In Proceedings of the ACM Symposium on Theory of Computing. ACM, 380--388.
[6]
Cormen, T. H., Stein, C., Rivest, R. L., and Leiserson, C. E. 2001. Introduction to Algorithms 2nd Ed. McGraw-Hill Higher Education.
[7]
Dai, W., Yang, Q., Rong Xue, G., and Yu, Y. 2008. Self-taught clustering. In Proceedings of the International Conference on Machine Learning. 200--207.
[8]
Datar, M., Immorlica, N., Indyk, P., and Mirrokni, V. S. 2004. Locality-sensitive hashing scheme based on p-stable distributions. In Proceedings of the Annual Symposium on Computational Geometry. 253--262.
[9]
Drineas, P. and Mahoney, M. W. 2005. On the nystrm method for approximating a gram matrix for improved kernel-based learning. J. Mach. Learn. Res. 6, 2153--2175.
[10]
Efron, B., Hastie, T., Johnstone, L., and Tibshirani, R. 2004. Least angle regression. Ann. Statist. 32, 407--499.
[11]
Gao, S., Tsang, I. W.-H., Chia, L.-T., and Zhao, P. 2010. Local features are not lonely - laplacian sparse coding for image classification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3555--3561.
[12]
Ghosh, S. 2011. On the grouped selection and model complexity of the adaptive elastic net. Statist. Comput. 21, 3, 451--462.
[13]
Grauman, K. 2007. Pyramid match hashing: Sub-linear time indexing over partial correspondences. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1--8.
[14]
He, J., Chang, S.-F., Radhakrishnan, R., and Bauer, C. 2011. Compact hashing with joint optimization of search accuracy and time. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 753--760.
[15]
He, X. and Niyogi, P. 2003. Locality preserving projections. In Proceedings of the Annual Conference on Neural Information Processing Systems. 197--204.
[16]
Hotelling, H. 1936. Relations between two sets of variates. Biometrika 28, 3/4, 321--377.
[17]
Hoyer, P. O. 2004. Non-negative matrix factorization with sparseness constraints. J. Mach. Learn. Res. 5, 1457--1469.
[18]
Huang, Z., Shen, H. T., Liu, J., and Zhou, X. 2011. Effective data co-reduction for multimedia similarity search. In Proceedings of the ACM SIGMOD International Conference on Management of Data. 1021--1032.
[19]
Huang, Z., Shen, H. T., Shao, J., Zhou, X., and Cui, B. 2009. Bounded coordinate system indexing for real-time video clip search. ACM Trans. Inf. Syst. 27, 3.
[20]
Jain, P., Kulis, B., and Grauman, K. 2008. Fast image search for learned metrics. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1--8.
[21]
Jégou, H., Douze, M., and Schmid, C. 2011. Product quantization for nearest neighbor search. IEEE Trans. Pattern Anal. Mach. Intell. 33, 1, 117--128.
[22]
Kulis, B. and Darrell, T. 2009. Learning to hash with binary reconstructive embeddings. In Proceedings of the Annual Conference on Neural Information Processing Systems. 1042--1050.
[23]
Lee, H., Battle, A., Raina, R., and Ng, A. Y. 2007. Efficient sparse coding algorithms. In Proceedings of the Annual Conference on Neural Information Processing Systems. 801--808.
[24]
Lee, H., Raina, R., Teichman, A., and Ng, A. Y. 2009. Exponential family sparse coding with applications to self-taught learning. In Proceedings of the International Joint Conference on Artificial Intelligence. 1113--1119.
[25]
Liu, J., Huang, Z., Cai, H., Shen, H. T., Ngo, C.-W., and Wang, W. 2013. Near-duplicate video retrieval: Current research and future trends. ACM Comput Surv. To appear.
[26]
Liu, W., Wang, J., Kumar, S., and Chang, S.-F. 2011. Hashing with graphs. In Proceedings of the International Conference on Machine Learning. 1--8.
[27]
Lykou, A. and Whittaker, J. 2010. Sparse cca using a lasso with positivity constraints. Comput. Stat. Data Anal. 54, 3144--3157.
[28]
Mairal, J., Bach, F., Ponce, J., and Sapiro, G. 2010. Online learning for matrix factorization and sparse coding. J. Mach. Learn. Res. 11, 19--60.
[29]
Mu, Y., Shen, J., and Yan, S. 2010. Weakly-supervised hashing in kernel space. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3344--3351.
[30]
Muja, M. and Lowe, D. 2009. Fast approximate nearest neighbors with automatic algorithm configuration. In Proceedings of the International Conference on Computer Vision Theory and Applications. 331--340.
[31]
Norouzi, M. E. and Fleet, D. J. 2011. Minimal loss hashing for compact binary codes. In Proceedings of the International Conference on Machine Learning. 353--360.
[32]
Olshausen, B. A. and Field, D. J. 1996. Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature 381, 6583, 607--609.
[33]
Raginsky, M. and Lazebnik, S. 2009. Locality-sensitive binary codes from shift-invariant kernels. In Proceedings of the Annual Conference on Neural Information Processing Systems. 1509--1517.
[34]
Raina, R., Battle, A., Lee, H., Packer, B., and Ng, A. Y. 2007. Self-taught learning: transfer learning from unlabeled data. In Proceedings of the International Conference on Machine Learning. 759--766.
[35]
Roweis, S. T. and Saul, L. K. 2000. Nonlinear dimensionality reduction by locally linear embedding. Science 290, 2323--2326.
[36]
Salakhutdinov, R. and Hinton, G. 2009. Semantic hashing. Int. J. Approximate Reasoning 50, 969--978.
[37]
Shang, L., Yang, L., Wang, F., Chan, K.-P., and Hua, X.-S. 2010. Real-time large scale near-duplicate web video retrieval. In Proceedings of the ACM International Conference on Multimedia. 531--540.
[38]
Shen, J., Tao, D., and Li, X. 2009. Quc-tree: Integrating query context information for efficient music retrieval. IEEE Trans. Multimedia 11, 2, 313--323.
[39]
Song, J., Yang, Y., Huang, Z., Shen, H. T., and Hong, R. 2011. Multiple feature hashing for real-time large scale near-duplicate video retrieval. In Proceedings of the ACM International Conference on Multimedia. 423--432.
[40]
Stein, B. 2007. Principles of hash-based text retrieval. In Proceedings of the ACM Special Interest Group on Information Retrieval. 527--534.
[41]
Tao, Y., Yi, K., Sheng, C., and Kalnis, P. 2009. Quality and efficiency in high dimensional nearest neighbor search. In Proceedings of the ACM Special Interest Group on Management of Data. 563--576.
[42]
Tibshirani, R. 1994. Regression shrinkage and selection via the lasso. J. R. Statist. Soc. Series B 58, 267--288.
[43]
Torralba, A., Fergus, R., and Weiss, Y. 2008. Small Codes and Large Image Databases for Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1--8.
[44]
Wang, J., Kumar, S., and Chang, S.-F. 2010a. Semi-supervised hashing for scalable image retrieval. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3424--3431.
[45]
Wang, J., Kumar, S., and Chang, S.-F. 2010b. Sequential projection learning for hashing with compact codes. In Proceedings of the International Conference on Machine Learning. 1127--1134.
[46]
Weiss, Y., Torralba, A., and Fergus, R. 2008. Spectral hashing. In Proceedings of the Annual Conference on Neural Information Processing Systems. 1753--1760.
[47]
Wu, M. and Schölkopf, B. 2007. Transductive classification via local learning regularization. J. Mach. Learn. Res. 2, 628--635.
[48]
Zass, R. and Shashua, A. 2006. Nonnegative sparse PCA. In Proceedings of the Annual Conference on Neural Information Processing Systems. 1561--1568.
[49]
Zhang, D., Wang, F., and Si, L. 2011. Composite hashing with multiple information sources. In Proceedings of the ACM SIGIR Conference on Research and Development in Information Retrieval. 225--234.
[50]
Zhang, D., Wang, J., Cai, D., and Lu, J. 2010a. Laplacian co-hashing of terms and documents. In Proceedings of the European Conference on Information Retrieval. 577--580.
[51]
Zhang, D., Wang, J., Cai, D., and Lu, J. 2010b. Self-taught hashing for fast similarity search. In Proceedings of the ACM Special Interest Group on Information Retrieval. 18--25.
[52]
Zheng, M., Bu, J., Chen, C., Wang, C., Zhang, L., Qiu, G., and Cai, D. 2011. Graph regularized sparse coding for image representation. IEEE Trans. Image Process. 20, 5, 1327--1336.
[53]
Zou, H. and Hastie, T. 2005. Regularization and variable selection via the elastic net. J. R. Statist. Soc. Series B 67, 2, 301--320.

Cited By

View all
  • (2024)Double-layer hybrid-label identification feature selection for multi-view multi-label learningProceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v38i11.29120(12295-12303)Online publication date: 20-Feb-2024
  • (2024)Image Hash Layer Triggered CNN Framework for Wafer Map Failure Pattern Retrieval and ClassificationACM Transactions on Knowledge Discovery from Data10.1145/363805318:4(1-26)Online publication date: 13-Feb-2024
  • (2024)Query Optimization Using Indexation Techniques in Datawarehouse: Survey and Use CasesArtificial Intelligence, Data Science and Applications10.1007/978-3-031-48465-0_53(406-412)Online publication date: 5-Mar-2024
  • Show More Cited By

Index Terms

  1. Sparse hashing for fast multimedia search

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Information Systems
    ACM Transactions on Information Systems  Volume 31, Issue 2
    May 2013
    180 pages
    ISSN:1046-8188
    EISSN:1558-2868
    DOI:10.1145/2457465
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 May 2013
    Accepted: 01 February 2013
    Revised: 01 August 2012
    Received: 01 March 2012
    Published in TOIS Volume 31, Issue 2

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Hashing
    2. indexing
    3. multimedia search
    4. sparse coding

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)17
    • Downloads (Last 6 weeks)4
    Reflects downloads up to 12 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media