skip to main content
10.1145/3652583.3658081acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

Deep Image Clustering Based on Curriculum Learning and Density Information

Published: 07 June 2024 Publication History

Abstract

Image clustering is one of the crucial techniques in multimedia analytics and knowledge discovery. Recently, the Deep clustering method (DC), characterized by its ability to perform feature learning and cluster assignment jointly, surpasses the performance of traditional ones on image data. However, existing methods rarely consider the role of model learning strategies in improving the robustness and performance of clustering complex image data. Furthermore, most approaches rely solely on point-to-point distances to cluster centers for partitioning the latent representations, resulting in error accumulation throughout the iterative process. In this paper, we propose a robust image clustering method (IDCL) which, to our knowledge for the first time, introduces a model training strategy using density information into image clustering. Specifically, we design a curriculum learning scheme grounded in the density information of input data, with a more reasonable learning pace. Moreover, we employ the density core rather than the individual cluster center to guide the cluster assignment. Finally, extensive comparisons with state-of-the-art clustering approaches on benchmark datasets demonstrate the superiority of the proposed method, including robustness, rapid convergence, and flexibility in terms of data scale, number of clusters, and image context.

References

[1]
2022. Deep embedded median clustering for routing misbehaviour and attacks detection in ad-hoc networks. Ad Hoc Networks 126 (2022), 102757. https: //doi.org/10.1016/j.adhoc.2021.102757
[2]
Yoshua Bengio, Jérôme Louradour, Ronan Collobert, and JasonWeston. 2009. Curriculum Learning. In Proceedings of the 26th Annual International Conference on Machine Learning (Montreal, Quebec, Canada) (ICML '09). Association for Computing Machinery, New York, NY, USA, 41?C48. https://doi.org/10.1145/ 1553374.1553380
[3]
Jinyu Cai, Jicong Fan, Wenzhong Guo, Shiping Wang, Yunhe Zhang, and Zhao Zhang. 2022. Efficient Deep Embedded Subspace Clustering. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1--10.
[4]
Mathilde Caron, Piotr Bojanowski, Armand Joulin, and Matthijs Douze. 2018. Deep clustering for unsupervised learning of visual features. In Proceedings of the European conference on computer vision (ECCV). 132--149.
[5]
Rui Chen, Yongqiang Tang, Lei Tian, Caixia Zhang, and Wensheng Zhang. 2022. Deep convolutional self-paced clustering. Applied Intelligence (2022), 1--15.
[6]
Rui Chen, Yongqiang Tang, Lei Tian, Caixia Zhang, and Wensheng Zhang. 2022. Deep convolutional self-paced clustering. Applied Intelligence (2022), 1--15.
[7]
Adam Coates, Andrew Ng, and Honglak Lee. 2011. An Analysis of Single- Layer Networks in Unsupervised Feature Learning. In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics (Proceedings of Machine Learning Research, Vol. 15), Geoffrey Gordon, David Dunson, and Miroslav Dud??k (Eds.). PMLR, Fort Lauderdale, FL, USA, 215--223. https://proceedings.mlr.press/v15/coates11a.html
[8]
Gregory Cohen, Saeed Afshar, Jonathan Tapson, and Andre Van Schaik. 2017. EMNIST: Extending MNIST to handwritten letters. In 2017 international joint conference on neural networks (IJCNN). IEEE, 2921--2926. https://doi.org/10. 1109/IJCNN.2017.7966217
[9]
Yao Ding, Zhili Zhang, Xiaofeng Zhao, Wei Cai, Nengjun Yang, Haojie Hu, Xianxiang Huang, Yuan Cao, and Weiwei Cai. 2022. Unsupervised Self-Correlated Learning Smoothy Enhanced Locality Preserving Graph Convolution Embedding Clustering for Hyperspectral Images. IEEE Transactions on Geoscience and Remote Sensing 60 (2022), 1--16. https://doi.org/10.1109/TGRS.2022.3202865
[10]
Liang Duan, Charu Aggarwal, Shuai Ma, and Saket Sathe. 2019. Improving Spectral Clustering with Deep Embedding and Cluster Estimation. In 2019 IEEE International Conference on Data Mining (ICDM). 170--179. https://doi.org/10. 1109/ICDM.2019.00027
[11]
Matheus Campos Fernandes, Thiago Ferreira Cov?es, and Andr?? Luiz Vizine Pereira. 2020. Improving evolutionary constrained clustering using Active Learning. Knowledge-Based Systems 209 (2020), 106452. https://doi.org/10.1016/j. knosys.2020.106452
[12]
Kamran Ghasedi, Xiaoqian Wang, Cheng Deng, and Heng Huang. 2019. Balanced Self-Paced Learning for Generative Adversarial Clustering Network. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[13]
Xifeng Guo, Long Gao, Xinwang Liu, and Jianping Yin. 2017. Improved Deep Embedded Clustering with Local Structure Preservation. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (Melbourne, Australia) (IJCAI'17). AAAI Press, 1753?C1759.
[14]
Xifeng Guo, Xinwang Liu, En Zhu, Xinzhong Zhu, Miaomiao Li, Xin Xu, and Jianping Yin. 2020. Adaptive Self-Paced Deep Clustering with Data Augmentation. IEEE Transactions on Knowledge and Data Engineering 32, 9 (2020), 1680--1693. https://doi.org/10.1109/TKDE.2019.2911833
[15]
Xifeng Guo, En Zhu, Xinwang Liu, and Jianping Yin. 2018. Deep Embedded Clustering with Data Augmentation. In Proceedings of The 10th Asian Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 95), Jun Zhu and Ichiro Takeuchi (Eds.). PMLR, 550--565. https://proceedings.mlr.press/ v95/guo18b.html
[16]
Kai Han, Andrea Vedaldi, and Andrew Zisserman. 2019. Learning to Discover Novel Visual Categories via Deep Transfer Clustering. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV). 8400--8408. https://doi. org/10.1109/ICCV.2019.00849
[17]
Sebastian Houben, Johannes Stallkamp, Jan Salmen, Marc Schlipsing, and Christian Igel. 2013. Detection of traffic signs in real-world images: The German Traffic Sign Detection Benchmark. In The 2013 international joint conference on neural networks (IJCNN). 1--8. https://doi.org/10.1109/IJCNN.2013.6706807
[18]
Jonathan J. Hull. 1994. A database for handwritten text recognition research. IEEE Transactions on pattern analysis and machine intelligence 16, 5 (1994), 550--554. https://doi.org/10.1109/34.291440
[19]
Wenhao Jiang, Wei Liu, and Fu lai Chung. 2018. Knowledge transfer for spectral clustering. Pattern Recognition 81 (2018), 484--496. https://doi.org/10.1016/j. patcog.2018.04.018
[20]
Yangbangyan Jiang, Zhiyong Yang, Qianqian Xu, Xiaochun Cao, and Qingming Huang. 2018. When to Learn What: Deep Cognitive Subspace Clustering. In Proceedings of the 26th ACM International Conference on Multimedia (Seoul, Republic of Korea) (MM '18). Association for Computing Machinery, New York, NY, USA, 718?C726. https://doi.org/10.1145/3240508.3240582
[21]
Zhuxi Jiang, Yin Zheng, Huachun Tan, Bangsheng Tang, and Hanning Zhou. 2017. Variational Deep Embedding: An Unsupervised and Generative Approach to Clustering. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI-17. 1965--1972. https://doi.org/10.24963/ijcai. 2017/273
[22]
Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
[23]
Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. 1998. Gradientbased learning applied to document recognition. Proc. IEEE 86, 11 (1998), 2278-- 2324. https://doi.org/10.1109/5.726791
[24]
Collin Leiber, Lena GM Bauer, Benjamin Schelling, Christian Böhm, and Claudia Plant. 2021. Dip-based deep embedded clustering with k-estimation. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining (KDD '21, 11). Association for Computing Machinery, New York, NY, USA, 903--913. https://doi.org/10.1145/3447548.3467316
[25]
David D Lewis, Yiming Yang, Tony Russell-Rose, and Fan Li. 2004. Rcv1: A new benchmark collection for text categorization research. Journal of machine learning research 5, Apr (2004), 361--397.
[26]
Hongyu Li, Lefei Zhang, and Kehua Su. 2023. Dual Mutual Information Constraints for Discriminative Clustering. Proceedings of the AAAI Conference on Artificial Intelligence 37, 7 (Jun. 2023), 8571--8579. https://doi.org/10.1609/aaai. v37i7.26032
[27]
Yunfan Li, Peng Hu, Zitao Liu, Dezhong Peng, Joey Tianyi Zhou, and Xi Peng. 2021. Contrastive Clustering. Proceedings of the AAAI Conference on Artificial Intelligence 35, 10 (May 2021), 8547--8555. https://doi.org/10.1609/aaai.v35i10. 17037
[28]
Xin Ma and Won Hwa Kim. 2022. Locally Normalized Soft Contrastive Clustering for Compact Clusters. In Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, IJCAI-22, Lud De Raedt (Ed.). International Joint Conferences on Artificial Intelligence Organization, 3313--3320. https://doi.org/10.24963/ijcai.2022/460 Main Track.
[29]
Daniel P. M. de Mello, Renato M. Assun??o, and Fabricio Murai. 2022. Top-Down Deep Clustering with Multi-Generator GANs. IEEE Transactions on Neural Networks and Learning Systems 36 (Jun 2022), 7770--7778. https://doi.org/10. 1609/aaai.v36i7.20745
[30]
Hankui Peng and Nicos G. Pavlidis. 2019. Subspace Clustering with Active Learning. In 2019 IEEE International Conference on Big Data (Big Data). 135-- 144. https://doi.org/10.1109/BigData47090.2019.9006361
[31]
Alex Rodriguez and Alessandro Laio. 2014. Clustering by fast search and find of density peaks. Science 344, 6191 (2014), 1492--1496. https://doi.org/10.1126/ science.1242072
[32]
Meitar Ronen, Shahaf E Finder, and Oren Freifeld. 2022. DeepDPM: Deep Clustering With an Unknown Number of Clusters. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 9861--9870.
[33]
Mohammadreza Sadeghi and Narges Armanfard. 2021. IDECF: Improved Deep Embedding Clustering With Deep Fuzzy Supervision. In 2021 IEEE International Conference on Image Processing (ICIP). 1009--1013. https://doi.org/10.1109/ ICIP42928.2021.9506051
[34]
Tian Tian, Jie Zhang, Xiang Lin, Zhi Wei, and Hakon Hakonarson. 2021. Modelbased deep embedding for constrained clustering analysis of single cell RNA-seq data. Nature communications 12, 1 (2021), 1873. https://doi.org/10.1038/s41467- 021--22008--3
[35]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, ? ukasz Kaiser, and Illia Polosukhin. 2017. Attention is All you Need. In Advances in Neural Information Processing Systems, I. Guyon, U. Von Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.), Vol. 30. Curran Associates, Inc. https://proceedings.neurips.cc/paper/ 2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf
[36]
Xin Wang, Yudong Chen, and Wenwu Zhu. 2022. A Survey on Curriculum Learning. IEEE Transactions on Pattern Analysis and Machine Intelligence 44, 9 (2022), 4555--4576. https://doi.org/10.1109/TPAMI.2021.3069908
[37]
Lior Wolf, Tal Hassner, and Itay Maoz. 2011. Face recognition in unconstrained videos with matched background similarity. In CVPR 2011. IEEE, 529--534. https: //doi.org/10.1109/CVPR.2011.5995566
[38]
Han Xiao, Kashif Rasul, and Roland Vollgraf. 2017. Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms. https://arxiv. org/abs/1708.07747
[39]
Junyuan Xie, Ross Girshick, and Ali Farhadi. 2016. Unsupervised Deep Embedding for Clustering Analysis. In Proceedings of The 33rd International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 48), Maria Florina Balcan and Kilian Q. Weinberger (Eds.). PMLR, New York, New York, USA, 478--487. https://proceedings.mlr.press/v48/xieb16.html
[40]
Lin Yang, Wentao Fan, and Nizar Bouguila. 2022. Clustering analysis via deep generative models with mixture models. IEEE Transactions on Neural Networks and Learning Systems 33, 1 (2022), 340--350. https://doi.org/10.1109/TNNLS.2020.3027761

Index Terms

  1. Deep Image Clustering Based on Curriculum Learning and Density Information

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ICMR '24: Proceedings of the 2024 International Conference on Multimedia Retrieval
    May 2024
    1379 pages
    ISBN:9798400706196
    DOI:10.1145/3652583
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 07 June 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. clustering assignment
    2. curriculum learning
    3. deep clustering
    4. density information
    5. learning pace

    Qualifiers

    • Research-article

    Funding Sources

    • Shenzhen Fundamental Research Fund
    • Guangdong Provincial Key Laboratory of Novel Security Intelligence Technologies

    Conference

    ICMR '24
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 254 of 830 submissions, 31%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 98
      Total Downloads
    • Downloads (Last 12 months)98
    • Downloads (Last 6 weeks)8
    Reflects downloads up to 24 Jan 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media