skip to main content
10.1145/3697355.3697405acmotherconferencesArticle/Chapter ViewAbstractPublication PagesbdiotConference Proceedingsconference-collections
research-article

I-WOA: An Optimization of K-Means Clusteringsis

Published: 12 December 2024 Publication History

Abstract

As the big data technology is widely used in various research fields, cluster analysis has been generally regarded as one of the most effective methods for processing different types of data. In response to the shortcomings of the K-means algorithm, such as sensitivity to the initial clustering points and a propensity to fall into local optima, an Improved Whale Optimization Algorithm (I-WOA) is proposed to optimize the K-means clustering algorithm. In the proposed I-WOA algorithm, the Sobol sequence is introduced to initialize the whale population, thus enriching the diversity of the whale population. An elite reverse learning strategy is also employed to improve the position updating method of the optimal whale in the whale population, hence enhancing the quality of the solution. Additionally, the convergence factor(\(a\)) and contraction probability (\(p\)) in the whale algorithm are adjusted to balance global and local searches, therefore improving convergence speed and optimization precision. Finally, the I-WOA algorithm is evaluated by six famous benchmark functions. The experimental results show that the proposed algorithm has better cluster quality and global search capability compared to traditional clustering algorithms, and outperforms some popular algorithms.

References

[1]
Qiao S, Han N, Zhang K, et al. Algorithm for detecting overlapping communities from complex network big data[J]. Journal of Software, 2017, 28(3): 631-647.
[2]
Swarndeep Saket, J., and Sharnil Pandya. "An overview of partitioning algorithms in clustering techniques." International Journal of Advanced Research in Computer Engineering & Technology (IJARCET) 5.6 (2016): 1943-1946.
[3]
Murtagh, Fionn, and Pedro Contreras. "Algorithms for hierarchical clustering: an overview." Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 2.1 (2012): 86-97.
[4]
Bhattacharjee, Panthadeep, and Pinaki Mitra. "A survey of density based clustering algorithms." Frontiers of Computer Science 15 (2021): 1-27.
[5]
Cheng, Wei, Wei Wang, and Sandra Batista. "Grid-based clustering." Data clustering. Chapman and Hall/CRC, 2018. 128-148.
[6]
Gormley, Isobel Claire, Thomas Brendan Murphy, and Adrian E. Raftery. "Model-based clustering." Annual Review of Statistics and Its Application 10 (2023): 573-595.
[7]
MacQueen J. Some methods for classification and analysis of multivariate observations[C]//Proceedings of the fifth Berkeley symposium on mathematical statistics and probability. 1967, 1(14): 281-297.
[8]
Hassan Z F, Al-Shareefi F, Gheni H Q. A coloured image watermarking based on genetic k-means clustering methodology[J]. Journal of Advances in Information Technology, 2023, 14(2).
[9]
Huang, Cheng, et al. "A hybrid Aquila optimizer and its K-means clustering optimization." Transactions of the Institute of Measurement and Control 45.3 (2023): 557-572.
[10]
Lou, Taishan, et al. "A Hybrid K-means Method based on Modified Rat Swarm Optimization Algorithm for Data Clustering." (2024).
[11]
Long Wen, Cai Shaohong, Jiao Jianjun, et al. Improved Whale Optimization Algorithm for Solving Large-Scale Optimization Problems[J]. Systems Engineering Theory & Practice, 2017, 37(11): 2983-2994.
[12]
Wang Jianhao, Zhang Liang, Shi Chao, et al. Whale Optimization Algorithm Based on Chaos Search Strategy [J]. Control and Decision, 2019, 34(9): 1893-1900.
[13]
Kong Zhi, Yang Qingfeng, Zhao Jie, et al. Whale Optimization Algorithm Based on Adaptive Adjustment of Weight and Search Strategy [J]. Journal of Northeastern University (Natural Science Edition), 2020, 41(1): 35.
[14]
Li Andong, Liu Sheng. Improved Whale Optimization Algorithm with Hybrid Strategy [J]. Application Research of Computers, 2022, 39(5): 1415-1421.
[15]
Joe, Stephen, and Frances Y. Kuo. "Constructing Sobol sequences with better two-dimensional projections." SIAM Journal on Scientific Computing 30.5 (2008): 2635-2654.
[16]
Mao, Q., & Zhang, Q. (2021). Improved Sparrow Algorithm with Fusion of Cauchy Mutation and Reverse Learning. Journal of Frontiers of Computer Science & Technology, 15(6).
[17]
He, X., Zhang, G., Chen, Y., et al. (2021). WOA-SVM Multi-classification Algorithm with Fusion of Lévy Flight and Elite Reverse Learning. Application Research of Computers, 38(12).
[18]
Trojovský, Pavel, and Mohammad Dehghani. "A new bio-inspired metaheuristic algorithm for solving optimization problems based on walruses behavior." Scientific Reports 13.1 (2023): 8775.
[19]
Zhou, C., et al. (2003). Particle Swarm Optimization Algorithm. Application Research of Computers, 20(12), 7-11.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
BDIOT '24: Proceedings of the 2024 8th International Conference on Big Data and Internet of Things
September 2024
412 pages
ISBN:9798400717529
DOI:10.1145/3697355
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 December 2024

Check for updates

Author Tags

  1. I-WOA
  2. clustering
  3. k-means
  4. optimization

Qualifiers

  • Research-article

Conference

BDIOT 2024

Acceptance Rates

Overall Acceptance Rate 75 of 136 submissions, 55%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 17
    Total Downloads
  • Downloads (Last 12 months)17
  • Downloads (Last 6 weeks)17
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

Full Text

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media