skip to main content
10.1109/FUZZ-IEEE.2017.8015560guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
research-article

A new iterative fuzzy clustering algorithm for multiple imputation of missing data

Published: 09 July 2017 Publication History

Abstract

This paper proposes a new iterative fuzzy clustering (IFC) algorithm to impute missing values of datasets. The information provided by fuzzy clustering is used to update the imputed values through iterations. The performance of the IFC algorithm is examined by conducting experiments on three commonly used datasets and a case study on a city mobility database. Experimental results show that the IFC algorithm not only works well for datasets with a small number of missing values but also provides an effective imputation result for datasets where the proportion of missing data is high.

References

[1]
M. Sarkar, and T.Y. Leong, “Fuzzy K-means clustering with missing values”, Proceedings of the AMIA Symposium, pp. 588–592, 2001.
[2]
D. Li, J. Deogun, W. Spaulding, and B. Shuart, “Towards Missing Data Imputation: A study of fuzzy K-means clustering method”, in Rough Sets and Current Trends in Computing, vol. 3066, S. Tsumoto et al. Eds. Berlin: Springer-Verlag, 2004, pp. 573–579.
[3]
R.J.A. Little, and D.B. Rubin, Statistical analysis with missing data. New York, US: John Wiley & Sons, 1987.
[4]
I. Myrtveit, E. Stensrud, and E. Olsson, “Analyzing data sets with missing data: an empirical evaluation of imputation methods and likelihood-based methods”, IEEE Transactions on Software Engineering, vol. 27, pp. 999–1013, 2001.
[5]
J. Luengo, S. Garcia, and F. Herrara, “On the choice of the best imputation methods for missing values considering three groups of classification methods”, Knowledge and Information Systems, vol. 32, pp. 77–108, 2012.
[6]
M. Sato-Ilic, and L.C. Jain, Innovations in fuzzy clustering: theory and applications. Berlin Heidelberg: Springer-Verlag, 2006.
[7]
S. Panda, S. Sahu, P. Jena, and S. Chattopadhyay, “Comparing Fuzzy-C Means and K-Means Clustering Techniques: A Comprehensive Study”, in Advances in Computer Science, Engineering & Applications, vol. 166, D.C. Wyld et al. Eds. Berlin: Springer-Verlag, 2012, pp. 451–460.
[8]
R. Mikaeil, S.S. Haghshenas, S.S. Haghshenas, and M. Ataei, “Performance prediction of circular saw machine using imperialist competitive algorithm and fuzzy clustering technique”, Neural Computing and Applications, in press.
[9]
M.B. Ferraro, and P. Giordani, “A toolbox for fuzzy clustering using the R programming language”, Fuzzy Sets and Systems, vol. 279, pp. 1–16, 2015.
[10]
UITP, Mobility in cities database. Brussels: International Association of Public Transport, 2015.
[11]
R.J.G.B. Campello, and E.R. Hruschka, “A fuzzy extension of the silhouette width criterion for cluster analysis”, Fuzzy Sets and Systems, vol. 157, pp. 2858–2875, 2006.
[12]
D. Albalate, and G. Bel, “Tourism and urban public transport: Holding demand pressure under supply constraints”, Tourism Management, vol. 31, pp. 425–433, 2010.
[13]
D. Albalate, and G. Bel, “What shapes local public transportation in Europe? Economics, mobility, institutions, and geography”, Transportation Research Part E: Logistics and Transportation Review, vol. 46, pp. 775–790, 2010.
[14]
J. Abonyi, and B. Feil, “Aggregation and visualization of fuzzy clusters based on fuzzy similarity measures”, in Advances in Fuzzy Clustering and its applications, vol. 166, J. Valente De Oliveira, and W. Pedrycz Eds. Chichester: John Wiley and sons, 2007, pp. 95–121.
[15]
J.C. Bezdek, and R.J. Hathaway, “Visual cluster validity (VCV) displays for prototype generator clustering methods”, The 12th IEEE International Conference on Fuzzy Systems, Vol. 2, pp. 875–880, 2003.
[16]
C. M. Musil, C. B. Warner, P. K. Yobas and S. L. Jones, “A comparison of imputation techniques for handling missing data”, Western Journal of Nursing Research, vol 24, pp. 815–829, 2002.
[17]
J. Tian, B. Yu, D. Yu and S. Ma, “Missing data analyses: a hybrid multiple imputation algorithm using Gray System Theory and entropy based on clustering”, Applied Intelligence, vol 40, pp. 376–388, 2014.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)
Jul 2017
2327 pages

Publisher

IEEE Press

Publication History

Published: 09 July 2017

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Feb 2025

Other Metrics

Citations

Cited By

View all

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media