skip to main content
10.1007/978-3-642-34707-8_36guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Advance missing data processing for collaborative filtering

Published: 28 November 2012 Publication History

Abstract

Memory-based collaborative filtering (CF) is widely used in the recommendation system based on the similar users or items. But all of these approaches suffer from data sparsity. In many cases, the user-item matrix is quite sparse, which directly leads to inaccurate recommend results. This paper focuses the memory-based collaborative filtering problem on the factor: missing data processing. We propose an advance missing data processing includes two steps: (1) using enhanced CHARM algorithm for mining closed subsets --- group of users that share interest in some items, (2) using adjusted Slope One algorithm base on subsets for utilizing not only information of both users and items but also information that fall neither in the user array nor in the item array. After that, we use Pearson Correlation Coefficient algorithm for predicting rating for active user. Finally, the empirical evaluation results reveal that the proposed approach outperforms other state-of-the-art CF algorithms and it is more robust against data sparsity.

References

[1]
Ansari, A., Essegaier, S., Kohli, R.: Internet Recommendations Systems. J. Marketing Research, 363-375 (2000)
[2]
Sarwar, B. M., Karypis, G., Konstan, J. A., Riedl, J.: Item-based collaborative filtering recommender algorithms. In: WWW10 (2001)
[3]
Lemire, D.: Scale and translation invariant collaborative filtering systems. Information Retrieval 8(1), 129-150 (2005)
[4]
Sarwar, B., Karypis, G., Konstan, J., Riedl, J.: Application of Dimensionality Reduction in Recommender Systems--A Case Study. In: Proc. ACM WebKDD Workshop (2000)
[5]
Basu, C., Hirsh, H., Cohen, W.: Recommendation as Classification: Using Social and Content-Based Information in Recommendation, Recommender Systems. Papers from 1998 Workshop, Technical Report WS 98-08. AAAI Press (1998)
[6]
Aggarwal, C.C., Wolf, J. L., Wu, K.-L., Yu, P. S.: Horting Hatches an Egg: A New Graph-Theoretic Approach to Collaborative Filtering. In: Proc. Fifth ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining (August 1999)
[7]
Billsus, D., Pazzani, M.: Learning Collaborative Information Filters. In: Proc. Int'l Conf. Machine Learning (1998)
[8]
Pennock, D. M., Horvitz, E., Lawrence, S., Giles, C. L.: Collaborative filtering by personality diagnosis: A hybrid memory- and model-based approach. In: Proc. of UAI (2000)
[9]
Adomavicius, G., Tuzhilin, A.: Expert-Driven Validation of Rule-Based User Models in Personalization Applications. Data Mining and Knowledge Discovery 5(1 and 2), 33-58 (2001)
[10]
Adomavicius, G., Tuzhilin, A.: Multidimensional Recommender Systems: A Data Warehousing Approach. In: Fiege, L., Mühl, G., Wilhelm, U. G. (eds.) WELCOM 2001. LNCS, vol. 2232, pp. 180-192. Springer, Heidelberg (2001)
[11]
Adomavicius, G., Sankaranarayanan, R., Sen, S., Tuzhilin, A.: Incorporating Contextual Information in Recommender Systems Using a Multidimensional Approach. ACM Trans. Information Systems 23(1) (January 2005)
[12]
Linden, G., Smith, B., York, J.: Amazon.com recommendations: Item-to-item collaborative filtering. IEEE Internet Computing, 76-80 (January/February 2003)
[13]
Xue, G.-R., Lin, C., Yang, Q., Xi, W., Zeng, H.-J., Yu, Y., Chen, Z.: Scalable collaborative filtering using cluster-based smoothing. In: Proc. of SIGIR 2005 (2005)
[14]
Herlocker, J. L., Konstan, J. A., Borchers, A., Riedl, J.: An algorithmic framework for performing collaborative filtering. In: Proc of SIGIR (1999)
[15]
Breese, J. S., Heckerman, D., Kadie, C.: Empirical analysis of predictive algorithms for collaborative filtering. In: Proc. of UAI (1998)
[16]
Ghazanfar, M., Prugel-Bennett, A.: An Improved Switching Hybrid Recommender System Using Naive Bayes Classifier and Collaborative filtering. IAENG International Journal of Computer Science, 37 (2010)
[17]
Armstrong, J. S.: Principles of Forecasting--A Handbook for Researchers and Practitioners. Kluwer Academic (2001)
[18]
Canny, J.: Collaborative Filtering With Privacy Via Factor Analysis. In: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 238-245 (2002)
[19]
Deshpande, M., Karypis, G.: Item-based top-n recommendation. ACM Trans. Inf. Syst. 22(1), 143-177 (2004)
[20]
Buhmann, M. D.: Approximation and Interpolation with Radial Functions. In: Dyn, N., Leviatan, D., Levin, D., Pinkus, A. (eds.) Multivariate Approximation and Applications. Cambridge Univ. Press (2001)
[21]
Belkin, N., Croft, B.: Information Filtering and Information Retrieval. Comm. ACM 35(12), 29-37 (1992)
[22]
Resnick, P., Iacovou, N., Suchak, M., Bergstrom, P., Riedl, J.: Grouplens: An open architecture for collaborative filtering of net news. In: Proc. of ACM Conference on Computer Supported Cooperative Work (1994)
[23]
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison-Wesley (1999)
[24]
Jin, R., Chai, J.Y., Si, L.: An automatic weighting scheme for collaborative filtering. In: Proc. of SIGIR (2004)
[25]
Huang, Z., Chen, H., Zeng, D.: Applying Associative Retrieval Techniques to Alleviate the Sparsity Problem in Collaborative Filtering. ACM Trans. Information Systems 22(1), 116-142 (2004)
[26]
Zaki, M. J., Hsiao, C.: CHARM: An Efficient Algorithm for Closed. Itemset Mining. In: SDM 2002 (2002)
[27]
Lemire, D., Maclachlan, A.: Slope one predictors for online rating-based collaborative filtering. In: SIAM Data Mining (2005)

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
ICCCI'12: Proceedings of the 4th international conference on Computational Collective Intelligence: technologies and applications - Volume Part II
November 2012
561 pages
ISBN:9783642347061
  • Editors:
  • Ngoc-Thanh Nguyen,
  • Kiem Hoang,
  • Piotr Jędrzejowicz

Sponsors

  • Inha University: Inha University
  • Hue Univ.: Hue University
  • National Foundation for Science and Technology Development: National Foundation for Science and Technology Development

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 28 November 2012

Author Tags

  1. collaborative filtering
  2. missing data
  3. pearson correlation coefficient
  4. recommender
  5. slope one
  6. sparsity

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 29 Jan 2025

Other Metrics

Citations

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media