skip to main content
10.1145/1651461.1651473acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Beyond the stars: exploiting free-text user reviews to improve the accuracy of movie recommendations

Published: 06 November 2009 Publication History

Abstract

In this paper we show that the extraction of opinions from free-text reviews can improve the accuracy of movie recommendations. We present three approaches to extract movie aspects as opinion targets and use them as features for the collaborative filtering. Each of these approaches requires different amounts of manual interaction. We collected a data set of reviews with corresponding ordinal (star) ratings of several thousand movies to evaluate the different features for the collaborative filtering. We employ a state-of-the-art collaborative filtering engine for the recommendations during our evaluation and compare the performance with and without using the features representing user preferences mined from the free-text reviews provided by the users. The opinion mining based features perform significantly better than the baseline, which is based on star ratings and genre information only.

References

[1]
S. Aciar, D. Zhang, S. Simoff, and J. Debenham. Informed recommender: Basing recommendations on consumer product reviews. IEEE Intelligent Systems, 22(3):39--47, May/June 2007.
[2]
N. Bhatti and S. H. Weber. Handbook of Research on Social Dimensions of Semantic Technologies and Web Services, chapter Semantic Visualization to Support Knowledge Discovery in Multi-Relational Service Communities, pages 281--303. IGI Global, 2009.
[3]
S. Blair-Goldensohn, K. Hannan, R. McDonald, T. Neylon, G. Reis, and J. Reynar. Building a sentiment summarizer for local service reviews. In Proceedings of the WWW2008 Workshop: NLP in the Information Explosion Era, Beijing, China, April 2008.
[4]
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, 3:993--1022, March 2003.
[5]
J. S. Breese, D. Heckerman, and C. Kadie. Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, pages 43--52, Madison, WI, USA, 1998.
[6]
X. Cheng and F. Xu. Fine-grained opinion topic and polarity identification. In Proceedings of LREC, pages 2710--2714, Marrekech, Morocco, May 2008.
[7]
T. Fawcett. Roc graphs: Notes and practical considerations for researchers. Technical report, HP Laboratories, 2004.
[8]
L. Ferreira, N. Jakob, and I. Gurevych. A comparative study of feature extraction algorithms in customer reviews. In Proceedings of ICSC, pages 144--151, Santa Clara, CA, USA, August 2008.
[9]
E. Gabrilovich and S. Markovitch. Computing semantic relatedness using wikipedia-based explicit semantic analysis. In Proceedings of IJCAI, pages 1606--1611, Hyderabad, India, January 2007.
[10]
M. Gamon, A. Aue, S. Corston-Oliver, and E. Ringger. Pulse: Mining customer opinions from free text. In Advances in Intelligent Data Analysis VI, volume 3646 of Lecture Notes in Computer Science, pages 121--132. Springer Berlin, August 2005.
[11]
J. L. Herlocker, J. A. Konstan, L. G. Terveen, John, and T. Riedl. Evaluating collaborative filtering recommender systems. ACM Transactions on Information Systems, 22:5--53, 2004.
[12]
M. Hu and B. Liu. Mining and summarizing customer reviews. In Proceedings of KDD-04, pages 168--177, Seattle, WA, USA, August 2004.
[13]
S.-M. Kim and E. Hovy. Extracting opinions, opinion holders, and topics expressed in online news media text. In Proceedings of the ACL Workshop on Sentiment and Subjectivity in Text, pages 1--8, Sydney, Australia, July 2006.
[14]
C. Lippert, S. H. Weber, Y. Huang, V. Tresp, M. Schubert, and H.-P. Kriegel. Relation-prediction in multi-relational domains using matrix-factorization. In Proceedings of the NIPS 2008 Workshop: Structured Input - Structured Output, Vancouver, Canada, December 2008.
[15]
B. Long, X. Wu, Z. M. Zhang, and P. S. Yu. Unsupervised learning on k-partite graphs. In Proceedings of the KDD 2006, pages 317--326, Philadelphia, PA, USA, 2006.
[16]
B. Long, Z. M. Zhang, X. Wu, and P. S. Yu. Spectral clustering for multi-type relational data. In Proceedings of the ICML 2006, pages 585--592, Pittsburgh, PA, USA, 2006.
[17]
Y. Lu and C. Zhai. Opinion integration through semi-supervised topic modeling. In Proceedings of WWW'08, pages 121--130, Beijing, China, April 2008.
[18]
A.-M. Popescu and O. Etzioni. Extracting product features and opinions from reviews. In Proceedings of HLT and EMNLP, pages 339--346, Vancouver, Canada, October 2005.
[19]
J. B. Schafer, D. Frankowski, J. L. Herlocker, and S. Sen. Collaborative filtering recommender systems. In The Adaptive Web, volume 4321 of Lecture Notes in Computer Science, pages 291--324. Springer, 2007.
[20]
N. Srebro, J. D. M. Rennie, and T. S. Jaakkola. Maximum-margin matrix factorization. In Advances in Neural Information Processing Systems 17, pages 1329--1336, 2005.
[21]
G. Takacs, I. Pilaszy, B. Nemeth, and D. Tikk. On the gravity recommendation system. In Proceedings of KDD Cup Workshop at SIGKDD'07, 13th ACM Int. Conf. on Knowledge Discovery and Data Mining, pages 22--30, San Jose, CA, USA, 2007.
[22]
I. Titov and R. McDonald. Modeling online reviews with multi-grain topic models. In Proceedings of WWW ’08, pages 111--120, Beijing, China, April 2008.
[23]
T. Wilson, J. Wiebe, and P. Hoffmann. Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of HLT and EMNLP, pages 347--354, Vancouver, Canada, October 2005.
[24]
J. Yi, T. Nasukawa, R. Bunescu, and W. Niblack. Sentiment analyzer: Extracting sentiments about a given topic using natural language processing techniques. In Proceedings of ICDM, pages 427--434, Melbourne, FL, USA, December 2003.
[25]
K. Yu, S. Yu, and V. Tresp. Multi-label informed latent semantic indexing. In SIGIR'05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, pages 258--265, Salvador, Brazil, 2005. ACM Press.
[26]
L. Zhuang, F. Jing, and X.-Y. Zhu. Movie review mining and summarization. In Proceedings of CIKM, pages 43--50, Arlington, VA, USA, November 2006.

Cited By

View all

Index Terms

  1. Beyond the stars: exploiting free-text user reviews to improve the accuracy of movie recommendations
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Conferences
          TSA '09: Proceedings of the 1st international CIKM workshop on Topic-sentiment analysis for mass opinion
          November 2009
          94 pages
          ISBN:9781605588056
          DOI:10.1145/1651461
          • General Chairs:
          • Maojin Jiang,
          • Bei Yu,
          • Program Chair:
          • Bei Yu
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Sponsors

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 06 November 2009

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. collaborative filtering
          2. multi-relational learning
          3. opinion mining
          4. recommendation system
          5. sentiment analysis

          Qualifiers

          • Research-article

          Conference

          CIKM '09
          Sponsor:

          Upcoming Conference

          CIKM '25

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)38
          • Downloads (Last 6 weeks)2
          Reflects downloads up to 25 Jan 2025

          Other Metrics

          Citations

          Cited By

          View all

          View Options

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          Figures

          Tables

          Media

          Share

          Share

          Share this Publication link

          Share on social media