skip to main content
10.1145/2983323.2983793acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article
Public Access

Data-Driven Contextual Valence Shifter Quantification for Multi-Theme Sentiment Analysis

Published: 24 October 2016 Publication History

Abstract

Users often write reviews on different themes involving linguistic structures with complex sentiments. The sentiment polarity of a word can be different across themes. Moreover, contextual valence shifters may change sentiment polarity depending on the contexts that they appear in. Both challenges cannot be modeled effectively and explicitly in traditional sentiment analysis. Studying both phenomena requires multi-theme sentiment analysis at the word level, which is very interesting but significantly more challenging than overall polarity classification. To simultaneously resolve the multi-theme and sentiment shifting problems, we propose a data-driven framework to enable both capabilities: (1) polarity predictions of the same word in reviews of different themes, and (2) discovery and quantification of contextual valence shifters. The framework formulates multi-theme sentiment by factorizing the review sentiments with theme/word embeddings and then derives the shifter effect learning problem as a logistic regression. The improvement of sentiment polarity classification accuracy demonstrates not only the importance of multi-theme and sentiment shifting, but also effectiveness of our framework. Human evaluations and case studies further show the success of multi-theme word sentiment predictions and automatic effect quantification of contextual valence shifters.

References

[1]
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. J. Mach. Learn. Res., 3:993--1022, Mar. 2003.
[2]
N. Boubel, T. François, H. Naets, and I. Cental. Automatic extraction of contextual valence shifters. In RANLP, pages 98--104, 2013.
[3]
A. M. Dai and Q. V. Le. Semi-supervised sequence learning. In NIPS, pages 3079--3087. 2015.
[4]
J. Han, J. Pei, and Y. Yin. Mining frequent patterns without candidate generation. In SIGMOD, pages 1--12, 2000.
[5]
D. Ikeda, H. Takamura, L.-A. Ratinov, and M. Okumura. Learning to shift the polarity of words for sentiment classification. In IJCNLP, pages 296--303, 2008.
[6]
L. Jia, C. Yu, and W. Meng. The effect of negation on sentiment analysis and retrieval effectiveness. In CIKM, pages 1827--1830, 2009.
[7]
Y. Jo and A. H. Oh. Aspect and sentiment unification model for online review analysis. In WSDM, pages 815--824, 2011.
[8]
N. Kalchbrenner, E. Grefenstette, and P. Blunsom. A convolutional neural network for modelling sentences. In ACL, pages 655--665, June 2014.
[9]
A. Kennedy and D. Inkpen. Sentiment classification of movie reviews using contextual valence shifters. Computational intelligence, 22(2):110--125, 2006.
[10]
W. Kessler and H. Schütze. Classification of inconsistent sentiment words using syntactic constructions. In COLING, pages 569--578, 2012.
[11]
Y. Kim. Convolutional neural networks for sentence classification. In EMNLP, pages 1746--1751, Doha, Qatar, October 2014.
[12]
Q. V. Le and T. Mikolov. Distributed representations of sentences and documents. In ICML, pages 1188--1196, 2014.
[13]
S. Li, S. Y. M. Lee, Y. Chen, C.-R. Huang, and G. Zhou. Sentiment classification and polarity shifting. In COLING, pages 635--643, 2010.
[14]
C. Lin and Y. He. Joint sentiment/topic model for sentiment analysis. In CIKM, pages 375--384, 2009.
[15]
B. Liu. Sentiment analysis and opinion mining. Synthesis lectures on human language technologies, 5(1):1--167, 2012.
[16]
J. Liu, X. Ren, J. Shang, T. Cassidy, C. R. Voss, and J. Han. Representing documents via latent keyphrase inference. In WWW, pages 1057--1067, 2016.
[17]
Y. Lu, M. Castellanos, U. Dayal, and C. Zhai. Automatic construction of a context-aware sentiment lexicon: An optimization approach. In WWW, pages 347--356, 2011.
[18]
A. L. Maas, R. E. Daly, P. T. Pham, D. Huang, A. Y. Ng, and C. Potts. Learning word vectors for sentiment analysis. In ACL, pages 142--150, 2011.
[19]
A. K. McCallum. Mallet: A machine learning for language toolkit. http://mallet.cs.umass.edu, 2002.
[20]
Q. Mei, X. Ling, M. Wondra, H. Su, and C. Zhai. Topic sentiment mixture: Modeling facets and opinions in weblogs. In WWW, pages 171--180, 2007.
[21]
T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In NIPS, pages 3111--3119. 2013.
[22]
J. Nocedal. Updating quasi-newton matrices with limited storage. Mathematics of computation, 35(151):773--782, 1980.
[23]
B. Pang and L. Lee. Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In ACL, pages 115--124, 2005.
[24]
B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up?: Sentiment classification using machine learning techniques. In EMNLP, pages 79--86, 2002.
[25]
L. Polanyi and A. Zaenen. Contextual valence shifters. In Computing Attitude and Affect in Text: Theory and Applications, volume 20 of The Information Retrieval Series, pages 1--10. 2006.
[26]
M. Pontiki, D. Galanis, J. Pavlopoulos, H. Papageorgiou, I. Androutsopoulos, and S. Manandhar. Semeval-2014 task 4: Aspect based sentiment analysis. In Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), pages 27--35, Dublin, Ireland, August 2014.
[27]
J. Shang, T. Chen, H. Li, Z. Lu, and Y. Yu. A parallel and efficient algorithm for learning to match. In ICDM, pages 971--976. IEEE, 2014.
[28]
B. Sharrack, R. A. Hughes, S. Soudain, and G. Dunn. The psychometric properties of clinical rating scales used in multiple sclerosis. Brain, 122(1):141--159, 1999.
[29]
R. Socher, B. Huval, C. D. Manning, and A. Y. Ng. Semantic compositionality through recursive matrix-vector spaces. In EMNLP, pages 1201--1211, 2012.
[30]
R. Socher, A. Perelygin, J. Y. Wu, J. Chuang, C. D. Manning, A. Y. Ng, and C. Potts. Recursive deep models for semantic compositionality over a sentiment treebank. In EMNLP, volume 1631, page 1642, 2013.
[31]
K. S. Tai, R. Socher, and C. D. Manning. Improved semantic representations from tree-structured long short-term memory networks. In ACL, pages 1556--1566, Beijing, China, July 2015.
[32]
I. Titov and R. T. McDonald. A joint model of text and aspect ratings for sentiment summarization. In ACL, pages 308--316, 2008.
[33]
H. Wang, Y. Lu, and C. Zhai. Latent aspect rating analysis on review text data: A rating regression approach. In SIGKDD, pages 783--792, 2010.
[34]
H. Wang, Y. Lu, and C. Zhai. Latent aspect rating analysis without aspect keyword supervision. In SIGKDD, pages 618--626, 2011.
[35]
S. Wang and C. D. Manning. Baselines and bigrams: Simple, good sentiment and topic classification. In ACL, pages 90--94, 2012.
[36]
J. Wiebe, T. Wilson, and C. Cardie. Annotating expressions of opinions and emotions in language. Language resources and evaluation, 39(2--3):165--210, 2005.
[37]
M. Wiegand, A. Balahur, B. Roth, D. Klakow, and A. Montoyo. A survey on the role of negation in sentiment analysis. In NeSp-NLP, pages 60--68, 2010.
[38]
Y. Wu and M. Ester. Flame: A probabilistic model combining aspect based opinion mining and collaborative filtering. In WSDM, pages 199--208, New York, NY, USA, 2015.
[39]
X. Yan, J. Guo, Y. Lan, and X. Cheng. A biterm topic model for short texts. In WWW, pages 1445--1456, 2013.
[40]
H. Zou and T. Hastie. Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 67(2):301--320, 2005.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management
October 2016
2566 pages
ISBN:9781450340731
DOI:10.1145/2983323
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 October 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. multi-theme
  2. sentiment analysis
  3. sentiment shifting

Qualifiers

  • Research-article

Funding Sources

Conference

CIKM'16
Sponsor:
CIKM'16: ACM Conference on Information and Knowledge Management
October 24 - 28, 2016
Indiana, Indianapolis, USA

Acceptance Rates

CIKM '16 Paper Acceptance Rate 160 of 701 submissions, 23%;
Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)114
  • Downloads (Last 6 weeks)18
Reflects downloads up to 08 Feb 2025

Other Metrics

Citations

Cited By

View all

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media