research-article

Public Access

Data-Driven Contextual Valence Shifter Quantification for Multi-Theme Sentiment Analysis

Authors:

Malu Castellanos,

Jiawei HanAuthors Info & Claims

CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management

Pages 939 - 948

https://doi.org/10.1145/2983323.2983793

Published: 24 October 2016 Publication History

Abstract

Users often write reviews on different themes involving linguistic structures with complex sentiments. The sentiment polarity of a word can be different across themes. Moreover, contextual valence shifters may change sentiment polarity depending on the contexts that they appear in. Both challenges cannot be modeled effectively and explicitly in traditional sentiment analysis. Studying both phenomena requires multi-theme sentiment analysis at the word level, which is very interesting but significantly more challenging than overall polarity classification. To simultaneously resolve the multi-theme and sentiment shifting problems, we propose a data-driven framework to enable both capabilities: (1) polarity predictions of the same word in reviews of different themes, and (2) discovery and quantification of contextual valence shifters. The framework formulates multi-theme sentiment by factorizing the review sentiments with theme/word embeddings and then derives the shifter effect learning problem as a logistic regression. The improvement of sentiment polarity classification accuracy demonstrates not only the importance of multi-theme and sentiment shifting, but also effectiveness of our framework. Human evaluations and case studies further show the success of multi-theme word sentiment predictions and automatic effect quantification of contextual valence shifters.

References

[1]

D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. J. Mach. Learn. Res., 3:993--1022, Mar. 2003.

[2]

N. Boubel, T. François, H. Naets, and I. Cental. Automatic extraction of contextual valence shifters. In RANLP, pages 98--104, 2013.

[3]

A. M. Dai and Q. V. Le. Semi-supervised sequence learning. In NIPS, pages 3079--3087. 2015.

Digital Library

[4]

J. Han, J. Pei, and Y. Yin. Mining frequent patterns without candidate generation. In SIGMOD, pages 1--12, 2000.

Digital Library

[5]

D. Ikeda, H. Takamura, L.-A. Ratinov, and M. Okumura. Learning to shift the polarity of words for sentiment classification. In IJCNLP, pages 296--303, 2008.

[6]

L. Jia, C. Yu, and W. Meng. The effect of negation on sentiment analysis and retrieval effectiveness. In CIKM, pages 1827--1830, 2009.

Digital Library

[7]

Y. Jo and A. H. Oh. Aspect and sentiment unification model for online review analysis. In WSDM, pages 815--824, 2011.

Digital Library

[8]

N. Kalchbrenner, E. Grefenstette, and P. Blunsom. A convolutional neural network for modelling sentences. In ACL, pages 655--665, June 2014.

[9]

A. Kennedy and D. Inkpen. Sentiment classification of movie reviews using contextual valence shifters. Computational intelligence, 22(2):110--125, 2006.

[10]

W. Kessler and H. Schütze. Classification of inconsistent sentiment words using syntactic constructions. In COLING, pages 569--578, 2012.

[11]

Y. Kim. Convolutional neural networks for sentence classification. In EMNLP, pages 1746--1751, Doha, Qatar, October 2014.

[12]

Q. V. Le and T. Mikolov. Distributed representations of sentences and documents. In ICML, pages 1188--1196, 2014.

Digital Library

[13]

S. Li, S. Y. M. Lee, Y. Chen, C.-R. Huang, and G. Zhou. Sentiment classification and polarity shifting. In COLING, pages 635--643, 2010.

Digital Library

[14]

C. Lin and Y. He. Joint sentiment/topic model for sentiment analysis. In CIKM, pages 375--384, 2009.

Digital Library

[15]

B. Liu. Sentiment analysis and opinion mining. Synthesis lectures on human language technologies, 5(1):1--167, 2012.

Digital Library

[16]

J. Liu, X. Ren, J. Shang, T. Cassidy, C. R. Voss, and J. Han. Representing documents via latent keyphrase inference. In WWW, pages 1057--1067, 2016.

Digital Library

[17]

Y. Lu, M. Castellanos, U. Dayal, and C. Zhai. Automatic construction of a context-aware sentiment lexicon: An optimization approach. In WWW, pages 347--356, 2011.

Digital Library

[18]

A. L. Maas, R. E. Daly, P. T. Pham, D. Huang, A. Y. Ng, and C. Potts. Learning word vectors for sentiment analysis. In ACL, pages 142--150, 2011.

Digital Library

[19]

A. K. McCallum. Mallet: A machine learning for language toolkit. http://mallet.cs.umass.edu, 2002.

[20]

Q. Mei, X. Ling, M. Wondra, H. Su, and C. Zhai. Topic sentiment mixture: Modeling facets and opinions in weblogs. In WWW, pages 171--180, 2007.

Digital Library

[21]

T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In NIPS, pages 3111--3119. 2013.

Digital Library

[22]

J. Nocedal. Updating quasi-newton matrices with limited storage. Mathematics of computation, 35(151):773--782, 1980.

[23]

B. Pang and L. Lee. Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In ACL, pages 115--124, 2005.

Digital Library

[24]

B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up?: Sentiment classification using machine learning techniques. In EMNLP, pages 79--86, 2002.

Digital Library

[25]

L. Polanyi and A. Zaenen. Contextual valence shifters. In Computing Attitude and Affect in Text: Theory and Applications, volume 20 of The Information Retrieval Series, pages 1--10. 2006.

[26]

M. Pontiki, D. Galanis, J. Pavlopoulos, H. Papageorgiou, I. Androutsopoulos, and S. Manandhar. Semeval-2014 task 4: Aspect based sentiment analysis. In Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), pages 27--35, Dublin, Ireland, August 2014.

[27]

J. Shang, T. Chen, H. Li, Z. Lu, and Y. Yu. A parallel and efficient algorithm for learning to match. In ICDM, pages 971--976. IEEE, 2014.

Digital Library

[28]

B. Sharrack, R. A. Hughes, S. Soudain, and G. Dunn. The psychometric properties of clinical rating scales used in multiple sclerosis. Brain, 122(1):141--159, 1999.

[29]

R. Socher, B. Huval, C. D. Manning, and A. Y. Ng. Semantic compositionality through recursive matrix-vector spaces. In EMNLP, pages 1201--1211, 2012.

Digital Library

[30]

R. Socher, A. Perelygin, J. Y. Wu, J. Chuang, C. D. Manning, A. Y. Ng, and C. Potts. Recursive deep models for semantic compositionality over a sentiment treebank. In EMNLP, volume 1631, page 1642, 2013.

[31]

K. S. Tai, R. Socher, and C. D. Manning. Improved semantic representations from tree-structured long short-term memory networks. In ACL, pages 1556--1566, Beijing, China, July 2015.

[32]

I. Titov and R. T. McDonald. A joint model of text and aspect ratings for sentiment summarization. In ACL, pages 308--316, 2008.

[33]

H. Wang, Y. Lu, and C. Zhai. Latent aspect rating analysis on review text data: A rating regression approach. In SIGKDD, pages 783--792, 2010.

Digital Library

[34]

H. Wang, Y. Lu, and C. Zhai. Latent aspect rating analysis without aspect keyword supervision. In SIGKDD, pages 618--626, 2011.

Digital Library

[35]

S. Wang and C. D. Manning. Baselines and bigrams: Simple, good sentiment and topic classification. In ACL, pages 90--94, 2012.

Digital Library

[36]

J. Wiebe, T. Wilson, and C. Cardie. Annotating expressions of opinions and emotions in language. Language resources and evaluation, 39(2--3):165--210, 2005.

[37]

M. Wiegand, A. Balahur, B. Roth, D. Klakow, and A. Montoyo. A survey on the role of negation in sentiment analysis. In NeSp-NLP, pages 60--68, 2010.

Digital Library

[38]

Y. Wu and M. Ester. Flame: A probabilistic model combining aspect based opinion mining and collaborative filtering. In WSDM, pages 199--208, New York, NY, USA, 2015.

Digital Library

[39]

X. Yan, J. Guo, Y. Lan, and X. Cheng. A biterm topic model for short texts. In WWW, pages 1445--1456, 2013.

Digital Library

[40]

H. Zou and T. Hastie. Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 67(2):301--320, 2005.

Cited By

Debnath REbanks DMohaddes KRoulet TAlvarez R(2023)Do fossil fuel firms reframe online climate and sustainability communication? A data-driven analysisnpj Climate Action10.1038/s44168-023-00086-x2:1Online publication date: 18-Dec-2023
https://doi.org/10.1038/s44168-023-00086-x
Singh AJenamani MThakkar JDwivedi Y(2022)A Text Analytics Framework for Performance Assessment and Weakness Detection From Online ReviewsJournal of Global Information Management10.4018/JGIM.30406930:8(1-26)Online publication date: 28-Jul-2022
https://doi.org/10.4018/JGIM.304069
Singh AJenamani MThakkar JRana N(2022)Quantifying the effect of eWOM embedded consumer perceptions on sales: An integrated aspect-level sentiment analysis and panel data modeling approachJournal of Business Research10.1016/j.jbusres.2021.08.060138(52-64)Online publication date: Jan-2022
https://doi.org/10.1016/j.jbusres.2021.08.060
Show More Cited By

Index Terms

Data-Driven Contextual Valence Shifter Quantification for Multi-Theme Sentiment Analysis
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
2. Information systems
  1. Information retrieval
    1. Retrieval tasks and goals
      1. Sentiment analysis
  2. Information systems applications
    1. Data mining
      1. Collaborative filtering

Recommendations

Joint sentiment/topic model for sentiment analysis
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management

Sentiment analysis or opinion mining aims to use automated tools to detect subjective information such as opinions, attitudes, and feelings expressed in text. This paper proposes a novel probabilistic modeling framework based on Latent Dirichlet ...
Contextual semantics for sentiment analysis of Twitter

We propose a semantic sentiment representation of words called SentiCircle.SentiCircle captures the contextual semantic of words from their co-occurrences.SentiCircle updates the sentiment of words based on their contextual semantics.SentiCircle can be ...
Domain-specific sentiment analysis using contextual feature generation
TSA '09: Proceedings of the 1st international CIKM workshop on Topic-sentiment analysis for mass opinion

This paper presents a novel framework for sentiment analysis, which exploits sentiment topic information for generating context-driven features. Since the domain-specific nature of sentiment classification led the task more problematic, considering more ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management

October 2016

2566 pages

ISBN:9781450340731

DOI:10.1145/2983323

General Chairs:
Snehasis Mukhopadhyay
Indiana University Purdue University Indianapolis, USA
,
ChengXiang Zhai
University of Illinois at Urbana-Champaign, USA
,
Program Chairs:
Elisa Bertino
Purdue University
,
Fabio Crestani
University of Lugano
,
Javed Mostafa
University of North Carolina
,
Jie Tang
Tsinghua University
,
Luo Si
Alibaba Group Inc & Purdue University
,
Xiaofang Zhou
University of Queensland
,
Yi Chang
Yahoo Research
,
Yunyao Li
IBM Research - Almaden
,
Parikshit Sondhi
WalmartLabs

Copyright © 2016 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 October 2016

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Conference

CIKM'16

Sponsor:

CIKM'16: ACM Conference on Information and Knowledge Management

October 24 - 28, 2016

Indiana, Indianapolis, USA

Acceptance Rates

CIKM '16 Paper Acceptance Rate 160 of 701 submissions, 23%;

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Sponsor:
sigir
sigir

The 34th ACM International Conference on Information and Knowledge Management

November 10 - 14, 2025

Seoul , Republic of Korea

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

6
Total Citations
View Citations
751
Total Downloads

Downloads (Last 12 months)114
Downloads (Last 6 weeks)18

Reflects downloads up to 08 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Debnath REbanks DMohaddes KRoulet TAlvarez R(2023)Do fossil fuel firms reframe online climate and sustainability communication? A data-driven analysisnpj Climate Action10.1038/s44168-023-00086-x2:1Online publication date: 18-Dec-2023
https://doi.org/10.1038/s44168-023-00086-x
Singh AJenamani MThakkar JDwivedi Y(2022)A Text Analytics Framework for Performance Assessment and Weakness Detection From Online ReviewsJournal of Global Information Management10.4018/JGIM.30406930:8(1-26)Online publication date: 28-Jul-2022
https://doi.org/10.4018/JGIM.304069
Singh AJenamani MThakkar JRana N(2022)Quantifying the effect of eWOM embedded consumer perceptions on sales: An integrated aspect-level sentiment analysis and panel data modeling approachJournal of Business Research10.1016/j.jbusres.2021.08.060138(52-64)Online publication date: Jan-2022
https://doi.org/10.1016/j.jbusres.2021.08.060
Ayeste ZNoferesti S(2021)A semantic approach based on domain knowledge for polarity shift detection using distant supervisionProgress in Artificial Intelligence10.1007/s13748-021-00267-x11:2(169-180)Online publication date: 23-Nov-2021
https://doi.org/10.1007/s13748-021-00267-x
Wanganga GQu Y(2020)A Deep Learning based Customer Sentiment Analysis Model to Enhance Customer Retention and Loyalty in the Payment Industry2020 International Conference on Computational Science and Computational Intelligence (CSCI)10.1109/CSCI51800.2020.00086(473-478)Online publication date: Dec-2020
https://doi.org/10.1109/CSCI51800.2020.00086
Rahimi ZNoferesti SShamsfard M(2019)Applying data mining and machine learning techniques for sentiment shifter identificationLanguage Resources and Evaluation10.1007/s10579-018-9432-053:2(279-302)Online publication date: 1-Jun-2019
https://dl.acm.org/doi/10.1007/s10579-018-9432-0

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Figures

Tables

Media

View Table of Conten