Article

Free access

Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews

Author:

Peter D. TurneyAuthors Info & Claims

ACL '02: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics

Pages 417 - 424

https://doi.org/10.3115/1073083.1073153

Published: 06 July 2002 Publication History

PDF eReader

Abstract

This paper presents a simple unsupervised learning algorithm for classifying reviews as recommended (thumbs up) or not recommended (thumbs down). The classification of a review is predicted by the average semantic orientation of the phrases in the review that contain adjectives or adverbs. A phrase has a positive semantic orientation when it has good associations (e.g., "subtle nuances") and a negative semantic orientation when it has bad associations (e.g., "very cavalier"). In this paper, the semantic orientation of a phrase is calculated as the mutual information between the given phrase and the word "excellent" minus the mutual information between the given phrase and the word "poor". A review is classified as recommended if the average semantic orientation of its phrases is positive. The algorithm achieves an average accuracy of 74% when evaluated on 410 reviews from Epinions, sampled from four different domains (reviews of automobiles, banks, movies, and travel destinations). The accuracy ranges from 84% for automobile reviews to 66% for movie reviews.

References

[1]

Agresti, A. 1996. An introduction to categorical data analysis. New York: Wiley.]]

Google Scholar

[2]

Brill, E. 1994. Some advances in transformation-based part of speech tagging. Proceedings of the Twelfth National Conference on Artificial Intelligence (pp. 722--727). Menlo Park, CA: AAAI Press.]]

Digital Library

Google Scholar

[3]

Church, K. W., & Hanks, P. 1989. Word association norms, mutual information and lexicography. Proceedings of the 27th Annual Conference of the ACL (pp. 76--83). New Brunswick, NJ: ACL.]]

Digital Library

Google Scholar

[4]

Frank, E., & Hall, M. 2001. A simple approach to ordinal classification. Proceedings of the Twelfth European Conference on Machine Learning (pp. 145--156). Berlin: Springer-Verlag.]]

Digital Library

Google Scholar

[5]

Hatzivassiloglou, V., & McKeown, K. R. 1997. Predicting the semantic orientation of adjectives. Proceedings of the 35th Annual Meeting of the ACL and the 8th Conference of the European Chapter of the ACL (pp. 174--181). New Brunswick, NJ: ACL.]]

Digital Library

Google Scholar

[6]

Hatzivassiloglou, V., & Wiebe, J. M. 2000. Effects of adjective orientation and gradability on sentence subjectivity. Proceedings of 18th International Conference on Computational Linguistics. New Brunswick, NJ: ACL.]]

Digital Library

Google Scholar

[7]

Hearst, M. A. 1992. Direction-based text interpretation as an information access refinement. In P. Jacobs (Ed.), Text-Based Intelligent Systems: Current Research and Practice in Information Extraction and Retrieval. Mahwah, NJ: Lawrence Erlbaum Associates.]]

Digital Library

Google Scholar

[8]

Landauer, T. K., & Dumais, S. T. 1997. A solution to Plato's problem: The latent semantic analysis theory of the acquisition, induction, and representation of knowledge. Psychological Review, 104, 211--240.]]

Crossref

Google Scholar

[9]

Santorini, B. 1995. Part-of-Speech Tagging Guidelines for the Penn Treebank Project (3rd revision, 2nd printing). Technical Report, Department of Computer and Information Science, University of Pennsylvania.]]

Google Scholar

[10]

Spertus, E. 1997. Smokey: Automatic recognition of hostile messages. Proceedings of the Conference on Innovative Applications of Artificial Intelligence (pp. 1058--1065). Menlo Park, CA: AAAI Press.]]

Digital Library

Google Scholar

[11]

Tong, R. M. 2001. An operational system for detecting and tracking opinions in on-line discussions. Working Notes of the ACM SIGIR 2001 Workshop on Operational Text Classification (pp. 1--6). New York, NY: ACM.]]

Google Scholar

[12]

Turney, P. D. 2001. Mining the Web for synonyms: PMI-IR versus LSA on TOEFL. Proceedings of the Twelfth European Conference on Machine Learning (pp. 491--502). Berlin: Springer-Verlag.]]

Digital Library

Google Scholar

[13]

Wiebe, J. M. 2000. Learning subjective adjectives from corpora. Proceedings of the 17th National Conference on Artificial Intelligence. Menlo Park, CA: AAAI Press.]]

Digital Library

Google Scholar

[14]

Wiebe, J. M., Bruce, R., Bell, M., Martin, M., & Wilson, T. 2001. A corpus study of evaluative and speculative language. Proceedings of the Second ACL SIG on Dialogue Workshop on Discourse and Dialogue. Aalborg, Denmark.]]

Digital Library

Google Scholar

Cited By

View all

Babina O(2024)Topic Modeling for Mining Opinion Aspects from a Customer Feedback CorpusAutomatic Documentation and Mathematical Linguistics10.3103/S000510552401006058:1(63-79)Online publication date: 2-Apr-2024
https://dl.acm.org/doi/10.3103/S0005105524010060
Rajabiyazdi FRamesh SLangstone BKulik DPontalba J(2024)TextVista: NLP-Enriched Time-Series Text Data VisualizationsProceedings of the 50th Graphics Interface Conference10.1145/3670947.3670971(1-12)Online publication date: 3-Jun-2024
https://dl.acm.org/doi/10.1145/3670947.3670971
Ramzy MIbrahim B(2024)User satisfaction with Arabic COVID-19 appsInformation Processing and Management: an International Journal10.1016/j.ipm.2024.10364461:3Online publication date: 2-Jul-2024
https://dl.acm.org/doi/10.1016/j.ipm.2024.103644
Show More Cited By

Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews
1. Computing methodologies
  1. Artificial intelligence
2. Hardware
  1. Power and energy
    1. Power estimation and optimization

Recommendations

Thumbs up?: sentiment classification using machine learning techniques
EMNLP '02: Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10

We consider the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative. Using movie reviews as data, we find that standard machine learning techniques definitively outperform ...
Hindi Word Sense Disambiguation Using Lesk Approach on Bigram and Trigram Words
AICTC '16: Proceedings of the International Conference on Advances in Information Communication Technology & Computing

Word Sense Disambiguation (WSD) is a vital task which provides the definition of particular words according to their sense or according to given context. Lesk algorithm is originally based on the gloss overlap that can be observed as the measure, ...
SemEval-2010 task 3: cross-lingual word sense disambiguation
SEW '09: Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions

We propose a multilingual unsupervised Word Sense Disambiguation (WSD) task for a sample of English nouns. Instead of providing manually sensetagged examples for each sense of a polysemous noun, our sense inventory is built up on the basis of the ...

Comments

Information & Contributors

Information

Published In

ACL '02: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics

July 2002

543 pages

General Chair:
Pierre Isabelle

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 06 July 2002

Qualifiers

Article

Acceptance Rates

Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

712
Total Citations
View Citations
11,117
Total Downloads

Downloads (Last 12 months)231
Downloads (Last 6 weeks)35

Reflects downloads up to 16 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Babina O(2024)Topic Modeling for Mining Opinion Aspects from a Customer Feedback CorpusAutomatic Documentation and Mathematical Linguistics10.3103/S000510552401006058:1(63-79)Online publication date: 2-Apr-2024
https://dl.acm.org/doi/10.3103/S0005105524010060
Rajabiyazdi FRamesh SLangstone BKulik DPontalba J(2024)TextVista: NLP-Enriched Time-Series Text Data VisualizationsProceedings of the 50th Graphics Interface Conference10.1145/3670947.3670971(1-12)Online publication date: 3-Jun-2024
https://dl.acm.org/doi/10.1145/3670947.3670971
Ramzy MIbrahim B(2024)User satisfaction with Arabic COVID-19 appsInformation Processing and Management: an International Journal10.1016/j.ipm.2024.10364461:3Online publication date: 2-Jul-2024
https://dl.acm.org/doi/10.1016/j.ipm.2024.103644
Das RSingh T(2023)Multimodal Sentiment Analysis: A Survey of Methods, Trends, and ChallengesACM Computing Surveys10.1145/358607555:13s(1-38)Online publication date: 13-Jul-2023
https://dl.acm.org/doi/10.1145/3586075
Alqahtani YAl-Twairesh NAlsanad A(2023)Improving sentiment domain adaptation for Arabic using an unsupervised self-labeling frameworkInformation Processing and Management: an International Journal10.1016/j.ipm.2023.10333860:3Online publication date: 1-May-2023
https://dl.acm.org/doi/10.1016/j.ipm.2023.103338
Bhardwaj AVeeramani DZhou S(2023)Identifying equipment health status from maintenance records using Lexicon based Unsupervised Sentiment Analysis Adjusted for Negation (LUSAA-N)Computers and Industrial Engineering10.1016/j.cie.2023.109693186:COnline publication date: 1-Dec-2023
https://dl.acm.org/doi/10.1016/j.cie.2023.109693
Chen BJiang LPan XZhou GSun ALi D(2023)Exploring Emotion Trends in Product Reviews: A Multi-modal Analysis with Malicious Comment Filtering and User Privacy ProtectionInformation Security and Cryptology10.1007/978-981-97-0942-7_19(379-396)Online publication date: 9-Dec-2023
https://dl.acm.org/doi/10.1007/978-981-97-0942-7_19
Tran TDinh HPhan TBalas V(2022)Building an enhanced sentiment classification framework based on natural language processingJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-21927843:2(1771-1777)Online publication date: 1-Jan-2022
https://dl.acm.org/doi/10.3233/JIFS-219278
Chebolu SRosso PKar SSolorio T(2022)Survey on Aspect Category DetectionACM Computing Surveys10.1145/354455755:7(1-37)Online publication date: 15-Dec-2022
https://dl.acm.org/doi/10.1145/3544557
P JBalakrishnan K(2022)Text-to-Speech Synthesis: Literature Review with an Emphasis on Malayalam LanguageACM Transactions on Asian and Low-Resource Language Information Processing10.1145/350139721:4(1-56)Online publication date: 19-Jan-2022
https://dl.acm.org/doi/10.1145/3501397
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Abstract

References

Cited By

Recommendations

Thumbs up?: sentiment classification using machine learning techniques

Hindi Word Sense Disambiguation Using Lesk Approach on Bigram and Trigram Words

SemEval-2010 task 3: cross-lingual word sense disambiguation

Comments

Information

Published In

Publisher

Publication History

Qualifiers

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Get Access

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations