skip to main content
10.1145/1367497.1367627acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Hidden sentiment association in chinese web opinion mining

Published: 21 April 2008 Publication History

Abstract

The boom of product review websites, blogs and forums on the web has attracted many research efforts on opinion mining. Recently, there was a growing interest in the finer-grained opinion mining, which detects opinions on different review features as opposed to the whole review level. The researches on feature-level opinion mining mainly rely on identifying the explicit relatedness between product feature words and opinion words in reviews. However, the sentiment relatedness between the two objects is usually complicated. For many cases, product feature words are implied by the opinion words in reviews. The detection of such hidden sentiment association is still a big challenge in opinion mining. Especially, it is an even harder task of feature-level opinion mining on Chinese reviews due to the nature of Chinese language. In this paper, we propose a novel mutual reinforcement approach to deal with the feature-level opinion mining problem. More specially, 1) the approach clusters product features and opinion words simultaneously and iteratively by fusing both their content information and sentiment link information. 2) under the same framework, based on the product feature categories and opinion word groups, we construct the sentiment association set between the two groups of data objects by identifying their strongest n sentiment links. Moreover, knowledge from multi-source is incorporated to enhance clustering in the procedure. Based on the pre-constructed association set, our approach can largely predict opinions relating to different product features, even for the case without the explicit appearance of product feature words in reviews. Thus it provides a more accurate opinion evaluation. The experimental results demonstrate that our method outperforms the state-of-art algorithms.

References

[1]
A. Budanitsky and G. Hirst. Evaluating wordnet-based measures of lexical semantic relatedness. Computational Linguistics, 32(1):13--47, 2006.
[2]
C. Cardie, K. Wagstaff, and et al. Noun phrase coreference as clustering. In Proceedings of the Joint Conf on Empirical Methods in NLP and Very Large Corpora, pages 82--89, 1999.
[3]
K. Dave, S. Lawrence, and D. M. Pennock. Mining the peanut gallery: Opinion extraction and semantic classification of product reviews. In Proceedings of 12th International Conference on the World Wide Web (WWW'03), pages 519--528, 2003.
[4]
A. Fujii and T. Ishikawa. A system for summarizing and visualizing arguments in subjective documents: Toward supporting decision making. In Proceedings of the Workshop on Sentiment and Subjectivity in Text, ACL2006, pages 15--22, 2006.
[5]
H. Guo, J. Jiang, G. Hu, and T. Zhang. Chinese named entity recognition based on multilevel linguistic features. Lecture Notes in Artificial Intelligence, 3248:90--99, 2005.
[6]
V. Hatzivassiloglou and K. McKeown. Predicting the semantic orientation of adjectives. In Proceedings of the 35th Annual Meeting of the ACL and the 8th Conference of the European Chapter of the ACL, pages 174--181, 1997.
[7]
M. Hu and B. Liu. Mining and summarizing customer reviews. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD-2004), pages 761--769, 2004.
[8]
M. Hu and B. Liu. Mining opinion features in customer reviews. In Proceedings of Nineteeth National Conference on Artificial Intellgience (AAAI-2004), pages 755--760, 2004.
[9]
S. -M. Kim and E. Hovy. Identifying opinion holders for question answering in opinion texts. In Proceedings of AAAI Workshop on Question Answering in Restricted Domains, 2005.
[10]
B. Liu, M. Hu, and J. Cheng. Opinion observer: analyzing and comparing opinions on the web. In Proceedings of the 14th international conference on World Wide Web (WWW'05), pages 1024--1025, 2005.
[11]
Y. Liu and et al. The CCD construction model & its auxiliary tool vacol. Applied Linguistics, 45(1):83--88, 2003.
[12]
J. B. MacQueen. Some methods for classification and analysis of multivariate observations. In Proceedings of 5-th Berkeley Symposium on Mathematical statistics and Probability, pages 281--297, 1967.
[13]
B. Pang and L. Lee. Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In Proceedings of 43nd Annual Meeting of the Association for Computational Linguistics (ACL'05), 2005.
[14]
P. Pantel and D. Lin. Discovering word senses from text. In Proceedings of ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pages 613--619, 2002.
[15]
A.-M. Popescu and O. Etzioni. Extracting product features and opinions from reviews. In Proceedings of Human Language Technology Conference/Conference on Empirical Methods in Natural Language Processing (HLT-EMNLP-05), Vancouver, CA, 2005.
[16]
Q. Su, Y. Zhu, B. Swen, and S.-W. Yu. Mining feature based opinion expressions by mutual information approach. International Journal of Computer Processing of Oriental Languages, 20(2/3):137--150.
[17]
J.-T. Sun, X. Wang, D. Shen, H.-J. Zeng, and Z. Chen. CWS: a comparative web search system. In Proceedings of the 15th international conference on World Wide Web, 2006.
[18]
M. C. Thomas and J. A. Thomas. Elements of Information Theory. Number 10. 1991.
[19]
P. Turney. Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. In Proceedings of 40th Annual Meeting of the Association for Computational Linguistics (ACL'02), pages 417--424, 2002.
[20]
K. Wagstaff, C. Cardie, S. Rogers, and S. Schroedl. Constrained k-means clustering with background knowledge. In Proceedings of the Eighteenth International Conference on Machine Learning, pages 577-5-584, 2001.
[21]
G.-R. Xue, Y. Yu, D. Shen, Q. Yang, H.-J. Zeng, and Z. Chen. Reinforcing web-object categorization through interrelationships. Number 12(2-3), 2006.
[22]
H. Yu and V. Hatzivassiloglou. Towards answering opinion questions: Separating facts from opinions and identifying the polarity of opinion sentences. In Proceedings of EMNLP 2003.
[23]
H.-J. Zeng, Z. Chen, and W.-Y. Ma. A unified framework for clustering heterogeneous web objects. In Proceedings of the 3rd International Conference on Web Information Systems Engineering, pages 161--172, 2002.

Cited By

View all

Index Terms

  1. Hidden sentiment association in chinese web opinion mining

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WWW '08: Proceedings of the 17th international conference on World Wide Web
    April 2008
    1326 pages
    ISBN:9781605580852
    DOI:10.1145/1367497
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 21 April 2008

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. association
    2. mutual reinforcement
    3. opinion mining
    4. opinion word
    5. product feature

    Qualifiers

    • Research-article

    Conference

    WWW '08
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)33
    • Downloads (Last 6 weeks)11
    Reflects downloads up to 23 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media