skip to main content
research-article

Modeling information-seeker satisfaction in community question answering

Published: 21 April 2009 Publication History

Abstract

Question Answering Communities such as Naver, Baidu Knows, and Yahoo! Answers have emerged as popular, and often effective, means of information seeking on the web. By posting questions for other participants to answer, information seekers can obtain specific answers to their questions. Users of CQA portals have already contributed millions of questions, and received hundreds of millions of answers from other participants. However, CQA is not always effective: in some cases, a user may obtain a perfect answer within minutes, and in others it may require hours—and sometimes days—until a satisfactory answer is contributed. We investigate the problem of predicting information seeker satisfaction in collaborative question answering communities, where we attempt to predict whether a question author will be satisfied with the answers submitted by the community participants. We present a general prediction model, and develop a variety of content, structure, and community-focused features for this task. Our experimental results, obtained from a large-scale evaluation over thousands of real questions and user ratings, demonstrate the feasibility of modeling and predicting asker satisfaction. We complement our results with a thorough investigation of the interactions and information seeking patterns in question answering communities that correlate with information seeker satisfaction. We also explore personalized models of asker satisfaction, and show that when sufficient interaction history exists, personalization can significantly improve prediction accuracy over a “one-size-fits-all” model. Our models and predictions could be useful for a variety of applications, such as user intent inference, answer ranking, interface design, and query suggestion and routing.

Supplementary Material

Agichtein Appendix (a10-agichtein-apndx.pdf)
Online appendix to modeling information-seeker satisfaction in community question answering. The appendix supports the information on article 10.

References

[1]
Agarwal, N., Liu, H., Tang, L., and Yu, P. S. 2008. Identifying the influential bloggers in a community. In Proceedings of 1st ACM International Conference on Web Search and Data Mining (WSDM). 207--218.
[2]
Agichtein, E., Brill, E., Dumais, S., and Ragno, R. 2006. Learning user interaction models for predicting web search result preferences. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR).
[3]
Agichtein, E., Castillo, C., Donato, D., Gionis, A., and Mishne, G. 2008. Finding high-quality content in social media. In Proceedings of 1st ACM International Conference on Web Search and Data Mining (WSDM). 183--194.
[4]
Belkin, N., Oddy, R. N., and Brooks, H. M. 1982. Information retrieval: Part ii. results of a design study. J. Documen. 38, 3, 145--164.
[5]
Belkin, N. J. 1997. User modeling in information retrieval. In Proceedings of the 6th International Conference on User Modelling (UM'97).
[6]
Brill, E., Dumais, S., and Banko, M. 2002. An analysis of the askmsr question-answering system. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP).
[7]
Cutrell, E. and Guan, Z. 2007. Eye tracking in msn search: Investigating snippet length, target position and task types. MSR-TR-2007-01.
[8]
Dang, H. T., Kelly, D., and Lin, J. 2007. Overview of the TREC 2007 question answering track. In Proceedings of the 16th Text Retrieval Conference (TREC).
[9]
Demner-Fushman, D. and Lin, J. 2007. Answering clinical questions with knowledge-based and statistical techniques. Comput. Linguist. 33, 1, 63--103.
[10]
Downey, D., Dumais, S. T., and Horvitz, E. 2007. Models of searching and browsing: Languages, studies, and applications. In Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI).
[11]
Freund, Y. and Schapire, R. 1996. Experiments with a new boosting algorithm. In Proceedings of the 13th International Conference on Machine Learning (ICML).
[12]
Harter, S. P. and Hert, C. A. 1997. Evaluation of information retrieval systems: Approaches, issues, and methods. Annual Review of Information Science and Technology (ARIST) 32, 3--94.
[13]
Jeon, J., Croft, W., and Lee, J. 2005. Finding similar questions in large question and answer archives. In Proceedings of the 14th ACM International Conference on Information and Knowledge Management (CIKM).
[14]
Jeon, J., Croft, W., Lee, J., and Park, S. 2006. A framework to predict the quality of answers with non-textual features. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR).
[15]
Joachims, T., Granka, L., Pan, B., Hembrooke, H., Radlinski, F., and Gay, G. 2007. Evaluating the accuracy of implicit feedback from clicks and query reformulations in web search. ACM Trans. Inf. Syst. 25, 2.
[16]
Kobayashi, M. and Takeda, K. 2000. Information retrieval on the web. ACM Compu. Surv. 32, 2, 144--173.
[17]
Lin, J. and Demner-Fushman, D. 2006. Methods for automatically evaluating answers to complex questions. Inform. Retrie. 9, 5, 565--587.
[18]
Lin, J. and Zhang, P. 2007. Deconstructing nuggets: the stability and reliability of complex question answering evaluation. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR). 327--334.
[19]
Liu, Y. and Agichtein, E. 2008a. On the evolution of the yahoo! answers qa community. In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR). 737--738.
[20]
Liu, Y. and Agichtein, E. 2008b. You've got answers: Towards personalized models for predicting success in community question answering. In Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (ACL). 97--100.
[21]
Liu, Y., Bian, J., and Agichtein, E. 2008. Predicting information seeker satisfaction in community question answering. In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR).
[22]
Platt, J. C. 1998. Fast training of support vector machines using sequential minimal optimization. Advances in Kernal Methods—Support Vector Learning, 185--208.
[23]
Quinlan, J. 1996. Improved use of continuous attributes in c4.5. In J. Artif. Intell. Resear.
[24]
Rose, D. E. and Levinson, D. 2004. Understanding user goals in web search. In Proceedings of the 13th International Conference on World Wide Web (WWW).
[25]
Ruthven, I., Glasgow, L. A., Baillie, M., Bierig, R., Nicol, E., Sweeney, S., and Yakici, M. 2007. Intra-assessor consistency in question answering. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR). 727--728.
[26]
Soricut, R. and Brill, E. 2004. Automatic question answering: Beyond the factoid. In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL).
[27]
Su, Q., Pavlov, D., Chow, J., and Baker, W. 2007. Internet-scale collection of human-reviewed data. In Proceedings of the 16th International Conference on World Wide Web (WWW).
[28]
Voorhees, E. M. 2001. The philosophy of information retrieval evaluation. In Proceedings of the Workshop of the Cross-Language Evaluation Forum (CLEF).
[29]
Voorhees, E. M. 2003. Overview of the TREC 2003 question answering track. In Proceedings of the 12th Text Retrieval Conference (TREC).
[30]
White, R., Bilenko, M., and Cucerzan, S. 2007. Studying the use of popular destinations to enhance web search interaction. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR).
[31]
White, R. W. and Drucker, S. M. 2007. Investigating behavioral variability in web search. In Proceedings of the 16th International Conference on World Wide Web (WWW).
[32]
Witten, I. and Frank, E. 2005. Data Mining: Practical Machine Learning Tools and Techniques, 2nd ed. Morgan Kaufman.
[33]
Zobel, J. 1998. How reliable are the results of large-scale information retrieval experiments? In Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR). 307--314.

Cited By

View all

Index Terms

  1. Modeling information-seeker satisfaction in community question answering

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Transactions on Knowledge Discovery from Data
        ACM Transactions on Knowledge Discovery from Data  Volume 3, Issue 2
        April 2009
        111 pages
        ISSN:1556-4681
        EISSN:1556-472X
        DOI:10.1145/1514888
        Issue’s Table of Contents
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 21 April 2009
        Accepted: 01 January 2009
        Revised: 01 January 2009
        Received: 01 September 2008
        Published in TKDD Volume 3, Issue 2

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Community question answering
        2. information seeker satisfaction

        Qualifiers

        • Research-article
        • Research
        • Refereed

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)23
        • Downloads (Last 6 weeks)5
        Reflects downloads up to 06 Jan 2025

        Other Metrics

        Citations

        Cited By

        View all

        View Options

        Login options

        Full Access

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media