skip to main content
10.1145/1518701.1518986acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
research-article

Amplifying community content creation with mixed initiative information extraction

Published: 04 April 2009 Publication History

Abstract

Although existing work has explored both information extraction and community content creation, most research has focused on them in isolation. In contrast, we see the greatest leverage in the synergistic pairing of these methods as two interlocking feedback cycles. This paper explores the potential synergy promised if these cycles can be made to accelerate each other by exploiting the same edits to advance both community content creation and learning-based information extraction. We examine our proposed synergy in the context of Wikipedia infoboxes and the Kylin information extraction system. After developing and refining a set of interfaces to present the verification of Kylin extractions as a non primary task in the context of Wikipedia articles, we develop an innovative use of Web search advertising services to study people engaged in some other primary task. We demonstrate our proposed synergy by analyzing our deployment from two complementary perspectives: (1) we show we accelerate community content creation by using Kylin's information extraction to significantly increase the likelihood that a person visiting a Wikipedia article as a part of some other primary task will spontaneously choose to help improve the article's infobox, and (2) we show we accelerate information extraction by using contributions collected from people interacting with our designs to significantly improve Kylin's extraction performance.

References

[1]
Bryant, S.L., Forte, A. and Bruckman, A. (2005). Becoming Wikipedian: Transformation of Participation in a Collaborative Online Encyclopedia. Proceedings of the ACM Conference on Supporting Group Work (GROUP 2005), 1--10.
[2]
Cosley, D., Frankowski, D., Terveen, L. and Riedl, J. (2007). SuggestBot: Using Intelligent Task Routing to Help People Find Work in Wikipedia. Proceedings of the International Conference on Intelligent User Interfaces (IUI 2007), 32--41.
[3]
Culotta, A., Kristjansson, T., McCallum, A. and Viola, P. (2006). Corrective Feedback and Persistent Learning for Information Extraction. Artificial Intelligence 170(14). 1101--1122.
[4]
DeRose, P., Chai, X., Gao, B., Shen, W., Doan, A., Bohannon, P. and Zhu, J. (2008). Building Community Wikipedias: A Human-Machine Approach. Proceedings of the IEEE International Conference on Data Engineering (ICDE 2008), 646--655.
[5]
Giles, C.L., Bollacker, K. and Lawrence, S. (1998). CiteSeer: An Automatic Citation Indexing System. Proceedings of the ACM Conference on Digital Libraries (DL 1998), 89--98.
[6]
Grudin, J. (1994). Groupware and Social Dynamics: Eight Challenges for Developers. Communications of the ACM 37(1). 92--105.
[7]
Hoffmann, R., Fogarty, J. and Weld, D.S. (2007). Assieme: Finding and Leveraging Implicit References in a Web Search Interface for Programmers. Proceedings of the ACM Symposium on User Interface Software and Technology (UIST 2007), 13--22.
[8]
Horvitz, E. (1999). Principles of Mixed-Initiative Interfaces. Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI 1999), 159--166.
[9]
Huynh, D.F., Miller, R.C. and Karger, D.R. (2006). Enabling Web Browsers to Augment Web Sites' Filtering and Sorting Functionalities. Proceedings of the ACM Symposium on User Interface Software and Technology (UIST 2006), 125--134.
[10]
Kuznetsov, S. (2006). Motivations of Contributors to Wikipedia. ACM Computers and Society 36(2). 1--7.
[11]
Mankoff, J., Hudson, S.E. and Abowd, G.D. (2000). Interaction Techniques for Ambiguity Resolution in Recognition-Based Interfaces. Proceedings of the ACM Symposium on User Interface Software and Technology (UIST 2000), 11--20.
[12]
McCann, R., Shen, W. and Doan, A. (2008). Matching Schemas in Online Communities: A Web 2.0 Approach. Proceedings of the IEEE International Conference on Data Engineering (ICDE 2008), 110--119.
[13]
McFarlane, D.C. (2002). Comparison of Four Primary Methods for Coordinating the Interruption of People in Human-Computer Interaction. Human-Computer Interaction 17(1). 63--139.
[14]
MediaWiki. http://www.mediawiki.org/.
[15]
Priedhorsky, R., Chen, J., Lam, S.T., Panciera, K., Terveen, L. and Riedl, J. (2007). Creating, Destroying, and Restoring Value in Wikipedia. Proceedings of the ACM Conference on Supporting Group Work (GROUP 2007), 259--268.
[16]
Shilman, M., Tan, D.S. and Simard, P. (2006). CueTIP: A Mixed-Initiative Interface for Correcting Handwriting Errors. Proceedings of the ACM Symposium on User Interface Software and Technology (UIST 2006), 323--332.
[17]
von Ahn, L. and Dabbish, L. (2004). Labeling Images with a Computer Game. Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI 2004), 319--326.
[18]
von Ahn, L. and Dabbish, L. (2008). Designing Games with a Purpose. Communications of the ACM 51(8). 58--67.
[19]
Voss, J. (2005). Measuring Wikipedia. International Conference of the International Society for Scientometrics and Informetrics (ISSI 2005), 221--231.
[20]
Wikipedia: AutoWikiBrowser. http://en.wikipedia.org/wiki/Wikipedia:AutoWikiBrowser.
[21]
Wikipedia: Be Bold. http://en.wikipedia.org/wiki/Wikipedia:Be_Bold.
[22]
Wikipedia: Bot Policy. http://en.wikipedia.org/wiki/Wikipedia:Bots.
[23]
Wikipedia: Cleanup Tags. http://en.wikipedia.org/wiki/Wikipedia:Template_messages/Cleanup.
[24]
Wu, F., Hoffman, R. and Weld, D.S. (2008). Information Extraction from Wikipedia: Moving Down the Long Tail. Proceedings of the ACM International Conference on Knowledge Discovery and Data Mining (KDD 2008), 731--739.
[25]
Wu, F. and Weld, D.S. (2007). Autonomously Semantifying Wikipedia. Proceedings of the ACM Conference on Information and Knowledge Management (CIKM 2007), 41--50.
[26]
Yee, K.-P., Swearingen, K., Li, K. and Hearst, M. (2003). Faceted Metadata for Image Search and Browsing. Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI 2003), 401--408.

Cited By

View all

Index Terms

  1. Amplifying community content creation with mixed initiative information extraction

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CHI '09: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
    April 2009
    2426 pages
    ISBN:9781605582467
    DOI:10.1145/1518701
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 04 April 2009

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. community content creation
    2. information extraction
    3. mixed-initiative interfaces

    Qualifiers

    • Research-article

    Conference

    CHI '09
    Sponsor:

    Acceptance Rates

    CHI '09 Paper Acceptance Rate 277 of 1,130 submissions, 25%;
    Overall Acceptance Rate 6,199 of 26,314 submissions, 24%

    Upcoming Conference

    CHI '25
    CHI Conference on Human Factors in Computing Systems
    April 26 - May 1, 2025
    Yokohama , Japan

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)11
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 06 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media