skip to main content
research-article

Learning to Recommend Descriptive Tags for Questions in Social Forums

Published: 01 January 2014 Publication History

Abstract

Around 40% of the questions in the emerging social-oriented question answering forums have at most one manually labeled tag, which is caused by incomprehensive question understanding or informal tagging behaviors. The incompleteness of question tags severely hinders all the tag-based manipulations, such as feeds for topic-followers, ontological knowledge organization, and other basic statistics. This article presents a novel scheme that is able to comprehensively learn descriptive tags for each question. Extensive evaluations on a representative real-world dataset demonstrate that our scheme yields significant gains for question annotation, and more importantly, the whole process of our approach is unsupervised and can be extended to handle large-scale data.

References

[1]
Sameer Agarwal, Kristin Branson, and Serge Belongie. 2006. Higher order learning with graphs. In Proceedings of the International Conference on Machine Learning.
[2]
Morgan Ames and Mor Naaman. 2007. Why we tag: Motivations for annotation in mobile and online media. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems.
[3]
Christopher H. Brooks and Nancy Montanez. 2006. Improved annotation of the blogosphere via autotagging and hierarchical clustering. In Proceedings of the International Conference on World Wide Web.
[4]
Gustavo Carneiro and Nuno Vasconcelos. 2005. Formulating semantic image annotation as a supervised learning problem. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
[5]
Pi-Chuan Chang, Huihsin Tseng, Dan Jurafsky, and Christopher D. Manning. 2009. Discriminative reordering with Chinese grammatical relations features. In Proceedings of the Workshop on Syntax and Structure in Statistical Translation.
[6]
Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yantao Zheng. 2009. NUS-WIDE: A real-world Web image database from National University of Singapore. In Proceeding of the ACM International Conference on Image and Video Retrieval.
[7]
Pinar Duygulu, Kobus Barnard, Nando de Freitas, and David Forsyth. 2002. Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In Proceedings of the European Conference on Computer Vision.
[8]
Zhouyu Fu, Guojun Lu, Kai ming Ting, and Dengsheng Zhang. 2011. A survey of audio-based music classification and annotation. IEEE Trans. Multimedia 13, 2, 303--319.
[9]
Yue Gao, Meng Wang, Zheng-Jun Zha, Jialie Shen, Xuelong Li, and Xindong Wu. 2012. Visual-textual joint relevance learning for tag-based social image search. IEEE Trans. Image Process. 22, 1, 363--376.
[10]
Scott A. Golder and Bernardo A. Huberman. 2006. Usage patterns of collaborative tagging systems. J. Inf. Sci. 32, 2, 198--208.
[11]
Winston H. Hsu, Lyndon S. Kennedy, and Shih-Fu Chang. 2007. Video search reranking through random walk over document-level context graph. In Proceedings of the ACM International Conference on Multimedia.
[12]
Yuchi Huang, Qingshan Liu, Shaoting Zhang, and Dimitris N. Metaxas. 2010. Image retrieval via probabilistic hypergraph ranking. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
[13]
Jiwoon Jeon, Victor Lavrenko, and R. Manmatha. 2003. Automatic image annotation and retrieval using cross-media relevance models. In Proceedings of the International ACM SIGIR Conference.
[14]
Feng Kang, Rong Jin, and Rahul Sukthankar. 2006. Correlated label propagation with application to multi-label learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
[15]
Xirong Li, Cees G. M. Snoek, and Marcel Worring. 2009. Annotating images by harnessing worldwide user-tagged photos. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing.
[16]
Dong Liu, Xian-Sheng Hua, Linjun Yang, Meng Wang, and Hong-Jiang Zhang. 2009. Tag ranking. In Proceedings of the International Conference on World Wide Web.
[17]
Dong Liu, Xian-Sheng Hua, Meng Wang, and Hong-Jiang Zhang. 2010. Image retagging. In Proceedings of the ACM International Conference on Multimedia.
[18]
Hans Peter Luhn. 1958. The automatic creation of literature abstracts. IBM J. Res. Develop. 2, 2, 159--165.
[19]
Gilad Mishne. 2006. Autotag: A collaborative approach to automated tag assignment for weblog posts. In Proceedings of the International Conference on World Wide Web.
[20]
Florent Monay and Daniel Gatica-Perez. 2004. Plsa-based image auto-annotation: Constraining the latent space. In Proceedings of the ACM International Conference on Multimedia.
[21]
Yasuhide Hironobu Mori, Hironobu Takahashi, and Ryuichi Oka. 1999. Image-to-word transformation based on dividing and vector quantizing images with words. In Proceedings of the International Workshop on Multimedia Intelligent Storage and Retrieval Management.
[22]
Sascha Narr, Ernesto William De Luca, and Sahin Albayrak. 2011. Extracting semantic annotations from Twitter. In Proceedings of the Workshop on Exploiting Semantic Annotations in Information Retrieval.
[23]
Liqiang Nie, Meng Wang, Zheng-Jun Zha, Guangda Li, and Tat-Seng Chua. 2011. Multimedia answering: Enriching text QA with media information. In Proceedings of the International ACM SIGIR Conference.
[24]
Liqiang Nie, Meng Wang, Zheng-Jun Zha, and Tat-Seng Chua. 2012a. Oracle in image search: A content-based approach to performance prediction. ACM Trans. Inf. Syst. 30, 2, Article 3.
[25]
Liqiang Nie, Shuicheng Yan, Meng Wang, Richang Hong, and Tat-Seng Chua. 2012b. Harvesting visual concepts for image search with complex queries. In Proceedings of the ACM International Conference on Multimedia.
[26]
Liqiang Nie, Meng Wang, Yue Gao, Zheng-Jun Zha, and Tat-Seng Chua. 2013. Beyond text QA: Multimedia answer generation by harvesting Web information. IEEE Trans. Multimedia 15, 2, 426--441.
[27]
Guo-Jun Qi, Xian-Sheng Hua, Yong Rui, Jinhui Tang, Tao Mei, and Hong-Jiang Zhang. 2007. Correlative multi-label video annotation. In Proceedings of the ACM International Conference on Multimedia.
[28]
Börkur Sigurbjörnsson and Roelof van Zwol. 2008. Flickr tag recommendation based on collective knowledge. In Proceedings of the International Conference on World Wide Web.
[29]
Sanjay Sood, Sara Owsley, Kristian Hammond, and Larry Birnbaum. 2007. Tagassist: Automatic tag suggestion for blog posts. In Proceedings of the International Conference on Weblogs and Social Media.
[30]
Shankara B. Subramanya and Huan Liu. 2008. Socialtagger - Collaborative tagging for blogs in the long tail. In Proceedings of the ACM Workshop on Search in Social Media.
[31]
Jinhui Tang, Haojie Li, Guo-Jun Qi, and Tat-Seng Chua. 2010. Image annotation by graph-based inference with integrated multiple/single instance representations. IEEE Trans. Multimedia 12, 2, 131--141.
[32]
Xinmei Tian, Linjun Yang, Jingdong Wang, Yichen Yang, Xiuqing Wu, and Xian-Sheng Hua. 2008. Bayesian video search reranking. In Proceedings of the ACM International Conference on Multimedia.
[33]
Kai Wang, Zhaoyan Ming, and Tat-Seng Chua. 2009. A syntactic tree matching approach to finding similar questions in community-based QA services. In Proceedings of the International ACM SIGIR Conference.
[34]
Matthijs J. Warrens. 2010. Inequalities between multi-rater kappas. Adv. Data Anal. Classification 4, 4, 271--286.
[35]
Pengcheng Wu, Steven Chu-Hong Hoi, Peilin Zhao, and Ying He. 2011. Mining social images with distance metric learning for automated image tagging. In Proceedings of the ACM International Conference on Web Search and Data Mining.
[36]
Wei Wu, Bin Zhang, and Mari Ostendorf. 2010. Automatic generation of personalized annotation tags for twitter users. In Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics.
[37]
Yu Xiang, Xiangdong Zhou, Tat-Seng Chua, and Chong-Wah Ngo. 2009. A revisit of generative model for automatic image annotation using markov random fields. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
[38]
Zhichen Xu, Yun Fu, Jianchang Mao, and Difu Su. 2006. Towards the Semantic Web: Collaborative tag suggestions. In Proceedings of the International Conference on World Wide Web.
[39]
Rong Yan, Alexander Hauptmann, and Rong Jin. 2003. Multimedia search with pseudo-relevance feedback. In Proceedings of the International Conference on Image and Video.
[40]
Changbo Yang, Ming Dong, and Jing Hua. 2006. Region-based image annotation using asymmetrical support vector machine-based multiple-instance learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
[41]
Yang Yang, Yi Yang, and Heng Tao Shen. 2013. Effective transfer tagging from image to video. ACM Trans. Multimedia Comput. Commun. Appl. 9, 2, Article 14.
[42]
Jun Yu, Dacheng Tao, and Meng Wang. 2012. Adaptive hypergraph learning and its application in image classification. IEEE Trans. Image Process. 21, 7, 3262--3272.
[43]
Dengyong Zhou, Olivier Bousquet, Thomas Navin Lal, Jason Weston, and Bernhard Schölkopf. 2004. Learning with local and global consistency. In Proceedings of the Advances in Neural Information Processing Systems Conference.
[44]
Dengyong Zhou, Jiayuan Huang, and Bernhard Schölkopf. 2006. Learning with hypergraphs: Clustering, classification, and embedding. In Proceedings of the Advances in Neural Information Processing Systems Conference.

Cited By

View all

Index Terms

  1. Learning to Recommend Descriptive Tags for Questions in Social Forums

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Information Systems
    ACM Transactions on Information Systems  Volume 32, Issue 1
    January 2014
    123 pages
    ISSN:1046-8188
    EISSN:1558-2868
    DOI:10.1145/2576772
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 January 2014
    Accepted: 01 November 2013
    Revised: 01 September 2013
    Received: 01 February 2013
    Published in TOIS Volume 32, Issue 1

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Question annotation
    2. knowledge organization
    3. social QA

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)16
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 25 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media