tutorial

Which Tweets Will Be Headlines? A Hierarchical Bayesian Model for Bridging Social Media and Traditional Media

Authors:

Luo SiAuthors Info & Claims

SNAKDD'14: Proceedings of the 8th Workshop on Social Network Mining and Analysis

Article No.: 5, Pages 1 - 9

https://doi.org/10.1145/2659480.2659497

Published: 24 August 2014 Publication History

Abstract

Microblogging platforms such as Twitter provide a convenient channel for people to express their feelings, report news, and communicate with friends. Most existing work on social media analysis has been focused on predicting users' behaviors, analyzing the corresponding social networks, tracking the popular topics, etc. However, there is limited research effort on uncovering the relationships between social media (e.g. Twitter) and traditional media (e.g., Washington Post and New York Times), which has a big impact in our daily lives and our society. This paper targets on a novel and important research problem as which and whose tweets are favored by the traditional media. The basic intuition is that whether a tweet could be picked up or not by traditional media depends not only on whether its content matches traditional media's interests towards this specific user but also the writer's personal influence, reflected by factors such as the number of followers. Based on this intuition, this paper proposes a Twitter Pick-Up Relational (TPUR) model to simultaneously integrate these factors. In particular, the dependence between the traditional media's interests towards a user and the content of each tweet, and the influence of each user are integrated in a hierarchical bayesian model. An extensive set of experiments are conducted on two datasets from two popular microblogging platforms, i.e., Twitter and Sina Weibo (Chinese version Twitter), to demonstrate the advantages of our algorithm against baseline methods on the proposed problem.

References

[1]

D. Agarwal and B.-C. Chen. Regression-based latent factor models. In KDD, pages 19--28, 2009.

Digital Library

[2]

D. Agarwal and B.-C. Chen. flda: matrix factorization through latent dirichlet allocation. In WSDM, pages 91--100, 2010.

Digital Library

[3]

L. Backstrom and J. Leskovec. Supervised random walks: predicting and recommending links in social networks. In WSDM, pages 635--644, 2011.

Digital Library

[4]

E. Bakshy, J. M. Hofman, W. A. Mason, and D. J. Watts. Everyone's an influencer: quantifying influence on twitter. In WSDM, pages 65--74, 2011.

Digital Library

[5]

D. P. Bertsekas and D. P. Bertsekas. Nonlinear Programming. Athena Scientific, 2nd edition, Sept. 1999.

[6]

A. Bifet and E. Frank. Sentiment knowledge discovery in twitter streaming data. In Discovery Science, pages 1--15, 2010.

Digital Library

[7]

D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, 3:993--1022, 2003.

Digital Library

[8]

M. Cha, H. Haddadi, F. Benevenuto, and P. K. Gummadi. Measuring user influence in twitter: The million follower fallacy. In ICWSM, 2010.

[9]

W. Croft, D. Metzler, and T. Strohman. Search engines: information retrieval in practice. Alternative Etext Formats. Addison-Wesley, 2010.

Digital Library

[10]

P. Cui, F. Wang, S. Liu, M. Ou, S. Yang, and L. Sun. Who should share what?: item-level social influence prediction for users and posts ranking. In SIGIR, pages 185--194, 2011.

Digital Library

[11]

I. S. Dhillon, S. Mallela, and D. S. Modha. Information-theoretic co-clustering. In KDD, pages 89--98, 2003.

Digital Library

[12]

E. Gilbert and K. Karahalios. Predicting tie strength with social media. In CHI, pages 211--220, 2009.

Digital Library

[13]

A. Go, R. Bhayani, and L. Huang. Twitter Sentiment Classification using Distant Supervision. Technical report, Stanford University.

[14]

M. Gupta, P. Zhao, and J. Han. Evaluating event credibility on twitter. In SDM, 2012.

[15]

J. L. Herlocker, J. A. Konstan, A. Borchers, and J. Riedl. An algorithmic framework for performing collaborative filtering. In SIGIR, pages 230--237, 1999.

Digital Library

[16]

M. Kim and J. Leskovec. The network completion problem: Inferring missing nodes and edges in networks. In SDM, pages 47--58, 2011.

[17]

Y. Koren, R. M. Bell, and C. Volinsky. Matrix factorization techniques for recommender systems. IEEE Computer, 42(8):30--37, 2009.

Digital Library

[18]

H. Kwak, C. Lee, H. Park, and S. B. Moon. What is twitter, a social network or a news media? In WWW, pages 591--600, 2010.

Digital Library

[19]

M. Rosen-Zvi, T. L. Griffiths, M. Steyvers, and P. Smyth. The author-topic model for authors and documents. In UAI, pages 487--494, 2004.

Digital Library

[20]

R. Salakhutdinov and A. Mnih. Probabilistic matrix factorization. In NIPS, 2007.

Digital Library

[21]

R. Salakhutdinov and A. Mnih. Bayesian probabilistic matrix factorization using markov chain monte carlo. In ICML, pages 880--887, 2008.

Digital Library

[22]

T. tajner, B. Thomee, A. M. Popescu, M. Pennacchiotti, and A. Jaimes. Automatic selection of social media responses to news. In KDD, 2013.

Digital Library

[23]

M. Tsagkias, M. de Rijke, and W. Weerkamp. Linking online news and social media. In WSDM, pages 565--574, 2011.

Digital Library

[24]

C. Wang and D. M. Blei. Collaborative topic modeling for recommending scientific articles. In KDD, pages 448--456, 2011.

Digital Library

[25]

M. J. Welch, U. Schonfeld, D. He, and J. Cho. Topical semantics of twitter links. In WSDM, pages 327--336, 2011.

Digital Library

[26]

J. Weng, E.-P. Lim, J. Jiang, and Q. He. Twitterrank: finding topic-sensitive influential twitterers. In WSDM, pages 261--270, 2010.

Digital Library

[27]

J. Yang and J. Leskovec. Modeling information diffusion in implicit networks. In ICDM, pages 599--608, 2010.

Digital Library

[28]

J. Yang and J. Leskovec. Patterns of temporal variation in online media. In WSDM, pages 177--186, 2011.

Digital Library

[29]

S.-H. Yang, B. Long, A. J. Smola, N. Sadagopan, Z. Zheng, and H. Zha. Like like alike: joint friendship and interest propagation in social networks. In WWW, pages 537--546, 2011.

Digital Library

[30]

K. Yu, J. D. Lafferty, S. Zhu, and Y. Gong. Large-scale collaborative prediction using a nonparametric random effects model. In ICML, page 149, 2009.

Digital Library

[31]

W. X. Zhao, J. Jiang, J. Weng, J. He, E.-P. Lim, H. Yan, and X. Li. Comparing twitter and traditional media using topic models. In ECIR, pages 338--349, 2011.

Digital Library

Cited By

Kourogi SFujishiro HKimura ANishikawa HBailey JMoffat AAggarwal Cde Rijke MKumar RMurdock VSellis TYu J(2015)Identifying Attractive News Headlines for Social MediaProceedings of the 24th ACM International on Conference on Information and Knowledge Management10.1145/2806416.2806631(1859-1862)Online publication date: 17-Oct-2015
https://dl.acm.org/doi/10.1145/2806416.2806631

Recommendations

Traditional media seen from social media
WebSci '13: Proceedings of the 5th Annual ACM Web Science Conference

With the advent of social media services, media outlets have started reaching audiences on social-networking sites. On Twitter, users actively follow a wide set of media sources, form interpersonal networks, and propagate interesting stories to their ...
Social Media: An Exploratory Study of Information, Misinformation, Disinformation, and Malinformation
Abstract
The widespread use of social media all around the globe has affected the way of life in all aspects, not only for individuals but for businesses as well. Businesses share their upcoming events, reveal their products, and advertise to their ...
Measuring and Detecting Virality on Social Media: The Case of Twitter’s Viral Tweets Topic
WWW '23 Companion: Companion Proceedings of the ACM Web Conference 2023

Social media posts may go viral and reach large numbers of people within a short period of time. Such posts may threaten the public dialogue if they contain misleading content, making their early detection highly crucial. Previous works proposed their ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SNAKDD'14: Proceedings of the 8th Workshop on Social Network Mining and Analysis

August 2014

90 pages

ISBN:9781450331920

DOI:10.1145/2659480

Program Chair:
Feida Zhu

Copyright © 2014 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 August 2014

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Tutorial
Research
Refereed limited

Conference

KDD '14

Sponsor:

KDD '14: The 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

August 24 - 27, 2014

NY, New York, USA

Upcoming Conference

KDD '25

Sponsor:
sigkdd
sigkdd

The 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 3 - 7, 2025

Toronto , ON , Canada

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
225
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)0

Reflects downloads up to 23 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Kourogi SFujishiro HKimura ANishikawa HBailey JMoffat AAggarwal Cde Rijke MKumar RMurdock VSellis TYu J(2015)Identifying Attractive News Headlines for Social MediaProceedings of the 24th ACM International on Conference on Information and Knowledge Management10.1145/2806416.2806631(1859-1862)Online publication date: 17-Oct-2015
https://dl.acm.org/doi/10.1145/2806416.2806631

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents