skip to main content
10.1145/2659480.2659497acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
tutorial

Which Tweets Will Be Headlines? A Hierarchical Bayesian Model for Bridging Social Media and Traditional Media

Published: 24 August 2014 Publication History

Abstract

Microblogging platforms such as Twitter provide a convenient channel for people to express their feelings, report news, and communicate with friends. Most existing work on social media analysis has been focused on predicting users' behaviors, analyzing the corresponding social networks, tracking the popular topics, etc. However, there is limited research effort on uncovering the relationships between social media (e.g. Twitter) and traditional media (e.g., Washington Post and New York Times), which has a big impact in our daily lives and our society. This paper targets on a novel and important research problem as which and whose tweets are favored by the traditional media. The basic intuition is that whether a tweet could be picked up or not by traditional media depends not only on whether its content matches traditional media's interests towards this specific user but also the writer's personal influence, reflected by factors such as the number of followers. Based on this intuition, this paper proposes a Twitter Pick-Up Relational (TPUR) model to simultaneously integrate these factors. In particular, the dependence between the traditional media's interests towards a user and the content of each tweet, and the influence of each user are integrated in a hierarchical bayesian model. An extensive set of experiments are conducted on two datasets from two popular microblogging platforms, i.e., Twitter and Sina Weibo (Chinese version Twitter), to demonstrate the advantages of our algorithm against baseline methods on the proposed problem.

References

[1]
D. Agarwal and B.-C. Chen. Regression-based latent factor models. In KDD, pages 19--28, 2009.
[2]
D. Agarwal and B.-C. Chen. flda: matrix factorization through latent dirichlet allocation. In WSDM, pages 91--100, 2010.
[3]
L. Backstrom and J. Leskovec. Supervised random walks: predicting and recommending links in social networks. In WSDM, pages 635--644, 2011.
[4]
E. Bakshy, J. M. Hofman, W. A. Mason, and D. J. Watts. Everyone's an influencer: quantifying influence on twitter. In WSDM, pages 65--74, 2011.
[5]
D. P. Bertsekas and D. P. Bertsekas. Nonlinear Programming. Athena Scientific, 2nd edition, Sept. 1999.
[6]
A. Bifet and E. Frank. Sentiment knowledge discovery in twitter streaming data. In Discovery Science, pages 1--15, 2010.
[7]
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, 3:993--1022, 2003.
[8]
M. Cha, H. Haddadi, F. Benevenuto, and P. K. Gummadi. Measuring user influence in twitter: The million follower fallacy. In ICWSM, 2010.
[9]
W. Croft, D. Metzler, and T. Strohman. Search engines: information retrieval in practice. Alternative Etext Formats. Addison-Wesley, 2010.
[10]
P. Cui, F. Wang, S. Liu, M. Ou, S. Yang, and L. Sun. Who should share what?: item-level social influence prediction for users and posts ranking. In SIGIR, pages 185--194, 2011.
[11]
I. S. Dhillon, S. Mallela, and D. S. Modha. Information-theoretic co-clustering. In KDD, pages 89--98, 2003.
[12]
E. Gilbert and K. Karahalios. Predicting tie strength with social media. In CHI, pages 211--220, 2009.
[13]
A. Go, R. Bhayani, and L. Huang. Twitter Sentiment Classification using Distant Supervision. Technical report, Stanford University.
[14]
M. Gupta, P. Zhao, and J. Han. Evaluating event credibility on twitter. In SDM, 2012.
[15]
J. L. Herlocker, J. A. Konstan, A. Borchers, and J. Riedl. An algorithmic framework for performing collaborative filtering. In SIGIR, pages 230--237, 1999.
[16]
M. Kim and J. Leskovec. The network completion problem: Inferring missing nodes and edges in networks. In SDM, pages 47--58, 2011.
[17]
Y. Koren, R. M. Bell, and C. Volinsky. Matrix factorization techniques for recommender systems. IEEE Computer, 42(8):30--37, 2009.
[18]
H. Kwak, C. Lee, H. Park, and S. B. Moon. What is twitter, a social network or a news media? In WWW, pages 591--600, 2010.
[19]
M. Rosen-Zvi, T. L. Griffiths, M. Steyvers, and P. Smyth. The author-topic model for authors and documents. In UAI, pages 487--494, 2004.
[20]
R. Salakhutdinov and A. Mnih. Probabilistic matrix factorization. In NIPS, 2007.
[21]
R. Salakhutdinov and A. Mnih. Bayesian probabilistic matrix factorization using markov chain monte carlo. In ICML, pages 880--887, 2008.
[22]
T. tajner, B. Thomee, A. M. Popescu, M. Pennacchiotti, and A. Jaimes. Automatic selection of social media responses to news. In KDD, 2013.
[23]
M. Tsagkias, M. de Rijke, and W. Weerkamp. Linking online news and social media. In WSDM, pages 565--574, 2011.
[24]
C. Wang and D. M. Blei. Collaborative topic modeling for recommending scientific articles. In KDD, pages 448--456, 2011.
[25]
M. J. Welch, U. Schonfeld, D. He, and J. Cho. Topical semantics of twitter links. In WSDM, pages 327--336, 2011.
[26]
J. Weng, E.-P. Lim, J. Jiang, and Q. He. Twitterrank: finding topic-sensitive influential twitterers. In WSDM, pages 261--270, 2010.
[27]
J. Yang and J. Leskovec. Modeling information diffusion in implicit networks. In ICDM, pages 599--608, 2010.
[28]
J. Yang and J. Leskovec. Patterns of temporal variation in online media. In WSDM, pages 177--186, 2011.
[29]
S.-H. Yang, B. Long, A. J. Smola, N. Sadagopan, Z. Zheng, and H. Zha. Like like alike: joint friendship and interest propagation in social networks. In WWW, pages 537--546, 2011.
[30]
K. Yu, J. D. Lafferty, S. Zhu, and Y. Gong. Large-scale collaborative prediction using a nonparametric random effects model. In ICML, page 149, 2009.
[31]
W. X. Zhao, J. Jiang, J. Weng, J. He, E.-P. Lim, H. Yan, and X. Li. Comparing twitter and traditional media using topic models. In ECIR, pages 338--349, 2011.

Cited By

View all
  • (2015)Identifying Attractive News Headlines for Social MediaProceedings of the 24th ACM International on Conference on Information and Knowledge Management10.1145/2806416.2806631(1859-1862)Online publication date: 17-Oct-2015

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SNAKDD'14: Proceedings of the 8th Workshop on Social Network Mining and Analysis
August 2014
90 pages
ISBN:9781450331920
DOI:10.1145/2659480
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 August 2014

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Tutorial
  • Research
  • Refereed limited

Conference

KDD '14
Sponsor:

Upcoming Conference

KDD '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 23 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2015)Identifying Attractive News Headlines for Social MediaProceedings of the 24th ACM International on Conference on Information and Knowledge Management10.1145/2806416.2806631(1859-1862)Online publication date: 17-Oct-2015

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media