research-article

Distilling Information Reliability and Source Trustworthiness from Digital Traces

Authors:

Behzad Tabibian,

Mehrdad Farajtabar,

Bernhard Schölkopf,

Manuel Gomez-RodriguezAuthors Info & Claims

WWW '17: Proceedings of the 26th International Conference on World Wide Web

Pages 847 - 855

https://doi.org/10.1145/3038912.3052672

Published: 03 April 2017 Publication History

Abstract

Online knowledge repositories typically rely on their users or dedicated editors to evaluate the reliability of their contents. These explicit feedback mechanisms can be viewed as noisy measurements of both information reliability and information source trustworthiness. Can we leverage these noisy measurements, often biased, to distill a robust, unbiased and interpretable measure of both notions?

In this paper, we argue that the large volume of digital traces left by the users within knowledge repositories also reflect information reliability and source trustworthiness. In particular, we propose a temporal point process modeling framework which links the temporal behavior of the users to information reliability and source trustworthiness. Furthermore, we develop an efficient convex optimization procedure to learn the parameters of the model from historical traces of the evaluations provided by these users. Experiments on real-world data gathered from Wikipedia and Stack Overflow show that our modeling framework accurately predicts evaluation events, provides an interpretable measure of information reliability and source trustworthiness, and yields interesting insights about real-world events.

References

[1]

A. Borodin, G. O. Roberts, J. S. Rosenthal, and P. Tsaparas. Link analysis ranking: algorithms, theory, and experiments. ACM Transactions on Internet Technology, 5(1):231--297, 2005.

Digital Library

[2]

Z. Gyöngyi, H. Garcia-Molina, and J. Pedersen. Combating web spam with trustrank. In VLDB, 2004.

Digital Library

[3]

M. Wu and A. Marian. Corroborating answers from multiple web sources. In WebDB, 2007.

[4]

X. L. Dong, E. Gabrilovich, G. Heitz, W. Horn, K. Murphy, S. Sun, and W. Zhang. From data fusion to knowledge fusion. VLDB, 2014.

Digital Library

[5]

X. L. Dong, E. Gabrilovich, K. Murphy, V. Dang, W. Horn, C. Lugaresi, S. Sun, and W. Zhang. Knowledge-based trust: Estimating the trustworthiness of web sources. VLDB, 2015.

Digital Library

[6]

H. Xiao, J. Gao, Q. Li, F. Ma, L. Su, Y Feng., and A. Zhang. Towards confidence in the truth: A bootstrapping based truth discovery approach. In KDD, 2016.

Digital Library

[7]

B. T. Adler and L. De Alfaro. A content-driven reputation system for the wikipedia. In WWW, 2007.

Digital Library

[8]

J. Pasternack and D. Roth. Latent credibility analysis. In WWW, 2013.

Digital Library

[9]

X. Yin and W. Tan. Semi-supervised truth discovery. In WWW, 2011.

Digital Library

[10]

B. Zhao and J. Han. A probabilistic model for estimating real-valued truth from conflicting sources. Proceedings of QDB, 2012.

[11]

B. Zhao, B. I. Rubinstein, J. Gemmell, and J. Han. A bayesian approach to discovering truth from conflicting sources for data integration. VLDB, 2012.

Digital Library

[12]

Y. Li, Q. Li, J. Gao, L. Su, B. Zhao, W. Fan, and J. Han. On the discovery of evolving truth. In KDD, 2015.

Digital Library

[13]

X. Liu, X. L. Dong, B. C. Ooi, and D. Srivastava. Online data fusion. VLDB, 2011.

[14]

A. Pal, V. Rastogi, A. Machanavajjhala, and P. Bohannon. Information integration over time in unreliable and uncertain environments. In WWW, 2012.

Digital Library

[15]

S. Wang, D. Wang, L. Su, L. Kaplan, and T. F. Abdelzaher. Towards cyber-physical systems in social spaces: The data reliability challenge. In RTSS, 2014.

[16]

M. Gomez-Rodriguez, D. Balduzzi, and B. Schölkopf. Uncovering the temporal dynamics of diffusion networks. In ICML, 2011.

Digital Library

[17]

N. Du, L. Song, M. Gomez-Rodriguez, and H. Zha. Scalable influence estimation in continuous-time diffusion networks. In NIPS, 2013.

Digital Library

[18]

H. Daneshmand, M. Gomez-Rodriguez, L. Song, and B. Schölkopf. Estimating diffusion network structures: Recovery conditions, sample complexity & soft-thresholding algorithm. In ICML, 2014.

Digital Library

[19]

M. Farajtabar, X. Ye, S. Harati, L. Song, and H. Zha. Multistage campaigning in social networks. In NIPS, 2016.

[20]

M. Karimi, E. Tavakoli, M. Farajtabar, L. Song, and M. Gomez-Rodriguez. Smart Broadcasting: Do you want to be seen? In KDD, 2016.

Digital Library

[21]

M. Farajtabar, N. Du, M. Gomez-Rodriguez, I. Valera, H. Zha, and L. Song. Shaping social activity by incentivizing users. In NIPS, 2014.

Digital Library

[22]

N. Du, H. Dai, R. Trivedi, U. Upadhyay, M. Gomez-Rodriguez, and L. Song. Recurrent Marked Temporal Point Process: Embedding Event History to Vector. In KDD, 2016.

Digital Library

[23]

D. Hunter, P. Smyth, D. Q. Vu, and A. U. Asuncion. Dynamic egocentric models for citation networks. In ICML, 2011.

[24]

M. Farajtabar, Y. Wang, M. Gomez-Rodriguez, S. Li, H. Zha, and L. Song. Coevolve: A joint point process model for information diffusion and network co-evolution. In NIPS, 2015.

Digital Library

[25]

A. De, I. Valera, N. Ganguly, S. Bhattacharya, and M. Gomez-Rodriguez. Learning and forecasting opinion dynamics in social networks. In NIPS, 2016.

[26]

I. Valera and M. Gomez-Rodriguez. Modeling adoption and usage of competing products. In ICDM, 2015.

Digital Library

[27]

O. Aalen, O. Borgan, and H. K. Gjessing. Survival and event history analysis: a process point of view. Springer, 2008.

[28]

K. Zhou, H. Zha, and L. Song. Learning triggering kernels for multi-dimensional hawkes processes. In ICML, 2013.

Digital Library

[29]

S. Diamond and S. Boyd. CVXPY: A Python-embedded modeling language for convex optimization. Journal of Machine Learning Research, 2016.

Digital Library

[30]

R. Řehůřek and P. Sojka. Software Framework for Topic Modelling with Large Corpora. In LREC, 2010.

[31]

A. Anderson, J. Kleinberg, and S. Mullainathan. Assessing Human Error Against a Benchmark of Perfection. In KDD, 2016.

Digital Library

[32]

S. Greenstein and F. Zhu. Is wikipedia biased? The American economic review, 102(3):343--348, 2012.

[33]

U. Upadhyay, I. Valera, and M. Gomez-Rodriguez. Uncovering the dynamics of crowdlearning and the value of knowledge. In WSDM, 2017.

Digital Library

Cited By

Santos TLemmerich FHelic D(2023)Bayesian estimation of decay parameters in Hawkes processesIntelligent Data Analysis10.3233/IDA-21628327:1(223-240)Online publication date: 30-Jan-2023
https://doi.org/10.3233/IDA-216283
Wu LLong YGao CWang ZZhang Y(2023)MFIR: Multimodal fusion and inconsistency reasoning for explainable fake news detectionInformation Fusion10.1016/j.inffus.2023.101944100(101944)Online publication date: Dec-2023
https://doi.org/10.1016/j.inffus.2023.101944
Morato JDiaz-Nafria JSanchez-Cuadrado S(2023)Factors Affecting the Reliability of Information: The Case of ChatGPTAdvanced Research in Technologies, Information, Innovation and Sustainability10.1007/978-3-031-48930-3_12(151-164)Online publication date: 20-Dec-2023
https://doi.org/10.1007/978-3-031-48930-3_12
Show More Cited By

Index Terms

Distilling Information Reliability and Source Trustworthiness from Digital Traces
1. Information systems
  1. World Wide Web

Recommendations

Distilling Information Reliability and Source Trustworthiness from Digital Traces
WWW '17 Companion: Proceedings of the 26th International Conference on World Wide Web Companion

Online knowledge repositories typically rely on their users or dedicated editors to evaluate the reliability of their content. These evaluations can be viewed as noisy measurements of both information reliability and information source trustworthiness. ...
Information Reliability Evaluation: From Arabic Storytelling to Computer Sciences

The literature on information retrieval shows the importance of information reliability as a key criterion for relevance judgment. However, information reliability evaluation is discussed in many disciplines such as history, Arabic storytelling, and ...
Information sharing and the impact of shutdown policy in a supply chain with market disruption risk in the social media era
Abstract
This paper investigates the information sharing issue in a simple supply chain with one manufacturer and one retailer. The market demand might be subject to disruption, but the retailer can get access to some signals from social media ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

WWW '17: Proceedings of the 26th International Conference on World Wide Web

April 2017

1678 pages

ISBN:9781450349130

General Chairs:
Rick Barrett
W3Events
,
Rick Cummings
Murdoch University
,
Program Chairs:
Eugene Agichtein
Emory University
,
Evgeniy Gabrilovich
Google Research

Copyright © 2017 Copyright is held by the International World Wide Web Conference Committee (IW3C2).

Sponsors

IW3C2: International World Wide Web Conference Committee

In-Cooperation

SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web

Publisher

International World Wide Web Conferences Steering Committee

Republic and Canton of Geneva, Switzerland

Publication History

Published: 03 April 2017

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

WWW '17

Sponsor:

IW3C2

WWW '17: 26th International World Wide Web Conference

April 3 - 7, 2017

Perth, Australia

Acceptance Rates

WWW '17 Paper Acceptance Rate 164 of 966 submissions, 17%;

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

16
Total Citations
View Citations
362
Total Downloads

Downloads (Last 12 months)28
Downloads (Last 6 weeks)3

Reflects downloads up to 06 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Santos TLemmerich FHelic D(2023)Bayesian estimation of decay parameters in Hawkes processesIntelligent Data Analysis10.3233/IDA-21628327:1(223-240)Online publication date: 30-Jan-2023
https://doi.org/10.3233/IDA-216283
Wu LLong YGao CWang ZZhang Y(2023)MFIR: Multimodal fusion and inconsistency reasoning for explainable fake news detectionInformation Fusion10.1016/j.inffus.2023.101944100(101944)Online publication date: Dec-2023
https://doi.org/10.1016/j.inffus.2023.101944
Morato JDiaz-Nafria JSanchez-Cuadrado S(2023)Factors Affecting the Reliability of Information: The Case of ChatGPTAdvanced Research in Technologies, Information, Innovation and Sustainability10.1007/978-3-031-48930-3_12(151-164)Online publication date: 20-Dec-2023
https://doi.org/10.1007/978-3-031-48930-3_12
Noorbakhsh KRodriguez MKoyejo SMohamed SAgarwal ABelgrave DCho KOh A(2022)Counterfactual temporal point processesProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3602069(24810-24823)Online publication date: 28-Nov-2022
https://dl.acm.org/doi/10.5555/3600270.3602069
Safavi SLogothetis NBesserve M(2021)From Univariate to Multivariate Coupling Between Continuous Signals and Point Processes: A Mathematical FrameworkNeural Computation10.1162/neco_a_0138933:7(1751-1817)Online publication date: 11-Jun-2021
https://doi.org/10.1162/neco_a_01389
Dito FAlqadhi HAlasaadi A(2020)Detecting Medical Rumors on Twitter Using Machine Learning2020 International Conference on Innovation and Intelligence for Informatics, Computing and Technologies (3ICT)10.1109/3ICT51146.2020.9311957(1-7)Online publication date: 20-Dec-2020
https://doi.org/10.1109/3ICT51146.2020.9311957
Santos TWalk SKern RStrohmaier MHelic D(2019)Self- and Cross-Excitation in Stack Exchange Question & Answer CommunitiesThe World Wide Web Conference10.1145/3308558.3313440(1634-1645)Online publication date: 13-May-2019
https://dl.acm.org/doi/10.1145/3308558.3313440
Guo RLi JLiu H(2018)INITIATORProceedings of the 27th International Joint Conference on Artificial Intelligence10.5555/3304889.3304964(2191-2197)Online publication date: 13-Jul-2018
https://dl.acm.org/doi/10.5555/3304889.3304964
De ABhattacharya SGanguly NAndre EKoenig SDastani MSukthankar G(2018)Shaping Opinion Dynamics in Social NetworksProceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems10.5555/3237383.3237899(1336-1344)Online publication date: 9-Jul-2018
https://dl.acm.org/doi/10.5555/3237383.3237899
Yardim AKristof VMaystre LGrossglauser MGuo YFarooq F(2018)Can Who-Edits-What Predict Edit Survival?Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining10.1145/3219819.3219979(2604-2613)Online publication date: 19-Jul-2018
https://dl.acm.org/doi/10.1145/3219819.3219979
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents