When is it biased? Assessing the representativeness of twitter's streaming API

F Morstatter, J Pfeffer, H Liu - … of the 23rd international conference on …, 2014 - dl.acm.org
Proceedings of the 23rd international conference on world wide web, 2014dl.acm.org
Twitter shares a free 1% sample of its tweets through the" Streaming API". Recently,
research has pointed to evidence of bias in this source. The methodologies proposed in
previous work rely on the restrictive and expensive Firehose to find the bias in the Streaming
API data. We tackle the problem of finding sample bias without costly and restrictive
Firehose data. We propose a solution that focuses on using an open data source to find bias
in the Streaming API.
Twitter shares a free 1% sample of its tweets through the "Streaming API". Recently, research has pointed to evidence of bias in this source. The methodologies proposed in previous work rely on the restrictive and expensive Firehose to find the bias in the Streaming API data. We tackle the problem of finding sample bias without costly and restrictive Firehose data. We propose a solution that focuses on using an open data source to find bias in the Streaming API.
ACM Digital Library