AI

Data workers detail exploitation by tech industry in DAIR report

Comment

Image Credits: DAIR/TU Berlin/Data Workers Inquiry

The essential labor of data work, like moderation and annotation, is systematically hidden from those who benefit from the fruits of that labor. A new project puts the lived experiences of data workers around the world in the spotlight, showing firsthand the costs and opportunities of tech work abroad.

Many tedious, thankless, or psychologically damaging tasks have been outsourced to poorer countries, where workers are happy to take on jobs for a fraction of an American or European wage. This labor market joins other jobs of the “dull, dirty, or dangerous” category like electronics “recycling” and shipbreaking. The conditions in moderation or annotation work aren’t as likely to cost you an arm or give you cancer, but that doesn’t make them safe, much less pleasant or rewarding.

The Data Workers’ Inquiry, a collaboration between AI ethics research group DAIR and TU Berlin, are nominally modeled on Marx’s work from the late 19th century identifying labor conditions in reports that are “collectively produced and politically actionable.”

All the reports are freely available and were launched today at an online event where those running the project discussed it.

The ever-expanding scope of AI applications is built by necessity on human expertise, and that expertise is bought to this day for the lowest dollar value companies can offer without incurring a public relations problem. When you report a post, it doesn’t say “great, we’ll send this to a guy in Syria who will be paid 3 cents to take care of it.” But the volume of reports (and of content deserving of report) is so high that solutions other than mass outsourcing of the work to cheap labor markets don’t really make sense to the companies involved.

Perusing the reports, they are largely anecdotal, and deliberately so. These reports are more on the level of systematic anthropological observation than quantitative analyses.

Quantifying experiences like these often fails to capture the real costs — the statistics you end up with are the type that companies love to trumpet (and therefore to solicit in studies): higher wages than other companies in the area, job creation, savings passed on to clients. Seldom are things like moderation workers losing sleep to nightmares or rampant chemical dependency mentioned, let alone measured and presented.

Take Fasica Berhane Gebrekidan’s report on Kenyan data workers struggling with mental health and drug issues. (The full PDF is here.)

She and her colleagues worked for Sama, which bills itself as a more ethical data work pipeline, but the reality of the job, as the actual people describe it, is unrelenting misery and a lack of support from the local office.

A whistleblower’s image of the moderation work space at Samasource in Kenya.
Image Credits: Fasica Berhane Gebrekidan

Recruited to handle tickets (i.e., flagged content) in local languages and dialects, they are exposed to a never-ending stream of violence, gore, sexual abuse, hate speech and other content that they must view and “action” quickly lest their performance fall below expected levels, leading to docked pay, the report says. For some that’s more than one per minute, meaning they view a minimum of around 500 such items a day. (In case you’re wondering where the AI is here — they are likely providing the training data.)

“It’s absolutely soul-crushing. I’ve watched the worst things one can imagine. I’m afraid that I will be scarred for life for doing this job,” said Rahel Gebrekirkos, one of the contractors interviewed.

Support personnel were “ill-equipped, unprofessional, and under-qualified,” and moderators frequently turned to drugs to cope, and complained of intrusive thoughts, depression, and other problems.

We’ve heard some of this before, but it is relevant to hear that it is happening still. There are several reports of this type, but others are more personal stories or take different formats.

For instance, Yasser Yousef Alrayes is a data annotator in Syria, working to pay for his higher education. He and his roommate work together on visual annotation tasks like parsing images of text that, as he points out, are often poorly defined, with frustrating demands from clients.

He chose to document his work in the form of a short film that is well worth eight minutes of your time.

Workers like Yasser are often obscured behind many organizational layers, acting as subcontractors to subcontractors so that lines of responsibility are obfuscated should there ever be a problem or lawsuit.

DAIR and TU Berlin’s Milagros Miceli, one of the leaders of the project, told me that they had not seen any comment or changes from the companies indicated in the report but that it was still early. But the results seem strong enough for them to go back for more: “We’re planning to continue this work with a second cohort of data workers,” she wrote, “most probably from Brazil, Finland, China, and India.”

No doubt there are some who will discount these reports for the very quality that makes them valuable: their anecdotal nature. But while it’s easy to lie with statistics, anecdotes always carry at least some truth in them, for these stories are taken direct from the source. Even if these were the only dozen moderators in Kenya, or Syria, or Venezuela with these problems, what they say should concern anyone who relies on them — which is to say, just about everyone.

More TechCrunch

AWS today announced that it is transitioning OpenSearch, its open source fork of the popular Elasticsearch search and analytics engine, to the Linux Foundation with the launch of the very…

AWS brings OpenSearch under the Linux Foundation umbrella

Insight Partners is reportedly on the cusp of on more than $10 billion in capital commitments for its 13th fund, per the FT.  The FT report notes that two of…

Insight Partners is closing in on a whopping $10B+ new fund

The Port of Seattle released a statement Friday confirming that it was targeted by a ransomware attack. The attack occurred on August 24, with the Port (which also operates the…

Port of Seattle shares ransomware attack details

A decade after the wildly popular game Flappy Bird disappeared, an organization calling itself The Flappy Bird Foundation announced plans to “re-hatch the official Flappy Bird® game.” But this morning,…

Flappy Bird’s creator disavows ‘official’ new version of the game

Platforms to connect apps that wouldn’t normally talk to each other have been around for a minute (see: Zapier). But they have not gotten dramatically simpler to use if you’re…

DryMerge promises to connect apps that normally don’t talk to each other — and when it works, it’s great

Featured Article

Cohere co-founder Nick Frosst’s indie band, Good Kid, is almost as successful as his AI company

Nick Frosst, the co-founder of $5.5 billion Canadian AI startup Cohere, has been a musician his whole life. He told TechCrunch that once he started singing, he never shut up. That’s still true today. In addition to his full-time job at Cohere, Frosst is also the front man of Good…

Cohere co-founder Nick Frosst’s indie band, Good Kid, is almost as successful as his AI company

Blockchain technology is all about decentralization and virtualization. So it’s a little ironic that humans love to come together in person at big blockchain events. Such was the case last…

A walk through the crypto jungle at Korea Blockchain Week

I have a guilty pleasure, and it’s not that I just rewatched “Glee” in its entirety (yes, even the awful later seasons), or that I have read an ungodly amount…

The LinkedIn games are fun, actually

It’s looking increasingly likely that OpenAI will soon alter its complex corporate structure. Reports earlier this week suggested that the AI company was in talks to raise $6.5 billion at…

OpenAI could shake up its nonprofit structure next year

Fusion startups have raised $7.1 billion to date, with the majority of it going to a handful of companies. 

Every fusion startup that has raised over $300M

Netflix has never quite cracked the talk show formula, but maybe it can borrow an existing hit from YouTube. According to Bloomberg, the streamer is in talks with BuzzFeed to…

‘Hot Ones’ could add some heat to Netflix’s live lineup

Alex Parmley has been thinking about building his latest company, ORNG, since he was working on his last company, Phood.  Launched in 2018, Phood was a payments app that let…

Why ORNG’s founder pivoted from college food ordering to real-time money transfer

Lawyers representing Sam Bankman-Fried, the FTX CEO and co-founder who was convicted of fraud and money laundering late last year, are seeking a new trial. Following crypto exchange FTX’s collapse,…

Sam Bankman-Fried appeals conviction, criticizes judge’s ‘unbalanced’ decisions

OpenAI this week unveiled a preview of OpenAI o1, also known as Strawberry. The company claims that o1 can more effectively reason through math and science, as well as fact-check…

OpenAI previews its new Strawberry model

There’s something oddly refreshing about starting the day by solving the Wordle. According to DeepWell DTx, there’s a scientific explanation for why our brains might feel just a bit better…

DeepWell DTx receives FDA clearance for its therapeutic video game developer tools

Soundiiz is a free third-party tool that builds portability tools through existing APIs and acts as a translator between the services.

These two friends built a simple tool to transfer playlists between Apple Music and Spotify, and it works great

In early 2018, VC Mike Moritz wrote in the FT that “Silicon Valley would be wise to follow China’s lead,” noting the pace of work at tech companies was “furious”…

This is how bad China’s startup scene looks now

Fei-Fei Li, the Stanford professor many deem the “Godmother of AI,” has raised $230 million for her new startup, World Labs, from backers including Andreessen Horowitz, NEA, and Radical Ventures.…

Fei-Fei Li’s World Labs comes out of stealth with $230M in funding

Bolt says it has settled its long-standing lawsuit with its investor Activant Capital. One-click payments startup Bolt is settling the suit by buying out the investor’s stake “after which Activant…

Fintech Bolt is buying out the investor suing over Ryan Breslow’s $30M loan

The rise of neobanks has been fascinating to witness, as a number of companies in recent years have grown from merely challenging traditional banks to being massive players in and…

Dave and Varo Bank execs are coming to TechCrunch Disrupt 2024

OpenAI released its new o1 models on Thursday, giving ChatGPT users their first chance to try AI models that pause to “think” before they answer. There’s been a lot of…

First impressions of OpenAI o1: An AI designed to overthink it

Featured Article

Investors rebel as TuSimple pivots from self-driving trucks to AI gaming

TuSimple, once a buzzy startup considered a leader in self-driving trucks, is trying to move its assets to China to fund a new AI-generated animation and video game business. The pivot has not only puzzled and enraged several shareholders, but also threatens to pull the company back into a legal…

Investors rebel as TuSimple pivots from self-driving trucks to AI gaming

Welcome to Startups Weekly — your weekly recap of everything you can’t miss from the world of startups. Want it in your inbox every Friday? Sign up here. This week…

Shrinking teams, warped views, and risk aversion in this week’s startup news

Silicon Valley startup accelerator Y Combinator will expand the number of cohorts it runs each year from two to four starting in 2025, Bloomberg reported Thursday, and TechCrunch confirmed today.…

Y Combinator expanding to four cohorts a year in 2025

Telegram has had a tough few weeks. The messaging app’s founder, Pavel Durov, was arrested in late August and later released on a €5 million bail in France, charged with…

Telegram CEO Durov’s arrest hasn’t dampened enthusiasm for its TON blockchain

Martin Casado, a general partner at Andreessen Horowitz, will tackle one of the most pressing issues facing today’s tech world — AI regulation — only at TechCrunch Disrupt 2024, taking…

A fireside chat with Andreessen Horowitz partner Martin Casado at TechCrunch Disrupt 2024

Christina Cacioppo, CEO and co-founder of Vanta, will be on the SaaS Stage at TechCrunch Disrupt 2024 to reveal how Vanta is redefining security and compliance automation and driving innovation…

Vanta’s Christina Cacioppo takes the stage at TechCrunch Disrupt 2024

On Thursday, cybersecurity giant Fortinet disclosed a breach involving customer data.  In a statement posted online, Fortinet said an individual intruder accessed “a limited number of files” stored on a…

Fortinet confirms customer data breach

Meta has confirmed that it’s restarting efforts to train its AI systems using public Facebook and Instagram posts from its U.K. userbase. The company claims it has “incorporated regulatory feedback” into a…

Meta reignites plans to train AI using UK users’ public Facebook and Instagram posts

Following the moves of other tech giants, Spotify announced on Friday it’s introducing in-app parental controls in the form of “managed accounts” for listeners under the age of 13. The…

Spotify begins piloting parent-managed accounts for kids on family plans