AI

This Week in AI: Addressing racism in AI image generators

Comment

Google logo is seen during the sales launch event of Google Inc. Pixel 3
Image Credits: Tomohiro Ohsumi / Getty Images

Keeping up with an industry as fast-moving as AI is a tall order. So until an AI can do it for you, here’s a handy roundup of recent stories in the world of machine learning, along with notable research and experiments we didn’t cover on their own.

This week in AI, Google paused its AI chatbot Gemini’s ability to generate images of people after a segment of users complained about historical inaccuracies. Told to depict “a Roman legion,” for instance, Gemini would show an anachronistic, cartoonish group of racially diverse foot soldiers while rendering “Zulu warriors” as Black.

It appears that Google — like some other AI vendors, including OpenAI — had implemented clumsy hardcoding under the hood to attempt to “correct” for biases in its model. In response to prompts like “show me images of only women” or “show me images of only men,” Gemini would refuse, asserting such images could “contribute to the exclusion and marginalization of other genders.” Gemini was also loath to generate images of people identified solely by their race — e.g. “white people” or “black people” — out of ostensible concern for “reducing individuals to their physical characteristics.”

Right wingers have latched on to the bugs as evidence of a “woke” agenda being perpetuated by the tech elite. But it doesn’t take Occam’s razor to see the less nefarious truth: Google, burned by its tools’ biases before (see: classifying Black men as gorillas, mistaking thermal guns in Black people’s hands as weapons, etc.), is so desperate to avoid history repeating itself that it’s manifesting a less biased world in its image-generating models — however erroneous.

In her best-selling book “White Fragility,” anti-racist educator Robin DiAngelo writes about how the erasure of race — “color blindness,” by another phrase — contributes to systemic racial power imbalances rather than mitigating or alleviating them. By purporting to “not see color” or reinforcing the notion that simply acknowledging the struggle of people of other races is sufficient to label oneself “woke,” people perpetuate harm by avoiding any substantive conservation on the topic, DiAngelo says.

Google’s ginger treatment of race-based prompts in Gemini didn’t avoid the issue, per se — but disingenuously attempted to conceal the worst of the model’s biases. One could argue (and many have) that these biases shouldn’t be ignored or glossed over, but addressed in the broader context of the training data from which they arise — i.e. society on the world wide web.

Yes, the datasets used to train image generators generally contain more white people than Black people, and yes, the images of Black people in those data sets reinforce negative stereotypes. That’s why image generators sexualize certain women of color, depict white men in positions of authority and generally favor wealthy Western perspectives.

Some may argue that there’s no winning for AI vendors. Whether they tackle — or choose not to tackle — models’ biases, they’ll be criticized. And that’s true. But I posit that, either way, these models are lacking in explanation — packaged in a fashion that minimizes the ways in which their biases manifest.

Were AI vendors to address their models’ shortcomings head on, in humble and transparent language, it’d go a lot further than haphazard attempts at “fixing” what’s essentially unfixable bias. We all have bias, the truth is — and we don’t treat people the same as a result. Nor do the models we’re building. And we’d do well to acknowledge that.

‘Embarrassing and wrong’: Google admits it lost control of image-generating AI

Here are some other AI stories of note from the past few days:

  • Women in AI: TechCrunch launched a series highlighting notable women in the field of AI. Read the list here.
  • Stable Diffusion v3: Stability AI has announced Stable Diffusion 3, the latest and most powerful version of the company’s image-generating AI model, based on a new architecture.
  • Chrome gets GenAI: Google’s new Gemini-powered tool in Chrome allows users to rewrite existing text on the web — or generate something completely new.
  • Blacker than ChatGPT: Creative ad agency McKinney developed a quiz game, Are You Blacker than ChatGPT?, to shine a light on AI bias.
  • Calls for laws: Hundreds of AI luminaries signed a public letter earlier this week calling for anti-deepfake legislation in the U.S.
  • Match made in AI: OpenAI has a new customer in Match Group, the owner of apps including Hinge, Tinder and Match, whose employees will use OpenAI’s AI tech to accomplish work-related tasks.
  • DeepMind safety: DeepMind, Google’s AI research division, has formed a new org, AI Safety and Alignment, made up of existing teams working on AI safety but also broadened to encompass new, specialized cohorts of GenAI researchers and engineers.
  • Open models: Barely a week after launching the latest iteration of its Gemini models, Google released Gemma, a new family of lightweight open-weight models.
  • House task force: The U.S. House of Representatives has founded a task force on AI that — as Devin writes — feels like a punt after years of indecision that show no sign of ending.

More machine learnings

AI models seem to know a lot, but what do they actually know? Well, the answer is nothing. But if you phrase the question slightly differently… they do seem to have internalized some “meanings” that are similar to what humans know. Although no AI truly understands what a cat or a dog is, could it have some sense of similarity encoded in its embeddings of those two words that is different from, say, cat and bottle? Amazon researchers believe so.

Their research compared the “trajectories” of similar but distinct sentences, like “the dog barked at the burglar” and “the burglar caused the dog to bark,” with those of grammatically similar but different sentences, like “a cat sleeps all day” and “a girl jogs all afternoon.” They found that the ones humans would find similar were indeed internally treated as more similar despite being grammatically different, and vice versa for the grammatically similar ones. OK, I feel like this paragraph was a little confusing, but suffice it to say that the meanings encoded in LLMs appear to be more robust and sophisticated than expected, not totally naïve.

Neural encoding is proving useful in prosthetic vision, Swiss researchers at EPFL have found. Artificial retinas and other ways of replacing parts of the human visual system generally have very limited resolution due to the limitations of microelectrode arrays. So no matter how detailed the image is coming in, it has to be transmitted at a very low fidelity. But there are different ways of downsampling, and this team found that machine learning does a great job at it.

Image Credits: EPFL

“We found that if we applied a learning-based approach, we got improved results in terms of optimized sensory encoding. But more surprising was that when we used an unconstrained neural network, it learned to mimic aspects of retinal processing on its own,” said Diego Ghezzi in a news release. It does perceptual compression, basically. They tested it on mouse retinas, so it isn’t just theoretical.

An interesting application of computer vision by Stanford researchers hints at a mystery in how children develop their drawing skills. The team solicited and analyzed 37,000 drawings by kids of various objects and animals, and also (based on kids’ responses) how recognizable each drawing was. Interestingly, it wasn’t just the inclusion of signature features like a rabbit’s ears that made drawings more recognizable by other kids.

Image Credits: Stanford

“The kinds of features that lead drawings from older children to be recognizable don’t seem to be driven by just a single feature that all the older kids learn to include in their drawings. It’s something much more complex that these machine learning systems are picking up on,” said lead researcher Judith Fan.

Chemists (also at EPFL) found that LLMs are also surprisingly adept at helping out with their work after minimal training. It’s not just doing chemistry directly, but rather being fine-tuned on a body of work that chemists individually can’t possibly know all of. For instance, in thousands of papers there may be a few hundred statements about whether a high-entropy alloy is single or multiple phase (you don’t have to know what this means — they do). The system (based on GPT-3) can be trained on this type of yes/no question and answer, and soon is able to extrapolate from that.

It’s not some huge advance, just more evidence that LLMs are a useful tool in this sense. “The point is that this is as easy as doing a literature search, which works for many chemical problems,” said researcher Berend Smit. “Querying a foundational model might become a routine way to bootstrap a project.”

Last, a word of caution from Berkeley researchers, though now that I’m reading the post again I see EPFL was involved with this one too. Go Lausanne! The group found that imagery found via Google was much more likely to enforce gender stereotypes for certain jobs and words than text mentioning the same thing. And there were also just way more men present in both cases.

Not only that, but in an experiment, they found that people who viewed images rather than reading text when researching a role associated those roles with one gender more reliably, even days later. “This isn’t only about the frequency of gender bias online,” said researcher Douglas Guilbeault. “Part of the story here is that there’s something very sticky, very potent about images’ representation of people that text just doesn’t have.”

With stuff like the Google image generator diversity fracas going on, it’s easy to lose sight of the established and frequently verified fact that the source of data for many AI models shows serious bias, and this bias has a real effect on people.

More TechCrunch

Once linked, parents will be alerted to their teen’s channel activity, including the number of uploads, subscriptions and comments.

YouTube debuts new parental controls aimed at teens

No one is putting the remote working genie back in the bottle. Which is good news for Oyster, a payroll and HR platform that specializes in distributed workforces – or…

As remote working keeps rolling, Oyster raises $59M Series D at $1.2B valuation

For the college students who are satisfied with dating apps, which may not be many, Tinder announced Wednesday a series of updates to Tinder U, its in-app feature that caters…

Tinder update targets college students as dating apps struggle

The exact contents of X’s (now permanent) undertaking with the DPC have not been made public, but it’s assumed the agreement limits how it can use people’s data.

Ireland’s privacy watchdog ends legal fight with X over data use for AI after it agrees to permanent limits

Years ago, Twitter tried but eventually walked away from building TV apps after getting a lukewarm reception. Now, as it looks to revive its advertising business, its new incarnation X…

X doubles down on video with a new TV app

Apple is likely to unveil its iPhone 16 series of phones and maybe even some Apple Watches at its Glowtime event on September 9.

Apple event 2024: How to watch the iPhone 16 launch

Korea’s Institute of Machinery and Materials this week showcased a robotic wheelchair with large, deformable wheels that can manage rocks, stairs and other obstacles. During normal operation, the wheel maintains…

Watch this robotic wheelchair’s compliant wheels take on bumps, rocks and stairs

Mayfield is launching AI Garage, a $100 million initiative for ideation-stage founders interested in building “AI teammate” companies.

Mayfield allocates $100M to AI incubator modeled after its entrepreneur-in-residence program

Anthropic is launching a new subscription plan for its AI chatbot, Claude, catered towards enterprise customers that want more administrative controls and increased security. Claude Enterprise will compete with OpenAI’s…

Anthropic launches Claude Enterprise plan to compete with OpenAI

Time is running out to take advantage of our Student Pass discount for TechCrunch Disrupt 2024. Students and recent graduates can still save up to $200 until September 6 at…

Students and recent grads: Only 3 days left to save on TechCrunch Disrupt 2024 Student Passes

Fast-forward to today, Slauson & Co. remains even more committed to the mission of inclusivity in its funding, and it seems limited partners have its back. 

Slauson & Co. raises $100M Fund II proving appetite for inclusion persists

Safe Superintelligence (SSI), the AI startup co-founded by former OpenAI chief scientist Ilya Sutskever, has raised over $1 billion in capital from investors including NFDG (an investment partnership run by…

Ilya Sutskever’s startup, Safe Superintelligence, raises $1B

The American sports betting market produced $10.9 billion in revenue in 2023 for casinos, sportsbooks and iGaming, according to the American Gambling Association. One of the reasons this industry is…

DubClub wants amateur sports bettors to win more

New climate tech VC firms have emerged in recent years, but existing ones are also raising larger funds. Founded in 2007, Dutch firm SET Ventures is one of the latter.…

Dutch clean energy investor SET Ventures lands new €200 million fund, which will go toward digital tech

Revefi connects to a company’s data stores and databases (e.g. Snowflake, Databricks and so on) and attempts to automatically detect and troubleshoot data-related issues.

Revefi seeks to automate companies’ data operations

If you build an AI search product, you compete with Google. But Google has a lot easier time answering queries with a single, simple answer, such as “how many is…

You.com ‘refocuses’ from AI search to deeper productivity agents with new $50M round

Featured Article

reMarkable’s Paper Pro adds color, light and more but keeps the focus on ‘focus’

The $499 Paper Pro — a new naming convention to indicate it is a higher-end alternative to the now-$379 reMarkable 2, not a direct successor — is momentous for its addition of both color and a “frontlight,” though both features are what you might call muted.

reMarkable’s Paper Pro adds color, light and more but keeps the focus on ‘focus’

Good news for Microsoft: The U.K.’s antitrust regulator says that the tech titan’s high-profile acquihire of the team behind AI startup Inflection doesn’t cause competition concerns, and thus it won’t…

UK regulator greenlights Microsoft’s Inflection acquihire, but also designates it a merger

In the summer of 2023, Lyft was contemplating the sale of its micromobility business after receiving strong interest from prospective buyers. Today, the ride-hail company is doubling down on its…

Why Lyft’s CEO says ‘it would be insane’ not to go all in on bikeshare

Here’s a look at what’s going to change with Siri, and what the introduction of Apple Intelligence will allow you to do with the digital assistant. 

How Apple Intelligence is changing the way you use Siri on your iPhone 

Apple Intelligence was designed to leverage things that generative AI already does well, like text and image generation, to improve upon existing features.

What is Apple Intelligence, when is it coming and who will get it?

Spotify is launching daylist globally. It’s a personalized playlist that evolves throughout the day depending on your listening habits. This rollout comes after the company introduced it first to English-speaking…

Spotify launches its evolving playlist, daylist, globally

Digital lending platforms have become an easy and swift alternative source of credit for microenterprises and individuals overlooked by traditional banking institutions. These platforms have turned into a lifeline for…

Impact investors FMO and BlueOrchard back Ghana’s digital lender Fido in $30M Series B round

Indian online pharmacy startup PharmEasy, once valued at a lofty $5.6 billion, is still about 92% below its peak valuation, according to new estimates by its investor Janus Henderson. The…

PharmEasy still 92% below its peak $5.6 billion valuation, investor estimates

Palm launched in 2023 with the goal of making cash management for enterprise treasury teams easier.

From their experiences at Uber and PayPal, Palm founders want to make moving cash easier for big companies

Canva, the design platform, is increasing prices steeply for some customers. And it’s blaming the move in part on generative AI. In the U.S., some Canva Teams subscribers on older…

Canva has increased prices for its Teams product

Featured Article

Apple Event 2024: iPhone 16, Apple Intelligence and all the other expected ‘Glowtime’ reveals

Apple’s Glowtime iPhone event will include the iPhone 16, but may also feature new AirPods, a new Apple Watch and possibly even new Macs.

Apple Event 2024: iPhone 16, Apple Intelligence and all the other expected ‘Glowtime’ reveals

Snap is testing a “simplified version of Snapchat,” CEO Evan Spiegel wrote in a verbose letter to employees published on Snap’s website Tuesday. The CEO says the simplified version aims…

Snapchat to test a ‘simplified’  app, CEO says

Prevention is better than cure, as the saying goes. Today, a splashy startup that has taken that concept to heart — literally and figuratively — is expanding. Neko Health was…

Neko Health, the body-scanning AI health startup from Spotify’s Daniel Ek, opens in London

The Federal Trade Commission (FTC) published a report about increasing fraud at Bitcoin ATMs. These ATMs allow people to turn their cash into crypto, but they’ve become a tool for…

Bitcoin ATMs are a hotbed for scams, FTC says