AI

Anthropic’s latest model can take ‘The Great Gatsby’ as input

Comment

Anthropic logo
Image Credits: Anthropic (opens in a new window)

Historically and even today, poor memory has been an impediment to the usefulness of text-generating AI. As a recent piece in The Atlantic aptly puts it, even sophisticated generative text AI like ChatGPT has the memory of a goldfish. Each time the model generates a response, it takes into account only a very limited amount of text — preventing it from, say, summarizing a book or reviewing a major coding project.

But Anthropic’s trying to change that.

Today, the AI research startup announced that it’s expanded the context window for Claude — its flagship text-generating AI model, still in preview — from 9,000 tokens to 100,000 tokens. Context window refers to the text the model considers before generating additional text, while tokens represent raw text (e.g., the word “fantastic” would be split into the tokens “fan,” “tas” and “tic”).

So what’s the significance, exactly? Well, as alluded to earlier, models with small context windows tend to “forget” the content of even very recent conversations — leading them to veer off topic. After a few thousand words or so, they also forget their initial instructions, instead extrapolating their behavior from the last information within their context window rather than from the original request.

Given the benefits of large context windows, it’s not surprising that figuring out ways to expand them has become a major focus of AI labs like OpenAI, which devoted an entire team to the issue. OpenAI’s GPT-4 held the previous crown in terms of context window sizes, weighing in at 32,000 tokens on the high end — but the improved Claude API blows past that.

With a bigger “memory,” Claude should be able to converse relatively coherently for hours — several days, even — as opposed to minutes. And perhaps more importantly, it should be less likely to go off the rails.

In a blog post, Anthropic touts the other benefits of Claude’s increased context window, including the ability for the model to digest and analyze hundreds of pages of materials. Beyond reading long texts, the upgraded Claude can help retrieve information from multiple documents or even a book, Anthropic says, answering questions that require “synthesis of knowledge” across many parts of the text.

Anthropic lists a few possible use cases:

  • Digesting, summarizing, and explaining documents such as financial statements or research papers
  • Analyzing risks and opportunities for a company based on its annual reports
  • Assessing the pros and cons of a piece of legislation
  • Identifying risks, themes, and different forms of argument across legal documents.
  • Reading through hundreds of pages of developer documentation and surfacing answers to technical questions
  • Rapidly prototyping by dropping an entire codebase into the context and intelligently building on or modifying it

“The average person can read 100,000 tokens of text in around five hours, and then they might need substantially longer to digest, remember, and analyze that information,” Anthropic continues. “Claude can now do this in less than a minute. For example, we loaded the entire text of The Great Gatsby into Claude … and modified one line to say Mr. Carraway was ‘a software engineer that works on machine learning tooling at Anthropic.’ When we asked the model to spot what was different, it responded with the correct answer in 22 seconds.”

Now, longer context windows don’t solve the other memory-related challenges around large language models. Claude, like most models in its class, can’t retain information from one session to the next. And unlike the human brain, it treats every piece of information as equally important, making it a not particularly reliable narrator. Some experts believe that solving these problems will require entirely new model architectures.

For now, though, Anthropic appears to be at the forefront.

More TechCrunch

It’s been three years since Life360’s $205 million acquisition of AirTag competitor Tile. The company announced Monday its new lineup of lost-item Bluetooth trackers, featuring a sleeker redesign in new…

Life360’s Tile introduces its first new Bluetooth trackers since its acquisition

Typeface, a generative AI startup focused on enterprise use cases, has acquired a pair of companies just over a year after raising $100 million at a $1 billion valuation. Typeface…

Generative AI startup Typeface acquires two companies, Treat and Narrato, to bolster its portfolio

Earlier this year, former NFL quarterback and civil rights activist Colin Kaepernick launched his AI startup, Lumi. Kaepernick has had thousands of stories written about him, and he knows a…

Colin Kaepernick is coming to TechCrunch Disrupt 2024

Runway, one of several AI startups developing video-generating tech, today announced an API to allow devs and organizations to build the company’s generative AI models into third-party platforms, apps and…

Runway announces an API for its video-generating models

IBM today launched the Qiskit Functions Catalog, a new set of services that aims to make programming quantum computers easier by abstracting away many of the complexities of working with…

IBM makes developing for quantum computers easier with the Qiskit Functions Catalog

Supermaven, an AI coding assistant, has raised $12 million in a funding round that had participation from OpenAI and Perplexity co-founders.

AI coding assistant Supermaven raises cash from OpenAI and Perplexity co-founders

Arjun Vora and Tito Goldstein were working on the corporate side of Uber when they realized that HR software largely wasn’t built to manage hourly staff. Many hourly workers lacked…

TeamBridge, founded by former Uber execs, raises $28M to build HR software for hourly workers

The US Food and Drug Administration Monday published approval for sleep apnea detection on the Apple Watch Series 9, Series 10, and Watch Ultra 2. The green light comes four…

Apple Watch sleep apnea detection gets FDA approval

Featured Article

Apple AirPods 4 with Active Noise Cancellation review

I can’t recall another consumer electronics product category becoming a commodity as quickly as Bluetooth earbuds. Apple’s AirPods played a key role in that growth, of course, recapturing a kind of excitement not seen in consumer music tech since the original iPod. AirPods’ fundamentals haven’t changed much in the eight…

Apple AirPods 4 with Active Noise Cancellation review

Myntra, India’s largest fashion e-commerce platform, is trialling a four-hour delivery service in four Indian cities, two sources familiar with the matter told TechCrunch, a dramatic acceleration from its standard…

Myntra bets on 4-hour delivery amid India’s quick commerce boom

AWS today announced that it is transitioning OpenSearch, its open source fork of the popular Elasticsearch search and analytics engine, to the Linux Foundation with the launch of the very…

AWS brings OpenSearch under the Linux Foundation umbrella

Insight Partners is reportedly on the cusp of closing on more than $10 billion in capital commitments for its 13th fund, per the FT.  The FT report notes that two…

Insight Partners is closing in on a whopping $10B+ new fund

The Port of Seattle released a statement Friday confirming that it was targeted by a ransomware attack. The attack occurred on August 24, with the Port (which also operates the…

Port of Seattle shares ransomware attack details

A decade after the wildly popular game Flappy Bird disappeared, an organization calling itself The Flappy Bird Foundation announced plans to “re-hatch the official Flappy Bird® game.” But this morning,…

Flappy Bird’s creator disavows ‘official’ new version of the game

Platforms to connect apps that wouldn’t normally talk to each other have been around for a minute (see: Zapier). But they have not gotten dramatically simpler to use if you’re…

DryMerge promises to connect apps that normally don’t talk to each other — and when it works, it’s great

Featured Article

Cohere co-founder Nick Frosst’s indie band, Good Kid, is almost as successful as his AI company

Nick Frosst, the co-founder of $5.5 billion Canadian AI startup Cohere, has been a musician his whole life. He told TechCrunch that once he started singing, he never shut up. That’s still true today. In addition to his full-time job at Cohere, Frosst is also the front man of Good…

Cohere co-founder Nick Frosst’s indie band, Good Kid, is almost as successful as his AI company

Blockchain technology is all about decentralization and virtualization. So it’s a little ironic that humans love to come together in person at big blockchain events. Such was the case last…

A walk through the crypto jungle at Korea Blockchain Week

I have a guilty pleasure, and it’s not that I just rewatched “Glee” in its entirety (yes, even the awful later seasons), or that I have read an ungodly amount…

The LinkedIn games are fun, actually

It’s looking increasingly likely that OpenAI will soon alter its complex corporate structure. Reports earlier this week suggested that the AI company was in talks to raise $6.5 billion at…

OpenAI could shake up its nonprofit structure next year

Fusion startups have raised $7.1 billion to date, with the majority of it going to a handful of companies. 

Every fusion startup that has raised over $300M

Netflix has never quite cracked the talk show formula, but maybe it can borrow an existing hit from YouTube. According to Bloomberg, the streamer is in talks with BuzzFeed to…

‘Hot Ones’ could add some heat to Netflix’s live lineup

Alex Parmley has been thinking about building his latest company, ORNG, since he was working on his last company, Phood.  Launched in 2018, Phood was a payments app that let…

Why ORNG’s founder pivoted from college food ordering to real-time money transfer

Lawyers representing Sam Bankman-Fried, the FTX CEO and co-founder who was convicted of fraud and money laundering late last year, are seeking a new trial. Following crypto exchange FTX’s collapse,…

Sam Bankman-Fried appeals conviction, criticizes judge’s ‘unbalanced’ decisions

OpenAI this week unveiled a preview of OpenAI o1, also known as Strawberry. The company claims that o1 can more effectively reason through math and science, as well as fact-check…

OpenAI previews its new Strawberry model

There’s something oddly refreshing about starting the day by solving the Wordle. According to DeepWell DTx, there’s a scientific explanation for why our brains might feel just a bit better…

DeepWell DTx receives FDA clearance for its therapeutic video game developer tools

Soundiiz is a free third-party tool that builds portability tools through existing APIs and acts as a translator between the services.

These two friends built a simple tool to transfer playlists between Apple Music and Spotify, and it works great

In early 2018, VC Mike Moritz wrote in the FT that “Silicon Valley would be wise to follow China’s lead,” noting the pace of work at tech companies was “furious”…

This is how bad China’s startup scene looks now

Fei-Fei Li, the Stanford professor many deem the “Godmother of AI,” has raised $230 million for her new startup, World Labs, from backers including Andreessen Horowitz, NEA, and Radical Ventures.…

Fei-Fei Li’s World Labs comes out of stealth with $230M in funding

Bolt says it has settled its long-standing lawsuit with its investor Activant Capital. One-click payments startup Bolt is settling the suit by buying out the investor’s stake “after which Activant…

Fintech Bolt is buying out the investor suing over Ryan Breslow’s $30M loan

The rise of neobanks has been fascinating to witness, as a number of companies in recent years have grown from merely challenging traditional banks to being massive players in and…

Dave and Varo Bank execs are coming to TechCrunch Disrupt 2024