Random Image Display on Page Reload

Amazon’s AGI Lab Reveals Its First Work: Advanced AI Agents

Mar 31, 2025 9:00 AM

Amazon’s AGI Lab Reveals Its First Work: Advanced AI Agents

Led by a former OpenAI executive, Amazon’s AI lab focuses on the decisionmaking capabilities of next-generation software agents—and borrows insights from physical robots.

25 March 2025 Bavaria Munich The logo and lettering of global online mail order company Amazon can be seen under the...

The Amazon logo on the façade of one of their global headquarters.Photograph: Matthias Balk/Getty Images

Amazon is still seen as a bit of a laggard in the race to develop advanced artificial intelligence, but it has quietly created a lab that is now setting records when it comes to AI performance. Amazon’s AGI SF Lab, which is located in San Francisco and dedicated to building artificial general intelligence, or AI that surpasses the capabilities of humans, revealed the first fruits of its work today: A new AI model capable of powering some of the most advanced AI agents available anywhere.

The new model, called Amazon Nova Act, outperforms ones from OpenAI and Anthropic on several benchmarks designed to gauge the intelligence and aptitude of AI agents, Amazon says. On the benchmarks GroundUI Web and ScreenSpot, Amazon Nova Act performs better than Claude 3.7 Sonnet and OpenAI Computer Use Agent. A major part of Amazon’s plan to compete in the AI market is to focus on building agents, and the new model’s abilities reflect its efforts to build a generation of tools that can measure up to the very best available.

“I believe that the basic atomic unit of computing in the future is going to be a call to a giant [AI] agent,” says David Luan, who leads Amazon’s AGI SF Lab. He was previously a vice president of engineering at OpenAI and later cofounded Adept, a startup that pioneered work on AI agents, before joining Amazon in 2024.

Most of the leading AI labs are now focused on building increasingly capable AI agents. Getting AI to master independent actions, as well as conversation, promises to make the technology more useful and valuable. The shift from chat to action is still very much a work in progress, however.

In the past six months, OpenAI, Anthropic, Google, and others have demonstrated web-browsing agents that take actions in response to a prompt. But for the most part, these agents are still unreliable, and they can easily be tripped up by open-ended requests.

Luan says that Amazon’s goal is building AI agents that are dependable rather than flashy. The thing holding agents back is not the need for “more cool demos of interesting capabilities that work 60 percent of the time, it’s the Waymo problem,” he says, referring to how self-driving cars needed to be trained to deal with unusual edge cases before they could take to the streets unsupervised.

Many so-called agents are built by combining large language models with multiple human-written rules that are designed to prevent them from veering off course, but also makes their behavior brittle. Amazon Nova Act is a version of the company's most powerful homegrown model Amazon Nova that has received additional training to help it make decisions about what actions to take and at what time. In general, Luan says, AI models struggle to decide when they should intervene in a task.

To improve Nova’s agential abilities, Amazon is using reinforcement learning, a method that has helped other AI models better simulate reasoning.

Amazon is also taking inspiration from physical robots with its new models. Luan’s team is working with another group at Amazon based in San Francisco led by Pieter Abbeel, a professor at UC Berkeley who works on finding AI applications for robotics. Abbeel, a fellow early OpenAI employee, joined Amazon in August 2024 after it invested in his startup, Covariant. Amazon is well positioned to make progress in robotics given the vast numbers of those already deployed in its fulfillment centers.

The release of Amazon Nova Act suggests that Amazon could emerge as a dark horse in the race to create useful software agents. The company was slow in responding to ChatGPT, but it has more recently shown signs of getting its act together. In February, the company announced a new version of its voice assistant Alexa with improved conversational abilities as well as the capacity to automate certain web tasks.

One use case Amazon cited is Alexa helping to book a repair service for a broken oven. Luan says Alexa’s new agentic capabilities were developed by his team.

As WIRED revealed in October, Amazon has also done research on how agents might eventually improve ecommerce by automating the process of finding and buying things. Such an agent might preemptively add items to a user’s cart based on their interests and habits, Amazon’s engineers said.

Besides unveiling the new model, Amazon today announced a software development kit (SDK) designed to make it easier for computer engineers to use Amazon Nova Act to build software agents. The SDK lets developers give their agent specific instructions to help them navigate an internet built for human users. For example, an agent can be instructed “don’t accept the insurance upsell” when booking a rental car.

Ultimately, Luan says, Amazon’s agents should become smart enough not to fall for the upsell on their own. “Nova Act is really like the very first step in that vision,” he says.

Update 3/31/25 12 pm EST: This story has been updated to clarify that Amazon did not take a stake in Adept.

Will Knight is a senior writer for WIRED, covering artificial intelligence. He writes the AI Lab newsletter, a weekly dispatch from beyond the cutting edge of AI—sign up here. He was previously a senior editor at MIT Technology Review, where he wrote about fundamental advances in AI and China’s AI … Read more
Senior Writer

Read More

The AI Race Has Gotten Crowded—and China Is Closing In on the US

New research from Stanford suggests artificial intelligence isn’t ruled by just OpenAI and Google, as competition increases across the US, China, and France.
Will Knight

Google’s Gemini Robotics AI Model Reaches Into the Physical World

Google has developed an AI model that gives humanoids and other robots more intelligence—and a tool designed to give them a moral compass too.
Will Knight

Nvidia Bets Big on Synthetic Data

Nvidia has acquired synthetic data startup Gretel to bolster the AI training data used by the chip maker's customers and developers.
Lauren Goode

Databricks Has a Trick That Lets AI Models Improve Themselves

Using several recent innovations, the company Databricks will let customers boost the IQ of their AI models even if they don’t have squeaky clean data.
Will Knight

This Tool Probes Frontier AI Models for Lapses in Intelligence

A new platform from data training company Scale AI will let artificial intelligence developers find their models’ weak spots.
Will Knight

Trump’s Tariffs Could Reshape the US Tech Industry

Apple, Amazon, and other tech companies reliant on global supply chains stand to lose the most from President Trump’s trade policies, but some software firms expect more demand for their services.
Lauren Goode

OpenAI’s Deep Research Agent Is Coming for White-Collar Work

The research-focused agent shows how a new generation of more capable AI models could automate some office tasks.
Will Knight

Researchers Propose a Better Way to Report Dangerous AI Flaws

After identifying major flaws in popular AI models, researchers are pushing for a new system to identify and report bugs.
Will Knight

Inside Google’s Two-Year Frenzy to Catch Up With OpenAI

The search giant should’ve been first to the chatbot revolution. It wasn’t. So it punched back with late nights, layoffs—and lowering some guardrails.
Arielle Pardes

FTC Removes Posts Critical of Amazon, Microsoft, and AI Companies

Business-guidance content published during the Biden administration has been removed from the Federal Trade Commission website.
Makena Kelly

OpenAI’s Sora Is Plagued by Sexist, Racist, and Ableist Biases

WIRED tested the popular AI video generator from OpenAI and found that it amplifies sexist stereotypes and ableist tropes, perpetuating the same biases already present in AI image tools.
Reece Rogers

An AI Image Generator’s Exposed Database Reveals What People Really Used It For

An unsecured database used by a generative AI app revealed prompts and tens of thousands of explicit images—some of which are likely illegal. The company deleted its websites after WIRED reached out.
Matt Burgess

*****
Credit belongs to : www.wired.com

Check Also

GM’s New Battery Tech Could Be a Breakthrough for Affordable EVs

John Voelcker Gear May 13, 2025 10:00 AM GM’s New Battery Tech Could Be a …