Random Image Display on Page Reload

AI’s Hacking Skills Are Approaching an ‘Inflection Point’

AI’s Hacking Skills Are Approaching an ‘Inflection Point’

AI models are getting so good at finding vulnerabilities that some experts say the tech industry might need to rethink how software is built.

Image may contain Purple
Photo-Illustration: WIRED Staff; Getty Images

Vlad Ionescu and Ariel Herbert-Voss, cofounders of the cybersecurity startup RunSybil, were momentarily confused when their AI tool, Sybil, alerted them to a weakness in a customer’s systems last November.

Sybil uses a mix of different AI models—as well as a few proprietary technical tricks—to scan computer systems for issues that hackers might exploit, like an unpatched server or a misconfigured database.

In this case, Sybil flagged a problem with the customer’s deployment of federated GraphQL, a language used to specify how data is accessed over the web through application programming interfaces (APIs). The issue meant that the customer was inadvertently exposing confidential information.

What puzzled Ionescu and Herbert-Voss was that spotting the issue required a remarkably deep knowledge of several different systems and how those systems interact. RunSybil says it has since found the same problem with other deployments of GraphQL—before anybody else made it public “We scoured the internet, and it didn’t exist,” Herbert-Voss says. “Discovering it was a reasoning step in terms of models’ capabilities—a step change.”

The situation points to a growing risk. As AI models continue to get smarter, their ability to find zero-day bugs and other vulnerabilities also continues to grow. The same intelligence that can be used to detect vulnerabilities can also be used to exploit them.

Dawn Song, a computer scientist at UC Berkeley who specializes in both AI and security, says recent advances in AI have produced models that are better at finding flaws. Simulated reasoning, which involves splitting problems into constituent pieces, and agentic AI, like searching the web or installing and running software tools, have amped up models’ cyber abilities.

“The cyber security capabilities of frontier models have increased drastically in the last few months,” she says. “This is an inflection point.”

Last year, Song cocreated a benchmark called CyberGym to determine how well large language models find vulnerabilities in large open-source software projects. CyberGym includes 1,507 known vulnerabilities found in 188 projects.

In July 2025, Anthropic’s Claude Sonnet 4 was able to find about 20 percent of the vulnerabilities in the benchmark. By October 2025, a new model, Claude Sonnet 4.5, was able to identify 30 percent. “AI agents are able to find zero-days, and at very low cost,” Song says.

Song says this trend shows the need for new countermeasures, including having AI help cybersecurity experts. “We need to think about how to actually have AI help more on the defense side, and one can explore different approaches,” she says.

One idea is for frontier AI companies to share models with security researchers before launch, so they can use the models to find bugs and secure systems prior to a general release.

Another countermeasure, says Song, is to rethink how software is built in the first place. Her lab has shown that it is possible to use AI to generate code that is more secure than what most programmers use today. “In the long run we think this secure-by-design approach will really help defenders,” Song says.

The RunSybil team says that, in the near term, the coding skills of AI models could mean that hackers gain the upper hand. “AI can generate actions on a computer and generate code, and those are two things that hackers do,” Herbert-Voss says. “If those capabilities accelerate, that means offensive security actions will also accelerate.”


This is an edition ofWill Knight’sAI Lab newsletter. Read previous newslettershere.

You Might Also Like

Will Knight is a senior writer for WIRED, covering artificial intelligence. He writes the AI Lab newsletter, a weekly dispatch from beyond the cutting edge of AI—sign up here. He was previously a senior editor at MIT Technology Review, where he wrote about fundamental advances in AI and China’s AI … Read More
Senior Writer

Read More

Former CISA Director Jen Easterly Will Lead RSAC Conference

The longtime cybersecurity professional says she’s taking the helm of the legacy security organization at “an inflection point” for tech and the world beyond.

ICE Can Now Spy on Every Phone in Your Neighborhood

Plus: Iran shuts down its internet amid sweeping protests, an alleged scam boss gets extradited to China, and more.

So Long, GPT-5. Hello, Qwen

In the AI boom, chatbots and GPTs come and go quickly. (Remember Llama?) GPT-5 had a big year, but 2026 will be all about Qwen.

People Are Using AI to Falsely Identify the Federal Agent Who Shot Renee Good

Online detectives are inaccurately claiming to have identified the federal agent who shot and killed a 37-year-old woman in Minnesota based on AI-manipulated images.

Grok Is Being Used to Mock and Strip Women in Hijabs and Saris

A substantial number of AI images generated or edited with Grok are targeting women in religious and cultural clothing.

AI Devices Are Coming. Will Your Favorite Apps Be Along for the Ride?

Tech companies are calling AI the next platform. But some developers are reluctant to let AI agents stand between them and their users.

Why Are Grok and X Still Available in App Stores?

Elon Musk’s chatbot has been used to generate thousands of sexualized images of adults and apparent minors. Apple and Google have removed other “nudify” apps—but continue to host X and Grok.

People Are Using Sora 2 to Make Disturbing Videos With AI-Generated Kids

Videos such as fake ads featuring AI children playing with vibrators or Jeffrey Epstein- and Diddy-themed play sets are being made with Sora 2 and posted to TikTok.

Grok Is Pushing AI ‘Undressing’ Mainstream

Paid tools that “strip” clothes from photos have been available on the darker corners of the internet for years. Elon Musk’s X is now removing barriers to entry—and making the results public.

Google’s and OpenAI’s Chatbots Can Strip Women in Photos Down to Bikinis

Users of AI image generators are offering each other instructions on how to use the tech to alter pictures of women into realistic, revealing deepfakes.

AI-Powered Dating Is All Hype. IRL Cruising Is the Future

Dating apps and AI companies have been touting bot wingmen for months. But the future might just be good old-fashioned meet-cutes.

Billion-Dollar Data Centers Are Taking Over the World

The battle for AI dominance has left a large footprint—and it’s only getting bigger and more expensive.

*****
Credit belongs to : www.wired.com

Check Also

Norse Atlantic Airways Offers Dirt-Cheap Tickets. There’s a Catch

Norse Atlantic Airways Offers Dirt-Cheap Tickets. There’s a Catch

Caroline Haskins Business Jun 1, 2026 7:00 AM Norse Atlantic Airways Offers Dirt-Cheap Tickets. There’s …