Random Image Display on Page Reload

Meta’s Open Source Llama 3 Is Already Nipping at OpenAI’s Heels

Apr 25, 2024 12:00 PM

Meta’s Open Source Llama 3 Is Already Nipping at OpenAI’s Heels

Meta’s decision to give away powerful AI software for free could threaten the business models of OpenAI and Google.

Digital generated image of layered blue speech bubbles against a blue background

Illustration: Andriy Onufriyenko/Getty Images

Jerome Pesenti has a few reasons to celebrate Meta’s decision last week to release Llama 3, a powerful open source large language model that anyone can download, run, and build on.

Pesenti used to be vice president ofartificial intelligence at Meta and says he often pushed the company to consider releasing its technology for others to use and build on. But his main reason to rejoice is that his new startup will get access to an AI model that he says is very close in power to OpenAI’s industry-leading text generator GPT-4, but considerably cheaper to run and more open to outside scrutiny and modification.

“The release last Friday really feels like a game-changer,” Pesenti says. His new company, Sizzle, an AI tutor, currently uses GPT-4 and other AI models, both closed and open, to craft problem sets and curricula for students. His engineers are evaluating whether Llama 3 could replace OpenAI’s model in many cases.

Sizzle’s story may augur a broader shift in the balance of power in AI. OpenAI changed the world with ChatGPT, setting off a wave of AI investment and drawing more than 2 million developers to its cloud APIs. But if open source models prove competitive, developers and entrepreneurs may decide to stop paying to access the latest model from OpenAI or Google and use Llama 3 or one of the other increasingly powerful open source models that are popping up.

“It’s going to be an interesting horse race,” Pesenti says of competition between open models like Llama 3 and closed ones such as GPT-4 and Google’s Gemini.

Meta’s previous model, Llama 2, was already influential, but the company says it made the latest version more powerful by feeding it larger amounts of higher-quality training data, with new techniques developed to filter out redundant or garbled content and to select the best mixture of datasets to use.

Pesenti says running Llama 3 on a cloud platform such as Fireworks.ai costs just a 20th of the cost of accessing GPT-4 through an API. He adds that Llama 3 can be configured to respond to queries extremely quickly, a key consideration for developers at companies like his that rely on tapping into models from different providers. “It's an equation between latency, cost, and accuracy,” he says.

Open models appear to be dropping at an impressive clip. A couple of weeks ago, I went inside startup Databricks to witness the final stages of an effort to build DBRX, a language model built that was briefly the best open one around. That crown is now Llama 3’s. Ali Ghodsi, CEO of Databricks, also describes Llama 3 as “game-changing” and says the larger model “is approaching the quality of GPT 4—that levels the playing field between open and closed-source LLMs.”

Llama 3 also showcases the potential for making AI models smaller, so they can be run on less powerful hardware. Meta released two versions of its latest model, one with 70 billion parameters—a measure of the variables it uses to learn from training data—and another with 8 billion. The smaller model is compact enough to run on a laptop but is remarkably capable, at least in WIRED’s testing.

Two days before Meta’s release, Mistral, a French AI company founded by alumni of Pesenti’s team at Meta, open sourced Mixtral 8x22B. It has 141 billion parameters but uses only 39 billion of them at any one time, a design known as a mixture of experts. Thanks to this trick, the model is considerably more capable than some models that are much larger.

Meta isn’t the only tech giant releasing open source AI. This week Microsoft released Phi-3-mini and Apple released OpenELM, two tiny but capable free-to-use language models that can run on a smartphone.

Coming months will show whether Llama 3 and other open models really can displace premium AI models like GPT-4 for some developers. And even more powerful open source AI is coming. The company is working on a massive 400-billion-parameter version of Llama 3 that chief AI scientist Yann LeCun says should be one of the most capable in the world.

Of course all this openness is not purely altruistic. Meta CEO Mark Zuckerberg says opening up its AI models should ultimately benefit the company by lowering the cost of technologies it relies on, for example by spawning compatible tools and services that Meta can use for itself. He left unsaid that it may also be to Meta’s benefit to prevent OpenAI, Microsoft, or Google from dominating the field.

Will Knight is a senior writer for WIRED, covering artificial intelligence. He writes the Fast Forward newsletter that explores how advances in AI and other emerging technology are set to change our lives—sign up here. He was previously a senior editor at MIT Technology Review, where he wrote about fundamental… Read more
Senior Writer

Read More

Meta Is Already Training a More Powerful Successor to Llama 3

The open source Llama 3 AI model released by Meta today is just the start, according to the company’s chief AI scientist, Yann LeCun. He said a new, much larger version is in the works.

Will Knight

Google Thinks It Can Cash In on Generative AI. Microsoft Already Has

While both Alphabet and Microsoft boasted strong quarterly earnings, only one tech giant showed that its generative AI bet is starting to pay off.

Paresh Dave

To Build a Better AI Supercomputer, Let There Be Light

OpenAI and other AI leaders think new leaps in machine intelligence will require new forms of computer hardware. One proposal involves connecting GPUs with light.

Will Knight

What Really Made Geoffrey Hinton Into an AI Doomer

The AI pioneer is alarmed by how clever the technology he helped create has become. And it all started with a joke.

Will Knight

How to Stop ChatGPT’s Voice Feature From Interrupting You

ChatGPT’s conversation tools are fantastic—when the chatbot isn’t constantly talking over you. Try these tips for a better AI audio experience.

Reece Rogers

Ads for Explicit ‘AI Girlfriends’ Are Swarming Facebook and Instagram

WIRED found thousands of ads running on Meta’s social platforms promoting sexually explicit “AI girlfriend" apps. Some human sex workers say the platform unfairly polices their own posts more harshly.

Lydia Morrish

The Unsexy Future of Generative AI

Some startups that launched buzzy generative AI products are now narrowing their offerings to try to make them more useful to business clients.

Lauren Goode

A Lawsuit Argues Meta Is Required by Law to Let You Control Your Own Feed

Academic Ethan Zuckerman is suing Meta to win protections for add-ons that help researchers study the platform and give users more control over their feeds.

Vittoria Elliott

*****
Credit belongs to : www.wired.com

Check Also

Climate Protesters Storm Tesla’s Gigafactory in Europe

Morgan Meaker Business May 10, 2024 9:08 AM Climate Protesters Storm Tesla’s Gigafactory in Europe …