Google Assistant Finally Gets a Generative AI Glow-Up

Oct 4, 2023 10:30 AM

Google Assistant Finally Gets a Generative AI Glow-Up

Google is adding AI capabilities from its chatbot Bard to the humble Google Assistant, allowing the virtual helper to make sense of images and draw on data in documents and emails.

Sissie Hsiao sepaking on stage in front of a screen that reads Assistant with Bard

Google went big when it launched its generative AI fight-back against OpenAI's ChatGPT in May. The company added AI text-generation to its signature search engine, showed off an AI-customized version of the Android operating system, and offered up its own chatbot, Bard. But one Google product didn’t get a generative AI infusion: Google Assistant, the company’s answer to Siri and Alexa.

Today, at its Pixel hardware event in New York, Google Assistant at last got its upgrade for the ChatGPT era. Sissie Hsiao, Google’s vice president and general manager for Google Assistant, revealed a new version of the AI helper that is a mashup of Google Assistant and Bard.

Hsiao says Google envisions this new, “multimodal” assistant to be a tool that goes beyond just voice queries, including by also making sense of images. It can handle “big tasks and small tasks from your to-do list, everything from planning a new trip to summarizing your inbox to writing a fun social media caption for a picture,” she said in an interview with WIRED earlier this week.

Courtesy of Google

The new generative AI experience is so early in its rollout that Hsiao said it didn’t even qualify as an “app” yet. When asked for more information about how it might appear on someone’s phone, company representatives were generally unclear on what final form it might take. (Did Google rush out the announcement to coincide with its hardware event? Quite possibly.)

Whatever container it appears in, the Bard-ified Google Assistant will use generative AI to process text, voice, or image queries, and respond accordingly in either text or voice. It’s limited to approved users for an unknown period of time, will run on mobile only, not smart speakers, and will require users to opt in. On Android, it may operate as either a full-screen app or as an overlay, similar to how Google Assistant runs today. On iOS, it will likely live within one of Google's apps.

The Google Assistant’s generative glow-up comes on the heels of Amazon’s Alexa getting more conversational and OpenAI’s ChatGPT also going multimodal, becoming able to respond using a synthetic voice and describe the content of images shared with the app. One capability apparently unique to Google’s upgraded assistant is an ability to converse about the webpage a user is visiting on their phone.

For Google in particular, the introduction of generative AI to its virtual assistant raises questions around how quickly the search giant will start using large language models across more of its products. That could fundamentally change how some of them work—and how Google monetizes them.

Most Popular

Gain of Function

Google has spent the past several years touting the capabilities of its Google Assistant, which was first introduced to smartphones in 2016, and the past several months touting the capabilities of Bard, which the company has positioned as a kind of chatty, AI-powered collaborator. So what does combining them—within the existing Assistant app—actually do?

Hsiao said the move combines the Assistant’s personalized help with the reasoning and generative capabilities of Bard. One example: Because of the way Bard now works within Google’s productivity apps, it can help find and summarize emails and answer questions about work documents. Those same functions would now theoretically be accessed through Google Assistant—you could request information about your docs or emails using voice and have those summaries read aloud to you.

Its new connection with Bard also gives the Google Assistant new powers to make sense of images. Google already has an image recognition tool, Google Lens, that can be accessed through the Google Assistant or the all-encompassing Google app. But if you capture a photo of a painting or a pair of sneakers and feed it to Lens, Lens will either identify the painting or try to sell you the sneakers—by showing links to buy them—and leave it at that.

The Bard-ified version of Assistant, on the other hand, will understand the content of the photo you’ve shared with it, Hsiao claims. In the future that could allow deep integration with other Google products. “Say you’re scrolling through Instagram and you see a picture of a beautiful hotel. You should be able to one-button press, open Assistant, and ask, ‘Show me more information about this hotel, and tell me if it’s available on my birthday weekend,’” she said. “And it should be able to not only figure out which hotel it is, but actually go check Google Hotels for availability.”

A similar workflow could make the new Google Assistant into a powerful shopping tool if it could connect products in images with online stores. Hsiao said Google hasn’t yet integrated commercial product listings into Bard results but didn’t deny that might be coming in the future.

“If users really want that, if they’re looking to buy things through Bard, that’s something we can look into,” she said. “We need to look at how people want to shop with Bard and really explore that and build that into the product.” (Although Hsiao framed this as something users might want, it could also provide new opportunities for Google’s ad business.)

Get More From WIRED

Will Knight is a senior writer for WIRED, covering artificial intelligence. He writes the Fast Forward newsletter that explores how advances in AI and other emerging technology are set to change our lives—sign up here. He was previously a senior editor at MIT Technology Review, where he wrote about fundamental… Read more

Senior Writer

Lauren Goode is a senior writer at WIRED covering consumer tech issues. She focuses on the intersection of new technologies and humanity, often through experiential or investigative personal essays. Her coverage areas include communications apps, trends in commerce, AR and VR, subscription services, data and device ownership, and how Silicon… Read more

Senior Writer