OpenAI’s new GPT-4o lets people interact using voice or video in the same model

6 months ago admin

OpenAI just debuted GPT-4o, a new kind of AI model that you can communicate with in real time via live voice conversation, video streams from your phone, and text. The model is rolling out over the next few weeks and will be free for all users through both the GPT app and the web interface,…

GPT-4 offered similar capabilities, giving users multiple ways to interact with OpenAI’s AI offerings. But it siloed them in separate models, leading to longer response times and presumably higher computing costs. GPT-4o has now merged those capabilities into a single model, which Murati called an “omnimodel.” That means faster responses and smoother transitions between tasks, she said.

The result, the company’s demonstration suggests, is a conversational assistant much in the vein of Siri or Alexa but capable of fielding much more complex prompts.

“We’re looking at the future of interaction between ourselves and the machines,” Murati said of the demo. “We think that GPT-4o is really shifting that paradigm into the future of collaboration, where this interaction becomes much more natural.”

Barret Zoph and Mark Chen, both researchers at OpenAI, walked through a number of applications for the new model. Most impressive was its facility with live conversation. You could interrupt the model during its responses, and it would stop, listen, and adjust course.

OpenAI showed off the ability to change the model’s tone, too. Chen asked the model to read a bedtime story “about robots and love,” quickly jumping in to demand a more dramatic voice. The model got progressively more theatrical until Murati demanded that it pivot quickly to a convincing robot voice (which it excelled at). While there were predictably some short pauses during the conversation while the model reasoned through what to say next, it stood out as a remarkably naturally paced AI conversation.

OpenAI’s new GPT-4o lets people interact using voice or video in the same model

The Download: understanding AI, and what to expect from the UN’s climate conference

What’s on the table at this year’s UN climate conference

Google DeepMind has a new way to look inside an AI’s “mind”

You may have missed

Are A.I. Clones the Future of Dating? I Tried Them for Myself.

Homeland Security Department to Release New A.I. Guidance

Brexit has ‘weighed’ on economy and UK ‘must rebuild relations’ with EU – Bank of England governor

First Glastonbury tickets sell out in 30 minutes as new booking system launched

Catch the Beaver Moon on Nov 15, 2024 – the year’s last supermoon!

Categories

Useful Links

More Stories

You may have missed

Categories

Useful Links