OpenAI could unveil new multimodal AI model that talks to people & recognises objects: Details here
OpenAI could unveil a new multimodal AI model.
This multimodal AI could talk to you and recognise objects.
It is expected to be a part of what the company intends to unveil today.
For quite some time now, we have been hearing a lot that OpenAI is expected to announce its AI-powered search engine today to compete with Google Search. The timing in particular is pretty interesting. It seems like OpenAI is planning to steal Google’s spotlight just before Google’s annual event.
Now, a new report suggests that OpenAI could unveil a new multimodal AI model. Read along to know what the new AI model is expected to offer.
Also read: OpenAI’s Google Search competitor could debut next week: All you need to know
According to a report by The Information (via The Verge), OpenAI’s multimodal AI model can talk to you and recognise objects. This AI model is expected to be a part of what the company intends to unveil on Monday.
Also read: OpenAI rumoured to steal Google’s thunder by launching its own search engine: Know more
The new model will likely provide faster and more precise interpretation of images and audio compared to OpenAI’s current separate transcription and text-to-speech models.
It seems the model would enable customer service agents to “better understand the intonation of callers’ voices or whether they’re being sarcastic,” and “theoretically,” it could assist students with maths problems or translate real-world signs.
The model might surpass GPT-4 Turbo in “answering some types of questions,” but it remains prone to making confident errors.
OpenAI seems to be working on having phone calls inside of chatGPT. This is probably going to be a small part of the event announced on Monday.
— Ananay (@ananayarora) May 11, 2024
(1/n) pic.twitter.com/KT8Hb54DwA
It’s possible that OpenAI is also preparing a new integrated feature in ChatGPT for making phone calls, as suggested by Developer Ananay, who shared a screenshot of code related to calls. Arora also noted evidence indicating that OpenAI had provisioned servers for real-time audio and video communication.
Given that all these details stem from reports and leaks, it’s wise to await an official announcement from OpenAI. Whatever OpenAI has in store, will be revealed via livestream on its website on Monday at 10AM PT / 10PM IST.
Ayushi Jain
Tech news writer by day, BGMI player by night. Combining my passion for tech and gaming to bring you the latest in both worlds. View Full Profile