Tuesday, November 5, 2024

ChatGPT ‘Voice Mode’ that lets you talk to AI like a human lands next week

Must read

CHATGPT’S long-awaited upgraded Voice Mode powered by the company’s smartest AI system is nearly here.

Chatbot creator OpenAI said that the new voice features – which allow for spoken humanlike conversations – will roll out next week.’

3

You can speak to ChatGPT using your voice – and it’ll respond back in kindCredit: Getty
The new Voice Mode is powered by ChatGPT's latest and most powerful GPT-4o model

3

The new Voice Mode is powered by ChatGPT’s latest and most powerful GPT-4o modelCredit: Getty

It’s all powered by GPT-4o, a large language model that was unveiled back in May.

As part of the demo, OpenAI showed off stunning demos of the new Voice Mode in action.

In one particularly impressive AI feat, the chatbot was able to translate in real-time for two people speaking different languages – letting them hold a conversation.

OpenAI’s GPT-4o is available right now for text-based conversations, and the upgraded Voice Mode is coming next week.

That’s according to OpenAI chief Sam Altman, who shared that the feature would only be available to ChatGPT Plus subscribers.

That’s a paid-for version of ChatGPT that costs $20 a month.

It’ll give you access to the “alpha” version of the feature, which means it’s still in testing.

It’s important to note that ChatGPT already has a voice mode that’s powered by the older GPT-4 model.

But GPT-4o is much smarter and, importantly, far quicker.

There’s a small delay when voice-chatting with ChatGPT using GPT-4.

ChatGPT’s astonishing new skills: The future of AI interaction

That will almost disappear entirely when you’re using the new GPT-4o version.

“It can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds,” explained OpenAI.

“Which is similar to human response time in a conversation.”

That’s compared to Voice Mode with GPT-4, which had average delays of 5.4 seconds.

OpenAI chief Sam Altman confirmed that the new GPT-4o Voice Mode would be coming to ChatGPT 'next week' – but only for ChatGPT Plus subscribers

3

OpenAI chief Sam Altman confirmed that the new GPT-4o Voice Mode would be coming to ChatGPT ‘next week’ – but only for ChatGPT Plus subscribersCredit: AFP

What is ChatGPT?

ChatGPT is a new artificial intelligence tool

ChatGPT, which was launched in November 2022, was created by San Francisco-based startup OpenAI, an AI research firm.

It’s part of a new generation of AI systems.

ChatGPT is a language model that can produce text.

It can converse, generate readable text on demand and produce images and video based on what has been learned from a vast database of digital books, online writings and other media.

ChatGPT essentially works like a written dialogue between the AI system and the person asking it questions

GPT stands for Generative Pre-Trained Transformer and describes the type of model that can create AI-generated content.

If you prompt it, for example ask it to “write a short poem about flowers,” it will create a chunk of text based on that request.

ChatGPT can also hold conversations and even learn from things you’ve said.

It can handle very complicated prompts and is even being used by businesses to help with work.

But note that it might not always tell you the truth.

“ChatGPT is incredibly limited, but good enough at some things to create a misleading impression of greatness,” OpenAI CEO Sam Altman said in 2022.

You’ll be able to ask the GPT-4o to talk in different tones of voice or change its pace.

And it can even translate foreign languages in real time, acting as a translator between two people.

This means two people who speak totally different languages can have a conversation using the app’s Voice Mode.

You could also ask the new Voice Mode to explain something more simply, slow down its pace, and speak in the style of a friendly teacher.

What is ChatGPT Plus?

Here’s what you need to know…

  • ChatGPT Plus is the premium version of OpenAI’s chatbot
  • It costs $20 a month and comes with additional benefits
  • OpenAI says: “It offers availability even when demand is high, faster response speed, and priority access to new features.”
  • For instance, you’ll gain access to the more powerful GPT-4 language model.
  • You can browse, create and use GPTs
  • You’ll gain access to extra tools like DALL-E image generation
  • You can get access to current information courtesy of search engines
  • And you can have voice conversations with ChatGPT too

It’s unclear how long the new GPT-4o Voice Mode will remain in “alpha” testing for – or whether it’ll eventually be available to free users who don’t pay for the monthly subscription.

Latest article