It’s been a couple of days since OpenAI rolled out ChatGPT’s new advanced voice mode, and the small group of ChatGPT Plus subscribers given access to it seems pretty impressed so far. Various clips of the feature in action have appeared online, demonstrating its ability to sing, imitate accents, correct language pronunciation, and perform narrative storytelling.
An example of the latter can be seen in the below videos, in which X user @nickfloats asks ChatGPT to “tell me a story as if you’re an airline pilot telling it to passengers on a flight.” The chatbot jumps into action barely a second later and even alters the audio to sound more like it’s coming from an intercom. ChatGPT struggled to accommodate more complex requests like layering on engine sounds, but the voice itself is clear and emotive, and ChatGPT handles user interruptions well.
In a conversation uploaded to YouTube, ChatGPT says it can handle inputs in “dozens of languages,” but the exact number can vary “depending on how you count dialects and regional variations.” One clip demonstrates the chatbot’s ability to correct the pronunciation of French words, giving specific pointers on adjusting inflection. Another language demo shows ChatGPT speaking Turkish after following a detailed request to tell an emotive story. While some Turkish X users noted that the accent didn’t sound native, it was able to complete the story request and react appropriately by laughing and crying at certain points.
A few different male- and female-sounding voices were present across these demonstrations, though these notably don’t include the Scarlett Johansson-like “Sky” voice that was pulled from the service in May.
As for anyone who feels left out of these fun demonstrations, OpenAI spokesperson Taya Christianson told The Verge that advanced voice mode will be available to all ChatGPT Plus subscribers (which costs $20 per month) sometime this fall.