San Francisco, CA – August 5, 2024 In a significant step forward for AI technology, OpenAI has launched ChatGPT’s advanced voice mode, an innovative feature that redefines user interaction with AI. This new mode, currently available to a small group of ChatGPT Plus subscribers, allows for real-time voice conversations, bringing a human-like interaction experience to users worldwide.
What is It?
OpenAI’s advanced voice mode is a cutting-edge feature integrated into the ChatGPT app, enabling users to engage in natural, real-time conversations with the AI chatbot. This new feature allows for dynamic voice interaction, where users can speak directly to ChatGPT, receiving real-time responses that are not only informative but also emotionally resonant, thanks to the AI’s ability to mimic human speech patterns and non-verbal cues like tone and intonation.
How to Use It
Using ChatGPT’s new voice mode is straightforward. Users can activate it by tapping the microphone icon located at the bottom-left of the screen. After granting microphone permission, users can start a voice chat with ChatGPT, which will respond in real time. This feature is designed to offer a smooth and intuitive user experience, making it possible for users to switch between text and voice interaction seamlessly.
OpenAI has also introduced preset voices, including professional voice actors, to enhance the realism of the interactions. One preset voice named “Sky” has since been removed after accusations that it had an uncanny resemblance to actress Scarlett Johansson who voiced the role of an AI-powered operating system in the movie Her. The voice mode also supports different languages, catering to a global audience and expanding the AI’s use cases.
According to the OpenAI website, CEO Sam Altman, released a statement on the use of Scarlett Johansson’s voice:
“The voice of Sky is not Scarlett Johansson’s, and it was never intended to resemble hers. We cast the voice actor behind Sky’s voice before any outreach to Ms. Johansson. Out of respect for Ms. Johansson, we have paused using Sky’s voice in our products. We are sorry to Ms. Johansson that we didn’t communicate better.”
Standout Features
One of the most remarkable aspects of ChatGPT’s advanced voice mode is its ability to handle real-time voice conversations with high safety and reliability standards. The AI has been designed to understand and respond to complex queries while maintaining a conversational flow that feels natural and engaging. This is part of OpenAI’s iterative deployment strategy, ensuring the feature meets the highest safety and reliability bar before being rolled out to a broader audience.
The new mode also includes innovative features like sound effects and the ability to recognize and respond to emotional tones, making interactions more lifelike. Additionally, OpenAI has incorporated screen-sharing support and custom instructions, enhancing the overall user experience.
The human-like voice even takes natural pauses in conversation to “breathe.” One user posted such a demonstration on X:
ChatGPT Advanced Voice Mode counting as fast as it can to 10, then to 50 (this blew my mind – it stopped to catch its breath like a human would) pic.twitter.com/oZMCPO5RPh
— Cristiano Giardina (@CrisGiardina) July 31, 2024
The Road Ahead
Currently, this advanced voice mode is in the alpha phase, being tested by a small group of ChatGPT Plus users, with plans for a broader rollout in the coming weeks. Sam Altman has indicated that the company is committed to refining this cutting-edge feature based on user feedback, ensuring it meets the needs of a diverse user base. Android users can expect to see this feature on a rolling basis, with exact timelines yet to be announced.
About OpenAI and ChatGPT
OpenAI, a leader among AI companies, has been at the forefront of artificial intelligence research and development. Founded with the mission to ensure that AI benefits all of humanity, OpenAI has continually pushed the boundaries of what AI can achieve. ChatGPT, one of its flagship products, has evolved from a simple chatbot into a sophisticated AI model capable of handling everything from casual conversation to solving complex math problems.
In recent months, OpenAI has introduced several technological advancements, including the new voice mode, which enhances the model’s ability to interact with users in a more human-like manner. As part of its commitment to user safety and data privacy, OpenAI has implemented robust data controls and settings, ensuring that voice data and personal information are handled with the utmost care.
With the debut of ChatGPT’s advanced voice mode, OpenAI is once again setting a new standard for AI interactions, offering users a more immersive and personalized experience. As this feature continues to evolve, it promises to unlock new possibilities for how we interact with technology, from smart home management to customer service, and beyond.
Photo by Solen Feyissa on Unsplash
Related: