ChatGPT Advanced Voice Mode

Share This Post

San Francisco, CA – August 5, 2024 In a significant step forward for AI technology, OpenAI has launched ChatGPT’s advanced voice mode, an innovative feature that redefines user interaction with AI. This new mode, currently available to a small group of ChatGPT Plus subscribers, allows for real-time voice conversations, bringing a human-like interaction experience to users worldwide.

What is It?

OpenAI’s advanced voice mode is a cutting-edge feature integrated into the ChatGPT app, enabling users to engage in natural, real-time conversations with the AI chatbot. This new feature allows for dynamic voice interaction, where users can speak directly to ChatGPT, receiving real-time responses that are not only informative but also emotionally resonant, thanks to the AI’s ability to mimic human speech patterns and non-verbal cues like tone and intonation.

How to Use It

Using ChatGPT’s new voice mode is straightforward. Users can activate it by tapping the microphone icon located at the bottom-left of the screen. After granting microphone permission, users can start a voice chat with ChatGPT, which will respond in real time. This feature is designed to offer a smooth and intuitive user experience, making it possible for users to switch between text and voice interaction seamlessly.

OpenAI has also introduced preset voices, including professional voice actors, to enhance the realism of the interactions. One preset voice named “Sky” has since been removed after accusations that it had an uncanny resemblance to actress Scarlett Johansson who voiced the role of an AI-powered operating system in the movie Her. The voice mode also supports different languages, catering to a global audience and expanding the AI’s use cases.

According to the OpenAI website, CEO Sam Altman, released a statement on the use of Scarlett Johansson’s voice:

“The voice of Sky is not Scarlett Johansson’s, and it was never intended to resemble hers. We cast the voice actor behind Sky’s voice before any outreach to Ms. Johansson. Out of respect for Ms. Johansson, we have paused using Sky’s voice in our products. We are sorry to Ms. Johansson that we didn’t communicate better.”

Standout Features

One of the most remarkable aspects of ChatGPT’s advanced voice mode is its ability to handle real-time voice conversations with high safety and reliability standards. The AI has been designed to understand and respond to complex queries while maintaining a conversational flow that feels natural and engaging. This is part of OpenAI’s iterative deployment strategy, ensuring the feature meets the highest safety and reliability bar before being rolled out to a broader audience.

The new mode also includes innovative features like sound effects and the ability to recognize and respond to emotional tones, making interactions more lifelike. Additionally, OpenAI has incorporated screen-sharing support and custom instructions, enhancing the overall user experience.

The human-like voice even takes natural pauses in conversation to “breathe.” One user posted such a demonstration on X:

The Road Ahead

Currently, this advanced voice mode is in the alpha phase, being tested by a small group of ChatGPT Plus users, with plans for a broader rollout in the coming weeks. Sam Altman has indicated that the company is committed to refining this cutting-edge feature based on user feedback, ensuring it meets the needs of a diverse user base. Android users can expect to see this feature on a rolling basis, with exact timelines yet to be announced.

About OpenAI and ChatGPT

OpenAI, a leader among AI companies, has been at the forefront of artificial intelligence research and development. Founded with the mission to ensure that AI benefits all of humanity, OpenAI has continually pushed the boundaries of what AI can achieve. ChatGPT, one of its flagship products, has evolved from a simple chatbot into a sophisticated AI model capable of handling everything from casual conversation to solving complex math problems.

In recent months, OpenAI has introduced several technological advancements, including the new voice mode, which enhances the model’s ability to interact with users in a more human-like manner. As part of its commitment to user safety and data privacy, OpenAI has implemented robust data controls and settings, ensuring that voice data and personal information are handled with the utmost care.

With the debut of ChatGPT’s advanced voice mode, OpenAI is once again setting a new standard for AI interactions, offering users a more immersive and personalized experience. As this feature continues to evolve, it promises to unlock new possibilities for how we interact with technology, from smart home management to customer service, and beyond.

Photo by Solen Feyissa on Unsplash

Related:

More To Explore

Artificial Intelligence Degree
AI

Artificial Intelligence Degree

Exploring an Artificial Intelligence Degree: Unlocking Opportunities in a High-Demand Field In recent years, artificial intelligence (AI) has transitioned from a niche topic within computer

Congrats! You're now on our early access list.

We’ll send you an email when it’s your turn to sign up.

Calling Rates for

(+ )
i1 plan i2 plan i3 plan
[sc name="popup_total_minutes"][/sc]/min
i1 plan i2 plan i3 plan
illumy to illumy calling unlimited calling included unlimited calling included unlimited calling included
Landline n/a
Mobile n/a
Premium n/a
Details: Calls are rounded up to the nearest minute. A fair usage policy applies to unlimited calling capabilities. Some premium, special rate, or geographic numbers are not included. Restrictions apply.