(Reuters) – ChatGPT maker OpenAI said on Tuesday it was delaying the release of its “Voice Mode” feature by a month to July due to technical problems.
The company originally planned to roll out the realistic voice calling experience to a small group of ChatGPT Plus users in late June, but said it postponed this as it needed time to reach launch standard.
“For example, we’re improving the model’s ability to detect and reject certain content. We’re also working to improve the user experience and prepare our infrastructure to scale to millions while maintaining real-time responses,” OpenAI said in a post on social media platform
The feature will initially be released to a small group of users to gather feedback and will be made available to all Plus users in the fall, subject to security and reliability checks, the company said.
OpenAI is also working on rolling out new video and screen sharing capabilities.
In May, it said it would release a new AI model called GPT-4o capable of realistic voice conversations and text-to-image interaction, the latest move to stay ahead in the race to adopt emerging AI technology dominate.
The new audio capabilities allow users to talk to ChatGPT and get real-time answers without delay, and pause ChatGPT while it speaks – both features of realistic conversations that AI voice assistants have found challenging.