OpenAI had initially revealed plans for the voice feature back in May, which got quite a bit of attention due to a voice closely resembling Scarlett Johansson’s from the movie Her. That, however, was short-lived.
read more
ChatGPT is stepping up its game with a brand-new voice feature, making conversations with the AI more natural and fluid. OpenAI announced that its popular chatbot can now engage in audio chats, but for now, it’s only available for those who subscribe to the premium service.
This advanced voice feature offers a smoother experience, allowing users to have real-time conversations, with the AI able to pause if interrupted — a pretty cool feature for anyone who enjoys a back-and-forth chat with their tech.
While this exciting new capability is being rolled out over the week, it’s not yet available in the EU, the UK, and a few other European countries. OpenAI had initially revealed plans for the voice feature back in May, which got quite a bit of attention due to a voice closely resembling Scarlett Johansson’s from the movie Her.
That, however, was short-lived, as legal action was quickly taken, and the company had to pause using the voice. Since then, though, users on the free tier have been able to play around with other voices. The advanced version offers nine voices and lets users customise instructions for these chats in the app’s settings.
In a playful nod to the wait, OpenAI’s co-founder and CEO Sam Altman posted on X (formerly Twitter), saying, “Hope you think it was worth the wait.”
Rising Competition in AI Voice Tech
OpenAI isn’t the only player in the AI voice game, though. Google recently released its Gemini Live voice feature, and Meta is getting in on the action too, with plans to launch celebrity voices through popular platforms like Facebook, Instagram, and WhatsApp later this week.
This competitive landscape highlights how important AI-powered voice features have become, especially for tech giants. OpenAI, backed by Microsoft, continues to lead the charge, having gained a massive head start with ChatGPT since its launch in late 2022. The chatbot already boasts over 200 million weekly active users, according to an August report.
However, the voice feature is only available to those who subscribe to OpenAI’s Plus, Team, or Enterprise plans, with the most affordable option priced at $20 per month.
Upgrades for GPT-4o Mini
In addition to voice features, OpenAI has rolled out some significant updates for its smaller GPT-4o mini model. Previously considered less powerful, the mini model has now been enhanced to offer four major features previously reserved for the larger GPT-4o version.
Firstly, the mini model can now generate images from text prompts using the DALL-E 3 model, similar to its bigger sibling. This upgrade is expected to be a hit with users looking for quicker image generation without sacrificing quality.
Secondly, the mini model now has internet browsing capabilities, meaning users can access up-to-date information and conduct real-time research. This upgrade brings it closer to the larger GPT-4o in terms of functionality, giving users more flexibility when fact-checking or gathering information.
Another new feature is the ability to upload and analyse documents and pictures, which will make working with complex visual data much easier. Users can now interact with both texts and visuals, opening the door for educational and personal use cases.
Lastly, the GPT-4o mini model can now remember past conversations with users, much like its more advanced counterparts. This memory feature enables the model to provide more relevant follow-up responses and recognise user preferences, making long-term interactions smoother.