Sep 25, 2023 at 8:05 PM

OpenAI has launched Voice Command and Image Upload features for ChatGPT Plus

OpenAI has unveiled new features for its AI chatbot, ChatGPT, including voice commands and image uploads. Initially, these features will be accessible to premium users (ChatGPT Plus), with a wider release scheduled later.

The voice command feature enables users to ask questions verbally, which are then transcribed into text for the chatbot to process and respond to in spoken form, largely using Whisper technology for this. ChatGPT will offer users a selection of five synthetic voices, and while the company recognizes the potential of synthetic voice technology, it also acknowledges risks like impersonation or fraud. Therefore, it plans to restrict and regulate the use of this technology to certain cases and partnerships, such as their recent collaboration with Spotify to use its text-to-speech technology for translating podcasts into multiple languages, while preserving the original podcaster's voice.

In the other hand, the chatbot's new image search feature lets users upload a photo and receive responses based on the image, kinda like Google Lens goes. Users can also use drawing tools or text and voice queries to specify their image-based questions, enabling a dialogue with the AI. OpenAI has deliberately curtailed ChatGPT's capacity to analyze and comment directly about people for accuracy and privacy reasons.

As mentioned at the beginning, the new update will be rolling out over the next two weeks, during which subscribers to the Plus service will be able to engage in voice conversations with ChatGPT (iOS & Android) and include images in their conversations (all platforms).

Sep 25, 2023 by Mauricio B. Holguin

MORE ABOUT: #AI Chatbots #Large Language Model (LLM) Tools #AI Writing Tools #ChatGPT

ChatGPT

435

AI Chatbot
Freemium
Proprietary

ChatGPT is an AI writing software developed by OpenAI. This language model is engineered to produce human-like text in response to user input, utilizing the GPT-3 architecture to generate text in numerous styles and formats. It's rated 4.5, indicating its high performance. Key features include AI-powered capabilities, chat bot functionality, and web-based operation. Notable alternatives to ChatGPT are Google Bard, ChatSonic, and HuggingChat.

Related news

All news about ChatGPT »

External links

ChatGPT can now see, hear, and speak
OpenAI Blog • Official source
OpenAI's Thread on X
@OpenAI on X • Official source
OpenAI Is Rolling Out Two New Ways to Chat With ChatGPT
Lifehacker
OpenAI gives ChatGPT an update that allows it to hear, see, and speak like a human
CNN
OpenAI’s ChatGPT chatbot now supports prompting with voice and images
The Verge

No comments so far, maybe you want to be first?