Dec 13, 2024 at 4:26 AM

ChatGPT adds video and screenshare capabilities to Advanced Voice Mode for visual context

OpenAI has expanded ChatGPT Advanced Voice Mode (AVM) by adding video and screenshare capabilities, building upon the initial audio features introduced with GPT-4o in May. This update enables users to use their phone cameras for visual recognition, allowing ChatGPT to finally understand visual context, and "see" the user's surroundings to interact accordingly.

During a livestream, OpenAI's Chief Product Officer Kevin Weil showcased these new features by demonstrating how AVM could assist in making pour-over coffee, recognizing the coffee maker and offering step-by-step guidance.

The update, which also includes playful elements likea seasonal Santa voice and the ability to interpret screenshared messages, is currently available to ChatGPT Plus and Pro users, with broader access planned for January.

Dec 13, 2024 by Mauricio B. Holguin

MORE ABOUT: #AI Chatbots #Large Language Model (LLM) Tools #AI Writing Tools #ChatGPT

ChatGPT

475

AI Chatbot
Freemium
Proprietary

ChatGPT is an AI chatbot developed by OpenAI, built on the GPT-3 architecture, and designed to generate human-like text responses to user input. It offers AI-powered capabilities, providing high-quality text in various styles and formats. As a web-based service, it is rated 4.4 and stands out for its advanced language model. Top alternatives include HuggingChat, Google Gemini, and GPT4ALL.

Related news

All news about ChatGPT»

External links

OpenAI's Thread on X
@OpenAI on X • Official source
ChatGPT's Advanced Voice Mode finally gets visual context on the 6th day of OpenAI
ZDNET
OpenAI brings video to ChatGPT Advanced Voice Mode
Mashable
ChatGPT now understands real-time video, seven months after OpenAI first demoed it
TechCrunch

No comments so far, maybe you want to be first?