May 9, 2026 at 6:41 AM

OpenAI API Adds GPT-Realtime-2, Live Translation and Transcription Models

OpenAI has expanded its API with three new voice intelligence models for real time voice applications. The release includes GPT-Realtime-2, a conversational model designed for realistic spoken interactions with GPT-5 class reasoning, allowing it to handle more complex live conversations than earlier versions.

The company also introduced GPT-Realtime-Translate for real time spoken translation, with support for more than 70 input languages and 13 output languages. Following these, GPT-Realtime-Whisper adds live speech to text transcription, letting applications capture spoken interactions as they happen.

GPT-Realtime-2 is billed by token usage, while translation and transcription are billed by the minute. All three models are available through OpenAI’s Realtime API and can support use cases such as customer service, education, media, events, and creator platforms. OpenAI also noted potential risks around spam, fraud, and online abuse, adding safety triggers that can halt conversations if harmful content is detected.

May 9, 2026 by Mauricio B. Holguin

MORE ABOUT: #ChatGPT #OpenAI Platform #Whisper

OpenAI Platform

Paid
Proprietary

OpenAI Platform offers an API to integrate GPT-5 series models for diverse applications such as text generation, image analysis and creation, audio processing, and structured output production. It also supports building tool-using agents. Key features include API Integration and Management. Notable alternatives include Google Cloud AI, Microsoft Azure AI, and IBM Watson.

External links

Advancing voice intelligence with new models in the API
OpenAI Blog • Official source
OpenAI launches new voice intelligence features in its API
TechCrunch
OpenAI unveils three audio models for real-time voice tasks
Reuters
OpenAI has new voice models that reason, translate, and transcribe as you speak
9to5Mac

No comments so far, maybe you want to be first?