OpenAI API Adds GPT-Realtime-2, Live Translation and Transcription Models

OpenAI API Adds GPT-Realtime-2, Live Translation and Transcription Models

OpenAI has expanded its API with three new voice intelligence models for real time voice applications. The release includes GPT-Realtime-2, a conversational model designed for realistic spoken interactions with GPT-5 class reasoning, allowing it to handle more complex live conversations than earlier versions.

The company also introduced GPT-Realtime-Translate for real time spoken translation, with support for more than 70 input languages and 13 output languages. Following these, GPT-Realtime-Whisper adds live speech to text transcription, letting applications capture spoken interactions as they happen.

GPT-Realtime-2 is billed by token usage, while translation and transcription are billed by the minute. All three models are available through OpenAI’s Realtime API and can support use cases such as customer service, education, media, events, and creator platforms. OpenAI also noted potential risks around spam, fraud, and online abuse, adding safety triggers that can halt conversations if harmful content is detected.

by Mauricio B. Holguin

  • Paid
  • Proprietary
  • ...

OpenAI Platform offers an API to integrate GPT-5 series models for diverse applications such as text generation, image analysis and creation, audio processing, and structured output production. It also supports building tool-using agents. Key features include API Integration and Management. Notable alternatives include Google Cloud AI, Microsoft Azure AI, and IBM Watson.

No comments so far, maybe you want to be first?
Gu