xAI launches Custom Voices for instant voice cloning in TTS and agents

xAI launches Custom Voices for instant voice cloning in TTS and agents

xAI has introduced Custom Voices, allowing users to create voice clones by recording about one minute of natural speech in the xAI console, then use them across Grok Text-to-Speech and Voice Agent APIs. The process takes under two minutes and includes verification, processing, and delivery of a production-ready model. Once generated, custom voices can be used anywhere xAI’s built-in voices are supported.

To address voice security concerns, xAI uses a two-stage verification process. Users first read a passphrase, which is transcribed in real time to confirm consent and presence. The system then compares speaker data from the passphrase and full recording to confirm both belong to the same person, preventing cloning from pre-existing recordings or unauthorized samples.

Custom voices support speech tags, multilingual output, REST API access, and WebSocket streaming, with use cases including creator narration, brand voice agents, accessibility, gaming, and audiobook production. xAI also introduced Voice Library, a console section for managing and previewing built-in and custom voices, with more than 80 built-in voices across 28 languages and no extra charge for using custom voices with its APIs.

by Mauricio B. Holguin

dailylenssoul1472
dailylens found this interesting
Grok iconGrok
  58
  • ...

Grok is an AI-powered assistant developed by xAI, designed to provide truthful and useful insights. It allows users to ask questions, generate images, and upload pictures for analysis. Grok operates ad-free and requires no coding skills. Despite its features, it holds a rating of 2.4.

Comments

DailyLens.app
1

Just finished our first internal beta tests and I have to admit – the transcription is blazing fast and really gets the job done. Pricing-wise, it's quite a bit more expensive than a standard Whisper approach, but still comes out cheaper than Deepgram. A very solid middle-ground option.

Review by a new / low-activity user.
Gu