Voicebox Studio

2 likes

The open-source voice synthesis studio.

Cost / License

Free
Open Source (MIT)

Origin

Canada

Platforms

Windows
Mac
Linux
Docker

Alternatives

2likes

0comments

26alternatives

0articles

Features

No features, maybe you want to suggest one?

Voicebox Studio News & Activities

Highlights All activities

Recent activities

niksavc liked Voicebox Studio
17 days ago
CoHarmonify added Voicebox Studio as alternative to CoHarmonify
17 days ago
muhammadfarag liked Voicebox Studio
18 days ago
muhammadfarag added Voicebox Studio
19 days ago
muhammadfarag added Voicebox Studio as alternative to ElevenLabs, Chatterbox TTS, Balabolka and Kokoro + 21 similar activities
20 days ago

Voicebox Studio information

Developed by
jamiepine
Licensing
Open Source (MIT) and Free product.
Written in
TypeScript
Alternatives
26 alternatives listed
Supported Languages
- English

GitHub repository

14,430 Stars
1,715 Forks
191 Open Issues
Updated Mar 31, 2026

View on GitHub

Popular alternatives

View all

Voicebox Studio was added to AlternativeTo by Muhammad Farag on Mar 22, 2026 and this page was last updated Mar 22, 2026.

No comments or reviews, maybe you want to be first?

What is Voicebox Studio?

Voicebox: The Open-Source Voice Cloning Studio

Voicebox is a powerful, local-first alternative to services like ElevenLabs, designed for high-fidelity voice cloning and speech synthesis. It functions as a comprehensive creative suite, allowing users to clone voices from seconds of audio, generate speech in 23 languages, and orchestrate complex audio projects via a multi-track timeline—all while running entirely on your own hardware.

Key Capabilities

Multi-Engine Synthesis Voicebox integrates five distinct Text-to-Speech (TTS) engines, allowing users to choose the best tool for the task:

Qwen3-TTS: High-quality multilingual cloning with support for delivery instructions (e.g., "whisper").
LuxTTS: A lightweight, ultra-fast engine optimized for 48kHz CPU generation.
Chatterbox (Multilingual & Turbo): Offers the broadest language support and paralinguistic tags for expressive speech (laughs, sighs, gasps).
TADA: A speech-language model designed for long-form, coherent audio (up to 700s+).

Advanced Audio Post-Processing Powered by Spotify’s pedalboard library, Voicebox includes eight real-time effects (Pitch Shift, Reverb, Compression, etc.). Users can build custom presets or use built-in profiles like "Radio" or "Robotic" to polish their clones.
Professional Workflow Tools

Unlimited Generation: Uses smart auto-chunking and crossfading to generate up to 50,000 characters without breaks.
Stories Editor: A multi-track timeline editor for composing podcasts, conversations, and narratives with drag-and-drop ease.
Version Control: Tracks "Takes" and "Effects versions" for every generation, ensuring the original clean output is always preserved.
Async Queue: A non-blocking generation system that allows you to queue multiple tasks without crashing your GPU.

Voice & Model Management

Profile Management: Create voice identities from recordings or files, supporting multi-sample inputs for higher cloning accuracy.
Recording & Transcription: Built-in system audio capture and Whisper-powered transcription for seamless content creation.
Hardware Efficiency: Local model management allows users to load/unload models to optimize VRAM usage.

Voicebox Studio

Cost / License

Origin

Platforms

Voicebox Studio

Features

Tags

Voicebox Studio News & Activities

Recent activities

Voicebox Studio information

Developed by

Licensing

Written in

Alternatives

Supported Languages

GitHub repository

Popular alternatives

What is Voicebox Studio?

Official Links

AppStores & Other Links

Social Networks