/voice-cloning and /voice-design routes redirect into this
workspace when Voice Studio is enabled.
What You Can Do
| Workflow | What it does |
|---|---|
| Saved voices | List, preview, reuse, and delete voices saved from cloning or design workflows. |
| Built-in voices | Browse model-provided speaker presets and generate preview samples. |
| New Voice | Start a cloned voice from reference audio or a designed voice from a text prompt. |
| Models | Download, load, unload, or switch the models needed for built-in voices and voice design. |
| Use in TTS | Send a saved voice or built-in speaker directly to the Text to Speech page. |
Using the Web UI
- Open
http://localhost:8080. - Select Voices in the sidebar.
- Use the Saved, Built-in, or All views to find a voice.
- Select New Voice to create a cloned or designed voice.
- Select Use in TTS to continue generation in Text to Speech.
Saved Voices
Saved voices are reusable voice assets stored by the local server. A saved voice can come from:- A reference-audio voice clone
- A prompt-designed voice
- A generated voice source saved through the API
/v1/voices, and izwi tts can reuse a saved voice with
--saved-voice-id.
Built-In Voices
Built-in voices are model-provided speaker presets. Voice Studio shows the available speakers for the selected model, can generate short preview samples, and can send a speaker into Text to Speech. Use built-in voices when you want quick, repeatable presets without reference audio or a design prompt. Use saved voices when you want a custom reusable voice asset. See Voice Presets for the current speaker IDs.Creating New Voices
Select New Voice from Voice Studio to choose a creation flow:| Flow | Input | Best for |
|---|---|---|
| Clone | Reference audio plus reference transcript | Reusing an existing permitted voice |
| Design | Natural-language voice description | Creating a new synthetic character or style |
API Routes
| Route family | Purpose |
|---|---|
/v1/voices | Saved voice list/create/delete/audio routes |
/v1/voice-clones | Persisted voice-cloning generation history |
/v1/voice-designs | Persisted voice-design generation history |
/v1/audio/speech | Immediate generation with built-in speakers, saved voices, reference audio, or design instructions |