Models

Current Model Catalog

Use izwi list (or GET /v1/models) to see the live, currently enabled catalog. Those endpoints only show variants that are enabled for download/use.

Izwi accepts many legacy aliases (for example lowercase IDs), but the canonical IDs below match izwi list output.

Text-to-Speech (TTS)

Family	Canonical IDs
Qwen3 Base (reference-voice cloning)	`Qwen3-TTS-12Hz-0.6B-Base`, `Qwen3-TTS-12Hz-0.6B-Base-4bit`, `Qwen3-TTS-12Hz-1.7B-Base`, `Qwen3-TTS-12Hz-1.7B-Base-4bit`
Qwen3 CustomVoice (built-in speakers)	`Qwen3-TTS-12Hz-0.6B-CustomVoice`, `Qwen3-TTS-12Hz-0.6B-CustomVoice-4bit`, `Qwen3-TTS-12Hz-1.7B-CustomVoice`, `Qwen3-TTS-12Hz-1.7B-CustomVoice-4bit`
Qwen3 VoiceDesign	`Qwen3-TTS-12Hz-1.7B-VoiceDesign`, `Qwen3-TTS-12Hz-1.7B-VoiceDesign-4bit`
Voxtral TTS	`Voxtral-4B-TTS-2603`
VibeVoice TTS	`VibeVoice-1.5B`
Kokoro	`Kokoro-82M`

Kokoro-82M requires espeak-ng: macOS, Linux, Windows

Voxtral-4B-TTS-2603 includes bundled voice assets licensed under CC BY-NC 4.0 and supports 20 preset voices with 24 kHz output.

VibeVoice-1.5B is a Microsoft long-form TTS model with reference-voice cloning. It uses saved or direct reference voices rather than built-in speaker presets.

For built-in speaker IDs, see Voice Presets.

Speech Recognition (ASR)

Model	Notes
`Parakeet-TDT-0.6B-v3`	CLI default for transcription/diarization ASR
`Whisper-Large-v3-Turbo`	Whisper ASR option
`Qwen3-ASR-0.6B-GGUF`	Smaller Qwen3 ASR
`Qwen3-ASR-1.7B-GGUF`	Higher-accuracy Qwen3 ASR
`VibeVoice-ASR`	Microsoft long-form ASR checkpoint
`Nemotron-3.5-ASR-Streaming-0.6B`	NVIDIA multilingual FastConformer-RNNT `.nemo`; native artifact/config/tokenizer and streaming-state support
`Granite-Speech-4.1-2B-Plus`	IBM Granite Speech rich transcription model with prompt guidance, speaker-attributed output, and word timestamp support
`LFM2.5-Audio-1.5B-GGUF`	Unified audio model (ASR + speech generation)
`Voxtral-Mini-4B-Realtime-2602`	Mistral Voxtral offline transcription; realtime support planned

Diarization and Alignment

Task	Model
Speaker diarization	`diar_streaming_sortformer_4spk-v2.1`
Forced alignment	`Qwen3-ForcedAligner-0.6B`, `Qwen3-ForcedAligner-0.6B-4bit`

Chat

Family	Canonical IDs
Qwen3 GGUF	`Qwen3-0.6B-GGUF`, `Qwen3-1.7B-GGUF`, `Qwen3-4B-GGUF`, `Qwen3-8B-GGUF`
Qwen3.5 GGUF	`Qwen3.5-0.8B`, `Qwen3.5-2B`, `Qwen3.5-4B`, `Qwen3.5-9B`
LFM2.5 text	`LFM2.5-1.2B-Instruct-GGUF`, `LFM2.5-1.2B-Thinking-GGUF`
Gemma	`Gemma-3-1b-it`

Currently Disabled (Not Listed by `izwi list`)

These variants exist in the catalog but are not currently enabled for standard listing/download:

Legacy Qwen3 chat IDs: Qwen3-0.6B, Qwen3-0.6B-4bit, Qwen3-1.7B, Qwen3-1.7B-4bit
Qwen3-14B-GGUF
Gemma-3-4b-it
TTS 8-bit and BF16 metadata variants such as Qwen3-TTS-12Hz-0.6B-Base-8bit and Qwen3-TTS-12Hz-1.7B-VoiceDesign-bf16; selected 4-bit variants are the standard low-memory downloads exposed by izwi list.

Downloading Models

Via CLI

# List enabled catalog models
izwi list

# Download a model
izwi pull Qwen3-TTS-12Hz-0.6B-Base

# Download an ASR model
izwi pull Qwen3-ASR-0.6B-GGUF

# Download NVIDIA Nemotron 3.5 ASR
izwi pull Nemotron-3.5-ASR-Streaming-0.6B

# Download IBM Granite Speech rich ASR
izwi pull Granite-Speech-4.1-2B-Plus

# Download Microsoft VibeVoice models
izwi pull VibeVoice-1.5B
izwi pull VibeVoice-ASR

Via Web UI

Open http://localhost:8080
Go to Models in the sidebar
Click Download on a model

Managing Models

For the complete UI, CLI, and API workflow, see Model Management.

View Downloaded Models

izwi list --local

Get Model Information

izwi models info Qwen3-TTS-12Hz-0.6B-Base

Load a Model into Memory

izwi models load Qwen3-TTS-12Hz-0.6B-Base

Unload a Model

izwi models unload Qwen3-TTS-12Hz-0.6B-Base

Delete a Model

izwi rm Qwen3-TTS-12Hz-0.6B-Base

Model Storage

Platform	Location
macOS	`~/Library/Application Support/izwi/models/`
Linux	`~/.local/share/izwi/models/`
Windows	`%APPDATA%\izwi\models\`

Custom Model Directory

# CLI flag
izwi serve --models-dir /path/to/models

# Environment variable
export IZWI_MODELS_DIR=/path/to/models
izwi serve

Manual Downloads

Some models (for example Gemma) may require manual Hugging Face access setup:

Model Status

Status	Description
not_downloaded	Available but not on disk
downloading	Currently downloading
downloaded	On disk but not loaded
loading	Being loaded into memory
ready	Loaded and ready for inference

Check status:

izwi status --detailed

Quantization Notes

-4bit / -8bit / -bf16 are reduced-precision variants.
-GGUF variants are quantized GGUF artifacts.
Smaller/quantized variants reduce memory and disk use at some quality/accuracy tradeoff.
izwi list shows enabled variants only. Some catalog metadata exists for experimental 8-bit/BF16 TTS artifacts, but the standard downloadable low-memory TTS variants are the explicit -4bit entries shown above.

Current Model Catalog

Text-to-Speech (TTS)

Speech Recognition (ASR)

Diarization and Alignment

Chat

Currently Disabled (Not Listed by `izwi list`)

Downloading Models

Via CLI

Via Web UI

Managing Models

View Downloaded Models

Get Model Information

Load a Model into Memory

Unload a Model

Delete a Model

Model Storage

Custom Model Directory

Manual Downloads

Model Status

Quantization Notes

Next Steps

Model Management

Voice Presets

Manual Model Downloads

Manual Download: Gemma 3 1B Instruct

​Current Model Catalog

​Text-to-Speech (TTS)

​Speech Recognition (ASR)

​Diarization and Alignment

​Chat

​Currently Disabled (Not Listed by izwi list)

​Downloading Models

​Via CLI

​Via Web UI

​Managing Models

​View Downloaded Models

​Get Model Information

​Load a Model into Memory

​Unload a Model

​Delete a Model

​Model Storage

​Custom Model Directory

​Manual Downloads

​Model Status

​Quantization Notes

​Next Steps

Model Management

Voice Presets

Manual Model Downloads

Manual Download: Gemma 3 1B Instruct

Current Model Catalog

Text-to-Speech (TTS)

Speech Recognition (ASR)

Diarization and Alignment

Chat

Currently Disabled (Not Listed by `izwi list`)

Downloading Models

Via CLI

Via Web UI

Managing Models

View Downloaded Models

Get Model Information

Load a Model into Memory

Unload a Model

Delete a Model

Model Storage

Custom Model Directory

Manual Downloads

Model Status

Quantization Notes

Next Steps