What is Izwi?

Izwi is a powerful, privacy-focused audio AI platform that runs entirely on your machine. No cloud services, no API keys, no data leaving your device. Key capabilities:
  • Voice Mode — Real-time voice conversations with AI
  • Chat — Text-based AI conversations
  • Transcription — Convert audio to text with high accuracy
  • Voices — Manage built-in, cloned, and designed voices
  • Text-to-Speech — Generate natural speech from text
  • Voice Cloning — Clone any voice from a short audio sample
  • Voice Design — Create custom voices from text descriptions
  • Diarization — Identify and separate multiple speakers

SectionDescription
Getting StartedInstall Izwi and run your first command
InstallationPlatform-specific installation guides
Runtime Support MatrixSupported OS, hardware, artifact, and API surfaces
API ReferenceStable, preview, first-party, operator, and realtime HTTP/WebSocket APIs
FeaturesLearn about each feature in detail
ModelsUnderstand and manage AI models
CLI ReferenceComplete command-line reference
TroubleshootingCommon issues and solutions

System Requirements

RequirementMinimumRecommended
macOS12.0+ (Monterey)14.0+ (Sonoma)
LinuxUbuntu 20.04+Ubuntu 22.04+
WindowsWindows 10Windows 11
RAM8 GB16 GB+
Storage10 GB free50 GB+ free
GPUApple Silicon / NVIDIA GPU (see support matrix)
Note: Izwi is optimized for Apple Silicon Macs with Metal acceleration. NVIDIA CUDA support exists in the runtime, but artifact-level support varies by source build, Docker image, and release package. See the Runtime Support Matrix.

Getting Help


License

Izwi is open source software licensed under Apache 2.0.