Kitten TTS v0.8: Lightweight 15M-80M Parameter Text-to-Speech on CPU

KittenML released Kitten TTS v0.8 with three new open-source ONNX-based text-to-speech models (15M, 40M, 80M parameters) requiring no GPU. Models range from 25 MB to 80 MB on disk and deliver high-quality voice synthesis in 8 voices across Raspberry Pi, smartphones, wearables, and browsers. Targets on-device AI applications without cloud dependency.

Key Takeaways

  • Three new models: 15M parameters (<25 MB int8), 40M parameters (41 MB), 80M parameters (80 MB); int8 quantization support
  • CPU-optimized inference via ONNX; 8 built-in voices; 24 kHz output; speech speed adjustment; runs on Raspberry Pi, mobile, wearables, browsers
  • Production-ready developer preview; multi-lingual release planned; commercial support available through Stellon Labs

Original source: Hacker News / KittenML GitHub