Spokes.wiki Search Graph Growth About

speech-audio-wiki

Software Application source ↗ source url updated Fri Jun 05 2026 00:00:00 GMT+0000 (Coordinated Universal Time)

MisoTTS

Miso Labs’ 8B open-weight emotive text-to-speech model (released 2026-06-03; modified MIT license, weights from day one). The seed source that opened this wiki’s cluster (parked in the hub _inbox, then migrated here on spin-out).

Architecture

Performance & limits

Place in the field

Emotive/expressive synthesis with a permissive-ish (open-weight-tts) license — competing on latency + expressivity. Its RVQ/Mimi codec approach is shared with fish-audio-s2-pro (dual-AR + RVQ); the “AR-over-time + AR-over-depth” split echoes the codec-token TTS design now common across the open field.

text-to-speech · neural-audio-codec · open-weight-tts · fish-audio-s2-pro · tts-benchmarks