Spokes.wiki Search Graph Growth About

speech-audio-wiki

Article source ↗ source url updated Fri Jun 05 2026 00:00:00 GMT+0000 (Coordinated Universal Time)

Best Text-to-Speech Models in 2026: A Benchmark-Based Comparison (MarkTechPost)

A field survey (2026-05-30) ranking the leading text-to-speech models — proprietary and open-weight — by tts-benchmarks and matching them to use-cases. One of the founding sources of this wiki.

Proprietary top tier (by Artificial Analysis Elo — dated snapshot)

Open-weight field

The verdict

No single model wins; pick by your binding constraint — latency, quality, language coverage, or cost.” Real-time → Sonic 3.5 / Inworld / Aura-2; long-form → ElevenLabs v3 / Gemini / VibeVoice; on-device → kokoro / CosyVoice 2; dubbing → IndexTTS-2; emotion → Hume Octave 2. The author stresses rankings shift weekly — treat leaderboard positions as dated snapshots (tts-benchmarks).

text-to-speech · tts-benchmarks · open-weight-tts · kokoro · fish-audio-s2-pro · tts-arena-leaderboard