Spokes.wiki Search Graph Growth About

speech-audio-wiki

Organization source ↗ source url updated Wed Jun 10 2026 00:00:00 GMT+0000 (Coordinated Universal Time)

ElevenLabs

The leading commercial voice-AI company — referenced across the spoke (Scribe STT WER, TTS Elo) but never paged. The clearest single embodiment of the synthesis’s closed-frontier pole, and unusual in that one proprietary provider spans all three branches of speech-audio-ai: TTS, STT, and now music. Founded 2022 by Piotr Dąbkowski (ex-Google ML) and Mati Staniszewski (ex-Palantir), reportedly motivated by bad film dubbing.

Products — across all three branches

Why it matters

ElevenLabs is the commercial archetype the open wedge competes against in every branch: proprietary, polished, premium, and closed — the foil to kokoro/fish-audio-s2-pro (TTS), whisper/canary-qwen (STT), and stable-audio/musicgen (music). Its trajectory also marks how commercially central voice AI has become: $11B valuation (Series D, Feb 2026), up from a $100M seed-stage in 2023; 1M+ users by mid-2023; Forbes AI 50. On the rights axis the synthesis tracks, it sits on both sides — its cloning powers the deepfake-fraud risk, and it ships an AI Speech Classifier to detect AI-generated audio (the SynthID/ASVspoof detection thread of audio-deepfake).

text-to-speech · speech-to-text · voice-cloning · audio-deepfake · tts-arena-leaderboard · stt-apis-comparison · audio-music-generation · elevenlabs-expressive-mode