MODELS & PROVIDERS

Supporting 25+ speech models

All the text-to-speech models supported by SpeechSDK in one place.

Provider Model Languages Release Date Open Source Voice Clone* Zero Data Retention
MistralMistral
mistral/voxtral-mini-tts-2603
enfrde
Mar 23, 2026
Fish AudioFish Audio
fish-audio/s2-pro
jaenzh
Mar 9, 2026
CartesiaCartesia
cartesia/sonic-3
enfrde
Oct 27, 2025
HumeHume
hume/octave-2
enfrde
Oct 1, 2025
falfal
fal-ai/index-tts-2
enzh
Sep 8, 2025
ResembleResemble
resemble/default
enarda
Sep 4, 2025
ElevenLabsElevenLabs
elevenlabs/eleven_v3
afarhy
Jun 8, 2025
Unreal SpeechUnreal Speech
unreal-speech/default
enzhhi
Jun 1, 2025
GoogleGoogle
google/gemini-2.5-flash-preview-tts
enfrde
May 1, 2025
GoogleGoogle
google/gemini-2.5-pro-preview-tts
enfrde
May 1, 2025
falfal
fal-ai/dia-tts
en
Apr 21, 2025
DeepgramDeepgram
deepgram/aura-2
enesde
Apr 15, 2025
OpenAIOpenAI
openai/gpt-4o-mini-tts
afarbg
Mar 20, 2025
falfal
fal-ai/orpheus-tts
enesfr
Mar 18, 2025
CartesiaCartesia
cartesia/sonic-2
en
Mar 13, 2025
HumeHume
hume/octave-1
en
Mar 1, 2025
falfal
fal-ai/kokoro
enfrko
Jan 27, 2025
MurfMurf
murf/GEN2
endees
Jan 1, 2025
MurfMurf
murf/FALCON
en
Jan 1, 2025
ElevenLabsElevenLabs
elevenlabs/eleven_flash_v2_5
arbgcs
Dec 1, 2024
ElevenLabsElevenLabs
elevenlabs/eleven_flash_v2
en
Dec 1, 2024
falfal
fal-ai/f5-tts
enzhfr
Oct 8, 2024
OpenAIOpenAI
openai/tts-1
afarbg
Nov 6, 2023
OpenAIOpenAI
openai/tts-1-hd
afarbg
Nov 6, 2023
ElevenLabsElevenLabs
elevenlabs/eleven_multilingual_v2
arbgcs
Aug 22, 2023

*Voice Clone refers to passing inline audio references instead of selecting a pre-defined voice.