Day 0 support for Google Gemini 3.1 Flash TTS Try it now →

MODELS & PROVIDERS

Supporting 25+ speech models

All the text-to-speech models supported by SpeechSDK in one place.

Provider Model Languages Release Date Open Source Streaming Audio Tags Voice Clone*
GoogleGoogle
google/gemini-3.1-flash-tts-preview
afamar
Apr 15, 2026
MistralMistral
mistral/voxtral-mini-tts-2603
enfrde
Mar 23, 2026
Fish AudioFish Audio
fish-audio/s2-pro
jaenzh
Mar 9, 2026
xAIxAI
xai/grok-tts
enarbn
Nov 1, 2025
CartesiaCartesia
cartesia/sonic-3
enfrde
Oct 27, 2025
HumeHume
hume/octave-2
enfrde
Oct 1, 2025
ResembleResemble
resemble/default
enarda
Sep 4, 2025
InworldInworld
inworld/inworld-tts-1.5-max
enesfr
Aug 15, 2025
InworldInworld
inworld/inworld-tts-1.5-mini
enesfr
Aug 15, 2025
ElevenLabsElevenLabs
elevenlabs/eleven_v3
afarhy
Jun 8, 2025
GoogleGoogle
google/gemini-2.5-flash-preview-tts
enfrde
May 1, 2025
GoogleGoogle
google/gemini-2.5-pro-preview-tts
enfrde
May 1, 2025
DeepgramDeepgram
deepgram/aura-2
enesde
Apr 15, 2025
OpenAIOpenAI
openai/gpt-4o-mini-tts
afarbg
Mar 20, 2025
falfal
fal-ai/orpheus-tts
enesfr
Mar 18, 2025
CartesiaCartesia
cartesia/sonic-2
en
Mar 13, 2025
HumeHume
hume/octave-1
en
Mar 1, 2025
falfal
fal-ai/kokoro
enfrko
Jan 27, 2025
MurfMurf
murf/GEN2
endees
Jan 1, 2025
MurfMurf
murf/FALCON
en
Jan 1, 2025
Smallest AISmallest AI
smallest-ai/lightning-v3.1
enhies
Jan 1, 2025
ElevenLabsElevenLabs
elevenlabs/eleven_flash_v2_5
arbgcs
Dec 1, 2024
ElevenLabsElevenLabs
elevenlabs/eleven_flash_v2
en
Dec 1, 2024
falfal
fal-ai/f5-tts
enzhfr
Oct 8, 2024
OpenAIOpenAI
openai/tts-1
afarbg
Nov 6, 2023
OpenAIOpenAI
openai/tts-1-hd
afarbg
Nov 6, 2023
ElevenLabsElevenLabs
elevenlabs/eleven_multilingual_v2
arbgcs
Aug 22, 2023

Audio Tags — bracket syntax like [laughs], [sighs], or [whispers] that adds expressive audio cues to generated speech. Models without support will strip tags and return warnings.

*Voice Clone refers to passing inline audio references instead of selecting a pre-defined voice.