Day 0 support for Google Gemini 3.1 Flash TTS Try it now →
Providers

Inworld

Inworld text-to-speech with streaming and multilingual voices.

Prefixinworld
Default modelinworld-tts-1.5-max
Env varINWORLD_API_KEY
Official docsdocs.inworld.ai

Models

ModelStreamingAudio TagsVoice CloningNotes
inworld-tts-1.5-maxYesNoNoFlagship Llama TTS 1.5 Max — best balance of quality and speed
inworld-tts-1.5-miniYesNoNoLlama TTS 1.5 Mini — ultra-fast, most cost-efficient

Both models support 15 languages: English, Chinese, Japanese, Korean, Russian, Italian, Spanish, Portuguese, French, German, Polish, Dutch, Hindi, Hebrew, and Arabic, with enhanced timestamps.

Usage

import { generateSpeech } from "@speech-sdk/core"

const result = await generateSpeech({
  model: "inworld/inworld-tts-1.5-max",
  text: "Hello from SpeechSDK!",
  voice: "Ashley",
})

Built-in voices include Ashley, Dominus, Edward, Hades, Priya, and others — see the Inworld docs for the full list.

Provider Options

await generateSpeech({
  model: "inworld/inworld-tts-1.5-max",
  text: "Hello!",
  voice: "Ashley",
  providerOptions: {
    temperature: 0.8,
    language: "en",
  },
})

Custom Configuration

import { generateSpeech } from "@speech-sdk/core"
import { createInworld } from "@speech-sdk/core/providers"

const inworld = createInworld({
  apiKey: process.env.INWORLD_API_KEY,
})

const result = await generateSpeech({
  model: inworld(),
  text: "Hello!",
  voice: "Ashley",
})

On this page