Skip to main content

Documentation Index

Fetch the complete documentation index at: https://dev.docs.inworld.ai/llms.txt

Use this file to discover all available pages before exploring further.

Inworld provides a family of state-of-the-art TTS models, optimized for different use cases, quality levels, and performance requirements.

Realtime TTS 2.0

Our most powerful and expressive model, available in Research Preview

  • Natural language steering for more contextually aware speech
  • Support for 100+ languages
  • Optimized for real-time use
  • High quality instant voice cloning
  • Enhanced timestamps with phonetic details and visemes

Realtime TTS 1.5 Max

Our flagship model, delivering the best balance of quality and speed

  • Rich, expressive, contextually aware speech
  • Support for 15 languages
  • Optimized for real-time use (<200ms median latency)
  • High quality instant voice cloning
  • Enhanced timestamps with phonetic details and visemes

Realtime TTS 1.5 Mini

Our ultra-fast, most cost-efficient model. For when latency is the top priority.

  • Ultra-low latency (~120ms median latency)
  • Support for 15 languages
  • Radically affordable pricing
  • High quality instant voice cloning
  • Enhanced timestamps with phonetic details and visemes

Models overview

NameModel IDDescriptionSupported languages
Llama Realtime TTS 2.0inworld-tts-2              Our newest, most powerful model with natural language steering and stronger multilingual capabilities100+ languages — see Languages
Llama Realtime TTS 1.5 Maxinworld-tts-1.5-max              #1 ranked model, best balance of quality and speed, with enhanced timestampsen, zh, ja, ko, ru, it, es, pt, fr, de, pl, nl, hi, he, ar
Llama Realtime TTS 1.5 Miniinworld-tts-1.5-mini                                Ultra-fast, most cost-efficient model, with enhanced timestampsen, zh, ja, ko, ru, it, es, pt, fr, de, pl, nl, hi, he, ar
Llama Realtime TTS Max Deprecatedinworld-tts-1-max              Our most powerful previous generation model, with basic timestamps supporten, de, es, fr, it, ja, ko, nl, pl, pt, ru, zh, hi
Llama Realtime TTS Deprecatedinworld-tts-1              Our fastest previous generation model, with basic timestamps supporten, de, es, fr, it, ja, ko, nl, pl, pt, ru, zh, hi
inworld-tts-1 and inworld-tts-1-max are deprecated and will be retired in the near future. We will communicate the exact retirement date once finalized to users with advance notice to ensure a smooth transition. We recommend migrating to inworld-tts-1.5-mini and inworld-tts-1.5-max as soon as possible to avoid disruptions.