Realtime TTS 2.0 is available now in research preview! Learn more
curl --request POST \
--url https://api.inworld.ai/stt/v1/transcribe \
--header "Authorization: Basic $INWORLD_API_KEY" \
--header "Content-Type: application/json" \
--data '{
"transcribeConfig": {
"modelId": "groq/whisper-large-v3",
"audioEncoding": "LINEAR16",
"language": "en-US",
"sampleRateHertz": 16000,
"numberOfChannels": 1
},
"audioData": {
"content": "<YOUR_AUDIO>"
}
}'
{
"transcription": {
"transcript": "Hey, I just wanted to check in on the delivery status for my order.",
"isFinal": true,
"wordTimestamps": []
},
"usage": null
}
Send the whole audio in a single request, and receive a single transcription.
curl --request POST \
--url https://api.inworld.ai/stt/v1/transcribe \
--header "Authorization: Basic $INWORLD_API_KEY" \
--header "Content-Type: application/json" \
--data '{
"transcribeConfig": {
"modelId": "groq/whisper-large-v3",
"audioEncoding": "LINEAR16",
"language": "en-US",
"sampleRateHertz": 16000,
"numberOfChannels": 1
},
"audioData": {
"content": "<YOUR_AUDIO>"
}
}'
{
"transcription": {
"transcript": "Hey, I just wanted to check in on the delivery status for my order.",
"isFinal": true,
"wordTimestamps": []
},
"usage": null
}
Documentation Index
Fetch the complete documentation index at: https://dev.docs.inworld.ai/llms.txt
Use this file to discover all available pages before exploring further.
curl --request POST \
--url https://api.inworld.ai/stt/v1/transcribe \
--header "Authorization: Basic $INWORLD_API_KEY" \
--header "Content-Type: application/json" \
--data '{
"transcribeConfig": {
"modelId": "groq/whisper-large-v3",
"audioEncoding": "LINEAR16",
"language": "en-US",
"sampleRateHertz": 16000,
"numberOfChannels": 1
},
"audioData": {
"content": "<YOUR_AUDIO>"
}
}'
{
"transcription": {
"transcript": "Hey, I just wanted to check in on the delivery status for my order.",
"isFinal": true,
"wordTimestamps": []
},
"usage": null
}
Your authentication credentials. For Basic authentication, please populate Basic $INWORLD_API_KEY
Request message for the synchronous TranscribeSpeech API.
Configuration for transcribing audio. Contains model selection, audio format settings, and optional feature configurations. Provider-specific configuration is mutually exclusive — set at most one of groqConfig or inworldSttV1Config.
Show child attributes
Container for raw audio data bytes.
Show child attributes