Synthesize speech

POST

speak

Handler for the /speak endpoint

curl --request POST \
  --url https://api.sayna.ai/speak \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "text": "Hello, world!",
  "tts_config": {
    "model": "aura-asteria-en",
    "provider": "deepgram",
    "audio_format": "linear16",
    "connection_timeout": 30,
    "pronunciations": [
      {
        "pronunciation": "A P I",
        "word": "API"
      }
    ],
    "request_timeout": 60,
    "sample_rate": 24000,
    "speaking_rate": 1,
    "voice_id": "aura-asteria-en"
  }
}
'

Kick off one-shot synthesis jobs for short responses. This endpoint reuses the same provider layer and cache that powers the streaming WebSocket experience.

Reuse identical tts_config inputs to hit the cache and avoid extra provider round-trips.

Authorizations

Authorization

string

header

required

JWT token obtained from the authentication service. Required when AUTH_REQUIRED is enabled.

Body

application/json

Request body for the speak endpoint

text

string

required

The text to synthesize

Example:

"Hello, world!"

tts_config

object

required

TTS configuration (without API key)

Show child attributes

Response

Audio generated successfully

LiveKit tokenGenerates a LiveKit JWT token for a participant to join a specific room. When authentication is enabled (`auth.id` is present), this handler: 1. Creates the room if it doesn't exist 2. Sets `room.metadata.auth_id` to the authenticated tenant's ID 3. Issues the token only after metadata is verified/set # Arguments * `state` - Shared application state containing LiveKit configuration * `request` - Token request with room name and participant details # Returns * `Response` - JSON response with token or error status # Errors * 400 Bad Request - Invalid request data (empty fields) * 403 Forbidden - Room exists with a different tenant's `auth_id` * 500 Internal Server Error - LiveKit service not configured, room creation failed, or token generation failed

⌘I

Handler for the /speak endpoint

curl --request POST \
  --url https://api.sayna.ai/speak \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "text": "Hello, world!",
  "tts_config": {
    "model": "aura-asteria-en",
    "provider": "deepgram",
    "audio_format": "linear16",
    "connection_timeout": 30,
    "pronunciations": [
      {
        "pronunciation": "A P I",
        "word": "API"
      }
    ],
    "request_timeout": 60,
    "sample_rate": 24000,
    "speaking_rate": 1,
    "voice_id": "aura-asteria-en"
  }
}
'

Overview

REST endpoints

Synthesize speech

Authorizations

Body

Response