Stream Input
Generate emotionally expressive speech.
HandshakeTry it
WSS
wss://api.hume.ai/v0/tts/stream/input
Query parameters
context_generation_id
The ID of a prior TTS generation to use as context for generating consistent speech style and prosody across multiple requests. Including context may increase audio generation times.
format_type
Allowed values:
strip_headers
If enabled, the audio for all the chunks of a generation, once concatenated together, will constitute a single audio file. Otherwise, if disabled, each chunk’s audio will be its own audio file, each with its own headers (if applicable).
instant_mode
Accelerates processing to reduce streaming latency.Incurs approximately 10% additional cost while preserving full voice quality.
no_binary
If enabled, no binary websocket messages will be sent to the client.
Send
InputMessage
Receive
SnippetAudioChunk