Stream Input

Generate emotionally expressive speech.

HandshakeTry it

WSS
wss://api.hume.ai/v0/tts/stream/input

Query parameters

context_generation_idstringOptional
The ID of a prior TTS generation to use as context for generating consistent speech style and prosody across multiple requests. Including context may increase audio generation times.
format_typeenumOptional
Allowed values:
strip_headersbooleanOptionalDefaults to false

If enabled, the audio for all the chunks of a generation, once concatenated together, will constitute a single audio file. Otherwise, if disabled, each chunk’s audio will be its own audio file, each with its own headers (if applicable).

instant_modebooleanOptionalDefaults to true

Accelerates processing to reduce streaming latency.Incurs approximately 10% additional cost while preserving full voice quality.

no_binarybooleanOptionalDefaults to false
If enabled, no binary websocket messages will be sent to the client.

Send

InputMessageobjectRequired

Receive

SnippetAudioChunkobjectRequired