Text-to-speech (File)
Synthesizes one or more input texts into speech using the specified voice. If no voice is provided, a novel voice will be generated dynamically. Optionally, additional context can be included to influence the speech’s style and prosody.
The response contains the generated audio file in the requested format.
Headers
X-Hume-Api-Key
Request
This endpoint expects an object.
utterances
Utterances to be converted to speech output.
context
Utterances to use as context for generating consistent speech style and prosody across multiple requests. These will not be converted to speech output.
format
Specifies the output audio file format.
num_generations
Number of generations of the audio to produce.
Response
OK