Enables ultra-low latency streaming, significantly reducing the time until the first audio chunk is received. Recommended for real-time applications requiring immediate audio playback. For further details, see our documentation on instant mode.
If enabled, the audio for all the chunks of a generation, once concatenated together, will constitute a single audio file. Otherwise, if disabled, each chunk’s audio will be its own audio file, each with its own headers (if applicable).
API key used for authenticating the client. If not provided, an access_token must be provided to authenticate.
For more details, refer to the Authentication Strategies Guide.
Access token used for authenticating the client. If not provided, an api_key must be provided to authenticate.
The access token is generated using both an API key and a Secret key, which provides an additional layer of security compared to using just an API key.
For more details, refer to the Authentication Strategies Guide.
Sampling temperature for the speech generation model. Higher values increase variation; lower values increase consistency.
This is an experimental parameter. It is recommended to use the default values for most use cases.
Defaults when omitted:
0.90.80.75