For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Start buildingGet support
DocumentationAPI ReferenceChangelogDiscord
  • Introduction
    • Welcome to Hume AI
    • Getting your API keys
    • Support
    • Pricing
  • Voice
    • Overview
    • Voice design
    • Voice cloning
    • Voice management
  • Text-to-Speech (TTS)
    • Overview
    • Voice
    • Acting instructions
    • Voice conversion
    • Continuation
    • Timestamps
    • FAQ
  • Speech-to-Speech (EVI)
    • Overview
    • FAQ
  • Expression Measurement
    • Overview
    • About the science
    • FAQ
  • Integrations
    • MCP
    • Vercel AI SDK
    • LiveKit
    • Pipecat
    • Vapi
    • Twilio
    • Agora
  • Resources
    • Terms of use
    • Use case guidelines
    • Billing
    • Errors
    • Privacy
    • Status
Start buildingGet support
LogoLogo
LogoLogo
On this page
  • Voice reference options
  • Specify a voice in your request
  • Resources
Text-to-Speech (TTS)

Voice Guide

Guide to using a saved voice or a Voice Library voice in your TTS API requests.
Was this page helpful?
Edit this page
Previous

Acting Instructions Guide

Guide to controlling voice expression in Octave TTS through acting instructions, speed settings, and silence parameters.
Next
Built with

Hume’s text-to-speech (TTS) API lets you specify which voice to use when synthesizing speech. You can use a custom voice that you have saved or select one from Hume’s Voice Library.

This guide explains how to specify a voice across all of Hume’s TTS endpoints.

To learn how to create or manage voices, see the Voice Design Guide, Voice Cloning Guide, and Voice Management Guide.

Voice reference options

You can specify a voice by name or id. If you use name, include a provider (defaults to CUSTOM_VOICE). To reference a voice from Hume’s Voice Library by name, set the provider to HUME_AI.

By ID
By Name
Specify either a custom voice or one from Hume's Voice Library by ID
1{
2 "voice": {
3 "id": "9e068547-5ba4-4c8e-8e03-69282a008f04"
4 }
5}

Get voice IDs and names from /v0/tts/voices or from the Platform’s Voice Library page.

Specify a voice in your request

To set a voice, include the voice field in the first utterance of your request. That voice is used for all following utterances unless you override it later.

Voice specification works the same across streaming and non-streaming endpoints. The code snippets below demonstrate how to set the voice in your TTS request.

1curl https://api.hume.ai/v0/tts/stream/json \
2 -H "X-Hume-Api-Key: <apiKey>" \
3 --json '{
4 "version": "2",
5 "utterances": [
6 {
7 "text": "Beauty is no quality in things themselves: It exists merely in the mind which contemplates them.",
8 "voice": {
9 "id": "9e068547-5ba4-4c8e-8e03-69282a008f04"
10 }
11 }
12 ]
13}'

Octave 1 voices are supported for both Octave 1 and Octave 2 requests, while Octave 2 voices are only supported for Octave 2 requests. If you specify an Octave 2 voice for an Octave 1 request, it will return an error.

Resources

Voice Design Guide

Learn how to design and create custom voices.

Voice Cloning Guide

Create a voice clone from a live recording or an audio file.

Acting Instructions

Control speech delivery using expressive performance cues.

Continuation Guide

Generate speech that leverages previous generations as context.