For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Start buildingGet support
DocumentationAPI ReferenceChangelogDiscord
  • Introduction
    • Welcome to Hume AI
    • Getting your API keys
    • Support
    • Pricing
  • Voice
    • Overview
    • Voice design
    • Voice cloning
    • Voice management
  • Text-to-Speech (TTS)
    • Overview
    • Voice
    • Acting instructions
    • Voice conversion
    • Continuation
    • Timestamps
    • FAQ
  • Speech-to-Speech (EVI)
    • Overview
    • FAQ
  • Expression Measurement
    • Overview
    • About the science
    • FAQ
  • Integrations
    • MCP
    • Vercel AI SDK
    • LiveKit
    • Pipecat
    • Vapi
    • Twilio
    • Agora
  • Resources
    • Terms of use
    • Use case guidelines
    • Billing
    • Errors
    • Privacy
    • Status
Start buildingGet support
LogoLogo
LogoLogo
On this page
  • Create a voice clone
  • Record your voice
  • Upload an audio file
  • Use your voice clone
Voice

Voice Cloning

Create custom voice clones from speech using Octave, either by recording your voice or uploading a sample.
Was this page helpful?
Edit this page
Previous

Voice Management

A guide to viewing, renaming, and deleting custom voices via the Platform UI or API.
Next
Built with

Octave, Hume’s speech-language model, deeply models language and speech patterns to generate voice. While Octave supports voice design from natural language descriptions, it can also create voices from audio samples, reflecting the speaker’s tone, accent, cadence, and vocal identity.

Voice cloning availability depends on your subscription tier. Check your access and usage limits on the billing page.

Create a voice clone

You can create a voice clone using one of two supported methods:

  1. Record your voice using your microphone in a guided session.
  2. Upload an audio file containing a speech sample from a consenting speaker.

To create a voice clone, start from the Platform’s Voice Library page. Click Voice Cloning to open the cloning menu.

The sections below walk through each method step by step.

Record your voice

Follow the steps below to record a speech sample for voice cloning.

1

Start recording setup

Click RECORD AUDIO to begin the recording session flow.

Voice cloning menu
2

Name your voice clone

Input a name for your voice, and click CONTINUE.

Recording session start menu
3

Select a microphone

Select your microphone from the dropdown menu.

If you’re using an external microphone and don’t see it listed, ensure it’s properly connected.

Click START to begin recording.

Recording session microphone select menu
4

Record your voice

During the session, text prompts are streamed one line at a time for you to read aloud.

The full session typically takes less than 30 seconds.

Recording session active
5

Save the voice clone

After the recording session is complete, the recorded audio is uploaded and your voice clone is created.

Click SAVE VOICE to complete the flow and be redirected to the My Voices page.

Recording session complete

Upload an audio file

Follow the steps below to upload a pre-recorded audio file and create a voice clone.

Upload only voice samples for which you have the necessary rights or consent to clone. Users must comply with Hume’s Terms of Use, Ethical Guidelines, Privacy Policy, and applicable laws.

1

Upload an audio file

Click BROWSE FILES to select a file to upload.

Voice cloning menu
2

Create voice clone

Input a name for the voice and fill out the legal agreements, confirming you have the necessary rights or consents to upload and clone the provided voice sample.

Click CREATE VOICE to complete the flow and be redirected to the My Voices page.

Legal agreement

Use your voice clone

You can use your voice clones in Hume products that support speech synthesis. Reference them by name or ID in TTS requests, or configure EVI to use the voice.

Use the playgrounds to preview how your cloned voice sounds in different scenarios:

EVI Playground

Chat with an assistant configured with your voice clone, to see how it sounds in conversation.

TTS Playground

See how your voice clone sounds with specific text input, or when given acting instructions.

See guides below for details on how to use your voice clone in your project or integration.

Empathic Voice Interface (EVI)

Configure EVI to use your voice clone.

Text-to-Speech (TTS)

Specify your voice clone in TTS requests.