Voice

Utilize Hume’s Voice Library or design custom voices tailored to your application.

Voice is foundational to any system that generates speech. It sets the tone, style, and pacing for how content is delivered. Whether it’s the friendly demeanor of a virtual assistant, the immersive narration of an audiobook, or the distinct personality of a character, the chosen voice shapes the listener’s experience.

Octave is Hume’s speech-language model for generating expressive speech with LLM intelligence. Unlike conventional TTS systems that rely on acoustic templates or phoneme-based pipelines, Octave understands what the text means and how it should be spoken.

Voices, whether selected from the Voice Library or created using prompts, are used in Hume’s two voice products: Empathic Voice Interface (EVI) and Text-to-speech (TTS). If you’re getting started with either, selecting or designing a voice is often your first step.

Try our free voice design demo to hear how Octave generates expressive speech from natural language descriptions — no signup or code required.

Voice design

Octave deeply models language and speech patterns to generate new voices from natural language descriptions. These prompts can specify tone, emotion, accent, and other stylistic traits with a high degree of control.

The Voice Library offers over 100 voices crafted by Hume with Octave, each reflecting a unique style, personality, or accent. These voices can be used directly or serve as inspiration for creating your own.

Voice cloning

While Octave supports voice design from natural language descriptions, it can also create voices from audio samples, reflecting the speaker’s tone, accent, cadence, and vocal identity.

Voice management

Manage your custom voices using the Platform UI or programmatically through the API. Use the guide below that best matches your preferred workflow.

Voice integration

Voices you design or select from the Voice Library can be used across all Hume products that support speech synthesis. The guides below explain how to configure a voice for each API.