Welcome to Hume AI

Octave 2 (preview) and EVI 4-mini are live! Expanded language support and lower latency for faster, more natural responses. Learn more.

Hume is a research lab and technology company with a mission to ensure that artificial intelligence is built to serve human goals and emotional well-being.

Hume develops speech-language models that interpret and generate expressive speech, available through two APIs: the Empathic Voice Interface (EVI) for real-time voice interaction and Text-to-Speech (TTS) for expressive speech synthesis.

Speech-to-Speech (EVI)

Hume’s Empathic Voice Interface (EVI) is an advanced, real-time emotionally intelligent voice AI. EVI measures users’ nuanced vocal modulations and responds to them using a speech-language model, which guides language and speech generation. Trained on millions of human interactions, our speech-language model unites language modeling and text-to-speech with better EQ, prosody, end-of-turn detection, interruptibility, and alignment.

  • Interviewing & Coaching: Simulate lifelike interviews or leadership coaching sessions with dynamic tone adjustment.
  • Digital Companions: Build emotionally aware companions for seniors, kids, or mental wellness support.
  • Digital Assistants: Respond with empathy and modulate tone to reduce user frustration or improve engagement.

Text-to-Speech (TTS)

Octave TTS is the first text-to-speech system built on LLM intelligence. Unlike conventional TTS that merely “reads” words, Octave is a “speech-language model” that understands what words mean in context, unlocking a new level of expressiveness and nuance.

  • Creative Tools: Narration for video, podcasting, and audiobooks.
  • Education/Coaching: Deliver lessons with engaging, emotionally varied voice.
  • Digital Avatars: Give realistic voice to AI-powered characters in apps, games, or virtual experiences.

Voice

Voice defines how speech is delivered, shaping tone, pacing, accent, and personality. It plays a central role in how listeners perceive meaning and emotion.

All voices in Hume’s platform are powered by Octave, a speech-language model built on LLM intelligence. Octave enables expressive, context-aware speech generation from both text and natural language descriptions.

Voices can be used across both EVI and TTS to tailor how content is spoken.

SDKs

Jumpstart your development with SDKs built for Hume APIs. They handle authentication, requests, and workflows to make integration straightforward. With support for React, TypeScript, and Python, our SDKs provide the tools you need to build efficiently across different environments.

Example Code

Explore step-by-step guides and sample projects for integrating Hume APIs. Our GitHub repositories include ready-to-use code and open-source SDKs to support your development process in various environments.

Get Support

Need help? Our team is here to support you.