Welcome to Hume AI
🚀 EVI 3 API support coming soon! Our most powerful real-time voice interactions API yet is launching soon with enhanced expressiveness, lowest latency, and custom voices! Stay tuned for early access!
This is your starting point for building applications that understand and generate human expression. Hume’s APIs are designed for real-time processing of voice and text, enabling you to measure nuanced emotional signals and create expressive, emotionally intelligent experiences. We offer REST and WebSocket interfaces, low-latency streaming support, and language-agnostic examples to help you get up and running quickly.
Empathic Voice Interface (EVI)
Hume’s Empathic Voice Interface (EVI) is an advanced, real-time emotionally intelligent voice AI. EVI measures users’ nuanced vocal modulations and responds to them using a speech-language model, which guides language and speech generation. Trained on millions of human interactions, our speech-language model unites language modeling and text-to-speech with better EQ, prosody, end-of-turn detection, interruptibility, and alignment.
- Interviewing & Coaching: Simulate lifelike interviews or leadership coaching sessions with dynamic tone adjustment.
- Digital Companions: Build emotionally aware companions for seniors, kids, or mental wellness support.
- Digital Assistants: Respond with empathy and modulate tone to reduce user frustration or improve engagement.
Text-to-speech (TTS)
Octave TTS is the first text-to-speech system built on LLM intelligence. Unlike conventional TTS that merely “reads” words, Octave is a “speech-language model” that understands what words mean in context, unlocking a new level of expressiveness and nuance.
- Creative Tools: Narration for video, podcasting, and audiobooks.
- Education/Coaching: Deliver lessons with engaging, emotionally varied voice.
- Digital Avatars: Give realistic voice to AI-powered characters in apps, games, or virtual experiences.
Expression Measurement
Hume’s state-of-the-art expression measurement models for the voice, face, and language are built on 10+ years of research and advances in semantic space theory pioneered by Alan Cowen. Our expression measurement models are able to capture hundreds of dimensions of human expression in audio, video, and images.
- Health & Wellness: Monitor patient tone and emotion during therapy or check-ins.
- Call Center Analytics: Detect caller frustration or distress for triage and escalation.
- UX/CX Research: Analyze user interviews and testing sessions for sentiment trends.
🔍 Learn from real-world examples: Check out our case studies to see how companies are integrating empathic AI into their products and solutions today.
API Reference
Our API reference provides detailed descriptions of our REST and WebSocket endpoints. Explore request and response formats, usage examples, and everything you need to integrate Hume APIs.
API that measures nuanced vocal modulations and responds to them using an empathic large language model
Synthesize text to speech using Octave, Hume’s state-of-the-art speech-language model
Analyze facial, vocal, and linguistic expressions across 48+ dimensions to unlock deeper emotional insights
SDKs
Jumpstart your development with SDKs built for Hume APIs. They handle authentication, requests, and workflows to make integration straightforward. With support for React, TypeScript, and Python, our SDKs provide the tools you need to build efficiently across different environments.
Integrate Hume’s Empathic Voice Interface into React apps with tools for audio recording, playback, and API interaction
Work with Hume’s APIs using type-safe utilities and API wrappers for TypeScript and JavaScript
Access Hume’s APIs in Python with async/sync clients, error handling, and streaming tools
Example Code
Explore step-by-step guides and sample projects for integrating Hume APIs. Our GitHub repositories include ready-to-use code and open-source SDKs to support your development process in various environments.
Browse sample code and projects designed to help you integrate Hume APIs
Explore all of Hume’s open-source SDKs, examples, and public-facing code
Get Support
Need help? Our team is here to support you with any questions or challenges.