Speech-to-Speech (EVI)
Speech-to-Speech (EVI)
Hume’s Empathic Voice Interface (EVI) is an advanced, real-time emotionally intelligent voice AI.
Speech-to-Speech (EVI)
Hume’s Empathic Voice Interface (EVI) is an advanced, real-time emotionally intelligent voice AI.
Hume’s Empathic Voice Interface (EVI) is an advanced, real-time emotionally intelligent voice AI. EVI measures users’ nuanced vocal modulations and responds to them using a speech-language model, which guides language and speech generation.
By processing the tune, rhythm, and timbre of speech, EVI unlocks a variety of new capabilities, like knowing when to speak and generating more empathic language with the right tone of voice.
These features enable smoother and more satisfying voice-based interactions between humans and AI, opening new possibilities for personal AI, customer service, accessibility, robotics, immersive gaming, VR experiences, and much more.
To try EVI in your browser, use the EVI Playground in the Hume platform.
Kickstart your integration with our quickstart guides for Next.js, TypeScript, and Python. Each guide walks you through integrating the EVI API, capturing user audio, and playing back EVI’s response so you can get up and running quickly.
Build web applications using our React client SDK in Next.js.
Develop server-side or frontend applications using our TypeScript SDK.
Create integrations in Python using our Python SDK.
EVI chat sessions run over a real-time WebSocket connection, enabling fluid, interactive dialogue. Users speak naturally while EVI analyzes their vocal expression and responds with emotionally intelligent speech.
REST endpoints support the API key authentication strategy.
specify your API key in the X-HUME-API-KEY header of your request.
The EVI WebSocket endpoint supports both the API key and Token authentication strategies, specify your API key or Access token in the query parameters of your request.
Before starting a session, you’ll need a voice and a configuration.
The EVI Playground is the easiest way to test your configuration. It lets you speak directly with EVI using your selected voice and settings, without writing any code.
To begin a conversation, connect using the EVI WebSocket URL start streaming the user’s audio input, via audio_input messages. EVI responds in real time with a sequence of structured messages:
assistant_messageHume provides a suite of developer tools to integrate and customize EVI.
Connect with EVI via WebSocket, including message formats and response types.
Manage EVI configurations and access your chat history.
Use official SDKs to streamline integration in Python and web-based projects.
Browse example projects demonstrating EVI integration in different frameworks.
The following limits apply to Hume’s Speech-to-Speech (EVI) API.
The EVI API supports thousands of concurrent sessions. To increase limits: