Hume MCP Server
Use Hume AI’s Octave TTS with your favorite MCP clients like Claude Desktop, Cursor, and Windsurf.
The Hume MCP Server implements the Model Context Protocol (MCP) for Hume AI’s Octave Text-To-Speech, allowing you to use MCP clients like Claude Desktop, Cursor, and Windsurf to collaborate with AI assistants on your voice projects.
What for?
If you hope to narrate a large source text, such as a book, play, or long-form video, there’s a lot more to the project than just converting the text to speech. You have to
- Design voices
- Break the text into pieces
- Assign each line of dialogue to a voice
- Separate acting instructions from spoken text
LLMs can perform some of these tasks and help you keep these efforts organized. MCP is an industry protocol that lets you easily give an AI assistant the ability to use tools like Octave TTS on your behalf.
Available Tools
The Hume MCP Server exposes the following tools to compatible MCP clients:
Quickstart
To get started with the Hume MCP Server, you’ll need to configure your MCP Client Application to use it:
Claude Desktop
Cursor
Windsurf
Add the following to the .mcpServers
property in claude_desktop_config.json
configuration file.
Prerequisites
Before you can use the Hume MCP Server, you’ll need:
- An account and API Key from Hume AI
- Node.js installed on your system
- (optional) A command-line audio player
- ffplay from FFMpeg is recommended, but the server will attempt to detect and use any of several common players
The MCP server calls Hume APIs on your behalf and will use credits from your account, incurring costs just as if you were making the API calls directly or doing Text-to-Speech through the web interface.
Source Code
The Hume MCP Server is open source. You can view and contribute to the source code in the GitHub repository.
Prompt Examples
Here are some example prompts to help you get started with the Hume MCP Server.
These examples assume that the assistant has the ability to read and write from a filesystem. This usually already the case for MCP clients like Cursor that are attached to an editor. For standalone chat apps like Claude Desktop, you can give the assistant filesystem access through the Filesystem MCP Server.
https://github.com/modelcontextprotocol/servers/tree/main/src/filesystem
Basic Voice Generation
Ask the assistant to create a voice with specific characteristics:
Reader Instructions
Have the assistant read you content.
Audiobook Narration Project
This comprehensive prompt helps the assistant break down an audiobook chapter into segments and design appropriate voices:
Voice Variant Chaining
This prompt explains how to create distinct character voices through a technique called “variant chaining”:
Command Line Options
The Hume MCP Server accepts several command line options to customize its behavior:
Environment Variables
You can configure the behavior of the Hume MCP Server using these environment variables:
Default API Parameters
The MCP Server applies several default parameters to API requests for convenience:
Related Resources
Learn more about Hume’s Octave TTS capabilities and features.
Best practices for prompting Octave for voice creation and voice modulation.
Guide to controlling voice expression in Octave TTS.