Hume MCP Server
The Hume MCP Server implements the Model Context Protocol (MCP) for Hume AI’s TTS API, allowing you to use MCP-compatible clients like Claude Desktop, Cursor, and Windsurf to collaborate with AI assistants on your voice projects.
What for?
If you hope to narrate a large source text, such as a book, play, or long-form video, there’s a lot more to the project than just converting the text to speech. You have to
- Design voices
- Break the text into pieces
- Assign each line of dialogue to a voice
- Separate acting instructions from spoken text
LLMs can perform some of these tasks and help you keep these efforts organized. MCP is an industry protocol that lets you easily give an AI assistant the ability to use tools like Octave TTS on your behalf.
Available tools
The Hume MCP Server exposes the following tools to compatible MCP clients:
Prerequisites
Before using the Hume MCP Server, make sure you have the following:
- An Hume account and API Key.
- Node.js installed on your machine.
- (Optional) A command-line audio player.
- We recommend ffplay from FFMpeg.
- The server will try to auto-detect and use any of several common players.
The MCP server calls Hume APIs on your behalf and will use credits from your account, incurring costs just as if you were making the API calls directly or using Hume’s TTS through the web interface.
Configure your MCP client
To get started with the Hume MCP Server, you’ll need to configure your MCP Client Application to use it:
Claude Desktop
Cursor
Windsurf
Add the following to the .mcpServers
property in claude_desktop_config.json
configuration file.
Source code
The Hume MCP Server is open source. You can view and contribute to the source code in the GitHub repository.
Prompt examples
Here are some example prompts to help you get started with the Hume MCP Server.
These examples assume that the assistant has the ability to read and write from a filesystem. This usually already the case for MCP clients like Cursor that are attached to an editor. For standalone chat apps like Claude Desktop, you can give the assistant filesystem access through the Filesystem MCP Server.
Basic Voice Generation
Ask the assistant to create a voice with specific characteristics:
Reader Instructions
Have the assistant read you content.
Audiobook Narration Project
This comprehensive prompt helps the assistant break down an audiobook chapter into segments and design appropriate voices:
Voice Variant Chaining
This prompt explains how to create distinct character voices through a technique called “variant chaining”:
Command line options
The Hume MCP Server accepts several command line options to customize its behavior:
Environment variables
You can configure the behavior of the Hume MCP Server using these environment variables:
Default API parameters
The MCP Server applies several default parameters to API requests for convenience:
Related resources
Learn more about Hume’s Octave TTS capabilities and features.
Best practices for prompting Octave for voice creation and voice modulation.
Guide to controlling voice expression in Octave TTS.