Labsco
MiniMax-AI logo

MiniMax MCP

β˜… 1,500

from MiniMax-AI

Interact with MiniMax's powerful Text-to-Speech, image, and video generation APIs.

πŸ”₯πŸ”₯πŸ”₯πŸ”₯βœ“ VerifiedAccount requiredNeeds API keys

Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech and video/image generation APIs. This server allows MCP clients like Claude Desktop, Cursor, Windsurf, OpenAI Agents and others to generate speech, clone voices, generate video, generate image and more.

πŸ’‘ Recommended: MiniMax CLI (mmx-cli) β€” our official command-line tool with the latest models and additional features including text, vision, search, and music cover. Works as an AI agent skill for Claude Code, Cursor, OpenClaw, etc.

Documentation

Transport

We support two transport types: stdio and sse.

stdio SSE Run locally Can be deployed locally or in the cloud Communication through stdout Communication through network Input: Supports processing local files or valid URL resources Input: When deployed in the cloud, it is recommended to use URL for input

Available Tools

tool description text_to_audio Convert text to audio with a given voice list_voices List all voices available voice_clone Clone a voice using provided audio files generate_video Generate a video from a prompt text_to_image Generate a image from a prompt query_video_generation Query the result of video generation task music_generation Generate a music track from a prompt and lyrics voice_design Generate a voice from a prompt using preview text

Release Notes

July 2, 2025

πŸ†• What's New

  • Voice Design: New voice_design tool - create custom voices from descriptive prompts with preview audio

  • Video Enhancement: Added MiniMax-Hailuo-02 model with ultra-clear quality and duration/resolution controls

  • Music Generation: Enhanced music_generation tool powered by music-1.5 model

πŸ“ˆ Enhanced Tools

  • voice_design - Generate personalized voices from text descriptions

  • generate_video - Now supports MiniMax-Hailuo-02 with 6s/10s duration and 768P/1080P resolution options

  • music_generation - High-quality music creation with music-1.5 model