Overview
MiniMax Voice Clone is a state-of-the-art voice synthesis model developed by MiniMax. It enables high-quality voice cloning from a short reference clip, producing speech that closely mimics the tone, accent, and personality of the original speaker.Authentication
This endpoint requires authentication using a Bearer token.Your API key in the format:
YOUR_API_KEYRequest Body
The uploaded file is cloned and supports formats such as MP3, M4A, and WAV.
Custom user-defined ID. Minimum 8 characters; must include letters and numbers and start with a letter (e.g., gptproto0001). Duplicate voice-ids will throw an error.
Specify the TTS model to be used for the preview. This is only a preview after cloning. Once the model is generated, any Minimax Turbo or HD voice model can be used for inference.
Enable noise reduction. Default is false (no noise reduction).
Specify whether to enable volume normalization. If not provided, the default value is false.
0.00 ~ 1.00 .Uploading this parameter will set the text validation accuracy threshold, with a value range of [0,1]. If not provided, the default value for this parameter is 0.7.
Text for audio preview. Limited to 2000 characters.

