POST
/
voice-cloning
curl -X POST https://api.console.tts.monster/voice-cloning \
-H "Content-Type: application/json" \
-H "Authorization: ttsm_12345-abcdef" \
-d '{
      "action": "create",
      "voice_name": "My Custom Voice",
      "language": "en",
      "file": "data:audio/wav;base64,..."
    }'
{
  "voice_id": "9aad4a1b-f04e-43a1-8ff5-4830115a10a8",
  "name": "My Custom Voice",
  "language": "en",
  "training": "https://script.tts.monster/abc123.wav",
  "sample": "https://script-samples.tts.monster/abc123.wav"
}
Authorization
string
required
Add your API token here. You can find it by reading the /authentication page.

Body

action
string
required
The action to perform. Must be one of: create, delete, or edit.

create Action Parameters

file
string
required
Base64 encoded audio file (WAV or MP3). Must be between 30 seconds and 5 minutes in length and under 10MB.
voice_name
string
required
Name for the custom voice. Must be between 3 and 50 characters.
language
string
required
The language of the voice. Must be one of the supported languages.

delete Action Parameters

voice_id
string
required
The ID of the custom voice to delete.

edit Action Parameters

voice_id
string
required
The ID of the custom voice to edit.
voice_name
string
New name for the custom voice. Must be between 3 and 50 characters.
language
string
New language for the voice. Must be one of the supported languages:

Response

create Action Response

voice_id
string
Unique identifier for the created voice.
name
string
Name of the custom voice.
language
string
Language of the voice.
training
string
URL to the training audio file.
sample
string
URL to a sample generated using the voice. This counts towards your character usage.

delete Action Response

message
string
Confirmation message of the deletion.

edit Action Response

voice_id
string
ID of the edited voice.
name
string
Updated name (if changed).
language
string
Updated language (if changed).
sample
string
New sample URL (if language was changed). This counts towards your character usage.

Limitations

  • Voice creation is limited based on your subscription plan
  • Audio files must be between 30 seconds and 5 minutes in duration
  • File size limit is 10MB
  • Voice names must be between 3 and 50 characters
  • Only WAV and MP3 file formats are supported

Supported Languages

  • English v3 (New) en-v3
    • Our newest English voice model has greatly improved quality and more natural sounding speech.
  • English v2 en-v2
  • English v1 en
  • Spanish es
  • French fr
  • Japanese ja
  • Korean ko
  • Italian it
  • Polish pl
  • German de
  • Portuguese pt
  • Arabic ar
  • Dutch nl
  • Russian ru
curl -X POST https://api.console.tts.monster/voice-cloning \
-H "Content-Type: application/json" \
-H "Authorization: ttsm_12345-abcdef" \
-d '{
      "action": "create",
      "voice_name": "My Custom Voice",
      "language": "en",
      "file": "data:audio/wav;base64,..."
    }'
{
  "voice_id": "9aad4a1b-f04e-43a1-8ff5-4830115a10a8",
  "name": "My Custom Voice",
  "language": "en",
  "training": "https://script.tts.monster/abc123.wav",
  "sample": "https://script-samples.tts.monster/abc123.wav"
}