SoundClone - Create Preview Task
SoundClone
SoundClone - Create Preview Task
Submit a voice-cloning preview task. Poll GET /api/open-api/v1/audios/ for modelId and preview audioUrl.
POST
SoundClone - Create Preview Task
SoundClone tasks are async. The create response returns a task
id. Poll Query SoundClone Task until completion to obtain modelId and preview audioUrl, then call Create Audio Task for production audio.All responses use the envelope { "code": 20000, "msg": "ok", "data": { ... } }. Examples below show the data payload.Overview
Submit a voice-cloning preview task from a source audio or video URL. When the task completes, you receive a preview audio URL and amodelId for formal audio generation.
Request body
| Field | Type | Required | Description |
|---|---|---|---|
fileUrl | string | Yes | Public URL of source audio or video. No local paths or Chinese characters in the URL. Audio: mp3, ogg, wav, m4a, aac. Video: mp4, avi, mov, mkv, flv. Spoken content must be > 15s and < 60s. |
contentText | string | No | Preview script, max 270 characters. Uses a default line when omitted. |
soundVersion | string | No | v1 (24 languages) or v2 (40 languages). Default v1. |
language | string | No | Language code, default auto. Example: Chinese, English. Some languages require v2; see Create Audio Task. |
Billing
Preview is billed by character count in units of 10,000 characters (price_mode: per_10k_char).
| Model config | Description |
|---|---|
sound-cloning-clone | Preview character fee; unit price is per 10k characters |
- Characters are counted as Unicode runes;
<#x#>pause markers are excluded. - The default preview text counts toward billing when
contentTextis omitted. - Balance is checked before submission; failed tasks are refunded.
Example
Response example
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
application/json