Skip to main content
POST
/
api
/
v3
/
alibaba
/
wan-2.5
/
image-to-video-fast
Image to Video
curl --request POST \
  --url https://gptproto.com/api/v3/alibaba/wan-2.5/image-to-video-fast \
  --header 'Authorization: <authorization>' \
  --header 'Content-Type: application/json' \
  --data '{
  "image": "<string>",
  "audio": "<string>",
  "prompt": "<string>",
  "negative_prompt": "<string>",
  "resolution": "<string>",
  "duration": 123,
  "enable_prompt_expansion": true,
  "seed": 123
}'
{
  "error": {
    "message": "Invalid signature",
    "type": "401"
  }
}

Overview

Alibaba WAN 2.5 is an advanced image-to-video model on Alibaba Cloud’s DashScope. It generates high-quality videos from images and supports output resolutions of 480p, 720p, and 1080p.

Authentication

This endpoint requires authentication using a Bearer token.
Authorization
string
default:"sk-***********"
required
Your API key in the format: YOUR_API_KEY

Request Body

image
string
required
The image for generating the output.
audio
string
Audio URL to guide generation (optional).
prompt
string
required
The positive prompt for the generation.
negative_prompt
string
required
The negative prompt for the generation.
resolution
string
default:"720p"
480p, 720p, 1080p . The resolution of the generated media.
duration
integer
default:"5"
3, 4, 5, 6, 7, 8, 9, 10 . The duration of the generated media in seconds.
enable_prompt_expansion
boolean
default:"false"
If set to true, the prompt optimizer will be enabled.
seed
integer
default:"-1"
-1 ~ 2147483647 . The random seed to use for the generation. -1 means a random seed will be used.

Request Example


curl --location 'https://gptproto.com/api/v3/alibaba/wan-2.5/image-to-video' \
--header 'Authorization: sk-xx' \
--header 'Content-Type: application/json' \
--data '{
    "image": "https://d1q70pf5vjeyhc.cloudfront.net/media/fb8f674bbb1a429d947016fd223cfae1/images/1758532447268876299_CLRNKHEA.jpeg",
    "audio": "",
    "prompt": "A confident woman in her 40s stands on a stage with a microphone. The background shows a large LED screen with abstract visuals. She smiles and begins speaking to the audience: “Good evening everyone. Tonight, I want to share three powerful lessons about leadership and innovation.” Her lip movements match her voice, and she uses expressive hand gestures while speaking.",
    "negative_prompt": "",
    "resolution": "720p",
    "duration": 5,
    "enable_prompt_expansion": false,
    "seed": -1
}'

Response

data.id
string
Unique identifier for the prediction, Task Id
data.status
string
Status of the task: created, processing, completed, or failed
{
  "data": {
    "id": "5d87273c75exxxx",
    "model": "alibaba/wan-2.5/image-to-video",
    "outputs": [],
    "urls": {
      "get": "https://gptproto.com/api/v3/predictions/5d87273c75exxxx/result"
    },
    "hasNsfwContents": [],
    "status": "created",
    "createdAt": "2025-11-10T09:50:05.977Z",
    "error": "",
    "executionTime": 0,
    "timings": {
      "inference": 0
    }
  },
  "message": "success",
  "code": 200
}

{
  "error": {
    "message": "Invalid signature",
    "type": "401"
  }
}