Seedance 2.0 15% OFF | Create in Video Generator →
Alibaba·video·From $0.020/run

Wan 2.2 API

Alibaba's Wan 2.2 — open-weight video toolkit deployed on WaveSpeedAI with 35+ first-party variants: Animate (120s character animation), Video Edit, Speech-to-Video (10-min audio-driven), Fun-Control (Apache 2.0 licensed), plus image-to-video and text-to-video at multiple model sizes (5B, A14B) and resolutions (480p / 720p).

WaveSpeedAI-hosted variants only. Animate produces 720p clips up to 120 seconds; Speech-to-Video produces 480p clips up to 10 minutes; Fun-Control uses preset Control Codes under Apache 2.0 for commercial use. LoRA training endpoints fine-tune in minutes.

About the Wan 2.2 API

What Wan 2.2 does, how it fits in the Alibaba model lineup, and why teams reach for it.

Wan 2.2 is a video generation model from Alibaba, available through the WaveSpeedAI REST API. Alibaba's Wan 2.2 — open-weight video toolkit deployed on WaveSpeedAI with 35+ first-party variants: Animate (120s character animation), Video Edit, Speech-to-Video (10-min audio-driven), Fun-Control (Apache 2.0 licensed), plus image-to-video and text-to-video at multiple model sizes (5B, A14B) and resolutions (480p / 720p).

WaveSpeedAI-hosted variants only. Animate produces 720p clips up to 120 seconds; Speech-to-Video produces 480p clips up to 10 minutes; Fun-Control uses preset Control Codes under Apache 2.0 for commercial use. LoRA training endpoints fine-tune in minutes.

The Wan 2.2 family on WaveSpeedAI ships 35 REST endpoints covering Lora-Support, Image-To-Video, Video-Extend, Motion-Control, Video-To-Video, Image-To-Image, Digital-Human, Training, Text-To-Image, Text-To-Video workflows. Each variant carries its own pricing, parameter knobs, and example outputs — pick the one that matches your input modality and production constraints, or call several from the same API key to compose multi-step pipelines.

Run Wan 2.2 through the same API key, billing account, and rate-limit envelope you use for the other 1,000+ AI models on WaveSpeedAI. No separate vendor setup, no per-provider SDKs, no per-vendor rate-limit envelopes — one integration covers everything from text-to-image and text-to-video through audio synthesis, 3D generation, upscaling, and editing.

All Wan 2.2 API endpoints

35 endpoints available now on WaveSpeedAI — pick the variant that matches your workflow.

Image To Video Lora — Wan 2.2 lora-support preview from Alibaba

Image To Video Lora

Wan-2.2/image-to-video-lora enables unlimited image-to-video generation from a single image, producing smooth, cinematic motion with clean detail. Supports custom LoRAs for style and character consistency. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

lora-supportfrom $0.20
Image To Video — Wan 2.2 image-to-video preview from Alibaba

Image To Video

Wan 2.2 Image-to-Video turns a single image into smooth, cinematic motion with clean detail—ideal for storyboards, mood shots, and product demos. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

image-to-videofrom $0.15
Video Extend Lora — Wan 2.2 lora-support preview from Alibaba

Video Extend Lora

Extend clips into unlimited longer videos with WAN 2.2 Spicy, producing smooth animation and supporting custom LoRA weights. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-supportfrom $0.20
Video Extend — Wan 2.2 video-extend preview from Alibaba

Video Extend

Extend short clips into unlimited, high-quality longer videos with smooth animation using WAN 2.2 Spicy. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-extendfrom $0.15
Image To Video Lora — Wan 2.2 lora-support preview from Alibaba

Image To Video Lora

Generate AI videos with personalized styles using LoRA. Upload images and apply a trained style model to WAN 2.2 — create unique, stylized videos with consistent visual identity.

lora-supportfrom $0.20
Image To Video — Wan 2.2 image-to-video preview from Alibaba

Image To Video

WAN 2.2 Spicy converts images into unlimited high-quality videos with smooth animations optimized for scalable content generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-videofrom $0.15
Animate — Wan 2.2 motion-control preview from Alibaba

Animate

Wan2.2-Animate unified character animation & replacement model replicating movement and expression; generates 720p videos up to 120s. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

motion-controlfrom $0.20
Video Edit — Wan 2.2 video-to-video preview from Alibaba

Video Edit

Wan 2.2 Video Edit lets you modify videos via text prompts (e.g., change clothing or characters). Powered by Wan 2.2, it supports 480p ($0.20/5s) and 720p ($0.40/5s), up to 120s. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-to-videofrom $0.20
Image To Image — Wan 2.2 image-to-image preview from Alibaba

Image To Image

WAN 2.2 (14B) is an image-to-image model for high-resolution photorealistic image editing with exceptional precision and fidelity. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-imagefrom $0.020
Speech To Video — Wan 2.2 digital-human preview from Alibaba

Speech To Video

Wan-2.2-S2V turns images and speech into high-fidelity videos with realistic face and body motion; supports up to 10-minute clips in 480p, from $0.15/5s. Ready-to-use REST API, no coldstarts, affordable pricing.

digital-humanfrom $0.15
Text To Image Lora — Wan 2.2 lora-support preview from Alibaba

Text To Image Lora

WAN 2.2 generates super-detailed images from text prompts and supports custom LoRAs for fine-grained style and subject control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-supportfrom $0.025
Fun Control — Wan 2.2 motion-control preview from Alibaba

Fun Control

Wan2.2-Fun-Control uses Control Codes and multi-modal inputs to generate preset-controlled videos up to 120s at 720p; released under Apache 2.0 for commercial use. Ready-to-use REST API, no coldstarts, affordable.

motion-controlfrom $0.20
Wan 2.2 Image Lora Trainer — Wan 2.2 training preview from Alibaba

Wan 2.2 Image Lora Trainer

Train custom Wan 2.2 character/style LoRA models 10x faster. Style training, character training, object training. From concept to model in minutes, not hours. Upload a ZIP file containing images to start!

trainingfrom $3.00
Wan 2.2 I2v Lora Trainer — Wan 2.2 training preview from Alibaba

Wan 2.2 I2v Lora Trainer

Train custom Wan 2.2 I2V LoRA models 10x faster. Action training, motion training, video efect training. From concept to model in minutes, not hours. Upload a ZIP file containing videos to start!

trainingfrom $5.00
Text To Image Realism — Wan 2.2 text-to-image preview from Alibaba

Text To Image Realism

WAN 2.2 delivers ultra-realistic text-to-image generation, converting prompts into photoreal images with high fidelity and detail. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-imagefrom $0.025
I2v 5b 720p Lora — Wan 2.2 lora-support preview from Alibaba

I2v 5b 720p Lora

Wan 2.2 i2v-5B-720p is a 5B image-to-video model producing 720p videos with LoRA support for style customization. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-supportfrom $0.10
T2v 5b 720p Lora — Wan 2.2 lora-support preview from Alibaba

T2v 5b 720p Lora

Wan 2.2 T2V 5B is a 5B text-to-video model with LoRA support that generates 720p videos from text prompts for easy personalization. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-supportfrom $0.10
I2v 480p Lora Ultra Fast (Fast) — Wan 2.2 lora-support preview from Alibaba

I2v 480p Lora Ultra Fast (Fast)

Wan 2.2 i2v delivers ultra-fast Image-to-Video at 480p with support for custom LoRAs for tailored styles. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-supportfrom $0.10
I2v 480p Ultra Fast (Fast) — Wan 2.2 image-to-video preview from Alibaba

I2v 480p Ultra Fast (Fast)

Wan 2.2 A14B Image-to-Video (i2v-480p) produces ultra-fast 480p videos from single images, enabling unlimited AI video generation with high throughput. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-videofrom $0.050
I2v 720p Ultra Fast (Fast) — Wan 2.2 image-to-video preview from Alibaba

I2v 720p Ultra Fast (Fast)

Generate unlimited ultra-fast 720p AI videos from images with Wan 2.2 A14B image-to-video model. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-videofrom $0.10
T2v 480p Lora Ultra Fast (Fast) — Wan 2.2 lora-support preview from Alibaba

T2v 480p Lora Ultra Fast (Fast)

Ultra-fast Wan 2.2 text-to-video model producing 480p videos with custom LoRA support—generate unlimited AI videos with personalized styles. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-supportfrom $0.10
I2v 720p Lora Ultra Fast (Fast) — Wan 2.2 lora-support preview from Alibaba

I2v 720p Lora Ultra Fast (Fast)

Wan 2.2 i2v 720P is an ultra-fast Image-to-Video model that generates unlimited AI videos and supports custom LoRAs for personalized outputs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-supportfrom $0.15
T2v 480p Ultra Fast (Fast) — Wan 2.2 text-to-video preview from Alibaba

T2v 480p Ultra Fast (Fast)

Wan 2.2 t2v 480p Ultra-Fast generates unlimited AI videos from text prompts at 480p with ultra-fast inference. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-videofrom $0.050
T2v 5b 720p — Wan 2.2 text-to-video preview from Alibaba

T2v 5b 720p

Wan 2.2 T2V 5B is a 720P text-to-video model that generates unlimited AI videos from simple text prompts, producing consistent high-quality 720p outputs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-videofrom $0.050
I2v 5b 720p — Wan 2.2 image-to-video preview from Alibaba

I2v 5b 720p

Wan 2.2 I2V 5B converts images into high-quality 720P videos using a 5B image-to-video model for AI video generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-videofrom $0.050
I2v 480p — Wan 2.2 image-to-video preview from Alibaba

I2v 480p

Wan 2.2 A14B converts images into 480p videos, enabling unlimited AI video generation from single images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-videofrom $0.15
I2v 480p Lora — Wan 2.2 lora-support preview from Alibaba

I2v 480p Lora

WAN 2.2 A14B Image-to-Video model generates unlimited 480p videos from images and supports custom LoRAs for personalized styles and fine-tuning. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-supportfrom $0.20
I2v 720p Lora — Wan 2.2 lora-support preview from Alibaba

I2v 720p Lora

WAN 2.2 Image-to-Video (i2v) 720p converts images into 720p videos and supports custom LoRAs for style personalization. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-supportfrom $0.35
I2v 720p — Wan 2.2 image-to-video preview from Alibaba

I2v 720p

WAN 2.2 A14B i2v-720p converts images into smooth 720p videos, enabling unlimited AI video generation with the Wan 2.2 image-to-video model. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-videofrom $0.30
T2v 480p — Wan 2.2 text-to-video preview from Alibaba

T2v 480p

Wan 2.2 t2v-480p generates unlimited AI videos from text prompts at 480p resolution, ideal for rapid prototyping and content creation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-videofrom $0.15
T2v 480p Lora — Wan 2.2 lora-support preview from Alibaba

T2v 480p Lora

WAN 2.2 T2V 480p with LoRA generates text-to-video at 480p and supports custom LoRAs for personalized styles. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-supportfrom $0.20
T2v 720p — Wan 2.2 text-to-video preview from Alibaba

T2v 720p

Wan 2.2 t2v-720p converts text prompts into native 720P videos, producing high-quality 720P clips from simple prompts. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-videofrom $0.30
T2v 720p Lora — Wan 2.2 lora-support preview from Alibaba

T2v 720p Lora

Wan 2.2 T2V 720p with custom LoRA support turns text prompts into 720p AI videos and enables unlimited video generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-supportfrom $0.35
T2v 720p Lora Ultra Fast (Fast) — Wan 2.2 lora-support preview from Alibaba

T2v 720p Lora Ultra Fast (Fast)

Ultra-fast Wan 2.2 Text-to-Video generates unlimited 720p AI videos with custom LoRAs for personalized styles. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-supportfrom $0.15
T2v 720p Ultra Fast (Fast) — Wan 2.2 text-to-video preview from Alibaba

T2v 720p Ultra Fast (Fast)

WAN 2.2 T2V 720p Ultra-Fast generates high-quality 720p videos from text prompts with unlimited output and ultra-fast throughput. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-videofrom $0.10

See Wan 2.2 in action

Real outputs generated by the Wan 2.2 API. Hover any video to preview, click to open the full-size viewer.

How to use the Wan 2.2 API

Four steps from signup to a finished generation. Full Python, Node.js, and cURL examples are in the API section below.

  1. 1

    Get an API key

    Sign up for a WaveSpeedAI account and copy your API key from the dashboard. New accounts come with free starter credits — enough to run the playground a few dozen times before billing kicks in.

  2. 2

    Submit a prediction

    POST your input as JSON to https://api.wavespeed.ai/api/v3/wavespeed-ai/wan-2.2/image-to-video. The endpoint returns a prediction id immediately — generations are async so you don't hold an open connection during inference.

  3. 3

    Poll for completion

    GET https://api.wavespeed.ai/api/v3/predictions/{request_id}/result every 1-2 seconds. The response includes a status field; keep polling until it flips from"queued" or"processing" to"completed".

  4. 4

    Read the output URL

    Once status is"completed", read the URL from data.outputs[0]. The URL points to your generated media on the WaveSpeedAI CDN — image, video, audio, or 3D file depending on the Wan 2.2 variant you called.

What you can build with Wan 2.2

Common workflows developers and creators use the Wan 2.2 API for.

Wan 2.2 Animate — character animation up to 120s

wavespeed-ai/wan-2.2/animate is a "unified character animation & replacement model replicating movement and expression; generates 720p videos up to 120s." Catalog claim. Significantly longer than most pose-driven animation tools.

animate120spose

Speech-to-Video up to 10 minutes

wavespeed-ai/wan-2.2/speech-to-video "turns images and speech into high-fidelity videos with realistic face and body motion; supports up to 10-minute clips in 480p." Useful for long-form talking content.

speech-to-video10-minutetalking

Video Edit with prompt-driven changes

wavespeed-ai/wan-2.2/video-edit lets you modify videos via text prompts (catalog examples: change clothing or characters). Supports 480p and 720p, up to 120s.

video-editmodifyprompt-driven

Fun-Control with Apache 2.0 license

wavespeed-ai/wan-2.2/fun-control uses "Control Codes and multi-modal inputs to generate preset-controlled videos up to 120s at 720p; released under Apache 2.0 for commercial use." The Apache 2.0 licensing is a real differentiator for commercial pipelines.

fun-controlapachecommercial

LoRA training (10× faster)

wavespeed-ai/wan-2.2-image-lora-trainer for image LoRA, wavespeed-ai/wan-2.2-i2v-lora-trainer for I2V LoRA. Catalog claim: "10x faster training." Style, character, object, motion, action, and video-effect training supported. Upload a ZIP file to start.

loratrainingfine-tune

Image-to-Video at multiple sizes (5B / A14B)

Pick the model size: 5B (smaller, -0.10 depending on res) for speed/cost, or A14B for full quality. Standard i2v for 480p; LoRA-supported variants; ultra-fast variants at -0.10.

i2vmodel-size5b-a14b

Tips for prompting Wan 2.2

Practical advice for getting better outputs from Wan 2.2 — drawn from the patterns that work across video models in production pipelines.

Pick the variant for your task

Wan 2.2 ships specialized endpoints rather than one general-purpose model. Animate for pose-driven motion, video-edit for targeted changes, speech-to-video for talking content, image-to-video for general generation. Match endpoint to task — significantly better outputs than asking a general model to do everything.

Train a LoRA for production-scale consistency

Wan 2.2's LoRA trainer endpoints are first-class API features, not a side-tool. For productions that need recurring identity across hundreds of generations (brand mascot, recurring character, signature style), train a LoRA once and call the LoRA-inference endpoint per generation.

Use Spicy variants for stylized output

wan-2.2-spicy variants load community LoRA checkpoints for stylized i2v and video-extend. Use when your creative needs a specific artistic look (anime, painterly, branded) the base model doesn't produce. Pair with stylized reference images for the strongest effect.

Open weights enable fine-tuning

Closed-weight competitors lock you into the vendor's model behavior. Wan 2.2 underneath is open-weight, so when you hit a limitation the base model can't solve, fine-tuning is on the table — most other commercial video APIs don't offer this path.

Combine Animate with Kling Motion Control

Both ship pose-driven animation with different trade-offs. Wan 2.2 Animate has LoRA support and open weights; Kling Motion Control has stronger identity preservation. Pick based on whether fine-tuning matters more or whether identity quality matters more.

Wan 2.2 API pricing

Pricing is per-output. The final charge scales with the parameters you set in each variant's playground (resolution, duration, output count, references).

EndpointTypeStarting price
wavespeed-ai/wan-2.2/image-to-video-loralora-support$0.20
wavespeed-ai/wan-2.2/image-to-videoimage-to-video$0.15
wavespeed-ai/wan-2.2-spicy/video-extend-loralora-support$0.20
wavespeed-ai/wan-2.2-spicy/video-extendvideo-extend$0.15
wavespeed-ai/wan-2.2-spicy/image-to-video-loralora-support$0.20
wavespeed-ai/wan-2.2-spicy/image-to-videoimage-to-video$0.15
wavespeed-ai/wan-2.2/animatemotion-control$0.20
wavespeed-ai/wan-2.2/video-editvideo-to-video$0.20
wavespeed-ai/wan-2.2/image-to-imageimage-to-image$0.020
wavespeed-ai/wan-2.2/speech-to-videodigital-human$0.15
wavespeed-ai/wan-2.2/text-to-image-loralora-support$0.025
wavespeed-ai/wan-2.2/fun-controlmotion-control$0.20
wavespeed-ai/wan-2.2-image-lora-trainertraining$3.00
wavespeed-ai/wan-2.2-i2v-lora-trainertraining$5.00
wavespeed-ai/wan-2.2/text-to-image-realismtext-to-image$0.025
wavespeed-ai/wan-2.2/i2v-5b-720p-loralora-support$0.10
wavespeed-ai/wan-2.2/t2v-5b-720p-loralora-support$0.10
wavespeed-ai/wan-2.2/i2v-480p-lora-ultra-fastlora-support$0.10
wavespeed-ai/wan-2.2/i2v-480p-ultra-fastimage-to-video$0.050
wavespeed-ai/wan-2.2/i2v-720p-ultra-fastimage-to-video$0.10
wavespeed-ai/wan-2.2/t2v-480p-lora-ultra-fastlora-support$0.10
wavespeed-ai/wan-2.2/i2v-720p-lora-ultra-fastlora-support$0.15
wavespeed-ai/wan-2.2/t2v-480p-ultra-fasttext-to-video$0.050
wavespeed-ai/wan-2.2/t2v-5b-720ptext-to-video$0.050
wavespeed-ai/wan-2.2/i2v-5b-720pimage-to-video$0.050
wavespeed-ai/wan-2.2/i2v-480pimage-to-video$0.15
wavespeed-ai/wan-2.2/i2v-480p-loralora-support$0.20
wavespeed-ai/wan-2.2/i2v-720p-loralora-support$0.35
wavespeed-ai/wan-2.2/i2v-720pimage-to-video$0.30
wavespeed-ai/wan-2.2/t2v-480ptext-to-video$0.15
wavespeed-ai/wan-2.2/t2v-480p-loralora-support$0.20
wavespeed-ai/wan-2.2/t2v-720ptext-to-video$0.30
wavespeed-ai/wan-2.2/t2v-720p-loralora-support$0.35
wavespeed-ai/wan-2.2/t2v-720p-lora-ultra-fastlora-support$0.15
wavespeed-ai/wan-2.2/t2v-720p-ultra-fasttext-to-video$0.10

Call the Wan 2.2 API

Sign up for an API key at wavespeed.ai/accesskey, then submit a prediction via REST. The playground generates ready-to-paste samples for any combination of inputs.

HTTP example
# 1. Submit a prediction
curl -X POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/wan-2.2/image-to-video" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{}'

# 2. Poll the result until status = "completed"
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# Read the output URL from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY

const result = await client.run("wavespeed-ai/wan-2.2/image-to-video", {});
console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "wavespeed-ai/wan-2.2/image-to-video",
    {}
)
print(output["outputs"][0])  # → URL of the generated output

Wan 2.2 vs alternatives

When to pick Wan 2.2 over similar models on WaveSpeedAI.

Wan 2.2 vs Wan 2.7

Wan 2.7 (alibaba/wan-2.7/*) is the newer Alibaba architecture with reference-to-video, video-edit, image-edit, and text-to-image in one family — broader cross-modal toolkit. Wan 2.2 (WaveSpeedAI variants) ships specialized endpoints — Animate (120s), Speech-to-Video (10-min), Fun-Control (Apache 2.0), LoRA trainers — that 2.7 doesn't expose.

Wan 2.2 vs Seedance 2.0

Seedance 2.0 ships Hollywood-grade output with native audio across every variant. Wan 2.2 wins on the variant surface (35+ endpoints) and the LoRA training story — the right pick when you need a specific specialized capability (Animate, Speech-to-Video, Fun-Control) rather than a general-purpose video model.

Wan 2.2 vs Kling 3.0 Motion Control

Both ship pose-driven character animation. Wan 2.2 Animate produces 120s 720p clips and ships LoRA fine-tuning. Kling Motion Control is constrained to reference video duration (3-30s) but trained on Kuaishou's video corpus for stronger motion priors.

Wan 2.2 API — Frequently asked questions

Pricing, license, integration — common questions about running Wan 2.2 on WaveSpeedAI.

What is the Wan 2.2 API?

Wan 2.2 is a Alibaba video generation model exposed as a REST API on WaveSpeedAI. Alibaba's Wan 2.2 — open-weight video toolkit deployed on WaveSpeedAI with 35+ first-party variants: Animate (120s character animation), Video Edit, Speech-to-Video (10-min audio-driven), Fun-Control (Apache 2.0 licensed), plus image-to-video and text-to-video at multiple model sizes (5B, A14B) and resolutions (480p / 720p). You can call it programmatically or try it from the playground linked above.

How do I call the Wan 2.2 API?

Sign up for a WaveSpeedAI account, copy your API key from /accesskey, then POST to https://api.wavespeed.ai/api/v3/wavespeed-ai/wan-2.2/image-to-video with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to "completed", then read the output URL from data.outputs[0]. Full Python / Node.js / cURL examples are above.

How much does the Wan 2.2 API cost?

Wan 2.2 starts at $0.020 per run. The exact cost scales with the parameters you set (resolution, duration, output count, references). The live cost preview next to the Generate button in the playground shows the exact price for your current input.

Which Wan 2.2 variants are available?

WaveSpeedAI hosts 35 Wan 2.2 endpoints: wavespeed-ai/wan-2.2/image-to-video-lora, wavespeed-ai/wan-2.2/image-to-video, wavespeed-ai/wan-2.2-spicy/video-extend-lora, wavespeed-ai/wan-2.2-spicy/video-extend, wavespeed-ai/wan-2.2-spicy/image-to-video-lora, wavespeed-ai/wan-2.2-spicy/image-to-video, wavespeed-ai/wan-2.2/animate, wavespeed-ai/wan-2.2/video-edit, and more. Each variant has its own playground page and pricing.

Can I use Wan 2.2 outputs commercially?

Commercial usage rights follow the Alibaba model license. Most Alibaba models permit commercial output use; see each model's playground page for the specific license summary, and WaveSpeedAI's Terms of Service for platform-level conditions.

Why use Wan 2.2 on WaveSpeedAI instead of going direct?

One API key + one billing account across Wan 2.2 AND 1,000+ other AI models from other providers. No per-vendor SDK setup, no separate rate-limit envelopes, no rewrite-per-vendor integration code. Pricing is typically at parity with or below Alibaba's direct API.

About Alibaba

The team behind Wan 2.2 and the broader Alibaba model lineup on WaveSpeedAI.

Alibaba's Tongyi Lab produces the Wan family of video models and the Qwen family of LLMs. Wan is notable for being released with open weights, broad variant coverage (text-to-video, image-to-video, reference-to-video, video-edit, video-extend, image-edit, text-to-image), and consistent strength on motion stability and prompt adherence across multilingual prompts.

Start building with Wan 2.2 on WaveSpeedAI

Free starter credits on signup. One API key across 1,000+ AI models from Alibaba and every other provider.