Alibaba·video·From $0.030/run

Wan 2.7 API

Alibaba WAN 2.7 — coherent cinematic video with crisp detail, stable motion, and strong instruction-following. Separate endpoints for text-to-video, image-to-video, reference-to-video, video-edit, video-extend, plus image-edit and text-to-image variants in the same family.

Video output at 720p (default) or 1080p. Image edits via the base Edit variant or the Pro variant (up to 2K output). Text-to-image at the base variant or the Pro variant (up to 4K with thinking mode). Image-to-video supports first/last frame control; video-extend supports last-frame and audio.

Open Playground →View API Docs

About the Wan 2.7 API

What Wan 2.7 does, how it fits in the Alibaba model lineup, and why teams reach for it.

Wan 2.7 is a video generation model from Alibaba, available through the WaveSpeedAI REST API. Alibaba WAN 2.7 — coherent cinematic video with crisp detail, stable motion, and strong instruction-following. Separate endpoints for text-to-video, image-to-video, reference-to-video, video-edit, video-extend, plus image-edit and text-to-image variants in the same family.

Video output at 720p (default) or 1080p. Image edits via the base Edit variant or the Pro variant (up to 2K output). Text-to-image at the base variant or the Pro variant (up to 4K with thinking mode). Image-to-video supports first/last frame control; video-extend supports last-frame and audio.

The Wan 2.7 family on WaveSpeedAI ships 11 REST endpoints covering Text-To-Image, Video-Extend, Image-To-Video, Text-To-Video, Image-To-Image, Video-To-Video workflows. Each variant carries its own pricing, parameter knobs, and example outputs — pick the one that matches your input modality and production constraints, or call several from the same API key to compose multi-step pipelines.

Run Wan 2.7 through the same API key, billing account, and rate-limit envelope you use for the other 1,000+ AI models on WaveSpeedAI. No separate vendor setup, no per-provider SDKs, no per-vendor rate-limit envelopes — one integration covers everything from text-to-image and text-to-video through audio synthesis, 3D generation, upscaling, and editing.

All Wan 2.7 API endpoints

11 endpoints available now on WaveSpeedAI — pick the variant that matches your workflow.

Text To Image

WAN 2.7 Text-to-Image generates high-quality images from text prompts with thinking mode for enhanced image quality. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-imagefrom $0.030

Text To Image Pro

WAN 2.7 Text-to-Image Pro generates high-quality images up to 4K from text prompts with thinking mode for enhanced image quality. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-imagefrom $0.075

Video Extend

WAN 2.7 Video Extend extends existing videos with optional last frame control and audio support, supporting 720p/1080p output. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

video-extendfrom $0.50

Reference To Video

WAN 2.7 Reference-to-Video turns character, prop, or scene references from images or videos into new video shots with preserved identity, style, and layout plus smooth, coherent motion. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

image-to-videofrom $0.50

Text To Video

WAN 2.7 Text-to-Video turns plain prompts into coherent, cinematic clips with crisp detail, stable motion, and strong instruction-following—great for ads, explainers, and social posts. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

text-to-videofrom $0.50

Image Edit Pro

WAN 2.7 Image Edit Pro performs prompt-driven image editing with multi-image reference support and up to 2K output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-imagefrom $0.075

Video Edit

WAN 2.7 Video Edit performs prompt-driven video editing with multi-image reference support, supporting 720p/1080p output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-to-videofrom $0.50

Image Edit

WAN 2.7 Image Edit performs prompt-driven image editing with support for multiple-image references. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-imagefrom $0.030

Image To Video Spicy

Wan 2.7 Spicy Image to Video is a fast AI image-to-video generation model that converts images into high-quality videos with smooth animations optimized for scalable content generation. Ready-to-use REST inference API for animating images, social media clips, product videos, advertising creatives, creative storytelling, and professional image-to-video workflows with simple integration, no coldstarts, and affordable pricing.

image-to-videofrom $0.50

Image To Video

WAN 2.7 converts images into videos (720p/1080p) with optional audio, supporting first and last frame control. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

image-to-videofrom $0.50

Image To Video Pro

Wan 2.7 Image to Video Pro is a fast AI image-to-video generation model that converts images into premium-quality videos with superior motion dynamics, enhanced visual fidelity, and professional cinematic output. Ready-to-use REST inference API for product videos, advertising creatives, cinematic clips, social media content, character animation, visual storytelling, and professional image-to-video workflows with simple integration, no coldstarts, and affordable pricing.

image-to-videofrom $0.60

See Wan 2.7 in action

Real outputs generated by the Wan 2.7 API. Hover any video to preview, click to open the full-size viewer.

How to use the Wan 2.7 API

Four steps from signup to a finished generation. Full Python, Node.js, and cURL examples are in the API section below.

1
Get an API key
Sign up for a WaveSpeedAI account and copy your API key from the dashboard. New accounts come with free starter credits — enough to run the playground a few dozen times before billing kicks in.
2
Submit a prediction
POST your input as JSON to https://api.wavespeed.ai/api/v3/alibaba/wan-2.7/text-to-video. The endpoint returns a prediction id immediately — generations are async so you don't hold an open connection during inference.
3
Poll for completion
GET https://api.wavespeed.ai/api/v3/predictions/{request_id}/result every 1-2 seconds. The response includes a status field; keep polling until it flips from"queued" or"processing" to"completed".
4
Read the output URL
Once status is"completed", read the URL from data.outputs[0]. The URL points to your generated media on the WaveSpeedAI CDN — image, video, audio, or 3D file depending on the Wan 2.7 variant you called.

What you can build with Wan 2.7

Common workflows developers and creators use the Wan 2.7 API for.

Text-to-video with strong instruction-following

alibaba/wan-2.7/text-to-video turns plain prompts into coherent cinematic clips. Catalog framing: "great for ads, explainers, and social posts." 720p default, 1080p available.

text-to-videoadsexplainer

Image-to-video with frame control

alibaba/wan-2.7/image-to-video supports first and last frame control plus optional audio — useful for clips where you need the generation to start AND end at specific stills.

image-to-videofirst-last-framecontrol

Reference-to-video for identity

alibaba/wan-2.7/reference-to-video uses character, prop, or scene references (from images or videos) to generate new shots with preserved identity, style, and layout. Smooth coherent motion across the generation.

referenceidentitystyle

Video editing with multi-image references

alibaba/wan-2.7/video-edit performs prompt-driven editing on input videos with multi-image reference support, 720p/1080p output. Useful for stylistic re-edits and targeted modifications on existing footage.

video-editmulti-referencerestyle

Image editing in the same family

alibaba/wan-2.7/image-edit and image-edit-pro (up to 2K output) handle prompt-driven image editing with multi-image references — same family as the video tools, useful for stills-to-video pipelines.

image-editpipelinemulti-ref

Text-to-image with thinking mode

alibaba/wan-2.7/text-to-image-pro generates up to 4K images with "thinking mode" for enhanced quality. Useful for stills work in the same Wan 2.7 family as your video generations.

text-to-imagethinking-mode4k

Tips for prompting Wan 2.7

Practical advice for getting better outputs from Wan 2.7 — drawn from the patterns that work across video models in production pipelines.

Pick the variant for your task

Wan 2.7 ships separate endpoints for text-to-video, image-to-video, reference-to-video, video-edit, video-extend, image-edit (and image-edit-pro), text-to-image (and text-to-image-pro). Pick the endpoint that matches your input — significantly better output than asking one variant to do everything.

Reference-to-video preserves identity

alibaba/wan-2.7/reference-to-video uses character, prop, or scene references (from images or videos) to generate new shots with preserved identity, style, and layout. The right pick when the source material has specific subjects that must remain recognizable.

Image-to-video supports first/last frame control

alibaba/wan-2.7/image-to-video lets you specify first and last frame for the generated clip, plus optional audio. Useful when the start and end states are locked and the model needs to fill the connecting motion.

Video-extend with last-frame and audio control

alibaba/wan-2.7/video-extend extends an existing video with optional last-frame control and audio support, 720p/1080p output. Useful for stitching extended sequences where you want explicit control over where the extension lands.

Use "thinking mode" on text-to-image-pro

alibaba/wan-2.7/text-to-image-pro generates up to 4K with "thinking mode" for enhanced image quality. Useful when stills work shares the Wan 2.7 family with your video generations and you want consistent style across stills + motion.

Pick 720p or 1080p deliberately

Wan 2.7 supports both 720p and 1080p output across most variants. 720p is the default and the cheaper option; pick 1080p when delivery resolution matters and the extra detail justifies the cost.

Wan 2.7 API pricing

Pricing is per-output. The final charge scales with the parameters you set in each variant's playground (resolution, duration, output count, references).

Endpoint	Type	Starting price
alibaba/wan-2.7/text-to-image	text-to-image	$0.030
alibaba/wan-2.7/text-to-image-pro	text-to-image	$0.075
alibaba/wan-2.7/video-extend	video-extend	$0.50
alibaba/wan-2.7/reference-to-video	image-to-video	$0.50
alibaba/wan-2.7/text-to-video	text-to-video	$0.50
alibaba/wan-2.7/image-edit-pro	image-to-image	$0.075
alibaba/wan-2.7/video-edit	video-to-video	$0.50
alibaba/wan-2.7/image-edit	image-to-image	$0.030
alibaba/wan-2.7/image-to-video-spicy	image-to-video	$0.50
alibaba/wan-2.7/image-to-video	image-to-video	$0.50
alibaba/wan-2.7/image-to-video-pro	image-to-video	$0.60

Call the Wan 2.7 API

Sign up for an API key at wavespeed.ai/accesskey, then submit a prediction via REST. The playground generates ready-to-paste samples for any combination of inputs.

HTTP example

# 1. Submit a prediction
curl -X POST "https://api.wavespeed.ai/api/v3/alibaba/wan-2.7/text-to-video" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{}'

# 2. Poll the result until status = "completed"
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# Read the output URL from data.outputs[0].

Node.js example

// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY

const result = await client.run("alibaba/wan-2.7/text-to-video", {});
console.log(result.outputs[0]); // → URL of the generated output

Python example

# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "alibaba/wan-2.7/text-to-video",
    {}
)
print(output["outputs"][0])  # → URL of the generated output

Wan 2.7 vs alternatives

When to pick Wan 2.7 over similar models on WaveSpeedAI.

Wan 2.7 vs Seedance 2.0

Seedance 2.0 ships native audio across every variant (Wan 2.7 has optional audio on image-to-video and video-extend only) and the Turbo tier (1080p at near-480p speed). Wan 2.7 wins on cross-modal breadth — image-edit and text-to-image variants in the same family.

Wan 2.7 vs Kling 3.0

Kling 3.0 has Pro and 4K tiers plus a motion-control endpoint. Wan 2.7 stays at base across most variants and adds reference-to-video, image-edit, image-edit-pro, and text-to-image variants Kling doesn't ship.

Wan 2.7 vs Wan 2.2

Wan 2.7 is the newer architecture. Wan 2.2 (WaveSpeedAI variants) ships specialized endpoints — Animate (120s character animation), Speech-to-Video (10-min audio-driven), Fun-Control (Apache 2.0), LoRA trainers — that 2.7 doesn't expose.

Wan 2.7 API — Frequently asked questions

Pricing, license, integration — common questions about running Wan 2.7 on WaveSpeedAI.

What is the Wan 2.7 API?

Wan 2.7 is a Alibaba video generation model exposed as a REST API on WaveSpeedAI. Alibaba WAN 2.7 — coherent cinematic video with crisp detail, stable motion, and strong instruction-following. Separate endpoints for text-to-video, image-to-video, reference-to-video, video-edit, video-extend, plus image-edit and text-to-image variants in the same family. You can call it programmatically or try it from the playground linked above.

How do I call the Wan 2.7 API?

Sign up for a WaveSpeedAI account, copy your API key from /accesskey, then POST to https://api.wavespeed.ai/api/v3/alibaba/wan-2.7/text-to-video with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to "completed", then read the output URL from data.outputs[0]. Full Python / Node.js / cURL examples are above.

How much does the Wan 2.7 API cost?

Wan 2.7 starts at $0.030 per run. The exact cost scales with the parameters you set (resolution, duration, output count, references). The live cost preview next to the Generate button in the playground shows the exact price for your current input.

Which Wan 2.7 variants are available?

WaveSpeedAI hosts 11 Wan 2.7 endpoints: alibaba/wan-2.7/text-to-image, alibaba/wan-2.7/text-to-image-pro, alibaba/wan-2.7/video-extend, alibaba/wan-2.7/reference-to-video, alibaba/wan-2.7/text-to-video, alibaba/wan-2.7/image-edit-pro, alibaba/wan-2.7/video-edit, alibaba/wan-2.7/image-edit, and more. Each variant has its own playground page and pricing.

Can I use Wan 2.7 outputs commercially?

Commercial usage rights follow the Alibaba model license. Most Alibaba models permit commercial output use; see each model's playground page for the specific license summary, and WaveSpeedAI's Terms of Service for platform-level conditions.

Why use Wan 2.7 on WaveSpeedAI instead of going direct?

One API key + one billing account across Wan 2.7 AND 1,000+ other AI models from other providers. No per-vendor SDK setup, no separate rate-limit envelopes, no rewrite-per-vendor integration code. Pricing is typically at parity with or below Alibaba's direct API.

About Alibaba

The team behind Wan 2.7 and the broader Alibaba model lineup on WaveSpeedAI.

Alibaba's Tongyi Lab produces the Wan family of video models and the Qwen family of LLMs. Wan is notable for being released with open weights, broad variant coverage (text-to-video, image-to-video, reference-to-video, video-edit, video-extend, image-edit, text-to-image), and consistent strength on motion stability and prompt adherence across multilingual prompts.

Related model APIs on WaveSpeedAI

Other AI APIs from Alibaba and the rest of the video model lineup — one API key, one billing account.

Happy Horse 1.0 API

Alibaba

Alibaba Happy Horse 1.0 — cinematic 720p / 1080p video with smooth camera movement, expressive motion, and strong prompt fidelity. Includes reference-to-video for consistent character/style identity across generations.

Wan 2.2 API

Alibaba

Alibaba's Wan 2.2 — open-weight video toolkit deployed on WaveSpeedAI with 35+ first-party variants: Animate (120s character animation), Video Edit, Speech-to-Video (10-min audio-driven), Fun-Control (Apache 2.0 licensed), plus image-to-video and text-to-video at multiple model sizes (5B, A14B) and resolutions (480p / 720p).

Wan 2.6 API

Alibaba

Alibaba WAN 2.6 — text-to-video and image-to-video with synced audio at 720p/1080p, plus reference-to-video, video-extend, image-edit, and text-to-image in the same family. Flash and Spicy tiers for speed and scalable content generation.

Qwen Image API

Alibaba

Alibaba Qwen-Image — 20B MMDiT next-gen text-to-image and editing toolkit with bilingual Chinese/English support, multi-image editing, LoRA customization, layered compositing, and a 96-pose camera-angle system.

Seedance 2.0 API

ByteDance

ByteDance Seedance 2.0 — Hollywood-grade cinematic video with native audio-visual synchronization, director-level camera and lighting control, and exceptional motion stability. Built on Seed's unified multimodal architecture.

Seedance 1.5 Pro API

ByteDance

ByteDance Seedance 1.5 Pro — cinematic, live-action-leaning clips with strong prompt adherence, expressive motion, and stable aesthetics. 4-12s duration with Smart Duration, multiple aspect ratios, reproducible generation via seeds.

Start building with Wan 2.7 on WaveSpeedAI

Free starter credits on signup. One API key across 1,000+ AI models from Alibaba and every other provider.

Open Wan 2.7 Playground →Get an API Key

Wan 2.7 API

About the Wan 2.7 API

All Wan 2.7 API endpoints

Text To Image

Text To Image Pro

Video Extend

Reference To Video

Text To Video

Image Edit Pro

Video Edit

Image Edit

Image To Video Spicy

Image To Video

Image To Video Pro

See Wan 2.7 in action

How to use the Wan 2.7 API

Get an API key

Submit a prediction

Poll for completion

Read the output URL

What you can build with Wan 2.7

Text-to-video with strong instruction-following

Image-to-video with frame control

Reference-to-video for identity

Video editing with multi-image references

Image editing in the same family

Text-to-image with thinking mode

Tips for prompting Wan 2.7

Pick the variant for your task

Reference-to-video preserves identity

Image-to-video supports first/last frame control

Video-extend with last-frame and audio control

Use "thinking mode" on text-to-image-pro

Pick 720p or 1080p deliberately

Wan 2.7 API pricing

Call the Wan 2.7 API

Wan 2.7 vs alternatives

Wan 2.7 vs Seedance 2.0

Wan 2.7 vs Kling 3.0

Wan 2.7 vs Wan 2.2

Wan 2.7 API — Frequently asked questions

About Alibaba

Related model APIs on WaveSpeedAI

Happy Horse 1.0 API

Wan 2.2 API

Wan 2.6 API

Qwen Image API

Seedance 2.0 API

Seedance 1.5 Pro API

Start building with Wan 2.7 on WaveSpeedAI