Wan 2.7 API
Alibaba WAN 2.7 — coherent cinematic video with crisp detail, stable motion, and strong instruction-following. Separate endpoints for text-to-video, image-to-video, reference-to-video, video-edit, video-extend, plus image-edit and text-to-image variants in the same family.
Video output at 720p (default) or 1080p. Image edits via the base Edit variant or the Pro variant (up to 2K output). Text-to-image at the base variant or the Pro variant (up to 4K with thinking mode). Image-to-video supports first/last frame control; video-extend supports last-frame and audio.
About the Wan 2.7 API
What Wan 2.7 does, how it fits in the Alibaba model lineup, and why teams reach for it.
Wan 2.7 is a video generation model from Alibaba, available through the WaveSpeedAI REST API. Alibaba WAN 2.7 — coherent cinematic video with crisp detail, stable motion, and strong instruction-following. Separate endpoints for text-to-video, image-to-video, reference-to-video, video-edit, video-extend, plus image-edit and text-to-image variants in the same family.
Video output at 720p (default) or 1080p. Image edits via the base Edit variant or the Pro variant (up to 2K output). Text-to-image at the base variant or the Pro variant (up to 4K with thinking mode). Image-to-video supports first/last frame control; video-extend supports last-frame and audio.
The Wan 2.7 family on WaveSpeedAI ships 11 REST endpoints covering Text-To-Image, Video-Extend, Image-To-Video, Text-To-Video, Image-To-Image, Video-To-Video workflows. Each variant carries its own pricing, parameter knobs, and example outputs — pick the one that matches your input modality and production constraints, or call several from the same API key to compose multi-step pipelines.
Run Wan 2.7 through the same API key, billing account, and rate-limit envelope you use for the other 1,000+ AI models on WaveSpeedAI. No separate vendor setup, no per-provider SDKs, no per-vendor rate-limit envelopes — one integration covers everything from text-to-image and text-to-video through audio synthesis, 3D generation, upscaling, and editing.
All Wan 2.7 API endpoints
11 endpoints available now on WaveSpeedAI — pick the variant that matches your workflow.

Text To Image
WAN 2.7 Text-to-Image generates high-quality images from text prompts with thinking mode for enhanced image quality. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Text To Image Pro
WAN 2.7 Text-to-Image Pro generates high-quality images up to 4K from text prompts with thinking mode for enhanced image quality. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Video Extend
WAN 2.7 Video Extend extends existing videos with optional last frame control and audio support, supporting 720p/1080p output. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Reference To Video
WAN 2.7 Reference-to-Video turns character, prop, or scene references from images or videos into new video shots with preserved identity, style, and layout plus smooth, coherent motion. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Text To Video
WAN 2.7 Text-to-Video turns plain prompts into coherent, cinematic clips with crisp detail, stable motion, and strong instruction-following—great for ads, explainers, and social posts. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Image Edit Pro
WAN 2.7 Image Edit Pro performs prompt-driven image editing with multi-image reference support and up to 2K output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Video Edit
WAN 2.7 Video Edit performs prompt-driven video editing with multi-image reference support, supporting 720p/1080p output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Image Edit
WAN 2.7 Image Edit performs prompt-driven image editing with support for multiple-image references. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Image To Video Spicy
Wan 2.7 Spicy Image to Video is a fast AI image-to-video generation model that converts images into high-quality videos with smooth animations optimized for scalable content generation. Ready-to-use REST inference API for animating images, social media clips, product videos, advertising creatives, creative storytelling, and professional image-to-video workflows with simple integration, no coldstarts, and affordable pricing.

Image To Video
WAN 2.7 converts images into videos (720p/1080p) with optional audio, supporting first and last frame control. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Image To Video Pro
Wan 2.7 Image to Video Pro is a fast AI image-to-video generation model that converts images into premium-quality videos with superior motion dynamics, enhanced visual fidelity, and professional cinematic output. Ready-to-use REST inference API for product videos, advertising creatives, cinematic clips, social media content, character animation, visual storytelling, and professional image-to-video workflows with simple integration, no coldstarts, and affordable pricing.
See Wan 2.7 in action
Real outputs generated by the Wan 2.7 API. Hover any video to preview, click to open the full-size viewer.
How to use the Wan 2.7 API
Four steps from signup to a finished generation. Full Python, Node.js, and cURL examples are in the API section below.
- 1
Get an API key
Sign up for a WaveSpeedAI account and copy your API key from the dashboard. New accounts come with free starter credits — enough to run the playground a few dozen times before billing kicks in.
- 2
Submit a prediction
POST your input as JSON to https://api.wavespeed.ai/api/v3/alibaba/wan-2.7/text-to-video. The endpoint returns a prediction id immediately — generations are async so you don't hold an open connection during inference.
- 3
Poll for completion
GET https://api.wavespeed.ai/api/v3/predictions/{request_id}/result every 1-2 seconds. The response includes a status field; keep polling until it flips from"queued" or"processing" to"completed".
- 4
Read the output URL
Once status is"completed", read the URL from data.outputs[0]. The URL points to your generated media on the WaveSpeedAI CDN — image, video, audio, or 3D file depending on the Wan 2.7 variant you called.
What you can build with Wan 2.7
Common workflows developers and creators use the Wan 2.7 API for.
Text-to-video with strong instruction-following
alibaba/wan-2.7/text-to-video turns plain prompts into coherent cinematic clips. Catalog framing: "great for ads, explainers, and social posts." 720p default, 1080p available.
Image-to-video with frame control
alibaba/wan-2.7/image-to-video supports first and last frame control plus optional audio — useful for clips where you need the generation to start AND end at specific stills.
Reference-to-video for identity
alibaba/wan-2.7/reference-to-video uses character, prop, or scene references (from images or videos) to generate new shots with preserved identity, style, and layout. Smooth coherent motion across the generation.
Video editing with multi-image references
alibaba/wan-2.7/video-edit performs prompt-driven editing on input videos with multi-image reference support, 720p/1080p output. Useful for stylistic re-edits and targeted modifications on existing footage.
Image editing in the same family
alibaba/wan-2.7/image-edit and image-edit-pro (up to 2K output) handle prompt-driven image editing with multi-image references — same family as the video tools, useful for stills-to-video pipelines.
Text-to-image with thinking mode
alibaba/wan-2.7/text-to-image-pro generates up to 4K images with "thinking mode" for enhanced quality. Useful for stills work in the same Wan 2.7 family as your video generations.
Tips for prompting Wan 2.7
Practical advice for getting better outputs from Wan 2.7 — drawn from the patterns that work across video models in production pipelines.
Pick the variant for your task
Wan 2.7 ships separate endpoints for text-to-video, image-to-video, reference-to-video, video-edit, video-extend, image-edit (and image-edit-pro), text-to-image (and text-to-image-pro). Pick the endpoint that matches your input — significantly better output than asking one variant to do everything.
Reference-to-video preserves identity
alibaba/wan-2.7/reference-to-video uses character, prop, or scene references (from images or videos) to generate new shots with preserved identity, style, and layout. The right pick when the source material has specific subjects that must remain recognizable.
Image-to-video supports first/last frame control
alibaba/wan-2.7/image-to-video lets you specify first and last frame for the generated clip, plus optional audio. Useful when the start and end states are locked and the model needs to fill the connecting motion.
Video-extend with last-frame and audio control
alibaba/wan-2.7/video-extend extends an existing video with optional last-frame control and audio support, 720p/1080p output. Useful for stitching extended sequences where you want explicit control over where the extension lands.
Use "thinking mode" on text-to-image-pro
alibaba/wan-2.7/text-to-image-pro generates up to 4K with "thinking mode" for enhanced image quality. Useful when stills work shares the Wan 2.7 family with your video generations and you want consistent style across stills + motion.
Pick 720p or 1080p deliberately
Wan 2.7 supports both 720p and 1080p output across most variants. 720p is the default and the cheaper option; pick 1080p when delivery resolution matters and the extra detail justifies the cost.
Wan 2.7 API pricing
Pricing is per-output. The final charge scales with the parameters you set in each variant's playground (resolution, duration, output count, references).
| Endpoint | Type | Starting price |
|---|---|---|
| alibaba/wan-2.7/text-to-image | text-to-image | $0.030 |
| alibaba/wan-2.7/text-to-image-pro | text-to-image | $0.075 |
| alibaba/wan-2.7/video-extend | video-extend | $0.50 |
| alibaba/wan-2.7/reference-to-video | image-to-video | $0.50 |
| alibaba/wan-2.7/text-to-video | text-to-video | $0.50 |
| alibaba/wan-2.7/image-edit-pro | image-to-image | $0.075 |
| alibaba/wan-2.7/video-edit | video-to-video | $0.50 |
| alibaba/wan-2.7/image-edit | image-to-image | $0.030 |
| alibaba/wan-2.7/image-to-video-spicy | image-to-video | $0.50 |
| alibaba/wan-2.7/image-to-video | image-to-video | $0.50 |
| alibaba/wan-2.7/image-to-video-pro | image-to-video | $0.60 |
Call the Wan 2.7 API
Sign up for an API key at wavespeed.ai/accesskey, then submit a prediction via REST. The playground generates ready-to-paste samples for any combination of inputs.
HTTP example
# 1. Submit a prediction
curl -X POST "https://api.wavespeed.ai/api/v3/alibaba/wan-2.7/text-to-video" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{}'
# 2. Poll the result until status = "completed"
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# Read the output URL from data.outputs[0].Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY
const result = await client.run("alibaba/wan-2.7/text-to-video", {});
console.log(result.outputs[0]); // → URL of the generated outputPython example
# pip install wavespeed
import wavespeed
output = wavespeed.run(
"alibaba/wan-2.7/text-to-video",
{}
)
print(output["outputs"][0]) # → URL of the generated outputWan 2.7 vs alternatives
When to pick Wan 2.7 over similar models on WaveSpeedAI.
Wan 2.7 vs Seedance 2.0
Seedance 2.0 ships native audio across every variant (Wan 2.7 has optional audio on image-to-video and video-extend only) and the Turbo tier (1080p at near-480p speed). Wan 2.7 wins on cross-modal breadth — image-edit and text-to-image variants in the same family.
Wan 2.7 vs Kling 3.0
Kling 3.0 has Pro and 4K tiers plus a motion-control endpoint. Wan 2.7 stays at base across most variants and adds reference-to-video, image-edit, image-edit-pro, and text-to-image variants Kling doesn't ship.
Wan 2.7 vs Wan 2.2
Wan 2.7 is the newer architecture. Wan 2.2 (WaveSpeedAI variants) ships specialized endpoints — Animate (120s character animation), Speech-to-Video (10-min audio-driven), Fun-Control (Apache 2.0), LoRA trainers — that 2.7 doesn't expose.
Wan 2.7 API — Frequently asked questions
Pricing, license, integration — common questions about running Wan 2.7 on WaveSpeedAI.
What is the Wan 2.7 API?
Wan 2.7 is a Alibaba video generation model exposed as a REST API on WaveSpeedAI. Alibaba WAN 2.7 — coherent cinematic video with crisp detail, stable motion, and strong instruction-following. Separate endpoints for text-to-video, image-to-video, reference-to-video, video-edit, video-extend, plus image-edit and text-to-image variants in the same family. You can call it programmatically or try it from the playground linked above.
How do I call the Wan 2.7 API?
Sign up for a WaveSpeedAI account, copy your API key from /accesskey, then POST to https://api.wavespeed.ai/api/v3/alibaba/wan-2.7/text-to-video with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to "completed", then read the output URL from data.outputs[0]. Full Python / Node.js / cURL examples are above.
How much does the Wan 2.7 API cost?
Wan 2.7 starts at $0.030 per run. The exact cost scales with the parameters you set (resolution, duration, output count, references). The live cost preview next to the Generate button in the playground shows the exact price for your current input.
Which Wan 2.7 variants are available?
WaveSpeedAI hosts 11 Wan 2.7 endpoints: alibaba/wan-2.7/text-to-image, alibaba/wan-2.7/text-to-image-pro, alibaba/wan-2.7/video-extend, alibaba/wan-2.7/reference-to-video, alibaba/wan-2.7/text-to-video, alibaba/wan-2.7/image-edit-pro, alibaba/wan-2.7/video-edit, alibaba/wan-2.7/image-edit, and more. Each variant has its own playground page and pricing.
Can I use Wan 2.7 outputs commercially?
Commercial usage rights follow the Alibaba model license. Most Alibaba models permit commercial output use; see each model's playground page for the specific license summary, and WaveSpeedAI's Terms of Service for platform-level conditions.
Why use Wan 2.7 on WaveSpeedAI instead of going direct?
One API key + one billing account across Wan 2.7 AND 1,000+ other AI models from other providers. No per-vendor SDK setup, no separate rate-limit envelopes, no rewrite-per-vendor integration code. Pricing is typically at parity with or below Alibaba's direct API.
About Alibaba
The team behind Wan 2.7 and the broader Alibaba model lineup on WaveSpeedAI.
Alibaba's Tongyi Lab produces the Wan family of video models and the Qwen family of LLMs. Wan is notable for being released with open weights, broad variant coverage (text-to-video, image-to-video, reference-to-video, video-edit, video-extend, image-edit, text-to-image), and consistent strength on motion stability and prompt adherence across multilingual prompts.
Start building with Wan 2.7 on WaveSpeedAI
Free starter credits on signup. One API key across 1,000+ AI models from Alibaba and every other provider.