WAN 2.7 Text-to-Image Pro generates high-quality images up to 4K from text prompts with thinking mode for enhanced image quality. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Idle
![Lookbook photo of a model wearing a [dark leather bomber jacket], minimalist studio, soft side light, confident pose, magazine cover aesthetic, clean backdrop, subtle film grain, high fashion. With a "Fashion Magazine" book name in art word style at the bottom](https://static.wavespeed.ai/examples/547b706ac3f445f0b521797d7a24a2d5/1.png)
$0.075per run·~13 / $1

two young people eating dessert together, close-up shot, wide angle lens, exaggerated perspective, sitting at an outdoor table, feeding each other with spoons, playful expressions, summer vibe, bright sunlight, pastel umbrellas above, blue sky, casual candid moment, lifestyle photography, vibrant colors, high contrast, natural skin texture, modern editorial style, high detail
![Lookbook photo of a model wearing a [dark leather bomber jacket], minimalist studio, soft side light, confident pose, magazine cover aesthetic, clean backdrop, subtle film grain, high fashion. With a "Fashion Magazine" book name in art word style at the bottom](https://static.wavespeed.ai/examples/547b706ac3f445f0b521797d7a24a2d5/1.png)
Lookbook photo of a model wearing a [dark leather bomber jacket], minimalist studio, soft side light, confident pose, magazine cover aesthetic, clean backdrop, subtle film grain, high fashion. With a "Fashion Magazine" book name in art word style at the bottom
![[High-end Wireless Headphones], centered on pure white background, studio high-key lighting, crisp hard shadow, commercial packshot, 35mm perspective, ultra-sharp details, subtle floor reflection, dust-free, 8k, realistic product photography](https://static.wavespeed.ai/examples/0359a2c6e8e740bb95c61b510915c688/1.png)
[High-end Wireless Headphones], centered on pure white background, studio high-key lighting, crisp hard shadow, commercial packshot, 35mm perspective, ultra-sharp details, subtle floor reflection, dust-free, 8k, realistic product photography

cinematic fashion editorial, blonde woman leaning on a glossy red car hood, reflection on surface, golden hour sunlight, dramatic shadows, retro american street, pharmacy sign in background, palm trees, shallow depth of field, high fashion styling, fur detail on shoulder, jewelry and sunglasses on car, confident expression, slightly parted lips, moody atmosphere, film photography look, rich contrast, warm tones, ultra realistic, 35mm lens, editorial photography, vogue style, sharp focus, highly detailed
Wan 2.7 Text-to-Image Pro is the professional tier of text-to-image generation model, supporting output resolutions up to 4K (4096×4096). With built-in thinking mode for enhanced reasoning and custom size control, it delivers higher-fidelity compositions ideal for print-ready assets, large-format displays, and any workflow where resolution and quality are the priority.
Up to 4K resolution output Generate images up to 4096×4096 pixels — ideal for print, large-format displays, and high-DPI screens where standard resolution falls short.
Thinking mode for smarter generation Built-in thinking mode enables the model to reason about prompt intent before generating, producing more coherent compositions and better prompt adherence.
Custom size output Set output width and height directly (512–8192 per dimension) to match banners, thumbnails, posters, or social formats exactly.
Seeded iteration Use a fixed seed to refine style and layout with more repeatable variations.
Prompt Enhancer Built-in tool to automatically improve your text descriptions for richer results.
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Text description of the image subject, scene, style, lighting, and mood. |
| size | No | Output dimensions (width × height). Range: 512–8192 per dimension. Default: 1024×1024. |
| thinking_mode | No | Enable thinking mode for enhanced reasoning and better image quality. Default: enabled. |
| seed | No | Fixed seed for repeatable iterations. Use -1 for a random seed. |
Just $0.075 per generated image.
Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/alibaba/wan-2.7/text-to-image-pro with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Wan 2.7 Text To Image Pro below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/alibaba/wan-2.7/text-to-image-pro" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"size": "1024*1024",
"thinking_mode": true,
"seed": -1
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("alibaba/wan-2.7/text-to-image-pro", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"size": "1024*1024",
"thinking_mode": true,
"seed": -1
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"alibaba/wan-2.7/text-to-image-pro",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"size": "1024*1024",
"thinking_mode": true,
"seed": -1
}
)
print(output["outputs"][0]) # → URL of the generated outputWan 2.7 Text To Image Pro is a Alibaba model for image generation, exposed as a REST API on WaveSpeedAI. WAN 2.7 Text-to-Image Pro generates high-quality images up to 4K from text prompts with thinking mode for enhanced image quality. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/alibaba/alibaba-wan-2.7-text-to-image-pro.
Wan 2.7 Text To Image Pro starts at $0.075 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `size`, `seed`, `thinking_mode`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/alibaba/alibaba-wan-2.7-text-to-image-pro.
Sign up for a free WaveSpeedAI account to claim starter credits, copy your API key from /accesskey, then call the endpoint shown in the API tab of the playground. The playground also auto-generates a code sample in Python, JavaScript, or cURL for the parameters you've set.
Commercial usage rights depend on the model's license, set by its provider (Alibaba). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.