Seedance 2.0 15% OFF | Create in Video Generator →
Home/Explore/Alibaba/Wan 2.7/Text To Image Pro

Wan 2.7 Text to Image Pro

alibaba /

WAN 2.7 Text-to-Image Pro generates high-quality images up to 4K from text prompts with thinking mode for enhanced image quality. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-image
Input
width
height
2048 × 2048 px
Range: 512 - 8192
Enable thinking mode for enhanced reasoning and better image quality. Increases generation time.

Idle

Lookbook photo of a model wearing a [dark leather bomber jacket], minimalist studio, soft side light, confident pose, magazine cover aesthetic, clean backdrop, subtle film grain, high fashion. With a "Fashion Magazine" book name in art word style at the bottom

$0.075per run·~13 / $1

Next:

ExamplesView all

two young people eating dessert together, close-up shot, wide angle lens, exaggerated perspective, sitting at an outdoor table, feeding each other with spoons, playful expressions, summer vibe, bright sunlight, pastel umbrellas above, blue sky, casual candid moment, lifestyle photography, vibrant colors, high contrast, natural skin texture, modern editorial style, high detail

two young people eating dessert together, close-up shot, wide angle lens, exaggerated perspective, sitting at an outdoor table, feeding each other with spoons, playful expressions, summer vibe, bright sunlight, pastel umbrellas above, blue sky, casual candid moment, lifestyle photography, vibrant colors, high contrast, natural skin texture, modern editorial style, high detail

Lookbook photo of a model wearing a [dark leather bomber jacket], minimalist studio, soft side light, confident pose, magazine cover aesthetic, clean backdrop, subtle film grain, high fashion. With a "Fashion Magazine" book name in art word style at the bottom

Lookbook photo of a model wearing a [dark leather bomber jacket], minimalist studio, soft side light, confident pose, magazine cover aesthetic, clean backdrop, subtle film grain, high fashion. With a "Fashion Magazine" book name in art word style at the bottom

[High-end Wireless Headphones], centered on pure white background, studio high-key lighting, crisp hard shadow, commercial packshot, 35mm perspective, ultra-sharp details, subtle floor reflection, dust-free, 8k, realistic product photography

[High-end Wireless Headphones], centered on pure white background, studio high-key lighting, crisp hard shadow, commercial packshot, 35mm perspective, ultra-sharp details, subtle floor reflection, dust-free, 8k, realistic product photography

cinematic fashion editorial, blonde woman leaning on a glossy red car hood, reflection on surface, golden hour sunlight, dramatic shadows, retro american street, pharmacy sign in background, palm trees, shallow depth of field, high fashion styling, fur detail on shoulder, jewelry and sunglasses on car, confident expression, slightly parted lips, moody atmosphere, film photography look, rich contrast, warm tones, ultra realistic, 35mm lens, editorial photography, vogue style, sharp focus, highly detailed

cinematic fashion editorial, blonde woman leaning on a glossy red car hood, reflection on surface, golden hour sunlight, dramatic shadows, retro american street, pharmacy sign in background, palm trees, shallow depth of field, high fashion styling, fur detail on shoulder, jewelry and sunglasses on car, confident expression, slightly parted lips, moody atmosphere, film photography look, rich contrast, warm tones, ultra realistic, 35mm lens, editorial photography, vogue style, sharp focus, highly detailed

Related Models

README

Wan 2.7 Text-to-Image Pro

Wan 2.7 Text-to-Image Pro is the professional tier of text-to-image generation model, supporting output resolutions up to 4K (4096×4096). With built-in thinking mode for enhanced reasoning and custom size control, it delivers higher-fidelity compositions ideal for print-ready assets, large-format displays, and any workflow where resolution and quality are the priority.

Why Choose This?

  • Up to 4K resolution output Generate images up to 4096×4096 pixels — ideal for print, large-format displays, and high-DPI screens where standard resolution falls short.

  • Thinking mode for smarter generation Built-in thinking mode enables the model to reason about prompt intent before generating, producing more coherent compositions and better prompt adherence.

  • Custom size output Set output width and height directly (512–8192 per dimension) to match banners, thumbnails, posters, or social formats exactly.

  • Seeded iteration Use a fixed seed to refine style and layout with more repeatable variations.

  • Prompt Enhancer Built-in tool to automatically improve your text descriptions for richer results.

Parameters

ParameterRequiredDescription
promptYesText description of the image subject, scene, style, lighting, and mood.
sizeNoOutput dimensions (width × height). Range: 512–8192 per dimension. Default: 1024×1024.
thinking_modeNoEnable thinking mode for enhanced reasoning and better image quality. Default: enabled.
seedNoFixed seed for repeatable iterations. Use -1 for a random seed.

How to Use

  1. Write your prompt — describe the subject, setting, and style. Use the Prompt Enhancer for better results.
  2. Choose a size — select a preset aspect ratio or set custom width and height. Examples: 2048×2048 for square, 4096×2048 for ultra-wide, 2048×4096 for tall posters.
  3. Set thinking_mode — leave enabled (default) for best quality, or disable for faster generation.
  4. Set seed (optional) — fix a seed to make iterative prompt refinements more comparable.
  5. Submit — review the result and iterate as needed.

Pricing

Just $0.075 per generated image.

Best Use Cases

  • Print & Large Format — Generate 4K-resolution assets for magazines, posters, and physical print campaigns.
  • Fashion & Lookbook — Produce high-detail model and product images at magazine-cover quality.
  • Marketing & Advertising — Create polished campaign visuals at production-ready resolutions.
  • Product Visualization — Generate fine-textured, high-fidelity product imagery for e-commerce and presentations.
  • Concept Art — Render detailed scene compositions with complex lighting, materials, and environments.

Pro Tips

  • Structure your prompt as subject + environment + style: "A modern tea shop interior, warm afternoon light, minimalist wood design, cinematic photography."
  • Add camera and composition cues when framing matters: "wide shot, shallow depth of field, 35mm film look."
  • For 4K outputs, include fine detail cues (textures, materials, lighting) to take full advantage of the higher resolution.
  • Keep thinking_mode enabled for best results — disable it only if generation speed is the priority.
  • Fix a seed while tweaking your prompt to isolate the effect of each change.

Notes

  • Only prompt is required; all other parameters are optional.
  • Output size range is 512–8192 pixels per dimension, with total pixels between 768×768 and 4096×4096 and aspect ratio between 1:8 and 8:1.
  • Thinking mode is enabled by default and improves quality but adds some latency.
  • Higher resolutions (e.g. 4096×4096) will take longer to generate than standard sizes.

Related Models

  • Wan 2.7 Text-to-Image — Standard version at lower cost for everyday generation needs.
  • Wan 2.6 Text-to-Image — Previous generation Wan text-to-image model with prompt expansion support.
  • Seedream V4 Text-to-Image — Style-consistent text-to-image for posters, campaigns, and brand-friendly illustration batches.
  • FLUX.2 Dev Text-to-Image — High-quality text-to-image with strong prompt adherence and fine detail for creative and production workflows.
Accessibility:This website uses AI models provided by third parties.

Wan 2.7 Text To Image Pro API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/alibaba/wan-2.7/text-to-image-pro with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Wan 2.7 Text To Image Pro below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/alibaba/wan-2.7/text-to-image-pro" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "size": "1024*1024",
    "thinking_mode": true,
    "seed": -1
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("alibaba/wan-2.7/text-to-image-pro", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "size": "1024*1024",
        "thinking_mode": true,
        "seed": -1
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "alibaba/wan-2.7/text-to-image-pro",
    {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "size": "1024*1024",
    "thinking_mode": true,
    "seed": -1
}
)

print(output["outputs"][0])  # → URL of the generated output

Wan 2.7 Text To Image Pro API — Frequently asked questions

What is the Wan 2.7 Text To Image Pro API?

Wan 2.7 Text To Image Pro is a Alibaba model for image generation, exposed as a REST API on WaveSpeedAI. WAN 2.7 Text-to-Image Pro generates high-quality images up to 4K from text prompts with thinking mode for enhanced image quality. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Wan 2.7 Text To Image Pro API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/alibaba/alibaba-wan-2.7-text-to-image-pro.

How much does Wan 2.7 Text To Image Pro cost per run?

Wan 2.7 Text To Image Pro starts at $0.075 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Wan 2.7 Text To Image Pro accept?

Key inputs: `prompt`, `size`, `seed`, `thinking_mode`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/alibaba/alibaba-wan-2.7-text-to-image-pro.

How do I get started with the Wan 2.7 Text To Image Pro API?

Sign up for a free WaveSpeedAI account to claim starter credits, copy your API key from /accesskey, then call the endpoint shown in the API tab of the playground. The playground also auto-generates a code sample in Python, JavaScript, or cURL for the parameters you've set.

Can I use Wan 2.7 Text To Image Pro outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (Alibaba). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.