Seedance 2.0 Fast (Text-to-Video) generates cinematic videos from text prompts with native audio-visual synchronization, director-level camera and lighting control, and exceptional motion stability — optimized for faster generation at lower cost. Built on Seed's unified multimodal architecture.
Idle
$0.5per run·~20 / $10
A 19-year-old girl with paint-stained fingers and a overstuffed backpack rides the Trans-Siberian Railway alone through a blizzard, sketchbook open on her lap. Shot 1: Close-up — her pencil moves furiously across the page, drawing the frozen birch forest blurring past the window, breath fogging the glass beside her face. Shot 2: Wide shot — the train car is nearly empty, a single dim overhead light swaying, snow pressing against every window, she is the only thing warm and alive in the frame. Shot 3: Medium shot — an elderly Russian man across the aisle silently sets a cup of tea on her table without a word and returns to his seat; she looks up, stunned. Shot 4: Close-up on the sketchbook — the drawing is extraordinary, hyperdetailed, and in the corner she has written in tiny letters: day 4. still not scared. Shot 5: She presses her forehead against the cold window, eyes wide open, watching the endless white world fly past — and very slowly, she begins to smile. Quiet and immense, the film captures the exact feeling of being 19 and choosing the world over safety.
An extremely frail elderly ballerina, 80s, in a tattered tutu, performs alone on an abandoned theater stage lit only by a single spotlight. Shot 1: Close-up on her gnarled, arthritic feet slowly sliding into first position on the dusty stage floor, the sound of creaking wood beneath her. Shot 2: Wide shot — she raises her arms overhead with trembling elegance, spine straightening inch by inch, the empty velvet seats stretching into darkness behind her. Shot 3: Medium shot — she begins to turn, slowly at first, then faster, her tutu catching the light, dust swirling around her ankles like smoke. Shot 4: Low-angle shot — she launches into a grand jeté, body suspended impossibly in the air for a breathless moment, face locked in pure, fierce concentration. Shot 5: She lands, staggers one step, then stands perfectly still — chest heaving, tears streaming silently — and takes a deep, solitary bow to no one. The mood is bittersweet and haunting, soaked in faded glory and unbroken love for a life lived in motion.
Seedance 2.0 Fast is the speed-optimized version of Seed's latest video generation model. The Text-to-Video mode generates cinematic videos from text prompts with native audio synchronization — faster and at 33% lower cost than the standard version, ideal for rapid iteration and high-volume production.
Speed-optimized generation Faster processing for quick turnaround on video projects, perfect for iteration and prototyping.
33% lower cost $0.80 per 5 seconds vs $1.20 for the standard version — ideal for high-volume production.
Unified multimodal architecture Same Seedance 2.0 foundation handling text, image, audio, and video inputs.
Native audio-visual synchronization Generates video with synchronized audio in a single pass.
Director-level control Camera movement, lighting, shadows, and character performance controlled through prompts.
Strong motion stability Coherent motion with stable subjects and fluid transitions.
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Detailed description of the cinematic scene |
| aspect_ratio | No | Output format: 16:9 (default), 9:16, 4:3, 3:4, 1:1, 21:9 |
| duration | No | Video length in seconds: 4-15 (default: 5) |
| resolution | No | Output resolution: 480p, 720p (default), or 1080p |
| reference_images | No | Reference image URLs to guide style, characters, or composition |
| reference_videos | No | Reference video URLs (total length must not exceed 15 seconds) |
| reference_audios | No | Reference audio URLs (total length must not exceed 15 seconds) |
Billed per 5-second block of output duration.
| Resolution | Duration | Cost |
|---|---|---|
| 480p | 5 s | $0.50 |
| 480p | 10 s | $1.00 |
| 480p | 15 s | $1.50 |
| 720p | 5 s | $1.00 |
| 720p | 10 s | $2.00 |
| 720p | 15 s | $3.00 |
| 1080p | 5 s | $2.50 |
| 1080p | 10 s | $5.00 |
| 1080p | 15 s | $7.50 |
When reference_videos are provided, billing follows the same scheme as Seedance 2.0 Fast Video-Edit: billed per second across input duration + output duration, where input duration is the total length of the supplied reference videos clamped to the 2-15 s range.
| Resolution | Per second |
|---|---|
| 480p | $0.065 |
| 720p | $0.13 |
| 1080p | $0.325 |
Examples (reference videos totaling 5 s, output 5 s = 10 billed seconds):
| Resolution | Cost |
|---|---|
| 480p | $0.65 |
| 720p | $1.30 |
| 1080p | $3.25 |
Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/bytedance/seedance-2.0-fast/text-to-video with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Seedance 2.0 Fast Text To Video below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/bytedance/seedance-2.0-fast/text-to-video" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"aspect_ratio": "16:9",
"resolution": "720p",
"duration": 5,
"enable_web_search": false,
"generate_audio": true
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("bytedance/seedance-2.0-fast/text-to-video", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"aspect_ratio": "16:9",
"resolution": "720p",
"duration": 5,
"enable_web_search": false,
"generate_audio": true
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"bytedance/seedance-2.0-fast/text-to-video",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"aspect_ratio": "16:9",
"resolution": "720p",
"duration": 5,
"enable_web_search": false,
"generate_audio": true
}
)
print(output["outputs"][0]) # → URL of the generated outputSeedance 2.0 Fast Text To Video is a ByteDance model for video generation, exposed as a REST API on WaveSpeedAI. Seedance 2.0 Fast (Text-to-Video) generates cinematic videos from text prompts with native audio-visual synchronization, director-level camera and lighting control, and exceptional motion stability — optimized for faster generation at lower cost. Built on Seed's unified multimodal architecture. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/bytedance/bytedance-seedance-2.0-fast-text-to-video.
Seedance 2.0 Fast Text To Video starts at $0.50 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `aspect_ratio`, `resolution`, `duration`, `reference_images`, `enable_web_search`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/bytedance/bytedance-seedance-2.0-fast-text-to-video.
Sign up for a free WaveSpeedAI account to claim starter credits, copy your API key from /accesskey, then call the endpoint shown in the API tab of the playground. The playground also auto-generates a code sample in Python, JavaScript, or cURL for the parameters you've set.
Commercial usage rights depend on the model's license, set by its provider (ByteDance). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.