Seedance 2.0 15% OFF | Create in Video Generator →

Seedance 2.0 Fast Text to Video

bytedance /

Seedance 2.0 Fast (Text-to-Video) generates cinematic videos from text prompts with native audio-visual synchronization, director-level camera and lighting control, and exceptional motion stability — optimized for faster generation at lower cost. Built on Seed's unified multimodal architecture.

text-to-video
Input
Enable web search for real-time information.
Whether to generate native audio synchronized with the output video. Defaults to true.

Idle

$0.5per run·~20 / $10

Next:

ExamplesView all

A 19-year-old girl with paint-stained fingers and a overstuffed backpack rides the Trans-Siberian Railway alone through a blizzard, sketchbook open on her lap. Shot 1: Close-up — her pencil moves furiously across the page, drawing the frozen birch forest blurring past the window, breath fogging the glass beside her face. Shot 2: Wide shot — the train car is nearly empty, a single dim overhead light swaying, snow pressing against every window, she is the only thing warm and alive in the frame. Shot 3: Medium shot — an elderly Russian man across the aisle silently sets a cup of tea on her table without a word and returns to his seat; she looks up, stunned. Shot 4: Close-up on the sketchbook — the drawing is extraordinary, hyperdetailed, and in the corner she has written in tiny letters: day 4. still not scared. Shot 5: She presses her forehead against the cold window, eyes wide open, watching the endless white world fly past — and very slowly, she begins to smile. Quiet and immense, the film captures the exact feeling of being 19 and choosing the world over safety.

An extremely frail elderly ballerina, 80s, in a tattered tutu, performs alone on an abandoned theater stage lit only by a single spotlight. Shot 1: Close-up on her gnarled, arthritic feet slowly sliding into first position on the dusty stage floor, the sound of creaking wood beneath her. Shot 2: Wide shot — she raises her arms overhead with trembling elegance, spine straightening inch by inch, the empty velvet seats stretching into darkness behind her. Shot 3: Medium shot — she begins to turn, slowly at first, then faster, her tutu catching the light, dust swirling around her ankles like smoke. Shot 4: Low-angle shot — she launches into a grand jeté, body suspended impossibly in the air for a breathless moment, face locked in pure, fierce concentration. Shot 5: She lands, staggers one step, then stands perfectly still — chest heaving, tears streaming silently — and takes a deep, solitary bow to no one. The mood is bittersweet and haunting, soaked in faded glory and unbroken love for a life lived in motion.

Related Models

README

Seedance 2.0 Fast Text-to-Video

Seedance 2.0 Fast is the speed-optimized version of Seed's latest video generation model. The Text-to-Video mode generates cinematic videos from text prompts with native audio synchronization — faster and at 33% lower cost than the standard version, ideal for rapid iteration and high-volume production.

Key Features

  • Speed-optimized generation Faster processing for quick turnaround on video projects, perfect for iteration and prototyping.

  • 33% lower cost $0.80 per 5 seconds vs $1.20 for the standard version — ideal for high-volume production.

  • Unified multimodal architecture Same Seedance 2.0 foundation handling text, image, audio, and video inputs.

  • Native audio-visual synchronization Generates video with synchronized audio in a single pass.

  • Director-level control Camera movement, lighting, shadows, and character performance controlled through prompts.

  • Strong motion stability Coherent motion with stable subjects and fluid transitions.

Parameters

ParameterRequiredDescription
promptYesDetailed description of the cinematic scene
aspect_ratioNoOutput format: 16:9 (default), 9:16, 4:3, 3:4, 1:1, 21:9
durationNoVideo length in seconds: 4-15 (default: 5)
resolutionNoOutput resolution: 480p, 720p (default), or 1080p
reference_imagesNoReference image URLs to guide style, characters, or composition
reference_videosNoReference video URLs (total length must not exceed 15 seconds)
reference_audiosNoReference audio URLs (total length must not exceed 15 seconds)

How to Use

  1. Write your prompt — describe the scene with cinematic detail.
  2. Select aspect ratio — 16:9 for widescreen, 9:16 for vertical, 4:3 or 3:4 for classic formats.
  3. Set duration — choose any duration from 4 to 15 seconds.
  4. Optionally add references — provide reference images, videos, or audios for style guidance.
  5. Run — submit and download your video.

Pricing

Without Reference Videos

Billed per 5-second block of output duration.

ResolutionDurationCost
480p5 s$0.50
480p10 s$1.00
480p15 s$1.50
720p5 s$1.00
720p10 s$2.00
720p15 s$3.00
1080p5 s$2.50
1080p10 s$5.00
1080p15 s$7.50

With Reference Videos

When reference_videos are provided, billing follows the same scheme as Seedance 2.0 Fast Video-Edit: billed per second across input duration + output duration, where input duration is the total length of the supplied reference videos clamped to the 2-15 s range.

ResolutionPer second
480p$0.065
720p$0.13
1080p$0.325

Examples (reference videos totaling 5 s, output 5 s = 10 billed seconds):

ResolutionCost
480p$0.65
720p$1.30
1080p$3.25

Billing Rules

  • Without reference videos: $0.50 per 5 seconds at 480p, scaled by resolution.
  • With reference videos: per-second billing matching Seedance 2.0 Fast Video-Edit, using the total reference-video duration as input (clamped 2-15 s) plus the output duration.
  • 720p: 2x the 480p price.
  • 1080p: 5x the 480p price (2.5x the 720p price).
  • Duration range: 4-15 seconds (continuous).

Best Use Cases

  • Rapid Prototyping — Quickly iterate on concepts before committing to the standard version.
  • High-Volume Production — Cost-effective generation for large content libraries.
  • Social Media Content — Fast turnaround for short-form video needs.
  • A/B Testing — Generate multiple variations efficiently to find the best creative direction.

Pro Tips

  • Use Fast for iteration and testing, switch to standard Seedance 2.0 for final quality.
  • Write prompts like a film director — include lighting, camera angles, and mood.
  • Start with 5s to iterate, then extend once the look is right.

Notes

  • Native audio generation included.
  • Duration range: 4-15 seconds (continuous).
  • For highest quality output, consider the standard Seedance 2.0.

Related Models

Accessibility:This website uses AI models provided by third parties.

Seedance 2.0 Fast Text To Video API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/bytedance/seedance-2.0-fast/text-to-video with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Seedance 2.0 Fast Text To Video below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/bytedance/seedance-2.0-fast/text-to-video" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "aspect_ratio": "16:9",
    "resolution": "720p",
    "duration": 5,
    "enable_web_search": false,
    "generate_audio": true
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("bytedance/seedance-2.0-fast/text-to-video", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "aspect_ratio": "16:9",
        "resolution": "720p",
        "duration": 5,
        "enable_web_search": false,
        "generate_audio": true
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "bytedance/seedance-2.0-fast/text-to-video",
    {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "aspect_ratio": "16:9",
    "resolution": "720p",
    "duration": 5,
    "enable_web_search": false,
    "generate_audio": true
}
)

print(output["outputs"][0])  # → URL of the generated output

Seedance 2.0 Fast Text To Video API — Frequently asked questions

What is the Seedance 2.0 Fast Text To Video API?

Seedance 2.0 Fast Text To Video is a ByteDance model for video generation, exposed as a REST API on WaveSpeedAI. Seedance 2.0 Fast (Text-to-Video) generates cinematic videos from text prompts with native audio-visual synchronization, director-level camera and lighting control, and exceptional motion stability — optimized for faster generation at lower cost. Built on Seed's unified multimodal architecture. You can call it programmatically or try it from the playground above.

How do I call the Seedance 2.0 Fast Text To Video API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/bytedance/bytedance-seedance-2.0-fast-text-to-video.

How much does Seedance 2.0 Fast Text To Video cost per run?

Seedance 2.0 Fast Text To Video starts at $0.50 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Seedance 2.0 Fast Text To Video accept?

Key inputs: `prompt`, `aspect_ratio`, `resolution`, `duration`, `reference_images`, `enable_web_search`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/bytedance/bytedance-seedance-2.0-fast-text-to-video.

How do I get started with the Seedance 2.0 Fast Text To Video API?

Sign up for a free WaveSpeedAI account to claim starter credits, copy your API key from /accesskey, then call the endpoint shown in the API tab of the playground. The playground also auto-generates a code sample in Python, JavaScript, or cURL for the parameters you've set.

Can I use Seedance 2.0 Fast Text To Video outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (ByteDance). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.