Kuaishou·video·From $0.42/run

Kling 3.0 API

Kuaishou Kling 3.0 — text-to-video and image-to-video with smooth motion, cinematic visuals, accurate prompt adherence, and native audio. Three tiers: Standard, Pro, and 4K.

Standard tier for default delivery; Pro tier for top-tier quality; 4K tier for delivery at 4K with multi-prompt and start/end frame control. A separate motion-control endpoint (Std / Pro) transfers motion from reference videos.

Open Playground →View API Docs

About the Kling 3.0 API

What Kling 3.0 does, how it fits in the Kuaishou model lineup, and why teams reach for it.

Kling 3.0 is a video generation model from Kuaishou, available through the WaveSpeedAI REST API. Kuaishou Kling 3.0 — text-to-video and image-to-video with smooth motion, cinematic visuals, accurate prompt adherence, and native audio. Three tiers: Standard, Pro, and 4K.

Standard tier for default delivery; Pro tier for top-tier quality; 4K tier for delivery at 4K with multi-prompt and start/end frame control. A separate motion-control endpoint (Std / Pro) transfers motion from reference videos.

The Kling 3.0 family on WaveSpeedAI ships 8 REST endpoints covering Text-To-Video, Motion-Control, Image-To-Video workflows. Each variant carries its own pricing, parameter knobs, and example outputs — pick the one that matches your input modality and production constraints, or call several from the same API key to compose multi-step pipelines.

Run Kling 3.0 through the same API key, billing account, and rate-limit envelope you use for the other 1,000+ AI models on WaveSpeedAI. No separate vendor setup, no per-provider SDKs, no per-vendor rate-limit envelopes — one integration covers everything from text-to-image and text-to-video through audio synthesis, 3D generation, upscaling, and editing.

All Kling 3.0 API endpoints

8 endpoints available now on WaveSpeedAI — pick the variant that matches your workflow.

Text To Video

Kling V3.0 4K delivers top-tier 4K text-to-video generation with smooth motion, cinematic visuals, accurate prompt adherence, and optional audio. Supports flexible aspect ratios, multi-prompt, and element references. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

text-to-videofrom $2.10

Text To Video

Kling 3.0 Pro delivers top-tier text-to-video generation with smooth motion, cinematic visuals, accurate prompt adherence, and native audio for ready-to-share clips. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

text-to-videofrom $0.56

Text To Video

Kling 3.0 Standard delivers high-quality text-to-video generation with smooth motion, cinematic visuals, accurate prompt adherence, and native audio for ready-to-share clips. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

text-to-videofrom $0.42

Motion Control

Kling 3.0 Standard Motion Control transfers motion from reference videos to animate still images. Upload a character image and a motion clip (dance, action, gesture), and the model extracts the movement to generate smooth, realistic video. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

motion-controlfrom $0.63

Motion Control

Kling 3.0 Standard Motion Control transfers motion from reference videos to animate still images. Upload a character image and a motion clip (dance, action, gesture), and the model extracts the movement to generate smooth, realistic video. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

motion-controlfrom $0.84

Image To Video

Kling V3.0 4K delivers top-tier 4K image-to-video generation with smooth motion, cinematic visuals, accurate prompt adherence, and optional audio. Supports start/end frame control, multi-prompt, and element references. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

image-to-videofrom $2.10

Image To Video

Kling 3.0 Pro delivers top-tier image-to-video generation with smooth motion, cinematic visuals, accurate prompt adherence, and native audio for ready-to-share clips. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

image-to-videofrom $0.56

Image To Video

Kling 3.0 Standard delivers high-quality image-to-video generation with smooth motion, cinematic visuals, accurate prompt adherence, and native audio for ready-to-share clips. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

image-to-videofrom $0.42

See Kling 3.0 in action

Real outputs generated by the Kling 3.0 API. Hover any video to preview, click to open the full-size viewer.

How to use the Kling 3.0 API

Four steps from signup to a finished generation. Full Python, Node.js, and cURL examples are in the API section below.

1
Get an API key
Sign up for a WaveSpeedAI account and copy your API key from the dashboard. New accounts come with free starter credits — enough to run the playground a few dozen times before billing kicks in.
2
Submit a prediction
POST your input as JSON to https://api.wavespeed.ai/api/v3/kwaivgi/kling-v3.0-std/text-to-video. The endpoint returns a prediction id immediately — generations are async so you don't hold an open connection during inference.
3
Poll for completion
GET https://api.wavespeed.ai/api/v3/predictions/{request_id}/result every 1-2 seconds. The response includes a status field; keep polling until it flips from"queued" or"processing" to"completed".
4
Read the output URL
Once status is"completed", read the URL from data.outputs[0]. The URL points to your generated media on the WaveSpeedAI CDN — image, video, audio, or 3D file depending on the Kling 3.0 variant you called.

What you can build with Kling 3.0

Common workflows developers and creators use the Kling 3.0 API for.

Text-to-video with native audio

kwaivgi/kling-v3.0-std/text-to-video at with smooth motion, cinematic visuals, and native audio for ready-to-share clips. The default Kling variant for delivery work.

text-to-videoaudiocinematic

Image-to-video animation

kwaivgi/kling-v3.0-std/image-to-video — animate a still image with the same audio and motion characteristics as the text-to-video variant.

image-to-videoanimatereference

Pro tier for top quality

kwaivgi/kling-v3.0-pro/text-to-video and image-to-video — 33% premium over Standard for top-tier output. Same prompt format works across Std and Pro.

pro-tierqualitydelivery

4K tier with multi-prompt + start/end control

kwaivgi/kling-v3.0-4k/text-to-video and image-to-video — top-tier 4K generation. Supports start/end frame control and multi-prompt. The 4K tier is roughly 5× the Standard tier cost.

4kmulti-promptstart-end

Motion control (separate endpoint)

kwaivgi/kling-v3.0-std/motion-control and pro/motion-control transfer motion from a reference video to animate a still character image — dance, action, gesture replays without mocap.

motion-controlposeanimate

Native audio across tiers

The sound parameter gives Kling 3.0 native audio synthesis across Std, Pro, and 4K tiers — toggle it per generation rather than picking a separate audio model. Audio adds to the per-call cost; disable it if you'll add audio in post anyway.

audionativetoggle

Tips for prompting Kling 3.0

Practical advice for getting better outputs from Kling 3.0 — drawn from the patterns that work across video models in production pipelines.

Pick the right tier (Standard / Pro / 4K)

Standard is the default delivery-grade tier. Pro is the premium quality tier (33% more than Standard). 4K is the top tier — significantly more expensive but ships start/end frame control and multi-prompt that the lower tiers don't expose.

Native audio is toggle-able

The sound parameter is exposed across Kling 3.0 variants — enable audio when you need ready-to-share clips with synchronized sound, disable when you'll add audio in post.

4K tier ships start/end frame control

kwaivgi/kling-v3.0-4k/* supports start/end frame control and multi-prompt features that aren't on the Standard or Pro tiers. Pick 4K when you need keyframe-level control of the generated clip.

Motion-control is a separate sub-endpoint

When you need to transfer specific motion from a reference clip to a still character image, switch to kwaivgi/kling-v3.0-std/motion-control (or the Pro version). Standard Kling 3.0 generates plausible motion; motion-control replays the specific motion from your reference.

Don't use Kling for precise camera-language work

Specific cinematography vocab (anamorphic, crane, jib, dolly-zoom) maps more cleanly to Seedance 2.0 or Veo 3.1. Use Kling 3.0 for its quality tiers and motion-control sub-endpoint; switch models when precise camera control is the requirement.

Kling 3.0 API pricing

Pricing is per-output. The final charge scales with the parameters you set in each variant's playground (resolution, duration, output count, references).

Endpoint	Type	Starting price
kwaivgi/kling-v3.0-4k/text-to-video	text-to-video	$2.10
kwaivgi/kling-v3.0-pro/text-to-video	text-to-video	$0.56
kwaivgi/kling-v3.0-std/text-to-video	text-to-video	$0.42
kwaivgi/kling-v3.0-std/motion-control	motion-control	$0.63
kwaivgi/kling-v3.0-pro/motion-control	motion-control	$0.84
kwaivgi/kling-v3.0-4k/image-to-video	image-to-video	$2.10
kwaivgi/kling-v3.0-pro/image-to-video	image-to-video	$0.56
kwaivgi/kling-v3.0-std/image-to-video	image-to-video	$0.42

Call the Kling 3.0 API

Sign up for an API key at wavespeed.ai/accesskey, then submit a prediction via REST. The playground generates ready-to-paste samples for any combination of inputs.

HTTP example

# 1. Submit a prediction
curl -X POST "https://api.wavespeed.ai/api/v3/kwaivgi/kling-v3.0-std/text-to-video" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{}'

# 2. Poll the result until status = "completed"
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# Read the output URL from data.outputs[0].

Node.js example

// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY

const result = await client.run("kwaivgi/kling-v3.0-std/text-to-video", {});
console.log(result.outputs[0]); // → URL of the generated output

Python example

# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "kwaivgi/kling-v3.0-std/text-to-video",
    {}
)
print(output["outputs"][0])  # → URL of the generated output

Kling 3.0 vs alternatives

When to pick Kling 3.0 over similar models on WaveSpeedAI.

Kling 3.0 vs Seedance 2.0

Seedance 2.0 ships native audio across every variant and the Turbo tier (1080p at near-480p speed). Kling 3.0 has the 4K tier with multi-prompt + start/end frame control that Seedance doesn't ship.

Kling 3.0 vs Veo 3.1

Veo 3.1 has stronger photorealism reputation for human faces and natively 1080p across tiers. Kling 3.0 is cheaper for most tiers and has a 4K option with start/end frame control.

Kling 3.0 vs Wan 2.7

Wan 2.7 has reference-to-video, video-edit, video-extend, and image-edit variants in one family. Kling 3.0 stays focused on text-to-video and image-to-video with three quality tiers (Standard / Pro / 4K) plus motion-control as a sub-endpoint.

Kling 3.0 API — Frequently asked questions

Pricing, license, integration — common questions about running Kling 3.0 on WaveSpeedAI.

What is the Kling 3.0 API?

Kling 3.0 is a Kuaishou video generation model exposed as a REST API on WaveSpeedAI. Kuaishou Kling 3.0 — text-to-video and image-to-video with smooth motion, cinematic visuals, accurate prompt adherence, and native audio. Three tiers: Standard, Pro, and 4K. You can call it programmatically or try it from the playground linked above.

How do I call the Kling 3.0 API?

Sign up for a WaveSpeedAI account, copy your API key from /accesskey, then POST to https://api.wavespeed.ai/api/v3/kwaivgi/kling-v3.0-std/text-to-video with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to "completed", then read the output URL from data.outputs[0]. Full Python / Node.js / cURL examples are above.

How much does the Kling 3.0 API cost?

Kling 3.0 starts at $0.42 per run. The exact cost scales with the parameters you set (resolution, duration, output count, references). The live cost preview next to the Generate button in the playground shows the exact price for your current input.

Which Kling 3.0 variants are available?

WaveSpeedAI hosts 8 Kling 3.0 endpoints: kwaivgi/kling-v3.0-4k/text-to-video, kwaivgi/kling-v3.0-pro/text-to-video, kwaivgi/kling-v3.0-std/text-to-video, kwaivgi/kling-v3.0-std/motion-control, kwaivgi/kling-v3.0-pro/motion-control, kwaivgi/kling-v3.0-4k/image-to-video, kwaivgi/kling-v3.0-pro/image-to-video, kwaivgi/kling-v3.0-std/image-to-video. Each variant has its own playground page and pricing.

Can I use Kling 3.0 outputs commercially?

Commercial usage rights follow the Kuaishou model license. Most Kuaishou models permit commercial output use; see each model's playground page for the specific license summary, and WaveSpeedAI's Terms of Service for platform-level conditions.

Why use Kling 3.0 on WaveSpeedAI instead of going direct?

One API key + one billing account across Kling 3.0 AND 1,000+ other AI models from other providers. No per-vendor SDK setup, no separate rate-limit envelopes, no rewrite-per-vendor integration code. Pricing is typically at parity with or below Kuaishou's direct API.

About Kuaishou

The team behind Kling 3.0 and the broader Kuaishou model lineup on WaveSpeedAI.

Kuaishou is a major Chinese short-video platform and the team behind the Kling family of video generation models. Kling 3.0 ships Standard, Pro, and 4K tiers with native audio synthesis (a sound parameter on every variant), plus a dedicated motion-control endpoint that transfers motion from a reference video to animate a still character image.

Related model APIs on WaveSpeedAI

Other AI APIs from Kuaishou and the rest of the video model lineup — one API key, one billing account.

Kling 3.0 Motion Control API

Kuaishou

Kuaishou Kling 3.0 Motion Control — transfers motion from a reference video to animate a still character image. Upload a character image plus a motion clip (dance, action, gesture); the model extracts the movement to generate smooth, realistic video. Standard and Pro tiers.

Kling Omni O3 API

Kuaishou

Kuaishou Kling Omni Video O3 — advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Standard, Pro, and 4K tiers for text-to-video, image-to-video, reference-to-video, and conversational video-edit.

Seedance 2.0 API

ByteDance

ByteDance Seedance 2.0 — Hollywood-grade cinematic video with native audio-visual synchronization, director-level camera and lighting control, and exceptional motion stability. Built on Seed's unified multimodal architecture.

Seedance 1.5 Pro API

ByteDance

ByteDance Seedance 1.5 Pro — cinematic, live-action-leaning clips with strong prompt adherence, expressive motion, and stable aesthetics. 4-12s duration with Smart Duration, multiple aspect ratios, reproducible generation via seeds.

Veo 3.1 API

Google

Google Veo 3.1 — text-to-video with synchronized native audio at 1080p. Three tiers (Standard, Fast, Lite) with text-to-video, image-to-video, reference-to-video, and video-extend, plus start-end-to-video on the Lite tier.

Wan 2.7 API

Alibaba

Alibaba WAN 2.7 — coherent cinematic video with crisp detail, stable motion, and strong instruction-following. Separate endpoints for text-to-video, image-to-video, reference-to-video, video-edit, video-extend, plus image-edit and text-to-image variants in the same family.

Start building with Kling 3.0 on WaveSpeedAI

Free starter credits on signup. One API key across 1,000+ AI models from Kuaishou and every other provider.

Open Kling 3.0 Playground →Get an API Key

Kling 3.0 API

About the Kling 3.0 API

All Kling 3.0 API endpoints

Text To Video

Text To Video

Text To Video

Motion Control

Motion Control

Image To Video

Image To Video

Image To Video

See Kling 3.0 in action

How to use the Kling 3.0 API

Get an API key

Submit a prediction

Poll for completion

Read the output URL

What you can build with Kling 3.0

Text-to-video with native audio

Image-to-video animation

Pro tier for top quality

4K tier with multi-prompt + start/end control

Motion control (separate endpoint)

Native audio across tiers

Tips for prompting Kling 3.0

Pick the right tier (Standard / Pro / 4K)

Native audio is toggle-able

4K tier ships start/end frame control

Motion-control is a separate sub-endpoint

Don't use Kling for precise camera-language work

Kling 3.0 API pricing

Call the Kling 3.0 API

Kling 3.0 vs alternatives

Kling 3.0 vs Seedance 2.0

Kling 3.0 vs Veo 3.1

Kling 3.0 vs Wan 2.7

Kling 3.0 API — Frequently asked questions

About Kuaishou

Related model APIs on WaveSpeedAI

Kling 3.0 Motion Control API

Kling Omni O3 API

Seedance 2.0 API

Seedance 1.5 Pro API

Veo 3.1 API

Wan 2.7 API

Start building with Kling 3.0 on WaveSpeedAI