Seedance 2.0 15% OFF | Create in Video Generator →
Hailuo Video Models

Hailuo Video Models

Minimax Hailuo 02 for professional video generation, plus speech synthesis models.

Minimax Hailuo 02 for professional video generation, plus speech synthesis models.

All models

11 models
minimax/video-01
image-to-video

minimax/video-01

Minimax Video-01 is a text-to-video model offering high compression, strong text responsiveness, cinematic styles, and native HD output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

minimax/video-02
image-to-video

minimax/video-02

Hailuo 02 is an AI video generation model fine-tuned for ultra-clear 1080P output and handling complex physics-driven scenes. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

minimax/hailuo-02/standard
image-to-video

minimax/hailuo-02/standard

Hailuo 02 is an AI video-generation model delivering 768P output with fast responsiveness and strong handling of complex physics scenes. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

minimax/hailuo-02/pro
image-to-video

minimax/hailuo-02/pro

Minimax Hailuo 02 Pro produces ultra-clear 1080P AI videos with responsive, physics-aware rendering for complex physics-driven scenes. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

minimax/hailuo-02/t2v-standard
text-to-video

minimax/hailuo-02/t2v-standard

Hailuo 02 is a text-to-video model on MiniMax, fine-tuned to output responsive 768P videos even for complex physics-driven scenes. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

minimax/hailuo-02/i2v-standard
image-to-video

minimax/hailuo-02/i2v-standard

Hailuo 02 by Hailuo AI is an image-to-video model delivering ultra-clear 768P video with responsive handling of physics-driven scenes. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

minimax/hailuo-02/t2v-pro
text-to-video

minimax/hailuo-02/t2v-pro

Hailuo 02 T2V-Pro is a text-to-video model fine-tuned for ultra-clear 1080P video and responsive handling of physics-driven scenes. Ready-to-use REST API, no coldstarts, best performance, affordable pricing.

minimax/hailuo-02/i2v-pro
image-to-video

minimax/hailuo-02/i2v-pro

MiniMax Hailuo 02 Pro, an image-to-video model tuned for clear 1080P output and responsive handling of complex physics-driven scenes. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

minimax/speech-02-hd
text-to-audio

minimax/speech-02-hd

Minimax Speech 02 HD is Minimax's high-definition text-to-speech model delivering clear HD voices; pricing $0.05 per 1,000 characters. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

minimax/speech-02-turbo
text-to-audio

minimax/speech-02-turbo

Minimax Speech-02 Turbo is a high-definition text-to-speech model delivering natural voice output. Cost: $0.03 per 1000 characters. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

minimax/voice-clone
audio-to-audio

minimax/voice-clone

Minimax Voice Clone creates high-quality voice clones from short reference clips, closely matching tone, accent, and speaking style. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Hailuo Video Models

MiniMax Hailuo 02 and Speech Models

Hailuo 02 is MiniMax's next-generation AI video model ranked #2 globally, delivering a significant leap forward in video generation technology. This advanced model achieves remarkable performance with 2.5x efficiency improvement, 85% complex instruction response rate, and extreme physics simulation capabilities.

The model demonstrates exceptional cost-effectiveness while maintaining superior output quality, making it ideal for both professional and enterprise applications.


Video Generation Versions

Text-to-Video Pro (t2v-pro)

  • Professional-grade video generation
  • Advanced text prompt processing
  • Extreme physics simulation

Image-to-Video Pro (i2v-pro)

  • Superior image-to-video transformation
  • Professional-grade output
  • High-fidelity conversion

Audio and Speech Models

Speech-02-HD

  • High-definition text-to-speech
  • Natural pronunciation and clear articulation
  • Multiple voice options and emotions
  • Adjustable speed, volume, and pitch

Speech-02-Turbo

  • Fast speech generation
  • Efficient processing
  • Optimized for real-time applications

Voice Clone

  • Advanced voice cloning capabilities
  • Custom voice training support
  • High-fidelity voice reproduction

Hailuo Video Models API — pricing & performance

Run any model in the Hailuo Video Models collection through a single REST API. Pay per generation — no subscriptions, no minimums — with industry-leading latency on a 99.9% uptime infrastructure.

Why run Hailuo Video Models on WaveSpeedAI

Transparent pricing

Per-call pricing for every Hailuo Video Models model. The price is listed on each model page — no platform fees on top.

Optimized for low latency

Most Hailuo Video Models image models complete in under 2 seconds. Video and 3D models run several times faster than self-hosted alternatives.

99.9% uptime

Multi-region failover and automatic retries keep your production traffic online — even during provider outages.

Frequently asked questions

How much does the Hailuo Video Models API cost?+

Each model has its own per-call price listed on the model page. We bill per successful generation, with no subscription fees or minimums.

How fast are Hailuo Video Models models on WaveSpeedAI?+

Image models in this collection typically complete in under 2 seconds. Video and 3D models depend on duration and resolution but are usually several times faster than self-hosted runs.

Can I try the API without a credit card?+

Yes — every account gets $1 in free credits on signup, enough to try most Hailuo Video Models models without a credit card.

Are there rate limits?+

Standard accounts have generous concurrent-job limits. Enterprise plans offer custom RPM, higher concurrency, and dedicated capacity — contact sales for details.