Seedance 2.0 15% OFF | Create in Video Generator →
Google Models

Google Models

Google's cutting-edge AI models deliver high-performance image and video models

Google's cutting-edge AI models deliver high-performance image and video models

All models

15 models
google/nano-banana/edit
image-to-image

google/nano-banana/edit

Nano-Banana is an advanced image generation and editing model that produces photorealistic or stylized visuals and performs precise inpainting, outpainting, and background replacement. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

google/nano-banana/text-to-image
text-to-image

google/nano-banana/text-to-image

Google Nano Banana is a cutting-edge text-to-image model that generates images from natural language prompts. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

google/imagen4
text-to-image

google/imagen4

Google's Imagen 4 is the flagship text-to-image model for generating images from text prompts with strong fidelity and creative control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

google/imagen4-ultra
text-to-image

google/imagen4-ultra

Imagen4 Ultra is Google's highest-quality text-to-image model, generating high-fidelity images from simple text prompts. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

google/imagen4-fast
text-to-image

google/imagen4-fast

Google Imagen4 Fast is the fast variant of Google's Imagen 4 flagship text-to-image model for high-quality image generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

google/imagen3
text-to-image

google/imagen3

Imagen3 is Google's highest-quality text-to-image model, generating highly detailed, beautifully lit and photoreal images from text prompts. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

google/imagen3-fast
text-to-image

google/imagen3-fast

Imagen3 Fast is Google's top text-to-image model, creating richly detailed, beautifully lit images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

google/veo3
text-to-video

google/veo3

Google Veo3 is Google's flagship text-to-video model with built-in audio, producing synchronized video and sound from text prompts. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

google/veo3-fast
text-to-video

google/veo3-fast

Google Veo 3 Fast creates text-to-video with synchronized audio, delivering faster, more cost-effective results than standard Veo 3; commercial use allowed and pricing starts at $0.25/second. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

google/veo3-fast/image-to-video
image-to-video

google/veo3-fast/image-to-video

Google Veo3 Fast provides faster, more cost-effective Image-to-Video generation vs Veo 3, with commercial use allowed and $0.25/sec pricing. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

google/veo3/image-to-video
image-to-video

google/veo3/image-to-video

Google Veo 3 is Google's flagship image-to-video model that creates audio-enabled videos from images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

google/veo2
text-to-video

google/veo2

Google Veo2 creates high-quality image-to-video outputs with realistic motion and extensive camera controls for customizable styles. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

google/veo2/image-to-video
image-to-video

google/veo2/image-to-video

Google Veo2 Image-to-Video creates high-quality videos with realistic motion, varied styles, and precise camera controls for cinematic results. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

google/gemini-2.5-flash-image/text-to-image
text-to-image

google/gemini-2.5-flash-image/text-to-image

Google Gemini 2.5 Flash Image offers advanced text-to-image generation and image editing with creative controls for quality images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

google/gemini-2.5-flash-image/edit
image-to-image

google/gemini-2.5-flash-image/edit

Nano Banana (Gemini 2.5 Flash Image) offers image-to-image generation and precise editing with deep reasoning for improved accuracy. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Google Models

Google AI Models

Google Cloud's Vertex AI platform offers a comprehensive suite of state-of-the-art AI models for image and video generation. These models represent the cutting edge of generative AI technology, combining high performance with enterprise-grade reliability.

Imagen Series - Text-to-Image Generation

Imagen 3

The latest generation of Google's high-quality image generation model, Imagen 3 excels at creating detailed, photorealistic images with rich lighting and artistic beauty. Key features include:


  • High-Quality Generation: Produces detailed, photorealistic images with rich lighting and artistic elements
  • Safety Features: Advanced content filtering and safety controls
  • Watermarking: Optional invisible watermarking for image attribution
  • Multi-Language Support: Supports 7 languages including English, Chinese, Hindi, Japanese, Korean, Portuguese, and Spanish

Imagen 4 (Preview)

Google's next-generation flagship image generation model, offering enhanced capabilities and commercial usage rights. Features include


  • Enhanced Quality: Further improved image quality and detail
  • Advanced Controls: Better prompt understanding and artistic control
  • Commercial License: Suitable for commercial applications
  • Enterprise Integration: Seamless integration with Google Cloud services

Veo Series - Text-to-Video Generation

Veo 3


  • High Resolution: Supports both 720p and 1080p output
  • Audio Generation: Optional audio generation capability
  • Prompt Enhancement**: Advanced prompt expansion for better results

Veo 3 Fast

An optimized version of Veo 3 designed for rapid video generation:


  • Quick Generation: Faster processing times
  • Resource Efficient: Optimized resource usage
  • Quality Balance: Maintains good quality while prioritizing speed
  • Same Format Support: Supports all standard resolutions and aspect ratios

Google Models API — pricing & performance

Run any model in the Google Models collection through a single REST API. Pay per generation — no subscriptions, no minimums — with industry-leading latency on a 99.9% uptime infrastructure.

Why run Google Models on WaveSpeedAI

Transparent pricing

Per-call pricing for every Google Models model. The price is listed on each model page — no platform fees on top.

Optimized for low latency

Most Google Models image models complete in under 2 seconds. Video and 3D models run several times faster than self-hosted alternatives.

99.9% uptime

Multi-region failover and automatic retries keep your production traffic online — even during provider outages.

Frequently asked questions

How much does the Google Models API cost?+

Each model has its own per-call price listed on the model page. We bill per successful generation, with no subscription fees or minimums.

How fast are Google Models models on WaveSpeedAI?+

Image models in this collection typically complete in under 2 seconds. Video and 3D models depend on duration and resolution but are usually several times faster than self-hosted runs.

Can I try the API without a credit card?+

Yes — every account gets $1 in free credits on signup, enough to try most Google Models models without a credit card.

Are there rate limits?+

Standard accounts have generous concurrent-job limits. Enterprise plans offer custom RPM, higher concurrency, and dedicated capacity — contact sales for details.