Vidu Contest
WaveSpeed.ai
Home/Explore/Best Image Tool/openai/gpt-image-1/text-to-image
text-to-image

text-to-image

OpenAI GPT Image 1

openai/gpt-image-1/text-to-image

OpenAI GPT Image-1 generates images from text prompts from OpenAI's latest text-to-image model, ideal for creating visual assets. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Input
If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.
If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

Idle

A small robot exploring an abandoned city, stylized cartoon look, bright and soft color palette, charming illustration

Your request will cost $0.042 per run.

For $1 you can run this model approximately 23 times.

One more thing:

ExamplesView all

A young woman with curly hair sitting at a café table, wearing a beige trench coat, soft morning light on her face, shallow depth of field, realistic skin texture, candid photography style
A small robot exploring an abandoned city, stylized cartoon look, bright and soft color palette, charming illustration
Make an image of a birthday card for my mom's 50th birthday, include all the gifts that I got her illustrated as a single black ink drawing. add a headline drawn in an elegant black script: Happy 50th Birthday, Mom!
Create a professional and visually engaging magazine cover for a lifestyle magazine called "Urban Pulse." Include these featured article headlines clearly: "10 Hidden Cafés You'll Love in NYC" "Minimalist Apartments: Small Spaces, Big Ideas" "Exclusive Interview: Behind the Scenes with Indie Band Echo District" Use contemporary typography, vibrant colors, and include an eye-catching main photograph with a person standing in front of a city scene
Generate an image of a sleek, red sports car with a polished chrome grille and alloy wheels. The car is parked on a sunlit beach with waves gently lapping at the shore and palm trees swaying in the background. The scene has a bright and cheerful tone with warm sunlight casting soft shadows on the car. The image is taken from a slightly elevated angle to capture the car's sleek design and the beach in the background. The image should be in a photorealistic style with high-resolution details.
A cute cartoon fox wearing a tiny wizard hat, sitting on a giant mushroom, colorful whimsical forest background, hand-drawn style, playful illustration
A highly detailed portrait of an elderly man with deep wrinkles, wearing a dark blue coat, sitting in a sunlit library, realistic lighting and textures, photograph style
Starry night over a modern city, in the style of Van Gogh, swirling sky, expressive brush strokes, oil painting texture
Abstract geometric shapes in pastel colors, inspired by Kandinsky, modern art painting, textured brush strokes
A teenage skateboarder performing a trick in an urban skatepark, casual streetwear, sunlight casting dynamic shadows, high-detail, action photography style

README

OpenAI GPT Image 1

GPT Image 1 is OpenAI’s latest multimodal image generation model, built to understand both text and image inputs and produce visually coherent, high-quality image outputs. It combines the reasoning power of GPT-4-Turbo with DALL·E-class visual synthesis—allowing for creative, controllable, and context-aware generation across illustration, photography, design, and visualization tasks.

🧠 Key Features

  • Multimodal Understanding Accepts both text and image inputs, enabling style transfer, editing, or contextual composition.

  • Flexible Styles Produces photorealistic renders, stylized artwork, concept art, infographics, and 3D-style illustrations.

  • High Visual Fidelity Maintains object relationships, lighting consistency, and color balance with strong adherence to prompts.

  • Accurate Text Rendering Capable of generating clean typography—ideal for posters, memes, comics, and branding visuals.

  • Knowledge-Grounded Creativity Uses GPT-4’s world knowledge to generate factual, contextually appropriate visuals.

⚙️ Parameters

  • Prompt: Required text description of the desired image.
  • Size: Supports 1024×1024, 1024×1536, and 1536×1024.
  • Quality: Choose between low, medium, and high.

💰 Pricing

ResolutionLow ($)Medium ($)High ($)
1024 × 10240.0110.0420.167
1024 × 1536 / 1536 × 10240.0160.0630.250

💡 Tips for Best Results

  1. Write prompts that specify style, subject, composition, and lighting.

    Example: “A small robot exploring an abandoned city, cartoon style, bright colors.”

  2. Use high quality for detailed or large-format outputs.

  3. Prefer landscape (1536×1024) for cinematic or wide compositions, and portrait (1024×1536) for characters or vertical art.

📝 Notes

  • All generated content follows OpenAI’s safety and content policies.
  • If a prompt triggers moderation, rephrase or simplify it.
  • This model supports multi-image input via API, enabling creative editing and composition workflows.
  • For performance and latency-sensitive cases, use medium quality as the balanced default.