Home/Explore/wavespeed-ai/moondream3-preview/point

vision-language

wavespeed-ai/moondream3-preview/point

Locate specific objects in images and get their coordinate points. Moondream3 Point provides precise point localization for computer vision applications.

Hint: You can drag and drop a file or click to upload

If set to true, the function will wait for the result before returning the response. This property is only available through the API.

Idle

Your request will cost $0.001 per run.

For $1 you can run this model approximately 1000 times.

README

Moondream3 Point - Point Localization

Moondream3 Point is a specialized vision-language model for locating specific objects in images and returning their coordinate points.

Features

  • Point Localization: Find exact positions of objects
  • Coordinate Points: Get precise x, y coordinates
  • Multiple Objects: Locate multiple instances of the same object
  • Natural Language: Specify objects using plain English

Example Usage

Locate Person

{
  "image": "https://example.com/photo.jpg",
  "prompt": "person"
}

Locate Object

{
  "image": "https://example.com/photo.jpg",
  "prompt": "car"
}

Locate Specific Item

{
  "image": "https://example.com/photo.jpg",
  "prompt": "laptop"
}

Output Format

Returns coordinate points in the format: [x, y] where x and y are the pixel coordinates of the object's center or key point.

Best Practices

  • Use specific object names for better accuracy
  • Supported formats: JPEG, PNG, WebP
  • Max image size: 10MB

Pricing

Fixed price per request. Contact WaveSpeed for volume discounts.