Moondream3 Point - Point Localization

Point Localization: Find exact positions of objects
Coordinate Points: Get precise x, y coordinates
Multiple Objects: Locate multiple instances of the same object
Natural Language: Specify objects using plain English

Moondream3 Point is a specialized vision-language model for locating specific objects in images and returning their coordinate points.

Features

{
  "image": "https://example.com/photo.jpg",
  "prompt": "person"
}

{
  "image": "https://example.com/photo.jpg",
  "prompt": "car"
}

{
  "image": "https://example.com/photo.jpg",
  "prompt": "laptop"
}

Returns coordinate points in the format: [x, y] where x and y are the pixel coordinates of the object's center or key point.

Fixed price per request. Contact WaveSpeed for volume discounts.