vision-language
Idle
Your request will cost $0.01 per run.
For $1 you can run this model approximately 100 times.
Moondream3 Query is a specialized vision-language model for asking natural language questions about images and receiving intelligent answers.
{
"image": "https://example.com/photo.jpg",
"prompt": "What is the person in the image doing?"
}
{
"image": "https://example.com/photo.jpg",
"prompt": "What emotions are visible in this scene?",
"reasoning": true
}
Fixed price per request. Contact WaveSpeed for volume discounts.