speech-to-text
Idle
Your request will cost $0.001 per run.
For $1 you can run this model approximately 1000 times.
WaveSpeed's Whisper deployment delivers production-ready speech recognition built on the large-v3-turbo checkpoint. Upload audio (MP3, WAV, FLAC) and receive accurate transcripts with automatic language detection.
Example output:
{
"outputs": {
"text": "Hello everyone, welcome to the show."
}
}
Usage is billed per request based on audio duration. Contact the WaveSpeed team for volume discounts and custom SLAs.