InfiniteTalk is an audio-driven conversational AI video generation model. Create talking or singing videos from a single image and audio input. Our endpoint starts with $0.15 per 5 seconds video generation (480p) and supports a maximum generation length of 120 seconds.
$0.15
infinitetalk
$0.15
wan-2.2/speech-to-video
$1.8
veo3-fast/image-to-video
$5.5
veo3/image-to-video
$0.15
multitalk
$0.05
lipsync-2-pro
$0.05
lipsync-2
$0.05
lipsync-1.9.0-beta
$0.05
song-generation
$0.12
avatar-omni-human
$0.14
kling-lipsync/audio-to-video
$0.14
kling-lipsync/text-to-video
$0.14
lipsync/audio-to-video