microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 350k • 1.6k
jq/whisper-large-v3-salt-plus-xog-myx-kin-swa Automatic Speech Recognition • 2B • Updated Feb 4, 2025 • 5 • 1
Running on Zero Agents Featured 965 MMAudio — generating synchronized audio from video/text 🔊 965 Generate synchronized audio for videos from text prompts