Running on CPU Upgrade Agents 28 FFASR Leaderboard π 28 Far-Field ASR β clean / noisy / reverberant benchmark
Paused 240 Omnilingual ASR Media Transcription π 240 Transcribe audio/video files into text instantly
Running on Zero Agents Featured 100 CapSpeech TTS π§’ 100 Stylized TTS β design voice, accent, and emotion your way
openai/whisper-large-v3 Automatic Speech Recognition β’ 2B β’ Updated Aug 12, 2024 β’ 6.05M β’ β’ 5.83k