Voz (TTS + STT) - a mangaba-ai Collection

mangaba-ai 's Collections

Edicao de Imagem

Codigo / Coding LLMs

Embeddings & RAG

Visao / Multimodal (VLM)

Voz (TTS + STT)

Voz (TTS + STT)

updated 10 days ago

Melhores modelos open-source de voz: sintese (TTS) e ASR (STT). · Curadoria Mangaba AI 🥭

Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice

Text-to-Speech • 2B • Updated Jan 29 • 2.14M • 1.63k
openbmb/VoxCPM2

Text-to-Speech • 2B • Updated Apr 16 • 529k • 1.42k
Supertone/supertonic-3

Text-to-Speech • Updated May 18 • 53.9k • 851
Qwen/Qwen3-ASR-1.7B

Automatic Speech Recognition • 2B • Updated Jan 30 • 1.62M • 898
nvidia/parakeet-tdt-0.6b-v3

Automatic Speech Recognition • 0.6B • Updated May 20 • 172k • • 941
CohereLabs/cohere-transcribe-03-2026

Automatic Speech Recognition • 2B • Updated 14 days ago • 736k • 1.01k
hexgrad/Kokoro-82M

Text-to-Speech • Updated Apr 10, 2025 • 16.2M • • 6.38k
k2-fsa/OmniVoice

Text-to-Speech • 0.6B • Updated May 7 • 1.26M • 1.07k
fishaudio/s2-pro

Text-to-Speech • 5B • Updated Mar 11 • 369k • 1.05k