neuphonic/neutts-air
Text-to-Speech β’ 0.7B β’ Updated β’ 14k β’ 873
State-of-the-art target speech extractor
Extreme Super-Resolution via Scale Autoregression
Explore Direct3DβS2 gigascale 3D generation via embedded demo
Watch and experiment with realtime AIβs with visuals
Voice Activity Detection using MarbleNet model
Filter multilingual data for high-quality language models
Transcribe speech and highlight emphasized words