AEmotionStudio commited on
Commit
e3f1910
·
verified ·
1 Parent(s): cde2ca8

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +39 -0
README.md ADDED
@@ -0,0 +1,39 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ - zh
6
+ tags:
7
+ - singing-voice-synthesis
8
+ - singing-voice-conversion
9
+ - svs
10
+ - svc
11
+ - zero-shot
12
+ - text-to-audio
13
+ - music
14
+ pipeline_tag: text-to-speech
15
+ ---
16
+
17
+ # SoulX-Singer Models (Safetensors Mirror)
18
+
19
+ Safetensors conversion of [Soul-AILab/SoulX-Singer](https://huggingface.co/Soul-AILab/SoulX-Singer) weights for use in the [MAESTRO AI Workstation](https://github.com/AEmotionStudio/Maestraea).
20
+
21
+ ## Models
22
+
23
+ | Path | Size | Description |
24
+ |------|------|-------------|
25
+ | svs/model.safetensors | ~2.82 GB | Singing Voice Synthesis (lyrics+MIDI → singing) |
26
+ | svc/model.safetensors | ~2.79 GB | Singing Voice Conversion (audio-to-audio) |
27
+ | config.yaml | 579 B | Model architecture configuration |
28
+ | phone_set.json | ~30 KB | Phoneme mapping for SVS |
29
+
30
+ ## Architecture
31
+
32
+ - Flow-matching based (F5-TTS foundation)
33
+ - 22-layer transformer with 1024 hidden size, 16 heads
34
+ - 128-dim mel spectrogram, 24kHz output
35
+ - Trained on 42,000+ hours of aligned vocals (Mandarin, English, Cantonese)
36
+
37
+ ## License
38
+
39
+ Apache 2.0