Instructions to use alakxender/mms-tts-div-finetuned-md-m01 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use alakxender/mms-tts-div-finetuned-md-m01 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-to-audio", model="alakxender/mms-tts-div-finetuned-md-m01")# Load model directly from transformers import AutoTokenizer, AutoModelForMultimodalLM tokenizer = AutoTokenizer.from_pretrained("alakxender/mms-tts-div-finetuned-md-m01") model = AutoModelForMultimodalLM.from_pretrained("alakxender/mms-tts-div-finetuned-md-m01") - Notebooks
- Google Colab
- Kaggle
Upload 8 files
Browse files- .gitattributes +4 -0
- outputs/audio/001.wav +3 -0
- outputs/audio/002.wav +0 -0
- outputs/audio/003.wav +3 -0
- outputs/mos_scores.txt +4 -0
- outputs/report.txt +3 -0
- outputs/spectrograms/001.png +3 -0
- outputs/spectrograms/002.png +0 -0
- outputs/spectrograms/003.png +3 -0
.gitattributes
CHANGED
|
@@ -33,3 +33,7 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
outputs/audio/001.wav filter=lfs diff=lfs merge=lfs -text
|
| 37 |
+
outputs/audio/003.wav filter=lfs diff=lfs merge=lfs -text
|
| 38 |
+
outputs/spectrograms/001.png filter=lfs diff=lfs merge=lfs -text
|
| 39 |
+
outputs/spectrograms/003.png filter=lfs diff=lfs merge=lfs -text
|
outputs/audio/001.wav
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e1a97fbd0030912904f6d0524d03ca93dd59111437f0d348d0322d7bfb35ea12
|
| 3 |
+
size 121422
|
outputs/audio/002.wav
ADDED
|
Binary file (93.3 kB). View file
|
|
|
outputs/audio/003.wav
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1a872a3e68708e5296c7a686cb945c8fdf1cac9df2bcbf67ee8272829db6fce9
|
| 3 |
+
size 104014
|
outputs/mos_scores.txt
ADDED
|
@@ -0,0 +1,4 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
001.wav 3.468
|
| 2 |
+
002.wav 2.977
|
| 3 |
+
003.wav 3.239
|
| 4 |
+
Avg MOS 3.228
|
outputs/report.txt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
001 އީދު ފާހަގަކުރުމުގައި ހުއްދަ މުނިފޫހިފިލުވުމަކީ ކޮބާ؟ ./outputs/audio/001.wav ./outputs/spectrograms/001.png
|
| 2 |
+
002 އައިންމަތީ ދޫނި މަދުވެއްޖެ، ތޭރަވާ ގެއްލިއްޖެ ./outputs/audio/002.wav ./outputs/spectrograms/002.png
|
| 3 |
+
003 "ނުކުމެވޭ ވަރެއް ނޫން، މަގުތައް ބުޅާ ނަޖިހުން ފުރިފައި" ./outputs/audio/003.wav ./outputs/spectrograms/003.png
|
outputs/spectrograms/001.png
ADDED
|
Git LFS Details
|
outputs/spectrograms/002.png
ADDED
|
outputs/spectrograms/003.png
ADDED
|
Git LFS Details
|