Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
juanquivilla
/
sotto-cleanup-lfm25-350m
like
0
Text Generation
Safetensors
juanquivilla/sotto-transcript-cleanup
English
lfm2
speech-to-text
transcript-cleanup
text-correction
asr-post-processing
LFM
LiquidAI
grpo
full-fine-tune
inverse-text-normalization
conversational
License:
mit
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
8d24c18
sotto-cleanup-lfm25-350m
714 MB
Ctrl+K
Ctrl+K
1 contributor
History:
22 commits
juanquivilla
v23+paragraphs: ROUGE-L 0.9506, Filler-Free 90.2%, paragraph rate 91.5% (0% in v22)
8d24c18
verified
3 months ago
.gitattributes
Safe
1.52 kB
initial commit
3 months ago
README.md
Safe
10.1 kB
v23+paragraphs: ROUGE-L 0.9506, Filler-Free 90.2%, paragraph rate 91.5% (0% in v22)
3 months ago
chat_template.jinja
Safe
1.3 kB
Upload folder using huggingface_hub
3 months ago
config.json
Safe
1.31 kB
Full FT: ROUGE-L 0.907 β new record, +1.6 over prompted 2B
3 months ago
generation_config.json
Safe
141 Bytes
v15: ROUGE-L 0.960, 70% exact match β LR 2.5e-5 breakthrough
3 months ago
model.safetensors
709 MB
xet
v23+paragraphs: ROUGE-L 0.9506, Filler-Free 90.2%, paragraph rate 91.5% (0% in v22)
3 months ago
tokenizer.json
Safe
4.73 MB
Upload folder using huggingface_hub
3 months ago
tokenizer_config.json
Safe
519 Bytes
v22+GRPO: ROUGE-L 0.953 val set, 91% filler-free β GRPO works on proper benchmark
3 months ago
training_args.bin
5.71 kB
xet
v15: ROUGE-L 0.960, 70% exact match β LR 2.5e-5 breakthrough
3 months ago