Add library_name and links to paper and GitHub

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +11 -6
README.md CHANGED
@@ -1,8 +1,11 @@
1
  ---
 
2
  language:
3
  - tr
4
  - en
5
  license: apache-2.0
 
 
6
  tags:
7
  - text-generation
8
  - turkish
@@ -14,14 +17,18 @@ tags:
14
  - continual-pretraining
15
  - TRUBA
16
  - MN5
17
- base_model: Qwen/Qwen3-1.7B
18
- pipeline_tag: text-generation
19
  ---
20
 
21
  # Mecellem-Qwen3-1.7B-TR
22
 
23
  [![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)
24
 
 
 
 
 
 
 
25
  ## Model Description
26
 
27
  Mecellem-Qwen3-1.7B-TR is a Turkish legal language model adapted through Continual Pre-training (CPT) on Turkish legal and official texts. The model is based on Qwen3-1.7B decoder architecture (1.7B parameters) and trained using a four-phase curriculum learning strategy specifically designed to account for Turkish linguistic complexity. The CPT process progressively transitions from general-purpose texts to domain-specific legal content, achieving 36.2% perplexity reduction on Turkish legal text compared to the base Qwen3-1.7B model.
@@ -237,7 +244,7 @@ If you use this model, please cite our paper:
237
  ```bibtex
238
  @article{mecellem2026,
239
  title={Mecellem Models: Turkish Models Trained from Scratch and Continually Pre-trained for the Legal Domain},
240
- author={Uğur, Özgür and Göksu, Mahmut and Çimen, Mahmut and Yılmaz, Musa and Şavirdi, Esra and Demir, Alp Talha and Güllüce, Rumeysa and Çetin, İclal and Sağbaş, Ömer Can},
241
  journal={arXiv preprint arXiv:2601.16018},
242
  year={2026},
243
  month={January},
@@ -257,6 +264,4 @@ If you use this model, please cite our paper:
257
  journal={arXiv preprint arXiv:2409.00000},
258
  year={2024}
259
  }
260
- ```
261
-
262
- <!-- Updated: 2026-01-15 09:38:29 -->
 
1
  ---
2
+ base_model: Qwen/Qwen3-1.7B
3
  language:
4
  - tr
5
  - en
6
  license: apache-2.0
7
+ pipeline_tag: text-generation
8
+ library_name: transformers
9
  tags:
10
  - text-generation
11
  - turkish
 
17
  - continual-pretraining
18
  - TRUBA
19
  - MN5
 
 
20
  ---
21
 
22
  # Mecellem-Qwen3-1.7B-TR
23
 
24
  [![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)
25
 
26
+ Mecellem-Qwen3-1.7B-TR is a Turkish legal language model presented in [Mecellem Models: Turkish Models Trained from Scratch and Continually Pre-trained for the Legal Domain](https://huggingface.co/papers/2601.16018).
27
+
28
+ **Resources:**
29
+ - **Code:** [GitHub Repository](https://github.com/newmindai/mecellem-models)
30
+ - **Paper:** [arXiv:2601.16018](https://arxiv.org/abs/2601.16018)
31
+
32
  ## Model Description
33
 
34
  Mecellem-Qwen3-1.7B-TR is a Turkish legal language model adapted through Continual Pre-training (CPT) on Turkish legal and official texts. The model is based on Qwen3-1.7B decoder architecture (1.7B parameters) and trained using a four-phase curriculum learning strategy specifically designed to account for Turkish linguistic complexity. The CPT process progressively transitions from general-purpose texts to domain-specific legal content, achieving 36.2% perplexity reduction on Turkish legal text compared to the base Qwen3-1.7B model.
 
244
  ```bibtex
245
  @article{mecellem2026,
246
  title={Mecellem Models: Turkish Models Trained from Scratch and Continually Pre-trained for the Legal Domain},
247
+ author={Uğur, Özgür and Göksu, Mahmut and Çimen, Mahmut and Yılmaz, Musa and Şavirdi, Esra and Demir, Alp Talha and Güllüce, Rumeysa and İclal Çetin and Sağbaş, Ömer Can},
248
  journal={arXiv preprint arXiv:2601.16018},
249
  year={2026},
250
  month={January},
 
264
  journal={arXiv preprint arXiv:2409.00000},
265
  year={2024}
266
  }
267
+ ```