Instructions to use build-small-hackathon/facade-of-jade-qwen3-4b-lora with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use build-small-hackathon/facade-of-jade-qwen3-4b-lora with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3-4B-Instruct-2507") model = PeftModel.from_pretrained(base_model, "build-small-hackathon/facade-of-jade-qwen3-4b-lora") - Notebooks
- Google Colab
- Kaggle
Upload Facade of Jade LoRA adapter trained on Modal A100
Browse files
README.md
CHANGED
|
@@ -1,62 +1,25 @@
|
|
| 1 |
-
---
|
| 2 |
-
base_model: Qwen/Qwen3-4B-Instruct-2507
|
| 3 |
-
library_name: peft
|
| 4 |
-
|
| 5 |
-
|
| 6 |
-
-
|
| 7 |
-
-
|
| 8 |
-
-
|
| 9 |
-
-
|
| 10 |
-
-
|
| 11 |
-
|
| 12 |
-
|
| 13 |
-
|
| 14 |
-
|
| 15 |
-
|
| 16 |
-
|
| 17 |
-
|
| 18 |
-
|
| 19 |
-
|
| 20 |
-
|
| 21 |
-
|
| 22 |
-
|
| 23 |
-
|
| 24 |
-
|
| 25 |
-
|
| 26 |
-
generator = pipeline("text-generation", model="None", device="cuda")
|
| 27 |
-
output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
|
| 28 |
-
print(output["generated_text"])
|
| 29 |
-
```
|
| 30 |
-
|
| 31 |
-
## Training procedure
|
| 32 |
-
|
| 33 |
-
|
| 34 |
-
|
| 35 |
-
|
| 36 |
-
|
| 37 |
-
This model was trained with SFT.
|
| 38 |
-
|
| 39 |
-
### Framework versions
|
| 40 |
-
|
| 41 |
-
- PEFT 0.19.1
|
| 42 |
-
- TRL: 1.5.1
|
| 43 |
-
- Transformers: 5.10.2
|
| 44 |
-
- Pytorch: 2.12.0
|
| 45 |
-
- Datasets: 5.0.0
|
| 46 |
-
- Tokenizers: 0.22.2
|
| 47 |
-
|
| 48 |
-
## Citations
|
| 49 |
-
|
| 50 |
-
|
| 51 |
-
|
| 52 |
-
Cite TRL as:
|
| 53 |
-
|
| 54 |
-
```bibtex
|
| 55 |
-
@software{vonwerra2020trl,
|
| 56 |
-
title = {{TRL: Transformers Reinforcement Learning}},
|
| 57 |
-
author = {von Werra, Leandro and Belkada, Younes and Tunstall, Lewis and Beeching, Edward and Thrush, Tristan and Lambert, Nathan and Huang, Shengyi and Rasul, Kashif and Gallouédec, Quentin},
|
| 58 |
-
license = {Apache-2.0},
|
| 59 |
-
url = {https://github.com/huggingface/trl},
|
| 60 |
-
year = {2020}
|
| 61 |
-
}
|
| 62 |
-
```
|
|
|
|
| 1 |
+
---
|
| 2 |
+
base_model: Qwen/Qwen3-4B-Instruct-2507
|
| 3 |
+
library_name: peft
|
| 4 |
+
tags:
|
| 5 |
+
- lora
|
| 6 |
+
- qwen3
|
| 7 |
+
- build-small-hackathon
|
| 8 |
+
- facade-of-jade
|
| 9 |
+
- modal
|
| 10 |
+
---
|
| 11 |
+
|
| 12 |
+
# Facade of Jade Qwen3-4B LoRA
|
| 13 |
+
|
| 14 |
+
LoRA adapter trained for **Facade of Jade**, a Build Small Hackathon interactive wuxia drama demo.
|
| 15 |
+
|
| 16 |
+
- Base model: `Qwen/Qwen3-4B-Instruct-2507`
|
| 17 |
+
- Training records: 50
|
| 18 |
+
- Epochs: 3
|
| 19 |
+
- Final train loss: `2.969015`
|
| 20 |
+
- Adapter size reported by Modal runner: `483.63 MB`
|
| 21 |
+
- Modal run evidence: https://modal.com/apps/t-abdullah-rashid/main/ap-W54lCMfJu4eu3UCVQvVpQK
|
| 22 |
+
- Source repo: https://github.com/tuancookiez-hub/facade-of-jade
|
| 23 |
+
- Live Space: https://build-small-hackathon-facade-of-jade.hf.space
|
| 24 |
+
|
| 25 |
+
This adapter was produced by `train_lora_modal.py` on Modal A100-80GB and saved from Modal volume `facade-of-jade-lora-out`.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|