build-small-hackathon
/

facade-of-jade-qwen3-4b-lora

@@ -1,62 +1,25 @@
----
-base_model: Qwen/Qwen3-4B-Instruct-2507
-library_name: peft
-model_name: facade-of-jade-qwen3-4b-lora
-tags:
-- base_model:adapter:Qwen/Qwen3-4B-Instruct-2507
-- lora
-- sft
-- transformers
-- trl
-licence: license
-pipeline_tag: text-generation
----
-# Model Card for facade-of-jade-qwen3-4b-lora
-This model is a fine-tuned version of [Qwen/Qwen3-4B-Instruct-2507](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507).
-It has been trained using [TRL](https://github.com/huggingface/trl).
-## Quick start
-```python
-from transformers import pipeline
-question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
-generator = pipeline("text-generation", model="None", device="cuda")
-output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
-print(output["generated_text"])
-```
-## Training procedure
-This model was trained with SFT.
-### Framework versions
-- PEFT 0.19.1
-- TRL: 1.5.1
-- Transformers: 5.10.2
-- Pytorch: 2.12.0
-- Datasets: 5.0.0
-- Tokenizers: 0.22.2
-## Citations
-Cite TRL as:
-```bibtex
-@software{vonwerra2020trl,
-  title   = {{TRL: Transformers Reinforcement Learning}},
-  author  = {von Werra, Leandro and Belkada, Younes and Tunstall, Lewis and Beeching, Edward and Thrush, Tristan and Lambert, Nathan and Huang, Shengyi and Rasul, Kashif and Gallouédec, Quentin},
-  license = {Apache-2.0},
-  url     = {https://github.com/huggingface/trl},
-  year    = {2020}
-}
-```

+---
+base_model: Qwen/Qwen3-4B-Instruct-2507
+library_name: peft
+tags:
+- lora
+- qwen3
+- build-small-hackathon
+- facade-of-jade
+- modal
+---
+# Facade of Jade Qwen3-4B LoRA
+LoRA adapter trained for **Facade of Jade**, a Build Small Hackathon interactive wuxia drama demo.
+- Base model: `Qwen/Qwen3-4B-Instruct-2507`
+- Training records: 50
+- Epochs: 3
+- Final train loss: `2.969015`
+- Adapter size reported by Modal runner: `483.63 MB`
+- Modal run evidence: https://modal.com/apps/t-abdullah-rashid/main/ap-W54lCMfJu4eu3UCVQvVpQK
+- Source repo: https://github.com/tuancookiez-hub/facade-of-jade
+- Live Space: https://build-small-hackathon-facade-of-jade.hf.space
+This adapter was produced by `train_lora_modal.py` on Modal A100-80GB and saved from Modal volume `facade-of-jade-lora-out`.