tunacookie commited on
Commit
25b665c
·
verified ·
1 Parent(s): ff015ff

Upload Facade of Jade LoRA adapter trained on Modal A100

Browse files
Files changed (1) hide show
  1. README.md +25 -62
README.md CHANGED
@@ -1,62 +1,25 @@
1
- ---
2
- base_model: Qwen/Qwen3-4B-Instruct-2507
3
- library_name: peft
4
- model_name: facade-of-jade-qwen3-4b-lora
5
- tags:
6
- - base_model:adapter:Qwen/Qwen3-4B-Instruct-2507
7
- - lora
8
- - sft
9
- - transformers
10
- - trl
11
- licence: license
12
- pipeline_tag: text-generation
13
- ---
14
-
15
- # Model Card for facade-of-jade-qwen3-4b-lora
16
-
17
- This model is a fine-tuned version of [Qwen/Qwen3-4B-Instruct-2507](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507).
18
- It has been trained using [TRL](https://github.com/huggingface/trl).
19
-
20
- ## Quick start
21
-
22
- ```python
23
- from transformers import pipeline
24
-
25
- question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
26
- generator = pipeline("text-generation", model="None", device="cuda")
27
- output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
28
- print(output["generated_text"])
29
- ```
30
-
31
- ## Training procedure
32
-
33
-
34
-
35
-
36
-
37
- This model was trained with SFT.
38
-
39
- ### Framework versions
40
-
41
- - PEFT 0.19.1
42
- - TRL: 1.5.1
43
- - Transformers: 5.10.2
44
- - Pytorch: 2.12.0
45
- - Datasets: 5.0.0
46
- - Tokenizers: 0.22.2
47
-
48
- ## Citations
49
-
50
-
51
-
52
- Cite TRL as:
53
-
54
- ```bibtex
55
- @software{vonwerra2020trl,
56
- title = {{TRL: Transformers Reinforcement Learning}},
57
- author = {von Werra, Leandro and Belkada, Younes and Tunstall, Lewis and Beeching, Edward and Thrush, Tristan and Lambert, Nathan and Huang, Shengyi and Rasul, Kashif and Gallouédec, Quentin},
58
- license = {Apache-2.0},
59
- url = {https://github.com/huggingface/trl},
60
- year = {2020}
61
- }
62
- ```
 
1
+ ---
2
+ base_model: Qwen/Qwen3-4B-Instruct-2507
3
+ library_name: peft
4
+ tags:
5
+ - lora
6
+ - qwen3
7
+ - build-small-hackathon
8
+ - facade-of-jade
9
+ - modal
10
+ ---
11
+
12
+ # Facade of Jade Qwen3-4B LoRA
13
+
14
+ LoRA adapter trained for **Facade of Jade**, a Build Small Hackathon interactive wuxia drama demo.
15
+
16
+ - Base model: `Qwen/Qwen3-4B-Instruct-2507`
17
+ - Training records: 50
18
+ - Epochs: 3
19
+ - Final train loss: `2.969015`
20
+ - Adapter size reported by Modal runner: `483.63 MB`
21
+ - Modal run evidence: https://modal.com/apps/t-abdullah-rashid/main/ap-W54lCMfJu4eu3UCVQvVpQK
22
+ - Source repo: https://github.com/tuancookiez-hub/facade-of-jade
23
+ - Live Space: https://build-small-hackathon-facade-of-jade.hf.space
24
+
25
+ This adapter was produced by `train_lora_modal.py` on Modal A100-80GB and saved from Modal volume `facade-of-jade-lora-out`.