NikitaKlimenko commited on
Commit
87f9335
·
verified ·
1 Parent(s): d9a7566

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +45 -7
README.md CHANGED
@@ -1,12 +1,50 @@
1
  ---
2
  license: apache-2.0
3
  language:
4
- - en
5
- base_model:
6
- - Qwen/Qwen3-4B-Instruct-2507
7
  pipeline_tag: text-generation
 
8
  tags:
9
- - lora
10
- - peft
11
- - qwen3
12
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
  language:
4
+ - en
5
+ base_model: Qwen/Qwen3-4B-Instruct-2507
 
6
  pipeline_tag: text-generation
7
+ library_name: peft
8
  tags:
9
+ - lora
10
+ - peft
11
+ - qwen3
12
+ - floorplan
13
+ - hypergraph
14
+ ---
15
+
16
+ # HypergraphFormer
17
+
18
+ LoRA adapters fine-tuning **Qwen/Qwen3-4B-Instruct-2507** for hypergraph-based
19
+ floorplan generation (RPLAN). The repo contains several adapters trained on
20
+ different dataset sizes.
21
+
22
+ ## Checkpoints
23
+
24
+ | Subfolder | Train samples | Step |
25
+ |---|---|---|
26
+ | `qwen_hypergraphformer_1000_samples/checkpoint-240` | 1,000 | 240 |
27
+ | `qwen_hypergraphformer_5000_samples/checkpoint-750` | 5,000 | 750 |
28
+ | `qwen_hypergraphformer_10000_samples/checkpoint-1500`| 10,000 | 1500 |
29
+ | `qwen_hypergraphformer_25000_samples/checkpoint-3900`| 25,000 | 3900 |
30
+ | `qwen_hypergraphformer/checkpoint-8700` | full | 8700 |
31
+
32
+ ## LoRA configuration
33
+
34
+ - Rank `r = 64`, `lora_alpha = 128`, `lora_dropout = 0.1`
35
+ - Target modules: `q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj`
36
+ - Task: `CAUSAL_LM`
37
+
38
+ ## Usage
39
+
40
+ ```python
41
+ from transformers import AutoModelForCausalLM, AutoTokenizer
42
+ from peft import PeftModel
43
+
44
+ base_id = "Qwen/Qwen3-4B-Instruct-2507"
45
+ repo_id = "NikitaKlimenko/HypergraphFormer"
46
+ subfolder = "qwen_hypergraphformer_25000_samples/checkpoint-3900"
47
+
48
+ tok = AutoTokenizer.from_pretrained(repo_id, subfolder=subfolder)
49
+ base = AutoModelForCausalLM.from_pretrained(base_id, torch_dtype="auto", device_map="auto")
50
+ model = PeftModel.from_pretrained(base, repo_id, subfolder=subfolder)