code2lora commited on
Commit
2728b31
·
verified ·
1 Parent(s): baf34f4

Update dataset/model card

Browse files
Files changed (1) hide show
  1. README.md +27 -0
README.md ADDED
@@ -0,0 +1,27 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ tags: [code, lora, hypernetwork, peft]
4
+ ---
5
+
6
+ # Code2LoRA — direct-projection hypernetwork
7
+
8
+ Final checkpoint of the **direct-projection** Code2LoRA hypernetwork used in
9
+ the paper. Maps a repository-level embedding into a rank-16 LoRA adapter for
10
+ `Qwen/Qwen2.5-Coder-1.5B` in a single forward pass.
11
+
12
+ ## Files
13
+
14
+ | File | Description |
15
+ |---|---|
16
+ | `code2lora_direct.pt` | Trained `Code2LoRAHead` weights (~2.7 GB, fp32). Loaded with `torch.load(map_location="cpu")`. |
17
+
18
+ ## Training recipe
19
+
20
+ * 3 epochs on the `code2lora/code2lora-data-snapshots` dataset.
21
+ * AdamW + cosine schedule, max-seq-len 8192, bf16, single H100 80 GB.
22
+ * See [`code2lora/code2lora`](https://github.com/) for the trainer code.
23
+
24
+ ## Companion model
25
+
26
+ `code2lora/code2lora-gru` -- the streaming-recurrent variant trained on
27
+ commit deltas.