voidstream commited on
Commit
6d43155
Β·
verified Β·
1 Parent(s): d582e02

Update model card: LoRA fine-tuning now supported via HelixLinearSTE

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -71,7 +71,7 @@ That's it. `import helix_substrate` registers the quantizer. `from_pretrained()`
71
 
72
  - **+8.0% PPL delta** β€” higher than transformers at this scale, but down from +18.4% at 130M. SSM compression quality scales with model size.
73
  - **GPU and CPU supported** β€” runs on any CUDA GPU or CPU via standard PyTorch. Fused kernels for additional speedup are in progress.
74
- - **Not fine-tunable** β€” compressed weights are read-only (`is_trainable = False`).
75
  - **Requires `helix-substrate`** β€” the quantizer is not built into transformers. You need `pip install "helix-substrate[hf]"`.
76
  - **`mamba-ssm` recommended** β€” without it, falls back to a slower sequential code path.
77
  - **Requires `transformers >= 4.45`** β€” for Mamba2 architecture support.
 
71
 
72
  - **+8.0% PPL delta** β€” higher than transformers at this scale, but down from +18.4% at 130M. SSM compression quality scales with model size.
73
  - **GPU and CPU supported** β€” runs on any CUDA GPU or CPU via standard PyTorch. Fused kernels for additional speedup are in progress.
74
+ - **Fine-tunable via LoRA** β€” compressed weights remain frozen, but LoRA adapters attach to each `HelixLinear` layer via `HelixLinearSTE`. See `helix-substrate` for training infrastructure.
75
  - **Requires `helix-substrate`** β€” the quantizer is not built into transformers. You need `pip install "helix-substrate[hf]"`.
76
  - **`mamba-ssm` recommended** β€” without it, falls back to a slower sequential code path.
77
  - **Requires `transformers >= 4.45`** β€” for Mamba2 architecture support.