Commit History

Dual-signature export v3: aligned chat+classifier decomposition pipelines so cross-program constant dedup works (3.9 GB, matches single-sig). 4 signatures: prefill_128_chat / decode_chat (no LoRA, no classifier) + prefill_128_classifier / decode_classifier (full).
aa91a6c
verified

DarrenJiaImbue commited on

Dual-signature export: decode_chat / prefill_128_chat (no LoRA, no classifier) + decode_classifier / prefill_128_classifier (with LoRA + classifier head). iOS Metal fused-attention shader should now match the chat graph.
2143079
verified

DarrenJiaImbue commited on

QAT-trained base + classifier (gemma4_mixed48 fake-quant in the loop)
0ad7a03
verified

DarrenJiaImbue commited on

QAT-trained LoRA (gemma4_mixed48 fake-quant in the loop)
4ff68a3
verified

DarrenJiaImbue commited on

Upload tokenizer_config.json with huggingface_hub
619ea7c
verified

DarrenJiaImbue commited on

Upload tokenizer.json with huggingface_hub
e40da92
verified

DarrenJiaImbue commited on

Upload lora_adapter.tflite with huggingface_hub
5232860
verified

DarrenJiaImbue commited on

Upload model.litertlm with huggingface_hub
590177c
verified

DarrenJiaImbue commited on

Upload README.md with huggingface_hub
af4faf4
verified

DarrenJiaImbue commited on

initial commit
e0edb1f
verified

DarrenJiaImbue commited on