Fix config.json: remove ghost embedding_file + add per_layer_embedding_file (Cycle 10 ship-bug) 2792aa3 verified darkmaniac7 commited on Apr 26
Remove legacy PLE-schema files (taobao schema is canonical) 34bdda0 verified darkmaniac7 commited on Apr 25
V2 (FusedAttention disabled + force_full_decode_recompute): llm.mnn.json e2c2f2c verified darkmaniac7 commited on Apr 12
V2 (FusedAttention disabled + force_full_decode_recompute): llm_config.json 8818649 verified darkmaniac7 commited on Apr 12
V2 (FusedAttention disabled + force_full_decode_recompute): llm.mnn 406048c verified darkmaniac7 commited on Apr 12
V2 (FusedAttention disabled + force_full_decode_recompute): export_args.json 6a94b5a verified darkmaniac7 commited on Apr 12
V2 (FusedAttention disabled + force_full_decode_recompute): llm.mnn.json fbd4c9d verified darkmaniac7 commited on Apr 12
V2 (FusedAttention disabled + force_full_decode_recompute): llm_config.json 430d263 verified darkmaniac7 commited on Apr 12
V2 (FusedAttention disabled + force_full_decode_recompute): llm.mnn 7c02481 verified darkmaniac7 commited on Apr 12
Add llm.mnn.weight (Q4+int4 PLE, requires TokForge 3.4.9) abf702c verified darkmaniac7 commited on Apr 12
Add per_layer_embeddings_int4.bin (Q4+int4 PLE, requires TokForge 3.4.9) f38a2f6 verified darkmaniac7 commited on Apr 12
Add embeddings_int4.bin (Q4+int4 PLE, requires TokForge 3.4.9) b2163a0 verified darkmaniac7 commited on Apr 12
Add tokenizer.txt (Q4+int4 PLE, requires TokForge 3.4.9) e423548 verified darkmaniac7 commited on Apr 12
Add config.json (Q4+int4 PLE, requires TokForge 3.4.9) 15a5cfe verified darkmaniac7 commited on Apr 12
Add llm_config.json (Q4+int4 PLE, requires TokForge 3.4.9) 8b5978c verified darkmaniac7 commited on Apr 12