Full BF16 variant
#3 opened 5 days ago
by
KeinNiemand
Knowledge degradation across the 504B REAP variants — empirical findings and which cut to pick
👍 2
1
#2 opened 8 days ago
by
rene98c
Parity with the unpruned model?
#1 opened 9 days ago
by
CosmicRaisins