What is the reason to use this model instead of simple mlx 4 bit?

#1
by zaskara - opened

Please explain; I didn't get it. On the provided benchmark, it has 89 vs. 91 for the basic 4-bit version of this model. And it seems to have the same size. Then, what is the reason to use it instead of basic 4 bit?

MLX Community org

This was still under active development the calibration mix has been improved and made more diverse. New benchmarks were also added that show clear improvement over uniform 4 bit quants.

Sign up or log in to comment