--- license: apache-2.0 --- This model is converted from `Qwen/Qwen2.5-1.5B`. This model contains bias in q_proj, k_proj layers. Another model contains bias in q_proj, k_proj is [deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B)