You guys should be proud!

#3
by gemstonebro - opened

You guys should be proud, your gemma4 31b QAT when given 3kb python only takes up to 10 times sometimes redoing it by my rules... at best it "fucks up" once or twice... where ANY up to 24GB VRAM qwen is so bad it doesnt fix issues in 20 tries..

overall its like 70% job well done.. wich any qwen or any other popular success rate is 5%.. i kid you not i can give 5kb python and its 90% failure rate until i run out of 60k tokens explaining what the heck the llm is doing badly.. other models refuse to listen to simple coding rules...

I highly recommend this model, if i had 5090 i would use gemma4 31b but probably Q8...

Sign up or log in to comment