A great model, but...

#10
by hklamaa - opened

This is the most all-rounded mid- to small-sized open source model I have tried so far: stable, accurate, with quite high t/s.

But just an issue when using this llm via llama.cpp to do skill call / file search. It always terminated without taking further action. Below is an example:

"This appears to be documentation about a JSON config, not the actual JSON itself β€” let me check what's actually in that file and also look for any sites*.json files:"

Not sure if others encounter the same problem? It would be perfect if such issue can be solved....

this usually means you have your temp and/or penalties set too high. I use --presence-penalty 0.0 --repeat-penalty 1.0 --temp 0.6 and it works well.

Thanks a lot - it works now!

Sign up or log in to comment