Model Looping in Reasoning

#12
by remichu - opened

Hi,

The model is working very well, however i encountered occassionally model looping during the reasoning. The looping pattern look like this:

Need …
Need …
Need …

Do you have any advice on how i can hande this? I currently use the recommended sampling parameters from the README.md

That seems to be normal part of the model reasoning traces. I've seen this, but it never got stuck forever. In llama.cpp I'm running with --temp 0.7 --top-p 0.95 --top-k 40.

I managed to not have loop on my quant with temp 0.9 and 1.15 repetition penalty

remichu changed discussion status to closed

Sign up or log in to comment