Search Results
6/13/2025, 10:20:39 AM
Why does llama.cpp produce a different result when you regenerate the answer even with greedy decoding? The first answer is always different from a regenerated answer and all regenerated answers are the same. So the pattern is A B B B B...
The screenshot shows the entire conversation, there is nothing in front, the first answer is completely schizo. qwen 235
The screenshot shows the entire conversation, there is nothing in front, the first answer is completely schizo. qwen 235
Page 1