Anonymous
8/21/2025, 3:26:46 PM
No.106335810
Gpt-5/horizon alpha ignored the "Length: 1000 words." prompt and thus have inflated scores on eq-bench writing tasks. Most model outputs have ~1000 words/~7000 characters. Gpt-5 outputs have ~2000 words/~14000 characters