Search - 4rchive

►Recent Highlights from the Previous Thread: >>106236127

--Paper: Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens:
>106237976 >106238003 >106238047 >106238091 >106238043 >106238046 >106238164 >106238352 >106238182 >106238229
--Papers:
>106243214
--Extraction of base model from GPT-OSS reveals fragility of alignment:
>106242752 >106242778 >106242781 >106243233 >106243306 >106242819 >106243237
--Testing multimodal AI on anime image reveals widespread failure in object counting and OCR tasks:
>106240801 >106240818 >106240916 >106241023 >106241158 >106241198 >106241568 >106242968 >106243022 >106243656 >106243668 >106243144 >106243204
--Prompt engineering strategies to reduce model repetition in interactive scenarios:
>106238216 >106238307 >106238383 >106241211 >106241225 >106241287 >106241231
--SXM2 NVLink boards for multi-GPU speedup over PCIe:
>106236838 >106236862 >106236971 >106237389 >106237040
--Intel's slowing contributions to llama.cpp and AI software maturity concerns:
>106239892
--Jinx models' high performance and zero safety refusals:
>106236916 >106236935 >106236969 >106236999 >106237019 >106237032 >106237029 >106237030 >106237057 >106237077 >106237151 >106237175 >106237675 >106237705 >106237713
--ZLUDA implements on-disk kernel cache for persistent PTX code storage:
>106241335 >106241347 >106241420
--MistralAI's internal Creative Writing benchmark revealed in model performance chart:
>106237528 >106237560
--Mixture-of-Agents as a practical alternative to scaling monolithic models:
>106236258 >106236274 >106236295 >106236285 >106236524 >106236567 >106236592 >106236526 >106236699
--GPT-5 leads in GitHub issue resolution but metric validity questioned:
>106239502 >106239551 >106239555 >106239581
--Release v3.10 - Multimodal support!:
>106242120
--Miku (free space):
>106237067 >106241198 >106243226

►Recent Highlight Posts from the Previous Thread: >>106236131

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script