Search Results

Found 1 results for "bad538e9c5e94bc4f2dd74a10034e5d5" across all boards searching md5.

Anonymous /g/105611492#105620326
6/17/2025, 1:52:55 PM
Qwen won't make dense models larger than 30B anymore.

https://x.com/JustinLin610/status/1934809653004939705
> For dense models larger than 30B, it is a bit hard to optimize effectiveness and efficiency (either training or inference). We prefer to use MoE for large models.