Search - 4rchive

Search results for "af1385f6b1011ccfb401d7c2fc36aafe" in md5 (3)

Anonymous 9/6/2025, 10:33:22 PM No.538185031

>>538183918
It's the amount of parameters that the model is trained on, think of parameters like it's knowledge without context. Large models like Claude, Gemini, etc. all have trillions of them. The smaller Gemma model has 270 million parameters and is basically functionally retarded without giving it proper context. 4b has some knowledge but it's still kind of stupid. The nice part about smaller models is that they are fast and are still able to do interpret natural language well despite being wrong if you don't feed it some external information first.

/vg/ - /mjg/ - Mahjong General

Anonymous 9/2/2025, 8:50:11 PM No.537672631

kujouriu-12.png

To everyone who participated in the Basic Sanma Tourney, please let me know if you have any suggestions about the tourney, such as the qualification time period, either here or on the IRC.
BST2 will likely have more qualification matches, tobi nashi in the playoffs, and a tighter finals schedule.

/vg/ - /mjg/ - Mahjong General

Anonymous 8/24/2025, 7:38:50 PM No.536514574

kujouriu-12.png

2 subs are needed for the upcoming sanma match in 20 minutes.
Anyone streaming this time?