Search results for "af1385f6b1011ccfb401d7c2fc36aafe" in md5 (3)

/vg/ - /mjg/ - Mahjong General
Anonymous No.538185031
>>538183918
It's the amount of parameters that the model is trained on, think of parameters like it's knowledge without context. Large models like Claude, Gemini, etc. all have trillions of them. The smaller Gemma model has 270 million parameters and is basically functionally retarded without giving it proper context. 4b has some knowledge but it's still kind of stupid. The nice part about smaller models is that they are fast and are still able to do interpret natural language well despite being wrong if you don't feed it some external information first.
/vg/ - /mjg/ - Mahjong General
Anonymous No.537672631
To everyone who participated in the Basic Sanma Tourney, please let me know if you have any suggestions about the tourney, such as the qualification time period, either here or on the IRC.
BST2 will likely have more qualification matches, tobi nashi in the playoffs, and a tighter finals schedule.
/vg/ - /mjg/ - Mahjong General
Anonymous No.536514574
2 subs are needed for the upcoming sanma match in 20 minutes.
Anyone streaming this time?