Thread 106213785

16 posts 4 images /g/

Anonymous 8/10/2025, 7:18:53 PM No.106213785 [Report] >>106214054 >>106214079 >>106214282 >>106216804 >>106217251 >>106217847

pepe-the-frog-thinking.gif md5: a7795064...

LLMs are just a sophisticated lossy compression algorithm.

Anonymous 8/10/2025, 7:21:53 PM No.106213814 [Report]

>sophisticated

Anonymous 8/10/2025, 7:22:39 PM No.106213828 [Report]

They should train LLMs on assembly code until it can substitute a compiler back-end. [spoiler] It's never gonna work. [/spoiler]

Anonymous 8/10/2025, 7:42:21 PM No.106214054 [Report] >>106214689 >>106215032

>>106213785 (OP)
Can we compress files efficiently with it?

Anonymous 8/10/2025, 7:45:02 PM No.106214079 [Report]

>>106213785 (OP)
so is your brain

Anonymous 8/10/2025, 8:03:03 PM No.106214282 [Report] >>106214675

>>106213785 (OP)
intelligence is compression, there's a reason why IQ tests consist entirely of pattern recognition yet predict every other facet of intelligence more strongly than anything else we've come up with

Anonymous 8/10/2025, 8:39:28 PM No.106214639 [Report] >>106214951

hurrr durr, humans are just a clump of cells. ohh I'm soo smart.

Anonymous 8/10/2025, 8:43:01 PM No.106214675 [Report]

>>106214282
IQ only "predicts" the correlation between social status and education.

Anonymous 8/10/2025, 8:44:24 PM No.106214689 [Report]

>>106214054
Not if you count the model weights, but transmission can be 10% the size of lzma2 if it's plaintext and if it's written at a gradeschool level and it's in the training data. Text not in training compresses to about 90% if lzma2. So no, not really.
t. I tried this

Anonymous 8/10/2025, 9:12:39 PM No.106214951 [Report]

>>106214639
A human is almost infinitely complex though. An LLM at its core is an incredibly simple algorithm.

Anonymous 8/10/2025, 9:20:17 PM No.106215032 [Report]

>>106214054
file compression necessarily needs to be lossless, so no
but it can lossily compress the concepts shared across files and domains; the model weights are a miniscule fraction of the size of the dataset but can produce a high percentage of it

Anonymous 8/11/2025, 12:11:14 AM No.106216804 [Report]

>>106213785 (OP)
The musing that Large Language Models are merely a complex form of lossy compression is somewhat reductive. The training process of these models actively cultivates generalization, deliberately expanding the data's representation to produce a more diverse range of outputs. This process constructs fuzzy logic circuits - wherein the model learns to map input patterns to predictions(the outputs) with a degree of probabilistic similarity, thereby generalizing its understanding beyond the precise instances represented in the original training data.

Anonymous 8/11/2025, 12:53:08 AM No.106217251 [Report]

>>106213785 (OP)
Yeah that isn't too far off i suppose.

Anonymous 8/11/2025, 1:35:03 AM No.106217658 [Report]

shutterstock_1971671024-455x300.jpg md5: 4d73bbbe...

Computers are just sophisticated electrical transformers.

Anonymous 8/11/2025, 1:50:49 AM No.106217827 [Report]

More or less, yeah.
https://github.com/Futrell/ziplm

Anonymous 8/11/2025, 1:52:38 AM No.106217847 [Report]

>>106213785 (OP)
Functions are just maps.
Functions describe everything.
Everything is just a compression algorithm.