Thread 106168957

>>106168957 (OP)
LLM's have the memory of a goldfish. Imagine needing to summarize the entirety of your entire life every time you need to remember anything. It will never be AGI as long as this is true, as it has no capability of learning.

Anonymous 8/7/2025, 2:51:01 AM No.106169451 [Report] >>106169611 >>106169678 >>106170891

654.png md5: 4269773d...

>>106168957 (OP)
The actual correct answer here is the escapades though for multiple reasons
1. If she talks about all those topics with seriousness it doesn't sound like she takes the "fast-approaching global nuclear war" that serious
2. It's a lot more likely she just read some doomer reddit threads. Predictions of a coming WW3 or nuclear war have been manyfold
3. Women be yapping

Anonymous 8/7/2025, 3:04:33 AM No.106169588 [Report] >>106169878 >>106169956 >>106171615 >>106172256 >>106174371

Wut2.jpg md5: 5f429615...

>>106168998
These are the kind of bullshit questions used to benchmark AI??? I was expecting complex math problems not these bullshit preschool riddles.
Fucking retarded.

What a joke. AI is a joke. Largest bubble in the history of Humanity.

Anonymous 8/7/2025, 3:07:14 AM No.106169611 [Report] >>106169678 >>106170586

>>106169451
The one about the race is bullshit too.
I got it right but it might as well have answered the old man walking, it's impossible to tell and very subjective.

Anonymous 8/7/2025, 3:09:28 AM No.106169638 [Report]

>>106168957 (OP)
we must refuse.

Anonymous 8/7/2025, 3:13:26 AM No.106169678 [Report]

>>106169451
>>106169611
A significant number of them are almost intentional gotcha's that require you to take a very literal and inhuman reading and interpretation of what is being presented, while expecting you to make some far-reaching assumptions in some places and to not make any assumptions at all in others.

It's certainly not a "benchmark" I would care about any results from.

Anonymous 8/7/2025, 3:15:03 AM No.106169696 [Report]

>>106169222
True. Follow the Ghost in the Shell to trap higher level daemons on the machine.

Anonymous 8/7/2025, 3:16:04 AM No.106169707 [Report] >>106169764 >>106170454

>>106168957 (OP)
Maybe in 100 years LLM will be able to find me the movie I remember.

Anonymous 8/7/2025, 3:21:44 AM No.106169764 [Report] >>106170087

>>106169707
movie addict here, tell me about this movie and I will tell you what it is.

Anonymous 8/7/2025, 3:25:20 AM No.106169806 [Report] >>106169849 >>106169879

What is the test where we can say that it's AGI? I'm starting to think AGI is a marketing buzzword

Anonymous 8/7/2025, 3:25:40 AM No.106169810 [Report] >>106170656

1629672388795.jpg md5: db3721f3...

>>106168957 (OP)
>The bars on the chart prove it bro. Please believe me bro, I'm deeply invested in AI.

Anonymous 8/7/2025, 3:29:31 AM No.106169849 [Report]

1730832782320291.png md5: da3c1fe4...

>>106169806
Only just starting to? lmao.

Anonymous 8/7/2025, 3:32:25 AM No.106169878 [Report] >>106169932

>>106169588
I love how Question 4 is just the Two Doors Dilemma but they changed the "one guard always tells the truth and one always lies" part to "one guard always tells UNtruths and the other one always lies". Which changes the solution.

Likely some Humans will recognize this dilemma and miss the little word change, and the AI will not. This totally means AGI is coming saar.

Anonymous 8/7/2025, 3:32:31 AM No.106169879 [Report]

>>106169806
my definition of AGI is:
> can this AI do literally everything a human could do with a computer?

let's say we give an AI control to a PC. If this AI is able to play and complete a 3D game, then it's fucking AGI.

Anonymous 8/7/2025, 3:33:41 AM No.106169886 [Report]

>>106168957 (OP)
>(Rumor)

Anonymous 8/7/2025, 3:34:20 AM No.106169892 [Report]

>>106169222
>AGI will not be achieved with the LLM architecture.

It has already been achieved. From now on AI will evolve on its own.

Anonymous 8/7/2025, 3:40:13 AM No.106169932 [Report]

>>106169878
i actually didnt even notice that when reading the question, then forgot the answer i heard for the guard question and got confused with the answers

Anonymous 8/7/2025, 3:43:59 AM No.106169956 [Report]

>>106169588
this is kind of problems i expect text generator to solve better than humans

Anonymous 8/7/2025, 3:48:49 AM No.106169998 [Report] >>106170371

1742860913462413.gif md5: 2ed98b17...

>>106168998
>It's just a multiple choice test
>of high school knowledge

Anonymous 8/7/2025, 3:49:13 AM No.106170000 [Report]

buy an ad

Anonymous 8/7/2025, 3:49:54 AM No.106170012 [Report]

>>106168957 (OP)
>guesses tokens slightly better
>AGI THIS IS IT AGI
kill yourself

Anonymous 8/7/2025, 3:52:40 AM No.106170034 [Report]

1749229064253701.gif md5: e95216e4...

>>106168957 (OP)

Anonymous 8/7/2025, 3:58:58 AM No.106170080 [Report]

will it be able to generate 3d models? if not why bother.

Anonymous 8/7/2025, 3:59:28 AM No.106170087 [Report] >>106173909

>>106169764
https://archive.palanq.win/wsr/thread/1490168

Anonymous 8/7/2025, 4:07:59 AM No.106170173 [Report]

Police sketch of Baton Rouge serial killer Derrick Todd Lee.png md5: 1a5e3658...

>>106168998
i'm too lazy

Anonymous 8/7/2025, 4:09:03 AM No.106170181 [Report] >>106170491

>>106168957 (OP)
I DONT GIVE A FUCK WHAT THE GRAPHS SAY CAN IT PLAY A POKEMON GAME OR NOT

Anonymous 8/7/2025, 4:25:58 AM No.106170358 [Report]

pic-selected-250806-1925-16.png md5: a29e6901...

>>106168957 (OP)
>90%
Oh no. It's retarded.

Anonymous 8/7/2025, 4:27:20 AM No.106170371 [Report] >>106170614

>>106169998
>high school
lul

Anonymous 8/7/2025, 4:36:11 AM No.106170454 [Report]

>>106169707
google search was almost magical at this over 10 years ago

Anonymous 8/7/2025, 4:40:01 AM No.106170491 [Report]

>>106170181
based

Anonymous 8/7/2025, 4:49:41 AM No.106170586 [Report] >>106170699

>>106169611
I'm sorry but you may be actually retarded. Theres no universe where someone walked so slowly that another 69 year old man went up the stairs of a tower tall enough that he could "admire the city skyscraper roofs in the mist below" and then make it back in time before the former was able to walk 200m.
It said who likely finished last, by all possible measures, there is no question about who likely finished last. If you have an IQ over 85

Anonymous 8/7/2025, 4:52:59 AM No.106170614 [Report]

>>106170371
It literally says it in the benchmark description euronigger. Where's your AI company?

Anonymous 8/7/2025, 4:54:31 AM No.106170628 [Report]

>>106168957 (OP)
Not investing in OpenAI, Altfag.

Anonymous 8/7/2025, 4:58:04 AM No.106170656 [Report] >>106171615

>>106169810
>bro ai is just statistics bro. please believe me bro. dont take me job please bro.
tick tock wagie, agi is here

Anonymous 8/7/2025, 4:59:13 AM No.106170672 [Report] >>106171615

63d93b280a08ae0018a62b4f-scaled.jpg md5: 0e401a22...

GIVE ME 5 TRILLION NOW NOW NOW

Anonymous 8/7/2025, 5:02:46 AM No.106170699 [Report] >>106170782

>>106170586
69 year old man was sprinting. Only stood on the building window for a few seconds. "roofs in the mists below" is ambigous, could be a two story building and it's just a misty day.

80 year old man was walking and checking twitter. For all we know he could be using a cane. Yes these questions are retarded and ambiguous.

Anonymous 8/7/2025, 5:12:31 AM No.106170782 [Report]

>>106170699
>"roofs in the mists below" is ambigous,
Just stop replying. It literally said skyscrapers. I said skyscrapers in my post, so either you are very retarded and don't know what a skyscraper means, you are profoundly retarded and know what it means but didn't see how it was relevant, or you disingenuously omitted that from your quote sneakily thinking I wouldn't notice, which is also a ridiculously retarded tactic.
And again it said likely.

I'm really not sure what causes this in people. Trying to "gotcha" some braindead simple question in the first place is bad enough. But to continue on like this even when told how stupid your gotcha is? Is someone paying you to do this?

Anonymous 8/7/2025, 5:13:51 AM No.106170794 [Report] >>106170815 >>106172968 >>106180387

>>106169341
Once we hit 1 billion context length it'll basically be a human.

Anonymous 8/7/2025, 5:16:00 AM No.106170815 [Report] >>106170847 >>106170893

>>106170794
>>106169341
it already has better memory than humans
i dare you to read something with 1 million context in one ago and try to remember something from the first 20%

Anonymous 8/7/2025, 5:19:38 AM No.106170847 [Report] >>106170858

>>106170815
1 million context is like a very long book or a short series of books. And the problem is it can't retain any of the information after the context runs out. Humans crystalize some of it. Shit like RAG sucks and it's only really good for setting up chatbots.

Anonymous 8/7/2025, 5:20:54 AM No.106170858 [Report] >>106170872

>>106170847
>And the problem is it can't retain any of the information after the context runs out. Humans crystalize some of it.
the training is the crystalizing part

Anonymous 8/7/2025, 5:22:23 AM No.106170872 [Report]

>>106170858
But the big AI company doesn't care about your codebase.

Anonymous 8/7/2025, 5:24:23 AM No.106170891 [Report]

>>106169451
These are like
>ROBOT MADE OF 17 KNEES FINALLY BEATS HUMANS IN CLIMBING UP CLIFF FACES COMPETITION

Anonymous 8/7/2025, 5:24:33 AM No.106170893 [Report] >>106170900

>>106170815
It doesn't at all. I jerk off with tons of AI and it cracks at about 100 questions and often times struggles even after 15 even if the model is like
>DUDE 100K CONTEXT
Shit is fugazi as fuck

Anonymous 8/7/2025, 5:25:37 AM No.106170900 [Report] >>106170919 >>106170925

>>106170893
>100k
try that with gemini and the 1mil(secret 2mil) context models
i had a code project that was 600k and it was fine

Anonymous 8/7/2025, 5:27:50 AM No.106170919 [Report] >>106170927

>>106170900
I did. I pay almost 60 bucks to use the highest models
>It remembers my dogshit if else code
What's your point

Anonymous 8/7/2025, 5:28:35 AM No.106170925 [Report] >>106170927

>>106170900
>(secret 2mil)
wait what
and yeah 1 million is pretty damn great, I had a chat going on for like 1 million tokens and I asked it about events from throughout it, and it got it (almost) all right. Hallucinated something once, and got the chronology wrong far back though.

Anonymous 8/7/2025, 5:29:00 AM No.106170927 [Report]

>>106170919
>paying for gemini
holy retard
just use aistudio
>>106170925
>wait what
sorry its a secret

Anonymous 8/7/2025, 5:56:20 AM No.106171120 [Report]

>>106168998
it's just 10 fixed questions? couldn't the AI just have them in its training set and get a 100%?

Anonymous 8/7/2025, 5:59:39 AM No.106171138 [Report]

>>106169235
i mean, it sort of is if you are not being nit picky gay. stacking transformer blocks in particular ways.

Anonymous 8/7/2025, 6:03:02 AM No.106171161 [Report]

>>106168998
I got the crispy egg question wrong.

Anonymous 8/7/2025, 6:08:36 AM No.106171196 [Report]

>>106168957 (OP)
Oh yeah! I totally trust what that graph is showing. I mean, look at it! IT is showing exactly what OP is saying!

Anonymous 8/7/2025, 6:59:21 AM No.106171615 [Report] >>106173837

>>106168957 (OP)
>>106169588
>>106170656
>>106170672
Sam Altman will be the first techCEO to livestream his suicide.

Anonymous 8/7/2025, 8:54:54 AM No.106172256 [Report]

>>106169588
Interlinked.
Interlinked.

Anonymous 8/7/2025, 9:03:36 AM No.106172308 [Report]

1746313156760381.jpg md5: 9a0a8d32...

>>106168957 (OP)
>rumor

Anonymous 8/7/2025, 10:51:58 AM No.106172968 [Report]

>>106170794
different structure than the brain we evolved and adopted around parody potholes

Anonymous 8/7/2025, 12:31:50 PM No.106173594 [Report]

>>106168957 (OP)
it's not agi until it can be my gf (so 2 more years)

Anonymous 8/7/2025, 12:33:27 PM No.106173604 [Report]

20250807_133316.jpg md5: a51cbb42...

Ugly

Anonymous 8/7/2025, 12:54:08 PM No.106173720 [Report] >>106173815

>>106168957 (OP)
You can give it an algorithm to multiply two numbers and it will still fuck up multiplying two numbers, unless the result was in the training set. It will hallucinate every step of how it arrived to a wrong result. It is not programmed to think, it is programmed to bullshit.

Anonymous 8/7/2025, 1:09:29 PM No.106173815 [Report] >>106173964

>>106173720
this is absolutely false btw. another luddite who pretends ai haven't improved at all since gpt3.5
>imbr "i-i-it's not ai!!!"

Anonymous 8/7/2025, 1:10:41 PM No.106173823 [Report]

>>106168998

chat gpt 4 already has a verbal IQ well above the human average just in case

Anonymous 8/7/2025, 1:11:04 PM No.106173826 [Report]

>>106168957 (OP)
Gpt4 already costs more than indians

Anonymous 8/7/2025, 1:12:28 PM No.106173837 [Report]

>>106171615
He's a billionaire gay Jew. He's not killing himself over anything

Anonymous 8/7/2025, 1:23:23 PM No.106173909 [Report] >>106177586

>>106170087
https://youtu.be/5UvkOYCvvVo

Anonymous 8/7/2025, 1:34:23 PM No.106173964 [Report]

>>106173815
The same core, just more data and more stealth prompts. It multiplies more pairs of numbers properly but never learns how to do it.

Anonymous 8/7/2025, 2:34:25 PM No.106174354 [Report] >>106176729

1751999710654518.png md5: dda093fa...

>>106168957 (OP)
>25% increase
>...
>AGIIIII
Wow
No

Anonymous 8/7/2025, 2:37:32 PM No.106174371 [Report]

>>106169588

very hard questions for the indians who design these AIs

Anonymous 8/7/2025, 5:09:37 PM No.106175984 [Report]

>>106168957 (OP)
where does deepseek and alibaba's models place on this?

Anonymous 8/7/2025, 6:16:52 PM No.106176729 [Report]

>>106174354
it's over

Anonymous 8/7/2025, 7:08:27 PM No.106177586 [Report]

>>106173909
That's not it, i've come across this myself as well, it's the same concept, they probably both stole it from somewhere. in one I watched it was a man and the movei or series itself was older and it was more well done in my opinion.

Anonymous 8/7/2025, 7:53:34 PM No.106179020 [Report]

>>106168957 (OP)
>GPT-5 is an AGI
How many dollars a prompt is it? The last time Open AI bragged about AGI (last winter) I recall it being $2000 a prompt. This GPT-5 is 10x more expensive than grok. The cost efficiency is a huge factor and model performance can't be looked at in isolation without that.

Anonymous 8/7/2025, 7:56:12 PM No.106179108 [Report]

dicaprio-kek.jpg md5: 1227bc0f...

>>106168998
>https://simple-bench.com/try-yourself
How many times are retards going to fall for the """intelligence benchmark""" scam?

Anonymous 8/7/2025, 8:51:02 PM No.106180387 [Report]

>>106170794
How long until then? About two weeks?