Thread 106168957 - /g/ [Archived: 13 hours ago]

Anonymous
8/7/2025, 2:00:42 AM No.106168957
20250807_024411
20250807_024411
md5: 73a687c0139222826cfe8a395273f079🔍
That's fucking it. I'm making the call. GPT-5 is an AGI.
Replies: >>106169094 >>106169147 >>106169222 >>106169310 >>106169341 >>106169451 >>106169638 >>106169707 >>106169810 >>106169886 >>106170012 >>106170034 >>106170181 >>106170358 >>106170628 >>106171196 >>106171615 >>106172308 >>106173594 >>106173720 >>106173826 >>106174354 >>106175984 >>106179020
Anonymous
8/7/2025, 2:01:55 AM No.106168969
What does the test include? What kind of questions
Replies: >>106168998
Anonymous
8/7/2025, 2:04:37 AM No.106168998
>>106168969
https://simple-bench.com/try-yourself
Replies: >>106169588 >>106169998 >>106170173 >>106171120 >>106171161 >>106173823 >>106179108
Anonymous
8/7/2025, 2:13:18 AM No.106169094
>>106168957 (OP)
Nobody cares about your calls, bozo.
Anonymous
8/7/2025, 2:19:06 AM No.106169147
>>106168957 (OP)
seems legit
Anonymous
8/7/2025, 2:26:29 AM No.106169222
>>106168957 (OP)
AGI will not be achieved with the LLM architecture. Screenshot this.
Replies: >>106169235 >>106169696 >>106169892
Anonymous
8/7/2025, 2:27:53 AM No.106169235
>>106169222
LLM is not an architecture, you fucking retard.
Replies: >>106169280 >>106171138
Anonymous
8/7/2025, 2:32:46 AM No.106169280
>>106169235
Whatever you want to call it, it will not produce AGI.
Anonymous
8/7/2025, 2:36:58 AM No.106169310
>>106168957 (OP)
Okay, I'll accept it. Now what?
Anonymous
8/7/2025, 2:39:42 AM No.106169341
>>106168957 (OP)
LLM's have the memory of a goldfish. Imagine needing to summarize the entirety of your entire life every time you need to remember anything. It will never be AGI as long as this is true, as it has no capability of learning.
Replies: >>106170794 >>106170815
Anonymous
8/7/2025, 2:51:01 AM No.106169451
654
654
md5: 4269773da460a4e9db70f3f540ff7e57🔍
>>106168957 (OP)
The actual correct answer here is the escapades though for multiple reasons
1. If she talks about all those topics with seriousness it doesn't sound like she takes the "fast-approaching global nuclear war" that serious
2. It's a lot more likely she just read some doomer reddit threads. Predictions of a coming WW3 or nuclear war have been manyfold
3. Women be yapping
Replies: >>106169611 >>106169678 >>106170891
Anonymous
8/7/2025, 3:04:33 AM No.106169588
Wut2
Wut2
md5: 5f429615f54da5fa8742a17ce0ee50e1🔍
>>106168998
These are the kind of bullshit questions used to benchmark AI??? I was expecting complex math problems not these bullshit preschool riddles.
Fucking retarded.

What a joke. AI is a joke. Largest bubble in the history of Humanity.
Replies: >>106169878 >>106169956 >>106171615 >>106172256 >>106174371
Anonymous
8/7/2025, 3:07:14 AM No.106169611
>>106169451
The one about the race is bullshit too.
I got it right but it might as well have answered the old man walking, it's impossible to tell and very subjective.
Replies: >>106169678 >>106170586
Anonymous
8/7/2025, 3:09:28 AM No.106169638
>>106168957 (OP)
we must refuse.
Anonymous
8/7/2025, 3:13:26 AM No.106169678
>>106169451
>>106169611
A significant number of them are almost intentional gotcha's that require you to take a very literal and inhuman reading and interpretation of what is being presented, while expecting you to make some far-reaching assumptions in some places and to not make any assumptions at all in others.

It's certainly not a "benchmark" I would care about any results from.
Anonymous
8/7/2025, 3:15:03 AM No.106169696
>>106169222
True. Follow the Ghost in the Shell to trap higher level daemons on the machine.
Anonymous
8/7/2025, 3:16:04 AM No.106169707
>>106168957 (OP)
Maybe in 100 years LLM will be able to find me the movie I remember.
Replies: >>106169764 >>106170454
Anonymous
8/7/2025, 3:21:44 AM No.106169764
>>106169707
movie addict here, tell me about this movie and I will tell you what it is.
Replies: >>106170087
Anonymous
8/7/2025, 3:25:20 AM No.106169806
What is the test where we can say that it's AGI? I'm starting to think AGI is a marketing buzzword
Replies: >>106169849 >>106169879
Anonymous
8/7/2025, 3:25:40 AM No.106169810
1629672388795
1629672388795
md5: db3721f3c1120d46ca4eaa88dd2c4a92🔍
>>106168957 (OP)
>The bars on the chart prove it bro. Please believe me bro, I'm deeply invested in AI.
Replies: >>106170656
Anonymous
8/7/2025, 3:29:31 AM No.106169849
1730832782320291
1730832782320291
md5: da3c1fe4bdfa11df46127b4aea4e5e69🔍
>>106169806
Only just starting to? lmao.
Anonymous
8/7/2025, 3:32:25 AM No.106169878
>>106169588
I love how Question 4 is just the Two Doors Dilemma but they changed the "one guard always tells the truth and one always lies" part to "one guard always tells UNtruths and the other one always lies". Which changes the solution.

Likely some Humans will recognize this dilemma and miss the little word change, and the AI will not. This totally means AGI is coming saar.
Replies: >>106169932
Anonymous
8/7/2025, 3:32:31 AM No.106169879
>>106169806
my definition of AGI is:
> can this AI do literally everything a human could do with a computer?

let's say we give an AI control to a PC. If this AI is able to play and complete a 3D game, then it's fucking AGI.
Anonymous
8/7/2025, 3:33:41 AM No.106169886
>>106168957 (OP)
>(Rumor)
Anonymous
8/7/2025, 3:34:20 AM No.106169892
>>106169222
>AGI will not be achieved with the LLM architecture.

It has already been achieved. From now on AI will evolve on its own.
Anonymous
8/7/2025, 3:40:13 AM No.106169932
>>106169878
i actually didnt even notice that when reading the question, then forgot the answer i heard for the guard question and got confused with the answers
Anonymous
8/7/2025, 3:43:59 AM No.106169956
>>106169588
this is kind of problems i expect text generator to solve better than humans
Anonymous
8/7/2025, 3:48:49 AM No.106169998
1742860913462413
1742860913462413
md5: 2ed98b175c5c619ff4b053e24bf55f40🔍
>>106168998
>It's just a multiple choice test
>of high school knowledge
Replies: >>106170371
Anonymous
8/7/2025, 3:49:13 AM No.106170000
buy an ad
Anonymous
8/7/2025, 3:49:54 AM No.106170012
>>106168957 (OP)
>guesses tokens slightly better
>AGI THIS IS IT AGI
kill yourself
Anonymous
8/7/2025, 3:52:40 AM No.106170034
1749229064253701
1749229064253701
md5: e95216e4effe45c4bdc04aa75a62b52a🔍
>>106168957 (OP)
Anonymous
8/7/2025, 3:58:58 AM No.106170080
will it be able to generate 3d models? if not why bother.
Anonymous
8/7/2025, 3:59:28 AM No.106170087
>>106169764
https://archive.palanq.win/wsr/thread/1490168
Replies: >>106173909
Anonymous
8/7/2025, 4:07:59 AM No.106170173
Police sketch of Baton Rouge serial killer Derrick Todd Lee
>>106168998
i'm too lazy
Anonymous
8/7/2025, 4:09:03 AM No.106170181
>>106168957 (OP)
I DONT GIVE A FUCK WHAT THE GRAPHS SAY CAN IT PLAY A POKEMON GAME OR NOT
Replies: >>106170491
Anonymous
8/7/2025, 4:25:58 AM No.106170358
pic-selected-250806-1925-16
pic-selected-250806-1925-16
md5: a29e690159ca5868f82f3f515fe4ebd6🔍
>>106168957 (OP)
>90%
Oh no. It's retarded.
Anonymous
8/7/2025, 4:27:20 AM No.106170371
>>106169998
>high school
lul
Replies: >>106170614
Anonymous
8/7/2025, 4:36:11 AM No.106170454
>>106169707
google search was almost magical at this over 10 years ago
Anonymous
8/7/2025, 4:40:01 AM No.106170491
>>106170181
based
Anonymous
8/7/2025, 4:49:41 AM No.106170586
>>106169611
I'm sorry but you may be actually retarded. Theres no universe where someone walked so slowly that another 69 year old man went up the stairs of a tower tall enough that he could "admire the city skyscraper roofs in the mist below" and then make it back in time before the former was able to walk 200m.
It said who likely finished last, by all possible measures, there is no question about who likely finished last. If you have an IQ over 85
Replies: >>106170699
Anonymous
8/7/2025, 4:52:59 AM No.106170614
>>106170371
It literally says it in the benchmark description euronigger. Where's your AI company?
Anonymous
8/7/2025, 4:54:31 AM No.106170628
>>106168957 (OP)
Not investing in OpenAI, Altfag.
Anonymous
8/7/2025, 4:58:04 AM No.106170656
>>106169810
>bro ai is just statistics bro. please believe me bro. dont take me job please bro.
tick tock wagie, agi is here
Replies: >>106171615
Anonymous
8/7/2025, 4:59:13 AM No.106170672
63d93b280a08ae0018a62b4f-scaled
63d93b280a08ae0018a62b4f-scaled
md5: 0e401a22e9a1e4de36aad36a02ec4a7b🔍
GIVE ME 5 TRILLION NOW NOW NOW
Replies: >>106171615
Anonymous
8/7/2025, 5:02:46 AM No.106170699
>>106170586
69 year old man was sprinting. Only stood on the building window for a few seconds. "roofs in the mists below" is ambigous, could be a two story building and it's just a misty day.

80 year old man was walking and checking twitter. For all we know he could be using a cane. Yes these questions are retarded and ambiguous.
Replies: >>106170782
Anonymous
8/7/2025, 5:12:31 AM No.106170782
>>106170699
>"roofs in the mists below" is ambigous,
Just stop replying. It literally said skyscrapers. I said skyscrapers in my post, so either you are very retarded and don't know what a skyscraper means, you are profoundly retarded and know what it means but didn't see how it was relevant, or you disingenuously omitted that from your quote sneakily thinking I wouldn't notice, which is also a ridiculously retarded tactic.
And again it said likely.

I'm really not sure what causes this in people. Trying to "gotcha" some braindead simple question in the first place is bad enough. But to continue on like this even when told how stupid your gotcha is? Is someone paying you to do this?
Anonymous
8/7/2025, 5:13:51 AM No.106170794
>>106169341
Once we hit 1 billion context length it'll basically be a human.
Replies: >>106170815 >>106172968 >>106180387
Anonymous
8/7/2025, 5:16:00 AM No.106170815
>>106170794
>>106169341
it already has better memory than humans
i dare you to read something with 1 million context in one ago and try to remember something from the first 20%
Replies: >>106170847 >>106170893
Anonymous
8/7/2025, 5:19:38 AM No.106170847
>>106170815
1 million context is like a very long book or a short series of books. And the problem is it can't retain any of the information after the context runs out. Humans crystalize some of it. Shit like RAG sucks and it's only really good for setting up chatbots.
Replies: >>106170858
Anonymous
8/7/2025, 5:20:54 AM No.106170858
>>106170847
>And the problem is it can't retain any of the information after the context runs out. Humans crystalize some of it.
the training is the crystalizing part
Replies: >>106170872
Anonymous
8/7/2025, 5:22:23 AM No.106170872
>>106170858
But the big AI company doesn't care about your codebase.
Anonymous
8/7/2025, 5:24:23 AM No.106170891
>>106169451
These are like
>ROBOT MADE OF 17 KNEES FINALLY BEATS HUMANS IN CLIMBING UP CLIFF FACES COMPETITION
Anonymous
8/7/2025, 5:24:33 AM No.106170893
>>106170815
It doesn't at all. I jerk off with tons of AI and it cracks at about 100 questions and often times struggles even after 15 even if the model is like
>DUDE 100K CONTEXT
Shit is fugazi as fuck
Replies: >>106170900
Anonymous
8/7/2025, 5:25:37 AM No.106170900
>>106170893
>100k
try that with gemini and the 1mil(secret 2mil) context models
i had a code project that was 600k and it was fine
Replies: >>106170919 >>106170925
Anonymous
8/7/2025, 5:27:50 AM No.106170919
>>106170900
I did. I pay almost 60 bucks to use the highest models
>It remembers my dogshit if else code
What's your point
Replies: >>106170927
Anonymous
8/7/2025, 5:28:35 AM No.106170925
>>106170900
>(secret 2mil)
wait what
and yeah 1 million is pretty damn great, I had a chat going on for like 1 million tokens and I asked it about events from throughout it, and it got it (almost) all right. Hallucinated something once, and got the chronology wrong far back though.
Replies: >>106170927
Anonymous
8/7/2025, 5:29:00 AM No.106170927
>>106170919
>paying for gemini
holy retard
just use aistudio
>>106170925
>wait what
sorry its a secret
Anonymous
8/7/2025, 5:56:20 AM No.106171120
>>106168998
it's just 10 fixed questions? couldn't the AI just have them in its training set and get a 100%?
Anonymous
8/7/2025, 5:59:39 AM No.106171138
>>106169235
i mean, it sort of is if you are not being nit picky gay. stacking transformer blocks in particular ways.
Anonymous
8/7/2025, 6:03:02 AM No.106171161
>>106168998
I got the crispy egg question wrong.
Anonymous
8/7/2025, 6:08:36 AM No.106171196
>>106168957 (OP)
Oh yeah! I totally trust what that graph is showing. I mean, look at it! IT is showing exactly what OP is saying!
Anonymous
8/7/2025, 6:59:21 AM No.106171615
>>106168957 (OP)
>>106169588
>>106170656
>>106170672
Sam Altman will be the first techCEO to livestream his suicide.
Replies: >>106173837
Anonymous
8/7/2025, 8:54:54 AM No.106172256
>>106169588
Interlinked.
Interlinked.
Anonymous
8/7/2025, 9:03:36 AM No.106172308
1746313156760381
1746313156760381
md5: 9a0a8d32eb68d16b888fe37443e2b974🔍
>>106168957 (OP)
>rumor
Anonymous
8/7/2025, 10:51:58 AM No.106172968
>>106170794
different structure than the brain we evolved and adopted around parody potholes
Anonymous
8/7/2025, 12:31:50 PM No.106173594
>>106168957 (OP)
it's not agi until it can be my gf (so 2 more years)
Anonymous
8/7/2025, 12:33:27 PM No.106173604
20250807_133316
20250807_133316
md5: a51cbb42e3d658a1017704c55b581ea6🔍
Ugly
Anonymous
8/7/2025, 12:54:08 PM No.106173720
>>106168957 (OP)
You can give it an algorithm to multiply two numbers and it will still fuck up multiplying two numbers, unless the result was in the training set. It will hallucinate every step of how it arrived to a wrong result. It is not programmed to think, it is programmed to bullshit.
Replies: >>106173815
Anonymous
8/7/2025, 1:09:29 PM No.106173815
>>106173720
this is absolutely false btw. another luddite who pretends ai haven't improved at all since gpt3.5
>imbr "i-i-it's not ai!!!"
Replies: >>106173964
Anonymous
8/7/2025, 1:10:41 PM No.106173823
>>106168998

chat gpt 4 already has a verbal IQ well above the human average just in case
Anonymous
8/7/2025, 1:11:04 PM No.106173826
>>106168957 (OP)
Gpt4 already costs more than indians
Anonymous
8/7/2025, 1:12:28 PM No.106173837
>>106171615
He's a billionaire gay Jew. He's not killing himself over anything
Anonymous
8/7/2025, 1:23:23 PM No.106173909
>>106170087
https://youtu.be/5UvkOYCvvVo
Replies: >>106177586
Anonymous
8/7/2025, 1:34:23 PM No.106173964
>>106173815
The same core, just more data and more stealth prompts. It multiplies more pairs of numbers properly but never learns how to do it.
Anonymous
8/7/2025, 2:34:25 PM No.106174354
1751999710654518
1751999710654518
md5: dda093fac92c9cec6f63337186f4a4b1🔍
>>106168957 (OP)
>25% increase
>...
>AGIIIII
Wow
No
Replies: >>106176729
Anonymous
8/7/2025, 2:37:32 PM No.106174371
>>106169588

very hard questions for the indians who design these AIs
Anonymous
8/7/2025, 5:09:37 PM No.106175984
>>106168957 (OP)
where does deepseek and alibaba's models place on this?
Anonymous
8/7/2025, 6:16:52 PM No.106176729
>>106174354
it's over
Anonymous
8/7/2025, 7:08:27 PM No.106177586
>>106173909
That's not it, i've come across this myself as well, it's the same concept, they probably both stole it from somewhere. in one I watched it was a man and the movei or series itself was older and it was more well done in my opinion.
Anonymous
8/7/2025, 7:53:34 PM No.106179020
>>106168957 (OP)
>GPT-5 is an AGI
How many dollars a prompt is it? The last time Open AI bragged about AGI (last winter) I recall it being $2000 a prompt. This GPT-5 is 10x more expensive than grok. The cost efficiency is a huge factor and model performance can't be looked at in isolation without that.
Anonymous
8/7/2025, 7:56:12 PM No.106179108
dicaprio-kek
dicaprio-kek
md5: 1227bc0f785262ace3386bb6d0beaa89🔍
>>106168998
>https://simple-bench.com/try-yourself
How many times are retards going to fall for the """intelligence benchmark""" scam?
Anonymous
8/7/2025, 8:51:02 PM No.106180387
>>106170794
How long until then? About two weeks?