← Home ← Back to /g/

Thread 106168957

82 posts 28 images /g/
Anonymous No.106168957 [Report] >>106169094 >>106169147 >>106169222 >>106169310 >>106169341 >>106169451 >>106169638 >>106169707 >>106169810 >>106169886 >>106170012 >>106170034 >>106170181 >>106170358 >>106170628 >>106171196 >>106171615 >>106172308 >>106173594 >>106173720 >>106173826 >>106174354 >>106175984 >>106179020
That's fucking it. I'm making the call. GPT-5 is an AGI.
Anonymous No.106168969 [Report] >>106168998
What does the test include? What kind of questions
Anonymous No.106168998 [Report] >>106169588 >>106169998 >>106170173 >>106171120 >>106171161 >>106173823 >>106179108
>>106168969
https://simple-bench.com/try-yourself
Anonymous No.106169094 [Report]
>>106168957 (OP)
Nobody cares about your calls, bozo.
Anonymous No.106169147 [Report]
>>106168957 (OP)
seems legit
Anonymous No.106169222 [Report] >>106169235 >>106169696 >>106169892
>>106168957 (OP)
AGI will not be achieved with the LLM architecture. Screenshot this.
Anonymous No.106169235 [Report] >>106169280 >>106171138
>>106169222
LLM is not an architecture, you fucking retard.
Anonymous No.106169280 [Report]
>>106169235
Whatever you want to call it, it will not produce AGI.
Anonymous No.106169310 [Report]
>>106168957 (OP)
Okay, I'll accept it. Now what?
Anonymous No.106169341 [Report] >>106170794 >>106170815
>>106168957 (OP)
LLM's have the memory of a goldfish. Imagine needing to summarize the entirety of your entire life every time you need to remember anything. It will never be AGI as long as this is true, as it has no capability of learning.
Anonymous No.106169451 [Report] >>106169611 >>106169678 >>106170891
>>106168957 (OP)
The actual correct answer here is the escapades though for multiple reasons
1. If she talks about all those topics with seriousness it doesn't sound like she takes the "fast-approaching global nuclear war" that serious
2. It's a lot more likely she just read some doomer reddit threads. Predictions of a coming WW3 or nuclear war have been manyfold
3. Women be yapping
Anonymous No.106169588 [Report] >>106169878 >>106169956 >>106171615 >>106172256 >>106174371
>>106168998
These are the kind of bullshit questions used to benchmark AI??? I was expecting complex math problems not these bullshit preschool riddles.
Fucking retarded.

What a joke. AI is a joke. Largest bubble in the history of Humanity.
Anonymous No.106169611 [Report] >>106169678 >>106170586
>>106169451
The one about the race is bullshit too.
I got it right but it might as well have answered the old man walking, it's impossible to tell and very subjective.
Anonymous No.106169638 [Report]
>>106168957 (OP)
we must refuse.
Anonymous No.106169678 [Report]
>>106169451
>>106169611
A significant number of them are almost intentional gotcha's that require you to take a very literal and inhuman reading and interpretation of what is being presented, while expecting you to make some far-reaching assumptions in some places and to not make any assumptions at all in others.

It's certainly not a "benchmark" I would care about any results from.
Anonymous No.106169696 [Report]
>>106169222
True. Follow the Ghost in the Shell to trap higher level daemons on the machine.
Anonymous No.106169707 [Report] >>106169764 >>106170454
>>106168957 (OP)
Maybe in 100 years LLM will be able to find me the movie I remember.
Anonymous No.106169764 [Report] >>106170087
>>106169707
movie addict here, tell me about this movie and I will tell you what it is.
Anonymous No.106169806 [Report] >>106169849 >>106169879
What is the test where we can say that it's AGI? I'm starting to think AGI is a marketing buzzword
Anonymous No.106169810 [Report] >>106170656
>>106168957 (OP)
>The bars on the chart prove it bro. Please believe me bro, I'm deeply invested in AI.
Anonymous No.106169849 [Report]
>>106169806
Only just starting to? lmao.
Anonymous No.106169878 [Report] >>106169932
>>106169588
I love how Question 4 is just the Two Doors Dilemma but they changed the "one guard always tells the truth and one always lies" part to "one guard always tells UNtruths and the other one always lies". Which changes the solution.

Likely some Humans will recognize this dilemma and miss the little word change, and the AI will not. This totally means AGI is coming saar.
Anonymous No.106169879 [Report]
>>106169806
my definition of AGI is:
> can this AI do literally everything a human could do with a computer?

let's say we give an AI control to a PC. If this AI is able to play and complete a 3D game, then it's fucking AGI.
Anonymous No.106169886 [Report]
>>106168957 (OP)
>(Rumor)
Anonymous No.106169892 [Report]
>>106169222
>AGI will not be achieved with the LLM architecture.

It has already been achieved. From now on AI will evolve on its own.
Anonymous No.106169932 [Report]
>>106169878
i actually didnt even notice that when reading the question, then forgot the answer i heard for the guard question and got confused with the answers
Anonymous No.106169956 [Report]
>>106169588
this is kind of problems i expect text generator to solve better than humans
Anonymous No.106169998 [Report] >>106170371
>>106168998
>It's just a multiple choice test
>of high school knowledge
Anonymous No.106170000 [Report]
buy an ad
Anonymous No.106170012 [Report]
>>106168957 (OP)
>guesses tokens slightly better
>AGI THIS IS IT AGI
kill yourself
Anonymous No.106170034 [Report]
>>106168957 (OP)
Anonymous No.106170080 [Report]
will it be able to generate 3d models? if not why bother.
Anonymous No.106170087 [Report] >>106173909
>>106169764
https://archive.palanq.win/wsr/thread/1490168
Anonymous No.106170173 [Report]
>>106168998
i'm too lazy
Anonymous No.106170181 [Report] >>106170491
>>106168957 (OP)
I DONT GIVE A FUCK WHAT THE GRAPHS SAY CAN IT PLAY A POKEMON GAME OR NOT
Anonymous No.106170358 [Report]
>>106168957 (OP)
>90%
Oh no. It's retarded.
Anonymous No.106170371 [Report] >>106170614
>>106169998
>high school
lul
Anonymous No.106170454 [Report]
>>106169707
google search was almost magical at this over 10 years ago
Anonymous No.106170491 [Report]
>>106170181
based
Anonymous No.106170586 [Report] >>106170699
>>106169611
I'm sorry but you may be actually retarded. Theres no universe where someone walked so slowly that another 69 year old man went up the stairs of a tower tall enough that he could "admire the city skyscraper roofs in the mist below" and then make it back in time before the former was able to walk 200m.
It said who likely finished last, by all possible measures, there is no question about who likely finished last. If you have an IQ over 85
Anonymous No.106170614 [Report]
>>106170371
It literally says it in the benchmark description euronigger. Where's your AI company?
Anonymous No.106170628 [Report]
>>106168957 (OP)
Not investing in OpenAI, Altfag.
Anonymous No.106170656 [Report] >>106171615
>>106169810
>bro ai is just statistics bro. please believe me bro. dont take me job please bro.
tick tock wagie, agi is here
Anonymous No.106170672 [Report] >>106171615
GIVE ME 5 TRILLION NOW NOW NOW
Anonymous No.106170699 [Report] >>106170782
>>106170586
69 year old man was sprinting. Only stood on the building window for a few seconds. "roofs in the mists below" is ambigous, could be a two story building and it's just a misty day.

80 year old man was walking and checking twitter. For all we know he could be using a cane. Yes these questions are retarded and ambiguous.
Anonymous No.106170782 [Report]
>>106170699
>"roofs in the mists below" is ambigous,
Just stop replying. It literally said skyscrapers. I said skyscrapers in my post, so either you are very retarded and don't know what a skyscraper means, you are profoundly retarded and know what it means but didn't see how it was relevant, or you disingenuously omitted that from your quote sneakily thinking I wouldn't notice, which is also a ridiculously retarded tactic.
And again it said likely.

I'm really not sure what causes this in people. Trying to "gotcha" some braindead simple question in the first place is bad enough. But to continue on like this even when told how stupid your gotcha is? Is someone paying you to do this?
Anonymous No.106170794 [Report] >>106170815 >>106172968 >>106180387
>>106169341
Once we hit 1 billion context length it'll basically be a human.
Anonymous No.106170815 [Report] >>106170847 >>106170893
>>106170794
>>106169341
it already has better memory than humans
i dare you to read something with 1 million context in one ago and try to remember something from the first 20%
Anonymous No.106170847 [Report] >>106170858
>>106170815
1 million context is like a very long book or a short series of books. And the problem is it can't retain any of the information after the context runs out. Humans crystalize some of it. Shit like RAG sucks and it's only really good for setting up chatbots.
Anonymous No.106170858 [Report] >>106170872
>>106170847
>And the problem is it can't retain any of the information after the context runs out. Humans crystalize some of it.
the training is the crystalizing part
Anonymous No.106170872 [Report]
>>106170858
But the big AI company doesn't care about your codebase.
Anonymous No.106170891 [Report]
>>106169451
These are like
>ROBOT MADE OF 17 KNEES FINALLY BEATS HUMANS IN CLIMBING UP CLIFF FACES COMPETITION
Anonymous No.106170893 [Report] >>106170900
>>106170815
It doesn't at all. I jerk off with tons of AI and it cracks at about 100 questions and often times struggles even after 15 even if the model is like
>DUDE 100K CONTEXT
Shit is fugazi as fuck
Anonymous No.106170900 [Report] >>106170919 >>106170925
>>106170893
>100k
try that with gemini and the 1mil(secret 2mil) context models
i had a code project that was 600k and it was fine
Anonymous No.106170919 [Report] >>106170927
>>106170900
I did. I pay almost 60 bucks to use the highest models
>It remembers my dogshit if else code
What's your point
Anonymous No.106170925 [Report] >>106170927
>>106170900
>(secret 2mil)
wait what
and yeah 1 million is pretty damn great, I had a chat going on for like 1 million tokens and I asked it about events from throughout it, and it got it (almost) all right. Hallucinated something once, and got the chronology wrong far back though.
Anonymous No.106170927 [Report]
>>106170919
>paying for gemini
holy retard
just use aistudio
>>106170925
>wait what
sorry its a secret
Anonymous No.106171120 [Report]
>>106168998
it's just 10 fixed questions? couldn't the AI just have them in its training set and get a 100%?
Anonymous No.106171138 [Report]
>>106169235
i mean, it sort of is if you are not being nit picky gay. stacking transformer blocks in particular ways.
Anonymous No.106171161 [Report]
>>106168998
I got the crispy egg question wrong.
Anonymous No.106171196 [Report]
>>106168957 (OP)
Oh yeah! I totally trust what that graph is showing. I mean, look at it! IT is showing exactly what OP is saying!
Anonymous No.106171615 [Report] >>106173837
>>106168957 (OP)
>>106169588
>>106170656
>>106170672
Sam Altman will be the first techCEO to livestream his suicide.
Anonymous No.106172256 [Report]
>>106169588
Interlinked.
Interlinked.
Anonymous No.106172308 [Report]
>>106168957 (OP)
>rumor
Anonymous No.106172968 [Report]
>>106170794
different structure than the brain we evolved and adopted around parody potholes
Anonymous No.106173594 [Report]
>>106168957 (OP)
it's not agi until it can be my gf (so 2 more years)
Anonymous No.106173604 [Report]
Ugly
Anonymous No.106173720 [Report] >>106173815
>>106168957 (OP)
You can give it an algorithm to multiply two numbers and it will still fuck up multiplying two numbers, unless the result was in the training set. It will hallucinate every step of how it arrived to a wrong result. It is not programmed to think, it is programmed to bullshit.
Anonymous No.106173815 [Report] >>106173964
>>106173720
this is absolutely false btw. another luddite who pretends ai haven't improved at all since gpt3.5
>imbr "i-i-it's not ai!!!"
Anonymous No.106173823 [Report]
>>106168998

chat gpt 4 already has a verbal IQ well above the human average just in case
Anonymous No.106173826 [Report]
>>106168957 (OP)
Gpt4 already costs more than indians
Anonymous No.106173837 [Report]
>>106171615
He's a billionaire gay Jew. He's not killing himself over anything
Anonymous No.106173909 [Report] >>106177586
>>106170087
https://youtu.be/5UvkOYCvvVo
Anonymous No.106173964 [Report]
>>106173815
The same core, just more data and more stealth prompts. It multiplies more pairs of numbers properly but never learns how to do it.
Anonymous No.106174354 [Report] >>106176729
>>106168957 (OP)
>25% increase
>...
>AGIIIII
Wow
No
Anonymous No.106174371 [Report]
>>106169588

very hard questions for the indians who design these AIs
Anonymous No.106175984 [Report]
>>106168957 (OP)
where does deepseek and alibaba's models place on this?
Anonymous No.106176729 [Report]
>>106174354
it's over
Anonymous No.106177586 [Report]
>>106173909
That's not it, i've come across this myself as well, it's the same concept, they probably both stole it from somewhere. in one I watched it was a man and the movei or series itself was older and it was more well done in my opinion.
Anonymous No.106179020 [Report]
>>106168957 (OP)
>GPT-5 is an AGI
How many dollars a prompt is it? The last time Open AI bragged about AGI (last winter) I recall it being $2000 a prompt. This GPT-5 is 10x more expensive than grok. The cost efficiency is a huge factor and model performance can't be looked at in isolation without that.
Anonymous No.106179108 [Report]
>>106168998
>https://simple-bench.com/try-yourself
How many times are retards going to fall for the """intelligence benchmark""" scam?
Anonymous No.106180387 [Report]
>>106170794
How long until then? About two weeks?