Thread 106197903 - /g/ [Archived: 4 hours ago]

Anonymous
8/9/2025, 6:37:44 AM No.106197903
u8ya5rajovdf1
u8ya5rajovdf1
md5: 6318ff96f6dd4144e55f67bfa38e1f44๐Ÿ”
Next-world prediction machine is the moon-landing moment for AI.
Replies: >>106197975 >>106197998 >>106198008 >>106198009 >>106198068 >>106198116 >>106198689 >>106198789 >>106198833 >>106198883
Anonymous
8/9/2025, 6:40:14 AM No.106197922
SAAAAAAAAAAAAAAAAAR!!!!!!
Anonymous
8/9/2025, 6:47:30 AM No.106197975
>>106197903 (OP)
>moon landing moment
They going to fake the entire thing then spend the next 5 decades pretending they didn't film it in a studio?
Anonymous
8/9/2025, 6:51:35 AM No.106197998
>>106197903 (OP)
>pre-college prodigies
so itโ€™s about as good at writing proofs as a high schooler? Am I reading that right?
Replies: >>106198904
Anonymous
8/9/2025, 6:53:28 AM No.106198008
>>106197903 (OP)
>a token guesser when given the answers guessed the answers
wow
Anonymous
8/9/2025, 6:53:40 AM No.106198009
>>106197903 (OP)
You work this upon yourself by following these witless cheerleaders on twitter. You deserve all the grief you get.
Anonymous
8/9/2025, 7:05:43 AM No.106198068
>>106197903 (OP)
LLMs can't solve unsolved math problems they are dumb
Replies: >>106198890
Anonymous
8/9/2025, 7:10:23 AM No.106198095
AI is an industry of frauds hyping up calculators. it's so fucking dumb.
Replies: >>106198754
Anonymous
8/9/2025, 7:14:10 AM No.106198116
>>106197903 (OP)
careful working is proof of deliberate fuckery, what a faggot
Anonymous
8/9/2025, 8:46:36 AM No.106198689
>>106197903 (OP)
>It's hard to overstate the significance of this
But you make it look so easy lying fag
Anonymous
8/9/2025, 8:55:39 AM No.106198741
1754645733573797
1754645733573797
md5: baca5eb4a1230a8a01711a5880818ea9๐Ÿ”
Presented without further comment.
Replies: >>106198910 >>106198917
Anonymous
8/9/2025, 8:58:09 AM No.106198754
>>106198095
>calculators
They wish they could function as the world's least efficient calculator. Doesn't matter if it's 2+2 or quadratics, LLMs get the answer by a combination of luck and repetition. We trust calculators because they always get the same answer to the same input. LLMs do not and even when it's right 90% of the time, having to check it in case it's the 10% takes so fucking long that it defeats the point.
Kinda the same reason quantum computing has gone nowhere in years. It returns an answer, but there's no way to verify the answer without doing it on normal computers anyway. Why? Because even when you think you've just done a variant of something that's worked every other time, well sometimes it doesn't work and how are you going to know if this was that time or not? Makes the whole thing worthless.
Replies: >>106198858 >>106198900 >>106198907
Anonymous
8/9/2025, 9:03:20 AM No.106198789
>>106197903 (OP)
Let me guess, the proofs written by those pre-college (see: high school) "prodigies" have been scraped and fed into the LLMs prior to this challenge?
Anonymous
8/9/2025, 9:11:37 AM No.106198833
>>106197903 (OP)
>moon landing moment
>did it once, cant ever do it again
yep thats AI
Anonymous
8/9/2025, 9:16:51 AM No.106198858
>>106198754
What I don't understand is why they always talk about "agentic" AI but they can't somehow just make their model, which already uses python, to just run the 2+2 in the instance. Or maybe its doing that sometimes, but clearly not always. Or these letter counting tasks. This is all stuff you can easily calculate with a programming language, yet this "agentic" AI is literally just sifting through its training set to find the answer to how many b's are in blueberry and then somehow givng a probabiblistic answer on something that has a definitive answer
Anonymous
8/9/2025, 9:19:14 AM No.106198883
>>106197903 (OP)
A sophisticated enough next word predictor can do anything.
Anonymous
8/9/2025, 9:20:30 AM No.106198890
fffkelgiozqx
fffkelgiozqx
md5: 96a8e199ca950b28b56a7703e6f5f71b๐Ÿ”
>>106198068
>LLMs can't solve unsolved math problems they are dumb
Replies: >>106198979
Anonymous
8/9/2025, 9:20:40 AM No.106198892
The future looks bleak
Anonymous
8/9/2025, 9:22:27 AM No.106198900
>>106198754
Right. If you have to check why didn't you just compute the right answer in the first place. LLMs are worthless.
Anonymous
8/9/2025, 9:23:15 AM No.106198904
>>106197998
Most graduates in mathematics couldn't solve these problems. The fact that the competition is for pre-college kids doesn't mean the problems are easy. These kids basically spend their life on them.
Replies: >>106198992 >>106200415
Anonymous
8/9/2025, 9:23:18 AM No.106198907
>>106198754
You're being closeminded. Yes, we expected it and were sold on the idea of it making calculators redundant. Sure, that's so far not happened and for the sake of argument may never happen. However, calculators don't do proofs. If the AI can illuminate even parts of the journey toward novel discoveries, while human intellect is the one holding the flashlight as it were, then that would still be valuable. The problem is human intelligence has sunken so low that most people want these AI to do their thinking for them, and aren't looking for an assistant per se which requires skillful and active wielding.
Replies: >>106200455
Anonymous
8/9/2025, 9:23:45 AM No.106198910
>>106198741
Retards still don't understand what tokenisation is.
Replies: >>106198917 >>106200269
Anonymous
8/9/2025, 9:24:53 AM No.106198917
>>106198741
>>106198910
qwen 2.5 answers this correctly tho
Replies: >>106198938 >>106200105
Anonymous
8/9/2025, 9:27:13 AM No.106198930
>moon-landing moment
Doing something relatively uselessL
Anonymous
8/9/2025, 9:28:26 AM No.106198938
>>106198917
Claude will also tell you the date, because every day they update the prompt with the current date. But knowing the date and reading individual characters in a token isn't what LLMs are good at. Trying to judge an AI by asking dumb questions like how many b's in blueberry is retarded.
Replies: >>106200148
Anonymous
8/9/2025, 9:29:39 AM No.106198946
Shit that never happened and boomer kool aid
Anonymous
8/9/2025, 9:33:05 AM No.106198979
>>106198890
No one is investing trillions of dollars in me on the premise that surely in two more months I will be able to.
Replies: >>106198989
Anonymous
8/9/2025, 9:34:28 AM No.106198989
>>106198979
Just because you're poor, doesn't mean that everyone else has to be.
Anonymous
8/9/2025, 9:34:44 AM No.106198992
>>106198904
Unless youre retarded and just don't have the mental capacity to understand, anyone can "solve" these math problems GIVEN the answers.
The point is the kids that have to solve these problems don't get trained on the exact answers.
Thats like claiming I know 5000 words in japanese because I have a dictionary in front of me right now
Replies: >>106199031
Anonymous
8/9/2025, 9:35:16 AM No.106198995
retardation of AI
retardation of AI
md5: 9bc177fb7990f8014832a11963421e19๐Ÿ”
Anonymous
8/9/2025, 9:35:30 AM No.106198996
The only thing ai is good for is shitting out large quantities of text and code I can't be bothered to type myself
Anonymous
8/9/2025, 9:41:46 AM No.106199031
>>106198992
I was replying to the person who implied that winning the olympiads makes you as good as a high schooler.
The assumption is that the AI was not given the answers obviously.
Replies: >>106199209
Anonymous
8/9/2025, 10:17:20 AM No.106199209
>>106199031
Okay, point taken. But I don't think that assumption is valid. You can see the LLMs fail at basic math unless the exact answer is in their training set. Kind of obvous because LLMs aren't really made for math, which is why I think its wierd that they are always pushing this "math olympiad" shit so hard when its obviously untrue
Replies: >>106199407
Anonymous
8/9/2025, 10:51:09 AM No.106199407
>>106199209
Maybe they are super good at abstract math rather then something that you can do with a calculator
Anonymous
8/9/2025, 1:06:40 PM No.106200105
>>106198917
Reasoning models can do it, gpt 5 can do it too but openAI is cheapskating and wants to give the worst model for free now despite rivals providing a much better model in free tier
Anonymous
8/9/2025, 1:13:25 PM No.106200148
>>106198938
>don't judge the artificial INTELLIGENCE for not knowing how to count!
do retards really?
Replies: >>106200175
Anonymous
8/9/2025, 1:17:49 PM No.106200175
>>106200148
Skill issue. If you had written b-l-u-e-b-e-r-r-y it would have counted correctly.
Anonymous
8/9/2025, 1:32:15 PM No.106200269
>>106198910
tokenization is their problem to solve, its not an excuse
Replies: >>106200298
Anonymous
8/9/2025, 1:37:17 PM No.106200298
>>106200269
It's not a problem because only a retard would use LLM to ask to count letters. It's like saying calculators are bad at arithmetic because they can't write poetry.
Replies: >>106200330
Anonymous
8/9/2025, 1:44:00 PM No.106200330
>>106200298
their average costumer could not give a rats ass whether its an llm or not, for them its "AI that gives magical answers", and if it fails at that then thats openais problem
if "AGI", aka their explicit goal cant solve a simple problem like this, then thats not much of an intelligence and will fail in other non obvious cases as well because of it
Replies: >>106200360
Anonymous
8/9/2025, 1:49:41 PM No.106200360
>>106200330
Again, the AI could easily answer the question if the developers changed the definition of a from a whole word to a single letter, but it would be a ridiculous decision to feed letters instead of words just to appease some autists who want to count letters.
Replies: >>106200398
Anonymous
8/9/2025, 1:56:29 PM No.106200398
>>106200360
while they may be retarded to care about it that much, it is a limitation of the architecture and therefore a valid criticism that will have to be solved eventually
Replies: >>106200410
Anonymous
8/9/2025, 1:58:56 PM No.106200410
>>106200398
It's not the architecture, it's a compromise between performance and cost.
Replies: >>106200434
Anonymous
8/9/2025, 1:59:48 PM No.106200415
>>106198904
has been trained on right answers, bound to give right answers
Anonymous
8/9/2025, 2:03:58 PM No.106200434
>>106200410
every ai architecture is filled with compromises, and when they find a way for it to not take too much performance, im sure they will implement it and then it will no longer be a limitation
Anonymous
8/9/2025, 2:09:37 PM No.106200455
>>106198907
what they can do is mix and match already existing info, they're good at that. what they cannot do is come up with novel ideas for problems. they may be able to luck out by hallucinating something which makes sense for them based on their training data, and no human figured to do so far, and it would be luck even so. but coming up with some e=mc^2 out of the blue is not going to happen, because the whole thing is not working like a human brain does. it's unable to do basic shit that human brains can, but it can do some shit that humans can, fast, way faster, that's true. but llms are not the whole package