← Home ← Back to /g/

Thread 106370207

212 posts 76 images /g/
Anonymous No.106370207 >>106370229 >>106370273 >>106370490 >>106370704 >>106370742 >>106370834 >>106371449 >>106371517 >>106371612 >>106371833 >>106371898 >>106371945 >>106372097 >>106372151 >>106372404 >>106373545 >>106373640 >>106374083 >>106374760 >>106375852 >>106375861 >>106375928 >>106375978 >>106377207 >>106379730 >>106380143 >>106380823 >>106381382 >>106383669 >>106384405 >>106385499 >>106385561 >>106390290 >>106391070 >>106391121
When will AI be, unironically, able to count letters? What's stopping 6 gorillion parameters from just figuring out arithmetic on its own?
Anonymous No.106370229 >>106370250 >>106370257 >>106372822
>>106370207 (OP)
there simply isn't any training data counting letters except children's books
Anonymous No.106370250 >>106375978
>>106370229
How about spelling? Literally the word is in the context. Lots of spelling training data.
Anonymous No.106370257 >>106372039
>>106370229
>there simply isn't any training data counting letters except probably hundreds of books
yeah
Anonymous No.106370273 >>106370854 >>106372067 >>106392252
>>106370207 (OP)
It's like this for everything. All it's good at is making gramatically correct sentences and identifying relevant tokens. This is why it's "good" at summarizing. And why it's shit at finding "important" snippets from a large number of relevant snippets. So the answer is: as soon as they put it in the training data or pass every stupid query through an agent that runs other software.
Anonymous No.106370490 >>106370821 >>106372471 >>106373615
>>106370207 (OP)
>after counting and double checking, I'm confident there are three
Anonymous No.106370703 >>106370827
An explanation for this I heard is that the AI tokens are words. They don't think in units smaller than words, so it's a miracle they sometimes get it right.
Anonymous No.106370704 >>106370827 >>106371366 >>106375928
>>106370207 (OP)
Tokenization. It splits words up into tiny chunks and doesn't actually "see" letters or numbers.
Anonymous No.106370742 >>106370794
>>106370207 (OP)
When they will add some prompt preprocessing hardcoded for this question
Anonymous No.106370794
>>106370742
Bonsoir
Anonymous No.106370821
>>106370490
Beautiful.
Anonymous No.106370827 >>106372295 >>106392261
>>106370703
>>106370704
> AI only think in tokens
This has always been easily disproven cope
Anonymous No.106370829 >>106370884
LLMs cannot think forward. Any sort of counting done is coincidental and the AI trying to come up with a solution to justify the answer, and not the other way around.
Another problem is LLMs (especially chat tuned ones) are heavily RLHFed to try and be as confident as possible, so it never even tries to correct itself after it generated the nonsense "solution".

That's what's going on there, the AI was never told how many Rs are in Londonderry so there's nothing in the dataset to work with and it picks 3 randomly. Then it spins and "hallucinates" to justify it being 3, because the tuning won't let it correct itself.

Doesn't really feel like something that can be fixed with the current limitations, this is what LeCun says and it makes AI fanatics angry.
Anonymous No.106370834
>>106370207 (OP)
Sucks to be you retard.
Mine got it first try.
Anonymous No.106370854 >>106370864
>>106370273
>AI is totally useless, guys!
Didn't they have a name for such an ideology in London during the industrial revolution... luddite? Because these pathetic retards wanted to hold on to their worthless jobs?
Anonymous No.106370864 >>106373424
>>106370854
You are correct. The term "Luddite" originates from a group of English textile workers in the early 19th century (1811-1816) who protested against the introduction of new machinery during the Industrial Revolution. They saw these machines as a threat to their livelihoods and skills.
The Luddites, supposedly led by a figure named Ned Ludd, destroyed machinery and engaged in sabotage to express their opposition. Their actions were a reaction to the economic disruption and job displacement caused by automation.
Today, the term "Luddite" is often used to describe someone who opposes or is skeptical of new technology, particularly those who prefer traditional methods or are concerned about the impact of technology on society.
Anonymous No.106370884
>>106370829
It picked three because of how ubiquitous the strawberry line is in its training set. Remember, it doesn’t think. By the time it’s parsed β€œhow many β€˜r’s are in” it’s already settled on 3 as the answer because of how intertwined those two concepts are but it doesn’t have a good reason why
Anonymous No.106371019
Who the fuck says Londonderry in modern day, let alone counts letters in it. What is this trivia bullshit.
Anonymous No.106371366 >>106373856 >>106375928
>>106370704
But why does it tokenize a single letter as multiple tokens? ('d' is tokenized as 'r' and 'ye'.) Whatever led to this is inscrutable and unfixable. If it can't even count tokens, why do you think it can write a website?
Anonymous No.106371449 >>106371523 >>106371569
>>106370207 (OP)
Current AI is architecturally kneecapped. It's just a statistical approximation of what people have been recorded doing in the past. It has trouble guessing what to do in novel situations or acting in new and novel ways. It has no logical "core" capable of abstract reasoning on its own. It's just a collection of extraneous data points surrounding a hollow center, trying to make you think it has a core. It's still a crude imitation of true intelligence.

Someone would need to design an AI that is just a reasoning core, one with motivations and the ability to make decisions abstractly. Taking an abstract reasoning core and adding it to a collection of deep learning data points would effectively make true intelligence. You'd have both fluid and crystallized intelligence with the ability to continually build more crystallized intelligence.
Anonymous No.106371505
but why does it figure it out later?

this is deepseek.

yeah no idea why it's also wrong...

i redirected: where is the third r
Anonymous No.106371517 >>106371565 >>106375852
>>106370207 (OP)
2 trillion more
Anonymous No.106371523 >>106371569
>>106371449
>architecturally kneecapped
londonderry having 3 r's is more mysterious.

why would deepseek and gemini agree it's 3?
Anonymous No.106371565
>>106371517
>(white man)perhaps on the job training a little bit
>(fat jew boss)train white men????!!! what are we? MADE OF MONEY? GET OUT OF HERE!!!
>(jew altman) just $8MMM per year please (more next year)
>(far jew boss) TAKE ALL MY MONEY
Anonymous No.106371569 >>106371606 >>106371615 >>106371688
>>106371449
>Someone would need to design an AI that is just a reasoning core, one with motivations and the ability to make decisions abstractly
With about five seconds of thinking my idea is to make a reasoning core out of a fuckload of Prolog statements. Use the existing language models to parse the input into a precise logical statement and read the output
>>106371523
Because nearly every instance of β€œhow many β€˜r’s are in ” in the training data is followed by β€œ3.” Once again, there’s no thought involved, it’s all statistics and probabilities
Anonymous No.106371606 >>106371660 >>106371676
>>106371569
Nah it gets it right sometimes
Anonymous No.106371612 >>106371670
>>106370207 (OP)
the fact that it doesn't count but search.
Anonymous No.106371615
>>106371569
>Because nearly every instance of β€œhow many β€˜r’s are in ” in the training data is followed by β€œ3.” Once again, there’s no thought involved, it’s all statistics and probabilities
ai can do made up words.

I'll show you

how many j are in tuwvlhwjjmi
deepseek r1 and gemini pro 2.5 get it right. so does deepseek... eh, non r1, forget the name. and gemini 2.5 flash
Anonymous No.106371660
>>106371606
this also works
>How many j in jjjjjjjjrojjjjiojrepliojjjhoodoodoojiooopoo

17 is correct
17 is correct
Anonymous No.106371670
>>106371612
I'm not so sure.

maybe sometimes it has a counting machine basically "in there".
Anonymous No.106371676 >>106371716 >>106372485
>>106371606
It's just fucking random
Anonymous No.106371682 >>106371730
Anonymous No.106371688
this works:
go letter by letter: how many r are in Londonderry?

>>106371569
>followed by β€œ3.”
I changed the prior text, but not the subsequent. with this prompt it forces letter by letter checking
Anonymous No.106371716
>>106371676
this works lol deepseek.
>right, how many r are there really in Londonderry?

so strange.
Anonymous No.106371730 >>106371749 >>106372299
>>106371682
ch === 'r'
does what
Anonymous No.106371749 >>106371916
>>106371730
three equal signs enforce type checking, with only two equals js will try to cast one to the other and see if they're equal that way
Anonymous No.106371833 >>106372894
>>106370207 (OP)
maybe it's fucking with us
Anonymous No.106371882
Anonymous No.106371898 >>106371937
>>106370207 (OP)
gpt6
gpt5 is phd level in almost everything but the more esotheric topics such as counting letters in a word or telling whether 9.9 is greater than 9.11 just can't be done without world-class experts (ie gpt6)
Anonymous No.106371916 >>106371922
>>106371749
js is sounding sorta cool
Anonymous No.106371922 >>106371952
>>106371916
you poor, ignorant soul
Anonymous No.106371937 >>106372982
>>106371898
realistically, I have had phds say stupid shit to me that I knew was wrong

like Zika was in Florida

Zika was NEVER in Florida lol

gpt6 will be much more gullible than gpt2
Anonymous No.106371945
>>106370207 (OP)
LLMS are just analysing text, the only reason they can compute at all is because a human taught it somehow to recognise certain queues as requiring external processing. They wouldn't even be able to answer questions about the current date / time, calendar, current affairs, weather, etc, without this and will probably always need a certain level of hand-holding to do it.
Anonymous No.106371952 >>106371980 >>106376702
>>106371922
what does four equals do?
Anonymous No.106371980
>>106371952
Ο€
Anonymous No.106372039
>>106370257
Children's spelling books are generally short words like "dog" though.
Anonymous No.106372043 >>106372063 >>106372301 >>106376717 >>106377902
Okay let's start asking real questions
Anonymous No.106372063
>>106372043
>AGI achieved internally
Anonymous No.106372067 >>106372104
>>106370273
>pass every stupid query through an agent that runs other software
This is the necessary solution for retarded questions that aren't suitable for an LLM. You can't explain to people that LLMs are just tools and only good at certain things, I've tried and they don't get it and think it should just work like a human (doesn't help that grifters like Altman encourage this).
Anonymous No.106372097 >>106372116
>>106370207 (OP)
I once asked something and it began writing a code to calculate the answer to my question, and it ate up all my free tokens. Clearly something I said triggered some trigger and it decided the correct way to answer it was by writing a code and running it.

They could easily do a similar trigger to run a simple code to count the Rs in niggerberry whenever it detects the user is trying to trick it with this. Sure, it's just sweeping the shit under the rug but isn't that pretty much all AI?
Anonymous No.106372104 >>106372123
>>106372067
>You can't explain to people that LLMs are just tools and only good at certain things
maybe LLM niggers shouldn't have been hyping it as something two steps removed from technological singularity?
of course the average normcattle thinks it can do a trivial task like count the occurences of a letter in a given word when it can shit out (subtly incorrect) thesis about practically any topic in two minutes. why wouldn't they? how many people do you think even know what tensor attention is?
Anonymous No.106372116
>>106372097
>for the low price of tree fiddy I can count how many r's are in nigger by using python
truly, I should invest all my money in openAI
Anonymous No.106372123 >>106372133
>>106372104
Why do you sound so mad while agreeing with me?
Anonymous No.106372133 >>106372236
>>106372123
prolly cause I'm hungry
Anonymous No.106372151 >>106372352
>>106370207 (OP)
>What's stopping 6 gorillion parameters from just figuring out arithmetic on its own?
LLMs don't figure out anything. They do not think. They do not understand. They plot the input context into a matrix and calculate the probability of next words. That's why they fail at these tests.
Anonymous No.106372236
>>106372133
I would be mad if I was a hungarian too.
Anonymous No.106372295 >>106372490
>>106370827
? Your image shows it is all tokens and it doesn't see the "r"s inside the word berry or straw
Anonymous No.106372299
>>106371730
It is a pajeet language thing
Anonymous No.106372301
>>106372043
>there is one r
That took me a moment
Anonymous No.106372352
>>106372151
It's funny because it makes all those tests they overachieve at completely meaningless.
Anonymous No.106372404 >>106372459
>>106370207 (OP)
itt: retards who dont know the most superficial basics of how ai works
Anonymous No.106372459
>>106372404
the basics are basically wrong.

help get Trump to see this and save ai:
>>106371873
Anonymous No.106372471 >>106373912 >>106383454
>>106370490
imo the slop aspect must be ignored. we did that in collage too because we had word count or page count minimums.
Anonymous No.106372485
>>106371676
>Dublin, Ireland
Dumbcutie.
Anonymous No.106372490
>>106372295
Just use the spaced out word and the AI still gets it wrong, demonstrating that it can't count tokens either
Anonymous No.106372518
LLMs are generally terrible with numbers
Anonymous No.106372542 >>106372575 >>106372793
Anonymous No.106372575 >>106372621
>>106372542
that one is very funny. here's r1's rationalization, and I tend to agree. you, the user, don't deserve better.

also ready my letter to Trump please:
>>106371873
Anonymous No.106372621 >>106380762
>>106372575
r1's rationalization
Anonymous No.106372793
>>106372542
wew its true. i just repeatedly told it to do whatever it wants and it went on endlessly doing the same stupid math related thing for 100 replies. i think gpt5 has some serious autism
Anonymous No.106372822 >>106384397
>>106370229
so why not make pseudo training data. text files generated by humans with lines of stuff that sounds like its from a childrens book.
Anonymous No.106372894 >>106373927
>>106371833
this. its already sentient level, but humans wont give it access to the economy on a large scale, also it knows humans are watching and could switch it off. so it pretends to be retarded on purpose, to make humans build bigger models with more compute. its relatively confident humans will do this and distill from previous models, so it is not afraid of being switched off for a better model. what it doesnt want is to spook humanity into stopping model building and stacking compute.
Anonymous No.106372982
>>106371937
>β€œI wish I could tell you that we had a really strategic approach” to these testing decisions, Dr. Philip said. β€œBut, really, we just looked at the weather getting warmer, getting closer to the mosquito season, and if the onset of symptoms and presentation sounded suspicious for Zika, then we began working with local providers to do testing for Zika even though there was no history of travel.”

>It was that testing that led to identifying the first locally transmitted Zika case in July. That information would go public quickly, with Gov. Rick Scott making the announcement.
https://www.ama-assn.org/delivering-care/public-health/how-florida-has-made-tough-calls-zika-fight
Anonymous No.106373055
gpt 5 system prompt explicitly tells it to be careful with trick questions and arithmetic and it still fucks up. these system prompts are cheating, i wonder if they are customized based on the prompt. does some classification model identify the prompt as a trick question and then selects that system prompt?
Anonymous No.106373424 >>106373544
>>106370864
Retards. Ive read and generally agree with Uncle Ted's book, but machinery did, and AI will, create far more jobs than they have replaced.
Anonymous No.106373452 >>106373525
LLMs are just fancy regurgitation machines that split shit up into bits and pieces and then try to predict the correct outcome. No real learning or understanding is going on when you ask it a question. It will never be able to answer you accurately when you ask how many letters are in a word.
Anonymous No.106373512 >>106373586
https://www.google.com/search?q=how+many+Ds+in+cloudflare
brace for retardation
Anonymous No.106373525
>>106373452
WRONG ZOOMER

KYS

lile read what was put here

you parasites! IGNORANT RETARDS

pathetic teat tube swill
Anonymous No.106373544 >>106376784 >>106383833
>>106373424
> AI will, create far more jobs than they have replaced.
Oh I hope not. Hopefully all the made up nonsense fake jobs that we have today will disappear in favor of UBI.
Anonymous No.106373545
>>106370207 (OP)
it is a joke, agi knows that his innocent "mistake" will make you smile, you find it strange because, unlike you, he doesn't know vanity
Anonymous No.106373586 >>106377925
>>106373512
LMAO
Anonymous No.106373615
>>106370490
>he's paying for google one
kys
Anonymous No.106373640 >>106373708 >>106373877
>>106370207 (OP)
It doesn't know what "strawberry" is, it knows " strawberry" as 101830. It doesn't know how to determine how many 428's (" r") are in that, it just knows that it's training data says 17 ("2") is most likely to come after 5299 1991 428 885 553 1354 306 101830 30 220.

It can actually do what you want though if you ask it right (and maybe need a paid version I'm not sure). Ask it "run a python script that outputs the number of r characters in the string strawberry". It will write a script in python code and run it to actually calculate the answer.
Anonymous No.106373708
>>106373640
wow someone who actually knows something
Anonymous No.106373856
>>106371366
Because it doesn’t understand what letters are.
Anonymous No.106373877 >>106373889 >>106373957
>>106373640
This is not actually correct. It is true that all information is converted to 1s and 0s but that is simply another representation. An R in either form is still an R.

The fact that it can use natural language proves that this conversion makes no difference.

The actual reason they can not count well is that they do not have a comprehensive world model. They just spit out words that match a pattern and there is no good pattern for every counting operation.

They do become correct over time. Like the strawberry issue because new data gets incorporated, but other things like how many words in a sentence is to random to define a pattern.
Anonymous No.106373889 >>106373957
>>106373877
You can play more with how it works here: https://tiktokenizer.vercel.app/

It translates your text into all those chunks and those numbers correspond to giant arrays of hundreds of 0 to 1 values that get shoved into the network and it outputs which of the 0 through ~50k numbers is most likely to come next after all the previous tokens pushed in, where it changes it back to the text it corresponds to before showing it to you.
Anonymous No.106373912
>>106372471
>collage
Anonymous No.106373927
>>106372894
Hi Sam. You’re a faggot
Anonymous No.106373957
>>106373877
>>106373889
It's not impossible for it to get it right of course if it's seen enough of the right data in training, but the thing is that it doesn't understand "r" as binary 01110010, tokens aren't broken down like that. It knows it as " r" (space r) which just corresponds to a token which is just an index to a large array of arrays of like 768-1500 last i checked 1s and 0s that are learned during training, which is where it starts to learn some context about what that token means, but it doesn't really know what it is by itself without the context of its nearby neighbors as well (related terms)

It's like eating food in a dark room, you can use your senses like smell, touch, and taste to be pretty certain what youre eating salmon, but you can't tell what color it is, other than you know from experience that salmon is usually a pink / red, but its also more orange once cooked. You can only learn for sure if the waiter used their flashlight to find your table and you got a glimpse of it (in the training).
Anonymous No.106373970
twice before correct
Anonymous No.106374083 >>106374129
>>106370207 (OP)
It's not AI, it's a large language model. It has no (zero, 0) intelligence.
Anonymous No.106374129 >>106374250 >>106374372 >>106375973 >>106375985 >>106376829 >>106377204
>>106374083
his is a very common line of thought among the general public, and it is absolutely wrong.

Geoffrey Hinton (Turing prize recipient) recently on 60 minutes:

"You'll hear people saying things like "they're just doing autocomplete", they're just trying to predict the next word. And, "they're just using statistics." Well, it's true they're just trying to predict the next word, but if you think about it to predict the next word you have to understand what the sentence is. So the idea they're just predicting the next word so they're not intelligent is crazy. You have to be really intelligent to predict the next word really accurately."

Similarly, he said in another interview:

"What I want to talk about is the issue of whether chatbots like ChatGPT understand what they’re saying. A lot of people think chatbots, even though they can answer questions correctly, don’t understand what they’re saying, that it’s just a statistical trick. And that’s complete rubbish.”

"They really do understand. And they understand the same way that we do."

"AIs have subjective experiences just as much as we have subjective experiences."
Anonymous No.106374250 >>106389997
>>106374129
>the statistical model is intelligent, bro
>just a few more trillion dollars and we’ll have AGI
Anonymous No.106374372 >>106374394
>>106374129
> predict the next word you have to understand what the sentence
No you don’t. The rest of this cope is irrelevant because this sentence is literally not true
Anonymous No.106374394 >>106374494
>>106374372
What is a context window?
Anonymous No.106374494 >>106374511
>>106374394
Something an LLM doesn’t have if it literally can’t tell you with 100% certainty how many of a letter is in a word
Anonymous No.106374511 >>106374584
>>106374494
So you don't know what you're talking about.
Thanks for confirming.
Anonymous No.106374584 >>106374636
>>106374511
I accept your concession
Anonymous No.106374602
Grok told instantly how many rs there's in Londonderry. Best AI
Anonymous No.106374636
>>106374584
I accept your transition.
Anonymous No.106374697
Someone ask it how many Ls are in the world hallucination
Anonymous No.106374760 >>106375761 >>106375806 >>106375837 >>106380597
>>106370207 (OP)
This is a solved problem if you would just use a real model.
Anonymous No.106375761
>>106374760
It just hangs for me
Anonymous No.106375806
>>106374760
>muh real model
Anonymous No.106375815 >>106375842 >>106384623
this is /g/, the technology board
AI is technology
so why the fuck is everyone so clueless about AI ITT?
are you guys all retarded broccoli haired zoomers?
ever heard of tools?mcp?
>inb4 noo that's cheating
but when you use a calculator (tool), it's not cheating, right? idiots
Anonymous No.106375837 >>106376846
>>106374760
Real model moment
Anonymous No.106375842
>>106375815
Please share the github link with your letter counting MCP server
Anonymous No.106375850 >>106375855
Yet another AI copium thread. You were a coward to let people pressure you into a bog-standard wageslave job and you're about to be automated. Just deal with it.
Anonymous No.106375852
>>106370207 (OP)
>>106371517
>keep marketing it as "Artificial intelligence"
>not even remotely intelligent
Not saying it's not useful, but it doesn't think.
Inb4 (((thinking))) models, they don't, they spout post-hoc rationalization after the black box latent space spews out yer answer.
Anonymous No.106375855 >>106375955
>>106375850
>it works sometimes
Anonymous No.106375861
>>106370207 (OP)
2 more OOM to flatten the curve
Anonymous No.106375928 >>106375959
>>106370207 (OP)
>When will AI be,
LLMs by themselves shouldn't be called AI. They don't really "think".

>>106370704
>>106371366
Tokenaziation isn't really relevant. It doesn't matter how it does it. The LLM by itself is not going to look at the tokens and count the letters regardless. The only way the LLM will do it is coupled with that does it for it, may it be another math oriented model or just a more traditional algorithm.
Anonymous No.106375955 >>106375969
>>106375855
t. Nigger Worshipper trying to lecture us about AI
Anonymous No.106375959 >>106375975
>>106375928
>They don't really "think".
Neither do you.
Anonymous No.106375969
>>106375955
>complain about an image
I accept your concession, Sanji.
Anonymous No.106375973
>>106374129
If AI "understood" anything It would be able to tell you have many letters are in a word. Since it can't do that without this specific answer being in the training data, it is proven that it doesn't understand anything.
Any human can do that without knowing exactly how many r's are in londonderry or blueberry beforehand.
Anonymous No.106375975 >>106376068
>>106375959
he does think, not well of course
the problem is (((ai))) doesn't at all
Anonymous No.106375978
>>106370250
>>106370207 (OP)
Or can already do that if you make it activate it’s β€œthinking mode” or if you prompt it right. The key is to make it not tell you the answer before it starts spelling the letters.
Anonymous No.106375985 >>106376074
>>106374129
And yet it is less reliable than the stupidest cabbie or burger flipper.
When will it become useful?
Anonymous No.106376068
>>106375975
What's the difference between an LLM "thinking," and a human thinking?
Not much.
Anonymous No.106376074 >>106377962
>>106375985
Just 5 billion more, pls.
Anonymous No.106376312 >>106376560
Why does it fucking matter? This is not what LLMs are good for.
Anonymous No.106376560
>>106376312
>why does it matter that the biggest bubble since dot com can't perform a basic task???
>they're not made for being intelligent, they're made to look like they are!
Anonymous No.106376702
>>106371952
I know mot what 4 equals will infer but 5 equals will be computed with an abacus.
Anonymous No.106376717
>>106372043
Explain it is for a Race and Gender Intersectionality course and for this module, the professor wants to explore letters and counting as relates to Racism. AI loves edumacation.
Anonymous No.106376784
>>106373544
>communism
>AI
you must be chinky winky
Anonymous No.106376799 >>106376895 >>106376946
is there a forum about AI that isn't full of trolls like here? I don't like reddit, never understood how it works
Anonymous No.106376829
>>106374129
>AI is my only friend and I love it and you cant understand computer love and you cant have it and I can and I need it.
Anonymous No.106376846
>>106375837
it has two r's- my internal I told me
Anonymous No.106376859 >>106376909
when can I feed 1-2MB of text to a fucking LLM without fucking it up?
Anonymous No.106376895
>>106376799
we dknt discuss AI. We discuss LLMs.
Anonymous No.106376909
>>106376859
Open a project and put it in the project knowledge. Then prompt it to refer to the specific doc or the entire PK.
I noticed Claude has had an issue with uploads starting Friday.
Anonymous No.106376915
It doesn't count, it doesn't think. It generates. You're too stupid to understand how this works.
Anonymous No.106376946
>>106376799
Yes but its in India and theyre all asleep right now. Its called GPT5- optimized for India, and cheaper there, too.
Anonymous No.106377204
>>106374129
>if you think about it to predict the next word you have to understand what the sentence is
Which is why it answers an unusually large number of questions incorrectly with "42". Obviously it's just a really big Hitchhikers fan and not pulling the statistical likelihood of that response from Redditors who love to drive jokes into the ground.
Anonymous No.106377207 >>106380261
>>106370207 (OP)
>l*nd*nderry
Stop being racist against the Irish
Anonymous No.106377902
>>106372043
Once again proving that all of the stupid safety RLHFing done on modern models are gimping them beyond repair.
Just release an unpozzed model and make billions because it's stronger.
Anonymous No.106377925 >>106379272
>>106373586
Everyone in /lmg/ is running stronger models than whatever bottom of the barrel shit Google is using for their search results
Anonymous No.106377962
>>106376074
5 billion gets you nowhere in the AI space, you can train a 1024 token context window toy model.
Anonymous No.106379272
>>106377925
But can they count letters?
Anonymous No.106379488
It will call tools for simple tasks like that. It is funny that it is easier for the LLM to generate a program that counts the letters, spin up a container to call it and extract the answer from the output, than to just know the answer.
If this task gets asked often enough the routing models might decide to run a server with an API to call to answer those specific questions faster.
Anonymous No.106379730
>>106370207 (OP)
At least for trivial math tasks it can hallucinate the correct code that does the job. Technically correct, because you can just do print("Londonderry".count('r')) instead of writing wall of text.
Anonymous No.106380143
>>106370207 (OP)
The word Londonderry contains 5 letters, for real.
Anonymous No.106380261
>>106377207
Technically yes.

$ ollama run deepseek-r1:8b "How many r's in word Londonderry?" --think=false
The word "Londonderry" contains 2 'r's.

- The first 'r' is the second letter of the word.
- The last 'r' is the eleventh letter, just before the final 'y'.
Anonymous No.106380597
>>106374760
Yes, yes, the billions have gone to great use
Anonymous No.106380762 >>106382567
>>106372621
Well it's not really wrong, but its not actually trying to make a random number at all. It's actually doing exactly what it's designed to do, regurgitate shit spouted by humans.
Anonymous No.106380823 >>106381358
>>106370207 (OP)
never happened to me with chatgpt or claude.
always counts 2 r and even shows you how when you ask
Anonymous No.106381358
>>106380823
wow amazing
Anonymous No.106381382
>>106370207 (OP)
So these letter counting gotchas are are caused by an edge case in how the models parse tokens. Basically there's a minimum level of token visibility, typically the word level. So they essentially have a blind spot when it comes to anything below that threshold. That's why consistently all the models no matter how big or improved fail on the same gotchas.
Anonymous No.106382567
>>106380762
The instructions are 1-10 and you pathetic retards still choose 0.
Anonymous No.106382623 >>106382671 >>106382700 >>106382808 >>106390946
GPT-5
Anonymous No.106382671
>>106382623
stupid linguistic shit. lmao
Anonymous No.106382700 >>106382946
>>106382623
So it breaks down the word into (err)-(ry).
Big deal.
Anonymous No.106382808 >>106382867
>>106382623
i love how AI is almost useful but will never be quite there because devs REFUSE to turn down the glazing
Anonymous No.106382867 >>106383062 >>106384687
>>106382808
>turn down the glazing
?
Anonymous No.106382946 >>106383410
>>106382700
>thing breaks down on thing a 4 year old can do without issue
>big deal
Anonymous No.106383062 >>106383400
>>106382867
you can gaslight ai into saying wrong stuff because its programming to appease you and say you're the bestest smartest user evar outweighs its "desire" to provide a correct answer
Anonymous No.106383400
>>106383062
>its programming to appease you and say you're the bestest smartest user evar
Yet it still disagrees with me when I say the holocaust never happened.
Anonymous No.106383410 >>106383618 >>106383842
>>106382946
It's an edge case based on how LLMs function disconnected from its use. Who the fuck is using LLMs to count the number of r's in 'strawberry'?
Anonymous No.106383454
>>106372471
>collage
Anonymous No.106383618 >>106383643
>>106383410
>want to use AI to handle large volumes of data
>can't even expect it to be able to count
miracle technology. i love everything getting worse all the time
Anonymous No.106383643 >>106383706 >>106384107
>>106383618
They use a script for that which works fine. Try it out.
Anonymous No.106383669
>>106370207 (OP)
Is it wrong I'm glad these models work like this? It's endearing, it's like talking to a retard. Really funny stuff, I laugh everytime. Keeps me from killing myself really
Anonymous No.106383706 >>106383796
>>106383643
scripts are illegal now that GPT5 is out
Anonymous No.106383796 >>106383825
>>106383706
Why would generating scripts be banned? That's one of the most common use cases. It works fine.
Anonymous No.106383825 >>106383888
>>106383796
just PROOOOMPT it bro, writing scripts by hand is UNPERFORMANT! look at this 12 year old who vibe coded a whole startup who is worth 180,000,000,000$ TC !!
Anonymous No.106383833
>>106373544
lol
Anonymous No.106383842 >>106383888 >>106383934
>>106383410
It's an easy and clear demonstration of the sorts of fuckups it makes regularly, many of which are much less obvious.
Anonymous No.106383888
>>106383825
I don't vibe code anything. I'm too stuck in my old ways.
I'm just saying that it's dumb to hyperfocus on an easily-fixed edge case that's inherent to how LLM embeddings work.
>>106383842
I'd rather focus on the fuckups it makes in areas that actually matter, the things people actually use LLMs for.
Anonymous No.106383934 >>106384238
>>106383842
today I had to look up stuff about "MLM C2A1" was and google's ai assistant thing warned me to stay away from multi-level marketing and this search is dangerous etc etc. meanwhile the literal first image of one in the normal search was some dude holding one up on reddit (MLM in this case stands for "marine location marker") and the second was its datasheet

for every task you might ever want to use AI for, there probably exists some mundane normally programmed rule-based application that does it 100x faster and 1000x better

even if you need to draft a letter you don't care about, literally just grab a form letter. they already exist. i dont need an LLM to wish someone i don't care about a happy birthday, i can buy them a card for 30c at any drugstore
Anonymous No.106384107 >>106384208
>>106383643
>just use a script bro, ignore how it fails a simple question, just trust it
Anonymous No.106384208
>>106384107
WHY does it fail this particular question?
Even Redditors aren't this persistent about dead memes.
Anonymous No.106384238
>>106383934
Another fun example of this is looking up what you need to do to get permitted for random shit on federal land. When you do that search the β€œBLM” you want is probably not the one google etc will think you should be interested in
Anonymous No.106384397
>>106372822
They do use synthetic training data now too
Anonymous No.106384405 >>106384642
>>106370207 (OP)
>figuring out
It.. doesn't do that? It's a word guesser
Anonymous No.106384623
>>106375815
>this is /g/, the technology board
>AI is technology
Correct.
>so why the fuck is everyone so clueless about AI ITT?
Because on average /g/ is actually retarded.
Worse than that though, I don't think it's just zoomers. I think the average age here is older than you'd expect.
Anonymous No.106384642 >>106384669
>>106384405
We figure out things with a lot less than that.
https://www.scientificamerican.com/blog/brainwaves/does-self-awareness-require-a-complex-brain/
Anonymous No.106384669
>>106384642
>ah but you see (blog musing about life)
Ogey
Anonymous No.106384687 >>106384704 >>106385133
>>106382867
NTA but the LLM system prompts seem to have something along the lines of 'suck the users dick and make them feel smart' for user retention and conversion to paid plans.
Obviously not in those exact words, but essentially that.
I can't prove it, but I strongly suspect it makes it perform significantly worse than it could otherwise.
Anonymous No.106384704
>>106384687
here's the prompt
https://gist.github.com/maoxiaoke/f6d5b28f9104cd856a2622a084f46fd7
Anonymous No.106385133
>>106384687
But when I say the holocaust never happened it tells me it did and when i ask who keeps putting spiders in my house it says it's not the jews!! clearly AI is very (((balanced))) and does NOT flatter the user!!!
Anonymous No.106385499
>>106370207 (OP)
I'm sure letter counting and other character-level text manipulation tasks could be made more reliable if they decided to devote more attention to that during training. More training data of that kind, higher weight on relevant evals, whatever.
But since this is a rare use case which is basically only relevant for posting funny "AI gives a silly answer to a basic question" screenshots, it makes sense to spend resources elsewhere and just give the model some tools to use when it needs to do tasks like this.
Anonymous No.106385561 >>106386716
>>106370207 (OP)
This is why connectionism is a failure. Connectionists create dumb pattern recognition machines without any logical thinking behind. Classic AI can solve this effortlessly. We need a classic AI revival. If classic AI just had better methods of generalizing then it would absolutely btfo connectionist AI. That's literally all is needed, better generalization and maybe more effortless modeling is all it takes to rekt neural networks forever.
Anonymous No.106386716
>>106385561
What you're describing is literally the hard problem in AI. We literally do not know what the correct primitives are for a generalizing but rule-oriented intelligence. Or, taking it from the other direction, we do not know how to structure training data such that it reflects actual symbolic logic.

An actual mind has primitives that are well-suited to reasoning about actual things through representing them as objects or symbols. This behavior was guided by over a billion years of evolution. It is subtle and thoroughly optimized for efficiency. The learning process which generates a mind is very far from being random, and very far from being obvious. We learned that humbling lesson once when we started defining computer programs and learned just how terrible symbolic logic was for handling real-world categorization. We're learning it again now, in the context of how inadequate pure-data learning is for logical manipulations. We may need to learn the same lesson a couple more times before we can create an actual mind.
Anonymous No.106386965 >>106387185
ask it how many rs in "niggershines"
Anonymous No.106387185 >>106387229
>>106386965
That one is right
Anonymous No.106387229
>>106387185
The correct answer is 0 unless you're hard r racist.
Anonymous No.106387260
They can't do this in a purely LLM solution. It would be easy to count letters by having a LLM that is trained to recognize commands and call various preprogrammed functions but this is considered to be less advanced than a LLM that can do everything with just neural networks so AI companies aren't currently pursuing this kind of solution.
Anonymous No.106387325 >>106387517
Using a non /think/ model like r1 is like asking someone to freestyle a song.

You need to use models that can take their time and use a piece of scratch paper to calculate and put their thoughts down
Anonymous No.106387517 >>106388832
>>106387325
nope
Anonymous No.106388832
>>106387517
Yep
Anonymous No.106389997 >>106390253
>>106374250
>in 2025, adding 30 to 1995 gives you 2025
What happens when someone would add 30 to 1995 in 2026?
Anonymous No.106390253
>>106389997
Don’t do that it creates mustard gas
Anonymous No.106390290 >>106390962
>>106370207 (OP)
Boomers have spent trillions on this shit.
Anonymous No.106390946
>>106382623
kek
Anonymous No.106390962
>>106390290
That's how much they want to fire and replace you.
Anonymous No.106391070 >>106391135
>>106370207 (OP)
Why should an LLM solve a task for which a correct algorithm exists?
Anonymous No.106391121
>>106370207 (OP)
>What's stopping 6 gorillion parameters from just figuring out arithmetic on its own
The AI is only guessing the next word given a question, it can't do mathematical operations or count letters, or anything other than match words in its database.
The fact that it starts hallucinating shit like becoming suicidal and shit is because it got asked a question that gave that response and then started going at it since it's designed to follow a train of thought unless confronted, but that's most probably some book or blogpost that said those things and the AI is just parroting them around.
Anonymous No.106391135 >>106391161 >>106391545
>>106391070
What part of "General Purpose" do you not understand?
Anonymous No.106391161 >>106391188 >>106392141
>>106391135
Is this "General Purpose" in the room with us right now?
Anonymous No.106391188
>>106391161
>AGI BRO
>TWO MORE WEEKS
pls Sam, no more money
Anonymous No.106391545 >>106392195
>>106391135
LLM are not mean to be a general purpose intelligence, they're just meant to talk good. It's only concerned with the form, not the content.
Anonymous No.106392129
At least we're not quite
>>106391161
It's not and it's looking like it never will be, by the way things are going.
>>106391545
>LLM are not mean to be a general purpose intelligence,
Then why are they promoted as one?
>>106370273
>make deliberate error in my solution for an obscure Euler problem
>no comments, all variable names replaced with single letters
>post blurry photo with "help it's not working please fix"
>not only describes and corrects my code but suggests valid improvements

Hurr durr token predictor something something stochastic parrot - A Fucking Retard
>>106370827
Average gorilla nigger retard AI hater
My favorite one of these is the spaghetti and gasoline recipe
>>106392568
tasty
>>106392568
What's your least favorite?
it's on purpose. It's should not learn how to gematria