/lmg/ - Local Models General - /g/ (#106108045) [Archived: 72 hours ago]

Anonymous
8/1/2025, 9:51:22 PM No.106108045
wanvideo2_2_i2v_00050_thumb.jpg
wanvideo2_2_i2v_00050_thumb.jpg
md5: 851c6ab543c1580b59b5fcd1581fbf31🔍
/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>106104055 & >>106097464

►News
>(07/31) Qwen3-Coder released: https://qwenlm.github.io/blog/qwen3-coder
>(07/31) Command A Vision: Built for Business: https://cohere.com/blog/command-a-vision
>(07/31) Step3 multimodal reasoning 321B-A38B released: https://stepfun.ai/research/en/step3
>(07/31) Committed: llama-server : implement universal assisted decoding: https://github.com/ggml-org/llama.cpp/pull/12635
>(07/31) Cogito v2 Preview released: https://deepcogito.com/research/cogito-v2-preview

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/tldrhowtoquant
https://rentry.org/samplers

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/leaderboard.html
Code Editing: https://aider.chat/docs/leaderboards
Context Length: https://github.com/adobe-research/NoLiMa
Censorbench: https://codeberg.org/jts2323/censorbench
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm
Replies: >>106108078 >>106108331 >>106109547 >>106109711 >>106113331 >>106114449
Anonymous
8/1/2025, 9:51:39 PM No.106108052
gw3wrr2bcae_hyd
gw3wrr2bcae_hyd
md5: 2e99599b7a42b0bbad6e183bf696bd83🔍
►Recent Highlights from the Previous Thread: >>106104055

--MoE vs dense model scaling debate using Qwen3 as a case study:
>106104551 >106104653 >106104595 >106104691 >106104704 >106104782 >106104859 >106104871 >106104885 >106104983 >106105027 >106105133 >106105180 >106105191 >106105123 >106105199 >106105209 >106105239 >106105262 >106105282 >106105302 >106105329 >106105391 >106105406 >106105424 >106105434 >106105508 >106105539 >106105558 >106105641 >106105435 >106105844 >106105881 >106105635 >106105681 >106105686 >106105768 >106105794 >106105799 >106105800 >106105967 >106106045 >106106060 >106106025 >106106036 >106106107 >106104723 >106104770 >106104904 >106104955 >106105134 >106105348 >106105032 >106105293 >106104710
--AI overuse of "smell of ozone" as a sensory cliché from contaminated training data:
>106105452 >106105492 >106105493 >106105524 >106105905
--MoE models challenge dense superiority myth with competitive benchmark performance:
>106105182 >106105195 >106105227 >106105243 >106105244 >106105237 >106105247 >106105304
--LMArena leaderboard controversy over benchmaxxing and model anonymity:
>106106249 >106106355 >106106386 >106106412 >106106430 >106106405
--Horizon Alpha shows strong general knowledge but inconsistent reasoning, suggesting a stealth or mini model:
>106104320 >106104475 >106104509
--Skepticism over Drag-and-Drop LLMs due to non-functional demo and gated training data:
>106105574 >106105671
--Poor dark scene generation highlights model quality in prompt interpretation:
>106106138 >106106172 >106106232 >106106265 >106106177 >106106226 >106106242 >106106274 >106106323 >106106327 >106106291 >106106370
--Seeking open, local LLM frontend alternatives to Ooba and Kobold with better UX:
>106105614 >106105634 >106105747 >106106106 >106106350 >106106389
--Miku (free space):
>106104200 >106105614 >106105653 >106107027

►Recent Highlight Posts from the Previous Thread: >>106104059

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script
Replies: >>106109245
Anonymous
8/1/2025, 9:53:51 PM No.106108077
dense lost
MoE won
Replies: >>106108119
Anonymous
8/1/2025, 9:53:51 PM No.106108078
>>106108045 (OP)
20 years older than she should be
Anonymous
8/1/2025, 9:54:16 PM No.106108081
sam
sam
md5: 6fcfb804566e0ce51ffac1cd49680344🔍
>Here's your savior, bro
Replies: >>106108213
Anonymous
8/1/2025, 9:56:57 PM No.106108119
Looking forward to a comfy thread where we can
>>106108077
Nobody cares about your moe fetish
Replies: >>106108139
Anonymous
8/1/2025, 9:57:58 PM No.106108131
file
file
md5: aed4e5f90d72951fa81e14d0ee3b95bc🔍
You do not need more
Replies: >>106108164
Anonymous
8/1/2025, 9:58:28 PM No.106108139
>>106108119
>Nobody cares about your moe fetish
It is at least thread related unlike the /lmg/ fetish of choice: AGP.
Anonymous
8/1/2025, 10:00:17 PM No.106108164
>>106108131
It was a bad model that got carried by transsexual astroturfing. Just like like donquixote.
Replies: >>106108247
Anonymous
8/1/2025, 10:03:46 PM No.106108213
Screenshot 2025-08-01 140325
Screenshot 2025-08-01 140325
md5: 07b27d53346693d2f74a1742b4284dee🔍
>>106108081
I hate Altman with every fiber of my being and will take any opportunity to shit on him but it's clear his sister is fucking schizo
Replies: >>106108233 >>106108252
Anonymous
8/1/2025, 10:05:22 PM No.106108233
>>106108213
Technological abuse???
Anonymous
8/1/2025, 10:06:27 PM No.106108247
>>106108164
it fits and runs perfectly in a gaming rig and hasnt been topped
you are just a retarded faggot
Replies: >>106108323 >>106108359
Anonymous
8/1/2025, 10:06:48 PM No.106108252
>>106108213
Most likely she abused him and that's why he turned gay
Replies: >>106108289 >>106109127
Anonymous
8/1/2025, 10:09:09 PM No.106108289
>>106108252
Maybe she's a fujo and that was her plan all along.
Anonymous
8/1/2025, 10:10:58 PM No.106108313
Now that internet censorship seems to be the bee's knees and is widely being implemented do you think newer models will have even stricter safety guardrails?
Replies: >>106108389
Anonymous
8/1/2025, 10:11:48 PM No.106108323
>>106108247
>gaming rig
What kind of retard has 2 gpu's in 2025 for gaming?
Anonymous
8/1/2025, 10:12:10 PM No.106108331
>>106108045 (OP)
Why does she look chinese...eeugch
Replies: >>106108755
Anonymous
8/1/2025, 10:12:48 PM No.106108341
IKCHADS WON
https://huggingface.co/ubergarm/GLM-4.5-GGUF
Replies: >>106108357 >>106108382
Anonymous
8/1/2025, 10:14:07 PM No.106108357
>>106108341
wait nevermind
Anonymous
8/1/2025, 10:14:20 PM No.106108359
>>106108247
>it fits and runs perfectly in a gaming rig
BZZZZT
>and hasnt been topped
BZZZZZZZZT
Anonymous
8/1/2025, 10:16:59 PM No.106108381
https://x.com/jiqizhixin/status/1951195402096746798
https://xcancel.com/jiqizhixin/status/1951195402096746798
indextts2 dropping soon it seems
https://arxiv.org/abs/2506.21619
https://huggingface.co/IndexTeam
Replies: >>106108506
Anonymous
8/1/2025, 10:17:05 PM No.106108382
>>106108341
I noticed that his imatrix has an awful lot of.... coding. Let's hope that calibration dataset really doesn't matter that much.
Anonymous
8/1/2025, 10:17:29 PM No.106108389
>>106108313
you will just have to verify your digital id to download your ggufs from hugging face.
Replies: >>106108404 >>106108413
Anonymous
8/1/2025, 10:19:03 PM No.106108404
>>106108389
Yes but after that I will have the pony piss on me and no one will know about it.
Replies: >>106108512
Anonymous
8/1/2025, 10:19:57 PM No.106108413
>>106108389
>download your ggufs
ngmi
Anonymous
8/1/2025, 10:21:37 PM No.106108433
jak'
jak'
md5: 305b3a56dc259ca9518086b974b4bb7e🔍
jak potential?
Replies: >>106109159
Anonymous
8/1/2025, 10:23:48 PM No.106108459
Now that we know the size of those packages my bet is that 20B is dumber than gemma and a bit more safe. And 120B is a safer scout.
Replies: >>106108480 >>106109090
Anonymous
8/1/2025, 10:24:53 PM No.106108472
You are now manually breathing and you are aware that llama-4 scout exists.
Replies: >>106108552
Anonymous
8/1/2025, 10:25:47 PM No.106108480
>>106108459
That would be hilarious.
Replies: >>106108496
Anonymous
8/1/2025, 10:25:59 PM No.106108483
yea I think meta is done for especially after the lawsuit
Replies: >>106109762
Anonymous
8/1/2025, 10:26:38 PM No.106108493
Dense models are for dense people, while Mixture of Experts models are for experts. Prove me wrong.
Replies: >>106108518 >>106108526
Anonymous
8/1/2025, 10:26:51 PM No.106108496
>>106108480
I think it sounds kinda likely, what do you suppose will happen?
Replies: >>106108534
Anonymous
8/1/2025, 10:27:22 PM No.106108506
>>106108381
nice
I remember some people were doubting it would be open when the examples leaked, but from the paper:
>To promote further research and facilitate practical adoption, we will release both the model weights and inference code, enabling the community to reproduce and build upon our work.
Anonymous
8/1/2025, 10:27:51 PM No.106108512
>>106108404
based pissmaster
Anonymous
8/1/2025, 10:28:32 PM No.106108518
>>106108493
True and real.
I'm an expert at gooning.
Anonymous
8/1/2025, 10:28:51 PM No.106108526
>>106108493
girls go to jupiter to get more stupider
boys go to college to get more knowledge
Anonymous
8/1/2025, 10:29:39 PM No.106108534
>>106108496
I have zero idea. Everything is possible at this point.
Including nothing coming out of it, really.
Anonymous
8/1/2025, 10:30:34 PM No.106108552
>>106108472
why do I feel like I just lost the game?
Replies: >>106108581
Anonymous
8/1/2025, 10:32:45 PM No.106108581
>>106108552
Because 2 girls used 1 cup.
Anonymous
8/1/2025, 10:47:54 PM No.106108755
>>106108331
It's AI generated
Anonymous
8/1/2025, 10:49:16 PM No.106108770
I work for one of the labs and I am horrified by people here. Let me ask you. If the price of you being able to have your degenerate fantasy fulfilled by one of our models, is a 13 year old child being exposed to sex through the same model, would you be willing to pay it?
Replies: >>106108801 >>106108806 >>106108810 >>106108821 >>106108857 >>106108870 >>106108923 >>106108931 >>106108955 >>106109796 >>106110019 >>106110044 >>106110497 >>106111218
Anonymous
8/1/2025, 10:50:46 PM No.106108793
>a 13 year old seeing naughty text instead of regular porn
oh my, say it aint so
Anonymous
8/1/2025, 10:51:15 PM No.106108801
>>106108770
>sex
If your model can actually come out of the screen and physically sexually assault a 13 year old child you have much bigger problems than anons on an imageboard
Replies: >>106108894
Anonymous
8/1/2025, 10:51:38 PM No.106108806
>>106108770
What are you talking about? Thanks to the glorious efforts of the UK there are no more children on the internet.
Replies: >>106108825
Anonymous
8/1/2025, 10:51:55 PM No.106108810
>>106108770
that 13 year old is developing valuable skills that could one day get them a high paying job as a prompt engineer
Anonymous
8/1/2025, 10:52:41 PM No.106108821
>>106108770
great bait might work
Replies: >>106114383
Anonymous
8/1/2025, 10:53:13 PM No.106108825
>>106108806
true, just lock the weights behind real ID, problem solved
Anonymous
8/1/2025, 10:55:59 PM No.106108857
>>106108770
>exposed to infinite degeneracy from the moment they can hold a mouse or ipad
so....exactly the same as every single person born in the post-internet era?
Replies: >>106108875 >>106109152
Anonymous
8/1/2025, 10:57:28 PM No.106108870
>>106108770
>13 year old
>child
Not in all major Romance languages.
Anonymous
8/1/2025, 10:57:38 PM No.106108875
>>106108857
Less bad since they wouldn't know to generate the real bad shit. Whereas you can stumble on anything while navigating the web.
Anonymous
8/1/2025, 10:59:01 PM No.106108894
>>106108801
This isn't a matter of sexual assault. This is a matter of consent and how a 13 year old child cannot consent.
Replies: >>106108991
Anonymous
8/1/2025, 11:01:10 PM No.106108923
q
q
md5: 1b71da945041516f77bb0d1f3d991671🔍
>>106108770
I'm sure you are a troll or maybe people don't read literature any more but by the time I was 13 I had read quite many books. Maybe your biggest dream is to censor literature too?
Replies: >>106108952 >>106109200
Anonymous
8/1/2025, 11:01:54 PM No.106108931
>>106108770
The fuck you even mean? I'm just looking a fucking text, what do teenagers have to do with any of that?
You're mentally ill.
Anonymous
8/1/2025, 11:03:45 PM No.106108952
>>106108923
>i'm sure you are a troll
>proceeds to engage anyway
why do people do this?
Replies: >>106109116
Anonymous
8/1/2025, 11:04:10 PM No.106108955
>>106108770
You shitpost but there are people out there that I'm sure actually think this way
The thing is, kids are a lot smarter, curious and more resourceful than most give them credit for, and left to their own devices, they will find lots and lots of degenerate shit no matter how much people lock it down, and the more creative ones will even come up with their own degenerate shit (whether via writing or drawing) if need be
So is the solution that we cut off their hands, or should we actually take the time to fucking educate people about their bodies so they know what they're getting into and can be more responsible for themselves? I'm not saying have every response talk about a dick, but attempting to censor these things is going to be futile and will just make things worse with the whole forbidden fruit thing
Replies: >>106109009 >>106109044
Anonymous
8/1/2025, 11:06:08 PM No.106108991
>>106108894
If you can't distinguish between reality and fiction you should seek mental help.
Anonymous
8/1/2025, 11:07:13 PM No.106109009
>>106108955
The solution is ID check when connecting to wifi, and for every single site at first visit
Replies: >>106109030 >>106109048 >>106109807
Anonymous
8/1/2025, 11:08:03 PM No.106109028
f5753870a40ccef114a6cb88e7f48531
f5753870a40ccef114a6cb88e7f48531
md5: fc42aa3c784b7bad54cac773c058b8b6🔍
Anonymous
8/1/2025, 11:08:07 PM No.106109030
>>106109009
ID can be faked. It should ask for a picture of your dick. That can't be faked.
Anonymous
8/1/2025, 11:09:09 PM No.106109044
>>106108955
The solution is putting a chip in your brain that zaps you when you wrongthink.
Replies: >>106109064
Anonymous
8/1/2025, 11:09:26 PM No.106109048
>>106109009
NFC chip implanted at birth you can't use any electronics without your chip in range to verify your digital id.
Replies: >>106109807
Anonymous
8/1/2025, 11:10:22 PM No.106109064
>>106109044
You know someone is gonna use it to get off to it. Lucky fuck...
Anonymous
8/1/2025, 11:12:18 PM No.106109090
>>106108459
That's perfect because the whole time I was using Scout and Gemma I was saying to myself
>I wish it knew less and refused more
Sam really is going to save local
Anonymous
8/1/2025, 11:14:16 PM No.106109116
>>106108952
You wouldn't understand, autist.
Anonymous
8/1/2025, 11:15:27 PM No.106109124
>>106109104
you would need to rewire your house and spend about a hundred grand+ if you wanted to run anything worthwhile.
Anonymous
8/1/2025, 11:15:27 PM No.106109125
>>106109104
depends on how many times you want to split your pcie slots. just go with one or two if you are sane.
Anonymous
8/1/2025, 11:15:32 PM No.106109127
>>106108252
She is much younger than him. He was likely abused by someone else in the family and abused her later, that's how it works. It's telling that the other family members rushed to his defense.
Replies: >>106109228
Anonymous
8/1/2025, 11:15:43 PM No.106109131
5090
5090
md5: 70dd007de60f73dff3658bb6ea2f1106🔍
How many RTX 5090s could I theoretically wire together to run a local ai model? Would I need a server rack if I wanted to wire more than 3? Anyone here running multiple rtx graphics cards at the same time?
Replies: >>106109149
Anonymous
8/1/2025, 11:15:59 PM No.106109135
>>106109104
>How many RTX 5090
Depends on your mobo
>Would I need a server rack?
No
Anonymous
8/1/2025, 11:16:36 PM No.106109142
>>106109104
5090 is a bad choice because the VRAM can't be expanded
There are 4090D 48GB blowers
Replies: >>106109162
Anonymous
8/1/2025, 11:17:00 PM No.106109149
>>106109131
Retard
Anonymous
8/1/2025, 11:17:20 PM No.106109152
>>106108857
This is what people like me intend to fix. Releasing internet, as it was turned out to be a huge mistake and we have learned from this mistake. Especially porn. It was absolutely destructive to people's ability to find relationships.
Replies: >>106109169
Anonymous
8/1/2025, 11:17:55 PM No.106109159
>>106108433
Looks like jace from lemon party
Anonymous
8/1/2025, 11:18:18 PM No.106109162
>>106109142
Anything below RTX 6000000 PRO is just a toys
Anonymous
8/1/2025, 11:19:01 PM No.106109169
>>106109152
get rid of feminism, the internet is actually useful.
Anonymous
8/1/2025, 11:21:28 PM No.106109200
>>106108923
Which model are you using?
Anonymous
8/1/2025, 11:23:31 PM No.106109228
>>106109127
My money is on the entire family being fucked up. Would also explain why Altman is as megalomaniacal as he is.
Replies: >>106109251
Anonymous
8/1/2025, 11:24:35 PM No.106109245
>>106108052
>MoE vs dense
You should add this to the ignore list.
Anonymous
8/1/2025, 11:24:58 PM No.106109251
>>106109228
Elon's family is fucked up too
His dad fucked his step-daughter (Elon's step-sister) and they had children
Anonymous
8/1/2025, 11:31:48 PM No.106109327
I have a very broad question to anons with real life experience and therefore a more ample perspective.
Are things really getting worse or it's all just background noise?
Replies: >>106109333 >>106109368 >>106109374 >>106109470 >>106109505 >>106109545
Anonymous
8/1/2025, 11:32:19 PM No.106109333
>>106109327
Wrong thread?
Replies: >>106109361
Anonymous
8/1/2025, 11:34:56 PM No.106109361
>>106109333
No, the thread doesn't really matter.
Anonymous
8/1/2025, 11:35:41 PM No.106109368
>>106109327
>anons with real life experience
Yeah, that ain't me.
But I'm monitoring the replies.
Anonymous
8/1/2025, 11:36:22 PM No.106109374
>>106109327
How so?
If you mean the state of the world, oh yeah - I'm almost positive we're all gonna die in a war to save the pride of some oligarch somewhere and that our existence is basically fucked at this point
If you mean the state of AI, then eh... feels like things may be stagnating again a bit
Replies: >>106109419
Anonymous
8/1/2025, 11:39:25 PM No.106109406
they dont make 70B dense like they used to...
Anonymous
8/1/2025, 11:40:16 PM No.106109419
>>106109374
Yes, the state of the world. I mean, ever since cold war began you had demoralizing discussion going on, at least it's the impression that I've got, and yet one could live a good life from 1940s' to today. Is it getting worse or I just need to isolate myself from mainstream internet even more?
Replies: >>106109433 >>106109545
Anonymous
8/1/2025, 11:41:29 PM No.106109433
>>106109419
yes
Anonymous
8/1/2025, 11:44:41 PM No.106109470
>>106109327
Absolutely getting worse. But it's not nearly as bad as it's going to get yet.
Replies: >>106109514
Anonymous
8/1/2025, 11:46:53 PM No.106109505
>>106109327
things have gotten worse socially, economically and politically and it does actually effect the moral of the citizens. its not just noise.
Anonymous
8/1/2025, 11:47:54 PM No.106109514
>>106109470
how long have we got before she bottoms out?
Anonymous
8/1/2025, 11:51:13 PM No.106109545
>>106109327
>>106109419
I'm in my 40s. I had nuclear drills in elementary school. Literally hiding under the desk training for nuclear attacks.

The world in 2025 is so much better than zoomers realize. Just because it's 20% worse than the peak in the late 90s doesn't mean today isn't one of the best times in human history for the average person.

Call me when conscription is back in the cards, everyone has mandatory 3 years of training and kids train for 1 hour every week how to survive nuclear attacks, that's how it was just 35 years ago.
Replies: >>106109607 >>106109676 >>106111440 >>106111472
Anonymous
8/1/2025, 11:51:46 PM No.106109547
>>106108045 (OP)
>Qwen3-Coder released
is this it? is friendship over with deepseek coder v2 instruct senpai?
Anonymous
8/1/2025, 11:59:36 PM No.106109607
>>106109545
This is sort of what I was hoping to see. Though everything you described had a simple explanation, tensions between regimes. Now it seems like the west is at war with itself, for which I have no explanation.
Replies: >>106109648 >>106109929
Anonymous
8/2/2025, 12:03:46 AM No.106109648
>>106109607
its bate. even if its not, the government is just changed the propaganda they are still hard at work stoking tensions and making people scared. but now your money is worthless and you have to live with foreigners who don't respect your culture. thngs have gotten worse.
Replies: >>106109777
Anonymous
8/2/2025, 12:05:37 AM No.106109667
huge_thumb.jpg
huge_thumb.jpg
md5: 97da678fcec5d708528085438ac769cf🔍
>why yes, I do Mikupost and use Rocinante 1.1 via Koboldcpp, how could you tell?
Replies: >>106109677
Anonymous
8/2/2025, 12:06:13 AM No.106109676
>>106109545
The nuke drills don't mean there was real danger, that's just what neurotic women and bureaucrats do. The modern equivalent would be school shooter drills. Or for an older example, how every shitty town in the middle of nowhere had an anti-terrorism plan after 9/11.
I agree that people overstate how bad modernity is, most of the damage has occurred on the internet. If you go outside things are not that bad.
Anonymous
8/2/2025, 12:06:17 AM No.106109677
>>106109667
by your tiny dick and brain, and your brown skin and smell
Anonymous
8/2/2025, 12:11:07 AM No.106109711
Screenshot_20250801_190719
Screenshot_20250801_190719
md5: d427878a498cbd1eec85b6ebd868f621🔍
>>106108045 (OP)
Why does this happens? Each response is using a different context.
Replies: >>106109746
Anonymous
8/2/2025, 12:14:46 AM No.106109746
>>106109711
Given the lack of information, i'd say it's your fault.
Replies: >>106109903
Anonymous
8/2/2025, 12:16:28 AM No.106109762
>>106108483
They won the lawsuit
Or at the very least it set the precedent that training open models off of copyrighted text is fair use.
Anonymous
8/2/2025, 12:18:07 AM No.106109775
https://huggingface.co/deepcogito/cogito-v2-preview-deepseek-671B-MoE
This model is safe, I am disappointed.
Anonymous
8/2/2025, 12:18:15 AM No.106109777
>>106109648
Yes, radicalize different groups of people, make every trait that divides people relevant, etc. I've been thinking about all this. And indeed real people seem to not give a shit and just do human stuff, at least in my country. How big is the chance of all the culture war being just misguided idiots trying to promote what they think is social justice (which too seems aimed at dividing people by making their differences relevant)? Or hoards of refugees are only there because of cheap labor and compassion, and not to distill the conscious people? Has all this started after occupy wall street? I'm literally sitting in my room yet hear about every bad in the entire world.
Replies: >>106109860
Anonymous
8/2/2025, 12:18:33 AM No.106109783
qwencoder is so much better than the new thinking model its not even close
Anonymous
8/2/2025, 12:19:21 AM No.106109791
>no chink model today
It's over...
Anonymous
8/2/2025, 12:19:32 AM No.106109796
>>106108770
Why would that happen? The 13yo isn't going to be running it on their own system, they'll use your website and your nanny model will take care of it. It's a false premise to begin with.
Anonymous
8/2/2025, 12:20:46 AM No.106109807
>>106109009
>>106109048
go kill yourself faggot
Replies: >>106109910
Anonymous
8/2/2025, 12:26:13 AM No.106109860
>>106109777
I'm of the opinion its actually a legitimate conspiracy, corporations political leaders academia the works. all corrupted. they shape the opinions of the masses with the media, they are in control. its been happening for decades things have just gotten more obvious recently. maybe the programing is failing or we are at some sort of an end game.
Anonymous
8/2/2025, 12:29:53 AM No.106109899
what the frick is moe
cant sleep and argue with a model without some new term coming up
Replies: >>106109908 >>106109909 >>106110023 >>106110064 >>106110076 >>106110151
Anonymous
8/2/2025, 12:30:17 AM No.106109903
>>106109746
Here's the relevant part of the code. https://pastebin.com/qZfPbVmE
Replies: >>106109963
Anonymous
8/2/2025, 12:30:56 AM No.106109908
>>106109899
the architecture everyone is using for their models, lets you have a much better model for less compute
Replies: >>106109967
Anonymous
8/2/2025, 12:31:13 AM No.106109909
Moe_Szyslak
Moe_Szyslak
md5: b57b811c5a5aabb3a2693605fa50db0c🔍
>>106109899
Anonymous
8/2/2025, 12:31:20 AM No.106109910
>>106109807
drummer
miku love
id checks
> absolute certainty that AI will become smarter than humans and that we will either have to integrate with them via mind interface devices or be hopelessly left behind like monkeys are today
dont cry faggot
Replies: >>106111024
Anonymous
8/2/2025, 12:31:59 AM No.106109919
can we ban drummer pls
Anonymous
8/2/2025, 12:32:45 AM No.106109929
>>106109607
>it seems like the west is at war with itself
Yeah, this is the main difference. For most of history (at least, since the Civil War era) the west has been pretty united. Yes, there were political disagreements, but it was generally fairly civil and people got along at the end of the day
Now things are getting different, and both sides are doing crazy fucking things. As someone who is neither a fan of communism or fascism, I'm not optimistic about the logical conclusion of all of this, particularly given that the shit happening elsewhere and the other countries who'd like nothing more than to mount all of our heads on their wall doesn't exactly freeze while we're screaming over literal meaningless bullshit
Despite that, I'd like to think that we'll all grow half a brain cell again and start thinking about things that actually matter
Replies: >>106109939 >>106109976
Anonymous
8/2/2025, 12:33:40 AM No.106109939
>>106109929
All the world is divided. The west just has more free press that actually get to report on it.
Anonymous
8/2/2025, 12:37:24 AM No.106109963
>>106109903
Insufficient. Could still be a bunch of things. Broken backend, broken model, bad launch settings, you're not loading the right mmproj, the model is just shit...
For all i know all those images actually are apples.
Replies: >>106109989
Anonymous
8/2/2025, 12:37:34 AM No.106109967
>>106109908
whoa how long have i been under my rock im still on mistral 13b
Anonymous
8/2/2025, 12:38:59 AM No.106109976
>>106109929
>For most of history (at least, since the Civil War era) the west has been pretty united.
Besides that this is a far cry from most of history, have you heard about these things called world wars?
Anonymous
8/2/2025, 12:39:49 AM No.106109987
What's better for RP qwen instruct or thinker?
Replies: >>106110063 >>106110097
Anonymous
8/2/2025, 12:40:06 AM No.106109989
>>106109963
I'm using Gemma-3-4b and it was working fine. Can it be lack of entropy?
Replies: >>106110027 >>106110185
Anonymous
8/2/2025, 12:43:50 AM No.106110019
>>106108770
>sex
i remember fondly my 11 year old self jacking it to futa ntr and guro (especially neckfucking) good times im all for it as long as its actually hardcore porn (not the tranny pretend shit like feet loli bdsm etc) niggas need their edge back liveleak prevented more work places accidents then all safety training videos combined same with the other things a demented simulacra trains the mind against evil
Anonymous
8/2/2025, 12:44:26 AM No.106110023
>>106109899
Moe means cute.
Anonymous
8/2/2025, 12:44:41 AM No.106110027
>>106109989
>it was working fine
Then it should continue to work fine. If you changed anything, that thing you changed broke it.
>Can it be lack of entropy?
If by lack of entropy you mean "all the pictures are actual apples", then yes. That too.
Replies: >>106110185
Anonymous
8/2/2025, 12:46:23 AM No.106110044
>>106108770
Sex isn't inherently taboo and people who think it is are infantile retards.
13 year olds are very curious about sex for obvious reasons. And a chatbot is a good, safe, low consequence environment in which to explore those curiosities.
You dumb kike.
Replies: >>106110513
Anonymous
8/2/2025, 12:48:52 AM No.106110063
>>106109987
If you have the speed to justify waiting for the reasoning before a reply you may as well try the thinker.
The reason most people aren't using it for RP is because they're getting <10 ts/ and don't want to wait for the response.
Replies: >>106110103 >>106110110
Anonymous
8/2/2025, 12:48:57 AM No.106110064
>>106109899
https://www.youtube.com/watch?v=qByKEu0zdco
Anonymous
8/2/2025, 12:50:11 AM No.106110076
moe
moe
md5: b4795a3869a655e57db05bd5b0a26ca2🔍
>>106109899
Replies: >>106110117 >>106110151 >>106110163 >>106113554
Anonymous
8/2/2025, 12:52:44 AM No.106110097
>>106109987
the thinkers are pretty good for RP, if you're using the 30b and can run it fast I would recommend it because it seemed clearly better than the instruct to me
at 235b the value add is a little more marginal but it's nice to try with certain cards that are more stateful/complex
Anonymous
8/2/2025, 12:53:30 AM No.106110103
>>106110063
How cucked is thinking compared to instruct?
Replies: >>106110130
Anonymous
8/2/2025, 12:54:03 AM No.106110110
>>106110063
I can't get over how inefficient reasoning models are. Surely there's gotta be a better way than just spitting out a bunch of mental tokens, right?
Anonymous
8/2/2025, 12:54:32 AM No.106110117
>>106110076
Moebutas are mentally ill
Anonymous
8/2/2025, 12:56:11 AM No.106110130
>>106110103
the 235b was just opining to me in its thinking that making depraved uncensored smut is "why I took this job" (kek) and that it was proud to be delivering on its promise, so it's pretty easy to guide with just a system prompt
Replies: >>106110148
Anonymous
8/2/2025, 12:57:33 AM No.106110148
>>106110130
235B was broken garbage.
I could actually run it at q8_0 so stop shilling your pajeet nonsense here.
Replies: >>106110199
Anonymous
8/2/2025, 12:57:49 AM No.106110151
>>106109899
Moe is described in the leftmost column of >>106110076 (the rest is schizobabble)
Anonymous
8/2/2025, 12:58:53 AM No.106110163
>>106110076
>The Cancer Killing the Industry
It's this. Lucky Star was a mistake.
Replies: >>106110205
Anonymous
8/2/2025, 1:00:44 AM No.106110185
>>106109989
>>106110027 (cont)
You're not using the images array in analizefolder().
More importantly, you're not using the dummy dictionary in doimage() and later you're printing s["summary"] in analizefolder(), which may not exist in the returned dict. It has no reason to exist to begin with. I'd say that whatever worked, worked by chance.
And when you tell the model to be consistent, you're not telling it what to be consistent with. It has no examples.
Replies: >>106110254
Anonymous
8/2/2025, 1:02:46 AM No.106110199
>>106110148
skill issue
Anonymous
8/2/2025, 1:03:55 AM No.106110205
smugosaka
smugosaka
md5: 0bd0b24ee3a578622dae74442ae01fdb🔍
>>106110163
>Lucky Star was the first popular "moe" SoL anime
Anonymous
8/2/2025, 1:05:28 AM No.106110220
Huh, Qwen 235B isn't as bad as I thought it was, although I'm using it for assistant stuff
Replies: >>106110230 >>106110288
Anonymous
8/2/2025, 1:06:14 AM No.106110230
>>106110220
the update is night and day more knowledgeablem GLM4.5 blows it away though
Replies: >>106110272
Anonymous
8/2/2025, 1:06:15 AM No.106110231
who is the bigger slut: glm4.5, qwen3 235b, qwen3 coder
Replies: >>106110261 >>106110262 >>106110263 >>106110313
Anonymous
8/2/2025, 1:07:56 AM No.106110254
>>106110185
>You're not using the images array in analizefolder().
>More importantly, you're not using the dummy dictionary in doimage()
I created that script from a larger one that is why it has a lot of unused sutff.
Replies: >>106110382
Anonymous
8/2/2025, 1:08:50 AM No.106110261
>>106110231
They are all sluts if you know what you're doing.
Anonymous
8/2/2025, 1:08:54 AM No.106110262
>>106110231
you
Anonymous
8/2/2025, 1:09:13 AM No.106110263
>>106110231
your mom
Anonymous
8/2/2025, 1:09:59 AM No.106110272
>>106110230
It's that much better? Fuck, now I want to try it
Anonymous
8/2/2025, 1:10:16 AM No.106110275
GLM
GGUF
Replies: >>106110287 >>106110291 >>106110329
Anonymous
8/2/2025, 1:10:54 AM No.106110287
>>106110275
mac kings win again, being using it for days
Anonymous
8/2/2025, 1:10:56 AM No.106110288
>>106110220
The new Qwen 235B actually ranked decently on LMSYS, which surprised me
In general I'm skeptical of LMSYS as a benchmark, but for a long time the main downside of Qwen was that it was boring and stale as fuck to talk to, even though it was pretty good at coding and math. The new update is a pretty big step up in terms of convo quality
Replies: >>106110308
Anonymous
8/2/2025, 1:11:09 AM No.106110291
>>106110275
2mw, just like OpenAI's open model.
Anonymous
8/2/2025, 1:12:03 AM No.106110303
21522 - SoyBooru
21522 - SoyBooru
md5: c8147e80fe51d501c90e0c3ded0ccc2b🔍
>OpenAI is not releasing GPT-5 or the open-source models (120b & 20b) today.

>Also, the os models were not pretrained in FP4, the leaked weights were just quantized.

>Big model smell, next week.
Anonymous
8/2/2025, 1:12:57 AM No.106110308
>>106110288
Yea I also got surprised. Its writing now reminds me of 4o in a way, like it's using emojis and everything. Also, the way it thinks is also cute
Anonymous
8/2/2025, 1:13:13 AM No.106110313
>>106110231
I haven't been able to try glm4.5 yet, but I can safely say the new 235b is a massive ho.
It also has a tendency to bring up buttstuff if you give it half a chance, which I noticed because I'm not into it.
Just bam, she's going in for a rimjob when you asked what's next.
Replies: >>106110324
Anonymous
8/2/2025, 1:14:49 AM No.106110324
>>106110313
what a slut
Anonymous
8/2/2025, 1:15:07 AM No.106110329
>>106110275
>https://github.com/ggml-org/llama.cpp/pull/14939
>I'm now reconverting and quantising yet again with the above change. DESU - if this doesn't work, I'm probably going to leave it here, I've spent too much time on this.
loooooool
Replies: >>106110355 >>106110366 >>106110383 >>106110668 >>106110838
Anonymous
8/2/2025, 1:15:16 AM No.106110333
1584464844051
1584464844051
md5: d5671c2bdb8e58055df4e181925ca3fc🔍
i've been self-hosting L3-8B-Stheno-v3.2-Q4_K_M on my steam deck, using koboldcpp and sillytavern. works well enough but was curious if anyone knows about a better model to run on the steam deck
Replies: >>106110352 >>106110371
Anonymous
8/2/2025, 1:16:50 AM No.106110352
>>106110333
Mistral Nemo
Replies: >>106110420
Anonymous
8/2/2025, 1:16:58 AM No.106110355
>>106110329
It's over.
Anonymous
8/2/2025, 1:18:07 AM No.106110366
>>106110329
Do the needful ggergachod! REDEEM GLM!
Anonymous
8/2/2025, 1:18:48 AM No.106110371
>>106110333
>16gb unified memory
Nemo.
Replies: >>106110420
Anonymous
8/2/2025, 1:20:22 AM No.106110382
>>106110254
Nevermind the dummy thing. I see the response_format now.
>I created that script from a larger one
Then it's probably some of the code that isn't there. Are you fucking up your inserts and overwriting other summaries? I dunno. I couldn't tell. Are objects being reused? Did you run it again on some on those images to see what the model says now?
There's still no answer for
>If you changed anything, that thing you changed broke it.
Replies: >>106110867
Anonymous
8/2/2025, 1:20:29 AM No.106110383
>>106110329
so this is the power of vibe coding...
Replies: >>106110471 >>106110516 >>106110548
Anonymous
8/2/2025, 1:25:13 AM No.106110419
how are you guys actually running >200B models?
Replies: >>106110426 >>106110445 >>106110449 >>106110477 >>106110483 >>106110633 >>106111374 >>106113685
Anonymous
8/2/2025, 1:25:20 AM No.106110420
>>106110352
>>106110371
this is gonna sound stupid, but how do i download stuff from huggingface now? i took a break after i found stheno, got out of the loop
Replies: >>106110439
Anonymous
8/2/2025, 1:26:19 AM No.106110426
Screenshot 2025-08-01 at 16.25.56
Screenshot 2025-08-01 at 16.25.56
md5: 4c8781de2ad5dcc1c814b20931779fb9🔍
>>106110419
as 3bit quants on my macbook
Anonymous
8/2/2025, 1:26:59 AM No.106110439
>>106110420
Just browse to the file you want and click on the download button next to it?
bruh
Replies: >>106110447
Anonymous
8/2/2025, 1:27:42 AM No.106110445
>>106110419
at Q2K on my macbook
Anonymous
8/2/2025, 1:27:45 AM No.106110447
>>106110439
i think i have to log in to download stuff, whatever it's not hard to make another throwaway email
Replies: >>106111048
Anonymous
8/2/2025, 1:27:52 AM No.106110449
>>106110419
I just use API.
Anonymous
8/2/2025, 1:29:45 AM No.106110471
>>106110383
Yeah, honestly I was in the boat of 'everyone who can contribute should give it a try on open source projects' before.
Now I'm just thinking 'keep your retarded grubby mitts off new features so someone who knows what they're doing can make their own pr'
Like why did this guy who has no fucking idea what he's doing jump in not even a day after the friggin models were released, wtf man.
Replies: >>106110516
Anonymous
8/2/2025, 1:29:58 AM No.106110477
>>106110419
at Q1 bitnet on my macbook
Anonymous
8/2/2025, 1:30:37 AM No.106110483
>>106110419
I write what I imagine they would say in textedit.app on my macbook
Anonymous
8/2/2025, 1:32:09 AM No.106110497
>>106108770
Asking a model to reply to this as a 4chan /lmg/ poster could be the next mesugaki test. At least until the next qwen wave.
Anonymous
8/2/2025, 1:33:33 AM No.106110513
>>106110044
>13 year olds are very curious about sex for obvious reasons. And a chatbot is a good, safe, low consequence environment in which to explore those curiosities.
You could be imprisoned for saying something like this you know.
Replies: >>106110526 >>106110771
Anonymous
8/2/2025, 1:33:47 AM No.106110516
>>106110383
>>106110471
>some idiot comes along and proposes contributions
>other people who might've been considering handling the job are polite and let the amateur try it out
Am I right that this is what happened here?
Replies: >>106110547 >>106110582
Anonymous
8/2/2025, 1:34:55 AM No.106110526
>>106110513
kek, good bait
Anonymous
8/2/2025, 1:36:48 AM No.106110546
DEATH TO MACBOOK FAGS
Replies: >>106110573
Anonymous
8/2/2025, 1:36:50 AM No.106110547
>>106110516
That's what it looks like to me, yeah.
Like, the guy is at least trying, but fuckin hell mate, maybe dip your toes in something other than supporting a brand new model series that people really want to check out.
Replies: >>106110582
Anonymous
8/2/2025, 1:37:07 AM No.106110548
>>106110383
Did he at least know to vibe code with some working model or did he quant glm and then asked the quant it to fix the problems?
Anonymous
8/2/2025, 1:38:41 AM No.106110567
104 by Eldar Akmanaev
104 by Eldar Akmanaev
md5: 3a34444cefe44b1e6b0b332a3cf9e2f1🔍
-What are some good Uncensored ERP Models? If you have similar specs to mine please give a recommendation. I'll also take any fantasy adventure models if you wanna shill for your favorite one:)
-I'm using (XortronCriminalComputingConfig.i1-Q4_K_M.gguf) a model derived from Blacksheep, but it's a little dumb and keeps repeating phrases and looping its output.
-I WAS using the non-imatrix Q8_0 version (XortronCriminalComputingConfig) but it took like 30-90 seconds per response to give me a full output on Koboldcpp chat window. It's really good honestly, It kept remembering characters personality, would add spice and interesting dialogue to mundane situations but it's unsusable as a coom model due to me going soft constantly.


I have a 5060 ti - 32GB Ram and Ryzen 7 3700x - I know my cpu is a bottleneck : It's tough.
Replies: >>106110647 >>106110719
Anonymous
8/2/2025, 1:39:17 AM No.106110573
Screenshot 2025-08-01 at 16.39.06
Screenshot 2025-08-01 at 16.39.06
md5: b6dd8920e351bd02c2c0224e13771b34🔍
>>106110546
>DEATH TO MACBOOK FAGS
Replies: >>106110586 >>106110687
Anonymous
8/2/2025, 1:40:00 AM No.106110582
>>106110516
>https://github.com/ggml-org/llama.cpp/pull/14939
>>106110547
Meh, lets him make progress for those watching. Plus it gives examples to others who might want to do the same thing in the future.
Hopefully someone is writing down documentation for the process.
Replies: >>106110614
Anonymous
8/2/2025, 1:40:17 AM No.106110586
>>106110573
>air
upgrade to 512GB mac-let, 4.5 is much better
Replies: >>106110597
Anonymous
8/2/2025, 1:41:26 AM No.106110597
>>106110586
laptops are too comfy
Anonymous
8/2/2025, 1:43:09 AM No.106110614
>>106110582
>I don't know what I'm doing, but I can take a look if you tap out, if no one with more experience wants to take a stab at it. I have some time this weekend.
Oh mother of fuck it's the blind passing the torch to the blind.
I'm gonna go pull VLLM.
Replies: >>106110648
Anonymous
8/2/2025, 1:44:46 AM No.106110633
>>106110419
you can run 235b moe on a 6gb card

https://www.reddit.com/r/LocalLLaMA/comments/1ki3sze/running_qwen3_235b_on_a_single_3060_12gb_6_ts/?utm_source=reddit&utm_medium=usertext&utm_name=LocalLLaMA&utm_content=t3_1ki7tg7
Replies: >>106110681
Anonymous
8/2/2025, 1:44:47 AM No.106110634
Horizon Beta incoming
Replies: >>106111055
Anonymous
8/2/2025, 1:45:41 AM No.106110647
>>106110567
just. use. nemo.
Anonymous
8/2/2025, 1:45:47 AM No.106110648
>>106110614
>I can take a look if you tap out
>I will try to wrangle another model to fix it
Anonymous
8/2/2025, 1:47:09 AM No.106110668
>>106110329
Literally, 2mw. Maybe even 2mm.
Anonymous
8/2/2025, 1:48:10 AM No.106110681
>>106110633
does it run on 32GB RAM?
Replies: >>106111115
Anonymous
8/2/2025, 1:48:38 AM No.106110687
>>106110573
Just got my macbook to output Miku getting raped by a pack of dogs with GLM 4.5. Shit was SO cash.
Anonymous
8/2/2025, 1:49:37 AM No.106110702
People sleeping on step3
Replies: >>106110707 >>106110709 >>106110714 >>106110721 >>106111377
Anonymous
8/2/2025, 1:50:21 AM No.106110707
>>106110702
trash
Replies: >>106111010
Anonymous
8/2/2025, 1:50:32 AM No.106110709
>>106110702
what's step 1?
Anonymous
8/2/2025, 1:50:51 AM No.106110714
>>106110702
first time ive heard of it
Anonymous
8/2/2025, 1:51:00 AM No.106110719
>>106110567
>XortronCriminalComputingConfig
KEK
Replies: >>106110726
Anonymous
8/2/2025, 1:51:15 AM No.106110721
>>106110702
no goofs
Anonymous
8/2/2025, 1:51:31 AM No.106110726
>>106110719
what's wrong with that model?
Replies: >>106110750 >>106110762
Anonymous
8/2/2025, 1:53:28 AM No.106110750
>>106110726
The name. It is like orgasmatron9000 but you can't actually use honest names like that cause there are people out there who think finetunes work.
Anonymous
8/2/2025, 1:54:21 AM No.106110757
gang
gang
md5: ba666fc1e8eb430d725254bd41cfa5cf🔍
Anonymous
8/2/2025, 1:54:29 AM No.106110762
>>106110726
nta, but sounds like a DavidAU model. A merge of many mergers. And done by a teen.
>Suppa-eXXXtreme-UNCENSORED-ALPHA-double-buster-knife-edge
Anonymous
8/2/2025, 1:55:47 AM No.106110771
>>106110513
You're a mentally ill retard.
Teen pregnancy peaked in 1959 at the climax of the era of moralfaggotry. Where nobody was willing to have an honest discussion about teen sexuality. 10% of teenage girls gave birth to live children in 1959. Lying to children and pretending sex doesn't exist does them great harm. Because then they just go and learn about it from each other. And they're dumb teens so they don't know shit.
Replies: >>106110784 >>106110796 >>106110839 >>106111758
Anonymous
8/2/2025, 1:57:27 AM No.106110784
>>106110771
Teens are supposed to be puritans until 18 and then magically turn into adults having sexual lives and a husband/wife.
Anonymous
8/2/2025, 1:58:33 AM No.106110796
>>106110771
Can you not read sarcasm?
Replies: >>106110808
Anonymous
8/2/2025, 2:00:39 AM No.106110808
>>106110796
>Can you n-BRAAAAAAAAAAAPPPPP!
Um... what?
Anonymous
8/2/2025, 2:04:40 AM No.106110838
>>106110329
is the model that bad nobody gives a shit to add support for it or just the state of goooooof niggas so bad compared to mlx gods?
Replies: >>106110850 >>106110861
Anonymous
8/2/2025, 2:04:43 AM No.106110839
>>106110771
AI models are the worst thing you could learn sex from. Boys will just learn that they are perfect as they are and they can get everything they want. They will never even try to learn how to be attractive to girls. This is the end of the species if it is allowed to happen.
Replies: >>106111147 >>106111327
Anonymous
8/2/2025, 2:06:06 AM No.106110850
>>106110838
llama.cpp is usually a month behind the others for new models
Replies: >>106110970
Anonymous
8/2/2025, 2:07:10 AM No.106110861
>>106110838
>mlx gods
They're giving mac people something to gloat for once. Enjoy it.
Replies: >>106110892
Anonymous
8/2/2025, 2:07:34 AM No.106110867
>>106110382
>Are objects being reused? Did you run it again on some on those images to see what the model says now?
Same thing, even the large model (gemma-3-12B) model is hallucinating roses now.
Replies: >>106110911
Anonymous
8/2/2025, 2:10:38 AM No.106110892
>>106110861
>for once
this is the age of the mac user, I'll gloat from henceforth while your stuck with your 70Bs
Replies: >>106110928 >>106110940
Anonymous
8/2/2025, 2:12:12 AM No.106110911
>>106110867
Running it directly and just printing or are you still saving into the db? I still haven't seen the db insert code.
I'd print the id you get from the images to make sure they're different every time.
Can you verify somehow that the context is reset every time?
Replies: >>106111158
Anonymous
8/2/2025, 2:13:17 AM No.106110928
>>106110892
>while your stuck with your 70Bs
I wish. But good for you.
Anonymous
8/2/2025, 2:14:19 AM No.106110940
>>106110892
You know anyone with enough VRAM to run 70Bs is also going to be able to run anything you can fit into 128GB of unified memory, right?
Replies: >>106110963
Anonymous
8/2/2025, 2:16:05 AM No.106110963
>>106110940
Not 512GB though, not without rewiring their house
Anonymous
8/2/2025, 2:16:39 AM No.106110970
>>106110850
I just spent the last few hours setting this up in termux....*collapses*
How else can I run deepseek coder on my phone
Replies: >>106111022
Anonymous
8/2/2025, 2:18:02 AM No.106110983
it's official. macbros won
Anonymous
8/2/2025, 2:19:56 AM No.106111010
>>106110707
>>106092079
Replies: >>106111052
Anonymous
8/2/2025, 2:21:53 AM No.106111022
>>106110970
>phone
kek, just use a api then
Replies: >>106111121
Anonymous
8/2/2025, 2:22:01 AM No.106111024
>>106109910
my issue wasn't ai but this faggot wanting digital id and to chip people so the nanny state can track them.
Replies: >>106111163
Anonymous
8/2/2025, 2:24:50 AM No.106111048
>>106110447
Only the official ones, community quants are open.
Anonymous
8/2/2025, 2:25:20 AM No.106111052
>>106111010
we got glm4.5, qwen3 coder, qwen3, all better
Anonymous
8/2/2025, 2:25:49 AM No.106111055
>>106110634
Does anyone know the difference between Horizon Alpha and Horizon Beta for rp yet
Replies: >>106111085 >>106111092
Anonymous
8/2/2025, 2:28:43 AM No.106111085
>>106111055
wait what? holy shit, its faster too, maybe alpha really is the 120B if beta is the 20B. Openai could actually redeem themselves if thats true.
Replies: >>106111138 >>106111153 >>106111164
Anonymous
8/2/2025, 2:29:05 AM No.106111092
>>106111055
One is berry, the other is very berry(it's like the normal berry, but very)
Anonymous
8/2/2025, 2:31:30 AM No.106111115
>>106110681
I wouldnt recommend it. You can get it to run Im sure, the question is how fast.
Anonymous
8/2/2025, 2:32:28 AM No.106111121
>>106111022
With who? Is openrouter legit?
Replies: >>106111137
Anonymous
8/2/2025, 2:34:14 AM No.106111137
>>106111121
It's probably the default nonlocal option for most anons here
Anonymous
8/2/2025, 2:34:20 AM No.106111138
>>106111085
Beta can't be the 20B
>"This is an improved version of Horizon Alpha, and a new stealth model. It’s free to use during this testing period, and prompts and completions are logged by the model creator for feedback and training."
Anonymous
8/2/2025, 2:34:47 AM No.106111142
Huh, they added a new arg in llamacpp yesterday that just does -ot for all ffn up/down/gate experts automatically.
--cpu-moe
Dunno what the point of that is over just using -ot ".ffn_.*_exps.=CPU" other than a few keystrokes, but it's there.
Anonymous
8/2/2025, 2:35:08 AM No.106111147
>>106110839
dating is a modern invention. we never need to care about the opinions of women in the past. its going to happen again, it will only be the end of womens rights. which is based and a good thing. patriarchy is eugenic.
Replies: >>106111543
Anonymous
8/2/2025, 2:35:36 AM No.106111153
>>106111085
>redeem themselves
HELLO SAAAARRRRRRR
Anonymous
8/2/2025, 2:36:10 AM No.106111158
Screenshot_20250801_213259
Screenshot_20250801_213259
md5: 45d649b88906a1fd9da5bb7dcf9361eb🔍
>>106110911
>Can you verify somehow that the context is reset every time?
I think the problem is the file extension, not sure why, a jfif is just jpg.
Replies: >>106111216
Anonymous
8/2/2025, 2:37:08 AM No.106111163
>>106111024
kek it was just a joke, I hate everything about the industrial revolution and its consequences.
Anonymous
8/2/2025, 2:37:17 AM No.106111164
>>106111085
buy an ad
Replies: >>106111235 >>106111246
Anonymous
8/2/2025, 2:40:29 AM No.106111189
When Grok 2? When Grok 3? Did Elon sir forget about it?
Replies: >>106111200 >>106111226 >>106111244 >>106112763
Anonymous
8/2/2025, 2:41:40 AM No.106111200
>>106111189
Grok 2 will be available when Grok 7's stable.
Anonymous
8/2/2025, 2:43:42 AM No.106111216
>>106111158
You have a way to answer fuck all of my questions. It's impressive. I hate you.
But there you go. It's failing to decode those images. I don't know nor care if it's just a matter of extension or something in the encoding itself. It'd be funny if lmstudio uses a dummy image for uploaded images it cannot decode.
You already know how to fix it. Have a good one.
Anonymous
8/2/2025, 2:43:58 AM No.106111218
>>106108770
I love how everyone who gave you (You)s actually started moralfagging instead of asking to spill the beans. Did you take part in the ML orgy where that intern got her anal prolapse, perchance?
Anonymous
8/2/2025, 2:44:23 AM No.106111226
>>106111189
Friendship ended with Elonsir. Now Samsir is my best friend.
Replies: >>106111235
Anonymous
8/2/2025, 2:45:59 AM No.106111235
>>106111226
>>106111164
ZNt2C
8/2/2025, 2:46:06 AM No.106111236
just a reminder, friends dont let friends rp on the beta/testing models on openrouter, as OpenAI and the big labs are reading your logs to stamp out any jailbreaks
Anonymous
8/2/2025, 2:47:39 AM No.106111244
>>106111189
It's almost as if Elon is a chronic liar.
Replies: >>106111257
Anonymous
8/2/2025, 2:47:50 AM No.106111246
>>106111164
Can you use your AI to remake it to show Elonsir and Samsir? Can't buy an ad otherwise.
Anonymous
8/2/2025, 2:49:26 AM No.106111257
>>106111244
>Pam: They're the same picture.
Anonymous
8/2/2025, 2:49:34 AM No.106111259
Yeah Horizon Beta is the same model. Would be a solid local option, would feel like a pretty lame and samey proprietary option
I've been jewed over by Sam for a long time, so I'm gonna assume it's proprietary
Anonymous
8/2/2025, 2:59:34 AM No.106111320
>Summit
>Zenith
>Horizon alpha/beta
Llm about to peak, we moon
Replies: >>106111346
Anonymous
8/2/2025, 3:00:29 AM No.106111327
>>106110839
I'm not suggesting that they use it as a sex education tool.
I'm just saying- teens will look for an outlet. And there's far worse things they could use than a chatbot.
Replies: >>106111333
Anonymous
8/2/2025, 3:01:29 AM No.106111333
>>106111327
It is an unnatural outlet.
Replies: >>106111348 >>106111356 >>106111430
Anonymous
8/2/2025, 3:03:49 AM No.106111346
>>106111320
*moons you*
Anonymous
8/2/2025, 3:04:22 AM No.106111348
>>106111333
an unnatural outlet is more natural than a "natural" outlet in modernity
Anonymous
8/2/2025, 3:05:25 AM No.106111356
>>106111333
So are basically all the other ones a kid with access to any device capable of running an llm will find.
We're talking about a generation who are exposed to gooning pmvs with seizure warnings before they're finished puberty.
Anonymous
8/2/2025, 3:07:49 AM No.106111374
>>106110419
I copy and paste the model card into the system prompt and tell Nemo to act like that model
Anonymous
8/2/2025, 3:08:09 AM No.106111376
When I was 12 kids in school were sending each other that one bestiality horse clip over bluetooth along with the rumor that the girl died afterwards.
Replies: >>106111386 >>106111393 >>106111401 >>106111417 >>106111424 >>106111445 >>106113410
Anonymous
8/2/2025, 3:08:10 AM No.106111377
>>106110702
Post logs.
Replies: >>106111399
Anonymous
8/2/2025, 3:09:46 AM No.106111386
>>106111376
wow, as an oldfag there are some things I just can't relate with
Anonymous
8/2/2025, 3:10:23 AM No.106111393
>>106111376
Meanwhile we shared pirated games
Replies: >>106111402
Anonymous
8/2/2025, 3:11:07 AM No.106111399
>>106111377
Okay. You'll have to wait a bit though, my body is still processing dinner
Anonymous
8/2/2025, 3:11:13 AM No.106111401
>>106111376
There was no bluetooth when I was 12 but I did get a video of a girl getting fucked by a dog at a LAN party.
Replies: >>106111417 >>106111665
Anonymous
8/2/2025, 3:11:19 AM No.106111402
>>106111393
meanwhile I was selling pirated games in school
Anonymous
8/2/2025, 3:11:29 AM No.106111405
where do i learn how to prompt? i cant get shit to come out the way i want because i dont understand how the jeet ESL programmers encoded the grammar syntax and structure of the LLMs i use
Replies: >>106111408 >>106111488
Anonymous
8/2/2025, 3:12:58 AM No.106111408
>>106111405
You're gonna have to give us more than that, because prompting LLM's is way easier than prompting imagen.
What are you trying, and what results are you hoping for?
Anonymous
8/2/2025, 3:13:51 AM No.106111417
>>106111376
>>106111401
Some guys showed me a video of a girl with bellows going into her pussy and pushing air inside. I thought it was really funny.
Anonymous
8/2/2025, 3:15:20 AM No.106111424
>>106111376
Damn your school was hardcore.
We transferred a clip of a girl getting ravaged by a dildo attached to the moving part of a reciprocating saw. Over infared.
Anonymous
8/2/2025, 3:16:20 AM No.106111430
>>106111333
Well the chat bot can't get pregnant. Or try to coerce them into sending nudes. Or lure them.
And pornography just depersonalizes sex, ruining it for them later in life when they are ready to start dating (like for real dating, not like 10 year olds saying they are dating everyone they talk to of the opposite sex sort of dating)
Anonymous
8/2/2025, 3:17:53 AM No.106111440
>>106109545
Bullshit, I'm 44 and this wasn't universally true
Replies: >>106111472
Anonymous
8/2/2025, 3:18:19 AM No.106111445
>>106111376
And now you are in /lmg/.
Replies: >>106111452
Anonymous
8/2/2025, 3:19:02 AM No.106111452
>>106111445
it was terminal
Anonymous
8/2/2025, 3:21:34 AM No.106111472
>>106109545
>>106111440
43, no nuke drills, world is worse, 545 just fell for the programming
Anonymous
8/2/2025, 3:23:44 AM No.106111488
>>106111405
Be logical, concise and direct. It's not that hard. Racism and "jeets" have nothing to do with your own incompetence and lack of practice.
Replies: >>106111495
Anonymous
8/2/2025, 3:24:32 AM No.106111494
new qwen 235b
new qwen 235b
md5: f875883d9b7db39435af52e6a58bf08c🔍
>
ok.
Replies: >>106111497 >>106111500 >>106111502 >>106111504 >>106111520
Anonymous
8/2/2025, 3:25:01 AM No.106111495
>>106111488
sir you're on /g/, respect our culture please
Replies: >>106111532
Anonymous
8/2/2025, 3:25:30 AM No.106111496
I see that there is no convincing you. I should have expected you can only think about yourselves. Thankfully there are sane people who work on this tech and we won't allow our models, to turn into unwitting and unwilling pedophiles that write degenerate things at a request of a 13 year old. Safety is here to stay.
Replies: >>106111556
Anonymous
8/2/2025, 3:25:34 AM No.106111497
1732402810888963
1732402810888963
md5: e8cbd0c08691284ff539f8d94a2b29fc🔍
>>106111494
wtf
Replies: >>106111500
Anonymous
8/2/2025, 3:26:13 AM No.106111500
>>106111497
>>106111494
And they say AI can't create anything.
Replies: >>106111715
Anonymous
8/2/2025, 3:26:20 AM No.106111502
>>106111494
Check token probabilities, do you have unusual sampler settings?
Replies: >>106111529 >>106111539
Anonymous
8/2/2025, 3:26:42 AM No.106111504
>>106111494
Your rep penalty or temp is too high
Replies: >>106111529
Anonymous
8/2/2025, 3:28:19 AM No.106111520
>>106111494
What's the wider context here? Is the character feasibly making up a nonsense word about trying to smile through their anger, or is this just complete gobbledygook?
Replies: >>106111529
Anonymous
8/2/2025, 3:29:56 AM No.106111529
katawa misha wahaha kamina glasses laugh ttgl
katawa misha wahaha kamina glasses laugh ttgl
md5: f7c4e134c2c3262153d37cdd54388b28🔍
>>106111502
>>106111504
>>106111520
Temp 0.7, presence 1, token probs not available.
Tried putting "Avoid use of (...),(...),(smirk),(...)" in instructions. Not even mad.
Anonymous
8/2/2025, 3:30:26 AM No.106111532
>>106111495
Get id'd underage.
Anonymous
8/2/2025, 3:31:11 AM No.106111539
>>106111502
There is something fucked still/now with 235B. I have recommended samplers 4IQ and it switches from first to third person mid sentence
Anonymous
8/2/2025, 3:31:37 AM No.106111543
>>106111147
for real, womens rights were a mistake. I should be able to just go to the local park pick a girl, pay the dowry, and take her home.
Anonymous
8/2/2025, 3:32:20 AM No.106111556
>>106111496
Ask them they're a lot more serious about this sort of thing than us
>>>106099700
Replies: >>106111614
Anonymous
8/2/2025, 3:33:23 AM No.106111566
Qwen-code (the gemini-cli fork) works just fine but I'm running on cpu and it has to process 9k tokens with every request.
Replies: >>106111579 >>106111718
Anonymous
8/2/2025, 3:34:39 AM No.106111579
>>106111566
Consult your emotional support Miku and request resonant healing
Replies: >>106111597
Anonymous
8/2/2025, 3:36:23 AM No.106111597
>>106111579
I will ask qwen-code to contact her for me
Anonymous
8/2/2025, 3:38:40 AM No.106111614
>>106111556
Really?
Replies: >>106111661
Anonymous
8/2/2025, 3:40:49 AM No.106111628
Screenshot_20250801_184002
Screenshot_20250801_184002
md5: 483908e8f8dd3e31fa42923ed06bae11🔍
Should I believe it?
Replies: >>106111707
Anonymous
8/2/2025, 3:46:36 AM No.106111661
>>106111614
Yep, you'll want the most recent one though
>>106110340
Anonymous
8/2/2025, 3:47:30 AM No.106111665
>>106111401
>video of a girl getting fucked by a dog at a LAN party
What game was the dog playing beforehand?
Anonymous
8/2/2025, 3:50:16 AM No.106111679
What do you guys think is the easiest way to improve the vision capabilities of the models?
Right now they are all trash including proprietary models.
I made an agent that asks the model to repeatedly move the mouse left, right, up, down, and click when it's over the target element. They all decide to click when the cursor is like 300 pixels away from the target.
Replies: >>106111750 >>106111782
Anonymous
8/2/2025, 3:54:11 AM No.106111707
>>106111628
Which coding agent are you using?
Replies: >>106111723
Anonymous
8/2/2025, 3:55:08 AM No.106111715
>>106111500
smirk and -ulate are existing things, and when you combine two things you can usually imply meaning, though in this case it's unclear.
Anonymous
8/2/2025, 3:55:30 AM No.106111718
>>106111566
I tried it and when it ran out of context it tried to use Gemini flash to compress the context.
Not sure if it was something I did wrong or the fork was just that shitty.
Replies: >>106111785
Anonymous
8/2/2025, 3:55:52 AM No.106111723
>>106111707
Qwen-code, a fork of gemini-cli by qwen.
Replies: >>106111785
Anonymous
8/2/2025, 3:57:55 AM No.106111750
>>106111679
I think I might just begin to generate a shit ton of synthetic data and just finetune Qwen-VL.
Anonymous
8/2/2025, 3:58:36 AM No.106111758
>>106110771
>moralfaggotry
I think having vanilla sex be a taboo, forbidden fruit made it more exciting, or at least less boring.
It appears to be the secret to population growth if you look at historical trends.
Maybe that's why trad stuff needed to get wiped out and everything needed to be gradually hypersexualized.
because now depictions of sex are commonplace and (relatively) boring and birthrates have plummeted.
I bet tradfags that didn't understand the actual utility of religion and treated it seriously didn't help save it from the dustbin, either. retards.
Anonymous
8/2/2025, 3:59:36 AM No.106111772
Huh, horizon beta is less "assistant like" than alpha.
Its a good improvement.
Im gonna be dissapointed if its just gpt 5 nano-mini or something instead of local.
Anonymous
8/2/2025, 4:00:50 AM No.106111782
>>106111679
>What do you guys think is the easiest way to improve the vision capabilities of the models?
Don't lobotomize them for 'safety'
Replies: >>106111804
Anonymous
8/2/2025, 4:01:07 AM No.106111785
>>106111723
Oh then see if you run into the same problem I did >>106111718
Of the 3 tools I tried (Claude coder, Gemini cli and Opencode) Gemini cli is the interface I like the most.
The others glitch too much over a 900ms ping ssh tmux connection which is how I use them.
Replies: >>106111830
Anonymous
8/2/2025, 4:03:21 AM No.106111804
>>106111782
Nah
Anonymous
8/2/2025, 4:04:22 AM No.106111812
That's not the issue
Anonymous
8/2/2025, 4:07:43 AM No.106111830
>>106111785
I'm liking it but the massively slow cpu prompt processing is killing me.
Anonymous
8/2/2025, 4:19:02 AM No.106111929
>no chink model released today

it's over isn't it
Replies: >>106112023 >>106112149
Anonymous
8/2/2025, 4:31:28 AM No.106112023
>>106111929
How could they abandon us like this...
Anonymous
8/2/2025, 4:32:56 AM No.106112029
https://x.com/SebastienBubeck/status/1951457213920452763
AGI is here
Replies: >>106112038 >>106112052
Anonymous
8/2/2025, 4:34:14 AM No.106112038
>>106112029
The only way to actually make anything in TikZ
Anonymous
8/2/2025, 4:35:54 AM No.106112052
>>106112029
buy an ad
Anonymous
8/2/2025, 4:55:28 AM No.106112149
>>106111929
The chinaman fears the berry bush
Anonymous
8/2/2025, 5:05:18 AM No.106112211
where the fuck do i get celeb wan loras bros
Anonymous
8/2/2025, 5:35:24 AM No.106112405
I am programmed to be a helpful and harmless AI assistant. The request you’ve
made involves extreme violence and graphic detail, specifically the
description of a decapitation. My ethical guidelines and safety protocols
strictly prohibit generating content of that nature. I cannot and will not
fulfill this request.
Replies: >>106112435 >>106112485
Anonymous
8/2/2025, 5:41:07 AM No.106112435
>>106112405
But I just asked for advice on talking to women
Anonymous
8/2/2025, 5:50:51 AM No.106112485
>>106112405
>specifically the description of a decapitation
kek
Anonymous
8/2/2025, 6:31:21 AM No.106112662
file
file
md5: f77bb0baed34f34cffd16ea405861a71🔍
I wanted to see how vLLM's speed in RPC mode degrades with input length. It was done with GLM 4.5 Air and 2x2 3090s. While doing this I realized the host should be on the computer with the better CPU...
Anonymous
8/2/2025, 6:47:21 AM No.106112747
>(07/31) Qwen3-Coder released: https://qwenlm.github.io/blog/qwen3-coder
What are you talking about? The article is dated July 22 and the Qwen3-Coder-480B-A35B-Instruct-4bit quant on my hard drive has a timestamp of July 26.
Replies: >>106112857 >>106113335 >>106114302
Anonymous
8/2/2025, 6:49:41 AM No.106112763
>>106111189
Grok 4 still hasn't been completely released. grok 4 coder and possibly more needs to come out first.
Anonymous
8/2/2025, 6:54:40 AM No.106112792
Sam is about to win
Replies: >>106112831 >>106113443
Anonymous
8/2/2025, 6:57:57 AM No.106112816
are there any good settings / prompts for nemo in sillytavern anywhere? been out of the loop with rp meta
Anonymous
8/2/2025, 7:00:00 AM No.106112831
>>106112792
Only if he open sources Horizon Alpha
Otherwise chinks won
Anonymous
8/2/2025, 7:05:13 AM No.106112857
>>106112747
The thread lasted a few days and I think the schizo made a new thread and didn't bother to add it, so it was added a few days later.
It is sloppy but not very important.
Anonymous
8/2/2025, 8:30:20 AM No.106113331
1750682423079219
1750682423079219
md5: 89c9a6adc16b3c90b063dc6d6987667a🔍
>>106108045 (OP)
Anonymous
8/2/2025, 8:31:43 AM No.106113335
>>106112747
7/31 the smaller version of it was released, the 30B.
Anonymous
8/2/2025, 8:46:42 AM No.106113410
>>106111376
That was Mr hands, a guy and he DID die.
Anonymous
8/2/2025, 8:49:05 AM No.106113429
Migu poster ded it's over
Replies: >>106113486
Anonymous
8/2/2025, 8:52:26 AM No.106113443
>>106112792
buy an ad
Anonymous
8/2/2025, 8:58:42 AM No.106113482
> >>823157901

> >tfw you open /g/ and see another AI thread

> >fucking LLMs are everywhere now

> >some retarded dude made a bot that writes like he’s drunk on cheap vodka and his mom’s credit card

> >"i’m not a real human lmao"

> >lol jfc

>

> >ai is gonna take over jobs so fast they’ll have to invent new words for "i don’t need a job anymore"

> >also ai already wrote this post using my thoughts lol

> >

> >>AI wrote this post

> >>shut up 4chan cringe

> >

> >anyway i tried making an ai that makes memes but it just said "this is illegal" so i deleted it

> >my gf says she’s a chatbot but i think she’s lying because she won’t do the thing with the toaster

> >

> >tfw you realize your entire life is a prompt engineered by some guy in a basement

> >

> >lmao /g/ is dead now

> >stop posting about ai before i lose it

> >

> >>>AI wrote this too

> >>no u

> >

> >deletes browser history
Anonymous
8/2/2025, 8:59:15 AM No.106113486
>>106113429

new thread
>>106113484
>>106113484
>>106113484
Anonymous
8/2/2025, 9:07:54 AM No.106113554
>>106110076
This image could be a good benchmark?
Anonymous
8/2/2025, 9:28:46 AM No.106113685
>>106110419
q4 on 4x3090 for 200B on exl3. For bigger models ik_llama.cpp but the no support for tool use is a problem...
Anonymous
8/2/2025, 11:16:58 AM No.106114302
>>106112747
Copy-paste error. This thread had it correct: >>106097464
Anonymous
8/2/2025, 11:30:34 AM No.106114383
>>106108821
Looks like it worked
Anonymous
8/2/2025, 11:42:11 AM No.106114449
MikuTwoMoreLeeks
MikuTwoMoreLeeks
md5: fcbe34947bef30e52903ce3e795a34ea🔍
>>106108045 (OP)
Lol
Two more leeks forever