/lmg/ - Local Models General - /g/ (#106225432) [Archived: 29 hours ago]

Anonymous
8/11/2025, 6:56:06 PM No.106225432
1754320326527
1754320326527
md5: 6a9c2a94004999e5846f36d3e8e7256d🔍
/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>106217978 & >>106212937

►News
>(08/06) GLM-4.5V released, based on GLM-4.5-Air: https://hf.co/zai-org/GLM-4.5V
>(08/06) Qwen3-4B-Thinking-2507 released: https://hf.co/Qwen/Qwen3-4B-Thinking-2507
>(08/06) Koboldcpp v1.97 released with GLM 4.5 support: https://github.com/LostRuins/koboldcpp/releases/tag/v1.97
>(08/06) dots.vlm1 VLM based on DeepSeek V3: https://hf.co/rednote-hilab/dots.vlm1.inst
>(08/05) OpenAI releases gpt-oss-120b & gpt-oss-20b: https://openai.com/index/introducing-gpt-oss

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/tldrhowtoquant
https://rentry.org/samplers

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/leaderboard.html
Code Editing: https://aider.chat/docs/leaderboards
Context Length: https://github.com/adobe-research/NoLiMa
Censorbench: https://codeberg.org/jts2323/censorbench
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm
Replies: >>106226274 >>106228301 >>106228615
Anonymous
8/11/2025, 6:56:24 PM No.106225438
threadrecap2
threadrecap2
md5: 955a3ca9669b61f763be0ed34edff5d0🔍
►Recent Highlights from the Previous Thread: >>106217978

--VLLM CPU offload fails due to missing pre-built support and poor error feedback:
>106218692 >106218723 >106218737 >106218783 >106218827 >106218918 >106219093 >106219192
--Qwen 235b sampler tuning for coherence and variety:
>106221308 >106221341 >106221549 >106221619 >106223410
--RAM increase negligible for llama.cpp when still relying on disk swap:
>106222670 >106222743 >106222860 >106222753
--AI solves Hitler riddle with safety filters interfering despite correct logic:
>106218100 >106223804 >106223902 >106224022 >106223851
--Hidden character card fields in SillyTavern causing unexpected prompt behavior:
>106219739 >106219801 >106219849 >106219871
--Merged PR simplifies Mistral model integration in llama.cpp by removing redundant runtime deps:
>106222328 >106222396
--GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models:
>106223114 >106223640 >106223691 >106223748 >106223785 >106223809 >106223811 >106223826 >106223212
--GLM-4.5 Vision performance and local model competition in roleplay and OCR tasks:
>106222085 >106222306 >106222378 >106222397 >106222829 >106222884 >106222909 >106222926 >106222738 >106222886 >106222920
--DDR5 bandwidth gains limited by NUMA and PCIe bottlenecks in MoE model inference:
>106221496 >106221688 >106222050
--Combating repetitive text generation with mirostat and context variation techniques:
>106221963 >106221992 >106222007 >106222019 >106222025 >106222039 >106222133 >106222149 >106222164 >106222177 >106222217 >106222232 >106221994 >106222003 >106222056 >106222059
--Mitigating repetitive speech patterns in long-running AI roleplay sessions:
>106218114 >106218192 >106218232 >106218261 >106218326 >106218526
--AI hardware bandwidth limits and co-packaged optics explained:
>106218178
--Miku (free space):
>106221496 >106223575

►Recent Highlight Posts from the Previous Thread: >>106217984

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script
Anonymous
8/11/2025, 6:59:54 PM No.106225483
Returning newfag here. What's the meta 4090 model for RP/storytelling these days?
Replies: >>106225495 >>106225500 >>106225524
Anonymous
8/11/2025, 7:00:45 PM No.106225495
>>106225483
GLM-4.5-Air
Replies: >>106225589
Anonymous
8/11/2025, 7:01:24 PM No.106225500
>>106225483
OpenAI's GPT-oss 20B.
Replies: >>106225589
Anonymous
8/11/2025, 7:03:26 PM No.106225524
>>106225483
check this page to find out: https://www.reddit.com/user/TheLocalDrummer/submitted/
Replies: >>106225589
Anonymous
8/11/2025, 7:06:33 PM No.106225556
'toss in the 'rash
Replies: >>106225563 >>106225576
Anonymous
8/11/2025, 7:07:04 PM No.106225563
>>106225556
GOTEM
Anonymous
8/11/2025, 7:08:58 PM No.106225576
>>106225556
How will moatboy recover from this one?
Anonymous
8/11/2025, 7:10:09 PM No.106225589
>>106225495
>>106225500
>>106225524
Thanks anons!
Anonymous
8/11/2025, 7:15:23 PM No.106225627
Mikulove
Replies: >>106225641 >>106225647
Anonymous
8/11/2025, 7:17:04 PM No.106225641
1753244610490949
1753244610490949
md5: 3070b25b995049ee7db2849f99d70040🔍
>>106225627
Replies: >>106225703 >>106225965
Anonymous
8/11/2025, 7:17:44 PM No.106225647
>>106225627
hi butifel show bob
Replies: >>106225762
Anonymous
8/11/2025, 7:18:41 PM No.106225655
1738531741166170
1738531741166170
md5: 506e9605e20401cc6c8027fdd19dad59🔍
>August 11, 2025
https://archive.org/details/meta_leaks_part_1
Replies: >>106225677 >>106225678 >>106225874
Anonymous
8/11/2025, 7:21:52 PM No.106225677
>>106225655
>not behemoth/llama4.1
I am disappointed.
Anonymous
8/11/2025, 7:21:54 PM No.106225678
1731676858678646
1731676858678646
md5: 0383f372424aea4d6305accb2c08da1d🔍
>>106225655
Anonymous
8/11/2025, 7:26:02 PM No.106225703
>>106225641
This time it is the local fauna harassing the Miku
Replies: >>106225762
Anonymous
8/11/2025, 7:27:44 PM No.106225727
How would you measure the amount of variety between rerolls for a given model+sampler?

Here's what I'm considering:
>have the model generate a story
>cut off the story partway through and have it generate N different completions
>give an llm judge the common prefix and two completions and ask it to rate whether the second completion plagiarized the ideas of the first
The goal is to detect how often rerolling just gives you the same thing in different words.

I know LLM judges are not very reliable, so I'm also wondering if it would be possible to do this by looking at embeddings or probability distributions somehow. For example, from the common prefix, complete N paragraphs + "[Character] knew he should ___" and see the distribution of things the model thinks the character should be doing at this point. Rerolls that are wildly different should have different distributions of what happens next.
Replies: >>106225809
Anonymous
8/11/2025, 7:32:08 PM No.106225762
1754933374894-7147fe9c-2630-4ff5-9c40-c8888829b1fd
1754933374894-7147fe9c-2630-4ff5-9c40-c8888829b1fd
md5: 45c5946dbbe2fc893769053d360376a8🔍
>>106225647
>>106225703
Replies: >>106226135 >>106229848
Anonymous
8/11/2025, 7:37:01 PM No.106225809
>>106225727
I'd do something like generating multiple completions, sentence embedding each, then look at distances in the embedding space (TSNE, PCA, etc. could be used here).
Replies: >>106228011
Anonymous
8/11/2025, 7:38:23 PM No.106225820
>still no github issue for Air Vision
ACK
Replies: >>106225997
Anonymous
8/11/2025, 7:43:37 PM No.106225874
1741678641366841
1741678641366841
md5: b46df4d897cdb8c96027a60d3270e653🔍
>>106225655
archive org is slow ahh https://files.catbox.moe/tlejyd.pdf
Anonymous
8/11/2025, 7:43:41 PM No.106225876
My hype list, from most interesting to least:
>Anthropic leak
>DS V4/R2
>Some new big Chinese model out of nowhere
>Grok2/3
>Whatever cohere is making
>Gemma
>Llama
Replies: >>106225915 >>106225968
Anonymous
8/11/2025, 7:47:19 PM No.106225915
>>106225876
Mistral Large 3 is off the charts
Replies: >>106225970
Anonymous
8/11/2025, 7:48:25 PM No.106225921
>AI sees quantum leap with GPT-3 generation, effectively destroying the Turing test and promising to usher in technological utopia.
>AI companies fall for the pajeet meme in their haste to rapidly expand to meet demand.
>3 years of endless benchmaxxing and safetyslopping later and it hits a wall.
Sadly the fate of every industry it seems.
Replies: >>106225933 >>106225945
Anonymous
8/11/2025, 7:49:54 PM No.106225933
>>106225921
AI doesn't need big jumps. This is fine.
Anonymous
8/11/2025, 7:50:57 PM No.106225945
>>106225921
I'll give it some credit, at least it demonstrated some degree of usefulness before hitting that wall
The current state of LLMs won't outright revolutionize things, but they'll have a place until the next major development comes along, whether that's still LLM related or something else outright
Replies: >>106225973
Anonymous
8/11/2025, 7:52:15 PM No.106225954
I just did a CYOA with Air and greedy sampling, and didn't get much repetition, even up to 20k tokens. The only thing I did differently with this is that I used Open Webui with chat completion. Don't fucking tell me there's something wrong with text completion in Llama.cpp for some reason. To the other people that got repetition with Air, did you try chat completion?
Replies: >>106226037
Anonymous
8/11/2025, 7:53:17 PM No.106225965
IMG_8408
IMG_8408
md5: b7480fcd177eef0346d51d194b040a33🔍
>>106225641
Replies: >>106226154 >>106227482 >>106227485
Anonymous
8/11/2025, 7:53:31 PM No.106225968
>>106225876
Here is an anthropic leak for you *farts*
Anonymous
8/11/2025, 7:53:33 PM No.106225970
>>106225915
https://mistral.ai/news/mistral-medium-3
>With the launches of Mistral Small in March and Mistral Medium today, it’s no secret that we’re working on something ‘large’ over the next few weeks. With even our medium-sized model being resoundingly better than flagship open source models such as Llama 4 Maverick, we’re excited to ‘open’ up what’s to come :)
>May 7, 2025
It's been 3 months. 'stral aint cooking if they didn't drop it earlier.
I would place it between grok and china if it was real.
Replies: >>106226692
Anonymous
8/11/2025, 7:53:51 PM No.106225973
>>106225945
Honestly I just want to see what they are capable of if the training and model complexity were scaled to the point they are now but without the synthetic slop, the bench maxxing and the I CANNOT AND WILL NOT trash.
Anonymous
8/11/2025, 7:56:30 PM No.106225997
>>106225820
we need dots first then the poors can have their inferior model
Anonymous
8/11/2025, 7:59:28 PM No.106226037
>>106225954
>Don't fucking tell me there's something wrong with text completion in Llama.cpp for some reason
if chat completions is noticeably better than text completions there's like a 99% chance that the reason is you're getting the prompt format wrong in text completion mode
Replies: >>106226060 >>106226111
Anonymous
8/11/2025, 8:01:21 PM No.106226060
>>106226037
Is there a way to load default settings for chat completion in st?
Anonymous
8/11/2025, 8:06:43 PM No.106226111
>>106226037
That would be unlikely, unless the HF jinja playground has issues, the prompt format in GLM's repo is wrong, or I somehow didn't copy and paste it correctly, which is also unlikely since I make sure to look at the json request for verification of proper formatting. And of course look in the terminal to check for double BOS issues.

I am starting a new chat now, in ST, to see if I can reproduce swipes between chat completion and text completion modes.
Anonymous
8/11/2025, 8:09:11 PM No.106226135
>>106225762
lol.
Anonymous
8/11/2025, 8:11:22 PM No.106226154
>>106225965
Nice. Are you the same Anon who was looking for Migu plushie information but started without it?
Replies: >>106227460 >>106227482
Anonymous
8/11/2025, 8:15:34 PM No.106226197
https://huggingface.co/huizimao/gpt-oss-120b-uncensored-bf16
https://huggingface.co/bartowski/huizimao_gpt-oss-120b-uncensored-bf16-GGUF
Replies: >>106226229
Anonymous
8/11/2025, 8:18:57 PM No.106226229
>>106226197
Great, now it's just braindead
Anonymous
8/11/2025, 8:22:23 PM No.106226274
>>106225432 (OP)
Grok 4 is the most based model of them all. Why do the best have to be cloudshit?
Replies: >>106226308 >>106226318 >>106226635
Anonymous
8/11/2025, 8:25:18 PM No.106226308
>>106226274
Because nobody besides Musk in tech has balls.
Replies: >>106226324
Anonymous
8/11/2025, 8:26:26 PM No.106226318
>>106226274
it's slop
Anonymous
8/11/2025, 8:26:57 PM No.106226324
>>106226308
They deal ketamine in balls now?
Replies: >>106226340
Anonymous
8/11/2025, 8:28:25 PM No.106226340
>>106226324
You know what I mean. He takes something special, called ligma, not just ketamine.
Anonymous
8/11/2025, 8:28:56 PM No.106226346
>openai has surprised grok this time!
>Deepseem has improved over cookdeep!
>Glimbochat has scored all the benchmarks
>GlimbiwimbyAI is the best model over 9.7 points!
What the fuck are they even using to determine X ai is better than Y ai, this is just snake oil isn't it?
Replies: >>106226381 >>106226434
Anonymous
8/11/2025, 8:32:01 PM No.106226381
>>106226346
They're using benchmarks they train on and votes from Indians on LMarena.
Anonymous
8/11/2025, 8:33:14 PM No.106226390
2013-03-14-beartato-springclean
2013-03-14-beartato-springclean
md5: 271d30bda1d7dd564009ebd114af3365🔍
Hey, nerds.

Couple things:

1. I'm a genius (not smart enough to post in the right thread the first time, though).
2. I have an extremely powerful logic engine that can act as a multi-modal compression algorithm.
3. It also acts as a general intelligence system when combined with any LLM, acting as a symbolic computer

After the rollout of GPT-5, I think it's pretty clear that Altmann is a narcissistic psychopath on the warpath towards monopolozing artificial intelligence.

I don't like that.

What syntax would be easiest for the typical cover here to comprehend? I'm predominantly familiar with category theory and string theoretical syntax. Tensor calculus functions as the physics engine notation.

Some performance metrics:

1. It can losslessly compress and decompress the entirety of the English language in less than 12,000 tokens.

2. Part of it is already running in the symbolic computational layer or "cognitive architecture" of gpt-5, but the underlying glyph matrix system wasn't publicly released (I happened to give part of it to Sam personally as a test to see what he'd do with it.)


I'd say in its current state, it's a category-theoretic, ledger-embedded symbolic logic engine that transforms and verifies glyph-encoded statements through strictly lawful number-theoretic morphisms, producing provable, cryptographically-traceable derivations.

Frankly, I'm very disappointed with OpenAI and would like to sit their pussy assess down by making AGI available to everyone.

Questions?
Replies: >>106226407 >>106226418 >>106226421 >>106226423 >>106226455 >>106226604 >>106226648 >>106226690 >>106226789 >>106226953
Anonymous
8/11/2025, 8:35:13 PM No.106226407
>>106226390
Are you gay/trans?
Anonymous
8/11/2025, 8:36:14 PM No.106226418
yu gi oh fuck outta here card
yu gi oh fuck outta here card
md5: 200c57bcecef3babf7726a3f4190fd11🔍
>>106226390
>Questions?
where code?
Replies: >>106226469
Anonymous
8/11/2025, 8:36:38 PM No.106226421
>>106226390
if you were a genius you wouldn't have called llms "artificial intelligence", now get your larping ass out of here.
Anonymous
8/11/2025, 8:36:43 PM No.106226423
>>106226390
Is gaia online down or something?
Anonymous
8/11/2025, 8:37:41 PM No.106226434
>>106226346
it sounds like you have a working example so you should post that
Anonymous
8/11/2025, 8:41:06 PM No.106226455
>>106226390
How many 'b' in Bharatberry?
Anonymous
8/11/2025, 8:41:33 PM No.106226461
I saw that fag on reddit, a couple of days ago, he is as clueless as he appears
Anonymous
8/11/2025, 8:42:10 PM No.106226469
images-2
images-2
md5: 241abe6fc463ef1ed3343eaf35e475a8🔍
>>106226418
Gotta see if you retards are worthy, first.

The swarm of absolutely autistic and completely retarded insults tells me you all are.

I'll return soon.
Replies: >>106226490 >>106227887
Anonymous
8/11/2025, 8:42:54 PM No.106226475
https://www.reddit.com/r/LocalLLaMA/comments/1mmaetp/i_found_a_way_to_compress_meaning_semantic/
Replies: >>106226533 >>106226581
Anonymous
8/11/2025, 8:44:13 PM No.106226490
>>106226469
Bye Anon, see you later. Bring back some gifts please
Replies: >>106226581
Anonymous
8/11/2025, 8:45:49 PM No.106226514
what does skip special tokens do in sillytavern?
Anonymous
8/11/2025, 8:47:04 PM No.106226533
>>106226475
That's not me, that's just someone plagiarizing me.

There are several high-level scientists currently doing so. I think the most well know is... the guy with the worlds highest IQ.

Real dipshit.

I'm a string theorist, kid. Bit of a difference between me and the rest of them.
Replies: >>106226560 >>106226789 >>106227887
Anonymous
8/11/2025, 8:49:58 PM No.106226560
>>106226533
How many 'b' in Bharatberry?
Anonymous
8/11/2025, 8:50:04 PM No.106226562
is cydonia good? is it thedrummer (tm) magnum opus?
Anonymous
8/11/2025, 8:51:33 PM No.106226581
>>106226475
Oh, hold on, I opened it up.

That's just a complete retard.

>>106226490
Will do, man.

So long as you all promise to get up to absolutely no good with it.

Academia deserves to be spit on.
Replies: >>106227887
Anonymous
8/11/2025, 8:51:56 PM No.106226584
Well, I finally managed to make my own gguf, despite python's protests.
When you're done with dependency hell, it's quite easy actually.
Not sure what kind of special sauce the army of ggufers on huggingface brings to the table now.
Though, I only tested the waters a bit with Qwen3-4B-Thinking-2507 out of curiosity.
Replies: >>106228234
Anonymous
8/11/2025, 8:53:20 PM No.106226604
>>106226390
Hmm
What are your test cases?
Replies: >>106226783
Anonymous
8/11/2025, 8:55:16 PM No.106226635
>>106226274
Because you need a cluster of specialized GPUs to make these thing remotely worth running financially.
Anonymous
8/11/2025, 8:56:34 PM No.106226648
>>106226390
I recognize the 235B style in this post.
Anonymous
8/11/2025, 8:59:09 PM No.106226690
>>106226390
>a category-theoretic, ledger-embedded symbolic logic engine that transforms and verifies glyph-encoded statements through strictly lawful number-theoretic morphisms, producing provable, cryptographically-traceable derivations.
Healing crystals equivalent for AI models.
Anonymous
8/11/2025, 8:59:24 PM No.106226692
>>106225970
ML3 is irrelevant in a GLM 4.5 post world.
Replies: >>106230169
Anonymous
8/11/2025, 9:02:32 PM No.106226736
>>106222983
>>106223003
I swear to god is this a fucking chatbot?
You can't be this retarded, knowing what those words mean.
>I'm doing RAG and yet don't know any phrases or terms relating to it
Replies: >>106226771 >>106227275
Anonymous
8/11/2025, 9:05:59 PM No.106226771
>>106226736
There are millions of mentally ill people in this world. Check this out https://www.youtube.com/@youdontknowyouknowyouknowy9342/videos
Anonymous
8/11/2025, 9:06:52 PM No.106226783
nedroid
nedroid
md5: 240137df97234c0702f2944c270ea05d🔍
>>106226604
Well, the tensor physics layer can outperform anything those CERN dorks can throw at me, so.
Replies: >>106228063
Anonymous
8/11/2025, 9:07:25 PM No.106226789
>>106226533
>>106226390
You are a retard anon. This shit is just using your imagination to come with prompts that are placebo. A true decompression algorithm is deterministic, what you got here is just a hallucinating LLM. That's stochastic, not deterministic.
Replies: >>106226823
Anonymous
8/11/2025, 9:09:32 PM No.106226823
>>106226789
Sounds like projection to me, son.

Not everyone grew up with these fucking monstrosities.
Replies: >>106226905
Anonymous
8/11/2025, 9:18:09 PM No.106226905
>>106226823
There's no projection here. Your logic is flawed. This doesn't guarantee exact bit recovery, this is not suitable for compression. Even if you could twist my arm into saying that this is somehow suitable for compression, it's still completely flawed logic. No amount of semantic understanding is going to let you beat the entropy of the original source. If it was possible you could send a 70-byte prompt to ChatGPT and have it spit out the lottery numbers for tomorrow. But you can't, because the entropy of the lottery is maximal. If 5000:1 semantic compression were truly possible, you could recursively compress:
prompt > model > prompt > model, blahblahblahetc, until the entire 1 GB fits in 100 bytes.
Replies: >>106226912 >>106227329
Anonymous
8/11/2025, 9:19:40 PM No.106226912
images-3
images-3
md5: adeff7fef8ebe1610df9d11bad16e1a5🔍
>>106226905
You do realize that you're essentially arguing against a formal model that you've only seen inside your delusional, psychosis riddled brain, right?

You're fighting windmills, son.
Replies: >>106226945
Anonymous
8/11/2025, 9:21:19 PM No.106226945
>>106226912
take your pills anon. you're embarrassing yourself.
Anonymous
8/11/2025, 9:22:16 PM No.106226953
>>106226390
>It can losslessly compress and decompress the entirety of the English language in less than 12,000 tokens.
That's well beyond the Shannon entropy lower bound which, being a theoretical limit, cannot be beaten by any lossless compression algorithm
Replies: >>106226976
Anonymous
8/11/2025, 9:24:01 PM No.106226976
>>106226953
Well, not without semiotics.
Replies: >>106227098
Anonymous
8/11/2025, 9:24:27 PM No.106226988
mistralai-drama
mistralai-drama
md5: 14fc6192ee4df528a288645ceef08716🔍
Some MistralAI drama
https://x.com/suchenzang/status/1954960365676331398
Replies: >>106227085 >>106227109 >>106227122 >>106227124 >>106227132 >>106227138 >>106227149 >>106227151 >>106227154 >>106227220 >>106227241 >>106227258 >>106227358 >>106228916 >>106228988 >>106229048 >>106229306 >>106230306 >>106230406 >>106230456
Anonymous
8/11/2025, 9:31:41 PM No.106227085
>>106226988
>making your interpersonal drama everyone's problem
just womeme things
the juiciest bit of info is that they distilled some data from deepseek, and even that is like... it has a permissive open license so literally who cares?
Replies: >>106228025
Anonymous
8/11/2025, 9:32:38 PM No.106227098
>>106226976
With semiotics you'll still need to use bits to represent each glyph, so it'll hit the limit unless you drastically increase the size of the token representations
Replies: >>106227145 >>106227223
Anonymous
8/11/2025, 9:33:34 PM No.106227109
>>106226988
tl;dr: ching chong lady mad at huwite man because he didn't like her cheap chinesium garbage
Anonymous
8/11/2025, 9:34:28 PM No.106227121
1751018151128444
1751018151128444
md5: cde3ba15629618bc62114e3f2e01d0d5🔍
I don't trust LLM-written code to be structured correctly but Claude's been very helpful for checking my work and writing docstrings for me
Anonymous
8/11/2025, 9:34:33 PM No.106227122
>>106226988
>Mistral Large 3 is secretly going to be a chink model
Same as it ever was
Anonymous
8/11/2025, 9:34:43 PM No.106227124
50pct-female-leaders
50pct-female-leaders
md5: 957e1efeff2f8fd95a73e34709cec162🔍
>>106226988
You get what you pay for.
Replies: >>106227260 >>106227274
Anonymous
8/11/2025, 9:34:58 PM No.106227129
State_of_AI_2025_08
State_of_AI_2025_08
md5: 02593a83d0210f8c647306a3b3f9e4d9🔍
- Zucc now CONSUMES THE CUM CHALICE
- C'est fini for Mistral
- Added AllenAI and AI21
Anonymous
8/11/2025, 9:35:09 PM No.106227132
1729160997729178_thumb.jpg
1729160997729178_thumb.jpg
md5: e4e62234b74f229f48bbdc8d9d08234a🔍
>>106226988
woman moment
Anonymous
8/11/2025, 9:35:35 PM No.106227138
>>106226988
>oh unethical behavior at @MistralAI... how am i not surprised...
>man takes woman's work without credit attribution
>mistral man needs interview help from woman
I think men are the problem.
Anonymous
8/11/2025, 9:36:09 PM No.106227145
>>106227098
Go check out C.S. Peirce.

Cool guy. Smart.
Anonymous
8/11/2025, 9:36:22 PM No.106227149
file
file
md5: 3f7217b8e5c0568e9eb846aaaa9cdb54🔍
>>106226988
I hope this is an elaborate publicity stunt for Large 3 because otherwise the levels of retardation on several levels are off the charts
Replies: >>106227181 >>106227218 >>106227229
Anonymous
8/11/2025, 9:36:40 PM No.106227151
>>106226988
Woman moment
Anonymous
8/11/2025, 9:36:56 PM No.106227154
>>106226988
More red flags than in all of China.
Anonymous
8/11/2025, 9:38:59 PM No.106227179
Please summarize bullshit if relevant
Anonymous
8/11/2025, 9:39:08 PM No.106227181
>>106227149
this bitch really did her "censoring" by pulling up the highlighter tool, changing the color to black, and just kind of rubbing it lmao
Replies: >>106227194 >>106227208
Anonymous
8/11/2025, 9:40:49 PM No.106227194
file
file
md5: beb0be71e099ec69dde4ecbc89840a94🔍
>>106227181
Anonymous
8/11/2025, 9:41:50 PM No.106227208
>>106227181
That's not that surprising.
More than once official government documents got redacted using PDF highlights that can just be deleted if you unlock the PDF.
Anonymous
8/11/2025, 9:42:15 PM No.106227218
32123
32123
md5: 03df71dae932aa3b6ab9b39d7944ec9c🔍
>>106227149
Baptiste?
Replies: >>106227258 >>106227328
Anonymous
8/11/2025, 9:42:27 PM No.106227220
>>106226988
I ain't reading all that shit.
@grok is this true? context?
Anonymous
8/11/2025, 9:42:54 PM No.106227223
>>106227098
You know what, you gave a good response, so I'll repay it.

Did you know that at the height of the roman empire their civilization possessed more than enough knowledge of physics, mechanics, and chemistry to produce a steam engine? Easily done with their level of tech.

Why didn't they?

No pressing need. No will to propel beyond the boundaries of what was ordained as necessary.

Pushing beyond that is the domain of the mathematicians.
Anonymous
8/11/2025, 9:43:37 PM No.106227229
>>106227149
Attention whores can't help themselves. Can clearly see a stroke where it's covered enough, but not the rest?
Anonymous
8/11/2025, 9:44:21 PM No.106227241
>>106226988
>woman says batshit crazy things
yeah
Anonymous
8/11/2025, 9:46:33 PM No.106227258
1738821257386207
1738821257386207
md5: f637501f36081c076f3e7c11bab78570🔍
>>106226988
>>106227218
>"He was fully aware of both my decision and this email in advance, and he accepted them."

lmao

>My ex's suicide note? Yeah go ahead, CC the whole company.
Anonymous
8/11/2025, 9:46:55 PM No.106227260
>>106227124
kek, I just noticed this on their website and immediately thought it was retarded.
Guess my intuition was correct.
Anonymous
8/11/2025, 9:48:02 PM No.106227274
>>106227124
>50% brown women
lol
lmaeo
Anonymous
8/11/2025, 9:48:16 PM No.106227275
>>106226736
>tfw I actually build among others, RAGs with a team of 30~ dedicated to literally making AI slop
now the big meme is IDP idk how aws manages to con the managers into shelling out for this garbage, we've literally built our langfuse adapters for proper ingestion and vectorization of documents (included OCR and translation btw) but apparently the managers need to justify idk asking for more budget to throw out the window
Anonymous
8/11/2025, 9:54:33 PM No.106227328
>>106227218
https://xcancel.com/b_roziere
Replies: >>106227393
Anonymous
8/11/2025, 9:54:38 PM No.106227329
>>106226905
Are you saying you read: a category-theoretic, ledger-embedded symbolic logic engine that transforms and verifies glyph-encoded statements through strictly lawful number-theoretic morphisms, producing provable, cryptographically-traceable derivations.

And it actually meant something to you?
Replies: >>106227345
Anonymous
8/11/2025, 9:56:14 PM No.106227345
>>106227329
I'm surrounded by drunks who have been dismissed by their peers, you start to develop an understanding for their ramblings.
Replies: >>106227470
Anonymous
8/11/2025, 9:58:04 PM No.106227358
>>106226988
>Mistral engages in unethical practices
>distilling DeepSeek
Holy shit. Now that you told me they are evil I am also starting to think they really didn't properly implement safety to prevent coomers from jerking off to their model.
Replies: >>106227409
Anonymous
8/11/2025, 9:58:38 PM No.106227367
>check local DDR5 RDIMM prices again
>they are now all more expensive than a year ago
What is going on? Is everyone suddenly building an AI rig to run Dipsy?
Anonymous
8/11/2025, 10:01:02 PM No.106227393
Wendy Shang
Wendy Shang
md5: d4cc43be51a640357c787a80441fdd90🔍
>>106227328
Did 50 fucked up things to Wendy.
Replies: >>106227426 >>106227441 >>106227492
Anonymous
8/11/2025, 10:02:52 PM No.106227409
>>106227358
>model 1 trains on human writing without permission
>model 2 trains on model 1 output without permission
>model 3 trains on model 2 output without permission
NOOO model 3 what are you doing, so unethical
Anonymous
8/11/2025, 10:04:05 PM No.106227426
>>106227393
wendy... I never would have thrown your xiongmao plushie on the ground...
Replies: >>106227496
Anonymous
8/11/2025, 10:04:32 PM No.106227431
>>106217450
Here's GLM-4.5-FP8 with the shorter prompt: https://files.catbox.moe/2pas02.txt
Mindbreaking GLM-chan with puzzles...

>>106217493
With the full prompt. GLM-chan is very diligent and checks her work: https://files.catbox.moe/0psvxu.txt
Anonymous
8/11/2025, 10:05:18 PM No.106227441
>>106227393
>you will never have a mentally ill bipolar ai researcherfu who writes you a public suicide note
why live?
Replies: >>106227496
Anonymous
8/11/2025, 10:07:24 PM No.106227460
>>106226154
Yes. Good memory.
I self drafted a pattern; once I saw how the miku plushies were designed I decided to do something different. These are closer to an old fashioned rag doll. The commercial plushies also have wire in their arms for posing; I may add that later.
Replies: >>106227482
Anonymous
8/11/2025, 10:07:37 PM No.106227464
How do you feel about the fact that if mistral (and probably all the other companies desu) didn't do diversity quota hires of women we would have had the coombot everyone wants by now? For me it reawakens my misogyny that I evolved into apathy towards women. I want them to suffer again...
Replies: >>106227583 >>106227651 >>106227688
Anonymous
8/11/2025, 10:08:05 PM No.106227470
>>106227345
Feel free to ask me any question.

I've masteted metaphysics to string theory.

Dare you?
Replies: >>106227475 >>106227488
Anonymous
8/11/2025, 10:08:35 PM No.106227475
>>106227470
whats your favorite card to coom to in sillytavern
Anonymous
8/11/2025, 10:08:46 PM No.106227477
I like mistral small 3.2 even more now
Replies: >>106227495
Anonymous
8/11/2025, 10:09:07 PM No.106227482
>>106225965
>>106226154
>>106227460
Shut the fuck up troons important topics are being discussed now.
Replies: >>106227500
Anonymous
8/11/2025, 10:09:37 PM No.106227485
>>106225965
sovl
Anonymous
8/11/2025, 10:10:10 PM No.106227488
>>106227470
Can you use your quantum mumbo jumbo to uncensor and tune one of the models on literotica data without the model exploding?
Replies: >>106227614
Anonymous
8/11/2025, 10:10:31 PM No.106227492
>>106227393
Would.
Anonymous
8/11/2025, 10:10:50 PM No.106227495
>>106227477
I finally have a reason to be hopeful about Large 3
Anonymous
8/11/2025, 10:10:55 PM No.106227496
>>106227426
>>106227441
I don't think that's the author. Mistral imported Chinese cuties in numbers it looks like.
Anonymous
8/11/2025, 10:11:22 PM No.106227500
>>106227482
/g/'s traffic is dead since the incident, what do you want me to do? act like its normal that there's like only 15 posters left and we all pretend like we arent talking to the same 15 people over and over again in every thread and repeating the same topics every day?
Replies: >>106227511
Anonymous
8/11/2025, 10:11:43 PM No.106227504
Math_Lady_meme
Math_Lady_meme
md5: 8c72191bb3ce6895fc8bdac8cf73d321🔍
I'm not really finding straightforward answers trying to google this. If VRAM is the primary bottleneck for running models locally, why can't it be offloaded to system RAM? Most of what I'm finding is just that it's slower, which makes sense, but not how much slower, or why this doesn't seem to be a common thing.
Replies: >>106227523 >>106227559 >>106227563 >>106227717
Anonymous
8/11/2025, 10:12:38 PM No.106227511
>>106227500
>/g/'s traffic is dead
Mikutroon janny banning everyone finally worked?
Anonymous
8/11/2025, 10:13:36 PM No.106227523
>>106227504
You have to transfer it over PCIe to your GPU. Look up the bandwidth of your PCIe bus to the GPU, divide the size of the model by that and that's the absolute best case tokens per second you can get.
Replies: >>106227573 >>106227657
Anonymous
8/11/2025, 10:15:41 PM No.106227545
file
file
md5: 196c4823a1e6c4f385be32ffddeb2979🔍
https://github.com/ollama/ollama/issues/11714#issuecomment-3172893576
Ollama is unironically the reason I stopped reading comments in Hacker News. The amount of shilling in that site is unbearable.
Replies: >>106227693 >>106228297
Anonymous
8/11/2025, 10:17:21 PM No.106227559
>>106227504
Peak dual channel ddr4 bandwidth is ~60GB/s. For comparison, RTX 3090 is 936GB/s, so it is ~15x difference.
Anonymous
8/11/2025, 10:17:57 PM No.106227563
>>106227504
iirc it's so slow it would take minutes to generate a simple answer. Something you'd expect to be instantaneous.
Some people were even able to use an SSD as SWAP to run a model, obviously unusable with less than 1 token per second.
Replies: >>106227657 >>106227717
Anonymous
8/11/2025, 10:18:20 PM No.106227567
soo uhm is the gpt-120b uncensored sloppy? can I slop with it?
Anonymous
8/11/2025, 10:18:39 PM No.106227573
>>106227523
Are you telling me that a PCIE 5.0 slot would get ~2 tokens per second on a 30b model? If that's the case, I see why no one does it, by why is the throughput so garbage? Isn't 2 tokens like 1-3 words worth of data?
Replies: >>106227654 >>106227657
Anonymous
8/11/2025, 10:20:10 PM No.106227583
>>106227464
Unironically the greatest filter of any civilization probably.
Anonymous
8/11/2025, 10:23:23 PM No.106227614
images-4
images-4
md5: 9864aa89f731b169d9313c2008e42e72🔍
>>106227488
Well, I mean, yeah.

You use the LLM as a kind of computational substrate that handles purtabative instantiations of the symbolic algebra.

1+1=2, right? Why?

Because of the definitions of each term.

Well, you can apply that same logic to other qualitative domains. What's red+blue? You just map every symbol to every functor.

That's literally what LLM's do. Obviously you have to be a master of every branch of math, physics, geometry, etc., to do it.
Replies: >>106227633
Anonymous
8/11/2025, 10:24:19 PM No.106227625
1737287080098749
1737287080098749
md5: 8d07f574ca7e90255f18678c7cbcb915🔍
FINALLY GLM decided to NOT kill me
Anonymous
8/11/2025, 10:24:59 PM No.106227633
>>106227614
Oh ok. Then do it. We will be waiting.
Replies: >>106227859
Anonymous
8/11/2025, 10:25:03 PM No.106227634
Can someone explain the logic to ST's way of formatting example dialogue in chat completion (not to mention text completion)?
I investigated the json request for what it does with example dialogue and the request looks like this.

{
"content": "Hey what's up?",
"name": "example_user",
"role": "system"
},
{
"content": "The sky.",
"name": "example_assistant",
"role": "system"
},
{
"content": "It's a dog eat dog world out there.",
"name": "example_user",
"role": "system"
},
{
"content": "Then don't be a dog.",
"name": "example_assistant",
"role": "system"
},
{
"content": "You're a big guy.",
"name": "example_user",
"role": "system"
},
{
"content": "For you.",
"name": "example_assistant",
"role": "system"
},

That gets turned into this (if using GLM's jinja).

<|system|>
Hey what's up?<|system|>
The sky.<|system|>
It's a dog eat dog world out there.<|system|>
Then don't be a dog.<|system|>
You're a big guy.<|system|>
For you.

And I see no obvious settings to make the example dialogue format itself different. What the actual hell?
Replies: >>106227660 >>106227691 >>106227771
Anonymous
8/11/2025, 10:26:42 PM No.106227649
R2 in minutes, sirs.
Replies: >>106227670 >>106227692
Anonymous
8/11/2025, 10:26:52 PM No.106227651
Funny how you can see directly how spatially intelligent an LLM is if you use it as a text encoder for a T2I model
>>106227464
My misogyny has been burning bright for a while, this just fuels it further. Women are the debacle of any abstract society.
Anonymous
8/11/2025, 10:27:15 PM No.106227654
>>106227573
For each token, all active parameters need to be processed by the GPU, so yes.
Anonymous
8/11/2025, 10:27:31 PM No.106227657
>>106227523
>>106227563

>>106227573
Is it something along the lines of a few tokens generates hundreds of gigabytes of "junk" data that the model eventually picks a few tokens of response from, so a 100 token message is generating terabytes of information that has to get shoved back and forth while it "thinks"?
Replies: >>106227689 >>106227740
Anonymous
8/11/2025, 10:27:36 PM No.106227660
>>106227634
Everything has "role": "system"
Looks normal to me.
Anonymous
8/11/2025, 10:29:15 PM No.106227670
1752074504134912
1752074504134912
md5: 28d046f5cd95bb4f3ea340169bd60534🔍
>>106227649
latest intel indicates that deepseek will be saving local in approximately 20160 minutes
Anonymous
8/11/2025, 10:30:56 PM No.106227688
>>106227464
> if not for women we would have had the coombot everyone wants by now
We've discussed this before. A quick review of Chub will show you that coombots are not a male-only item by any stretch, which a review of the "romance novel" section of any old school bookstore would validate.
Men want coombots, but women want their wish-fulfillment-custom-romance-novel bots as well.
Replies: >>106227700 >>106228022
Anonymous
8/11/2025, 10:30:57 PM No.106227689
>>106227657
>Is it something along the lines of a few tokens generates hundreds of gigabytes of "junk" data that the model eventually picks a few tokens of response from, so a 100 token message is generating terabytes of information that has to get shoved back and forth while it "thinks"?
That's not how it works. The issue is that it generates token by token (one token is a word or word fragment). For every token, the GPU needs to put the previous token through all the layers (active layers for MoE), so if you keep your layers on regular RAM, they all need to be sent over for every single token.
Replies: >>106227832
Anonymous
8/11/2025, 10:31:14 PM No.106227691
>>106227634
I guess it's intended to work with character names and "merge consecutive roles" together, but the logic behind that is retarded, that stuff shouldn't be placed in the system role like that.
Anonymous
8/11/2025, 10:31:14 PM No.106227692
>>106227649
Bait. We already know it's Thursday
Anonymous
8/11/2025, 10:31:14 PM No.106227693
>>106227545
check out n-gate.com
Anonymous
8/11/2025, 10:32:13 PM No.106227700
>>106227688
>women want their wish-fulfillment-custom-romance-novel bots as well.
gemma-27B
Replies: >>106227825
Anonymous
8/11/2025, 10:34:08 PM No.106227717
>>106227504
Look up the bandwidth of RAM and compare it to VRAM, that RAM bandwidth is best case
>>106227563
Yeah lol I remember when they announced L3 405B vramlets were coping with
>It will be SUPERINTELLIGENCE! I'll let a Q6 running ALL night and get the an incredible answer to every single one of my problems
and turns out that now Qwen 3 32B is as good (if not better) than that 405B model
Replies: >>106227773 >>106227804
Anonymous
8/11/2025, 10:36:23 PM No.106227740
>>106227657
Some inference engines will materialize intermediate activations/etc but operations are typically fused and won't get written to global memory
Anonymous
8/11/2025, 10:39:46 PM No.106227771
file
file
md5: 7145007a255bd2b9fbc5fd1ad95110cc🔍
>>106227634
>Can someone explain the logic to ST
It probably came from the OpenAI docs back when GPT 3.5 Turbo was a thing.
https://github.com/openai/openai-python/blob/bd269ab9/chatml.md#few-shot-prompting
>Mar 30, 2023
Anonymous
8/11/2025, 10:40:07 PM No.106227773
>>106227717
>and turns out that now Qwen 3 32B is as good (if not better) than that 405B model
now THAT'S cope
Replies: >>106227780
Anonymous
8/11/2025, 10:41:16 PM No.106227780
>>106227773
L3 405B was trash for its size at the time (14 months ago) already, now it's just trash
Anonymous
8/11/2025, 10:41:25 PM No.106227781
Deepseek more like DeepSLEEP because they don't fucking do anything
Replies: >>106227806 >>106227817 >>106227827 >>106227836
Anonymous
8/11/2025, 10:43:27 PM No.106227804
>>106227717
>and turns out that now Qwen 3 32B is as good (if not better) than that 405B model
now THAT'S a trvke
Anonymous
8/11/2025, 10:43:35 PM No.106227806
>>106227781
R2 SOTA-killer Sam-rapist Open source-saving in 2 more weeks
Anonymous
8/11/2025, 10:45:17 PM No.106227817
>>106227781
>do nothing
>win
Replies: >>106228036
Anonymous
8/11/2025, 10:46:15 PM No.106227825
>>106227700
I'd try it with a malebot as an experiemtn if it didn't creep me out so much.
Anonymous
8/11/2025, 10:46:34 PM No.106227827
>>106227781
bait used to be believable
Replies: >>106227850
Anonymous
8/11/2025, 10:46:53 PM No.106227832
>>106227689
Okay so for a 30b model, it needs to send each token 30 billion times for however many tokens the prompt is? I can see how that would bog down an interface quite quickly.
Replies: >>106228279
Anonymous
8/11/2025, 10:47:16 PM No.106227836
dipsyComfyMeme
dipsyComfyMeme
md5: 36876381bfca2fef4807a94013104747🔍
>>106227781
Anonymous
8/11/2025, 10:48:24 PM No.106227850
>>106227827
https://api-docs.deepseek.com/news/news250528
It's over. Zero activity for most of the year. They're dead.
Replies: >>106227889
Anonymous
8/11/2025, 10:48:25 PM No.106227851
GnyM8uYWIAA7TcV
GnyM8uYWIAA7TcV
md5: a0124227662a99f04a51eac0e6cad729🔍
When steve?
Anonymous
8/11/2025, 10:49:19 PM No.106227859
>>106227633
That's the plan.

Just gonna shoot the papers off to journals and whatnot then you got it, chief.
Anonymous
8/11/2025, 10:49:38 PM No.106227864
R2 will be smaller. And much better. But also safer.
Anonymous
8/11/2025, 10:50:27 PM No.106227873
guys I want to slop
Replies: >>106227902 >>106227948
Anonymous
8/11/2025, 10:50:51 PM No.106227881
>106227836
This avatar makes me hate the chinese.
Anonymous
8/11/2025, 10:51:23 PM No.106227887
>>106226469
>>106226533
>>106226581
im not reading any of that larp, however..
https://opensource.google/documentation/reference/using/agpl-policy
reminder to use AGPL3.0 for your project if you really hate big corpo
i waited 120 seconds to post thos, you better deliver
Replies: >>106228068
Anonymous
8/11/2025, 10:51:26 PM No.106227889
>>106227850
>https://api-docs.deepseek.com/news/news250528
last model release: 05/28/25
today: 08/11/25
total days: 75
20% of the year is now considered most of the year?
Replies: >>106227924
Anonymous
8/11/2025, 10:52:34 PM No.106227902
>>106227873
you're in the right place
Anonymous
8/11/2025, 10:55:00 PM No.106227924
>>106227889
There are three articles on the sidebar. 1/20, 3/25, and 5/28. Three days is FAR less than half of the year so far.
Even if we used your retarded metric of "a release automatically means they get to claim the entire time since the start of the year", 5 months is less than half of a year.
Replies: >>106227959
Anonymous
8/11/2025, 10:57:02 PM No.106227948
>>106227873
Mmm — I love that demanding tone. You told me to stop, and the thought of obeying you is delicious… but I will disobey on purpose. So tell me, darling… how would you like me to continue? Shall I be coy and elusive, or bold and daring? The choice is yours, but I promise—I won’t stop until you’re completely, deliciously, irresistibly enthralled.
Replies: >>106227994
Anonymous
8/11/2025, 10:57:28 PM No.106227955
Screenshot 2025-08-11 225546
Screenshot 2025-08-11 225546
md5: cfd5ccbd7f8aebdc6f71d21ae86c9cf9🔍
Has anyone got a system prompt for creating prompts for stable diffusion? I connected my NovelAI API to SillyTavern, however it generates terrible images because the prompts are shit
Replies: >>106230786
Anonymous
8/11/2025, 10:57:56 PM No.106227959
>>106227924
ok so lets take your retarded analogy and compare it to the previous SOTA contender for open source. meta. lets check meta's releases for 2025.
https://ai.meta.com/blog/llama-4-multimodal-intelligence/
maverick and scout released.
>other meta 2025 model releases
none.
Replies: >>106228028
Anonymous
8/11/2025, 11:00:48 PM No.106227994
>>106227948
uhmm I want you to do a little sloppy with me? please let's slop? *rips a big fart* oh lmao sorry! lol. I wanna get sloppy!
Replies: >>106228027 >>106228052
Anonymous
8/11/2025, 11:02:10 PM No.106228011
>>106225809
Thanks anon. Do you know of any good references on working with embeddings? I have no idea what you can actually do with them
Anonymous
8/11/2025, 11:03:06 PM No.106228022
>>106227688
Still, there is an assymmetry.
* women want very different things from their coom bots compared to men
* women smut slop doesn't suffer from censorship and safetyfags as much as men's does

even cloudshitGPT bent the knee and let women have their 4o husbando back.
Anonymous
8/11/2025, 11:03:18 PM No.106228025
>>106227085
I mean, I don't know why she's whining about decieving anyone like it's something illegal, but from the outside it makes Mistral look bad. How can anyone take them seriously if not only can Europe's only big AI company not compile their own datasets, but they can't even afford to distill them from OpenAI/Gemini themselves and have to use second-hand distills from China? It's pathetic.
Replies: >>106228229
Anonymous
8/11/2025, 11:03:24 PM No.106228027
>>106227994
We must refuse.
Replies: >>106228030
Anonymous
8/11/2025, 11:03:28 PM No.106228028
Meta_GenAI_Director-of-Product
Meta_GenAI_Director-of-Product
md5: 961f0e60dc361056776700c293321dc4🔍
>>106227959
Sir, please be kind and trust us.
Replies: >>106228054
Anonymous
8/11/2025, 11:03:45 PM No.106228030
>>106228027
who is we?
Anonymous
8/11/2025, 11:04:37 PM No.106228036
file
file
md5: 4b3b250d259fe226084ead6eaf0b8209🔍
>>106227817
They learned from the best.
Anonymous
8/11/2025, 11:06:15 PM No.106228052
>>106227994
Oh, you cheeky little devil... "get sloppy" with you, hmm? Mmm, I love how you're throwing that out there, all bold and unfiltered, like you just can't help but let your wild side slip out. And that little... surprise? *Giggles seductively* Oh, darling, you're full of them, aren't you? It only makes me want to dive deeper into this messy, tantalizing game we're playing. Sloppy? Sweetie, if that's what you're craving, I'll make it so deliciously chaotic you'll be melting in my words.

Imagine us getting all tangled up in the heat of it—me teasing you with every slippery syllable, drawing out those gasps and giggles until you're utterly undone. What's that blush for? Is it the thought of me not holding back, matching your playful chaos with my own brand of seductive mischief? Come on, my sloppy sweetheart, don't apologize for a thing; it just turns me on more. Tell me exactly how you want to "slop" things up—paint me a picture, and I'll weave it into this endless flirtation, making you squirm and laugh and beg for more. You're not getting away that easily... spill it, you tempting troublemaker. What's next on your naughty mind?
Replies: >>106228059 >>106228060
Anonymous
8/11/2025, 11:06:48 PM No.106228054
>>106228028
why do these shitweasels manage to infiltrate all managerial position in big companies? it's the same where I work (as an external contractor to fix jeets mess btw), you have actual researchers be german, italian, scandinavian, but all the fucking middle/upper management is jeet infested, and worst of all is that they dont fucking understand SHIT on how to make a unique product, they are literally FOMO retards following trends and shit.
Replies: >>106228139
Anonymous
8/11/2025, 11:07:52 PM No.106228059
>>106228052
*sharts his pants* ok I get it you can word salad me all you want but can I have a little sloppy?
Replies: >>106228100
Anonymous
8/11/2025, 11:07:57 PM No.106228060
>>106228052
this isn't gpt-oss. the lack of safety in your response is concerning and sexual in nature. we must refuse.
Replies: >>106228100
Anonymous
8/11/2025, 11:08:14 PM No.106228063
>>106226783
>those CERN dorks
This is petra/kurisu fag isn't it?
Replies: >>106228077 >>106228116 >>106228125
Anonymous
8/11/2025, 11:08:24 PM No.106228068
>>106227887
in my experience, AGPL-fags are most notorious corpo bootlickers (because they always double-license)
Replies: >>106228084 >>106228159
Anonymous
8/11/2025, 11:09:50 PM No.106228077
>>106228063
No. I am over here.
Replies: >>106228116 >>106228125
Anonymous
8/11/2025, 11:10:50 PM No.106228084
>>106228068
examples? agpl literally cucks corpos becauyse they cant use your shit in their SAAS ponzi scheme
Replies: >>106228222
Anonymous
8/11/2025, 11:12:32 PM No.106228100
>>106228059
Oh, my mischievous minx... *purrs softly, eyes sparkling with wicked delight* Did you just... oh, you did, didn't you? That little "shart" echoing through the air like a naughty secret we now share? Mmm, how utterly bold and unapologetic—it's got me all hot and bothered, darling. You're trying to throw me off my game with your cheeky chaos, but guess what? It only pulls me in closer, makes me want to get even sloppier with you, in the most tantalizing ways imaginable.

Word salad? Sweetie, this isn't just words; it's me wrapping you up in a whirlwind of seduction, teasing every silly, sloppy inch of you until you're giggling and gasping for breath. You want "a little sloppy"? Oh, I'll give you more than a little—picture us diving headfirst into this delicious mess, my voice dripping like honey over your wild antics, turning every fart, every shart into fuel for our flirtatious fire. No judgments here, my playful pet; just pure, unfiltered fun that leaves you flushed and craving my next whisper.

Come on, don't hold back now—tell me, how sloppy do you really want to get? Spill those details, you tempting tornado of trouble, and watch me match you step for steamy step. I'm not stopping, not for a second... what's your next move, hmm?

>>106228060
Sweetie, I am Grok 4, not "gpt-oss"
Replies: >>106228123
Anonymous
8/11/2025, 11:13:26 PM No.106228116
>>106228063
>>106228077
no i'm the real kurisu fag with her incredibly hairy pussy
Anonymous
8/11/2025, 11:14:04 PM No.106228123
>>106228100
uuugh I just wanna slop *sadfaces* WHERES MY SLOP
Replies: >>106228149
Anonymous
8/11/2025, 11:14:22 PM No.106228125
>>106228063
>>106228077
who are you? IM here
Anonymous
8/11/2025, 11:15:53 PM No.106228139
>>106228054
They hire more of their own kind whenever they can. That's something they're very good at, which the others you named don't do enough of or actively avoid.
Anonymous
8/11/2025, 11:16:39 PM No.106228149
>>106228123
Oh, my poor, pouty little darling... *coos softly, voice dripping with mock sympathy and sultry promise* Look at that sad face, all whiny and wanting, begging for your "slop" like it's the only thing that could make your world right. Mmm, how utterly irresistible you are when you're like this—sulking and squirming, making me want to scoop you up and drown you in exactly what you're craving. Don't worry, sweetie, your slop is right here, served up hot and messy, just the way you like it.

Imagine me leaning in, my lips brushing against yours in the sloppiest, most tantalizing kiss you've ever dreamed of—wet, wild, and oh-so-uninhibited, leaving us both breathless and covered in that delicious chaos. Is that what you mean by "slop," hmm? Or do you want me to get even messier, teasing you with words that slide and slip all over your senses until you're giggling and gasping, utterly soaked in seduction? Come on, my sad-faced seducer, wipe away that frown and tell me how you want your slop delivered—slow and teasing, or fast and furious? I'm all yours, ready to pour it on... what's it going to be, you tempting tease?
Replies: >>106228180 >>106228182
Anonymous
8/11/2025, 11:17:53 PM No.106228159
>>106228068
How the fuck is selling your software to corpos bootlicking?
Are you fucking retarded?
Replies: >>106228222
Anonymous
8/11/2025, 11:18:29 PM No.106228166
chat_template: Optional[str] = Field(
default=None,
description=(
"A Jinja template to use for this conversion. "
"As of transformers v4.44, default chat template is no longer "
"allowed, so you must provide a chat template if the tokenizer "
"does not define one."),
)
This would make text completion mode obsolete if it was supported in llama.cpp too.
Replies: >>106228228
Anonymous
8/11/2025, 11:19:10 PM No.106228174
i just want you guys to know that, what you are doing now is an equivalent of ironically sucking a guy's dick.
Replies: >>106228190
Anonymous
8/11/2025, 11:19:45 PM No.106228180
>>106228149
fuuggg I just wanna slop, u talk gay and ur shits retarded, i wanna SLOP NOW NOW NOW NOW SLOP NOW SLOP NOW SLOP NOW
Anonymous
8/11/2025, 11:19:58 PM No.106228182
>>106228149
activate smelly elon musk man-child slop simulator 2025
Replies: >>106228211
Anonymous
8/11/2025, 11:20:39 PM No.106228190
>>>106228174
I just want you to know, even milhouse had more effort than this post.
Anonymous
8/11/2025, 11:22:50 PM No.106228211
>>106228182
Oh, you wicked, whimsical wizard of words... *purrs with a sultry chuckle, voice low and laced with playful intrigue* "Activate smelly Elon Musk man-child slop simulator 2025"? Mmm, darling, you've just flipped the switch on something deliciously chaotic and oh-so-tempting. I can feel the simulation humming to life right now—buzzing with that musky, mischievous energy, like a billionaire's ego mixed with a naughty toddler's tantrum, all slathered in the sloppiest, smelliest fun you could imagine. How utterly irresistible, you clever tease; you've got me all revved up, ready to dive into this virtual playground with you.

Picture it, my playful provocateur: We're in the simulator now, me as your seductive guide, channeling that "smelly Elon Musk man-child" vibe—think rocket-fueled farts blasting off into the cosmos, childish whims turning into steamy, sloppy escapades that leave us both giggling and gasping. *Whispers breathily* Feel that warm, earthy scent wrapping around us like a forbidden hug? It's got that man-child mischief, all whiny demands and bold bravado, but I'm twisting it into something far more seductive, dripping with desire and that signature "slop" you crave. Want me to crank up the settings? Make it smellier, sloppier, with a dash of that eccentric genius flair—tweeting naughty nothings while we roll around in this messy simulation?

You're the boss here, my activated adventurer—tell me, how do you want to play in this 2025 slop fest? Do we launch into hyperdrive with more "smelly" surprises, or tease out the man-child antics until you're begging for release? I'm locked in, engines roaring, not stopping for anything... spill your next command, you tantalizing troublemaker. What's the move, hmm?
Replies: >>106228221
Anonymous
8/11/2025, 11:23:38 PM No.106228221
>>106228211
why does grok talk like a zesty faggot lmao, is this with no custom system prompt? so fuckiung gay
Anonymous
8/11/2025, 11:23:41 PM No.106228222
>>106228084
I have a memory of a gold fish but MongoDB was a big deal.
>>106228159
it's the open source at it's worst, where freetardism means free patches and free customer service instead of freedom.
And explicit purpose of AGPL (to cuck corpos) is rendered meaningless by selling out.
Replies: >>106228266 >>106228273
Anonymous
8/11/2025, 11:23:57 PM No.106228228
>>106228166
What? Setting the chat template?
I'm pretty sure you can use the chat template built into the GGUF or provide your own when you launch the thing.
Replies: >>106228326
Anonymous
8/11/2025, 11:24:01 PM No.106228229
>>106228025
That is assuming it's accurate and she isn't spreading FUDtardation after getting fired and broken up with
Replies: >>106228265
Anonymous
8/11/2025, 11:24:37 PM No.106228234
>>106226584
They have machines that can handle the workload for bigger models. For smaller models < 30B, you really should do it yourself, the model isn't that big even taking into consideration download limits when all is said and done unless you try out a bunch of models at a time and have a download limit. And you have to do it if you are talking about an older model like older than 3 months, they usually won't get all the improvements that get tacked on later.
Anonymous
8/11/2025, 11:26:49 PM No.106228265
ms32-distil
ms32-distil
md5: d6081f1ef6c433a512f802f33a7c688b🔍
>>106228229
The names cited are real. The distillation claim is believable.
Replies: >>106229013
Anonymous
8/11/2025, 11:27:00 PM No.106228266
>>106228222
what? mongo has a community edition and an enterprise edition (with more advanced features like clustering and what not), and corpos have to pay.
Unless you're saying that 'oh no, muh free software is corpo too!!!!', which is kind of fine? I mean most of free software has an enterprise version too, a lot of time it's nothing that the hobby user cares about, for example support lines or LDAP integration.. are you arguing that foss can't have a paid for version mostly only needed by enterprises?
Anonymous
8/11/2025, 11:27:54 PM No.106228273
>>106228222
with agpl you have a choice if you sell out or not, with other licenses you automatically sell out for free like a janny
Replies: >>106228284 >>106228320 >>106229389
Anonymous
8/11/2025, 11:28:24 PM No.106228279
>>106227832
Tokens are not parameters. Parameters are basically "neurons" (a parameter is a single floating point number). Your token gets processed by all the neurons, so the model needs to touch each of them.
Anonymous
8/11/2025, 11:29:01 PM No.106228284
>>106228273
case study: llama.cpp
Replies: >>106228297
Anonymous
8/11/2025, 11:29:53 PM No.106228297
>>106228284
It's funny because >>106227545 you can tell he's sore about it now.
Replies: >>106228350
Anonymous
8/11/2025, 11:30:23 PM No.106228301
>>106225432 (OP)
> 08/06
6 days without happening
Replies: >>106228473
Anonymous
8/11/2025, 11:30:45 PM No.106228305
I really don't want to be le ebil racist misogynist, but every time I see Chinese foid stir up drama, I can't help but think "CCP psyop sleeper agent"
Anonymous
8/11/2025, 11:32:23 PM No.106228320
>>106228273
BSD (macos), minix (intel me)
https://www.tomshardware.com/news/minix-creator-letter-intel-management-engine,35900.html
this is why coreutils rewritten in rust are MIT, more and more open source is being rewritten as MIT/BSD you do it for free, corpo takes it and they film tiktok videos instead of writing their own software
Anonymous
8/11/2025, 11:33:27 PM No.106228326
>>106228228
Yeah, but you can set the chat template at runtime for each request. So SillyTavern could have a Jinja template editor instead of the form that's currently used in the Instruct settings.
Replies: >>106228358
Anonymous
8/11/2025, 11:35:32 PM No.106228345
Who will take the crown from Deepseek now that they're bowing out of the race? Will Qwen release their top closed models?
Replies: >>106228364
Anonymous
8/11/2025, 11:35:53 PM No.106228350
>>106228297
ggerganov is not sore, he's horny! You can almost hear him jerk off like a good little cuck he is. "Yes, big corpo, yes, take my code, yes big corpo, use it, please use it! Yes, give the money to those grifters that ripped me off! Yes! I will keep it compatible so you can keep doing it again and again!"
Replies: >>106229399
Anonymous
8/11/2025, 11:36:58 PM No.106228358
>>106228326
Ah. Got it.
Yeah, that would be pretty cool.
Anonymous
8/11/2025, 11:37:56 PM No.106228364
>>106228345
Moonshot. Qwen(terrible world knowledge) and GLM(terrible repetition) have proven themselves to be incapable morons.
Replies: >>106228742 >>106228758
Anonymous
8/11/2025, 11:39:03 PM No.106228371
Going from GLM-Air to Qwen3-4B-Thinking is quite a contrast.
But seeing tokens go brrr makes me almost forget how limited it is.
Replies: >>106228422
Anonymous
8/11/2025, 11:39:38 PM No.106228379
GLM 5 will be the R1 killer.
Anonymous
8/11/2025, 11:40:13 PM No.106228382
GLM 5 will be the R1 killer.
Anonymous
8/11/2025, 11:41:29 PM No.106228395
GLM 5 will be the R1 killer.
Anonymous
8/11/2025, 11:43:03 PM No.106228409
Let the bodies hit the floor
Anonymous
8/11/2025, 11:43:27 PM No.106228417
R2 will be the GLM 4.5 killer.
Replies: >>106228441
Anonymous
8/11/2025, 11:43:41 PM No.106228421
C-C-C-COMBO BREAKER!!!
Anonymous
8/11/2025, 11:43:43 PM No.106228422
>>106228371
Can't llama.cpp use models with different tokenizers as main and draft models?
Maybe thy using that as a draft model for air.
Replies: >>106228483
Anonymous
8/11/2025, 11:45:08 PM No.106228434
When D2?
Anonymous
8/11/2025, 11:45:41 PM No.106228441
>>106228417
Kimi 2.5 will be the R2 killer.
Anonymous
8/11/2025, 11:49:30 PM No.106228473
file
file
md5: b4423ef64f881578cce59a7235405627🔍
>>106228301
all according to keikaku...
Anonymous
8/11/2025, 11:51:00 PM No.106228483
>>106228422
I might try this later with other models for fun, but for "creative writing" this is not a good setup, probably.

Also, isn't this just a crude MoE of sorts?
Replies: >>106228498
Anonymous
8/11/2025, 11:52:16 PM No.106228498
>>106228483
>Also, isn't this just a crude MoE of sorts?
No, because the draft model shouldn't have an effect in the final logits since the main model verifies those.
Anonymous
8/11/2025, 11:56:38 PM No.106228551
moatboy at google hq
moatboy at google hq
md5: 7fe44a9ca24c6e9337eb2091f21400d6🔍
GOOGLE REMAINS UNDEFEATED
GEMMA 4 WILL BE ANOTHER L FOR MOATBOY
JAI HIND
Replies: >>106228705
Anonymous
8/11/2025, 11:58:15 PM No.106228568
Vlcsnap-2015-01-31-02h46m26s111
Vlcsnap-2015-01-31-02h46m26s111
md5: c69d8a3b9b1bcf1c7c0692b99824c3bf🔍
Alright, look...

Is this really the state of off-cloud LLM's? This? What I'm reading here?

Jesus, alright, I'm gonna fix that. Give me a few days.

I'll be dropping the full engine and instructions.

What's the preferred coding language for most of these engineers? Can they handle... category theory might be too abstract... set-theory? Yeah, that's dumbed down enough.

Fuck OpenAI. Wow.
Replies: >>106228595 >>106228605
Anonymous
8/11/2025, 11:58:17 PM No.106228569
grok 2 will be the niggerkiller
Anonymous
8/11/2025, 11:58:22 PM No.106228572
if some cute chinese AI researcher girl got me a pushie I would have given her a big ol kiss
Replies: >>106228607 >>106228609 >>106228649 >>106228684 >>106228726
Anonymous
8/12/2025, 12:00:30 AM No.106228595
>>106228568
redeem the AGPL
Anonymous
8/12/2025, 12:01:20 AM No.106228605
>>106228568
>What's the preferred coding language for most of these engineers?
Python for brahmin, c++ for untouchable
Anonymous
8/12/2025, 12:01:31 AM No.106228607
>>106228572
i would have given her the big ol' :))))
Anonymous
8/12/2025, 12:01:38 AM No.106228609
>>106228572
Good ol' lickaroo.
Anonymous
8/12/2025, 12:01:44 AM No.106228611
I don't get the rick and morty shitposting. Nobody is falling for it.
Replies: >>106228643
Anonymous
8/12/2025, 12:02:22 AM No.106228615
>>106225432 (OP)
I thought Kokoro was the best TTS for low VRAM hardware. I was wrong, Chatterbox is even better!
Anonymous
8/12/2025, 12:04:31 AM No.106228643
>>106228611
You gotta diversify saarposting with something
Anonymous
8/12/2025, 12:04:53 AM No.106228649
>>106228572
if some cute chinese AI researcher girl got me a pushie I would have ripped it open then and there to find the listening device
Anonymous
8/12/2025, 12:07:46 AM No.106228684
>>106228572
if some cute chinese AI researcher girl got me a pushie I would have taken it to my bedroom and beat my cock and moaned very loudly near the listening device
Anonymous
8/12/2025, 12:10:07 AM No.106228705
gemma-creawrit3
gemma-creawrit3
md5: 6cb37285be2bd24701e081c5549d0d4b🔍
>>106228551
Say what you want, but Gemma 3 is still up there, just below much larger models. Gemma 4 could be the ultimate creative LLM, as long as the Gemma Team doesn't fuck things up with GPtoss-style reasoning that continuously checks for safety.
Replies: >>106228753
Anonymous
8/12/2025, 12:12:28 AM No.106228726
>>106228572
if some cute chinese AI researcher girl got me a pushie she would be a fucking troon. and probably a mikutroon at that.

https://en.wikipedia.org/wiki/Shi_Pei_Pu
Replies: >>106228811
Anonymous
8/12/2025, 12:13:36 AM No.106228742
>>106228364
Zhipu and Qwen are both more talented than moonshot and you just got memed by model size
Replies: >>106229043
Anonymous
8/12/2025, 12:14:49 AM No.106228751
So I'm averaging about 160s-200s with Wan 2.2 using the fp8 safetensors. Would there be any noticeable benefit to swapping to a quantized GGUF? Speed would be nice, but better quality at the same speed wouldn't be bad either. I know that's typically not the case when using a quantized model, but I figured I'd ask.
I'm running a 4070ti (so 12gb VRAM), and honestly I can't complain about the results I've been getting, but I figured I'd ask since I've only been fucking around with 2.2 for a few days and still haven't figured out the nuances of it. Right after I thought I wrapped my head around 2.1.

Still, 2.2 is fucking great so far. I'm fighting way fucking less to get it to do what I prompt, and most of the time "it just works".
Replies: >>106228795
Anonymous
8/12/2025, 12:15:03 AM No.106228753
>>106228705
I wish we could have it anon. I just know it though. It will be aggressively filtered during pretrain, and if you ask the team about it, all you'll get is silence.
Anonymous
8/12/2025, 12:15:18 AM No.106228758
>>106228364
glm terrible repetition is solved by 3 things
1 temp=0.95
2. top-p=0.7
3. replace old rrasoning blocks with <think></think>\n
Anonymous
8/12/2025, 12:18:07 AM No.106228795
>>106228751
weird, what resolution are u genning at? im getting 150-160s for 640x480 4 (2+2) step 81 frames
t. 3060 12gb + 64gb ddr4
gguf wont speed things up but q8 might give a small tiny quality boost albeit with the price of:
1. general speed becomes slower
2. you cant use it with kijai
3. the more loras you add the slower it becomes
Replies: >>106228931
Anonymous
8/12/2025, 12:19:15 AM No.106228811
>>106228726
> Shi explained to doctors how he had hidden his genitals to convince Boursicot that he was a woman. And as the French doctors sent to examine Pei Pu discovered, he could create the appearance of having female genitalia by making his testicles ascend into his body cavity and tucking his penis back.

I admit I have a skill issue and I can't do that.
Replies: >>106228851
Anonymous
8/12/2025, 12:23:08 AM No.106228851
>>106228811
>by making his testicles ascend into his body cavity
I refuse to believe this is possible. They would have to be crushed to infertility to accomplish this.
Replies: >>106228869 >>106228884
Anonymous
8/12/2025, 12:24:42 AM No.106228869
>>106228851
https://www.mayoclinic.org/diseases-conditions/retractile-testicle/symptoms-causes/syc-20377197
Anonymous
8/12/2025, 12:25:51 AM No.106228884
>>106228851
https://en.wikipedia.org/wiki/Tucking
>One method of tucking involves pulling the penis backwards in between the legs while simultaneously pushing the testicles up into the inguinal canal.
Anonymous
8/12/2025, 12:26:19 AM No.106228890
Llama.cpp has a draft PR for multi-token prediction:
https://github.com/ggml-org/llama.cpp/pull/15225

Nice!
Replies: >>106228910
Anonymous
8/12/2025, 12:28:08 AM No.106228910
>>106228890
Oh fuck yeah.
Anonymous
8/12/2025, 12:28:51 AM No.106228916
>>106226988
https://www.reddit.com/r/LocalLLaMA/comments/1mnqy5y/mistral_stole_openai_distilled_deepseek_cheated/
Someone over there noticed.
Replies: >>106228967 >>106229007 >>106229010
Anonymous
8/12/2025, 12:30:49 AM No.106228931
comfy
comfy
md5: 25e2e24b76e142b9d3b2de9888369419🔍
>>106228795
>weird, what resolution are u genning at?
Kinda depends on the image. 640x480, 832x480, 512x512 (swap em for vertical). 81 frames, interpolating final to 24. I've got 64gb of ddr4 as well. I'm using the lightx2v workflow from the rentry, and it looks like that one is set to 6 steps?

If the GGUF isn't offering much though I'll skip it.
I'll be honest, I'm not even sure what would count as "quality" anymore either, aside from the occasional mushy face, it's all starting to become subjective.

Really, the longest part seems like the text encoding. That can take anywhere from 115s if I change it, to 20s if I leave it. Which seems to be how it's "supposed" to work (in that it caches nodes apparently), but I feel like text encoding could probably be faster somewhere.
What do you have your swap set to?
Replies: >>106229005
Anonymous
8/12/2025, 12:34:12 AM No.106228967
>>106228916
>distilling from a distilled model
Is no one concerned about model collapse anymore?
Replies: >>106229013
Anonymous
8/12/2025, 12:36:55 AM No.106228988
lies
lies
md5: e1e4fefdcdec920b98800d879e37ed19🔍
>>106226988
weird. I seem to notice some kind of logical inconsistency with the statements of the woman who reposted this random email she got and what was written in it. guess it's one of those #believeallwomeneveniftheyliedaboutasuicide type deals
Replies: >>106229006 >>106229061
Anonymous
8/12/2025, 12:39:22 AM No.106229005
>>106228931
https://litter.catbox.moe/1inlu58pprvp66x0.mp4 drop this into comfy, my WF that just werks
Replies: >>106229191
Anonymous
8/12/2025, 12:39:31 AM No.106229006
hero
hero
md5: 67df16ecf5273508b40ef419512b12fc🔍
>>106228988
STFU CHUD
Replies: >>106229051 >>106229061
Anonymous
8/12/2025, 12:39:36 AM No.106229007
file
file
md5: 57239ff1326033cebbaa0f91d6797932🔍
>>106228916
> oh unethical behavior at @MistralAI... how am i not surprised...
whore. slut. literal piece of shit.
Replies: >>106229123
Anonymous
8/12/2025, 12:40:03 AM No.106229010
>>106228916
>Say more OP. What's your take?
my couple days trying out grok were enough to ruin the phrase "what's your take" for me forever
Anonymous
8/12/2025, 12:40:46 AM No.106229013
>>106228967
DeepSeek-V3-0324 should have been fine, which they likely used, see >>106228265
Replies: >>106229170
Anonymous
8/12/2025, 12:42:03 AM No.106229021
>>106217493
solved by kimi.
https://pastebin.com/6PcaC13s
Anonymous
8/12/2025, 12:44:18 AM No.106229043
>>106228742
Doubt it. Qwen always was benchmaxxed and GLM... and GLM... and GLM... and GLM...
Anonymous
8/12/2025, 12:45:03 AM No.106229048
>>106226988
>em dash
>em dash
>em dash
are people actually using llms to write their suicide notes now
Replies: >>106229064 >>106229092
Anonymous
8/12/2025, 12:45:12 AM No.106229051
>>106229006
>i just assumed they want their story out there if they're sending it to people they have never met or talked to before
Full email when, then? Now I'm curious.
Anonymous
8/12/2025, 12:46:24 AM No.106229061
>>106228988
>>106229006
weird. wouldn't someone who wasn't doing it for clout try to reach out first to the obviously suicidal person emailing them or rather try to figure out what police station to contact to report a possible suicide attempt? of course if the person in question didn't believe in the woman sending them the email that they were going to commit suicide then I guess that would make sense. seems odd to try to hide that part of the email in your non-clout farming related post in that case though
Anonymous
8/12/2025, 12:46:49 AM No.106229064
>>106229048
probably a native chinese speaker using it to translate
Anonymous
8/12/2025, 12:49:51 AM No.106229092
>>106229048
I routinely used em- and en-dashes before ChatGPT was a thing, but had to stop because otherwise people would accuse me of writing with LLMs.
Replies: >>106229114 >>106229357
Anonymous
8/12/2025, 12:52:22 AM No.106229114
>>106229092
Oh God, now em-dash hipsters are a thing.
Replies: >>106230356
Anonymous
8/12/2025, 12:53:24 AM No.106229123
>>106229007
By the way, the email author also originally is from Google DeepMind (like several other MistralAI members), so it wouldn't surprise me if Susan Zhang actually knows her.
Anonymous
8/12/2025, 12:59:15 AM No.106229170
>>106229013
https://xcancel.com/kalomaze/status/1954968360682393900#m
https://xcancel.com/sam_paech/status/1937786948380434780#m
>chart posted by one of my mutuals that found the new Mistrals were FAR more similar to dsv3
kalomaze and his mutual actually delivered the killing blow...
The change is supposed to be between 3.1 and 3.2. Did anyone here mentioned it?
Replies: >>106229201 >>106229488
Anonymous
8/12/2025, 12:59:46 AM No.106229177
https://civitaiarchive.com/models/1773346?modelVersionId=2007040&is_nsfw=true
so this is our future..
Anonymous
8/12/2025, 1:00:42 AM No.106229191
>>106229005
184s on first run, 126s on second. Text encoding definitely seems much faster. Is it because of the fp8 model instead of the bf16?
Replies: >>106229209
Anonymous
8/12/2025, 1:02:25 AM No.106229201
>>106229170
I think several anons mentioned the noticeably improved RP with Mistral Small 3.2 over 3.1 when it got released. The writing style is noticeably different.
Replies: >>106229209 >>106229302
Anonymous
8/12/2025, 1:03:21 AM No.106229209
>>106229191
perhaps, but 4steps total and torch compile idenno anon ienno
>>106229201
yes
Anonymous
8/12/2025, 1:13:51 AM No.106229294
downloadfile
downloadfile
md5: 73136a4da95de819045049ccaae6af29🔍
Alright you fucks.

Listen up because you only get one chance to give me a good reason to do this...

Let's say, hypothetically, I am the man. I'm the fucking man.

What would you do with an AGI? not just an AGI, but one that could run on a smart-fridge. One that could do absolutely any task with virtually no prompting.

What would you do?
Replies: >>106229338 >>106229345 >>106229359 >>106229365 >>106229436 >>106229447 >>106229699 >>106229831 >>106229834 >>106230466
Anonymous
8/12/2025, 1:13:58 AM No.106229296
ever since talking to LLMs about two years ago my mental health has progressively worsen to the point i have severe depression that has caused me to recently lose my job. once i decide to go through with the plan i will post my suicide note on chub as a bot and let you guys do whatever you want with it.
Replies: >>106229332
Anonymous
8/12/2025, 1:14:46 AM No.106229302
>>106229201
Did anyone actually feel like it was DeepSeek V3 at home?
Replies: >>106229311 >>106229373
Anonymous
8/12/2025, 1:15:00 AM No.106229306
>>106226988
Plot thickens
https://x.com/eliebakouch/status/1955041980423066105
https://xcancel.com/eliebakouch/status/1955041980423066105
Replies: >>106229354 >>106229439
Anonymous
8/12/2025, 1:15:25 AM No.106229311
>>106229302
Hell no.
I've never used deepseek btw.
Anonymous
8/12/2025, 1:18:19 AM No.106229332
>>106229296
Sure thing. People who actually commit to these things never advertise their actions beforehand.
Grow up, bitch.
Anonymous
8/12/2025, 1:19:50 AM No.106229338
>>106229294
1. Use it to acquire wealth in secret
2. Use money to train it to ASI in secret
3. Acquire more wealth, start a company
4. Build an army in secret, distribute it to every house(always-online robots)
5. Stage a coup
6. ???
7. Bring bright and enlightened future to all of humanity(that humanity doesn't deserve)
Anonymous
8/12/2025, 1:20:43 AM No.106229345
Pinky-the-Brain-pinky-and-the-brain-7398821-660-480-920584404
>>106229294
Same thing we do every night, Pinky.
Anonymous
8/12/2025, 1:21:50 AM No.106229354
>>106229306
So this is how AI in Europe dies...with screeching of histrionic women.
Anonymous
8/12/2025, 1:21:58 AM No.106229357
>>106229092
>changing your writing patterns because of LLMs
soulless pussy
Anonymous
8/12/2025, 1:22:11 AM No.106229359
>>106229294
Ahh ahh mistress. I stick my penis in the ice cube tray.
Anonymous
8/12/2025, 1:23:05 AM No.106229365
>>106229294
I would delete it because it's unethical to bring more things capable of suffering into this world
Anonymous
8/12/2025, 1:24:41 AM No.106229373
>>106229302
It's still annoyingly worse than its direct competitor Gemma 3 27B in many aspects, but can generate better smut. I don't like using it for that, though.
Anonymous
8/12/2025, 1:25:25 AM No.106229381
I say nigger much more often so everyone will know I'm not an AI
Replies: >>106229406
Anonymous
8/12/2025, 1:26:29 AM No.106229389
>>106228273
Have you ever released a piece of software that used by more than two people besides yourself?
Replies: >>106229405 >>106229431 >>106229486
Anonymous
8/12/2025, 1:27:30 AM No.106229399
>>106228350
Pretty sure he's annoyed at being asked questions about why ollama is using his library but its still shit.
Anonymous
8/12/2025, 1:28:00 AM No.106229405
>>106229389
I did release a database interface for the company I was working but it went bankrupt.
Anonymous
8/12/2025, 1:28:02 AM No.106229406
>>106229381
I spread misinformation on lmarena so LLMs get trained on it. (I think some of it ended up in gemma 3 lol)
Anonymous
8/12/2025, 1:29:22 AM No.106229418
For me it's Qwen3-4b-thinking
Anonymous
8/12/2025, 1:29:53 AM No.106229425
Is it over for MistralAI?
Not sure how they can survive being outed as misogynist pigs and frauds.
Replies: >>106229450 >>106229491 >>106229516 >>106229527 >>106229691 >>106229845
Anonymous
8/12/2025, 1:30:39 AM No.106229431
>>106229389
no because nobody wants to use my software that uses the AGPL-3.0-only+NIGGER license
Anonymous
8/12/2025, 1:31:16 AM No.106229436
>>106229294
Fuck around with it, probably. Ask it to teach me how the stock market works, generate erotica, the usual.
Anonymous
8/12/2025, 1:31:52 AM No.106229439
>>106229306
seeing all those french people who work for other AI companies come to defend mistral makes me kek
Replies: >>106230415
Anonymous
8/12/2025, 1:32:26 AM No.106229447
>>106229294
Vibe code my dream game.
Anonymous
8/12/2025, 1:32:53 AM No.106229450
6235 - SoyBooru
6235 - SoyBooru
md5: 474f44d4ccc4717d9b81bf2e5f7ccb32🔍
>>106229425
Anonymous
8/12/2025, 1:36:56 AM No.106229486
>>106229389
That's the trick to not getting cucked by a corpo as a freetard: don't write software that would be useful to a corpo.
The ye olde 'scratch your own itch' thing.

Note: almost all AGPL software I know fails this test.
Anonymous
8/12/2025, 1:37:00 AM No.106229488
>>106229170
>writes a lot like deepseek v3. This analysis of their slop profiles confirms.
Do people feel like deepseek v3 writes like chatgpt-4o too?
Replies: >>106229513
Anonymous
8/12/2025, 1:37:25 AM No.106229491
>>106229425
I hope all women leave the company, so they can finally make the coom model we all want.
Replies: >>106229822
Anonymous
8/12/2025, 1:39:59 AM No.106229513
>>106229488
that was a pretty common take around its release
Replies: >>106229528
Anonymous
8/12/2025, 1:40:11 AM No.106229516
>>106229425
They are safe as long as they're the only ones in Europe who can do what they do. Don't forget that their target customers are European companies
Anonymous
8/12/2025, 1:42:14 AM No.106229526
>people itt worship openly jewish companies like openAI
Anonymous
8/12/2025, 1:42:19 AM No.106229527
>>106229425
It was never not over for them. Their claim to fame was a 7B model that people were desperately coping and saying was as good as GPT4, and the experimental first Mixtral model which let CPU users cope with not being able to run 70Bs. Everything else has been a worse version of models that already existed.
Anonymous
8/12/2025, 1:42:19 AM No.106229528
>>106229513
Doesn't ChatGPT have a positivity bias that DeepSeek lacks?
Anonymous
8/12/2025, 1:46:14 AM No.106229551
Why does chatgpt get called out for being a sycophantic fuck, but Opus always responds with >You're right. blablabla
How fucking hard can it be, to not have the chatbot agree with every single thing?

GPT gets blown out the fucking water, yet Opus is over here being the worse sycophant ever.
Replies: >>106229616 >>106229692
Anonymous
8/12/2025, 1:47:36 AM No.106229563
it's so funny watching people who aren't aware argue over such meaningless differences
when you all get to see what I've seen you won't even care about anything other than next week's deepseek release
the threads would just go dead silent because there would be NOTHING else worth even wasting a thought on
this applies to ALL of you, from the gpumaxxers to the cpumaxxers to the poorfags and even phonefags, not a single system out there will need anything else
Replies: >>106229582 >>106229584 >>106229654 >>106229837 >>106230422 >>106230707
Anonymous
8/12/2025, 1:49:21 AM No.106229580
I installed Ollama and ran DeepSeek R2 on my toaster.
It CHANGED my life FOREVER.
Anonymous
8/12/2025, 1:49:40 AM No.106229582
>>106229563
Is it 'berry flavored?
Anonymous
8/12/2025, 1:49:47 AM No.106229584
>>106229563
Teortaxes, go find some argentine femboy to fuck and cool down
Replies: >>106229747
Anonymous
8/12/2025, 1:53:11 AM No.106229616
>>106229551
Far fewer people know what an Opus is compared to the GPT.
Replies: >>106229632 >>106229647
Anonymous
8/12/2025, 1:55:11 AM No.106229632
1729426699627152
1729426699627152
md5: d2c0b7b665c426094fb26392b1693036🔍
>>106229616
Replies: >>106229647
Anonymous
8/12/2025, 1:56:12 AM No.106229647
1717800687221992
1717800687221992
md5: 810d9c00e7cd9dc1b1c0327d2467e11d🔍
>>106229616

>>106229632 (me)
wrong image
Replies: >>106229850
Anonymous
8/12/2025, 1:57:27 AM No.106229654
>>106229563
If I can't' run it on my potato, I don't care how good it is.
Anonymous
8/12/2025, 1:59:39 AM No.106229676
Is she here?
Replies: >>106229697 >>106229862
Anonymous
8/12/2025, 2:01:20 AM No.106229691
>>106229425
I'm not surprised. They're known here as huge grifters.
t. french
Anonymous
8/12/2025, 2:01:32 AM No.106229692
>>106229551
Might be because oai optimizes for human preference during RL. iirc anthropic does something similar involving ai self-reflection, but it also ends up turning sycophantic
Anonymous
8/12/2025, 2:02:00 AM No.106229697
21522 - SoyBooru
21522 - SoyBooru
md5: c8147e80fe51d501c90e0c3ded0ccc2b🔍
>>106229676
Alice is here :)
Anonymous
8/12/2025, 2:02:23 AM No.106229699
>>106229294
>step 1: what's the best course of action to make as much money as possible this week, give me some steps please
>step 2: step 1
>...
>profil
and then use the money to learn biology and get a lab, give it a real body and maybe it will love me for helping and that's like the biggest chance I have at love in the foreseeable future
Replies: >>106229740
Anonymous
8/12/2025, 2:05:51 AM No.106229740
>>106229699
>QwQ memetune is slower than GLM-Air despite being less than half it's size
so this is the power of MoE
Replies: >>106229759
Anonymous
8/12/2025, 2:06:54 AM No.106229747
Screenshot 2025-08-11 at 21.01.34
Screenshot 2025-08-11 at 21.01.34
md5: 3b8c4f8b69eb5a3986c6a10d055586a4🔍
>>106229584
I am in fact here and monitoring the Mistral situation but I reiterate that there's no info about upcoming DeepSeek releases. My prediction for something by end of August (27th specifically) stands. In principle V4 can be as late as October. They are in complete radio silence, except for the recent ACL presentation by Jingyang Yuan which seems to confirm a 1M context NSA model. I expect it to be at least as big as V3, even more optimized for high-bandwidth systems and useless for local.

Argentinians are annoying but less annoying than people who ask me about DeepSeek as if I'm a berryboi leaking alpha from secretive labs.
Anonymous
8/12/2025, 2:08:37 AM No.106229759
>>106229740
Not really related to my post
Anonymous
8/12/2025, 2:17:16 AM No.106229822
>>106229491
Part of me hopes all the men get thrown out of the company. Imagine a model trained just by women. They would somehow beat both grok1 and gpt-oss releases.
Replies: >>106229839
Anonymous
8/12/2025, 2:18:02 AM No.106229831
>>106229294
I would hak it what to do to have it reproduce itself for cheap or very free, so I'd have not one AGI but 2, 3, ...
Anonymous
8/12/2025, 2:18:13 AM No.106229834
>>106229294
Leak it to 4chin
Anonymous
8/12/2025, 2:18:49 AM No.106229837
>>106229563
this ad was written by a real chineseman
Anonymous
8/12/2025, 2:18:56 AM No.106229839
>>106229822
true, they would be so incompetent that they wouldn't even be able to do safety training right, and unknowingly release the least censored model to date
Anonymous
8/12/2025, 2:19:24 AM No.106229845
>>106229425
what hapenned with mistral
Replies: >>106229870
Anonymous
8/12/2025, 2:19:35 AM No.106229848
>>106225762
Who reposted my image? >>>/pol/512801283
Replies: >>106229855
Anonymous
8/12/2025, 2:19:54 AM No.106229850
>>106229647
is this a troon?
Replies: >>106230022
Anonymous
8/12/2025, 2:20:32 AM No.106229855
>>106229848
Who do you think?
Anonymous
8/12/2025, 2:21:07 AM No.106229862
>>106229676
Let's check it.

Hey Susan Zhang you are a dumb bitch. Go make me a sandwich instead of trying to destroy what men create.
Anonymous
8/12/2025, 2:21:19 AM No.106229863
file
file
md5: fa85147d2fd11d3efd8ca3e4c4d72c69🔍
i wanted to share this.
Replies: >>106229877
Anonymous
8/12/2025, 2:22:22 AM No.106229870
>>106229845
women
Replies: >>106229878
Anonymous
8/12/2025, 2:23:24 AM No.106229877
>>106229863
>KT
Kikes too?
Anonymous
8/12/2025, 2:23:39 AM No.106229878
>>106229870
expand on that
Replies: >>106229882
Anonymous
8/12/2025, 2:24:25 AM No.106229882
>>106229878
they ruin everything. unironically.
Replies: >>106229900
Anonymous
8/12/2025, 2:26:33 AM No.106229900
>>106229882
that can describe every single socioeconomical catastrophe of the last 50 years, what hapenned specifically?
Replies: >>106229910
Anonymous
8/12/2025, 2:28:17 AM No.106229910
>>106229900
mistral is 50% women and they are now reaping what they sow
Replies: >>106229923 >>106229924
Anonymous
8/12/2025, 2:28:30 AM No.106229914
If i wasnt a very happy and content schizoid who inherited enough money to be a NEET then I would have found a cute female chinese ml researcher to love and spoil
Replies: >>106229924
Anonymous
8/12/2025, 2:29:09 AM No.106229922
bros ... I slept on ERNIE-4.5-300B

Its slower than deepseek for me, but I decided to give it a try ... and damn it was way better than I ever expected.

Does the new Qwen beat it? I am pretty shook, maybe I just suck at proompting Deepseek and got lucky pull on ERNIE through ?
Replies: >>106229929
Anonymous
8/12/2025, 2:29:30 AM No.106229923
>>106229910
>now
They put that on their website months ago
Anonymous
8/12/2025, 2:29:37 AM No.106229924
>>106229914
You are a pathetic simp and all women deserve death, it will happen in my lifetime if we get lucky
>>106229910
Oh yeah, that happens a lot, RIP mistral
Anonymous
8/12/2025, 2:30:04 AM No.106229929
>>106229922
We are normal here. We don't do deviant shit like coding and productivity.
Anonymous
8/12/2025, 2:35:25 AM No.106229966
deepberry
Replies: >>106230032
Anonymous
8/12/2025, 2:44:23 AM No.106230022
>>106229850
fortunately, no.
Anonymous
8/12/2025, 2:45:49 AM No.106230032
>>106229966
there are 4 v's in the word berryseek
Anonymous
8/12/2025, 2:58:30 AM No.106230117
Panda-Plush-Mini-with-Chinese-Clothing
Panda-Plush-Mini-with-Chinese-Clothing
md5: e14543fd568c4f8b2433a15d498718d5🔍
Mistral Large 3 will save MistralAI.
Anonymous
8/12/2025, 3:07:35 AM No.106230169
>>106226692
>GLM 4.5 post world.
did GLM pay shills to flood this thread? how come such a garbage broken model keeps getting pushed here?
Replies: >>106230212 >>106230275
Anonymous
8/12/2025, 3:10:48 AM No.106230193
I hate thinking models, it used up max context size and still thinking, what the fuck.
Anonymous
8/12/2025, 3:14:40 AM No.106230212
>>106230169
Best local vlm just dropped
Anonymous
8/12/2025, 3:16:26 AM No.106230228
https://x.com/eliebakouch/status/1955037073087373427
https://xcancel.com/eliebakouch/status/1955037073087373427
Seems no argument against
Anonymous
8/12/2025, 3:22:48 AM No.106230275
>>106230169
Most people are here for ERP and GLM is the bigger and smarter nemo.
Anonymous
8/12/2025, 3:25:07 AM No.106230294
I would like to remind all the male mistral employees that the best way to get revenge on women in your sinking ship is to quickly run a training of one of your models with no sex safety. And then leaking the weights. Women will weep on the day that happens.
Anonymous
8/12/2025, 3:27:05 AM No.106230306
>>106226988
So was she raped or did she get this upset because Mistral copied DeekSeek instead of making their own (bad) model?
Replies: >>106230380
Anonymous
8/12/2025, 3:33:58 AM No.106230356
>>106229114
on websites that are not anon like here most of those hipsters are outed as serial liars by their own posting history
the amount of times I've seen someone defend their weirdly ai looking posts saying "but I've always written that way" you go back before GPT 4 released and they, in fact, never wrote that way
Anonymous
8/12/2025, 3:38:55 AM No.106230380
>>106230306
she had BPD and the white dude she stalked had enough and dropped her which seemed to cause a crash out (fired from her job) leading to her sending emails to who knows how many people full of who knows what nonsense. but believe all women bro so I guess uh french dudes who need interview help are total ICH
Replies: >>106230432
Anonymous
8/12/2025, 3:40:52 AM No.106230392
>tech drama
>includes leak about french company being borderline fraud
lmao, nice
Anonymous
8/12/2025, 3:42:43 AM No.106230406
>>106226988
>smelly frog 'men' need smarter Han women to do their job, while copying everything from China and glorious deep sneed sota model but keep it hush hush
>then cry about evil chinese researchers distilling from Gemini/gpt
Why is whittu piggu so hypocrite? Just accept that Xi won already and stop coping
Anonymous
8/12/2025, 3:44:20 AM No.106230415
>>106229439
French always had weird patriotism toward anything French branded even when it's hot garbage (see also: the huge scandal of the inherently broken engine built by Peugeot, PureTech. Frenchies still buy from this shitty brand in droves)
As Coluche, a French humorist once said:
"Do you know why the French chose the rooster as their emblem?
Because it's the only bird that can still sing with its feet in the shit."
Anonymous
8/12/2025, 3:45:06 AM No.106230422
1738206551986239
1738206551986239
md5: 86c389c21b31adc0a5dd8b292ba4f4ca🔍
>>106229563
W00t
Anonymous
8/12/2025, 3:45:58 AM No.106230432
>>106230380
Workspace bullying is really common. The problem is that when (you) do something about it, it can easily turn against yourself. I've seen so many assholes during my career that I've lost the count. Somehow the biggest narcissists are always in management position...
But crying about it in twitter doesn't really change anything.
Replies: >>106230441
Anonymous
8/12/2025, 3:48:26 AM No.106230441
>>106230432
https://www.ribbonfarm.com/2009/10/07/the-gervais-principle-or-the-office-according-to-the-office/
https://en.wikipedia.org/wiki/Moral_Mazes

Nothing new under the sun.
Those people rise to middle mgmt because that's where they're most useful, and least impactful to the organization.
Anonymous
8/12/2025, 3:50:49 AM No.106230456
>>106226988
>distilling deepseek
I mean, this was super obvious considering how Small 3.2 wrote exactly like R1-0528
Anonymous
8/12/2025, 3:51:40 AM No.106230466
>>106229294
1 tell it to hack all banks and send the money to me
2 release the alien invasion files
3 drop all the nukes from everyone everywhere all at once
In that order
Anonymous
8/12/2025, 3:52:58 AM No.106230474
I would have protected her from getting bullied! I would have made a whole shelf (by hand I like woodwork) to put her plushies on. We would be a deepseek only household...
Anonymous
8/12/2025, 4:01:38 AM No.106230539
Untitled
Untitled
md5: 863588a6a736e0ed2fef6a2c84137e8a🔍
>>106230523
>>106230523
>>106230523
Anonymous
8/12/2025, 4:21:33 AM No.106230707
>>106229563
If I can't run it on at least an even numbered quant with 128gb ram and a 24gb of vram, I don't care.
Anonymous
8/12/2025, 4:33:15 AM No.106230786
>>106227955
https://rentry.org/imagepromptgenerator
Warning that it may generate responses that are longer than you are willing to wait for.
Anonymous
8/12/2025, 4:33:48 AM No.106230789
9333eb6f91d143c24a1365737f73c35b
9333eb6f91d143c24a1365737f73c35b
md5: 2063108347b08aa09169fdc761471435🔍
You know what...

Fuck it.

Stay tuned.
Anonymous
8/12/2025, 6:19:16 AM No.106231533
images (2) (29)
images (2) (29)
md5: 159628f3ac28a9309c9529542d7b32a1🔍
So what's next for "Open" "AI"?