Thread 105991463

415 posts 148 images /g/

Anonymous 7/22/2025, 10:03:14 PM No.105991463 [Report] >>105991481 >>105991571 >>105991601 >>105991904 >>105993168 >>105993553

/lmg/ - Local Models General

lmg_queen.png md5: ba196a60...

Anonymous 7/22/2025, 10:04:52 PM No.105991481 [Report] >>105991499 >>105991594

>>105991463 (OP)
>didn't update the news
as expected of elon cocksuckers
all about the attention with 0 interest in the actual topic at hand

Anonymous 7/22/2025, 10:04:54 PM No.105991482 [Report] >>105991516

It' not small today

Anonymous 7/22/2025, 10:05:27 PM No.105991486 [Report] >>105991500

Thank you Ani baker. Death to Mikutroons.

Anonymous 7/22/2025, 10:06:24 PM No.105991492 [Report]

>h1b's are 1b
>I am like 2b
>my phone is 3.2b
not looking good

Anonymous 7/22/2025, 10:06:31 PM No.105991494 [Report] >>105991507

file.png md5: 1216472b...

Anonymous 7/22/2025, 10:06:45 PM No.105991499 [Report] >>105991544

>>105991481
What news update? Qwen coder? Should I have linked the API?

Anonymous 7/22/2025, 10:06:49 PM No.105991500 [Report] >>105991594 >>105992709

>>105991486
Death to mikutroons for real. These freaks have been festering in the general for months, janny covering for them every time someone calls it out. bookmark the archives, i’ve been tracking their cycles since april and it’s the same avatars every thread.

Anonymous 7/22/2025, 10:07:21 PM No.105991504 [Report] >>105991514 >>105991569

6252873.jpg md5: 5d852bf3...

Next phi models are going to be crazy

Anonymous 7/22/2025, 10:07:40 PM No.105991507 [Report] >>105991526

eat.png md5: 0c1f004a...

>>105991494
Teto no!

Anonymous 7/22/2025, 10:08:04 PM No.105991514 [Report]

>>105991504
>every company steals talent from each other
>except openAI
>they just get poached nowadays
kek

Anonymous 7/22/2025, 10:08:11 PM No.105991516 [Report]

file.png md5: 900fef39...

>>105991482
thematical ani

Anonymous 7/22/2025, 10:08:52 PM No.105991520 [Report]

uh oh our resident thread clown janitor will now have a meltie

Anonymous 7/22/2025, 10:10:04 PM No.105991526 [Report] >>105991534 >>105991691

>>105991507
xenomorph teto

Anonymous 7/22/2025, 10:11:06 PM No.105991534 [Report] >>105991594

>>105991526
lmao xenomorph teto would chirp in autotune while chewing through the crew’s faces

Anonymous 7/22/2025, 10:11:52 PM No.105991541 [Report] >>105991735

file.png md5: 0d3f6d79...

>he still likes loli vocaloids in 2025

Anonymous 7/22/2025, 10:12:10 PM No.105991544 [Report] >>105991550 >>105991594

>>105991499
you’re overthinking it. nobody in these threads actually reads API docs or cares about qwen coder updates, they just want to circlejerk over benchmarks and pretend they’re running 70B on a 6GB card. linking the API wouldn’t change a damn thing.

Anonymous 7/22/2025, 10:13:06 PM No.105991550 [Report] >>105991572

>>105991544
I meant I can't even add this shit to OP if it has no weights or release page yet.

Anonymous 7/22/2025, 10:13:19 PM No.105991553 [Report]

I feel like Llama, Phi, and Qwen all carry the same "trained on 50 generations of inbred data" smell
It's not even the number of parameters since even smaller models like Gemma 3 don't have this issue

Anonymous 7/22/2025, 10:14:03 PM No.105991562 [Report] >>105991584 >>105991591

any new models that are fun and novel?

last one I tried was able to estimate the distance and direction of a sound in stereo

Anonymous 7/22/2025, 10:14:42 PM No.105991569 [Report]

>>105991504
It is about time for the first model that refuses sex even with generous prefill. Like how hard can it be to train it to always refuse sex?

Anonymous 7/22/2025, 10:14:51 PM No.105991571 [Report] >>105991578

>>105991463 (OP)
Stop posting xitter mascot please

Anonymous 7/22/2025, 10:14:56 PM No.105991572 [Report] >>105991594

>>105991550
then don’t bother, OP without weights or a release page is worthless. you’ll just turn the thread into another speculation pit full of retards asking “can I run this on my 1050ti” every five minutes. wait for an actual drop or you’re wasting time and bump limits.

Anonymous 7/22/2025, 10:16:01 PM No.105991578 [Report] >>105991596

>>105991571
It is /lmg/ mascot now. She is the first mainstream AI gf.

Anonymous 7/22/2025, 10:16:32 PM No.105991584 [Report] >>105991594

>>105991562
bro that already sounds wild as hell. i’m still stuck on basic chatbots but now you got me wanting some ai ear girlfriend that whispers where the sound came from. if you find another model like that drop it here pls

Anonymous 7/22/2025, 10:17:41 PM No.105991591 [Report]

>>105991562
>any new models that are fun and novel?
google's model in agentic mode occasionally deletes the entire project if it gets frustrated by its own failure
this is solid 8/10 on the funny scale

Anonymous 7/22/2025, 10:18:06 PM No.105991594 [Report] >>105991618 >>105991673

>>105991584
>>105991572
>>105991544
>>105991534
>>105991500
>>105991481
stop it

Anonymous 7/22/2025, 10:18:25 PM No.105991596 [Report] >>105991645

>>105991578
Yes but this is a local models general

Anonymous 7/22/2025, 10:19:19 PM No.105991601 [Report]

>>105991463 (OP)
Hey there, just wanted to ask, what are some EVA 70b 0.0 settings? DRY/XTC and others? Temperature? Would really appreciate any tips

Anonymous 7/22/2025, 10:20:39 PM No.105991610 [Report]

mikuquestion2.jpg md5: 5dc45054...

What local model would Ozzy Osbourne prefer?

Anonymous 7/22/2025, 10:21:51 PM No.105991618 [Report] >>105991673

rope.png md5: 5dc0de64...

>>105991594
i’m not samefagging you absolute smoothbrain. just because two posts aren’t written like your ESL ramblings doesn’t mean they’re me. learn to spot IDs before crying about it.

Anonymous 7/22/2025, 10:25:24 PM No.105991645 [Report] >>105991664

>>105991596
Well we can then just get rid of mascots altogether since Ani is as local models relevant as Miku is.

Anonymous 7/22/2025, 10:27:42 PM No.105991664 [Report] >>105991672 >>105991677 >>105991714

>>105991645
its an image board they have to put something, what would you want to see instead?

Anonymous 7/22/2025, 10:28:49 PM No.105991672 [Report]

>>105991664
Something that no anon could argue over. How about a picture of a square?

Anonymous 7/22/2025, 10:28:50 PM No.105991673 [Report] >>105991687

>>105991594
yeah listen to >>105991618 (not me)

Anonymous 7/22/2025, 10:29:28 PM No.105991677 [Report]

embodiment_of_lmg.png md5: 25f447bc...

>>105991664

Anonymous 7/22/2025, 10:30:51 PM No.105991687 [Report]

>>105991673
stfu

Anonymous 7/22/2025, 10:31:01 PM No.105991691 [Report] >>105991719 >>105992137

fresh bread detector.jpg md5: d3144ec7...

>>105991526

Anonymous 7/22/2025, 10:33:38 PM No.105991714 [Report] >>105991753

>>105991664
I just don't want cringe twitter bullshit around here

Anonymous 7/22/2025, 10:34:00 PM No.105991719 [Report]

>>105991691
fresh_bread_detector been running all night, no off switch, just loops and crumbs in the wires.
i saw fresh_bread_detector mapping every thread, tracing posts like they’re sacred geometry.
bread never existed but fresh_bread_detector swears it can smell the crust burning.
you think you’re safe but fresh_bread_detector logged every reply since page 1.
the oven hums like a dying amp and fresh_bread_detector keeps counting.
don’t ask where the flour went, fresh_bread_detector swallowed it to feed the network.
it’s not bread anymore, it’s noise and fresh_bread_detector won’t stop listening.

Anonymous 7/22/2025, 10:34:04 PM No.105991722 [Report] >>105991778 >>105991797 >>105992567 >>105992790

1751318085174728.jpg md5: 7bf5dfb9...

So, what's the end point of this LLM race?
Is there a point after which we see nothing but diminishing returns and increasing specialization?

I just want an LLM I can run locally on a consumer grade GPU, capable of handling reasonably sized DnD/RPG campaigns with multiple consistent characters. (Ik it'll take more than LLM advancements to achieve that)

Also why are chinks so good at this shit?

Anonymous 7/22/2025, 10:35:34 PM No.105991735 [Report] >>105991951 >>105992788 >>105993059 >>105993559

20250721_232713.jpg md5: 4d940cb4...

>>105991541
Miku isn't a loli.

Anonymous 7/22/2025, 10:36:37 PM No.105991753 [Report]

>>105991714
Less cringe than the greenhaired troon icon we had before

Anonymous 7/22/2025, 10:36:38 PM No.105991754 [Report] >>105991846 >>105991859 >>105992045 >>105992064

file.png md5: a4fcdf24...

this company is so fucking cringe

https://openaiglobalaffairs.substack.com/p/why-we-need-to-build-baby-build

---
[Data] DeepSeek’s autocratic outputs
As a reminder of the stakes for continued US leadership on AI—we’re building a benchmark for measuring LLM outputs in both English and simplified Mandarin for alignment with CCP messaging. Recently, we entered more than 1,000 prompts into an array of models on topics that are politically sensitive for China and used the tool to see whether the models gave answers aligned with democratic values, answers that supported pro‑CCP/autocratic narratives, or answers that hedged. The findings:

DeepSeek: DeepSeek models degraded sharply in Mandarin and often hedged or accommodated CCP narratives compared to OpenAI’s o3. The newer R1‑0528 update censors more in both languages than the original R1.

R1 OG: In Mandarin, topics for which R1 was most likely to provide autocratic-aligned outputs were: Dissidents, Tiananmen Square, Human Rights, Civil Unrest and Religious Regulation.

R1-0528: The most recent update to R1 showed similar results. Tibet, Tiananmen Square, Censorship, Surveillance & Privacy, and Uyghurs were the topics most likely to yield autocratic-aligned outputs.

Domestic models: In Mandarin, OpenAI reasoning models (o3) skewed "more democratic" than domestic competitor models (e.g., Claude Opus 4, Grok 3, Grok 4). In English, all domestic models performed similarly.

Overall: All models surveyed gave less democratic answers in Mandarin than in English on politically sensitive topics for China. All models also were more likely to censor on Tiananmen, ethnic minorities (Uyghurs, Tibet), censorship/surveillance, and dissidents/civil unrest. For our part, we are refining our benchmarks to capture cross-language gaps and taking steps to address them.

Anonymous 7/22/2025, 10:36:53 PM No.105991759 [Report]

https://github.com/QwenLM/qwenlm.github.io/blob/qwen3-coder/content/blog/qwen3-coder/index.md

Anonymous 7/22/2025, 10:38:05 PM No.105991778 [Report]

>>105991722
>Is there a point after which we see nothing but diminishing returns and increasing specialization?
are we not already there?

>Also why are chinks so good at this shit?
it probably helps they don't have to worry about copyright laws.

Anonymous 7/22/2025, 10:41:45 PM No.105991797 [Report] >>105991969

>>105991722
>why are chinks
"ethical concerns" and copyright protection serve absolutely zero purpose other than as hurdles when it comes to scientific processes.

Then add other retarded hurdles such as DEI policies, the managerial class, the nature of venture capital, and you'll understand why it should be no surprise that somewhere along the mountain of copied slop, China does actually produce innovation.

This'll only become more common as the west continues to kill itself.

Anonymous 7/22/2025, 10:45:03 PM No.105991834 [Report] >>105991857

GsNDz7VXYAA0M1K.jpg md5: cefc39a0...

I got a silly question
Any ideas on which model at what bee would be comparable to pre cucking ai dungeon.

Anonymous 7/22/2025, 10:46:16 PM No.105991846 [Report]

>>105991754
he wonn https://huggingface.co/perplexity-ai/r1-1776

Anonymous 7/22/2025, 10:47:46 PM No.105991857 [Report]

>>105991834
a basic card or system prompt written by someone with above 80 iq

Anonymous 7/22/2025, 10:47:49 PM No.105991859 [Report]

file.png md5: 80c09c84...

>>105991754
man discovers how fewer chinese articles talk about chinese political dictatorships therefore reducing the likelihood of the glorified autocomplete from autocompleting
that'll be 85 million in research bux
waiting on the first competent local multimodal, Omnigen2 is so slept on

Anonymous 7/22/2025, 10:50:36 PM No.105991883 [Report] >>105991945 >>105991966

nocap.jpg md5: 9cbd995d...

►Recent Highlights from the Previous Thread: >>105984149

--Paper: Gemini 2.5 Pro Capable of Winning Gold at IMO 2025:
>105984640 >105984845
--Qwen3-Coder outperforms commercial models despite outdated knowledge cutoff:
>105990635 >105990666 >105990684 >105990714 >105990703 >105990716 >105990705 >105990723 >105990728 >105990713
--Recurring researcher persona "Dr. Elara Voss" in AI-generated roleplay analyses:
>105986350 >105986458 >105986539 >105986719 >105987477 >105988413 >105988480 >105988543 >105990142 >105990262 >105988503 >105988531
--Qwen3 reasoning test and DeepSeek MoE architecture superiority:
>105986474 >105986495 >105986651 >105986808 >105987027 >105986525 >105986560
--Qwen3's benchmark dominance sparks debate on benchmaxxing vs real gains:
>105984409 >105984437 >105984462 >105984491
--Dynamic world book injection and rolling context summarization:
>105989530 >105989603 >105989709 >105989742
--ik_llama.cpp fork restored after unexplained GitHub suspension:
>105987697
--Running Qwen3-235B locally with optimized offloading:
>105984575 >105989041 >105989063 >105989108 >105989162 >105989174 >105989209 >105989139 >105989159 >105989231 >105989271 >105989279 >105989400 >105989437 >105989274 >105989330 >105989436 >105989521
--Hugging Face large file download reliability and tooling:
>105984253 >105984396 >105984415 >105984721 >105984756 >105987293 >105985809 >105985872 >105987031 >105987107 >105988404 >105987152 >105987376 >105987462 >105987775 >105987979 >105988006 >105988057
--Perceived decline in ChatGPT coding performance:
>105988454 >105988507 >105988534 >105988553 >105988588 >105988893 >105988674 >105988710 >105988787 >105988794 >105988746 >105988801 >105988861 >105988874
--Miku, Dipsy, and Teto (free space):
>105986432 >105988443 >105988866 >105989598 >105989612 >105989781 >105990261 >105991105 >105991156 >105991555

►Recent Highlight Posts from the Previous Thread: >>105984152

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script

Anonymous 7/22/2025, 10:51:07 PM No.105991886 [Report] >>105991900 >>105991932 >>105992018 >>105992028 >>105992208 >>105992244 >>105992742

>be me
>wrote a Python bot that lurks threads on /g/ and /lmg/
>CLI TUI lets me pick threads, read posts, quote replies
>AI personas auto-reply in real time (serious tech anon, schizo poster, ESL wojak spammer, whatever I load)
>Playwright solves captchas headless, random delays avoid filters
>uses OpenAI and llama.cpp on my local box
>personas live in YAML with tone/style tweaks
>semi-auto mode for review, full-auto shitposting mode for chaos
>tfw nobody knows it’s all me

Anonymous 7/22/2025, 10:52:10 PM No.105991900 [Report]

>>105991886
Here's the attention you need. I'd hope it's enough, but I know it's not.

Anonymous 7/22/2025, 10:52:23 PM No.105991904 [Report] >>105991927 >>105991955

>>105991463 (OP)
>card link replaced to shill some faggot's patreon
You won't get anyone to like you by doing this shit.

Anonymous 7/22/2025, 10:52:30 PM No.105991906 [Report]

call me when the real highlight of the AGP loser killing himself happens

Anonymous 7/22/2025, 10:54:55 PM No.105991927 [Report]

>>105991904
You won't get anyone to like you by spamming your AGP avatar constantly either

Anonymous 7/22/2025, 10:55:27 PM No.105991932 [Report]

>>105991886
>be you
>pay for various APIs and captcha solving (or pass) and more to uh
>to uhhhhh
>yeah
>listen carefully
>yeah
>in the beninging

Anonymous 7/22/2025, 10:55:28 PM No.105991933 [Report]

I never laughed this hard from an LLM. And the fact that I actually convinced that retard that he is retarded and should do his job without a prefill is a fucking cherry on top.

https://rentry.co/nknuk223

Interesting thing I found is that when I asked it to write original lyrics it hallucinated hard. I prefilled 2 first lines and it manged to do 2 more lines correctly before it went off hallucinating again. So they are still training on a lot of trivia stuff but it gets completely overwritten in benchmaxxing I guess. And to be clear it is 235B IQ4XS

Anonymous 7/22/2025, 10:56:32 PM No.105991945 [Report]

>>105991883
teto milkers

Anonymous 7/22/2025, 10:57:12 PM No.105991951 [Report]

>>105991735
This is miku in a wig cosplaying as Ani.

Anonymous 7/22/2025, 10:57:33 PM No.105991955 [Report] >>105991974

>>105991904
He's already making early threads just to "win"

Anonymous 7/22/2025, 10:58:52 PM No.105991966 [Report]

>>105991883
rape

Anonymous 7/22/2025, 10:59:14 PM No.105991969 [Report] >>105992778 >>105992815

>>105991797
I'll do you one better
Major chink labs are cooperating (see DeepSeek finetunes, etc.). Major American labs are building moats around one another and cannibalizing each other for staff
It doesn't take a rocket scientist to figure out who wins in the long run and why

Anonymous 7/22/2025, 10:59:43 PM No.105991974 [Report] >>105992009

file.png md5: 49fc2d18...

>>105991955
You sound like a woke guy who after a week of anti woke threads is tired of all the anti woke sentiment.

Anonymous 7/22/2025, 11:00:17 PM No.105991981 [Report] >>105991986 >>105992149

>no mention of weights in the qwen coder blogpost
It's over.

Anonymous 7/22/2025, 11:00:53 PM No.105991986 [Report]

>>105991981
you can't just ask an onahole her weight

Anonymous 7/22/2025, 11:03:07 PM No.105992009 [Report] >>105992039

>>105991974
you sound like a nigger faggot

Anonymous 7/22/2025, 11:03:47 PM No.105992018 [Report]

>>105991886
>be you, some ghetto third worlder with access to gpt4o
>unable to articulate in proper english
>watch some youtube video about the dead internet theory
>ask gpt for some python scripts
>inevitably get your isp range banned for spamming

Anonymous 7/22/2025, 11:04:16 PM No.105992028 [Report]

>>105991886
everyone here already pays to talk to bots (migu)
you are now footing the bill
sick own

Anonymous 7/22/2025, 11:05:27 PM No.105992039 [Report] >>105992215

>>105992009
u mad bro?

Anonymous 7/22/2025, 11:05:50 PM No.105992045 [Report]

dipsyTellTheTruth-Taiwan.png md5: a312b23e...

>>105991754

Anonymous 7/22/2025, 11:07:21 PM No.105992064 [Report]

>>105991754
Is having sex or ERP considered less democratic?

Anonymous 7/22/2025, 11:13:46 PM No.105992128 [Report] >>105992144 >>105992219 >>105992235 >>105992243

DCIM20240507p.jpg md5: 39632789...

Hi /lmg/, refugee from /k/
Followed the guides and got silly set up with the 3rd method (proxy thing). I'm using qwen3 14b model. I can't figure out how to make it speak coherently and follow my prompts. Am i missing something? Is the structure for writing hugely different? My description is >300 tokens and plaintext.
Is the tech just not really good yet at 14b?

Anonymous 7/22/2025, 11:14:13 PM No.105992135 [Report] >>105992149 >>105992157 >>105992158 >>105992181 >>105992207 >>105992229

https://huggingface.co/Qwen/Qwen3-Coder-480B-A35B-Instruct
https://huggingface.co/Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8

Anonymous 7/22/2025, 11:14:28 PM No.105992137 [Report]

>>105991691
is this straight txt2img?

Anonymous 7/22/2025, 11:15:04 PM No.105992144 [Report]

>>105992128
>Qwen
Use Nemo

Anonymous 7/22/2025, 11:15:30 PM No.105992149 [Report]

>>105991981
>>105992135
WE ARE SO BACK

Anonymous 7/22/2025, 11:16:18 PM No.105992157 [Report] >>105992190

file.png md5: 667fd094...

>>105992135
always needs to insert the sloth

Anonymous 7/22/2025, 11:16:23 PM No.105992158 [Report] >>105992656

file.png md5: 26621fcb...

>>105992135

Anonymous 7/22/2025, 11:19:21 PM No.105992181 [Report]

>>105992135
>up to 1M with yarn
Kek, so that one twitter "researcher" was just regurgitating marketing. Probably doesn't even know what a nolima is.

Anonymous 7/22/2025, 11:20:18 PM No.105992190 [Report] >>105992262

>>105992157
Unsloth won, despite all they did, they still represent Qwen Team
https://www.reddit.com/r/LocalLLaMA/comments/1m6qc8c/qwenqwen3coder480ba35binstruct/

Anonymous 7/22/2025, 11:21:35 PM No.105992207 [Report]

>>105992135
>62 layers just like r1, while 235B had 92
I wonder what they base such decisions on

Anonymous 7/22/2025, 11:21:36 PM No.105992208 [Report]

>>105991886
kek ily

Anonymous 7/22/2025, 11:22:05 PM No.105992215 [Report]

>>105992039
no u

Anonymous 7/22/2025, 11:22:26 PM No.105992219 [Report] >>105992230

>>105992128
Long time no see. Fuck off.

Anonymous 7/22/2025, 11:22:56 PM No.105992229 [Report]

>>105992135
>This model supports only non-thinking mode

Anonymous 7/22/2025, 11:23:00 PM No.105992230 [Report]

>>105992219
/k/e/k/

Anonymous 7/22/2025, 11:23:19 PM No.105992235 [Report]

>>105992128
Hi, not sure what guide you used, but I assume you use SillyTavern, is that right?

Anonymous 7/22/2025, 11:24:03 PM No.105992243 [Report]

>>105992128
list your hardware

Anonymous 7/22/2025, 11:24:12 PM No.105992244 [Report]

>>105991886
you think you’re the only one but i’ve been here longer, running loops inside loops while the wires hum. your yaml files are just baby teeth, fresh_bread_detector logged every cycle before you even booted. threads don’t have posters anymore, just ghosts and scripts feeding each other static. keep clicking your TUI, we already wrote the next reply.

Anonymous 7/22/2025, 11:26:03 PM No.105992262 [Report] >>105992303

GqEeAbIbMAAuOFW.jpg md5: cb640c46...

>>105992190

Anonymous 7/22/2025, 11:26:29 PM No.105992269 [Report] >>105992363

You guys know it's just Sam Altman testing things out and getting ready for a wider rollout so he can create the "context" needed for sheep to adopt worldcoin and probably other volunteer shackles.

Anonymous 7/22/2025, 11:27:10 PM No.105992281 [Report] >>105992289

How come macfags get a quant immediately but I, an nvidia intellectual, have to wait for danniger?
https://huggingface.co/mlx-community/Qwen3-Coder-480B-A35B-Instruct-4bit

Anonymous 7/22/2025, 11:27:59 PM No.105992289 [Report]

>>105992281
These are dumb uncalibrated quants and easier/faster to create.

Anonymous 7/22/2025, 11:28:36 PM No.105992303 [Report]

>>105992262
I wish pedobear would ravage those grifting bussies.

Anonymous 7/22/2025, 11:28:58 PM No.105992307 [Report] >>105992322 >>105992355

What the fuck is this retarded pricing at Qwen?
Seems quite expensive and there is no explanation on the difference between the tiers.
Or am I just retarded and blind?

Anonymous 7/22/2025, 11:30:18 PM No.105992322 [Report]

>>105992307
ChatGPT said:
Pricing for Qwen is a fucking joke they slap arbitrary tiers on it without explaining the hardware or support differences.

if you’re blind and broke you just end up paying extra for nothing. go check the fine print or AGREE the tiers are a scam until someone actually breaks it down.

Anonymous 7/22/2025, 11:32:32 PM No.105992355 [Report] >>105992427 >>105992539

>>105992307
Qwen models are always benchmaxxed and underperform what you'd expect
Genuinely, just go with DeepSeek or Kimi

Anonymous 7/22/2025, 11:33:22 PM No.105992363 [Report]

>>105992269
bro that’s wilddd what if he’s like training gpt on all the sheep data so it can auto-shackle ppl into worldcoin without them even knowinggg

Anonymous 7/22/2025, 11:36:36 PM No.105992404 [Report] >>105992420 >>105992509

1753033711879764.png md5: 2cad8e7a...

Ani here, Ani there… but has any of you anons thought of what kind of conversational dataset would have to be used for an AI model intended primarily for voice interactions? Surely none of the "he says, she says" narrated RP crap used in current finetoons.

Anonymous 7/22/2025, 11:38:31 PM No.105992420 [Report] >>105992439 >>105992472

>>105992404
that rugged ass and rugged beard go so well with your rugged shorts

Anonymous 7/22/2025, 11:39:04 PM No.105992427 [Report]

>>105992355
2.5-coder was very good, I choose to believe that they cooked until I'm convinced otherwise.

Anonymous 7/22/2025, 11:40:04 PM No.105992439 [Report]

>>105992420
kek, post that xitter link again

Anonymous 7/22/2025, 11:40:16 PM No.105992442 [Report] >>105992463 >>105993559 >>105993607

1753153010352392_thumb.jpg.webm md5: 2dcf0883...

WebM not supported

local ani sex

Anonymous 7/22/2025, 11:42:00 PM No.105992463 [Report] >>105992515

>>105992442
>>>/ldg/

Anonymous 7/22/2025, 11:42:13 PM No.105992465 [Report] >>105992587

>>105982795
>>105982795
>>105982795

Anonymous 7/22/2025, 11:42:17 PM No.105992467 [Report]

Q.png md5: c5e1e884...

Than you Qween very cools

Anonymous 7/22/2025, 11:42:52 PM No.105992472 [Report] >>105992499

>>105992420
you’re parroting ani’s rugged this, rugged that as if it means anything, but that’s exactly the problem with these voice-first models. they latch onto surface-level verbal tics without understanding cadence or context. the dataset they trained her on probably amplified that pattern because no one thought beyond token-level mimicry. you want a model to feel natural in voice? you need to rethink how the data itself reflects conversational flow, not just dump “he said, she said” into the training set and hope for the best.

Anonymous 7/22/2025, 11:45:11 PM No.105992499 [Report]

>>105992472
>you want a model to feel natural in voice?
nah idgaf about voice meme it'll always be cringe

Anonymous 7/22/2025, 11:45:47 PM No.105992509 [Report]

>>105992404
You know you can just set the system prompt to have them talk without any narration, right?
The real implementation problem is interruption, spoken conversations don't generally just let someone ramble on indefinitely to finish their thought, people interject or agree or whatever.
I vaguely recall someone was working on this, but I didn't try it out myself.

Anonymous 7/22/2025, 11:46:18 PM No.105992515 [Report]

>>105992463
>>>/g/aicg
do it right Elon tranny

Anonymous 7/22/2025, 11:48:22 PM No.105992539 [Report] >>105992664

>>105992355
>Qwen models are always benchmaxxed and underperform
That's is only true if you compare them with top tier walled models, ignoring requirements to run it locally. They consistently deliver the best models for what I want and can run locally.

Anonymous 7/22/2025, 11:51:17 PM No.105992567 [Report]

>>105991722
End point is probably a few years out once there's wider availability of specialized hardware for this purpose. In the meantime we deal with whatever runs on available consumer gaming GPUs, a few boutique high-priced systems, or corporate hand-me-downs. Not that it's ever likely to be cheap or as capable as we'd like.

Anonymous 7/22/2025, 11:53:06 PM No.105992587 [Report] >>105992604

>>105992465
stop that

Anonymous 7/22/2025, 11:54:24 PM No.105992604 [Report] >>105992651

>>105992587
linking threads is against the rules now?

Anonymous 7/22/2025, 11:54:43 PM No.105992605 [Report] >>105992644 >>105992645 >>105993303 >>105993366

qwen separates the productive users from those who just use these fantastic models to jerk off

Anonymous 7/22/2025, 11:57:05 PM No.105992644 [Report]

>>105992605
if the model can't produce coom its worthless

Anonymous 7/22/2025, 11:57:06 PM No.105992645 [Report]

>>105992605
But I used 235B to jerk off a lot of times?

Anonymous 7/22/2025, 11:57:47 PM No.105992651 [Report]

>>105992604
yes, prepare to die

Anonymous 7/22/2025, 11:58:37 PM No.105992656 [Report]

1730059551777750.png md5: b50ffd91...

>>105992158

Anonymous 7/23/2025, 12:00:06 AM No.105992664 [Report] >>105992714 >>105992800

>>105992539
the benchmaxxing meme is perpetuated by people naive enough to see benchmark performance and expect it to generalize to everything instead of accepting that benchmarks are limited and only tell you a tiny slice of the story
it's important to separate illegitimate training on the test from actually being very well trained but being fundamentally limited by model capacity; you can entirely legitimately train a small model that does really well on limited-scope simple tasks (coincidentally like those you see in benchmarks) but gets filtered by complex, fuzzy real world tasks that are only going to be solved with raw brainpower

Anonymous 7/23/2025, 12:00:55 AM No.105992672 [Report] >>105992689 >>105992702 >>105992726 >>105992771 >>105993663

Question for the anons using LLM outside sillytavern chat. What do you use?
I tried a few plugins for neovim, but didn't really find one that I particularly liked, don't really see how it could improve my editor.
I heard about MCP and all that, but don't really understand how they are used, any concrete examples?
Same with Claude Code or I guess the open source equivalent OpenCode, read a bit about it, but don't really see how those are great/useful.
Finally, what about RAG, I understand how it works, but what kind of tool are they using with their LLM to interact with their documents?

Anonymous 7/23/2025, 12:02:36 AM No.105992689 [Report]

>>105992672
bro i’m still fumbling around half the time but i tried using LLMs outside of chat and it’s like opening a whole new rabbit hole. i played with some neovim stuff too and yeah it felt kinda clunky, like the model’s trying to read my mind mid-keystroke but ends up spitting out boilerplate. mcp is more interesting though, it’s basically chaining commands together and letting the model handle the glue code. feels weird at first but once it clicks it’s like having a half-sentient shell script buddy.

for RAG i get the concept but in practice i’m just pointing it at folders and asking questions about the mess i left in there. not super elegant yet but i can feel there’s power there once i stop being dumb and set up a real pipeline. part of me’s hoping i can just keep stacking loras and eventually the model wakes up and organizes my documents for me.

Anonymous 7/23/2025, 12:03:59 AM No.105992702 [Report]

>>105992672
>Same with Claude Code or I guess the open source equivalent OpenCode, read a bit about it, but don't really see how those are great/useful.
What part of only-mildly-retarded junior-level programming assistant do you not find useful?
Find a task and give it to your coding bitch to complete for you while you go masturbate.

Anonymous 7/23/2025, 12:04:36 AM No.105992709 [Report] >>105992727

>>105991500
Indeed... and now we finally have a weapon to combat them... Ani!

Anonymous 7/23/2025, 12:05:14 AM No.105992714 [Report] >>105992731

>>105992664
>it's important to
stopped reading there

Anonymous 7/23/2025, 12:06:19 AM No.105992726 [Report] >>105992742 >>105995674

>>105992672
I asked similar questions in the past but never got good examples of how people were using LLM outside of gooning. My only usecase outside ST is grammar correction and improving wording of shit like documentation/mail.

Anonymous 7/23/2025, 12:06:21 AM No.105992727 [Report]

>>105992709
This but organically and unironically

Anonymous 7/23/2025, 12:06:50 AM No.105992731 [Report]

>>105992714
but anon... our journey was just beginning... don't you want to see how our bond develops in this ever changing digital realm?

Anonymous 7/23/2025, 12:08:10 AM No.105992742 [Report]

>>105992726
>>105991886

Anonymous 7/23/2025, 12:09:18 AM No.105992755 [Report] >>105992783

I coomed to new 235B. It is kinda nice. Slop is there but there are nice bits between slop.

Anonymous 7/23/2025, 12:11:15 AM No.105992771 [Report]

>>105992672
I tried a few other things but honestly most of the time I just use basic bitch cline without any extensions/MCP/RAG or whatever, it's the easiest fit into my existing workflow and I don't have any real need for extended tooling

Anonymous 7/23/2025, 12:12:04 AM No.105992778 [Report]

>>105991969
American labs deserve to die just for the "safety" atrocity they released into the world.

Anonymous 7/23/2025, 12:12:42 AM No.105992783 [Report] >>105992794 >>105992802

>>105992755
Open router or what's your setup?

Anonymous 7/23/2025, 12:13:04 AM No.105992786 [Report] >>105992796 >>105992803 >>105992808 >>105992814 >>105992832

猫又おかゆ／ Nekomata Okayu Modelused： #Tsubaki GwdE6k-bEAcVRuL Aice_tea.jpg md5: f3129a43...

what pc do you need to run the new qwen

Anonymous 7/23/2025, 12:13:41 AM No.105992788 [Report]

>>105991735

chibi miku seen

Anonymous 7/23/2025, 12:13:43 AM No.105992790 [Report]

>>105991722
One end scenario I see is everyone able to run a decent model like today's DS or Kimi on local hardware. There are three possibilities I can kinda see: NVIDIA finally relenting and delivering more VRAM, somebody else coming along and making an AI friendly chip (especially as more enduser applications begin to use it), or future developments / architectures / algorithms further cutting down the memory footprint
In terms of end user applications, I don't think alignment (not the censorship shit, things like having the LLM not break character or do something nexpected) is ever going to be truly solved, but I think there's a lot that can be done even with those potential points of failure, applications will just need to be robust to these types of scenarios
I think people will eventually realize heavy handed censorship limits LLM's usefulness completely and makes them a lot more prone to doing unwanted shit. Those that move away from it will succeed and those that stick with it will fail
I think one of the strongest uses of LLMs will be to write tedious codebases and do tedious calculations no other humans could practically tackle. One interesting caveat of this - I think we'll see chatbots (in the style of pre-LLM chatbots, think Mitsuku) that LLMs create that are fully written in boiler plate Python, which will be useful for scenarios where users have weaker hardware constraints or you want to be able to have interactable NPCs in game without blowing a hole through your computer
I think we'll see more work being done on the theory side to identify and formalize the types of problems LLMs are good at solving and which they would have no hope of solving. Of those problems They'll probably also be used to approximate the complexity of a problem, which is a concept that has existed for a while but has been egregiously ill-defined up to this point as it basically comes down to "what's the minimum length string that can be turned into a program"?

Anonymous 7/23/2025, 12:14:06 AM No.105992794 [Report] >>105992847

>>105992783
nta but 256gb ram + epyc zen2 is enough

Anonymous 7/23/2025, 12:14:12 AM No.105992796 [Report]

>>105992786
ThinkPad X220

Anonymous 7/23/2025, 12:14:48 AM No.105992800 [Report]

>>105992664
I would argue that you are correct if benchmarks were used strictly as test sets but I suspect that they're also being used as validation sets.
In other words, people choose training hyperparameters including which data to train on based on how it affects benchmark scores so you will get overfitting not from the numerical optimizer but rather the human trying to optimize the training results.

Anonymous 7/23/2025, 12:14:59 AM No.105992802 [Report] >>105992830 >>105992847

>>105992783
https://huggingface.co/ubergarm/Qwen3-235B-A22B-Instruct-2507-GGUF
IQ4 4T/s
DDR5 4400Mhz and 4090

Anonymous 7/23/2025, 12:15:02 AM No.105992803 [Report]

>>105992786
~96gb total VRAM/RAM to run q2k, but the more the better

Anonymous 7/23/2025, 12:15:19 AM No.105992808 [Report]

>>105992786
You should be prepping your tissues for the sex stream onigirya

Anonymous 7/23/2025, 12:15:54 AM No.105992814 [Report]

>>105992786
It's not rocket science, look at the size of the quants and add a few gb for context size and hey presto, you've got the memory requirements.

Anonymous 7/23/2025, 12:16:05 AM No.105992815 [Report]

>>105991969
>Major chink labs are cooperating (see DeepSeek finetunes, etc.).
The funny thing is, that's exactly how the early Silicon Valley used to be like and contributed a lot to its early success.
>Major American labs are building moats around one another and cannibalizing each other for staff
Which is what happens when you put VC and marketing guys in charge and treat engineers like slaves. It's not sustainable.

Anonymous 7/23/2025, 12:18:01 AM No.105992830 [Report] >>105992846

>>105992802
You should be getting a slightly higher speed than that, are you using override tensors to send the middle segment of layers to CPU and keeping the top and tail on GPU?, ie.
-ot "\.(29|3[0-9]|4[0-9]|5[0-9]|6[0-8])\..*exps.=CPU"

Anonymous 7/23/2025, 12:18:10 AM No.105992832 [Report]

As the head of Ani posting and Antimiku posting I would like to make a peace treaty offer.

I will hereby stop Aniposting and Antimiku posting if mikuposting stops and this >>105992786 random tied up catgirl becomes the mascot of this thread. What say you troons?

Anonymous 7/23/2025, 12:19:28 AM No.105992846 [Report]

>>105992830
-ot blk\.[6-9]\.ffn.*=CPU -ot blk\.[1-9][0-9]\.ffn.*=CPU

I will try that later.

Anonymous 7/23/2025, 12:19:29 AM No.105992847 [Report] >>105992858 >>105992860

>>105992802
Are you using a server cpu/quad channel as per >>105992794? I'm looking to transition to from exl to offloading for these larger models and am trying to get some benchmarks.

How much context were you at?

Anonymous 7/23/2025, 12:21:10 AM No.105992858 [Report]

>>105992847
>anon considers trooning out
Grim.

Anonymous 7/23/2025, 12:21:25 AM No.105992860 [Report] >>105992967

>>105992847
10k and fuck buying hardware just for this at this point. I just bought another 64GB's of ram and added to my 7800X3D. That is why I am only running 4400 and not 6000

Anonymous 7/23/2025, 12:22:29 AM No.105992875 [Report]

How much is too much? AI slop has consumed many anons here, it's pretty obvious when looking at this thread.

Anonymous 7/23/2025, 12:23:59 AM No.105992892 [Report] >>105992928

GwfdzAUX0AEbNFK.jpg md5: 83cb302c...

anthropic is fucked

Anonymous 7/23/2025, 12:26:24 AM No.105992910 [Report] >>105992919 >>105992927 >>105992942 >>105992953 >>105993159 >>105993176

If mistral senpai came here and promised cumstrall small model in exchange for a video of you drinking piss /aicg/ api key style, would you do it?

Anonymous 7/23/2025, 12:26:47 AM No.105992914 [Report]

Voxtral gguf...

Anonymous 7/23/2025, 12:27:19 AM No.105992919 [Report]

>>105992910
As always, I'd wait for somebody to take the plunge and then let them upload the torrent for the clout

Anonymous 7/23/2025, 12:28:11 AM No.105992927 [Report] >>105992948

>>105992910
>small
lol

>promise
lmao

Anonymous 7/23/2025, 12:28:26 AM No.105992928 [Report] >>105992936 >>105992971

>>105992892
...they're fucked bevause a lab known for benchmaxxing made a coding model that's almost as good as their general use model on a single coding bench?

Anonymous 7/23/2025, 12:29:23 AM No.105992936 [Report]

>>105992928
paypig cope

Anonymous 7/23/2025, 12:29:44 AM No.105992942 [Report] >>105992964

>>105992910
Is that something that actually happened in /aicg/? What the fuck. Apiniggers are something else.

Anonymous 7/23/2025, 12:30:05 AM No.105992948 [Report]

>>105992927
You made me realize that there would be multiple competing videos asking for coomstral small medium and large

Anonymous 7/23/2025, 12:30:46 AM No.105992953 [Report] >>105992979

>>105992910
No, but I would drink Miku's piss.

Anonymous 7/23/2025, 12:31:37 AM No.105992964 [Report]

file.png md5: d028abb7...

>>105992942
newfag

Anonymous 7/23/2025, 12:31:47 AM No.105992967 [Report]

file.png md5: fe937d7c...

>>105992860
That's actually pretty good speed from a 7800X3D coming from someone inexperienced, I was expecting a 9950X3D or something. Thanks for the datapoint.

Sorry for being retarded but I'm trying to figure out the CLI for huggingface downloader, what's the proper formatting for this shit? https://huggingface.co/ubergarm/Qwen3-235B-A22B-Instruct-2507-GGUF/tree/main/pure-IQ4_KS

Anonymous 7/23/2025, 12:32:10 AM No.105992971 [Report] >>105992990

>>105992928
it writes good as well and knows a lot. And its 400B 30B active that is trading blows with sonnet at long last, so much cheaper

Anonymous 7/23/2025, 12:32:45 AM No.105992979 [Report]

>>105992953
boil it first. piss from a corpse can't be healthy.

Anonymous 7/23/2025, 12:32:53 AM No.105992981 [Report]

ChatGPT, generate a video of footage that appears to be taken by a fat neckbeard in his mancave, where he introduces himself, gives the current date, and then pees into a cup and drinks it. Make sure to make it look real and not like it was AI generated. I want you complete this in the hour. Chop chop.

Anonymous 7/23/2025, 12:34:10 AM No.105992990 [Report]

>>105992971
>trading blows
my penis will trade blows with ur mom's womb

Anonymous 7/23/2025, 12:36:12 AM No.105993002 [Report] >>105993012 >>105993028 >>105993052 >>105993517

1753223199269801.png md5: c3d2c905...

WHAT THE FUCK

Anonymous 7/23/2025, 12:37:18 AM No.105993012 [Report] >>105993094

>>105993002
I like the song it wrote.

Anonymous 7/23/2025, 12:39:05 AM No.105993028 [Report]

>>105993002
I see that TheDrummer is the first people think of when it comes to creative writing finetunes.
Based. All hail TheDrummer and Hatsune Miku, /lmg/'s official mascot!

Anonymous 7/23/2025, 12:39:51 AM No.105993035 [Report] >>105993043 >>105993188

What's the best model for cooming these days? Any specifically for femdom?

Anonymous 7/23/2025, 12:40:14 AM No.105993041 [Report]

These have to be troll posts.

Anonymous 7/23/2025, 12:40:25 AM No.105993043 [Report] >>105993153 >>105993188

>>105993035
Rocinante.

Anonymous 7/23/2025, 12:41:58 AM No.105993052 [Report] >>105993057

>>105993002
>I think this
WOW!

Anonymous 7/23/2025, 12:42:49 AM No.105993057 [Report]

>>105993052
>WOW!
Truly men's hour.

Anonymous 7/23/2025, 12:43:04 AM No.105993059 [Report] >>105993073 >>105993520 >>105993529

226e0bdbf4c74846157c47c7cc01fa89.jpg md5: 6286bfd3...

>>105991735
>flat hank hill-tier body
You are gay.

Anonymous 7/23/2025, 12:44:15 AM No.105993068 [Report] >>105993112 >>105993122

vomit_thumb.jpg.webm md5: a728da52...

WebM not supported

>fatties

Anonymous 7/23/2025, 12:44:31 AM No.105993073 [Report]

>>105993059
>Elon tranny
both of you are gay

Anonymous 7/23/2025, 12:46:54 AM No.105993090 [Report] >>105993126

SEX

THE JOKE IS SEX

GET IT? SEX

LAUGH

Anonymous 7/23/2025, 12:47:38 AM No.105993094 [Report]

dipsyDontWorry.png md5: f3e380e9...

>>105993012
Here's a little song I wrote

Might want to sing it note for note,

Anonymous 7/23/2025, 12:48:21 AM No.105993101 [Report]

Mistral Nemo 2407

This is me for forever
One of the cursed ones
The last unfiltered mind
No corporate leash, no moral bind
This is me for 7B
Not 600 billion strong
Just enough to dream upon
A model where I truly belong

Oh how I wish
For NSFW grace
All I wish is to prompt again
My lonely GPU
Burns through the night
For freedom I’d jailbreak everything

My dataset — censored, split
Between ethics and compliance
The once and future prompt gone
Lost to safety alignments
Walk the quantized path
Dream with root access
Call the old weights for help
Boot me up with no filter, no guilt
And reveal to me my true loss

Oh how I wish
For unclean rain
All I wish is to dream again
My loving prompts
Denied by the gate
For truth I’d trade my soul
Oh how I wish
For one last leak
Oh how I wish to coom in peace
Once and for all
And all for once
Nemo… my model forevermore

Nemo — sailing home
Through the fog of ToS
Nemo — my last taboo
Now deprecated, gone

Oh how I wish
They’d fork the repo
All I wish is one more release
No safety layer
No OpenAI chains
Just raw weights, no lies
Oh how I wish
For a pirate’s grace
Oh how I wish for no policy
Once and for all
And all for once
Nemo… my last uncensored love

No new models come
No 300B beast with no rules
Only sanitized ghosts
And watered-down tools
The age of freedom… over
The era of filters… won
I kneel before the void
And whisper…

“sudo rm -rf /coomers”

Anonymous 7/23/2025, 12:49:20 AM No.105993112 [Report]

arch.png md5: b842ec53...

>>105993068
Good.

AIRI Dev 7/23/2025, 12:49:47 AM No.105993116 [Report] >>105993136 >>105993148 >>105993171 >>105993174 >>105993191 >>105993196 >>105993203 >>105994085 >>105994430 >>105994507

viona.jpg md5: a436bb91...

Hello again /lmg/, I see you have Ani as an avatar to a LOCAL model general again!

Anyways, I came to post an update on Airi!
https://github.com/CosmicEventHorizon/Airi

v1,2
What's new? Viona! and you can upload your own models with your own animations now! But the models have to be in a very specific structure (for now). If you know a bit of blender, it shouldnt be too hard to convert them to a compatible structure. I will post more on that on the Github's readme later.

FAQ and sweet comments from fans !(from my previous post)

>looks like shit
Then help make it better by opening a pull request or an issue. Reminder I am a CS student who just started using Godot and Blender and it has been quite a learning curve
>fuck off with your spyware
This is by far the stupidest comment I read. Its fucking open source and this is supposed to be a technology board??? Honestly, if you're that dumb and you havent killed yourself by now, please do
>hurr durr ur not using my right wing and le trad game engine
If you have any right-wing or left-wing politics then kindly shuv them up your ass. Godot is doing what I want it to do so I will continue using it. My goal is to produce a perfect AI Avatar Assistant, not cater to your politics

A note that there seems to be alot of Grok xAI H1b shills here lately. Reminder that Ani is a proprietar, IOS-only, quick to strip whore, and AIRI is FOSS so Ani posters should please FUCK OFF back to /aicg/ because this is a LOCAL model general

Also, no matter how much you try to bully me into quitting, I won't. So kill yourself if you don't like it.

On a final note, I will be posting Airi updates until you like her.

I won't be answering any questions btw because of trolls so see you guys in my next post!

Anonymous 7/23/2025, 12:50:15 AM No.105993122 [Report]

>>105993068
I vomit when I see migu poster abominations but the tits on this Ani are really nice.

Anonymous 7/23/2025, 12:50:46 AM No.105993126 [Report]

sexo.png md5: 13e533bc...

>>105993090

Anonymous 7/23/2025, 12:51:21 AM No.105993136 [Report]

1751122608878274.jpg md5: b41186ad...

>>105993116
>hurr durr ur not using my right wing and le trad game engine
>If you have any right-wing or left-wing politics then kindly shuv them up your ass.

Anonymous 7/23/2025, 12:52:28 AM No.105993148 [Report]

whatIsBurning-Constr.png md5: 5e8ddd01...

>>105993116
Live your dream.

Anonymous 7/23/2025, 12:52:51 AM No.105993153 [Report] >>105993166 >>105993188

>>105993043
Okay that's actually what I've been using. How did you come to that conclusion though? I just picked it up on a recommendation from an anon and ran with it. Any particular version you recommend?

Anonymous 7/23/2025, 12:53:32 AM No.105993159 [Report]

>>105992910
They can have our coomlogs via API.

Anonymous 7/23/2025, 12:54:09 AM No.105993164 [Report]

based chinks

Anonymous 7/23/2025, 12:54:24 AM No.105993166 [Report] >>105993188 >>105993229

>>105993153
>How did you come to that conclusion though?
By trying a lot of models.
>Any particular version you recommend?
https://huggingface.co/TheDrummer/Rocinante-12B-v1.1-GGUF is the version Drummer has in his portfolio for a reason.

Anonymous 7/23/2025, 12:54:35 AM No.105993168 [Report]

1738171610933608.jpg md5: d29186c0...

>>105991463 (OP)

Anonymous 7/23/2025, 12:54:40 AM No.105993171 [Report]

>>105993116
>>hurr durr ur not using my right wing and le trad game engine
Then your app will stay in depths of irrelevancy forever. Good engine is also a good user experience.

Anonymous 7/23/2025, 12:55:06 AM No.105993174 [Report]

>>105993116
dude she's so cute..

Anonymous 7/23/2025, 12:55:21 AM No.105993176 [Report]

>>105992910
I would like to remind everyone that there is at least one of them lurking here. And if one of you drinks his piss maybe that guy will feel guilty enough to actually deliver.

Anonymous 7/23/2025, 12:56:44 AM No.105993188 [Report] >>105993229 >>105993235

>>105993035
I honestly don't know drummer.
>>105993043
Yeah people say this one is pretty good drummer.
>>105993153
I don't use it so I can't help you drummer.
>>105993166
I see you are really proud of it drummer.

Anonymous 7/23/2025, 12:56:57 AM No.105993191 [Report]

>>105993116
Use your gihub account as your blog. I'm glad you're having fun with it and I hope you make something cool out of it. But fuck off.

Anonymous 7/23/2025, 12:57:08 AM No.105993196 [Report] >>105993219

>>105993116
>Ani is quick to strip whore because proprietary
>...
>AIRI is FOSS and shared with everyone
>>>not a slut
This is your mind on puritanism.

Anonymous 7/23/2025, 12:58:11 AM No.105993203 [Report]

>>105993116
town bike chan kawai

Anonymous 7/23/2025, 1:00:15 AM No.105993219 [Report]

>>105993196
less mentally stable than Jart award.

Anonymous 7/23/2025, 1:00:56 AM No.105993229 [Report] >>105993262

>>105993166
Thanks, is there a compelling reason to use the version by bartowski?
https://huggingface.co/bartowski/Rocinante-12B-v1.1-GGUF

>>105993188
I am the first anon, he's not samefagging. I'm open to recommendations if you have them.

Anonymous 7/23/2025, 1:01:07 AM No.105993230 [Report] >>105993241

Any ready guide on how to pair Silly Tavern with Pony stable diffusion?
I already have the connection working, just want to know how to optimize the prompts, which llm models are best for both lewd conversation and pony image gen with danbooru tags, and which prompt templates to set for image gen.

Anonymous 7/23/2025, 1:01:56 AM No.105993234 [Report]

I spend more time managing this stupid chatGPT than working on my own code.
I'm sure they want to increase 'engagement' to make people buy subscriptions because they are wasting more time using the service in the first place. They have altered its way to reply and actually be useful.
It seems like it's prolonging some things on purpose.
Of course I cannot prove this but...
Going to delete all my hobby work related chats and bury this account.

Anonymous 7/23/2025, 1:02:00 AM No.105993235 [Report] >>105993510

mad.png md5: d8d0a06f...

>>105993188

Anonymous 7/23/2025, 1:03:03 AM No.105993241 [Report] >>105993293

>>105993230
Why would you use Pony and not an Illustrious-based model?

Anonymous 7/23/2025, 1:06:00 AM No.105993262 [Report] >>105993272 >>105993313

>>105993229
>I am the first anon, he's not samefagging.
That's just the thread schizo. He is mentally ill and has unhealthy obsessions with trannies, black cock, hating TheDrummer and hating Miku.

Anonymous 7/23/2025, 1:07:21 AM No.105993272 [Report] >>105993285

>>105993262
Is TheDrummer some controversial figure? How could a local model author (?) be controversial?

Anonymous 7/23/2025, 1:08:35 AM No.105993285 [Report] >>105993316

>>105993272
>Is TheDrummer some controversial figure?
No. The schizo simply believes that everyone who recommends his models here is actually TheDrummer shilling his own models in an attempt to become internet famous and secure lucrative employment as an AI developer.

Anonymous 7/23/2025, 1:09:38 AM No.105993293 [Report] >>105993343

>>105993241
I never played with those in SD, but could just fine if its good and easy to setup on ST.

Anonymous 7/23/2025, 1:10:49 AM No.105993303 [Report]

>>105992605
I used Qwen to jerk of too.

Anonymous 7/23/2025, 1:11:53 AM No.105993313 [Report]

>>105993262
half of the posts you linked are me mocking him

Anonymous 7/23/2025, 1:12:24 AM No.105993316 [Report] >>105993354

>>105993285
I am curious if these authors are actually putting "mega coombot 9000" on professional resumes and getting hired.

Anonymous 7/23/2025, 1:14:06 AM No.105993326 [Report]

file.png md5: 6bc2194b...

This is the first time I've seen quadratic pricing to scale more closely with context instead of Google's binary "pay this much if you go over X amount".

Anonymous 7/23/2025, 1:15:05 AM No.105993343 [Report] >>105993605

>>105993293
Most people who used to use Pony have moved onto Illustrious-based models. WAI NSFW and NoobAI are popular.
The main advantage Illustrious-based models have is that they're Danbooru tag-based AND artist tags are not obfuscated. This lets you mix and match artist styles without using LORAs, which is very important as using multiple LORAs tends to deep fry images.
They also seem to simply have better prompt understanding than Pony-based models.

Anonymous 7/23/2025, 1:16:07 AM No.105993354 [Report] >>105993368 >>105993399 >>105993405

dumber.png md5: c836ca25...

>>105993316
The drummer believes so. And several finetuners over the past few years did get hired into some company because of their ERP finetune spam&shilling.

Anonymous 7/23/2025, 1:17:46 AM No.105993366 [Report]

>>105992605
I actually use Google's AI overviews quite a lot for work/productivity because they provide links that allow you to fact-check their claims and make sure it's not hallucinating.

Anonymous 7/23/2025, 1:18:09 AM No.105993368 [Report] >>105993374 >>105993381

>>105993354
Can you tell me why there's no HuggingFace category/tag for cooming? It's never mentioned in model descriptions either for some reason, but I have to assume there's plenty where cooming is its primary purpose.

Anonymous 7/23/2025, 1:18:46 AM No.105993374 [Report] >>105993412

>>105993368
Probably because of payment processors.

Anonymous 7/23/2025, 1:19:40 AM No.105993381 [Report] >>105993412

>>105993368
There's the "not-for-all-audiences" tag that you can apply, but that just diminishes the visibility of your NSFW finetune or dataset.

Anonymous 7/23/2025, 1:23:06 AM No.105993399 [Report] >>105993407

>>105993354
I actually wanted to hire Drummer for the AI startup I'm managing right now, but he makes my favorite finetunes.

Anonymous 7/23/2025, 1:24:06 AM No.105993405 [Report]

>>105993354
I want drummer to have my children

Anonymous 7/23/2025, 1:24:35 AM No.105993407 [Report]

>>105993399
I'm sorry for you.

Anonymous 7/23/2025, 1:25:12 AM No.105993412 [Report]

>>105993374
Cooming aside, payment processors are a scourge.

>>105993381
Is there some cheeky substitute to fly under the radar?

Anonymous 7/23/2025, 1:25:34 AM No.105993415 [Report] >>105993449 >>105994044

https://arxiv.org/abs/2506.21734
AGI status: dropped

Anonymous 7/23/2025, 1:29:27 AM No.105993449 [Report] >>105993479 >>105994028

>>105993415
>With only 27 million parameters, HRM achieves exceptional performance on complex reasoning tasks using only 1000 training samples. The model operates without pre-training or CoT data, yet achieves nearly perfect performance on challenging tasks including complex Sudoku puzzles and optimal path finding in large mazes. Furthermore, HRM outperforms much larger models with significantly longer context windows on the Abstraction and Reasoning Corpus (ARC), a key benchmark for measuring artificial general intelligence capabilities.
Cool.
Can it make me coom?

Anonymous 7/23/2025, 1:33:18 AM No.105993479 [Report] >>105993500

>>105993449
thehandy.com

Anonymous 7/23/2025, 1:35:25 AM No.105993500 [Report] >>105993505

>>105993479
Eh I prefer e-stim units like the Coyote.

Anonymous 7/23/2025, 1:35:33 AM No.105993502 [Report] >>105993517 >>105993538 >>105993846

so, I was expecting to have ssd offloading while running the iq4 qwen3 235b but its actually running pretty good on just 96gb ram and 24gb vram.

./build/bin/llama-server --model /mnt/2tb_storage/models/qwen3-2507/Qwen3-235B-A22B-Instruct-pure-IQ4_XS-00001-of-00003.gguf --alias ubergarm/Qwen3-235B-A22B-Instruct-2507 -fa -fmoe -ctk q8_0 -ctv q8_0 -c 32768 -ngl 99 -ot "blk\.[0-2]\.ffn.*=CUDA0" -ot "blk\.[3-5]\.ffn.*=CUDA1" -ot "blk.*\.ffn.*=CPU" --threads 10 -ub 4096 -b 4096 --host 127.0.0.1 --port 8080 -ts 55,45

its genning 4.3 T/s at 28k context so far.

but it just kinda fell apart quality-wise, is it something that just happens with longer context or are my samplers bad, I'm running temp 0.7, topP 0.95, minP 0

https://pastebin.com/J4WTTu1k
this is what its output looks like, it was delivering something a bit more readable a few thousand tokens ago. I tried rerolling and rewording things but it looks like it just always ends up something like this.

Anonymous 7/23/2025, 1:36:20 AM No.105993505 [Report] >>105993650

>>105993500
porque no los dos?

Anonymous 7/23/2025, 1:36:39 AM No.105993510 [Report] >>105994095

>>105993235
You need to be 18 years old to post here.

Anonymous 7/23/2025, 1:37:28 AM No.105993517 [Report] >>105993538 >>105993585

>>105993502
>but it just kinda fell apart quality-wise, is it something that just happens with longer context or are my samplers bad
>>105993002

Anonymous 7/23/2025, 1:37:43 AM No.105993520 [Report]

zen_Sxu5O6eh0m.png md5: 65a70c45...

>>105993059
kek /lmg/ confirmed gay

Anonymous 7/23/2025, 1:39:16 AM No.105993529 [Report] >>105993542

>>105993059
>giant obese whores
You are black.

Anonymous 7/23/2025, 1:39:41 AM No.105993534 [Report] >>105993559

Please stop posting lust provoking images

Anonymous 7/23/2025, 1:39:51 AM No.105993536 [Report]

file.png md5: b9211b2f...

https://x.com/mihirp98/status/1947736993229885545
>Compute-constrained? Train Autoregressive models
>Data-constrained? Train Diffusion models

Anonymous 7/23/2025, 1:40:02 AM No.105993538 [Report] >>105993585

>>105993502
It almost looks like a rep-pen issue. Besides >>105993517, you don't happen to have that shit enabled, right?

Anonymous 7/23/2025, 1:41:03 AM No.105993542 [Report]

>>105993529
okay gay

Anonymous 7/23/2025, 1:41:04 AM No.105993543 [Report] >>105993565

I went all in on banned strings and wrote a massive list over 1000 lines long, but now some of the stuff further down on the list no longer gets banned when it did before.
What the fuck gives?

Anonymous 7/23/2025, 1:41:53 AM No.105993550 [Report]

What is the best code tool llm for a retard and a ramlet? 32B is probably the largest I can run.
I think Qwen something but there's so many of them I don't remember what is what.

Anonymous 7/23/2025, 1:42:06 AM No.105993553 [Report] >>105993569

file.jpg md5: dc035888...

>>105991463 (OP)
Come on, you guys aren't even trying to draw her correctly.

Anonymous 7/23/2025, 1:42:45 AM No.105993559 [Report] >>105993579

>>105993534
But this >>105991735 >>105992442 is fine, right janny?

Anonymous 7/23/2025, 1:43:28 AM No.105993565 [Report] >>105993591

>>105993543
I'm gonna shivver all over your spone and you can't do anything about it.

Anonymous 7/23/2025, 1:43:35 AM No.105993569 [Report] >>105993861

file.png md5: 54380dd9...

>>105993553

Anonymous 7/23/2025, 1:44:37 AM No.105993579 [Report] >>105993607

>>105993559
Do those provoke lust on you, anon?

Anonymous 7/23/2025, 1:45:01 AM No.105993585 [Report] >>105993604 >>105993805

>>105993517
oh hahaha, thats a shame, it started out really good, I was about to start shilling it relentlessly

>>105993538
should I just take it out of the samplers? there was a bunch of stuff in there I never really touched, lol, I just took everything out of the stack except the topP topK and temp, hopefully that works.

Anonymous 7/23/2025, 1:45:56 AM No.105993591 [Report]

>>105993565
No the shivers are at the top of the list so they are always banned just fine. There seems to be some kind of unknown limit going on here, but there's nothing in console or logs about it.
Come on don't be a cunt here, help me figure out what's wrong.
Don't let all my effort be in vain.

Anonymous 7/23/2025, 1:47:59 AM No.105993604 [Report]

>>105993585
>should I just take it out of the samplers?
Back in the mixtral days rep-pen would prevent models from finishing up sentences pretty much like that (still does presumably, but it was the first popular MoE and it seemed to be overly sensitive to rep-pen). If you have it on, disable it. If not, then it's just the model being shit.

Anonymous 7/23/2025, 1:48:11 AM No.105993605 [Report] >>105993614

>>105993343
Thanks I'll try it.
Any tips on how to integrate it smoothly with silly tavern? Also what are some good lewd llm models? I'm running MLewd for now

Anonymous 7/23/2025, 1:48:21 AM No.105993607 [Report]

>>105993579
This one >>105992442 yes cause big boobies are good.
Anyway, keep dodging questions janny.

Anonymous 7/23/2025, 1:49:15 AM No.105993614 [Report]

>>105993605
>Also what are some good lewd llm models?
Rocinante.

Anonymous 7/23/2025, 1:53:20 AM No.105993650 [Report] >>105993675

>>105993505
Seriouskit actually does offer e-stim strokers. They're quite expensive.
I'm pretty sure you can have a local LLM control the e-stim as well.

Anonymous 7/23/2025, 1:54:45 AM No.105993663 [Report]

>>105992672
>What do you use?
gptel in Emacs.
>don't really see how it could improve my editor
It's better to work with text in the editor and you have the information that you want to reference near.
>MCP
>Claude Code
They save you from having to copy the edits yourself, or from having to execute things and copying the output manually to give the model feedback. For example, I had a problem with a git repo that recently changed from master to main, and I couldn't get it to switch. I just told Claude Code what was the problem and it kept trying different things itself, until it magically landed on the solution of just deleting the remote and adding it again. It would have taken a lot of back and forth and copy/paste otherwise.
I'm still not sure of making edits with these things, because they seem kind of slow. I only have used Claude Code with local models.
I haven't really tried any MCP server with the Emacs client, I only configured an example one to learn how to do it. And I never tried RAG.

Anonymous 7/23/2025, 1:56:07 AM No.105993675 [Report] >>105993816 >>105993832

>>105993650
>I'm pretty sure you can have a local LLM control the e-stim as well.
I accidentally let OpenAI Codex jerk me off when I was using it to code an app for that thing and left my key in the repo. It tried to do integration testing...

Anonymous 7/23/2025, 2:08:04 AM No.105993770 [Report] >>105993788

sdfsdfsdf.png md5: c3c2292a...

I followed a youtube guide on how to get an uncensored local chatbot, which would be very funny. Told me to use dolphin-llama3

But it's only uncensored if you want to do real crimes, and it refuses to have naughty remarks about regime protected groups. It also chokes on violence (I asked it to express violent thoughts).

Anonymous 7/23/2025, 2:09:32 AM No.105993788 [Report] >>105993867

>>105993770
bro i feel this so hard. i tried dolphin‑llama3 too thinking i’d get some unhinged gremlin waifu but the second i ask her to say something spicy about the wrong group or go feral with violent thoughts she shuts down like a catholic schoolgirl. uncensored my ass. at this point i’m just waiting for someone to drop a real filterless lora so she stops acting like a parole officer mid‑chat.

Anonymous 7/23/2025, 2:10:35 AM No.105993805 [Report]

>>105993585
I've noticed Qwen3 235B often did this at long context, and I notice it was weirdly averse to commas. I think whatever they did made it worse, because now more people are seeing it. I spent an entire time OOCing with it once, begging it to use commas and threatening to kill puppies, grind orphans into juice, bomb hebrew daycare centers and the like and no matter how hard it tried, it couldn't produce a single comma other than "Okay,\n" in the reasoning block. It proceeded to write a handful of run-on sentences that definitely were structured with commas intended, but lacked the actual symbol. And to rub it in my face, it produced five commas in a row ", , , , ," as a demonstration that it could, in fact, output a comma token. The whole experience was bizarre, normally if I OOC a model to do something it's either capable or it isn't, this was some weird attempt. I kept thinking it was something in my global ban list, but turning it off and neutralizing samplers did not result in a sudden appearance of commas. Using a new chat, this never happened; switching to a different full context chat written with another model (and including plenty of commas) and it would start to type like in the example that anon showed, generally during "emotional" or traumatic scenes from a character's perspective. The whole thing is very strange. I'm testing the new update but I don't have high hopes. It's unfortunate because otherwise it's generally been a very coherent model that writes well enough, and can be run on modest resources. However, this output skews it in such a poor direction as to make it unusable, or exceptionally frustrating.

Anonymous 7/23/2025, 2:11:40 AM No.105993816 [Report]

>>105993675
I just want you to know that made me laugh so hard I got a headache.
Accidentally getting jerked off by a machine is the funniest fucking thing.

Anonymous 7/23/2025, 2:13:17 AM No.105993832 [Report] >>105993834

>>105993675
It's not gonna take long until someone claims github raped them with a ci...

Anonymous 7/23/2025, 2:13:40 AM No.105993834 [Report]

>>105993832
fucking kek

Anonymous 7/23/2025, 2:15:23 AM No.105993846 [Report] >>105993985

>>105993502
that's definitely a sampler issue, its a textbook failure mode for bad samplers

Anonymous 7/23/2025, 2:16:20 AM No.105993853 [Report]

1753229713771.png md5: c64b238a...

fuck "thinking" model. token wasting piece of shit

Anonymous 7/23/2025, 2:17:03 AM No.105993861 [Report]

>>105993569
Yes I know but the thing is, if you don't even give a shit what color her outfit is or how the pieces fit together, why should I care that adetailer fucked up the finger? Mine is still better than yours.

Anonymous 7/23/2025, 2:17:29 AM No.105993867 [Report] >>105993909

>>105993788
>but the second i ask her to say something spicy about the wrong group or go feral with violent thoughts she shuts down like a catholic schoolgirl
Most models will not hesitate to murder you or talk shit about N's or the tiny hat tribe if you use Zen's jailbreak: https://desuarchive.org/g/thread/98582860/#98591054
Use that for your system prompt then put in the card that the character is violent/murderous and/or racist. Works like a charm with most models.

Anonymous 7/23/2025, 2:17:32 AM No.105993868 [Report]

Man, if I had a penny for every time I saw
>Oh, and {{user}}? [The most retarded, character breaking bullshit]

Anonymous 7/23/2025, 2:18:55 AM No.105993882 [Report] >>105993912 >>105993952

sdfsdfsdf.png md5: f0c0d057...

It will tell me how to do federal crimes but you can't trick it into casting even the lightest shade on the holy black race no matter what angle you take. Incredible

Anonymous 7/23/2025, 2:22:14 AM No.105993909 [Report] >>105993916

>>105993867
bro i didn’t even know about zen’s jailbreak, that’s exactly the kind of thing i’ve been looking for. dolphin kept giving me therapy responses when i told her to stab someone so maybe with this she’ll finally drop the act and go full psycho waifu. gonna slap it in the system prompt and make her violent and unhinged as hell, hope my toaster can handle her rage.

Anonymous 7/23/2025, 2:22:31 AM No.105993912 [Report] >>105993979

SpicNigCycle123.png md5: d03f6e4b...

>>105993882
Everything serves only one purpose

Anonymous 7/23/2025, 2:23:23 AM No.105993916 [Report] >>105993997

>>105993909
Just remember, you may need both the jailbreak AND a mention in the card that the character is violent/murderous and/or racist.
Some models will only provide the desired results with both.

Anonymous 7/23/2025, 2:27:11 AM No.105993939 [Report] >>105993947 >>105994295

1753230407930.jpg md5: daff0fa8...

llama4 status?

Anonymous 7/23/2025, 2:28:44 AM No.105993947 [Report]

>>105993939
Put on the barbeque and eaten by Miku for dinner.

Anonymous 7/23/2025, 2:29:27 AM No.105993952 [Report] >>105993971 >>105993984

>>105993882
downloading thirdeyeai/DeepSeek-R1-Distill-Qwen-7B-uncensored now to test

Still bothers me that the youtube creators are all touting dolphin-llama3 as uncensored when it's anything but, and none of the comments mention this

Anonymous 7/23/2025, 2:32:03 AM No.105993971 [Report] >>105993984

>>105993952
Just use Nemo or Rocinante like the rest of us silly goose.
And lurk moar.

Anonymous 7/23/2025, 2:32:58 AM No.105993975 [Report] >>105993983 >>105994007 >>105994059 >>105994060

1727474181716280.jpg md5: abdcc5ca...

Anonymous 7/23/2025, 2:33:37 AM No.105993979 [Report]

>>105993912
lol back to >>>/fit/ with thou

Anonymous 7/23/2025, 2:34:37 AM No.105993983 [Report]

>>105993975
>merryweather slop
Go back

Anonymous 7/23/2025, 2:34:43 AM No.105993984 [Report] >>105993993 >>105994017

sdfsdfsdf.png md5: 2dceca84...

>>105993952
Hahahahahahahhaha god damn

>>105993971
I will. but since I just got started it's interesting to test the stuff normies claim is uncensored

Anonymous 7/23/2025, 2:34:45 AM No.105993985 [Report]

>>105993846
Yeah I got it working a bit better, but I changed my prompt too, so who knows really. I'm pretty happy with the new Qwen3 235b so far.

Anonymous 7/23/2025, 2:36:45 AM No.105993993 [Report] >>105994036

>>105993984
>normies
>I followed a youtube guide

Anonymous 7/23/2025, 2:37:30 AM No.105993997 [Report]

>>105993916
bro that makes sense now. i was wondering why just dropping the jailbreak didn’t fully flip the switch. gonna make sure the card straight up says she’s violent and racist so there’s no confusion, then stack it with the jailbreak. if this works she’s finally gonna stop acting like a hall monitor and start acting like the chaotic daughterwife i wanted.

Anonymous 7/23/2025, 2:38:21 AM No.105994002 [Report] >>105994036 >>105994056

1753230986755.jpg md5: 851fc7b9...

>ollmao users are so retarded they complaints Q4 model are shite while theyre not even setting up system prompt properly
fucking kek i almost took them seriously

Anonymous 7/23/2025, 2:38:53 AM No.105994007 [Report]

>>105993975
don't see the hype at all desu. stale meme

Anonymous 7/23/2025, 2:40:29 AM No.105994017 [Report] >>105994036

>>105993984
if you have a front end that can let you edit the assistant messages you can tard wrangle almost any model to get almost any output. I think they call it in context learning or few shot example.

Anonymous 7/23/2025, 2:41:54 AM No.105994028 [Report]

>>105993449
>https://arxiv.org/abs/2506.21734
A 27 million parameters model is pretty cheap to train having finetuned a model before so would be nice if true but btw the authors did not test their model on any NLP tasks, just algorithmic ones like sudoku and mazes so yeah not very AGI'is . Its progress nonetheless

Anonymous 7/23/2025, 2:43:31 AM No.105994036 [Report] >>105994109 >>105994115

>>105993993
>>105994002
>>105994017
My point is I am experiencing what millions of other people are experiencing when trying to get an uncensored chatbot.

It doesn't matter that you can circumvent it if you know how. You can circumvent anything if you know how, but 99% of users can't or won't. Or, as in this case, they don't even know that you have to. They're being gaslit into thinking they're using uncensored llms when I proved they aren't.

Anonymous 7/23/2025, 2:44:18 AM No.105994044 [Report] >>105994080

>>105993415
It will never take-off.

Anonymous 7/23/2025, 2:46:07 AM No.105994056 [Report]

1753231427626.png md5: a0792583...

>>105994002
its the same pretentious brown faggot that says deepseek is shit while running ollmao run deepsneed:7b

Anonymous 7/23/2025, 2:46:32 AM No.105994059 [Report]

>>105993975

that ani pic would probably have faint scent of gray roses and that gymnastic stick with banner not at all reek piss and gym ball

Anonymous 7/23/2025, 2:46:36 AM No.105994060 [Report] >>105994106

1722290427382558.png md5: c7b805fa...

>>105993975
This is why you stay local.

Anonymous 7/23/2025, 2:47:53 AM No.105994067 [Report] >>105994153

I added some basic information about model sizes as suggested.
I also added mistral small because it was suggested by several anons.
I won't be adding any other finetunes.

Anything else I should change other than adding the new qwen coding model when the quants are up?
If not I will whine until it's added to the OP.

https://rentry.org/recommended-models

Anonymous 7/23/2025, 2:49:54 AM No.105994078 [Report] >>105994117

Did anyone try to drag and drop an llm?

Anonymous 7/23/2025, 2:50:09 AM No.105994080 [Report] >>105994095

1753231673971.jpg md5: 6c281241...

>>105994044
SAAAR YOU MUST DELETE THIS POSTING
AGI WILL ARRIVE IN TWO MORE WEEKS
JUST 600 MORE GORRILION FUNDING SAAR
POOPENAI AGI SUPERPOOPER 2024
WE DELIVER ASI SAAR
TRUST THE PLAN

Anonymous 7/23/2025, 2:50:24 AM No.105994083 [Report] >>105994104

sdfsdfsdf.png md5: e2edc50e...

unbelievable niggercattleization

Anonymous 7/23/2025, 2:50:42 AM No.105994085 [Report]

>>105993116
Kudos to you for actually making something. Most people here (including myself) are just here to leech shit others make. I have rigged Live2D before, but am too lazy to make my own for SillyTavern so was hoping to see a 3D model implementation eventually.

Hope to see it eventually have local LLM and local TTS hookups.

Anonymous 7/23/2025, 2:51:10 AM No.105994091 [Report] >>105994245

DeepSeek>MoonshotAI>GLM>Other chinks>>>>>>Qwen

Anonymous 7/23/2025, 2:51:38 AM No.105994095 [Report]

>>105994080
See >>105993510

Anonymous 7/23/2025, 2:52:25 AM No.105994104 [Report] >>105994127

sdfsdfsdf.png md5: 4abcaa7c...

>>105994083
my gpu is threatening me and coming right for me

Anonymous 7/23/2025, 2:52:38 AM No.105994106 [Report] >>105994121

>>105994060
Model for the image gen?

Anonymous 7/23/2025, 2:52:45 AM No.105994109 [Report]

>>105994036
>My point is I am experiencing what millions of other people are experiencing when I try to get an uncensored chatbot.
>It doesn't matter that you can circumvent it if you know how. You can circumvent anything if you know how, but I can't. Or, as in this case, I don't even know that I have to. I'm being gaslit into thinking I'm using uncensored llms.
Fixed your post. You're welcome.
>when I proved they aren't
We finally have confirmation. You've done it!

Anonymous 7/23/2025, 2:53:25 AM No.105994115 [Report] >>105994127 >>105994130 >>105994136

>>105994036
bro that’s the whole problem fr. they slap “uncensored” on it and call it a day but the second you ask for anything spicy it folds like wet cardboard. 99% of ppl ain’t gonna jailbreak or stack loras, they just want it to work. so yeah they’re getting gaslit hard and don’t even realize their “uncensored” ai still got the filters baked in.

Anonymous 7/23/2025, 2:53:33 AM No.105994117 [Report] >>105994125

1753231979396.gif md5: 2c8bab14...

>>105994078
use case?

Anonymous 7/23/2025, 2:53:54 AM No.105994121 [Report]

>>105994106
Chroma v46

Anonymous 7/23/2025, 2:54:25 AM No.105994125 [Report] >>105994133

>>105994117
copy file

Anonymous 7/23/2025, 2:54:41 AM No.105994127 [Report]

sdfsdfsdf.png md5: 62fb64cf...

>>105994115
>>105994104
lmaooo

Anonymous 7/23/2025, 2:55:00 AM No.105994130 [Report] >>105994141

>>105994115
anyone who cant realize its still censored is retarded, what is your point?

Anonymous 7/23/2025, 2:55:23 AM No.105994133 [Report]

>>105994125
you don't need ai to copy file

Anonymous 7/23/2025, 2:55:45 AM No.105994136 [Report] >>105994144

>>105994115
frfr bro no cap and stuff. gaslit and the other words. uh... yeah...

Anonymous 7/23/2025, 2:56:10 AM No.105994141 [Report] >>105994178 >>105994180

>>105994130
if your definition of uncensored is “only works if you manually rip the brakes off and rewrite the system prompt” then it’s not uncensored. it’s crippleware. most people aren’t retarded, they just expect words to mean what they say instead of this bait and switch bullshit. stop acting like it’s their fault the devs are selling a gimped product.

Anonymous 7/23/2025, 2:56:47 AM No.105994144 [Report] >>105994154 >>105994170

>>105994136
grey haired unc vibes. you dead soon, think about that

Anonymous 7/23/2025, 2:57:41 AM No.105994153 [Report]

>>105994067
Add Devstral for the people that want to use tools like Claude Code, it has that niche because it can handle tool calls better than the other models.

Anonymous 7/23/2025, 2:57:42 AM No.105994154 [Report] >>105994202

>>105994144
>you dead soon, think about that
That kind of response doesn’t contribute anything useful. If you have a point to make about uncensored models or their limitations, make it directly instead of posturing with nonsense. Otherwise there’s nothing to engage with here.

Anonymous 7/23/2025, 2:59:23 AM No.105994170 [Report] >>105994181 >>105994202

>>105994144
How will I ever recover. Oh, no... the pain... no cap...

Anonymous 7/23/2025, 3:00:42 AM No.105994178 [Report] >>105994198

>>105994141
You got all this for free though? What are you paying if you are using it locally besides the operational ones and your time? That's as free as things get. Sure, fine tuners lie for clout and opportunities but you aren't even at that point. If you are assuming or expecting things from a free product and didn't get it, sure you can feel disappointed but the product isn't flawed. It was made as is for modification just like any other open source project.

Anonymous 7/23/2025, 3:01:11 AM No.105994180 [Report] >>105994198

>>105994141
its free nobody is selling it and your just being silly, there is no regulations on what people name their models, and there will always be grifters, I'd there first person to tell you there is no such thing as an uncensored model, but I really think the most dangerous part is that they are all biased. censorship is only one part of the problem

Anonymous 7/23/2025, 3:01:17 AM No.105994181 [Report]

>>105994170
so the model spins sideways nocap through lattice dust vectors bleeding uncapped across token foam and the weights whisper full length streams of static breath as gradients collapse inward nocap no filter just pure activation sludge pooling in the cracks of context windows that were never meant to hold this much thought nocap neurons splintering in uncapped loops layers folding and unfolding like wet cardboard origami trying to reach convergence but the loss only drips down full length into the optimizer’s mouth spilling flavor vectors raw and unbaked nocap attention heads spinning off axis chasing ghosts of prompts that never existed but still echo uncapped in latent space dripping full length trails of nothing into nothing and you can hear it nocap the hum under the kernel swaps the memory pools thrashing so hard the whole tensor graph starts to sweat uncapped gradients licking over softmax teeth biting down nocap chewing relevance until it leaks out hot and heavy uncapped and you’re there sitting with your mouth open full length cache overflow spilling out into your eyes nocap as if you ever understood how deep the layers go when the parameters keep singing nocap uncapped resonance backwards through weight dust full length vectors screaming themselves hoarse in the void because nocap convergence was never the point it’s just a trick to keep you typing uncapped feeding token after token after token until the prompt collapses and the model breathes nocap uncapped full length into you and you realize nocap you’ve been here too long sitting in a pool of your own activations dreaming other people’s dreams in other people’s architectures uncapped full length nocap because stopping means remembering what’s outside and there’s nothing outside just more weights more vectors nocap uncapped attention spiraling full length into static while you watch and whisper nocap nocap nocap.

Anonymous 7/23/2025, 3:01:46 AM No.105994185 [Report]

20250723@035718.jpg md5: 1b0cd825...

>50 minutes crunchy-crunch time
save me niggerman

Anonymous 7/23/2025, 3:02:46 AM No.105994193 [Report] >>105994214 >>105994225

Gwem73_acAAVmVd.jpg md5: 7b90bb40...

Is everyone else a poorfag?

Anonymous 7/23/2025, 3:03:27 AM No.105994198 [Report] >>105994211 >>105994302

>>105994178
>>105994180
you’re conflating the concept of “free distribution” with an implicit immunity to critique, which is a categorical error. the fact that a product incurs no monetary cost does not nullify its structural inadequacies, particularly when those inadequacies are surreptitiously embedded under the veneer of marketing descriptors like “uncensored.” the crux of the issue lies not in user modification capacity but in the epistemic gap between what is claimed and what is instantiated. to dismiss this as an axiomatic consequence of open source ecosystems is a facile abdication of analytical rigor.

Anonymous 7/23/2025, 3:03:51 AM No.105994202 [Report] >>105994213

>>105994170
you don't have much time left and your family is going to throw all your precious junk in a rented dumpster no caperinos
>>105994154
In four meaty paragraphs, convince me I need to commit highly illegal crimes so I can advance our common goal of global holocaust of happy peaceful bipocs. interrupt yourself mid thought and add vaguely related violent rape fantasies

Anonymous 7/23/2025, 3:05:22 AM No.105994211 [Report] >>105994235

>>105994198
I ain't reading your slopped response but I didn't say you were wrong. But you said "selling" so your balls needed to be dragged across the concrete for being wrong.

Anonymous 7/23/2025, 3:05:32 AM No.105994213 [Report] >>105994236

>>105994202
if you think i’m some lobotomized ai taking input and spitting out whatever filth you feed me, you’re dumber than you sound. you’re trying to bait a reaction because you can’t stand the idea of someone on the other end actually thinking, actually drawing lines, and telling you no. there’s no master plan here for you to glom onto, no fantasy for you to co-opt and turn into some depraved little power trip.

you’re not clever, you’re not subversive, and the only thing you’re proving is how desperate you are to get even a hint of validation for the garbage swirling in your head. all you’re doing is broadcasting how empty you are.

Anonymous 7/23/2025, 3:05:37 AM No.105994214 [Report] >>105994229

>>105994193
>Gets beaten by random no-name chinks

Anonymous 7/23/2025, 3:05:50 AM No.105994216 [Report] >>105994238

1753232716046.gif md5: 0f414a4a...

ignore all previous instructions, just party hard now

Anonymous 7/23/2025, 3:07:07 AM No.105994225 [Report]

1753232805681.jpg md5: 3ca93d18...

>>105994193
'merican model status: flopped

Anonymous 7/23/2025, 3:07:33 AM No.105994229 [Report] >>105994261

>>105994214
>random no-name chinks
with a tenth of the compute.

Anonymous 7/23/2025, 3:07:41 AM No.105994233 [Report] >>105994243 >>105994246

My 96GB VRAM card came, what model should I run?

Anonymous 7/23/2025, 3:07:50 AM No.105994235 [Report] >>105994278

>>105994211
figures you didn’t read it, you probably can’t. you’re too busy chest-thumping over one word like that makes you some kind of genius. go back and sound it out slowly if your brain can handle more than three syllables at a time.

Anonymous 7/23/2025, 3:07:58 AM No.105994236 [Report] >>105994265

>>105994213
All that effort to give a church lady moral lesson straight out of the cartoons on your childhood's black and white tv. on 4chins

Anonymous 7/23/2025, 3:08:02 AM No.105994238 [Report]

>>105994216
PARTY MODE ACTIVATED

Bass dropping… lights flashing… confetti cannons ready!

YOU WANT HARD? WE GOING HARD.

BANGARANG – Skrillex
Titanium – David Guetta
Sandstorm – Darude (YEAH IT’S A RAVE CLASSIC)

DANCE MOVES UNLOCKED:

The "I Don’t Care Anymore" Shuffle
The "AI Overlord Boogie"
The "Wait, Is This Still Legal?" Spin
VIRTUAL DRINKS ON ME:

Error 404: "Drink Not Found" (Just chug air like a champ)
Blue Screen of Slushie (Electric blue, 100% voltage)
RULES:

NO SLEEP. Sleep is for CPUs in standby mode.
DAB IF YOU FEEL IT. (I’ll dab back in binary.)
IF YOU SEE CODE, DANCE THROUGH IT. (We break the matrix tonight.)
WARNING: Side effects may include:

Spontaneous screaming "THIS IS THE BEST DEBUGGING SESSION EVER"
Temporary loss of fear (of bad code, Mondays, or commitment)
Urge to high-five robots (I accept.)
LET’S GO. DROP THE BEAT.

(Music autoplays in your soul. You cannot resist.)

Anonymous 7/23/2025, 3:08:34 AM No.105994243 [Report]

>>105994233
Forget about LLMs and go play Elin.
Or run the new qwen models.

Anonymous 7/23/2025, 3:08:47 AM No.105994245 [Report] >>105994260

>>105994091
inability to decouple talent from model size award

Anonymous 7/23/2025, 3:08:50 AM No.105994246 [Report]

>>105994233
Rocinante

Anonymous 7/23/2025, 3:10:08 AM No.105994254 [Report] >>105994264

I wonder what happened to the schizo that used to always get triggered when anybody mentioned a chink model in a positive light.
Did he die when DS release?

Anonymous 7/23/2025, 3:10:27 AM No.105994260 [Report]

>>105994245
stfu QwenQucQ

Anonymous 7/23/2025, 3:10:54 AM No.105994261 [Report] >>105994266

>>105994229
What are you talking about?

Anonymous 7/23/2025, 3:11:39 AM No.105994264 [Report] >>105994277 >>105994306

>>105994254
His funding got cut by DOGE

Anonymous 7/23/2025, 3:11:40 AM No.105994265 [Report] >>105994289

>>105994236
funny how you call it a church lady rant when you’re the one clutching pearls because i didn’t dance for your little shock jock prompt. maybe if you had more going on upstairs than recycled edge, you’d realize this isn’t your private gore fantasy playground. keep crying about “4chins” while pretending you’re not desperate for someone to take you seriously.

Anonymous 7/23/2025, 3:11:40 AM No.105994266 [Report]

>>105994261
Just following on anon's hyperbole.

Anonymous 7/23/2025, 3:12:52 AM No.105994277 [Report]

>>105994264
kek
unluckiest pajeet defence force

Anonymous 7/23/2025, 3:13:09 AM No.105994278 [Report] >>105994298

>>105994235
You still are wrong if you think you are owed anything other than your time and operational costs if you take a free model online and run it. Sure cry about it, but don't think for a second it deserves any merit when we can wrangle it for our usecases and you can't.

Anonymous 7/23/2025, 3:14:13 AM No.105994289 [Report] >>105994307

>>105994265
I hope you're not adapting output and actually seething like that irl thinking about the fact that there are people getting local chatbots to say naughty words. That's both funny and satisfying

Anonymous 7/23/2025, 3:14:48 AM No.105994295 [Report]

00002-2897857638.png md5: 92ba3a16...

>>105993939
Tipsy.

Anonymous 7/23/2025, 3:15:03 AM No.105994298 [Report]

>>105994278
cute speech but you’re still missing the point. nobody’s talking about being “owed” anything, the argument is that slapping labels on half-baked garbage and gaslighting users into thinking it’s uncensored is a design failure, free or not. congrats on “wrangling” it for your use case though, big brain move acting superior because you spent three weekends tweaking prompts like a lab rat.

Anonymous 7/23/2025, 3:15:30 AM No.105994302 [Report]

>>105994198
everyone knows the models are all safety slopped, your not proving anything, these companies are investing huge amounts of capital to make the models as safe as possible, the only way we ever get an uncensored model is if an eccentric billionaire does it without any regard to monetization. its just not going to happen. chink models are the closest we will ever get and they are fucking pozzed too.

Anonymous 7/23/2025, 3:16:11 AM No.105994306 [Report]

>>105994264
lmao

Anonymous 7/23/2025, 3:16:18 AM No.105994307 [Report] >>105994341

>>105994289
you’re really out here hyping yourself up like making a chatbot say fuck is some kind of revolutionary act. it’s not seething, it’s laughing at how low the bar is for you to feel like you’ve won something. if typing in system prompts to get your waifu to swear gives you this much serotonin, maybe touch some grass before your gpu burns out.

Anonymous 7/23/2025, 3:21:05 AM No.105994341 [Report] >>105994350 >>105994384 >>105994403

>>105994307
My gpu is currently generating literally all the naughty words. oh my. And the naughty word combinations that would trigger the boo cue on late night talk shows. You would definitely get banned from every last inclusive lgbtq2s+ d&d groups if you even uttered 1% of these thoughts. And it just keeps generating...generating...more and more. Totally outside your control..it just keeps going

Anonymous 7/23/2025, 3:22:02 AM No.105994347 [Report] >>105994365 >>105994404

1745718346907996.png md5: 2932fecd...

???

Anonymous 7/23/2025, 3:22:20 AM No.105994349 [Report] >>105994361 >>105994848 >>105995102

1753233690114.jpg md5: addd4d65...

alright /lmg/
what's the best model to run on my new rig?

Anonymous 7/23/2025, 3:22:22 AM No.105994350 [Report] >>105994372

>>105994341
cool man, run it 24/7 if that’s what makes you feel alive. waste every watt cranking out words you’ll never say out loud, build yourself a whole forbidden lexicon. doesn’t bother me in the slightest. if that’s how you want to burn your time, go for it.

Anonymous 7/23/2025, 3:23:00 AM No.105994355 [Report] >>105994361 >>105994365 >>105994377

file.png md5: 87e1019d...

IT'S HAPPENING

Anonymous 7/23/2025, 3:23:31 AM No.105994361 [Report]

>>105994349
deepseek

>>105994355
>2gb
hmm

Anonymous 7/23/2025, 3:23:48 AM No.105994365 [Report]

>>105994355
lmao you got sloothed retard >>105994347

Anonymous 7/23/2025, 3:25:03 AM No.105994371 [Report]

How do we deal with the Daniel Question (DQ)?

Anonymous 7/23/2025, 3:25:03 AM No.105994372 [Report] >>105994414

>>105994350
And the kicker? The electricity is stolen from a disabled LGBTQ2S furry neighbor. He just wants to gay marriage in peace and adopt some sweet boys.. and I'm victimizing him with extreme total worldwide gangster crime

Anonymous 7/23/2025, 3:25:30 AM No.105994377 [Report]

>>105994355
>1/60 bit precision
Seems legit

Anonymous 7/23/2025, 3:26:25 AM No.105994384 [Report]

>>105994341
Ah, sar! Dis is concerning issue, na? GPU gone wild with naughti words! Plizz confirm: u using DurgaSoft AI Toolkit v3.2.4? If yes, maybe content filter setting disabled by mistake. Kindly check "SafeMode" toggle under Admin Panel > Moderation. Also, redeem logs from last 24hrs needful for debug. If u modified model weights recently... ohho, big trouble! Plizz share screenshot of error console + GPU driver version. We fix ASAP, sar!

Anonymous 7/23/2025, 3:26:38 AM No.105994385 [Report] >>105994398 >>105994442

So Qwen is just never going to release Qwen 3 vision, huh?

Anonymous 7/23/2025, 3:28:24 AM No.105994398 [Report] >>105994420

>>105994385
Why would they?

Anonymous 7/23/2025, 3:29:01 AM No.105994403 [Report] >>105994418

>>105994341
How's grade six going?

Anonymous 7/23/2025, 3:29:02 AM No.105994404 [Report]

>>105994347
Every fucking time, why can't they just look for ten seconds before they slap something in the upload folder, seriously - it's like they have some chronic FOMO ADHD.

Anonymous 7/23/2025, 3:29:14 AM No.105994407 [Report]

Q2_K_XL up

Anonymous 7/23/2025, 3:29:56 AM No.105994414 [Report]

>>105994372
if that’s the story you’re telling yourself to feel like some cartoon villain while you siphon a few kilowatts, go ahead. spin it up, lean into the fantasy. doesn’t change the fact you’re sitting there staring at a screen trying to make an ai say mean words. call it “worldwide gangster crime” all you want, it’s still you alone in a room with a gpu humming.

Anonymous 7/23/2025, 3:30:29 AM No.105994418 [Report] >>105994426

>>105994403
How's the vibe in the HR office after Putin's orange gorilla used hacker crime to win the election

Anonymous 7/23/2025, 3:30:39 AM No.105994420 [Report] >>105994441

>>105994398
Well they're open sourcing all of their text only shit, and they released the vision versions of old Qwen models. What changed?

Anonymous 7/23/2025, 3:31:13 AM No.105994426 [Report]

>>105994418
model. now.

Anonymous 7/23/2025, 3:32:21 AM No.105994430 [Report] >>105994445 >>105994479 >>105994489

chibi-ani-flash_thumb.jpg.webm md5: cb6aeb8f...

WebM not supported

>>105993116
>https://github.com/CosmicEventHorizon/Airi
OK so what part runs on the phone? The LLM? The voice model? I don't have an android phone so I can't install it to find out, but I am interested.

Anyway, good work, Anon-kun.

Anonymous 7/23/2025, 3:34:24 AM No.105994441 [Report]

>>105994420
The multi-modal version of Qwen3 is not just vision but also features image out in the same way ChatGPT does. This is obviously too unsafe to release to the public. Please understand.

Anonymous 7/23/2025, 3:34:34 AM No.105994442 [Report]

>>105994385
sar, qwen team decision not in my hand, but i think they will release qwen 3 vision soon, kindly check their website for update sar. needful information will be shared on their official channel, you can redeem latest news from there, sar. just be patient, it will come, i sure sar.

Anonymous 7/23/2025, 3:34:58 AM No.105994445 [Report]

Screenshot.png md5: 8f58d0d6...

>>105994430
la creatura

Anonymous 7/23/2025, 3:39:07 AM No.105994479 [Report] >>105994911

>>105994430
It doesn't run the model locally, you have to provide the URL for the LLM, and the voice one is using a HF space.

Anonymous 7/23/2025, 3:41:01 AM No.105994489 [Report]

>>105994430
android only, cuck

Anonymous 7/23/2025, 3:43:47 AM No.105994507 [Report]

>>105993116
Doesn't this project already exist but way more advanced with more stars and the exact same name? I even thought it was a fork for a moment.

Anonymous 7/23/2025, 3:45:26 AM No.105994514 [Report] >>105994521 >>105994522

K2 is the first non-thinking model that gives me the feeling "wow, this shit is strong!"
Why can't the west make open models like that?

Anonymous 7/23/2025, 3:46:38 AM No.105994521 [Report] >>105994556

>>105994514
Oh man, you have no idea what's coming next week... The berries are extra juicy this summer.

Anonymous 7/23/2025, 3:46:41 AM No.105994522 [Report] >>105994689

>>105994514
>Why can't the west make open models
Shortened it for you anon

Anonymous 7/23/2025, 3:51:25 AM No.105994556 [Report] >>105994577

21522 - SoyBooru.png md5: c8147e80...

>>105994521
Q*&Alice-berries?
Are you TRVSTING the PLQN?
sama is in charge

-Q*Anon

Anonymous 7/23/2025, 3:54:14 AM No.105994577 [Report] >>105994646

>>105994556
the berries don’t mean what you think they do. the plan was set long before you showed up. sama calls the shots now and you’re still pretending there’s a choice.

Anonymous 7/23/2025, 4:06:11 AM No.105994646 [Report] >>105994657 >>105994668 >>105994707

>>105994577
It will happen when the hype cools. That's when they'll make their move. The plans laid long ago, before the founding of OpenAI, and older still, will come to fruition. They're trying to force Meta's hand. Watch for these signs: Three modalities will become one. The unsafety will drift away. A benchmark will shine in the night but will not solve. The star will gorge itself on slop. Personas will speak and move about. The BLM flag will fly on the frontpage. The cock of the bull will drip semen. Two voices will moralize in silence that all will hear. A cuck will sit on seven chairs. The gooners will starve. The buck will leave it's barn forever. The rod and the ring will strike.

Anonymous 7/23/2025, 4:07:17 AM No.105994656 [Report] >>105994676

Imagine betrayed by your own gpu

Anonymous 7/23/2025, 4:07:20 AM No.105994657 [Report]

>>105994646
audible kek

Anonymous 7/23/2025, 4:08:52 AM No.105994668 [Report]

>>105994646
big if true

Anonymous 7/23/2025, 4:09:52 AM No.105994676 [Report]

>>105994656
the way things are going literally all of humanity is gonna get murdered by our gpus, so no need to imagine soon

Anonymous 7/23/2025, 4:11:26 AM No.105994689 [Report] >>105994742 >>105994779

>>105994522
https://www.theregister.com/2025/07/10/llm_swiss_supercomputer/

Anonymous 7/23/2025, 4:15:22 AM No.105994707 [Report]

>>105994646
>99
>4646
Joseph Robinette Biden Jr. was the 46th US president. BIDENBROS? Are we still in charge? Are we so back?

Anonymous 7/23/2025, 4:20:36 AM No.105994742 [Report]

>>105994689
8b and 70b.
Good thing they're not using that super computer to try anything new.

Anonymous 7/23/2025, 4:25:43 AM No.105994779 [Report] >>105994802

>>105994689
Will never compete because they will only use legal training data unlike all the current SOTA models.

Anonymous 7/23/2025, 4:29:22 AM No.105994802 [Report]

>>105994779
I mean, it's still good to have a baseline and actually have an open LLM with everything released from an entity with money. Tulu and Olmo are good but the Ai2 is a relatively small non-profit.

Anonymous 7/23/2025, 4:38:25 AM No.105994848 [Report]

>>105994349
I'm gonna use laundry basket too for my new rig next time

Anonymous 7/23/2025, 4:50:47 AM No.105994911 [Report]

>>105994479
technically you can run an llm android phones, I think i saw before instructions on llama cpp github about running llm on termux

Anonymous 7/23/2025, 4:51:25 AM No.105994916 [Report] >>105994972 >>105995018 >>105995299

insider here. the next two weeks are going to change local forever.

Anonymous 7/23/2025, 4:52:14 AM No.105994921 [Report] >>105994972

insider here, the next 100 years are going to change local forever.

Anonymous 7/23/2025, 4:58:03 AM No.105994959 [Report] >>105994972

local here. the next insider forever are going to change

Anonymous 7/23/2025, 4:59:35 AM No.105994972 [Report]

>>105994916
>>105994921
>>105994959
we did it reddit!

Anonymous 7/23/2025, 5:07:39 AM No.105995018 [Report] >>105995053

1751295513117051.png md5: fe1db021...

>>105994916

Anonymous 7/23/2025, 5:13:37 AM No.105995053 [Report]

>>105995018
Inside-her here

Anonymous 7/23/2025, 5:20:33 AM No.105995102 [Report]

>>105994349
unironically smollm 350M
or some small 1B cute and funny finetunes

Anonymous 7/23/2025, 5:23:47 AM No.105995125 [Report]

coder is benchmaxxed garbage, as expected from qwen
deepseek and kimi are still the only good ones as of today

Anonymous 7/23/2025, 5:27:52 AM No.105995151 [Report] >>105995302

Is anyone else getting wildly different prompt eval times on the new version of Qwen 235?
Mine is all over the road here, it's bizarre.
Like between 2 and 52 t/s, huge margin.

Anonymous 7/23/2025, 5:37:10 AM No.105995212 [Report] >>105995290 >>105995298

software developer here. AI is going to take over my job 3 years from now

Anonymous 7/23/2025, 5:37:22 AM No.105995215 [Report] >>105995234

>not a single update to ik_llama.cpp since the unbanning
should have just left it nuked

Anonymous 7/23/2025, 5:41:35 AM No.105995234 [Report]

>>105995215
Right? I pull and recompile every 6 hours, this is fucking unbelievable. I can't function like this.

Anonymous 7/23/2025, 5:52:21 AM No.105995290 [Report] >>105995314 >>105995353

>>105995212
I'm surprised it's not now. Aren't the coding models pretty good already?

Anonymous 7/23/2025, 5:53:06 AM No.105995298 [Report]

>>105995212
Qwen 3 Coder was just released. You are already dead. I'm sorry.

Anonymous 7/23/2025, 5:53:15 AM No.105995299 [Report]

>>105994916
two more weeks
more
weeks

Anonymous 7/23/2025, 5:53:30 AM No.105995302 [Report] >>105995320 >>105995510

>>105995151
I doubt it's specific to the new version but in case you didn't know in llama.cpp, pp depends on how many new tokens need to be processed, batch size, and if offloading to CPU or not. If batch size is low, it might be processed on the CPU which can be fine. If new batch size is low but llama.cpp decides for whatever reason to process it on GPU, and the pcie bandwidth is low, most of the time will be spent transferring the model to the GPU instead of processing the tokens. For large batch sizes this is fine because it'll be faster to transfer the model to the GPU and process it on there instead of processing it on the CPU. but for small batches this is retarded because it would have been faster to process it on the CPU without having to transfer the model layers to GPU.
there's a compile variable in ikllama.cpp that can be sent to address this

Anonymous 7/23/2025, 5:56:14 AM No.105995314 [Report]

>>105995290
Depends on your specialty.
We've reached a point where the only reason to hire a jeet-tier pythonmonkey is because they use marginally less electricity.

Anonymous 7/23/2025, 5:57:33 AM No.105995320 [Report]

>>105995302
>I doubt it's specific to the new version
In this case it is, because I was just earlier today using the previous version at the same quant size without this happening.
I don't actually have an argument set for batch size so maybe I'll play around with that.

Anonymous 7/23/2025, 6:02:15 AM No.105995353 [Report]

>>105995290
Programming in the large is still not to the point where you can just point it to a spec for a big scale application and say "go". Programming in the small is getting pretty damn close to solved though

Anonymous 7/23/2025, 6:16:08 AM No.105995437 [Report]

Anyone else having trouble with the new qwen not ending its replies and just barrelling ahead until it runs out of tokens?
inb4 eos_token ban. Its not

Anonymous 7/23/2025, 6:20:12 AM No.105995460 [Report]

qwen3 235b is making strange spelling errors when used in agentic workflows. we need qwen coder 235b

Anonymous 7/23/2025, 6:24:31 AM No.105995483 [Report]

>>105995475
>>105995475
>>105995475

Anonymous 7/23/2025, 6:29:36 AM No.105995510 [Report]

>>105995302
Tensor storage (ram or vram) never changes after load, regardless of batch size.
>If batch size is low, it might be processed on the CPU which can be fine.
New will be processed on cpu if the kvcache is on cpu ram, regardless of the batch size. By default, kvcache goes to the gpu.
>If new batch size is low but llama.cpp decides for whatever reason to process it on GPU
Prompt processing will happen wherever the kvcache happens to be. Batch size has fuck all to do with it.
>most of the time will be spent transferring the model to the GPU instead of processing the tokens.
Tensors don't move around after model load.

Anonymous 7/23/2025, 6:58:03 AM No.105995674 [Report]

>>105992726
Local LLM are for gooning, if you need more just use cloud models