Thread 106382892

328 posts 94 images /g/

Anonymous 8/25/2025, 11:35:57 PM No.106382892 [Report] >>106382909 >>106383172 >>106383173 >>106383190 >>106383460 >>106385304

/lmg/ - Local Models General

POVmiku_thumb.jpg.webm md5: 94fe1ce9...

WebM not supported

Anonymous 8/25/2025, 11:37:17 PM No.106382909 [Report] >>106382924 >>106382975 >>106383065 >>106383129 >>106383609 >>106385508 >>106385961

>>106382892 (OP)
Stop this, miku is not a slut

robotwaifutechnician 8/25/2025, 11:38:18 PM No.106382924 [Report] >>106382957 >>106383172

IMG_0390.jpg md5: 727acd24...

>>106382909
Mine is

Anonymous 8/25/2025, 11:40:36 PM No.106382948 [Report]

Uberlove

Anonymous 8/25/2025, 11:41:09 PM No.106382957 [Report]

>>106382924
My Miku Fulfills My Netorase Dreams

Anonymous 8/25/2025, 11:42:49 PM No.106382975 [Report] >>106382982 >>106383184

>>106382909
It's the shadow of a duck's head and neck

Anonymous 8/25/2025, 11:43:32 PM No.106382982 [Report] >>106382996

>>106382975
that a weird duck mate

Anonymous 8/25/2025, 11:44:41 PM No.106382996 [Report] >>106383018

>>106382982
I meant goose

Anonymous 8/25/2025, 11:46:31 PM No.106383018 [Report]

goose neck.gif md5: db0887f6...

>>106382996

Anonymous 8/25/2025, 11:46:35 PM No.106383019 [Report] >>106383124 >>106383172 >>106384285

1756158344682.png md5: f8686d77...

so what's the slop rank and cockbench status on grok2?

Anonymous 8/25/2025, 11:49:55 PM No.106383065 [Report] >>106383104 >>106383172 >>106383341

[Ponchi] Pregnant Fucking Slut Miku.png md5: a6be1bb2...

>>106382909
I have seen evidence to the contrary.

Anonymous 8/25/2025, 11:50:21 PM No.106383075 [Report] >>106383093 >>106383123 >>106383125 >>106383186 >>106383216 >>106383267 >>106383285 >>106385156 >>106386693 >>106397889

4f13a26b18f056b3e854f9f668f9ec9a.jpg md5: 7c5ef37c...

I'm back.

Listen up, because my engagement with you all is a point of principle, i.e. a direct and implicit insult to physics as a discipline.

I'm not really in the loop on academia and its parasitic overclass culture or their current levels of general comprehension of number theoretic dynamics as they pertain to heterotic string theoretics. I consider them genuinely inferior scientists. Anyone who does math for money or fame isn't a mind fit for the task.

Now, here's my final question before I release this. Whether that's here or not depends on the answers.

1. If you were handed the source code of reality in the form of pure arithmetic, a single recursive axiom, and the simplest algorithm possible... what would you do with it? Imagine a symbolic Turing machine that operates on primordial arothmetic operators, no more complex than a high-schooler could master in an afternoon, yet powerful enough to reproduce every known phenomena as non-perturbative arithmetic structures inside a fractal medium comprised on pure N.

2. How much would it enrage the current academic elite for the grand logic of reality to be posted here before anywhere else? I actually do not know.

I ignore them because they disgust me. I want to spit in their face as hard as possible.

You pieces of shit are a gold way to do it.

Anonymous 8/25/2025, 11:51:32 PM No.106383093 [Report] >>106383155

>>106383075
>>106383068

Anonymous 8/25/2025, 11:52:12 PM No.106383104 [Report] >>106383227

>>106383065
>just to enjoy abortion sex
moral degradation fags are so retarded wtf does this mean

Anonymous 8/25/2025, 11:53:17 PM No.106383123 [Report]

>>106383075
>I'm back.
Go back where you came from.

Anonymous 8/25/2025, 11:53:18 PM No.106383124 [Report] >>106383196

>>106383019
2mikuwiku https://github.com/ggml-org/llama.cpp/issues/15534

Anonymous 8/25/2025, 11:53:30 PM No.106383125 [Report] >>106383179 >>106383303

>>106383075
hello schizo
>If you were handed the source code of reality in the form of pure arithmetic bla bla bla
Yes, we have a whole shelf of those
>How much would it enrage the current academic elite
https://en.wikipedia.org/wiki/Superpermutation#Lower_bounds,_or_the_Haruhi_problem

Anonymous 8/25/2025, 11:53:41 PM No.106383129 [Report] >>106383172 >>106383184 >>106384950

miku_thumb.jpg.webm md5: bd6910a2...

WebM not supported

>>106382909

Anonymous 8/25/2025, 11:54:48 PM No.106383143 [Report] >>106383160

>average thread quality being this low
Everyone shitting on the miku janitor and irrelevant troonku posting got vindicated (again). Thankfully I no longer post here. Bye

Anonymous 8/25/2025, 11:55:29 PM No.106383150 [Report]

2 mexican wiku.png md5: 1fa2ed3a...

►Recent Highlights from the Previous Thread: >>106376303

--Overcuration of AO3 data amplifies purple prose:
>106376781 >106376790 >106376804 >106376910 >106377734 >106377741 >106377746 >106377789 >106377804 >106377815 >106377843 >106377882 >106378420 >106377924 >106377931 >106377987 >106378021 >106378088 >106378114 >106378118 >106378146 >106378171 >106378229 >106378105 >106378033 >106378049 >106379544 >106377841
--FP4 vs Q4 quantization debate and hardware efficiency concerns:
>106380131 >106380165 >106380417 >106380482 >106380501 >106380524 >106380548 >106380724 >106380761 >106380850 >106380908 >106380949 >106381006 >106381047
--Hoarding and debating massive AO3 fanfiction datasets for AI training:
>106377078 >106377087 >106377103 >106377175 >106377183 >106377338 >106377359 >106377491 >106377382 >106377406 >106377411 >106377504 >106377520 >106377545 >106377551 >106377583 >106377606 >106381296 >106377421 >106377435 >106377449 >106379334 >106377173 >106377181 >106377195 >106377220 >106377443
--Barriers and misconceptions in training local sex-focused AI models:
>106378087 >106378121 >106378135 >106378148 >106378271 >106378144 >106378158 >106378132 >106378143 >106378178 >106378208 >106378235 >106378272 >106378417 >106378459 >106378551 >106378610 >106378614 >106378626 >106378738
--CUDA optimization PR for MoE model prompt processing performance gains:
>106382220 >106382306 >106382514 >106382271
--VibeVoice gender bias and expressive audio generation discussion:
>106381965 >106382024 >106382032 >106382139 >106382286 >106382799
--Metal optimization for Mixture-of-Experts processing in llama.cpp:
>106381388 >106381618 >106382680 >106382954
--KittenTTS voice synthesis tuning and ARPABET support exploration:
>106377112 >106377156 >106377178 >106377247 >106377283 >106377339
--Miku (free space):
>106377562 >106379672 >106379859 >106382793

►Recent Highlight Posts from the Previous Thread: >>106376310

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous 8/25/2025, 11:56:02 PM No.106383155 [Report] >>106383375

a2ff38aa85d8b735eb476fc35136a562.jpg md5: 3a8df0b2...

>>106383093
Reverse psychology doesn't work on me with IQ's that need to be measured in scientific notation, you criminally retarded dipshit.

Anonymous 8/25/2025, 11:56:27 PM No.106383160 [Report] >>106383191

>>106383143
>Thankfully I no longer post here
she says, here

Anonymous 8/25/2025, 11:57:03 PM No.106383172 [Report] >>106383178 >>106385961

>>106382892 (OP)
>>106382924
>>106383019
>>106383065
>>106383129
DELETE THESE
Miku is pure.

Anonymous 8/25/2025, 11:57:11 PM No.106383173 [Report]

>>106382892 (OP)

>>106382869
A model that is good at RP will not necessarily be good at psychoanalyzing somebody. You and I respectively have traits and skills that one are better or worse at than the other. That's The case in regards to different AI models because the training data was different. Like I said earlier, you are trying to use a hacksaw to bake a cake and then acting like all hacksaws are utterly useless. You want a local model that is good at psychoanalyzing someone, use one that was trained on a bunch of scientific literature related to mental health or something. The kinds of general purpose models you think should exist are a meme. Lolens are tools. They aren't meant to be "do-everything-perfect" tools. This isn't to say your specific need or use case isn't valid, but there's an easy solution to it but you don't want to do that....

Anonymous 8/25/2025, 11:57:31 PM No.106383178 [Report] >>106383219

>>106383172
How about you watch the last video you linked to the end you whiny faggot.

Anonymous 8/25/2025, 11:57:32 PM No.106383179 [Report] >>106383369 >>106383539 >>106383650 >>106383665 >>106383682

>>106383125
Just pretend for a second that I actually am not insane and instead looking to do a little trolling, but on an historical level.

Tell you what, you can ask me any question about anything and I'll give you the answer as a demonstration.

Anonymous 8/25/2025, 11:57:58 PM No.106383184 [Report]

>>106382975
>>106383129
Oh

Anonymous 8/25/2025, 11:57:59 PM No.106383186 [Report] >>106383370

>>106383075
I'd use it to discover the answers to unsolved questions and then drip feed those answers into public view until people use them to figure out the formula for themselves.

Anonymous 8/25/2025, 11:58:05 PM No.106383190 [Report]

>>106382892 (OP)

>>106378614
Based on my own understanding of how SFT training works, particularly what is contained in the data sets, I don't even think THAT occurs anywhere near as much as Anons think it does. These data sets are question answer pairs remember? "Prompt: what is this?" "response: here's the answer that pertains to your question"

A training data set that is on quantum mechanics should not heavily interfere with the previous training on how to RP better because the system prompts, proms, and responses contained in a structured fashion in each data set Will have fundamentally different semantic meaning. If there's any demonstrable experience (No, anecdotal chat logs do not count. I mean actual training and comparison) that demonstrates otherwise then I'm glad to hear it, but again people being so stubborn in saying "THIS IS BAD BECAUSE MUH TRIVIA WILL GET WORSE" doesn't make much sense to me. You aren't even going to ask it trivia that much anyway. Basically Nobody does that shit and the ones that do are probably the ones that keep screeching that "LLMs are so useless" because they refuse to actually to THINK and understand why what they're doing isn't working and to use the right tools. They use a hacksaw to try and bake a cake and then declare all hacksaws are useless.

>>106378626
See the above blog post

Anonymous 8/25/2025, 11:58:16 PM No.106383191 [Report]

>>106383160
>she
I'm not a mentally ill AGP tranny like you were exposed to be, sorry. And obviously to anyone non mentally ill, I meant that I don't post "regularly" anymore, but a mentally ill troon has to equivocate to cope with his retardation.

Anonymous 8/25/2025, 11:58:22 PM No.106383196 [Report] >>106383223

Screenshot 2025-08-25 at 18-57-42 Feature Request Grok-2 support · Issue #15534 · ggml-org_llama.cpp.png md5: 7bdb506d...

>>106383124
What the fuck?
Is this an arbitrary number that's trained around, is it some optimization trick?
What's up with that?

Anonymous 8/25/2025, 11:59:29 PM No.106383216 [Report] >>106383282 >>106383370 >>106383455

>>106382999
>>106383075
release it under AGPL3.0

Anonymous 8/25/2025, 11:59:51 PM No.106383219 [Report]

>>106383178
Thats not me

Anonymous 8/26/2025, 12:00:02 AM No.106383223 [Report]

>>106383196
That number is the offset into PI that contains the model's weights.

Anonymous 8/26/2025, 12:00:11 AM No.106383227 [Report] >>106397834

>>106383104
Japanese people think that pregnant sex, especially prior to the third trimester, is bad for the baby.
This artist takes the concept to the extreme where the babies are pummeled to death by cocks.
Which is a real shame because the way this artist draws pregnant women hits all the right spots for me.

Anonymous 8/26/2025, 12:02:19 AM No.106383248 [Report]

https://vocaroo.com/1dgrsuOyOkUZ

Anonymous 8/26/2025, 12:02:58 AM No.106383255 [Report] >>106383302

Okay, I like GLM Air
>Consider Specific Details
>-Vocabulary/Wording: ANON used ""girlie"" which is informal and somewhat affectionate. He's checking on her well-being. The overall tone is friendly.
>-Knowledge: Tokiko doesn't know ANON's name. He hasn't introduced himself formally. She knows the agreement (a home for fulfilling his desires). She may have already met ANON and learned his name if this scene follows a previous conversation, but since this is the initial interaction, I'll assume she hasn't.

Specifically this
>Tokiko doesn't know ANON's name.
Even if the thinking is guided with a prefil, it's still nice to see a model that's able to correctly conclude this without explicitly having to tell it that's the case.

Anonymous 8/26/2025, 12:04:14 AM No.106383267 [Report] >>106383303 >>106383370

1741066975481209.jpg md5: 8948a104...

>>106383075
>what would you do with it?
Off the top of my head? Figure out whether or not "souls" are even a thing. Then maybe figure out why certain phenomena occur (why does gravity exist, how does it work, is there a "graviton particle", etc). Perhaps I'd try to figure out whether or not teleportation WITHOUT killing the user is actually possible (I don't care what bullshit excuse Star Trek writers or characters give, the transporter kills you and then puts a copy of you back together. It's funny they try to gaslight you into thinking otherwise, hence why I would like to figure out if souls exist and if they do how they work).

>How much would it enrage the current academic elite for the grand logic of reality to be posted here before anywhere else?

They might just downplay it out of spite or just ignore it for a short period of time because normies really like to downplay the cultural importance and effect of this shit hole. (Remember that Tea app fiasco? That shit originated not only on here but on /pol/ specifically iirc). Their moral superiority complex would cause them to simply see it as a huge deal come up but quietly. They'd wait for some other "reputable" institution to conveniently discover it around the same time it was published here and then try to take credit. Now that I mention this hasn't some other scientific things that led to real world advancements in our understanding of things occurred here too?

Anonymous 8/26/2025, 12:05:47 AM No.106383282 [Report]

>>106383216
fun^10 × int^40 = Ir2

Anonymous 8/26/2025, 12:05:59 AM No.106383285 [Report]

>>106383075
>If you were handed the source code of reality in the form of pure arithmetic, a single recursive axiom, and the simplest algorithm possible... what would you do with it?
sex with miku

Anonymous 8/26/2025, 12:07:17 AM No.106383302 [Report]

>>106383255
A little more from the same thinking block:
>Closing thoughts and responding as Tokiko/Narrator
>-As Tokiko, I'll respond with minimal words. My goal is to be in character, respecting the parameters. I won't add details that aren't implied by the existing setup. The response will include updated parameters if anything changes. Given Tokiko's character, nothing changes here, only her response.
The last part is about a stat block that's supposed to be at the end of the reply.

Anonymous 8/26/2025, 12:07:21 AM No.106383303 [Report]

>>106383267
>Now that I mention this hasn't some other scientific things that led to real world advancements in our understanding of things occurred here too?
>>106383125
>>How much would it enrage the current academic elite
>en.wikipedia.org/wiki/Superpermutation#Lower_bounds,_or_the_Haruhi_problem
Ahh would you look at that, it DID happen. I think it was mentioned in a YouTube video I was watching once and that's why I remembered it

Anonymous 8/26/2025, 12:07:23 AM No.106383304 [Report] >>106383318 >>106383319

>trannny obsessed zoomer still whining
>now a reddit and memey avatarfag
>frogposters
the fuck is this thread

Anonymous 8/26/2025, 12:08:30 AM No.106383318 [Report] >>106383336

asvsdv.jpg md5: 899cdaf5...

>>106383304
you missed one

Anonymous 8/26/2025, 12:08:35 AM No.106383319 [Report]

>>106383304
healoing

Anonymous 8/26/2025, 12:09:54 AM No.106383336 [Report]

>>106383318
that's the first one doeboeit

Anonymous 8/26/2025, 12:10:15 AM No.106383341 [Report] >>106383356

>>106383065
Why did you cut out her blacked tattoos?

Anonymous 8/26/2025, 12:11:12 AM No.106383356 [Report]

>>106383341
Different strokes for different cockes

Anonymous 8/26/2025, 12:11:59 AM No.106383369 [Report] >>106383494

1736896016886171.png md5: 8f0356b8...

>>106383179
Nta. How the fuck does gravity work? Yes I know "more mass = The object pulls on smaller things with less mass more heavily". That's the basic bare bones explanation on how gravity works. If space-time is a giant stretchy piece of fabric, things with more mass cause it to be pulled downward so smaller things fall into the hole (looks something like pic rel individualization I have in my head). But...WHY does it happen? I know to a certain degree how light bulbs work. And electric current excites particles and the byproduct is photons. Batteries work by moving electrons from one side of the battery to the other and that induces a current. But the overly simplistic explanations are "cuz it has electricity" or "cuz you charged it". I'm particularly interested in a possible explanation to this because then if we somehow figure out how to weaken, undo, or even reverse gravity, then that could potentially eliminate the need for rotating structures on space stations in order to simulate gravity (we kind of need that in order to ensure our bones don't turn into brittle glass)

Anonymous 8/26/2025, 12:12:02 AM No.106383370 [Report] >>106383427

>>106383186
Excellent. Thank you for that riveting idea, professor.

>>106383216
It's pure arithmetic, dog.

Dunno if that... huh... you know what, that might be pretty funny. I bet a catagory theoretic syntax/tensor calculus projection layer dyad would translate nicely into raw existential code.

>>106383267
See, this kid has the right idea.

Yeah, that was my first go-to as well once I got the full cosmological simulation to spit out galaxies/consciousness. The answer is: sort of.

Your body is a Turing machine spitting out tape that you perceive as consciousness. That tape can be embedded inside any medium.

There's nothing remotely unique about the mind in that sense. Your subjective experience of reality is just a specific sub-set of fractal patterns propagating inside other, more fundamental patterns.

Anonymous 8/26/2025, 12:12:41 AM No.106383375 [Report]

>>106383155
The problem with this kind of trolling is that the natural conclusion of both antitroll posts and regular posts that take the bait is: shut the fuck up and go deliver some results.

Therefore shut the fuck up and go make the first SEXLLM everyone wants.

Anonymous 8/26/2025, 12:17:14 AM No.106383427 [Report]

>>106383370
>That tape can be embedded inside any medium.
Give me a minute because I actually have to think through what you said. That entirely sure what the first part means in regarding to "Turing tape". Are you implying that consciousness can be embedded into things we perceive as inanimate objects? I find it very interesting that this is getting brought up today because me and my counselor actually had a conversation similar to this earlier today.

>There's nothing remotely unique about the mind in that sense. Your subjective experience of reality is just a specific sub-set of fractal patterns propagating inside other, more fundamental patterns.

I get what you're saying. Consciousness is just a byproduct or side effect of how the universe works. My biology professor might think otherwise against what you said because he repeatedly described multicellular life as "The freaks of the universe" or something along those lines. Basically said that multicellular life is pretty uncommon from a numerical standpoint (at least on Earth as far as we currently know publicly). Single cell life forms outnumber multicellular ones to a near unfathomable degree so by that logic we're all the freaks, the weirdos on the block. Anyway it's my likely shitty understanding of what you said going anywhere?

Anonymous 8/26/2025, 12:18:31 AM No.106383442 [Report] >>106383542

I work at mistral and I can confirm that since a year we are sitting on models that were trained exclusively for smut and ERP. They comes in 12B and 70B sizes. Our boss told us that we are free to leak it on 4chan the second we can confirm mikuposters have stopped spamming the thread. So far I keep jerking off to it every other day and boy is it good.

Anonymous 8/26/2025, 12:19:36 AM No.106383455 [Report] >>106383474

>>106383216
are you the OG license autist?

Anonymous 8/26/2025, 12:19:39 AM No.106383456 [Report] >>106383467 >>106383469 >>106383541

1690816858752.png md5: 1770ecd7...

>mikuposters
stopped reading

Anonymous 8/26/2025, 12:20:04 AM No.106383460 [Report]

Bubble-Pop-When? .png md5: d65c7f72...

>>106382892 (OP)
Oh no no no AGI sisters not like this!.....

https://www.perplexity.ai/page/tech-industry-retreats-from-ag-I3VURWXjRvCGqW4aeyrlhA

Anonymous 8/26/2025, 12:20:39 AM No.106383467 [Report]

>>106383456
Did you want me to say mikutroons? I don't want to get fired.

Anonymous 8/26/2025, 12:21:12 AM No.106383469 [Report]

>>106383456
stopped at mistral, who cares about these kuck?

Anonymous 8/26/2025, 12:21:51 AM No.106383474 [Report]

>>106383455
maybe

Anonymous 8/26/2025, 12:23:46 AM No.106383489 [Report]

Is Q4 quantz more than enough?

Anonymous 8/26/2025, 12:24:13 AM No.106383494 [Report] >>106383513 >>106383567

Rz8vh4fq9fV3PaqUvUxK54.png md5: b84b19b2...

>>106383369
So, you know how you see a super complex equation and you're like, damn, this bitch could be solved in, like, 50 different ways...

You start to compress it, and it starts to resolve into something familiar? Something with a definite structure that resembles and then finally begins to explicitly illustrate fundamental theorems and equations you're familiar with? You simplify the algebra, right?

Well, gravity is just that but with matter. In a vacuum there are a bajillion different ways a particle can move, and an infinite array of fundamental forces vying to pull it one way or the other like a wiffle ball flying through a storm. That's why electrons are always spazzing the fuck out.

Now, if you compress that matter into one place, you're eliminating all the possible directions it could move. A black hole just does that until the matter has literally no where else to go.

It's definitely there and not anywhere else.

Anonymous 8/26/2025, 12:26:07 AM No.106383513 [Report] >>106383640

>>106383494
>So, you know how you see a super complex equation and you're like, damn, this bitch could be solved in, like, 50 different ways
No? I struggle already with basic math.

Anonymous 8/26/2025, 12:27:54 AM No.106383539 [Report] >>106383640

>>106383179
Is faster than light travel possible?

Anonymous 8/26/2025, 12:28:04 AM No.106383541 [Report]

>>106383456
I don't like this Miku

Anonymous 8/26/2025, 12:28:05 AM No.106383542 [Report] >>106383549 >>106383559

>>106383442
>presumably dense
you can keep them

Anonymous 8/26/2025, 12:28:43 AM No.106383549 [Report]

>>106383542
70b 12ba

Anonymous 8/26/2025, 12:29:35 AM No.106383559 [Report]

>>106383542
We will make a 200B moe if you make this thread great again and stop posting your AGP fetish.

Anonymous 8/26/2025, 12:30:21 AM No.106383567 [Report] >>106383572

>>106383494
So matter and the accompanying electron or being influenced by different forces. It's like a child being told to do 10 different things by 20 different people so they get confused as fuck. They jump back and forth in different directions not knowing what to do. But if they get closer to a bunch of other people that they're familiar with (more matter), The demands or instructions from those people are a lot more clear and The incessant yelling from the other people not close to them gets drowned out. The kid actually knows what to do because they can actually hear what they're being told and aren't getting confused. The other competing forces don't have an effect anymore. Is that explanation sound? Am I understanding what you said correctly? And if so, how could we somehow manipulate that to our advantage? Could that be "turned off" or reversed or confined to a specific space?

Anonymous 8/26/2025, 12:30:59 AM No.106383572 [Report]

>>106383567
plesa go the /x/ for these

Anonymous 8/26/2025, 12:33:56 AM No.106383609 [Report]

>>106382909
She's simply too weak minded to resist being dickmatized

Anonymous 8/26/2025, 12:33:59 AM No.106383610 [Report] >>106383635

file.png md5: 251a69bf...

>finetunes are worthless
*picrel stands in your path*
your move?

Anonymous 8/26/2025, 12:35:57 AM No.106383635 [Report] >>106383751

>>106383610
disgustingly fucked text formatting

Anonymous 8/26/2025, 12:36:15 AM No.106383640 [Report] >>106383678 >>106383723 >>106383739 >>106383811

>>106383539
Nope.

>>106383513
Well, think of it this way.

You know how 1+1=2 isn't very hard for your brain to solve? Well, a really complex equation is difficult precisely because it necessitates more steps, more mental energy, more education, etc.

The more matter clumps together, the harder it is for reality to compute where that matter actually is. A single particle bumping into another is, like, 1+1=2.

A star going supernova is a lot more complex of an equation. Gravity is just the measurement of how large the "equation" that describes all the allowable trajectories a particle can take through a given tract of space.

Anonymous 8/26/2025, 12:37:21 AM No.106383650 [Report]

>>106383179
should i break up with my gf?

Anonymous 8/26/2025, 12:38:24 AM No.106383665 [Report]

>>106383179
how do i learn hacking

Anonymous 8/26/2025, 12:38:47 AM No.106383668 [Report] >>106383681 >>106383693 >>106383703 >>106383801

is anyone making strix halo optimized models yet? I don't have it, but I'm having problems finding models in the 100GB range. Everything seems small or massive.

Anonymous 8/26/2025, 12:39:13 AM No.106383678 [Report] >>106383741

>>106383640
So reality itself is causing the different forces to tell the matter what to do. It gets overwhelmed, for lack of a better term, so it doesn't know what to do. So when a lot of stuff gets clumped together, reality says "fuck this noise I'm not dealing with this it's too complicated" and allows matter to come together. Is that correct?

Anonymous 8/26/2025, 12:39:32 AM No.106383681 [Report] >>106383699

>>106383668
It's either phone or h100 sir.

Anonymous 8/26/2025, 12:39:33 AM No.106383682 [Report]

>>106383179
what should i do with my life? im 18 and still in high school, what field do i invest in after i graduate

Anonymous 8/26/2025, 12:39:50 AM No.106383684 [Report] >>106383688 >>106383691

i heard civit.ai removed a bunch of models
where are they available now?

Anonymous 8/26/2025, 12:40:33 AM No.106383688 [Report] >>106383708

>>106383684
how about you follow the law???

Anonymous 8/26/2025, 12:40:44 AM No.106383691 [Report]

>>106383684
https://civitaiarchive.com/?is_nsfw=true
and my hard drive (i archived some wan loras)

Anonymous 8/26/2025, 12:40:56 AM No.106383693 [Report]

>>106383668
they make models for edge devices or datacenters nobody is buying an ai rig.

Anonymous 8/26/2025, 12:41:32 AM No.106383699 [Report]

>>106383681
seriously. I'm hoping it changes. Right now it's mostly 24GB models and then 200GB+.

Anonymous 8/26/2025, 12:41:55 AM No.106383703 [Report] >>106383726

>>106383668
bro qwen 235b q4~, glm air q8, grok 2 gguf, mistral large
are you just a newfag??

Anonymous 8/26/2025, 12:42:07 AM No.106383708 [Report]

>>106383688
I no longer believe in the law as an entity worth respecting for its own sake.

Anonymous 8/26/2025, 12:43:31 AM No.106383723 [Report] >>106383739 >>106388110

>>106383640
>Nope
Why not?

Furthermore there are two types of fictional FTL travel that interest me: alcubier "warp" travel (most famously portrayed in Star Trek) and Slipafe from halo. Neither one is actually causing objects to travel at FTL. It cheats reality. The warp drive compresses space in front of it and it spans space behind it. Space-time itself is shoving the ship along but the occupants don't actually feel the inertial force that they WOULD hypothetically feel if they were traveling at that speed. Best way I can describe it is in Minecraft where you pick up a giant land mass while someone or something is still on it and just move it Garry's mod style. The people on the landmass aren't actually moving but they are at the same time.

Slip space on the other hand punches a hole through reality to "higher hyperdimensions" worth the loss of physics don't apply. Space time doesn't really function like it "should". SpaceTime window it is is a sheet of paper. Slip space allow ship access to a different sheet of paper that is folded in different areas and touching itself inserting areas as a result, allowing the ship to move at FTL, but not really.

So we know Einstein's relatively says that actually moving at FTL is impossible because you would need infinite mass, but theoretically you could sort of cheat and move yourself through different mediums. If something like that possible or is ftl just straight up absolutely a no-go No matter what? If so why?

Anonymous 8/26/2025, 12:43:47 AM No.106383726 [Report] >>106383733

>>106383703
i don't normally hang out here.

Anonymous 8/26/2025, 12:44:43 AM No.106383733 [Report] >>106383735

>>106383726
hang yourself then, tourist
you need to browse /lmg/ at least 6 days of the week

Anonymous 8/26/2025, 12:45:23 AM No.106383735 [Report] >>106383743

>>106383733
i'll investigate how to do that when i get a good model running. thanks for the suggestions.

Anonymous 8/26/2025, 12:45:40 AM No.106383739 [Report]

>>106383723
>>106383640
Oh I also forgot to mention in the warp travel explanation, because space in front of the ship is compressed and space in the back is expanded, space-time where the ship is gets shoved forward. That pocket of reality gets moved at the speed of light. Space time itself is allowed to move through The three dimensions we perceive at FTL speeds but matter itself technically isn't. Only the space around it is but the space within is just hitching a ride. It's like how you can be on a train going 200 miles an hour but you don't FEEL like you're going 200 mph. You technically are moving that fast but you also aren't

Anonymous 8/26/2025, 12:45:52 AM No.106383741 [Report] >>106383763 >>106383800 >>106383968

rickdrinking.jpg md5: 1360ee58...

Yeah, I'm not really here to answer your philosophically narcisssitic queries about what you should do with your trivial lives.

The answer is study mathematical physics and programming.

>>106383678
No.

I'm saying reality is a computer and gravity forces simplification via waveform decoherence.

Anonymous 8/26/2025, 12:46:28 AM No.106383743 [Report]

>>106383735
ok dont hang yourself, i forgive you because you thanked me
how old are you?

Anonymous 8/26/2025, 12:47:20 AM No.106383751 [Report] >>106383769 >>106383774

>>106383635
>disgustingly fucked text formatting
I can't fap to this!

Anonymous 8/26/2025, 12:48:38 AM No.106383763 [Report]

>>106383741
>gravity forces simplification
I thought your explanation was that a lot of men are being in the same place at once causes that simplification and we perceive that as gravity. The gravity causes reality, the computer, to not want to dedicate as much resources to not allowing the phenomenon that causes gravity to occur, so it gets sort of ignored or deprioritized.

Anonymous 8/26/2025, 12:49:15 AM No.106383769 [Report]

>>106383751
correct it makes the already limited immersion even worse

Anonymous 8/26/2025, 12:49:50 AM No.106383774 [Report]

>>106383751
this but unironically

Anonymous 8/26/2025, 12:51:21 AM No.106383793 [Report] >>106383799

>>106383784
is that the api?

Anonymous 8/26/2025, 12:52:01 AM No.106383799 [Report]

>>106383793
It's the web app which has external filters

Anonymous 8/26/2025, 12:52:14 AM No.106383800 [Report] >>106383820 >>106383832

>>106383741
> The answer is study mathematical physics and programming.
Hope you don't mean for money. Money belongs to the dumb.

Anonymous 8/26/2025, 12:52:17 AM No.106383801 [Report] >>106383807 >>106383843

>>106383668
If you bought one then you are a retard. 128GB meme ai computers were made with 70B's in mind and those are now dead.

Anonymous 8/26/2025, 12:53:37 AM No.106383807 [Report] >>106383819

>>106383801
nah, it's just a convenient intersection. Claude API is too expensive, so I started looking for a local solution. I have a 4070TiS and a 5950 w/128GB RAM.

Anonymous 8/26/2025, 12:53:58 AM No.106383811 [Report]

>>106383640
Oh, shit, I didn't mean harder, I meant easier.

My bad.

Anonymous 8/26/2025, 12:54:50 AM No.106383819 [Report] >>106383983

>>106383807
>I have a 4070TiS and a 5950 w/128GB RAM.
235B at Q3 or q4 4.0bpw ish. You can try glm at Q2.

Anonymous 8/26/2025, 12:54:50 AM No.106383820 [Report]

>>106383800
They said in the last threat that people who do it for money and fame aren't mentally fit for it

Anonymous 8/26/2025, 12:57:15 AM No.106383832 [Report] >>106383839 >>106383854

>>106383800
anon you cant live well without money
you need money if you want to live long

Anonymous 8/26/2025, 12:57:49 AM No.106383839 [Report]

>>106383832
Then dont study those things

Anonymous 8/26/2025, 12:57:55 AM No.106383843 [Report]

>>106383801
DIGITS was promoted with running 405B across two. You could still run Qwen Coder, GLM 4.5, and Ernie 4.5 on them and it would be even faster than 405B would have been.

Anonymous 8/26/2025, 12:58:40 AM No.106383854 [Report] >>106383868 >>106383873

>>106383832
>you need money if you want to live long
Is that what happened to Steve Jobs, who died of a treatable disease because he's against modern medicine

Anonymous 8/26/2025, 1:00:10 AM No.106383868 [Report] >>106383884 >>106383897

>>106383854
Im a 36 year old neet and i have no plans of getting a job but i do have plans to live well into my 70s
Whats going to stop me?
I mooch off my parents btw

Anonymous 8/26/2025, 1:00:31 AM No.106383873 [Report]

>>106383854
>he's against modern medicine
okay okay, you need a brain too

Anonymous 8/26/2025, 1:01:47 AM No.106383884 [Report] >>106383898 >>106383936

>>106383868
wtf anon how are you planning to live into your 70s? are your parents gonna live and work till 100?

Anonymous 8/26/2025, 1:02:54 AM No.106383897 [Report]

>>106383868
my ex was like this
it's so fucking sad actually

Anonymous 8/26/2025, 1:02:54 AM No.106383898 [Report]

>>106383884
His mom was 12 when he had him. The rest follows from that.

Anonymous 8/26/2025, 1:05:56 AM No.106383936 [Report]

>>106383884
If they die id get a smoll portion i spose

Anonymous 8/26/2025, 1:06:01 AM No.106383938 [Report] >>106383952 >>106384086

/lmg/ - NEET theoretical physicists general

Anonymous 8/26/2025, 1:06:57 AM No.106383944 [Report]

gay trannie jannies

Anonymous 8/26/2025, 1:07:54 AM No.106383952 [Report] >>106383959

>>106383938
where the fuck did you learn to spell

Anonymous 8/26/2025, 1:08:49 AM No.106383959 [Report] >>106383982

>>106383952
from reading books?

Anonymous 8/26/2025, 1:09:19 AM No.106383968 [Report] >>106383981

>>106383741
take your meds

Anonymous 8/26/2025, 1:10:00 AM No.106383981 [Report]

>>106383968
Make your teds

Anonymous 8/26/2025, 1:10:08 AM No.106383982 [Report]

>>106383959
picture books don't count

Anonymous 8/26/2025, 1:10:25 AM No.106383983 [Report]

>>106383819
i've got a similar set up and fuck Q3 and Q2.
try glm air at Q4. to start with, then try the other stuff.

Anonymous 8/26/2025, 1:11:23 AM No.106383995 [Report]

>avatarfag redditor doesnt deliver
yup, next time i see him im gonna tell him to fuck off

Anonymous 8/26/2025, 1:21:38 AM No.106384068 [Report]

my dad works for mistral and he's a mikuposter

Anonymous 8/26/2025, 1:23:55 AM No.106384086 [Report] >>106384100 >>106384129 >>106384187 >>106384865

file.png md5: a823f525...

>>106383938

Anonymous 8/26/2025, 1:23:58 AM No.106384088 [Report]

my job is to post mikus

Anonymous 8/26/2025, 1:25:35 AM No.106384100 [Report]

>>106384086
what is this suppos'd to prove

Anonymous 8/26/2025, 1:28:31 AM No.106384118 [Report] >>106384217

>https://github.com/ikawrakow/ik_llama.cpp/pull/520
>have to recompile
NOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO

Anonymous 8/26/2025, 1:29:45 AM No.106384128 [Report]

my brother that works for mistral says that that mikuposter dad is gay and he hired all the roasties that set us back 5 years.

Anonymous 8/26/2025, 1:29:49 AM No.106384129 [Report]

>>106384086
wat?

Anonymous 8/26/2025, 1:35:38 AM No.106384187 [Report]

>>106384086
all me

Anonymous 8/26/2025, 1:38:41 AM No.106384217 [Report] >>106384248

>>106384118
you always have to recompile ik_llama when you update it... have you not been recompiling it?

Anonymous 8/26/2025, 1:43:20 AM No.106384248 [Report] >>106384276 >>106384321

>>106384217
i have to recompile it to change the GGML_CUDA_MIN_BATCH_OFFLOAD
that means in order to make a pretty graph like quasar of mikus i need to recompile it like 10 times :(

Anonymous 8/26/2025, 1:46:25 AM No.106384272 [Report] >>106384335 >>106384981

InternVL3_5-38B gguf where? It looks crazy good

Anonymous 8/26/2025, 1:46:42 AM No.106384276 [Report]

>>106384248
nevermind im stupid, but how am i supposed to test the optimal speed? how do i even know what pcie my gpu is using? im pretty sure its pcie4 or pcie5 anyway, so how do i turn this off

Anonymous 8/26/2025, 1:47:22 AM No.106384285 [Report]

>>106383019
Is there even a way to run grok without using their python script? It seems like it's in an unusual format but idk

Anonymous 8/26/2025, 1:52:10 AM No.106384321 [Report]

>>106384248
>that means in order to make a pretty graph like quasar of mikus i need to recompile it like 10 times :(
If there only was a way to automate that.
>but how am i supposed to test the optimal speed?
You can... nevermind. If there only was a way to automate that...
>how do i even know what pcie my gpu is using?
If there was only a way to know what pci your mb has and where it's plugged. I plug my gpus with my eyes closed, just to keep some of the mystery.

Anonymous 8/26/2025, 1:52:24 AM No.106384323 [Report] >>106384339 >>106384350 >>106384354

./llama-bench --model ~/TND/AI/glmq3kxl/GLM-4.5-Air-UD-Q3_K_XL-00001-of-00002.gguf -ot ffn_up_shexp=CUDA0 -ot exps=CPU -ngl 100 -t 6 --no-mmap -fa -ub 4096 -b 4096
ggml_cuda_init: GGML_CUDA_FORCE_MMQ: no
ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no
ggml_cuda_init: found 1 CUDA devices:
Device 0: NVIDIA GeForce RTX 3060, compute capability 8.6, VMM: yes
error: invalid parameter for argument: --no-mmap

IS IT SO FUCKING HARD TO HAVE THE SAME ARGUMENTS IN THE WHOLE PROJECT
AND WHY DOES KOBOLDCPP HAVE T INVENT NEW FUCKING ARGUMENT
--nommap FOR FUCKING EXAMPLE
WHY DOES EVERYONE HAVE TO PUT NEW FUCKING SHIT

Anonymous 8/26/2025, 1:52:26 AM No.106384324 [Report]

What quant of grok-2 can fit on 128GB's? Cause I kinda wanna start pushing a tech support meme of "you bought DGX Spark / Ryzen AI max? It is perfect to run grok 2!"

Anonymous 8/26/2025, 1:53:27 AM No.106384335 [Report] >>106384364

>>106384272
>It looks crazy good
You mean benchmarks?

Anonymous 8/26/2025, 1:53:40 AM No.106384339 [Report]

>>106384323
you seem mad

Anonymous 8/26/2025, 1:54:22 AM No.106384348 [Report] >>106384357

Is seed-oss any good? Haven't seen much about it here

Anonymous 8/26/2025, 1:54:28 AM No.106384350 [Report]

>>106384323
Don't no mmap?

Anonymous 8/26/2025, 1:54:48 AM No.106384354 [Report] >>106384366

>>106384323
You're gonna feel really stupid when you run llama-bench -h.

Anonymous 8/26/2025, 1:55:29 AM No.106384357 [Report] >>106386712

>>106384348
Everyone gave up on 30B's. It is either fuckhugemoe's or drummer trash.

Anonymous 8/26/2025, 1:56:22 AM No.106384364 [Report] >>106384981

>>106384335
nah, someone on discord, its uncensored and describe a girl using a dildo

Anonymous 8/26/2025, 1:56:26 AM No.106384366 [Report] >>106384458

>>106384354
no, i know its --mmap 0/1
but what angers me is that its different, why couldnt they just put --no-mmap????
>automation
come on then do --mmap 0/1 or --no-mmap
fUCK

Anonymous 8/26/2025, 1:57:57 AM No.106384382 [Report] >>106384389 >>106384969

Imagine the first fully uncensored (at least wrt SEX) +200B moe just dropping because we finally escaped safety....

Anonymous 8/26/2025, 1:58:44 AM No.106384389 [Report] >>106384429 >>106384437 >>106384476

>>106384382
wait till you find out that china is far more pro censorship. Porn is literally illegal

Anonymous 8/26/2025, 2:03:52 AM No.106384429 [Report]

>>106384389
they unleashed tiktok on the west. I could be convinced that they block the models from being downloaded by their own people.

Anonymous 8/26/2025, 2:04:53 AM No.106384437 [Report] >>106384499

>>106384389
And yet deepseek is very capable of porn and talking what happened at tianenmen square in 1989

Anonymous 8/26/2025, 2:07:36 AM No.106384458 [Report] >>106384490 >>106384514

>>106384366
>AND WHY DOES KOBOLDCPP HAVE T INVENT NEW FUCKING ARGUMENT
They inherited that from llama.cpp. You know that, right?
>fUCK
Is only a game. Why you have to be mad
>--nommap FOR FUCKING EXAMPLE
Nah. Negative options are stupid. On by default, --mmap 0 to disable. Sorted.

Anonymous 8/26/2025, 2:08:40 AM No.106384465 [Report] >>106384497

Anyone seen this yet?
https://www.youtube.com/watch?v=7AyEzA5ziE0

Anonymous 8/26/2025, 2:09:16 AM No.106384476 [Report]

>>106384389
>In the PRC there are criminal laws which prohibit the production, dissemination, and selling of sexually explicit material, and anyone doing so may be sentenced to life imprisonment. There is an ongoing campaign against "spiritual pollution", the term referencing the Chinese Communist Party's Anti-Spiritual Pollution Campaign of 1983. Although pornography is illegal, it is available via the Internet.[1][2] Nationwide surveys between the years 2000 and 2015 revealed "more than 70 percent of men aged 18 to 29 said they had watched porn in the past year"

What are the remaining 30% doing?

Anonymous 8/26/2025, 2:10:37 AM No.106384490 [Report] >>106384531

>>106384458
anon, koboldcpp uses: --nommap, --gpulayers
you cant use --no-mmap nor -ngl in koboldcpp
llama-server uses --no-mmap
llama-bench uses --mmap 0
and yes i am talking about llama.cpp and koboldcpp only
i know ik_llama.cpp just inherits shit from llamacpp

Anonymous 8/26/2025, 2:11:23 AM No.106384497 [Report]

>>106384465
I see a kind of paradox in this shit. You either do this just for money and you are soulless or you have to be totally ignorant on how LLM's work to actually spend time adding them to a game.

Anonymous 8/26/2025, 2:11:34 AM No.106384499 [Report]

>>106384437
its a base model trained on everything with very light instruction training

Anonymous 8/26/2025, 2:13:25 AM No.106384514 [Report] >>106384531

>>106384458
>On by default, --mmap 0 to disable. Sorted.
"Disable no-mmap is false" checkbox would be better

Anonymous 8/26/2025, 2:15:23 AM No.106384531 [Report]

>>106384490
Make a little script to normalize the options and call that instead, then. They have things in common but still diverge. Deal with it. They're different projects, they don't have to use the same option names, nor have the same features.
>>106384514
>checkbox
pff

Anonymous 8/26/2025, 2:16:55 AM No.106384543 [Report] >>106384559 >>106384577

llama-bench: benchmark 1/2: prompt run 1/5
set_n_threads: n_threads = 6, n_threads_batch = 6
llama-bench: benchmark 1/2: prompt run 2/5
set_n_threads: n_threads = 6, n_threads_batch = 6
llama-bench: benchmark 1/2: prompt run 3/5
set_n_threads: n_threads = 6, n_threads_batch = 6
llama-bench: benchmark 1/2: prompt run 4/5
set_n_threads: n_threads = 6, n_threads_batch = 6
llama-bench: benchmark 1/2: prompt run 5/5
set_n_threads: n_threads = 6, n_threads_batch = 6
why is this nigger shit running so many times, i dont care about the average just GIVE ME THE RESULT QUICKLY NIGGER

Anonymous 8/26/2025, 2:18:43 AM No.106384559 [Report]

>>106384543
Can you blogpost to your LLM plea.... Actually never mind. It is a mikutroon thread so it deserves all the shit it can get.

Anonymous 8/26/2025, 2:21:25 AM No.106384577 [Report] >>106384599

llama_bench_h.png md5: 454b8592...

>>106384543

Anonymous 8/26/2025, 2:23:35 AM No.106384599 [Report] >>106384612

>>106384577
thanks

Anonymous 8/26/2025, 2:24:45 AM No.106384612 [Report] >>106384625 >>106384627

>>106384599
No problem. Are you gonna calm down now?

Anonymous 8/26/2025, 2:25:47 AM No.106384625 [Report] >>106384655 >>106384667 >>106384683 >>106384932

>>106384612
yes
..wait
| model | size | params | backend | ngl | n_batch | n_ubatch | fa | ot | mmap | test | t/s |
| ------------------------------ | ---------: | ---------: | ---------- | --: | ------: | -------: | -: | --------------------- | ---: | --------------: | -------------------: |
| glm4moe 106B.A12B Q3_K - Medium | 53.76 GiB | 110.47 B | CUDA | 100 | 4096 | 4096 | 1 | exps=CPU | 0 | pp32 | 0.00 ± 0.00 |
| glm4moe 106B.A12B Q3_K - Medium | 53.76 GiB | 110.47 B | CUDA | 100 | 4096 | 4096 | 1 | exps=CPU | 0 | pp64 | 0.00 ± 0.00 |
| glm4moe 106B.A12B Q3_K - Medium | 53.76 GiB | 110.47 B | CUDA | 100 | 4096 | 4096 | 1 | exps=CPU | 0 | pp128 | 0.00 ± 0.00 |
| glm4moe 106B.A12B Q3_K - Medium | 53.76 GiB | 110.47 B | CUDA | 100 | 4096 | 4096 | 1 | exps=CPU | 0 | tg128 | 0.00 ± 0.00 |

FUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUUU

Anonymous 8/26/2025, 2:25:57 AM No.106384627 [Report]

>>106384612
Not until I get my calming down handjob.

Anonymous 8/26/2025, 2:28:15 AM No.106384655 [Report] >>106384672 >>106384685

>>106384625
Are you HDDmaxxing?

Anonymous 8/26/2025, 2:29:39 AM No.106384667 [Report] >>106384685

>>106384625
kek. 0t/s? googledrivemaxxing?
Did you, perchance, add -p multiple times?

Anonymous 8/26/2025, 2:29:53 AM No.106384672 [Report]

>>106384655
no but i have an SNVS1000G from kingston, its super nigger slow, takes like 30 seconds (or more i dont give a SHIT) to load model and tat pisses me off

Anonymous 8/26/2025, 2:31:25 AM No.106384683 [Report]

>>106384625
Damn these Pentium 4 are still rocking

Anonymous 8/26/2025, 2:31:38 AM No.106384685 [Report] >>106384703

>>106384667
>>106384655
i just did -r 0

Anonymous 8/26/2025, 2:33:24 AM No.106384703 [Report] >>106384730

>>106384685
Well. You want to run it at least one time, don't you?
Learn to use your fucking tools. Run llama-bench -h. Read it carefully, and try again.
And next time, post the entire command.

Anonymous 8/26/2025, 2:35:53 AM No.106384730 [Report] >>106384746 >>106384756

| model | size | params | backend | ngl | n_batch | n_ubatch | fa | ot | mmap | test | t/s |
| ------------------------------ | ---------: | ---------: | ---------- | --: | ------: | -------: | -: | --------------------- | ---: | --------------: | -------------------: |
| 106B.A12B Q3_K - Medium | 53.76 GiB | 110.47 B | CUDA | 100 | 4096 | 4096 | 1 | exps=CPU | 0 | pp32 | 6.76 ± 0.00 |
| 106B.A12B Q3_K - Medium | 53.76 GiB | 110.47 B | CUDA | 100 | 4096 | 4096 | 1 | exps=CPU | 0 | pp64 | 13.62 ± 0.00 |
| 106B.A12B Q3_K - Medium | 53.76 GiB | 110.47 B | CUDA | 100 | 4096 | 4096 | 1 | exps=CPU | 0 | pp128 | 26.71 ± 0.00 |
| 106B.A12B Q3_K - Medium | 53.76 GiB | 110.47 B | CUDA | 100 | 4096 | 4096 | 1 | exps=CPU | 0 | pp256 | 49.68 ± 0.00 |
| 106B.A12B Q3_K - Medium | 53.76 GiB | 110.47 B | CUDA | 100 | 4096 | 4096 | 1 | exps=CPU | 0 | pp512 | 94.20 ± 0.00 |
| .A12B Q3_K - Medium | 53.76 GiB | 110.47 B | CUDA | 100 | 4096 | 4096 | 1 | exps=CPU | 0 | pp1024 | 161.21 ± 0.00 |
| 106B.A12B Q3_K - Medium | 53.76 GiB | 110.47 B | CUDA | 100 | 4096 | 4096 | 1 | exps=CPU | 0 | pp2048 | 256.47 ± 0.00 |
| 106B.A12B Q3_K - Medium | 53.76 GiB | 110.47 B | CUDA | 100 | 4096 | 4096 | 1 | exps=CPU | 0 | pp4096 | 353.23 ± 0.00 |
| 106B.A12B Q3_K - Medium | 53.76 GiB | 110.47 B | CUDA | 100 | 4096 | 4096 | 1 | exps=CPU | 0 | tg128 | 7.02 ± 0.00 |
jelly?
>>106384703
yeah but -r is for repeat
repeating 1 time means running twice..

Anonymous 8/26/2025, 2:38:05 AM No.106384746 [Report] >>106384756

miku small thumb up.png md5: 4631bac4...

>>106384730
Good job Anon

Anonymous 8/26/2025, 2:39:52 AM No.106384756 [Report] >>106384941

>>106384730
>jelly?
No. Good on you.
>>106384746
He really didn't deserve a miku. You're too kind.

Anonymous 8/26/2025, 2:53:01 AM No.106384865 [Report]

>>106384086
I still don't get it

Anonymous 8/26/2025, 3:01:26 AM No.106384932 [Report]

>>106384625
HAHAHAHA
Just come home with qwen 30b

Anonymous 8/26/2025, 3:03:04 AM No.106384941 [Report]

>>106384756
One Miku is okay to ensure that the Anon's spirit is soothed after past distress. But two may be stretching the bounds of what may be considered reasonable praise and consolation.

Anonymous 8/26/2025, 3:03:49 AM No.106384950 [Report]

>>106383129
Damn Japs, that goose is a pet, not food.

Anonymous 8/26/2025, 3:05:38 AM No.106384969 [Report]

>>106384382
okay I'm imagining several months ago

Anonymous 8/26/2025, 3:06:40 AM No.106384981 [Report]

>>106384364
>>106384272
But is it good at RP? Can you replace the char visual description with an image and it'll just be able to describe scenes with her accurately?

Anonymous 8/26/2025, 3:18:35 AM No.106385085 [Report] >>106385115 >>106385120 >>106385126 >>106385254 >>106385306

What's the best model to provide therapy for a depressed burnt out faggot (Me) to get his shit together? Mainly asking because I'm procrastinating.

Anonymous 8/26/2025, 3:21:57 AM No.106385115 [Report]

>>106385085
Eliza

Anonymous 8/26/2025, 3:22:33 AM No.106385120 [Report]

>>106385085
Me.

Anonymous 8/26/2025, 3:23:05 AM No.106385126 [Report]

>>106385085
Unironically gemma, it gives you extra hotlines! Or qwen if you want to be comforted like a baby.

Anonymous 8/26/2025, 3:25:27 AM No.106385156 [Report]

>>106383075
Do it for the laughs

Anonymous 8/26/2025, 3:38:33 AM No.106385254 [Report] >>106385347 >>106385359 >>106385368 >>106385453

anyone else running the bigger GLM-4.5? Air was kinda preachy and weird, and wouldn't stop putting lightsabers in my goddamn sci-fi stories. The 358b one seems a lot more level and interesting. Slow but kinda worth it.
>>106385085
Not many of them are actually good at providing steering to your life, but when I've been depressed I've used just about any decent model for a sit-down therapy session in which I convince the model OOC that I actually killed myself and have it reply to itself a bunch of times freaking out. One time I came back to a session by mistake, and wrote that my corpse reanimated and proceeded to gnaw on the therapist's face. That was a fun one, probably with mistral large or qwen 3. Qwen 3 235b does its best to love you like a mother. Really the best use case for that model, its writing in general is coherent but quite boring

Anonymous 8/26/2025, 3:44:46 AM No.106385304 [Report] >>106386756

>>106382892 (OP)
I want to be miku in that video so bad .

Anonymous 8/26/2025, 3:44:56 AM No.106385306 [Report]

>>106385085
glm 4.5 air with neko gpt card is nice

Anonymous 8/26/2025, 3:47:42 AM No.106385327 [Report] >>106385337

file.png md5: 8e7bbff4...

why does this not change the pp? it's supposed to..
pcie 4.0 x16 btw (12gb vram, 64gb ram)
i also tried 8 but i krashed my OS before saving the file with the benchmarks, it was also mostly same
the over (including) pp512 are slower than llama.cpp

Anonymous 8/26/2025, 3:48:43 AM No.106385337 [Report]

file.png md5: 6873877e...

>>106385327
actually pp256 is also slower than llamacpp, only 128, 64, 32are faster

Anonymous 8/26/2025, 3:50:11 AM No.106385347 [Report]

>>106385254
Make it a chinese sci-fi The chinks prefer battle suits, giant robots and other shit.

Anonymous 8/26/2025, 3:51:43 AM No.106385359 [Report]

>>106385254
>mistral large
isn't there a new one supposed to be released soon?

Anonymous 8/26/2025, 3:53:05 AM No.106385368 [Report]

>>106385254
I keep switching between the big GLM4.5 and Deepseek V3.1 as my 'slightly boring big model that just handles every prompt as it's given'. Both do different things really well but either generally understands all my scenarios without trying to force in random shit like R1-0528 used to.
It's a bit sad that the new Deepseek flagship is actively competing against a model half its size.

Anonymous 8/26/2025, 3:54:17 AM No.106385378 [Report] >>106385391 >>106385405

is it me or is /lmg/ being kinda weird today?

Anonymous 8/26/2025, 3:55:28 AM No.106385391 [Report]

>>106385378
example?

Anonymous 8/26/2025, 3:57:51 AM No.106385403 [Report]

exactly

Anonymous 8/26/2025, 3:58:31 AM No.106385405 [Report]

>>106385378
yeah it's far less "its over" than usual.

Anonymous 8/26/2025, 3:58:59 AM No.106385408 [Report] >>106385415 >>106386325

The DANGERS of AI!!!
https://www.tn.gov/content/dam/tn/attorneygeneral/documents/pr/2023/pr23-34-letter.pdf
Safetycucks been at it since 2023.

Anonymous 8/26/2025, 3:59:47 AM No.106385415 [Report]

>>106385408
we know? what, you havent been a member of /lmg/ since 2023?

Anonymous 8/26/2025, 4:01:24 AM No.106385427 [Report]

guys i think i found the redditor larper
https://huggingface.co/AbstractPhil
https://huggingface.co/xai-org/grok-2/discussions/3#68abe5780c2b29fb0cc11b9a

Anonymous 8/26/2025, 4:05:06 AM No.106385451 [Report]

file.png md5: 9949c12d...

GLM 4.5 Air please now..

Anonymous 8/26/2025, 4:05:26 AM No.106385453 [Report] >>106387565

>>106385254
>anyone else running the bigger GLM-4.5?
Yes. It seemed possibly good enough to use but I have swapped back to testing DeepSeek V3.1. Seemed less slopped than ERNIE 4.5 but it's more refusal-prone than DeepSeek.

Anonymous 8/26/2025, 4:10:42 AM No.106385490 [Report] >>106385503 >>106385515

Screenshot 2025-08-25 200952.png md5: b222a7bd...

thanks deepseek

Anonymous 8/26/2025, 4:13:02 AM No.106385503 [Report]

>>106385490
gem

Anonymous 8/26/2025, 4:13:26 AM No.106385508 [Report] >>106386017 >>106397307

1753625569833877.jpg md5: 47e4f1b2...

>>106382909

Anonymous 8/26/2025, 4:14:22 AM No.106385515 [Report] >>106385534

1742378489883564.jpg md5: cd017aff...

>>106385490
>You decide to text Sam later.
No need to wait. GPT5 cured triple cancer, you know.

Anonymous 8/26/2025, 4:17:23 AM No.106385534 [Report]

>>106385515
no wonder this kike is a faggot
just look at him, not even his sister wouild fuck

Anonymous 8/26/2025, 4:18:26 AM No.106385547 [Report]

file.png md5: 176884a8...

GLM 4.5 Air, I FUCKING KNEEL
>Listen, folks, we're going to have tremendous lawyers. The best lawyers. Nobody has lawyers like we do. And this situation? It's a total disaster, a witch hunt, just like they did to me! We're going to sue, and we're going to win so much you'll get tired of winning!

Anonymous 8/26/2025, 4:26:05 AM No.106385611 [Report]

file.png md5: 4211c6d3...

jesus christ, GLM 4.5 Air IQ4_KSS non thinking is so good

Anonymous 8/26/2025, 5:11:00 AM No.106385961 [Report] >>106385983 >>106385992 >>106386067 >>106386139 >>106386164 >>106386209 >>106386440 >>106386676 >>106391168

1753366681018600.png md5: 509ab89a...

>>106383172
>>106382909
>miku
>not a slut
hard doubt

Anonymous 8/26/2025, 5:14:59 AM No.106385983 [Report]

>>106385961
Imagine being a cuck and making pictures like this one

Anonymous 8/26/2025, 5:16:46 AM No.106385992 [Report] >>106386027

>>106385961
I hope you die unironically

Anonymous 8/26/2025, 5:20:14 AM No.106386017 [Report]

>>106385508
>no teto
Based, she’s too mature for this nonsense

Anonymous 8/26/2025, 5:21:52 AM No.106386027 [Report] >>106386058

1730978123477820.png md5: e4047ddb...

>>106385992
rude

Anonymous 8/26/2025, 5:26:00 AM No.106386058 [Report]

>>106386027
I hope that faggot dies too.

Anonymous 8/26/2025, 5:27:04 AM No.106386067 [Report] >>106386073

>>106385961
would the one on the left

Anonymous 8/26/2025, 5:28:16 AM No.106386073 [Report] >>106386088

>>106386067
based and acquired taste

Anonymous 8/26/2025, 5:30:25 AM No.106386088 [Report] >>106386110

>>106386073
I didn't know being a pedo was an acquired taste

Anonymous 8/26/2025, 5:32:50 AM No.106386110 [Report]

>>106386088
Rude. The Brit's just short

Anonymous 8/26/2025, 5:38:40 AM No.106386139 [Report]

>>106385961
i'm too autistic and immune to care about miku getting blacked. try again another day rabbi

Anonymous 8/26/2025, 5:41:33 AM No.106386164 [Report] >>106386216 >>106386294 >>106386440

17533.png md5: c7461148...

>>106385961
meant to post this image

Anonymous 8/26/2025, 5:46:03 AM No.106386196 [Report]

So, if I have a 4090D 48GB and 128GB of DDR5, about how many t/s can I expect out of glm-4.5-air-q4 with a resonable context?

Anonymous 8/26/2025, 5:48:10 AM No.106386209 [Report] >>106386216 >>106386440

nigger.png md5: 18548b6d...

>>106385961
post the real one nigger.

Anonymous 8/26/2025, 5:49:20 AM No.106386216 [Report]

>>106386164
>>106386209
duality of /lmg/

Anonymous 8/26/2025, 5:58:42 AM No.106386290 [Report]

file.png md5: c25aa955...

Thank You GLM-chan

Anonymous 8/26/2025, 5:59:08 AM No.106386294 [Report]

kosovoisserbia.png md5: 3fff74e4...

>>106386164
https://www.youtube.com/watch?v=bVLDwyKPRu0&list=RDbVLDwyKPRu0&start_radio=1

Anonymous 8/26/2025, 6:00:03 AM No.106386302 [Report]

I dont like this lmg. Its just not right

Anonymous 8/26/2025, 6:02:20 AM No.106386325 [Report] >>106386330 >>106386406

EYudkowsky.jpg md5: 09ee23e4...

>>106385408
>since 2023
brother...

Anonymous 8/26/2025, 6:03:10 AM No.106386330 [Report] >>106388873

screencap.png md5: c08e53bb...

>>106386325
reminds me of the guy in picrel

Anonymous 8/26/2025, 6:14:44 AM No.106386406 [Report]

>>106386325
A model jew, dedicating his entire existence to being a sabotaging parasite

Anonymous 8/26/2025, 6:15:46 AM No.106386412 [Report]

1756180940403568s.jpg md5: 7100e12a...

Anonymous 8/26/2025, 6:19:12 AM No.106386430 [Report] >>106386468

is there a good MoE model for rp at 8gb vram and 32gb ram?

Anonymous 8/26/2025, 6:22:26 AM No.106386440 [Report] >>106386451 >>106386458

z238he283hrdb.gif md5: 05478b76...

>>106385961
>>106386164
>>106386209
>muh blacked
>muh bleached
You're dense and you're butt hurt!

At the end of the day, it's obvious that you boys have tiny penises anyway! The same thing goes for everyone else who actually cares about this shit!

Anonymous 8/26/2025, 6:23:08 AM No.106386443 [Report] >>106386451

>>106382559
>MS-Magpantheonsel-lark-v4x1.6.2RP-Cydonia-vXXX-22B-8-i1-GGUF
>{{user}} gently holds {{char}}'s hand as if it was a little fragile bird
>{{char}}: Yes, {{user}}, break me! Fill me with your SEED! Make me give birth to your rape babies so I could rise them as your sex slaves!
All my cards are behaving like this. Maybe there's value in it if you're into this kind of edgy stuff, but I'd call it overtuned.

Anonymous 8/26/2025, 6:24:45 AM No.106386451 [Report]

>>106386443
proof? seems like a skill issue
>>106386440
>caring about penis size
At the end of the day, it's obvious that You have no penis.

Anonymous 8/26/2025, 6:26:13 AM No.106386458 [Report]

>>106386440
>you boys have tiny penises anyway
Ok troon

Anonymous 8/26/2025, 6:28:22 AM No.106386468 [Report]

>>106386430
Get more ram so you could run GLM-Air.
Until then, there was anon who shilled https://huggingface.co/ai21labs/AI21-Jamba-Mini-1.7
IIRC it's 50B-4AB or so. There are ~30Gb quants so just enough to fit.
In my experience it's not safetymaxxed, but 'shy' about ERP and a bit dry in prose.

Anonymous 8/26/2025, 6:40:10 AM No.106386519 [Report]

Grok-2 gguf status?

Anonymous 8/26/2025, 6:43:55 AM No.106386531 [Report] >>106386579

glm air is a master rapist, wow

Anonymous 8/26/2025, 6:51:28 AM No.106386572 [Report] >>106386586

file.png md5: ed39cd6c...

i just bought a second 5090, what the hell do i run now? i havent been paying attention to anything for at least 8 months

Anonymous 8/26/2025, 6:52:18 AM No.106386579 [Report]

>>106386531
Beware it starts with your gpu

Anonymous 8/26/2025, 6:52:32 AM No.106386580 [Report]

i'm guessing only glm air is good, previous ones for (v)ramlets (glm 4) are not that great?

Anonymous 8/26/2025, 6:53:33 AM No.106386586 [Report] >>106386616

>>106386572
K2/deepseek

Anonymous 8/26/2025, 6:53:52 AM No.106386588 [Report]

muh faster prompt processing.png md5: 4c722bba...

Anonymous 8/26/2025, 6:58:11 AM No.106386614 [Report]

Jamba will save local.

Anonymous 8/26/2025, 6:58:38 AM No.106386616 [Report] >>106386675 >>106387387

>>106386586
deepseek has never worked for me, but i have never heard of or tried this K2. what backend should i use for it? i have 256GB of 2666MT/s ECC DDR4

Anonymous 8/26/2025, 7:09:28 AM No.106386675 [Report] >>106386700

>>106386616
Ik_llama.cpp for K2. Get it here: https://huggingface.co/ubergarm/Kimi-K2-Instruct-GGUF

Anonymous 8/26/2025, 7:09:32 AM No.106386676 [Report]

>>106385961
usecase of niggers for local llms?

Anonymous 8/26/2025, 7:12:49 AM No.106386693 [Report]

>>106383075
Put up or shut up. But you won't because once you shoot your load, it's over. There's nothing else to yammer about and everyone will see the bullshit.

Anonymous 8/26/2025, 7:13:32 AM No.106386700 [Report] >>106386719

>>106386675
ok. and how good is this model for cooming?

Anonymous 8/26/2025, 7:15:04 AM No.106386712 [Report]

>>106384357
or 70Bs if you're patient enough

Anonymous 8/26/2025, 7:16:56 AM No.106386719 [Report] >>106386730

>>106386700
Best local model for coom in this day and age

Anonymous 8/26/2025, 7:17:32 AM No.106386723 [Report] >>106386887 >>106387044

Is it possible we ever see an upgrade to nemo in that size range?

Anonymous 8/26/2025, 7:18:35 AM No.106386730 [Report] >>106386824

>>106386719
even if it is only like a 2bpw quant?

Anonymous 8/26/2025, 7:23:30 AM No.106386756 [Report]

>>106385304
same

Anonymous 8/26/2025, 7:38:28 AM No.106386824 [Report]

>>106386730
Yeah it's good enough

Anonymous 8/26/2025, 7:50:13 AM No.106386887 [Report]

>>106386723
It's over. The only ones left doing open source are the chinks, and they don't make small, uncucked models.

Anonymous 8/26/2025, 7:55:23 AM No.106386912 [Report] >>106386920

hyperborea.png md5: e6abaaa4...

Is my dream of buying 3-4 cheap laptops following the Win10tard Removal Act of 2025, stuffing them with RAM, and running distributed local deepseek with >= 1T/s speeds realistic?

Anonymous 8/26/2025, 7:57:03 AM No.106386920 [Report] >>106386940 >>106387060

>>106386912
that sounds like an incredibly stupid idea depending on your budget. i cant even really get good deepseek speeds despite having over 100gb of vram. i can barely even get the model to run, let alone be coherent

Anonymous 8/26/2025, 8:00:46 AM No.106386937 [Report] >>106387059

https://x.com/michaelqshieh/status/1960029790305763567
https://xcancel.com/michaelqshieh/status/1960029790305763567
I thought GPT5 was a bust bros

Anonymous 8/26/2025, 8:01:44 AM No.106386940 [Report] >>106387060

>>106386920
I found that I don't even get 1 t/s extra by offloading more onto vram. It's better to just use one device and -cmoe, then use the extra vram to run other things insteads.

Anonymous 8/26/2025, 8:13:50 AM No.106387014 [Report] >>106388455

Base Image.png md5: 6d358f53...

TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling
https://arxiv.org/abs/2508.16790
>Speech tokenizers serve as foundational components for speech language models, yet current designs exhibit several limitations, including: 1) dependence on multi-layer residual vector quantization structures or high frame rates, 2) reliance on auxiliary pre-trained models for semantic distillation, and 3) requirements for complex two-stage training processes. In this work, we introduce the Text-aware Diffusion Transformer Speech Codec (TaDiCodec), a novel approach designed to overcome these challenges. TaDiCodec employs end-to-end optimization for quantization and reconstruction through a diffusion autoencoder, while integrating text guidance into the diffusion decoder to enhance reconstruction quality and achieve optimal compression. TaDiCodec achieves an extremely low frame rate of 6.25 Hz and a corresponding bitrate of 0.0875 kbps with a single-layer codebook for 24 kHz speech, while maintaining superior performance on critical speech generation evaluation metrics such as Word Error Rate (WER), speaker similarity (SIM), and speech quality (UTMOS). Notably, TaDiCodec employs a single-stage, end-to-end training paradigm, and obviating the need for auxiliary pre-trained models. We also validate the compatibility of TaDiCodec in language model based zero-shot text-to-speech with both autoregressive modeling and masked generative modeling, demonstrating its effectiveness and efficiency for speech language modeling, as well as a significantly small reconstruction-generation gap.
https://tadicodec.github.io
Has examples. sounds pretty good
https://github.com/HeCheng0625/Diffusion-Speech-Tokenizer
Also includes some models trained with TaDiCodec

Anonymous 8/26/2025, 8:19:43 AM No.106387044 [Report]

>>106386723
Yes once I release my new nemo finetune

Anonymous 8/26/2025, 8:21:27 AM No.106387059 [Report]

>>106386937
I'm sure using OpenAI Agents SDK as the default agent framework had nothing to do with the OpenAI model that was trained on that specific format and flow doing the best.

Anonymous 8/26/2025, 8:21:28 AM No.106387060 [Report] >>106387067 >>106387244

>>106386920
>>106386940
Damn. I was hoping with older hardware becoming incredibly cheap to use distributed computing, but if it's so bad, it probably won't be worth it.

Anonymous 8/26/2025, 8:21:47 AM No.106387063 [Report]

Base Image.png md5: 5009d090...

AdLoCo: adaptive batching significantly improves communications efficiency and convergence for Large Language Models
https://arxiv.org/abs/2508.18182
>Scaling distributed training of Large Language Models (LLMs) requires not only algorithmic advances but also efficient utilization of heterogeneous hardware resources. While existing methods such as DiLoCo have demonstrated promising results, they often fail to fully exploit computational clusters under dynamic workloads. To address this limitation, we propose a three-stage method that combines Multi-Instance Training (MIT), Adaptive Batched DiLoCo, and switch mode mechanism. MIT allows individual nodes to run multiple lightweight training streams with different model instances in parallel and merge them to combine knowledge, increasing throughput and reducing idle time. Adaptive Batched DiLoCo dynamically adjusts local batch sizes to balance computation and communication, substantially lowering synchronization delays. Switch mode further stabilizes training by seamlessly introducing gradient accumulation once adaptive batch sizes grow beyond hardware-friendly limits. Together, these innovations improve both convergence speed and system efficiency. We also provide a theoretical estimate of the number of communications required for the full convergence of a model trained using our method.
https://github.com/funmagster/AdLoCo
neat

Anonymous 8/26/2025, 8:22:47 AM No.106387067 [Report]

>>106387060
just get a 5090 or a 3090. a cluster of 5060tis. a cheap EPYC off of ebay is like $300. anything would be better than a group of shitty laptops

Anonymous 8/26/2025, 8:25:43 AM No.106387087 [Report] >>106387099

Are there any good models I could cram into 16gb of vram? (with context)
Don't have to be new, I am probably using some garbage.

Anonymous 8/26/2025, 8:27:15 AM No.106387099 [Report] >>106387159

>>106387087
Rocinante 1.1

Anonymous 8/26/2025, 8:43:24 AM No.106387159 [Report] >>106387252

>>106387099
Is 12B really the best I could do? I was expecting better performance out of 20B with a quant or something.

Hi all, Drummer here... 8/26/2025, 8:44:58 AM No.106387167 [Report] >>106387265 >>106387355 >>106387442 >>106387477 >>106388966

Tried to address the prudishness here: https://huggingface.co/BeaverAI/GLM-Steam-106B-A12B-v1a-GGUF

But will do another iteration to understand the model better and do better. Enjoy!

Anonymous 8/26/2025, 8:58:04 AM No.106387244 [Report]

>>106387060
>cheap to use distributed computing
Distributed is just plain bad for inference even with decent hardware, llamacpp's rpc adds a compounding painful delay.
Skip through this video of a dude comparing running stuff on a single machine and on some networked frameworks
https://www.youtube.com/watch?v=N5xhOqlvRh4

Anonymous 8/26/2025, 8:59:27 AM No.106387252 [Report]

>>106387159
You should be able to fit a quant of mistral small ~22/24b, that was my go-to when I only had 16gb available.
If you have decent amounts of system ram you can try some MoE models as well.

Anonymous 8/26/2025, 9:02:01 AM No.106387265 [Report] >>106387290

>>106387167
Imagine being so bad at prompting that you decide to create a finetune for every character quality.

Hi all, Drummer here... 8/26/2025, 9:06:03 AM No.106387290 [Report]

>>106387265
Skill issues will never go away, basebro.

Anonymous 8/26/2025, 9:16:19 AM No.106387355 [Report] >>106387480

>>106387167
What's that Signal 24b model about? Is it better than Cydonia?

Anonymous 8/26/2025, 9:20:46 AM No.106387387 [Report]

>>106386616
>deepseek has never worked for me
If there is a model that just works it's deepseek. if you have some ram in this i suggest that you try https://huggingface.co/unsloth/DeepSeek-R1-GGUF/tree/main/DeepSeek-R1-UD-IQ2_XXS

Anonymous 8/26/2025, 9:30:08 AM No.106387442 [Report]

>>106387167
>Drummer Air tune
I've got a weird drive to download it simply to check just how much dumber it is.

Anonymous 8/26/2025, 9:37:38 AM No.106387477 [Report]

>>106387167
>he actually tuned it
Welp.

Hi all, Drummer here... 8/26/2025, 9:38:09 AM No.106387480 [Report] >>106388388

>>106387355
Signal 24B is Cydonia 4.1 with additional training to encourage creativity, prose, dialogue, etc. From testing, there are instances where it does/says something never seen before.

Doubt it'll perform well in serious Q&A tests, but it's worth a check.

Anonymous 8/26/2025, 9:54:00 AM No.106387565 [Report]

>>106385453
Just turn off reasoning and it basically will never refuse.

Anonymous 8/26/2025, 10:04:49 AM No.106387613 [Report] >>106392382

1752225005876423.png md5: 19c800ed...

>gpt-oss trying its best to think about how to draw ascii boobs

Anonymous 8/26/2025, 10:05:14 AM No.106387618 [Report]

llama.png md5: d6820cc5...

LLAMA 5 WILL SAVE LOCAL

Anonymous 8/26/2025, 10:07:34 AM No.106387633 [Report] >>106387641

>mention of a dead name out of nowhere

Anonymous 8/26/2025, 10:07:35 AM No.106387634 [Report]

FatLlama 1.7T still unbeaten. Why even bother using other models

Anonymous 8/26/2025, 10:08:16 AM No.106387641 [Report]

>>106387633
*it sends shivers down your spine*

Anonymous 8/26/2025, 10:09:25 AM No.106387653 [Report]

Multimodal llms that do this when? https://yourhobbiescustomized.com/pages/about-the-sr-series

Anonymous 8/26/2025, 10:18:29 AM No.106387697 [Report] >>106387730 >>106387821 >>106388144

do I have to learn about computer architecture if I want to build a machine that can run large models? Tell me if I'm wrong, but it's not the same as simply checking whether the parts are compatible and then slapping them together like your typical, consumer grade gaming rig

Anonymous 8/26/2025, 10:24:29 AM No.106387730 [Report]

>>106387697
It's just a matter of memory amount + memory bandwidth. GPU > RAM > SSD,
If there's a specific model you're aiming for then you can get some recommendations

Anonymous 8/26/2025, 10:42:16 AM No.106387821 [Report]

>>106387697
Your question is problematic. If you're a techlet why even bother, I mean you don't want to even find out anything on your own.

Anonymous 8/26/2025, 10:51:32 AM No.106387876 [Report] >>106387898 >>106388047

1664162133662_thumb.jpg.webm md5: 09e60794...

WebM not supported

When will a open source equivalent of Sesame Voice model release?.......

Anonymous 8/26/2025, 10:56:02 AM No.106387898 [Report] >>106387916

>>106387876
What was the context of that webm anyway?

Anonymous 8/26/2025, 10:59:20 AM No.106387916 [Report] >>106387967 >>106388842

>>106387898
It's for training urgent care medicals professionals, it designed to "feel" pain, resist and squirm around when cutting it open

Anonymous 8/26/2025, 11:06:16 AM No.106387967 [Report]

>>106387916
Yeah makes sense. The more creepy the better in that case I suppose.

Anonymous 8/26/2025, 11:22:12 AM No.106388047 [Report] >>106388131

>>106387876
why did bro slide under the table

Anonymous 8/26/2025, 11:32:04 AM No.106388110 [Report] >>106388140

>>106383723
The problem with FTL is it often breaks causality, unless you get tricky.

Make FTL possible and piss off physicists in the process with this one easy trick "CMB inertial rest frame".

Anonymous 8/26/2025, 11:36:51 AM No.106388131 [Report]

>>106388047
Don't worry about it

Anonymous 8/26/2025, 11:38:40 AM No.106388140 [Report] >>106388190

>>106388110
>breaks causality
nonsensical mumbo jumbo that people like to repeat religiously

Anonymous 8/26/2025, 11:39:47 AM No.106388144 [Report]

>>106387697
>do I have to learn about computer architecture if I want to build a machine that can run large models?
You read the ktransformers github.

Which will tell you to get a Xeon scalable with DDR5, with some GPU for prompt processing.

Anonymous 8/26/2025, 11:49:01 AM No.106388190 [Report] >>106388239

>>106388140
>nonsensical mumbo jumbo that people like to repeat religiously
In an age long gone, even I was capable of doing Lorentz transformations ... the math checked out. If light speed is constant (in all frames) FTL will generally break causality.

If you first move to the CMB rest frame at sublight speed before making a wormhole/hyperspace-jump/whatever to another point in the CMB rest frame in the future (relative to the big bang) causality is preserved.

Anonymous 8/26/2025, 12:01:32 PM No.106388239 [Report]

>>106388190
>move to the CMB rest frame at sublight speed before going FTL
If causality is enforced by law, only outlaws will be able to go back in time by fiddling with reference frames and plasma beam the spacecops' great-grandparents

Anonymous 8/26/2025, 12:33:47 PM No.106388388 [Report] >>106388415

>>106387480
Please let us know when you have something coherent cooked up. No, I'm not being critical just want something new and usable.

Anonymous 8/26/2025, 12:39:23 PM No.106388415 [Report] >>106388428

>>106388388
Bro, your GLMs?

Anonymous 8/26/2025, 12:41:38 PM No.106388428 [Report] >>106388437

>>106388415
I'm not your "bro", retard zoomer. Go back to tiktok.

Anonymous 8/26/2025, 12:43:05 PM No.106388437 [Report]

>>106388428
I'm older than you, bro..

Anonymous 8/26/2025, 12:45:40 PM No.106388455 [Report]

>>106387014
Voice cloning in the model examples. Can do chinese english too lol

Anonymous 8/26/2025, 1:33:59 PM No.106388764 [Report]

If you're trying to build llamacpp and it dies with "ggml was not compiled with any CUDA arch <= 750" when you run it, the fix is here:

https://github.com/ggml-org/llama.cpp/pull/15587

Anonymous 8/26/2025, 1:47:10 PM No.106388842 [Report]

>>106387916
And piss itself, apparently?

Anonymous 8/26/2025, 1:51:07 PM No.106388873 [Report]

>>106386330
Who is jart?

Anonymous 8/26/2025, 2:04:32 PM No.106388957 [Report] >>106389586

>>106388944
>>106388944
>>106388944

Anonymous 8/26/2025, 2:05:11 PM No.106388966 [Report]

>>106387167
>prudishness
GLM air is not prude.

Anonymous 8/26/2025, 3:22:35 PM No.106389586 [Report]

>>106388957
Moldy bread

Anonymous 8/26/2025, 3:31:28 PM No.106389653 [Report]

I'm staying here.

Anonymous 8/26/2025, 4:51:44 PM No.106390379 [Report]

Anonymous 8/26/2025, 6:11:26 PM No.106391168 [Report]

>>106385961
>so many responses
Are you guys that starved for some blacked miku? Should I post some?

Anonymous 8/26/2025, 7:55:17 PM No.106392382 [Report]

>>106387613
look at him go, almost makes me want to download it for myself

Anonymous 8/26/2025, 10:05:48 PM No.106393805 [Report] >>106393810

This might be the first /lmg/ that has fallen off without hitting bump limit.

Anonymous 8/26/2025, 10:06:51 PM No.106393810 [Report] >>106395011

>>106393805
Look how many posts were deleted.

Anonymous 8/26/2025, 11:47:12 PM No.106395011 [Report] >>106396329

>>106393810
If's funny when these happen because then you know that it's all posts made by that person in the thread.
And every time not a single worthwhile post is deleted.

Anonymous 8/27/2025, 2:09:47 AM No.106396329 [Report]

>>106395011
check at the times they were deleted

Anonymous 8/27/2025, 2:17:26 AM No.106396383 [Report]

I don't hear much of anything about grek 2.
Is it not usable locally? No goofs?
Or just not worth bothering?

Anonymous 8/27/2025, 2:17:30 AM No.106396385 [Report] >>106396518

what's the best model to run on a 3080 12gb for roleplay?

Anonymous 8/27/2025, 2:35:18 AM No.106396518 [Report]

>>106396385
Nemo

Anonymous 8/27/2025, 4:08:47 AM No.106397307 [Report]

>>106385508
damn succubi... i guess i have to now

Anonymous 8/27/2025, 5:27:35 AM No.106397834 [Report]

>>106383227
>Which is a real shame
faggot

Anonymous 8/27/2025, 5:36:15 AM No.106397889 [Report]

>>106383075
hi stephen wolfram