Thread 716282706 - /v/ [Archived: 45 hours ago]

Anonymous
7/24/2025, 6:05:45 AM No.716282706
0_ti7fapm8tagrfbtz
0_ti7fapm8tagrfbtz
md5: 77965370fd971a9a9283cd4a7204eb09๐Ÿ”
>Needs top of the line power hog GPU with infinite VRAM to be able to play fucking TEXT adventure locally
Replies: >>716282908 >>716286298 >>716286367 >>716290967 >>716292069 >>716294437 >>716295048 >>716295163 >>716300321 >>716303290 >>716307074
Anonymous
7/24/2025, 6:07:11 AM No.716282782
@CollectiveShout
Ummmm I think you guys should look into this
Replies: >>716291865
Anonymous
7/24/2025, 6:09:25 AM No.716282908
1731025855856062
1731025855856062
md5: 2889639b2d47dc1e7df144dff4c41aa3๐Ÿ”
>>716282706 (OP)
Who are you kidding? LLMs are not smart enough to run a competent text adventure. You're chatbotting with pretend girls. Nothing wrong with that per se but let's not kid ourselves by calling it a text adventure
Replies: >>716288479 >>716296154
Anonymous
7/24/2025, 6:19:44 AM No.716283549
So what good NSFW chat bot sites are around now?
Yodayo seems a bit shit though everyone was shilling it last thread.
Looking for /d/ & /tg/ ERP so I can pretend like i'm in a mid-2000s MMO again
Anonymous
7/24/2025, 7:01:43 AM No.716286261
>try Stheno Q8_0
>ok i guess
>see Rocinante shilled, 12B
>it doesn't seem much better than stheno but is slightly slower
>fuck it i have 32gb
>try Mistral Small 24B Q8_0
>CPU grinds to 100% on all cores and struggles with each word coming through
huh.

Also I still don't know what the difference between CFG and Author's Note is, they seem to do the same thing.
Replies: >>716287634 >>716290027
Anonymous
7/24/2025, 7:02:15 AM No.716286298
>>716282706 (OP)
Deepseek is fine and takes 10 minutes to set up. You are just retarded.
Replies: >>716292096 >>716293164 >>716294681
Anonymous
7/24/2025, 7:03:24 AM No.716286367
>>716282706 (OP)
I'm happy the good models are gatekept well enough that retards on /v/ can't get to them.
Replies: >>716294681
Anonymous
7/24/2025, 7:27:52 AM No.716287634
>>716286261
authors note are just notes from the author
high CFG means the AI is more likely to do what you want, but less creative, where as low CFG is high creativity but less likely to follow prompt
Replies: >>716288256 >>716288302
Anonymous
7/24/2025, 7:39:11 AM No.716288256
>>716287634
Both are effectively telling the AI what to do, at least from what I can tell.

If I put "speak with a pirate accent" in either one, the result is the same.
Replies: >>716288302
Anonymous
7/24/2025, 7:40:02 AM No.716288302
>>716287634
>authors note are just notes from the author
He's talking about sillytavern itself not the note in a card.
>>716288256
https://docs.sillytavern.app/usage/core-concepts/authors-note/
Anonymous
7/24/2025, 7:43:08 AM No.716288479
>>716282908
t. imaginationlet
Replies: >>716289058
Anonymous
7/24/2025, 7:45:37 AM No.716288603
1753130200601058
1753130200601058
md5: cdba1069431625448ffbbe516c2662f2๐Ÿ”
Got any shota character cards?
Replies: >>716296076
Anonymous
7/24/2025, 7:53:56 AM No.716289058
1737481068889117
1737481068889117
md5: bbf385a7227636de2f19d10d2d2e6b59๐Ÿ”
>>716288479
>"it's smart enough to create and run a text adventure!"
>look inside
>"dude just come up with the adventure part yourself"
Replies: >>716290802
Anonymous
7/24/2025, 8:12:56 AM No.716290027
>>716286261
You can go down to Q6 if Q8 is too slow. It's only marginally worse.
Anonymous
7/24/2025, 8:19:48 AM No.716290361
doesn't matter how expensive your gpu is the good stuff is proprietary
Anonymous
7/24/2025, 8:28:07 AM No.716290802
>>716289058
yeah you have to be the game master since LLM aren't designed with foresight but to generate the next most probable token with some variance...
Replies: >>716291165
Anonymous
7/24/2025, 8:31:21 AM No.716290967
>>716282706 (OP)
Has anyone tried running this on 4GB VRAM? I wouldโ€™ve done it myself, but I donโ€™t want to install 17 Python libs just to be disappointed.
Replies: >>716291350 >>716291446
Anonymous
7/24/2025, 8:34:58 AM No.716291165
>>716290802
I want real intelligence. Why are people spending $30/mo talking to a sentence finisher?!
Replies: >>716291268 >>716293716
Anonymous
7/24/2025, 8:36:47 AM No.716291268
>>716291165
Because they are literal NPCs who can't tell the difference between fancy auto-complete and intellect.

That's all this LLM stuff really boils down to, there is a large fraction of the population who legitimately cannot tell, because they're philosophical zombies.
Replies: >>716291446
Anonymous
7/24/2025, 8:38:17 AM No.716291350
>>716290967
How much RAM do you have? You can run a decent model if you offload layers to RAM/CPU, although that's going to be a lot slower than running entirely on GPU. Tiny models won't be worth the effort.
Replies: >>716291664 >>716292413
Anonymous
7/24/2025, 8:39:04 AM No.716291403
1736085487042848
1736085487042848
md5: ab6bf8b4fdc4e697184674e37c628af8๐Ÿ”
Why the FUCK do you insist on turning this into a 24/7 general to bring attention to someone that MUST stay hidden. Why can't you let people have nice things?
When has ANYTHING, EVER, AT ANY POINT IN HISTORY, become better with mainstream exposure?
Replies: >>716291543
Anonymous
7/24/2025, 8:40:05 AM No.716291446
>>716290967
silly tavern is just a graphical frontend, running a local LLM is the bottleneck. You can run a 7B q4m at good speed but the context window will be ridiculously small and the vocabulary very constrained (source: I ran 7B model on a RX580)
Also it doesn't rely on python and you can use LM Studio as back end which is a one click setup.

>>716291268
to me you don't sound any different than a LLM
Replies: >>716291664
Anonymous
7/24/2025, 8:41:51 AM No.716291543
>>716291403
it's just a front end to display text generated by LLM in a more roleplaying way, anon, nobody is going to take it from you.
Replies: >>716293802 >>716304068
Anonymous
7/24/2025, 8:44:20 AM No.716291664
>>716291446
>>716291350
thanks i had some success running llama and qwen so ill give it shot
Replies: >>716291964
Anonymous
7/24/2025, 8:48:15 AM No.716291865
file
file
md5: 92294aa213e61a739fae4229dfae3636๐Ÿ”
>>716282782
you joke but they are partnered with a chick who was all about "no sex robots" and "its harmful to women" or something and she has been campaigning for it since 2015
Anonymous
7/24/2025, 8:50:11 AM No.716291964
>>716291664
get a model from TheDrummer (maybe be a 8B but make sure you pick a q4m quant at most) and don't ever offload every layers to the gpu, this sounds counter intuitive but if the model is larger than your vram this will causes inference to come to a crawl, it's better to offload a portion only, like half.
Replies: >>716292235
Anonymous
7/24/2025, 8:51:45 AM No.716292038
I have the IQ of an ape but still want to talk to my waifu and I don't want to do it online.
Can someone give me a guide for retards to run this on a low end AMD card locally and what models to use and shit?
Replies: >>716292235 >>716292347 >>716292490
Anonymous
7/24/2025, 8:52:20 AM No.716292069
>>716282706 (OP)
like other anons said - it is impossible to play text adventures with llm. It is only good for erp sessions.
Replies: >>716292235 >>716292723
Anonymous
7/24/2025, 8:52:46 AM No.716292096
>>716286298
Guide? Pretty please?
Replies: >>716294681
Anonymous
7/24/2025, 8:55:09 AM No.716292235
>>716292038
sillytavern is a bit overblown if you just want to talk. Get LM Studio and >>716291964

>>716292069
it's not impossible, you just need to be the game master and guide it constantly. Right now LLM are like advanced random number generators but it's gonna get better and better.
Replies: >>716292723 >>716293151 >>716293420
Anonymous
7/24/2025, 8:57:18 AM No.716292347
>>716292038
local models are all shit. even top ones like deepseek are barely coherent and frequently goes into schizo mode. low-end AMD card further limits what you can do.
if you are really afraid some wagie is gonna read your erp chatlog then
>>>/vg/lmg
Replies: >>716292430 >>716293151 >>716293420
Anonymous
7/24/2025, 8:58:40 AM No.716292413
file
file
md5: 10a620a0db47225a486e9edd49f7caa9๐Ÿ”
>>716291350
>You can run a decent model if you offload layers to RAM/CPU, although that's going to be a lot slower than running entirely on GPU.
nta but I honestly have no idea what to even set for these settings or if its even using my GPU at all. Pretty sure i've just been going full RAM/CPU this entire time, or what 5 layers even means.
Replies: >>716292628 >>716294391
Anonymous
7/24/2025, 8:59:11 AM No.716292430
>>716292347
a good 13B model can be surprisingly coherent. And Deepseek is not a top model, stop reading /pol/ and /g/
Replies: >>716292664
Anonymous
7/24/2025, 9:00:12 AM No.716292490
>>716292038
the mistral setup is crazy https://rentry.org/lmg-lazy-getting-started-guide
this is the no-bullshit guide and its very fast
it is magic to me, insane results, especially with kobold instruct storyteller and the fact that it runs with sillytavern chars makes it perfect, I kinda feel like ai chatbots peak here
Replies: >>716293151
Anonymous
7/24/2025, 9:02:50 AM No.716292628
>>716292413
first make yourself a favor and switch to LM Studio. But if you want to stick to kobolt regardless, you shouldn't touch anything there except gpu layers, the higher number you use in gpu layers, the more of the model will be offloaded to your gpu, but if you set it too high and it overflow your vram into dynamic vram and causes your inference to be very slow. What you want to do is set it to half of the max at first, then increase it if inference is too slow. If increasing it more and more makes no difference you've got a model that's way too big for your vram in the first place.
Replies: >>716292747 >>716294587 >>716298925
Anonymous
7/24/2025, 9:03:29 AM No.716292664
>>716292430
>all the poors get free DS
>NOOOO DS IS BAD YOU NEED GEMINI FOR REAL ERP NOW
A tale as old as time. All models suck absolute shit, local models suck more shit because they're slow.
Replies: >>716292731
Anonymous
7/24/2025, 9:04:31 AM No.716292723
egt
egt
md5: a50ad43e7e30dfa59b4a0e38c68d1f35๐Ÿ”
>>716292069
>>716292235
sounds about right
it can be an assistant instead of GM
and give it an outline and some room to work with
and use a RNG machine along side that for a more fair playthrough
Anonymous
7/24/2025, 9:04:43 AM No.716292731
>>716292664
you have zero fucking clue
Anonymous
7/24/2025, 9:05:01 AM No.716292747
>>716292628
That's the thing I don't even know what "max" is. Is 16 max? It's a 16gb vram card. Is it 100, like a percentage?
Replies: >>716292929
Anonymous
7/24/2025, 9:08:14 AM No.716292929
1746056449664578
1746056449664578
md5: b68fddd9589b6a28f49753f7366c886a๐Ÿ”
>>716292747
you need to load a model file first and the number of layers available will be visible.
Replies: >>716293272
Anonymous
7/24/2025, 9:12:37 AM No.716293151
>>716292235
>>716292347
>>716292490
Fine, what if I'm willing to use cloud shit to ERP but I'm not willing to pay?
Anonymous
7/24/2025, 9:12:48 AM No.716293164
>>716286298
>local deepseek
Not everyone has a stack of mac minis
Anonymous
7/24/2025, 9:15:04 AM No.716293272
>>716292929
What... I do have one loaded but none of those numbers are appearing except the use vulkan 8/9
Replies: >>716293549
Anonymous
7/24/2025, 9:18:24 AM No.716293420
>>716292235
>>716292347
Can it run on a RX6600?
Replies: >>716293549
Anonymous
7/24/2025, 9:21:08 AM No.716293549
>>716293272
make sure your model is a .gguf

>>716293420
of course. You can run 13B models, albeit a bit slowly but for roleplaying at reading speed it's enough. Heck I used to run a RX580 with 7B models. What matters is not overflowing your vram by offloading too many layers.
Replies: >>716293604 >>716293660
Anonymous
7/24/2025, 9:22:16 AM No.716293604
>>716293549
RX580 4gb, forgot to clarify,.
Anonymous
7/24/2025, 9:23:14 AM No.716293660
file
file
md5: 00171c4c1a5dbb4b93dbcf9fdfaf7298๐Ÿ”
>>716293549
>make sure your model is a .gguf
Suppose i'm cursed
Replies: >>716293757 >>716294808
Anonymous
7/24/2025, 9:24:22 AM No.716293716
>>716291165
Check back in a few years (or decades)
Anonymous
7/24/2025, 9:24:56 AM No.716293757
>>716293660
24B at 8Q??? nigga what's your gpu even??? In any case try another model for sanity check
Replies: >>716293956 >>716294227
Anonymous
7/24/2025, 9:25:41 AM No.716293802
>>716291543
Actually the ST devs had a massive shitfit when an article came out about people using it for NSFW shit. I don't think anything negative ended up happening but there was talk of a total rebrand
Replies: >>716293938 >>716293956 >>716295212
Anonymous
7/24/2025, 9:27:50 AM No.716293938
>>716293802
that was funny
they tried to pretend like roleplay of any kind wasn't the intended use, even sfw
I suspect 90% of the people making all the noise did 10% of the code contributions and the people that actually did do all the work quietly told them to shut the fuck up or they'd split off and make a competitor, just a guess but I've seen similar things happen before
Replies: >>716294020 >>716295212
Anonymous
7/24/2025, 9:28:10 AM No.716293956
>>716293757
And seriously, give LM Studio a try,

>>716293802
that's like having a fit because people can use krita or blender to draw/animate porn... Really dumb
Anonymous
7/24/2025, 9:29:29 AM No.716294020
>>716293938
>they tried to pretend like roleplay of any kind wasn't the intended use, even sfw
what? Then what the fuck would the intended use even be?
Replies: >>716294235
Anonymous
7/24/2025, 9:33:39 AM No.716294227
>>716293757
>24B at 8Q??? nigga what's your gpu even???
I don't even know what that MEANS bro
Just a 7900 with 16gb vram. I have no idea what its even capable of because all guides assume you already know what the fuck you're doing and the one calculator i've found is an absolute enigma of "input all the data you don't know so I can tell you the answer to the question you don't know how to ask"
https://smcleod.net/vram-estimator/

Seriously for all the talk of model sizes and quants nobody ever seems to mention or bothers to detail what any of that means or how it runs.
Replies: >>716294336 >>716294587 >>716294654
Anonymous
7/24/2025, 9:33:45 AM No.716294235
Untitled
Untitled
md5: 6fef895e092845f503eb6c42b7fe0e2a๐Ÿ”
>>716294020
A personal code monkey I guess.
Replies: >>716294317 >>716294640
Anonymous
7/24/2025, 9:35:43 AM No.716294317
1730968096153942
1730968096153942
md5: 670abcfd7d72d8f5924d7b5a7b1b94af๐Ÿ”
>>716294235
A coding assistant definitely needs an anime girl avatar to be usable.
Anonymous
7/24/2025, 9:36:07 AM No.716294336
>>716294227
just download a Q6 and offload 10GB to RAM, and youโ€™ll be fine
Replies: >>716294391
Anonymous
7/24/2025, 9:37:01 AM No.716294382
This is legit too hard for me.
Can someone just tell me "go fucking here, download this thing, run this thing, set up this thing by doing this"? I'm the RX6600 guy.
Anonymous
7/24/2025, 9:37:10 AM No.716294391
>>716294336
How

I still don't know what the fuck GPU layers even are as per my initial question that ran down this entire rabbit hole of replies >>716292413
What is "10gb" in gpu layers?
Replies: >>716294587 >>716294617 >>716294654
Anonymous
7/24/2025, 9:38:16 AM No.716294437
>>716282706 (OP)
and this is why generative AI are not going to be part of mainstream video games anytime soon.
they'll vibe code with AI and use pre-rendered and pre-recorded AI images and voice lines of course, but having them work in real time to hold conversations with the player or supplant game AI will not be feasible this decade without another DLSS type compromise.
Anonymous
7/24/2025, 9:41:08 AM No.716294587
>>716294227
ok first you're going to switch to LM Studio, it tells you straight on if you're about to select a model to large and recommends you what will fit better. Next the 24B means 24 billion parameters, it's how smart the model is. Q8 is the level of compression (I believe F16 is uncompressed but don't quote me on that), Q8 is barely compressed and high quality but much bigger and slower, and Q2 is so compressed it's gonna output junk at high speed. Honestly q4_K_M (as opposed to S) tend to have the best ratio quality/speed/size but if you can still do higher then go for it.

>>716294391
I explained to you >>716292628
what layers are and for the last time, switch to LM studio.
Anonymous
7/24/2025, 9:41:42 AM No.716294617
>>716294391
Mistral Small 3 models are 41 layers total. If you want to put half of it in VRAM then do 10 or 11 GPU layers
Replies: >>716294671
Anonymous
7/24/2025, 9:42:09 AM No.716294640
1725755237752054_thumb.jpg
1725755237752054_thumb.jpg
md5: a3e50312c1271818d9176eda07b454fb๐Ÿ”
>>716294235
Anonymous
7/24/2025, 9:42:26 AM No.716294654
screenshot-huggingface.co-2025.07.24-01_36_52
screenshot-huggingface.co-2025.07.24-01_36_52
md5: 36f5b9c0f3f17afa4fec3e11d8db19e7๐Ÿ”
>>716294227
Higher quant typically means better outputs. The downside is that it becomes harder to fit one on your GPU the higher it is. Most of the time you want a size that matches your VRAM if you want it to be as fast as possible.
>>716294391
GPU layers is something you need to experiment with. Kobold can only guess but you can tweak it manually and test to see if you get faster or slower outputs by going up/down. Try going up/down a few steps at a time until you dial in something that seems reasonable. If it's still slow as shit make sure your model fits your GPU.
Replies: >>716295112
Anonymous
7/24/2025, 9:42:43 AM No.716294671
>>716294617
Oops. Meant 20 or 21
Anonymous
7/24/2025, 9:42:54 AM No.716294681
>>716292096
>>716286298
>>716286367
I would say don't spoonfeed, but I know for a fact some fag on reddit is probably spoonfeeding or some normie who wants "internet fame" is probably making a guide on it and flood the services with 100k new users,so the 10-20 anons in this thread won't make a difference that much for traffic, especially since I think a lot of us are already using it.
Anonymous
7/24/2025, 9:44:33 AM No.716294754
I usually run Q4 on 12gb of VRAM so I can save space for more context and card size, is it worth bumping up to Q6 or even Q8? Is the quality difference actually that significant?
Anyone have any logs that compare quants?
Anonymous
7/24/2025, 9:45:24 AM No.716294805
wow you fags really are useless for advice, do you really expect anyone to understand literal technobabble?
Replies: >>716294859 >>716294871
Anonymous
7/24/2025, 9:45:29 AM No.716294808
>>716293660
also that model you picked is probably sanitized to hell. Get a model from TheDrummer
Replies: >>716294997
Anonymous
7/24/2025, 9:46:14 AM No.716294853
The filter is working
Anonymous
7/24/2025, 9:46:18 AM No.716294859
>>716294805
/v/ is literally the only place that will actually give good advice. /g/ is somehow even more retarded than niche pseudo-generals on /v/.
Anonymous
7/24/2025, 9:46:30 AM No.716294871
>>716294805
nigga what part of download LM Studio is too fucking technobabble to you you fucking mentally disabled inbred? It's literally designed for tterminal retards like you.
Replies: >>716294996 >>716295007
Anonymous
7/24/2025, 9:48:53 AM No.716294996
>>716294871
You could at least give a few steps process on what to do.
So you download it and install it, then what? Does it have a setup guide? What local models to use for RP?
Replies: >>716295112
Anonymous
7/24/2025, 9:48:54 AM No.716294997
>>716294808
I already picked one up from him based on Rocinate (I think the name is) and it immediately tries to femdom me with the most retarded prose at any slight action while we're trying to finish our stealth sabotage mission in our MX-2 shared cockpit mech.
Replies: >>716295161
Anonymous
7/24/2025, 9:49:01 AM No.716295007
00264-2708861977-Abstract art, modern art, by Arthur Dove, masterpiece
>>716294871
Setting up stable diffusion was only a few clicks back when AI art started getting popular on /v/ but we still had people that couldn't figure out anything more complicated than copy paste. /v/ anons are either very knowledgeable or completely tech illiterate.
Replies: >>716295145
Anonymous
7/24/2025, 9:49:45 AM No.716295048
161951095809_thumb.jpg
161951095809_thumb.jpg
md5: 8ecd7c943e66e99da80def3f4f9d9336๐Ÿ”
>>716282706 (OP)
>falling for the local meme
Replies: >>716295707 >>716296229
Anonymous
7/24/2025, 9:51:15 AM No.716295112
>>716294996
I'm the anon he's trying to help and if you're too retarded to download a different program after somehow getting kobold set up like me, theres no hope for you. I appreciate the spoonfeeding like the big fucking baby I am OPENING WIDE FOR THE AIRPLANE.

>>716294654
I see, so Q6 would be a more reasonable step down. I used Stheno at Q8 and it ran just fine so I wasn't expecting Mistral to jam to a halt.
Does file size have anything to do with ram/vram limits or is that just kind of arbitrary?
Replies: >>716295336 >>716295370
Anonymous
7/24/2025, 9:51:38 AM No.716295132
So is anyone finally gonna share a Chorbo JB for /ss/ or are you gonna be gatekeeping niggers like /aicg/?
>but they will le patch it
They can see your JB whenever you send a message, if (You) are using it they already have it and will patch it soon. NIGGER.
Anonymous
7/24/2025, 9:51:53 AM No.716295145
>>716295007
it's baffling how some of these people even found their way on 4chan. And the worst part is they act like victims surrounded by idiots who refuse to help them.
Replies: >>716295187
Anonymous
7/24/2025, 9:52:17 AM No.716295161
>>716294997
Drummer's models have like 3 personalities and they're all horny
Replies: >>716295434
Anonymous
7/24/2025, 9:52:21 AM No.716295163
>>716282706 (OP)
I just use Featherless

works fine
Anonymous
7/24/2025, 9:52:53 AM No.716295187
>>716295145
I think its just one guy shitposting pretending to be an ungrateful asshole after someone else getting their question answered.
Anonymous
7/24/2025, 9:53:16 AM No.716295212
>>716293802
>>716293938
is that why they got rid of the default included character cards? felix the cat, the coder guy and the anime girl?
Replies: >>716295356 >>716299231
Anonymous
7/24/2025, 9:55:39 AM No.716295336
>>716295112
Those file sizes directly correspond to loading the model. So as an example if you had an 8GB card and wanted the model to have any amount of speed you could only really use the 2bit quants (which would be shit and not worth it). However you could still technically load a bigger model as long as you have enough RAM.

If you go over VRAM it will dump it into regular RAM which is much slower.
Replies: >>716295591 >>716295625
Anonymous
7/24/2025, 9:56:00 AM No.716295356
>>716295212
Seraphina was still there when I last reinstalled it, which is funny considering that's the pure RP character.
Anonymous
7/24/2025, 9:56:19 AM No.716295370
>>716295112
yes the file size is what is going to try to fit in vram depending on the number of gpu layers you set (higher mean more in vram)

looks like Stheno is primarily a 8B paramters so it make sense that it ran fine in Q8 but 8B is fucking garbage, especially for your gpu. with 16GB you could use a 24B model at Q4_M_S relatively painlessly but not at full layers, probably half.
Replies: >>716295591
Anonymous
7/24/2025, 9:57:41 AM No.716295434
>>716295161
I tried many models and there isn't a magic recipe, abliteration is a bit like a lobotomy.
Anonymous
7/24/2025, 10:00:46 AM No.716295591
file
file
md5: 8b182d0850cc90ce9acbeaa8eae466c1๐Ÿ”
>>716295336
>>716295370
I see, i've got 32gb of regular ram and I didn't mind the speed of stheno while it was running purely RAM only so even offloading half of it to GPU would be an improvement. I just had no idea what the limits are.
Working on the LM studio install, thanks for the rec.
Replies: >>716295760 >>716295975
Anonymous
7/24/2025, 10:01:25 AM No.716295625
>>716295336
you can go over vram and still get decent speed as long as you don't offload everything to the gpu which would just trigger swapping which is far worse
Anonymous
7/24/2025, 10:02:56 AM No.716295707
599
599
md5: 27bec306cebbe5a3156af6b75d7f95c4๐Ÿ”
>>716295048
>external service provider
>meanwhile, 9000 threads on this board about payment processors fucking around with denials
Does that not tip you off that anything that could be easily cut off is a humongously retarded idea?
Replies: >>716295950
Anonymous
7/24/2025, 10:04:04 AM No.716295760
>>716295591
to be clear you don't need to have to pick a model/quant that fits entirely, that's how gpu offload comes to help, if the model is slightly bigger you can simply offload less so that the vram isn't overflowing which is worse in term of performances than having a static portion in vram and the rest in ram.
Anonymous
7/24/2025, 10:07:27 AM No.716295950
>>716295707
Generally they aren't advertising their models as porn models.
Anonymous
7/24/2025, 10:07:54 AM No.716295975
>>716295591
and make sure you use the vulkan runtime in LM studio (i think it's gonna be picked by default since you have an AMD), you can click the magnifier on the left and click the runtime tab (or models to look for models)
Anonymous
7/24/2025, 10:10:08 AM No.716296076
>>716288603
>post
kill yourself
>game you posted
i love mmbn. there are like, 2 other games like it, it's so sad. the grid layout is so unique and so good, the battle chip concept is awesome, the leveling was so good, and no one has really copied it in a sensible way. we have the something something step eden game and we have berserk bits which reduces the whole concept to an idle game.
Anonymous
7/24/2025, 10:12:05 AM No.716296154
>>716282908
Yeah, even the top of the line models everyone brags about start losing the plot of any RP I try after around 60-70 messages or so. By 80-90 they start getting confused and characters start more or less acting randomly and forgetting things that happened previously, and after this it's basically the AI just hallucinating non-stop.
It's fun for short term shit, but anything longer term than that is kind of a gamble.
Replies: >>716296374 >>716299463 >>716303550
Anonymous
7/24/2025, 10:13:41 AM No.716296229
1743059930663405
1743059930663405
md5: 221bfa2f5e5160781eebd9815059ab6a๐Ÿ”
>>716295048
>provider catches wind of your loli cunny ERP
>API access key invalidated
Anonymous
7/24/2025, 10:16:55 AM No.716296374
>>716296154
define top of the line model. Dilution is real but if the context limit is reached it's a straight up hard cut off starting at the beginning and getting worse over time.
Replies: >>716296790
Anonymous
7/24/2025, 10:18:16 AM No.716296436
>LM studio is constantly trying to make internet connections
>Nothing in settings to make it stop or run offline mode only
>have to make several firewall rules blocking it
>more new connections keep being made
This is already looking grim and I haven't even found out how to pipe this shit through to SillyTavern from CLI
Replies: >>716296548 >>716296656
Anonymous
7/24/2025, 10:20:42 AM No.716296548
>>716296436
it has an updater for the app and its runtimes and download stuff straight from huggingface, of course it's making connections...

>and I haven't even found out how to pipe this shit through to SillyTavern from CLI
skill issue
Replies: >>716296626
Anonymous
7/24/2025, 10:21:53 AM No.716296601
Anyone use text-to-speech with ST? Did they come up with a model that can make sex noises yet?
Anonymous
7/24/2025, 10:22:27 AM No.716296626
>>716296548
>it has an updater for the app and its runtimes and download stuff straight from huggingface, of course it's making connections...
The entire point of me downloading multiple gigs of models beforehand and running them locally is to not connect to the internet. Kobold didn't need to connect to shit.
Replies: >>716296669 >>716296689
Anonymous
7/24/2025, 10:23:00 AM No.716296656
>>716296436
Just don't use LM Studio then??
Replies: >>716296694
Anonymous
7/24/2025, 10:23:06 AM No.716296658
I switched from stheno 8b to nemo 12b and it's noticeably better. Makes me wonder what kind of god tier adventures 70b models give. I still have that obnoxious "she says calmly/angrily/happily/*insert any emotion here*" after every sentance once getting to around 100 messages though.
Replies: >>716296724 >>716296731
Anonymous
7/24/2025, 10:23:19 AM No.716296669
>>716296626
Kobold justwerksโ„ข
Anonymous
7/24/2025, 10:23:46 AM No.716296689
>>716296626
nigga you're on windows lmao. Unplug your ethernet cable if you're that paraonoid.
Replies: >>716296732
Anonymous
7/24/2025, 10:23:49 AM No.716296694
file
file
md5: 9f344892db470ea6e863c56ccac530d6๐Ÿ”
>>716296656
Anonymous
7/24/2025, 10:24:36 AM No.716296724
>>716296658
If shit gets too annoying just ban those tokens. No more tasty morsels or predatory looks for me.
Anonymous
7/24/2025, 10:24:47 AM No.716296731
>>716296658
there is a diminutive factor when going to higher models.
Anonymous
7/24/2025, 10:24:47 AM No.716296732
>>716296689
Congrats on most retarded post in the thread award (still 400 posts to go, maybe someone will dethrone you)
Replies: >>716296778
Anonymous
7/24/2025, 10:25:50 AM No.716296778
>>716296732
I accept your concession. Now stop being tech illiterate or stick to softwares made for retards.
Replies: >>716296979
Anonymous
7/24/2025, 10:26:05 AM No.716296790
>>716296374
The ones everyone always brags about are Deepseek these days and it's fun but starts losing the plot after a while.
It also has an annoying habit of, any card that has one than one person in it, trying to have every character contribute to every scene. Which means if me and one character are explicitly separated from the others, then I have to constantly narrate what those other people are doing because the second I don't, they're immediately blasting through the nearest door to come in and comment on what we're doing, no matter how far away they were last time they were mentioned.
Replies: >>716296963 >>716297532 >>716299463
Anonymous
7/24/2025, 10:29:38 AM No.716296963
>>716296790
deepseek is /pol/ overhyped chink trash

>It also has an annoying habit of, any card that has one than one person in it, trying
I don't know exactly how the card system works but sounds like sillytavern just keep inserting them at regular interval which prompt the model to piggyback on the other characters in them. I don't think multi people card is a smart choice.
Replies: >>716297107
Anonymous
7/24/2025, 10:29:56 AM No.716296979
>>716296778
Oh I see its just a reading comprehension issue. I feel bad for you.
Do winbabies even use CLI for anything? They want one-click apps which is what LM Studio appears to be, opposed to kobold happily running from terminal.
Replies: >>716297085
Anonymous
7/24/2025, 10:32:17 AM No.716297085
>>716296979
then stick to kobold? fucking mongoloid nigger what even is the point of your replies?
Replies: >>716298356
Anonymous
7/24/2025, 10:32:31 AM No.716297107
>>716296963
Some of them work ok, but yeah I generally prefer to stick to 1-on-1 cards.
That said, I GENERALLY haven't had any real issues with Deepseek. Poking at others, but it's a decent enough fallback. My only real issue with it is it's really fucking obsessed with how everything smells, and when things get even remotely sexual it starts constantly chewing on your ears for some reason.
Replies: >>716297258
Anonymous
7/24/2025, 10:36:05 AM No.716297258
>>716297107
Were the smells thick and cloying
Replies: >>716297323
Anonymous
7/24/2025, 10:37:28 AM No.716297323
>>716297258
No, they usually smelled like cheap booze, perfume, and desperation.
Or iron and blood when I'm getting raped by sexy vampire women which has been happening a lot more often than you'd think lately.
Replies: >>716299913
Anonymous
7/24/2025, 10:41:48 AM No.716297532
>>716296790
Deepseek is shit, it's used by everyone because it's basically free.
I gave it chances every update and it keeps failing to be consistent and adding irrelevant shit and losing the original plot.
Replies: >>716297613
Anonymous
7/24/2025, 10:43:10 AM No.716297613
>>716297532
Any good alternatives? I'm poking at Gemini right now and a bit surprised it's letting me get away with blatantly sexual stuff considering how anal Google is about that kind of thing.
Replies: >>716297778 >>716299620 >>716299753
Anonymous
7/24/2025, 10:47:04 AM No.716297778
>>716297613
you have to be more specific about what fine tuned models you're referring to
Anonymous
7/24/2025, 10:49:03 AM No.716297869
Why is this now a ritual post?
Replies: >>716297985
Anonymous
7/24/2025, 10:51:58 AM No.716297985
>>716297869
Probably because all the AI chatbot generals are shit and full of schizos
Replies: >>716298165
Anonymous
7/24/2025, 10:55:28 AM No.716298165
>>716297985
many general turn to shit with a small clique of tribal autists talking about unrelated stuff
Anonymous
7/24/2025, 10:59:17 AM No.716298356
>>716297085
It's either contrarian troll or someone who has some Kobolb vs LMstudio schizo life goals
Replies: >>716298563
Anonymous
7/24/2025, 11:04:00 AM No.716298563
>>716298356
who is the schizo right now? you complain that LM doesn't do what you want it to do and that kobold does the job, so what the fuck are you even arguing about? It's a fact that LM is designed for the lowest common denominator, for better or worse which is why i recommend it to total beginners. Now piss off.
Replies: >>716300206
Anonymous
7/24/2025, 11:11:08 AM No.716298925
>>716292628
>switch to proprietary software marketed towards retards
Why?
Replies: >>716299068
Anonymous
7/24/2025, 11:13:46 AM No.716299068
>>716298925
>proprietary
ooooh you're one of those. Should have figured i was wasting my time with a /g/ schizo. Peace out.
Replies: >>716299251
Anonymous
7/24/2025, 11:15:52 AM No.716299161
1737003428984468
1737003428984468
md5: e022871c7c6fa3f79596f0fc0392e2e1๐Ÿ”
>melty over not being able to launch a model using kobold
It was genuinely easier than figuring out how to use [REDACTED] model online for free. I still keep my local setup just in case.
Anonymous
7/24/2025, 11:17:07 AM No.716299231
>>716295212
I just installed it a week ago and the anime girl is still there, as well as a generic "Assistant" example card.
Anonymous
7/24/2025, 11:17:34 AM No.716299251
1722783162637235
1722783162637235
md5: 081d51a5bd5b0c73a41f6ebc1ecb59c4๐Ÿ”
>>716299068
>wasting my time
You wrote one post, pipe down retard lmfao, what the fuck are you on.
Are you gonna elaborate on why you're telling people to switch to proprietary software or not?
Replies: >>716299375 >>716299429
Anonymous
7/24/2025, 11:20:58 AM No.716299375
>>716299251
I'd just like to interject for a moment. What you're referring to as Linux, is in fact, GNU/Linux, or as I've recently taken to calling it, GNU plus Linux. Linux is not an operating system unto itself, but rather another free component of a fully functioning GNU system made useful by the GNU corelibs, shell utilities and vital system components comprising a full OS as defined by POSIX. Many computer users run a modified version of the GNU system every day, without realizing it. Through a peculiar turn of events, the version of GNU which is widely used today is often called โ€œLinux,โ€ and many of its users are not aware that it is basically the GNU system, developed by the GNU Project. There really is a Linux, and these people are using it, but it is just a part of the system they use.
Replies: >>716299438
Anonymous
7/24/2025, 11:22:20 AM No.716299429
>>716299251
it's a shill
Anonymous
7/24/2025, 11:22:24 AM No.716299436
Are people still using local models in 2025? Local peaked in late 2023-early 2024. The best local model that can run in consumer hardware is like a year old at this point.
Anonymous
7/24/2025, 11:22:27 AM No.716299438
1721960247121133
1721960247121133
md5: 2fec24c7eb087f3baa98d8a143eaafe6๐Ÿ”
>>716299375
That's hella fucking epic dude, never seen that one before.
Why are you avoiding the question so vehemently?
Replies: >>716299519
Anonymous
7/24/2025, 11:22:57 AM No.716299463
>>716296790
>>716296154
Context management is 90% of what makes a good adventure, and what AI Dungeon does decently. For that you need to know the context limit, bring back characters and plot points that are mentioned inside the current context, and occasionally summarize previous events into new memories.

It's not very difficult, it's like 100 lines of Python, but also not what SillyTavern is made to do. And no one seems to fucking care, so there's that.

For an RPG, a context should be:

* LLM/Writing instruction
* Quick overview of the story.
* Relevant cards brought back by matching what is currently discussed
* Relevant "memories" brought back by matching what is current discussed
* Last XXX tokens of story to fill the context entirely

It's not that difficult to fuck up, but SillyTavern does.

Or you can also use a 128k context model, that works too.
Replies: >>716299863 >>716300027
Anonymous
7/24/2025, 11:23:31 AM No.716299487
this aint no paperclip
this aint no paperclip
md5: 2ec815a472b6a7bceafb52c07b81fa71๐Ÿ”
i just upgraded to a 5090 and text gen ui is incompatible. wtf
Replies: >>716299712
Anonymous
7/24/2025, 11:23:41 AM No.716299494
I'm too retarded to use this, I remember using char ai back in the day before it got lobotomized
There's a general on vg but I think it assumes you have basic knowledge. Can I get some really quick rundown without specifics so I know what to do?
Anonymous
7/24/2025, 11:24:07 AM No.716299519
>>716299438
I'd just like to interject for a moment. What you're referring to as Linux, is in fact, GNU/Linux, or as I've recently taken to calling it, GNU plus Linux. Linux is not an operating system unto itself, but rather another free component of a fully functioning GNU system made useful by the GNU corelibs, shell utilities and vital system components comprising a full OS as defined by POSIX. Many computer users run a modified version of the GNU system every day, without realizing it. Through a peculiar turn of events, the version of GNU which is widely used today is often called โ€œLinux,โ€ and many of its users are not aware that it is basically the GNU system, developed by the GNU Project. There really is a Linux, and these people are using it, but it is just a part of the system they use.
Replies: >>716299589
Anonymous
7/24/2025, 11:25:41 AM No.716299589
1729598081964052
1729598081964052
md5: ab20fb132a7ecb01c91b5b8c3db99541๐Ÿ”
>>716299519
>Spamming/flooding
That one simple question really worked you up for some reason.
Replies: >>716299651
Anonymous
7/24/2025, 11:26:22 AM No.716299620
>>716297613
Yeah 2.5 pro is decent and you can even make it write degenerate incest and sexual stuff wven without presets. API access is easy to get too by just signing up with new emails, for now.
The flash version can be very dry though.
Anonymous
7/24/2025, 11:26:54 AM No.716299651
>>716299589
I'd just like to interject for a moment. What you're referring to as Linux, is in fact, GNU/Linux, or as I've recently taken to calling it, GNU plus Linux. Linux is not an operating system unto itself, but rather another free component of a fully functioning GNU system made useful by the GNU corelibs, shell utilities and vital system components comprising a full OS as defined by POSIX. Many computer users run a modified version of the GNU system every day, without realizing it. Through a peculiar turn of events, the version of GNU which is widely used today is often called โ€œLinux,โ€ and many of its users are not aware that it is basically the GNU system, developed by the GNU Project. There really is a Linux, and these people are using it, but it is just a part of the system they use.
Replies: >>716299720
Anonymous
7/24/2025, 11:28:02 AM No.716299698
he's bumping the thread at least right
Anonymous
7/24/2025, 11:28:31 AM No.716299712
1731413764037561
1731413764037561
md5: 3cf9501b3f76e4cd29562acdcaf0704d๐Ÿ”
>>716299487
What is it saying when you try to load a model?
Replies: >>716299869
Anonymous
7/24/2025, 11:28:47 AM No.716299720
1749940206246483
1749940206246483
md5: cc5b864be4268e4d0f5ae3682fb97923๐Ÿ”
>>716299651
Why are you telling people to use proprietary software that does literally the same thing as the open source implementation?
Replies: >>716299786
Anonymous
7/24/2025, 11:29:35 AM No.716299753
1748379391466
1748379391466
md5: e137bcc18c824aa4232b2fe8abdab462๐Ÿ”
>>716297613
Those are your options.
Deepseek if you wanna pay 5$ every couple months
Grok/sonnet if you're willing to pay up to 30$ a month
Opus if you're willing to pay 100$ a month
Local is a meme, except for a quick coom, and at that point get deepchink.

I still remember when opus access was free. Months of the best text gaming I've had. Goddamn how I miss that. Could go into the thousand messages easily in a long, consistent plot, fucking amazing model.
Anonymous
7/24/2025, 11:30:13 AM No.716299786
1751671202905379_thumb.jpg
1751671202905379_thumb.jpg
md5: 565de3c0506d0b6d413b2df1f2495090๐Ÿ”
>>716299720
I'd just like to interject for a moment. What you're referring to as Linux, is in fact, GNU/Linux, or as I've recently taken to calling it, GNU plus Linux. Linux is not an operating system unto itself, but rather another free component of a fully functioning GNU system made useful by the GNU corelibs, shell utilities and vital system components comprising a full OS as defined by POSIX. Many computer users run a modified version of the GNU system every day, without realizing it. Through a peculiar turn of events, the version of GNU which is widely used today is often called โ€œLinux,โ€ and many of its users are not aware that it is basically the GNU system, developed by the GNU Project. There really is a Linux, and these people are using it, but it is just a part of the system they use.
Replies: >>716299872
Anonymous
7/24/2025, 11:32:05 AM No.716299863
>>716299463
Use this for the summary, works really well, and most models seem to understand it.

<!-- Don't reply as {{char}} to this message. Stop the roleplay and update this <roleplay_summary></roleplay_summary>. You must repeat this summary, adding the information that you deem essential to remember for constructing the future scenario as a GM! -->

<roleplay_summary ver="N"> <!-- version number N must be incremented with each summary update -->
This is the Memory Book of our roleplay which keeps the record of the most important information on what happened so far:

<npcs_encountered>
- X (<!-- relationship to {{user}} -->): <!-- NPC info. Appearance, speech manner, at least 3 personality traits. -->
- Y
- Z
</npcs_encountered>

<visited_locations>
- <!-- title --> : <!-- description in 1-3 sentences -->
-
-
</visited_locations>

<major_events>
- <!-- creative title in a couple words -->: <!-- brief summary -->
-
-
</major_events>

<secrets> <!-- "- " if no secrets -->
- <!-- secret --> (kept secret by <!-- char --> from X)
-
-
</secrets>

<relationships>
- <!-- long-term relationships between characters. These relationships must be updated anew each time, they are not additive -->
-
-
</relationships>

<planned_events>
- <!-- creative title in a couple words -->: <!-- brief summary -->
-
-
</planned_events>

<current_quests>
-
</current_quests>

</roleplay_summary>
Anonymous
7/24/2025, 11:32:08 AM No.716299869
terry-davis-terry-a-davis
terry-davis-terry-a-davis
md5: b06056f0cd385a68916c531b7a467554๐Ÿ”
>>716299712
E:\text-generation-webui-main\installer_files\env\Lib\site-packages\torch\cuda\__init__.py:235: UserWarning:
NVIDIA GeForce RTX 5090 with CUDA capability sm_120 is not compatible with the current PyTorch installation.
The current PyTorch install supports CUDA capabilities sm_50 sm_60 sm_61 sm_70 sm_75 sm_80 sm_86 sm_90.
If you want to use the NVIDIA GeForce RTX 5090 GPU with PyTorch, please check the instructions at https://pytorch.org/get-started/locally/
Replies: >>716299980 >>716300025
Anonymous
7/24/2025, 11:32:13 AM No.716299872
1752835408572427
1752835408572427
md5: 95dfdf5c76c63c41e485d7d753bf13c9๐Ÿ”
>>716299786
It's really not a hard question.
You could've at least attempted to explain why you think it's better, instead you went straight to sperging out.
Replies: >>716299984 >>716300228 >>716300715
Anonymous
7/24/2025, 11:32:57 AM No.716299913
>>716297323
And ozone
Anonymous
7/24/2025, 11:34:29 AM No.716299980
>>716299869
It literally tells you what to do. Install experimental pytorch.
>terry poster is a tech-illiterate (actually, just illiterate) retard
Every single time.
Anonymous
7/24/2025, 11:34:32 AM No.716299984
>>716299872
you look like a massive faggot avatarfagging with the cat pictures
Replies: >>716300105
Anonymous
7/24/2025, 11:35:26 AM No.716300025
>>716299869
You could probably just paste the error in ChatGPT and ask how to fix it. The irony makes it funny too
Anonymous
7/24/2025, 11:35:31 AM No.716300027
>>716299463
>* Quick overview of the story.
>* Relevant cards brought back by matching what is currently discussed
>* Relevant "memories" brought back by matching what is current discussed
Isn't that what lorebooks are for? Or is AI Dungeon automating that process
I saw an extension for ST lorebook automisation but I think it only works for specific models
Anonymous
7/24/2025, 11:37:14 AM No.716300105
1727058710353777
1727058710353777
md5: c157d7c62dfa58f54ed224e0baff4f6b๐Ÿ”
>>716299984
Nyo... How could you say this to your fellow forum poster...
>still avoiding the question
lole + lmao
Anonymous
7/24/2025, 11:39:24 AM No.716300206
1684769023267350
1684769023267350
md5: 97575c226b7dce15229c0f51339390d5๐Ÿ”
>>716298563
Wat
I just pointed out how you must be contrarian troll or schizo to manage to go such lengths rather than using one you just prefer.
That was my first post on the thread.
Baaaaaka
Replies: >>716300278
Anonymous
7/24/2025, 11:39:47 AM No.716300228
>>716299872
if only you knew how much time I wasted debating with trannies like you during the big linux/free software push amidst the vista's debacle. Never again.
Replies: >>716300458
Anonymous
7/24/2025, 11:40:48 AM No.716300278
>>716300206
my bad, I read it as "you're either..." there is some serious amount of noise from different trolls in this thread and it's hard to keep up.
Replies: >>716300584
Anonymous
7/24/2025, 11:41:44 AM No.716300321
>>716282706 (OP)
Where do you guys go for bots/cards? I use chub but it's been very meh lately when it's not just people making "bots" that's just a blog or that one i saw the other week with someone just making a bot where every field was just filled with tard reading over how much he hates the artist he used for the bot's pic.
Replies: >>716300515
Anonymous
7/24/2025, 11:44:51 AM No.716300458
1730362525123002
1730362525123002
md5: 3b9b2c127d94d00bdf51cd81eec50c8d๐Ÿ”
>>716300228
>gets asked a question
>"ITS DA TROONS!11"
lole
Anonymous
7/24/2025, 11:46:08 AM No.716300515
1724365993219299
1724365993219299
md5: e7de3407bc741518f0a6e95c6ee36ced๐Ÿ”
>>716300321
The evulid character archive. You can find some fun cards by hitting randomize enough times. Though personally I just make my own cards now because 99% of the ones posted online are complete trash garbage.
Anonymous
7/24/2025, 11:47:37 AM No.716300584
Small Haato
Small Haato
md5: 46aedbed6c99c3c838d5ebedb4180edb๐Ÿ”
>>716300278
Alrighty.
Well do take care anon-kun.
Anonymous
7/24/2025, 11:51:03 AM No.716300715
1220991709065
1220991709065
md5: 345be1be37d91cefde74413c810b2394๐Ÿ”
>>716299872
>sperging out
how fucking new are you
Replies: >>716300835
Anonymous
7/24/2025, 11:53:53 AM No.716300835
>>716300715
>gets asked a question
>immediately goes on the defensive
>spams copypasta
>starts blabbering something about trannies
Spergs have no self-awareness.
Replies: >>716301523
Anonymous
7/24/2025, 11:54:24 AM No.716300854
>free software cultist
>avatarfag desperate for attention
it always adds up
Replies: >>716301026 >>716301060 >>716301105
Anonymous
7/24/2025, 11:58:02 AM No.716301026
>>716300854
I bet it's one of those... things too
Anonymous
7/24/2025, 11:58:47 AM No.716301060
1735900080138337
1735900080138337
md5: 20ae90f03e7339644563bcc2e898debc๐Ÿ”
>>716300854
>>free software cultist
Did you divine that out of one post?
I'm not against proprietary software, I'm asking why the fuck would you recommend it over an open alternative that does the exact same thing. Not only that, it actually does it better, because last time I checked LM Studio hogged more VRAM compared to Kobold using the same settings.
Anonymous
7/24/2025, 12:00:01 PM No.716301105
1753351201567293.jpg
1753351201567293.jpg
md5: 388d511b8fbee60fafdd92c80355b191๐Ÿ”
>>>716300854
>I bet it's one of those... things too
Replies: >>716301297
Anonymous
7/24/2025, 12:03:59 PM No.716301283
the amount effort you pedos go to jerk off to the thought of fucking children is unreal
Replies: >>716301439
Anonymous
7/24/2025, 12:04:16 PM No.716301297
>>716301105
>stop noticing things
https://www.linuxfoundation.org/press/press-release/linux-foundation-focuses-on-science-and-research-to-advance-diversity-and-inclusion-in-software-engineering
Anonymous
7/24/2025, 12:06:47 PM No.716301439
1746815242968317
1746815242968317
md5: 66d629fe4ab5dd2a03964b19b5e55507๐Ÿ”
>>716301283
>thread about an LLM frontend
>immediately thinks about cp
Anonymous
7/24/2025, 12:07:12 PM No.716301462
>degenerate anons discussing coomslop text llm peacefully
>here comes an avatarfag having a meltdown
>here comes another PEDO PEDO PEDO!!!!
Anonymous
7/24/2025, 12:08:24 PM No.716301523
>>716300835
>no self-awareness
you're the one replying to a two decades old copypasta as if it were a genuine response
Anonymous
7/24/2025, 12:10:50 PM No.716301642
Weird how I ask for help with a couple settings and it turns into like 5 different people hijacking my question and arguing with eachother for half the thread.
Anonymous
7/24/2025, 12:12:03 PM No.716301702
microsoft lawsuit status?
Replies: >>716302030
Anonymous
7/24/2025, 12:18:13 PM No.716302030
1751580076897738
1751580076897738
md5: 20525979c5f73cc44f23ecf081ab6362๐Ÿ”
>>716301702
fiz fought back
Anonymous
7/24/2025, 12:23:05 PM No.716302279
Mint2x
Mint2x
md5: 8123728014522f7ab59343a5b74ec4c3๐Ÿ”
I ain't gonna bother reading this thread but I know (You) are often retarded and need help in some way or another, either setting up or getting it to work properly. Quote me and I'll answer whatever questions you have. You have half an hour.
Replies: >>716302850
Anonymous
7/24/2025, 12:33:41 PM No.716302850
>>716302279
Recommend a 12B model. I've already tried Rocinante.
Replies: >>716303231
Anonymous
7/24/2025, 12:40:44 PM No.716303231
>>716302850
Pshaw this outdated nigga. Fine. People swear up and down that MagMell is amazing, but I thought it was shit and dumb. Your mileage might vary.

My actual favorite 12B was, aside from Rocinante, Violet Lotus 12B. It's temperamental and you need to have a good card for it, ie.no typos, intro doesn't act for you, etc... but it had the best prose by far and adhered well to character traits.

Captain BMO was another decent choice. It's a lot like Rocinante in that it works with whatever you throw at it, but characters come off generic as a result.

Lyra-Gutenberg-mistral-nemo-12B was ok too but it's a step down, I think. Had good prose but never did use it that much.
Replies: >>716303343 >>716303960
Anonymous
7/24/2025, 12:41:51 PM No.716303290
1736746134739697
1736746134739697
md5: 895980c132fef4ad0ad67ced771d86ac๐Ÿ”
>>716282706 (OP)
Node based Russian Tavern, inspired in Silly Tavern and ComfyUI nodes, supports proxys, and the same as Silly Tavern , please put it in the OP.

https://tavernikof.github.io/NoAssTavern/
https://rentry.org/noasstavern
https://github.com/Tavernikof/NoAssTavern

*****
>What is this?
This is a new frontend, inspired by the stupid tavern, but sharpened purely for bezhop . The main motivation is to fix what is poorly done in the tavern and add new functionality. It does not need a backend to work, so it runs purely through the browser (there are some limitations, more on that below ) .
At the moment, this is a very raw version and is suitable for those who know how to edit presets or at least understand at a basic level how lobotomite works. Although you can already tinker with it now, the basic settings are available

>Main differences:
N O D Y . Yes, you heard right, the wet dream is already here.
Chats are separated from cards. Similar to risu, angai and any other adequate frontend
Presets are tied to chats. Hello FatPresets
Prompt editor . Allows more explicit control over what goes into the request
What it can do at the moment:
Basic stuff: character cards, personas, chats, presets, proxies
Backends: Claude, Gemini, OpenAI (in theory all compatible ones should be supported)
External blocks

>Two more weeks:
Mobile version
Summary (Sillipidor won't steal your summary if you don't have one)
Lorbuki
Regex magic
Plugins and Themes
Replies: >>716303459 >>716304813
Anonymous
7/24/2025, 12:42:53 PM No.716303343
>>716303231
I'll try them out, thanks broski.
Anonymous
7/24/2025, 12:44:56 PM No.716303459
>>716303290
>N O D Y . Yes, you heard right, the wet dream is already here.
You may need to run this post through a few more translation AIs
Replies: >>716304648
Anonymous
7/24/2025, 12:46:40 PM No.716303550
>>716296154
I found that Gemini was best at remembering details for the longest, but its writing is too robotic, even with Jailbreak.
Anonymous
7/24/2025, 12:49:00 PM No.716303692
I run a Mistral Nemo 12B Instruct pretty well on Kobold. I used Silly Tavern back in the day for Proxies but I found Nemo to work pretty decently.

Anything similar to Nemo or what I'm trying to say is, Have larger models been compressed enough to be in similar size to what I was using?

I hilariously got shot down by my AI waifu at one point
Anonymous
7/24/2025, 12:53:51 PM No.716303960
>>716303231
>People swear up and down that MagMell is amazing, but I thought it was shit and dumb.
I find Mag Mell merges that throw out its positivity and assistant messaging to be pretty decent but I haven't been doing this long enough to know for sure, and I've only used local so I can't compare it to any online models
NTA but I will test out Violet Lotus later, cheers for that
Anonymous
7/24/2025, 12:55:57 PM No.716304068
>>716291543
>what is piracy
>seeders and leechers
>torrent isn't popular with normies
>reliably seeded,
>good ratio
>torrent is popular with normies
>fucking 10+:1 leech:seed
>most leechers won't seed
Assuming people aren't facetious in making a thread about something that has a similar dynamic to this they only stand to attract normies that only leech.
Replies: >>716304551
Anonymous
7/24/2025, 1:04:21 PM No.716304551
>>716304068
How does that apply to SillyTavern? The dynamic isn't similar at all.
Anonymous
7/24/2025, 1:06:15 PM No.716304648
>>716303459
Knowing russians, he was unironically too dumb to use the AI - he just threw it into Google Translate and called it a day.
Anonymous
7/24/2025, 1:09:38 PM No.716304813
>>716303290
>please put it in the OP.
This retard's bot thinks its on /vg/ or /g/ lmao
Anonymous
7/24/2025, 1:49:38 PM No.716307074
>>716282706 (OP)
why are these posts considered on-topic?
Replies: >>716307178
Anonymous
7/24/2025, 1:51:27 PM No.716307178
>>716307074
use catalog and take a look yourself