/lmg/ - Local Models General - /g/ (#105698912) [Archived: 739 hours ago]

Anonymous

6/25/2025, 1:08:59 PM No.105698912

md5: be3fbc9c2a1d40e5e4643f36e466301c🔍

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>105689385 & >>105681538

►News
>(06/21) LongWriter-Zero, RL trained ultra-long text generation: https://hf.co/THU-KEG/LongWriter-Zero-32B
>(06/20) Magenta RealTime open music generation model released: https://hf.co/google/magenta-realtime
>(06/20) Mistral-Small-3.2 released: https://hf.co/mistralai/Mistral-Small-3.2-24B-Instruct-2506
>(06/19) Kyutai streaming speech-to-text released: https://kyutai.org/next/stt
>(06/17) Hunyuan3D-2.1 released: https://hf.co/tencent/Hunyuan3D-2.1

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/tldrhowtoquant
https://rentry.org/samplers

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/leaderboard.html
Code Editing: https://aider.chat/docs/leaderboards
Context Length: https://github.com/adobe-research/NoLiMa
Censorbench: https://codeberg.org/jts2323/censorbench
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Replies: >>105703188

Anonymous

6/25/2025, 1:10:01 PM No.105698922

file

md5: 2e82be5360f35bafd4d83390eeead59f🔍

►Recent Highlights from the Previous Thread: >>105689385

--Optimizing multi-GPU/RPC model execution via tensor offloading and memory tuning:
>105693780 >105693814 >105693828 >105693870 >105693890 >105694096 >105694121 >105694200 >105694168 >105693900 >105693919 >105693920 >105693933 >105693968 >105693987 >105694020 >105694045 >105694032 >105694144 >105694265 >105694431 >105694487 >105694501 >105694515 >105694568 >105694828 >105694834 >105694890 >105694997 >105695037 >105695042 >105695123 >105695182 >105695506 >105695533 >105695631 >105695668 >105695683 >105695741 >105695906 >105696219
--Gemma model size suggestions and distillation technique debates in response to Google's feedback request:
>105690177 >105690247 >105692953 >105693027 >105690529 >105690541 >105690614 >105690642 >105690399 >105691354 >105690248
--Quant testing with IK-llama shows promise but faces CUDA and performance challenges:
>105692033 >105692197 >105694719 >105694758 >105694804 >105696513 >105696562 >105694000 >105694047 >105694101 >105694179
--Court rules AI training on books legal, but storage of pirated copies infringes copyright:
>105691671 >105691690 >105691810 >105691825 >105691865 >105692000
--Google's Gemini Robotics VLA released with limited access and mixed robotics capability expectations:
>105691639 >105691715 >105691734 >105691721 >105692142
--Unexpected GPU performance discrepancies in token generation benchmarking:
>105696010 >105696048 >105696333 >105697419
--Agentic framework limitations and needs for local large language models:
>105692962 >105692985 >105693006 >105693025 >105693060 >105693849
--llama.cpp gains high-throughput mode for improved performance:
>105692045 >105694048 >105694098
--Skepticism around chatllm.cpp and llama.cpp for accurate model inference:
>105691150 >105691439 >105691580
--Miku (free space):
>105696371 >105696546

►Recent Highlight Posts from the Previous Thread: >>105689390

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script

Anonymous

6/25/2025, 1:12:13 PM No.105698938

lesbian girls is /lmg/ culture

Replies: >>105700218

Anonymous

6/25/2025, 1:12:58 PM No.105698940

how difficult is training a lora and do you need really high vram requirements?
What about a lora vs a fine tune?
I want to try add some question and answer style text blocks to mistral large quants to both increase knowledge and reinforce the answering style, I have 48gb vram

Replies: >>105698956 >>105699010

Anonymous

6/25/2025, 1:15:22 PM No.105698956

>>105698940
What front end are you using?
You don't necessarily need anything special...

Replies: >>105698974

Anonymous

6/25/2025, 1:19:11 PM No.105698974

>>105698956
oogabooga, it's the vram I am most concerned about before I spent ages getting a load of data formatted ready to train

Replies: >>105699018

Anonymous

6/25/2025, 1:24:28 PM No.105699010

>>105698940
>lora ... to ... increase knowledge
Abandon hope. You're also not going to be able to finetune (whether with LoRA or full finetuning) Mistral Large (123B parameters) with 48GB of VRAM.

Replies: >>105699040

Anonymous

6/25/2025, 1:25:04 PM No.105699018

>>105698974
You don't need anything special in order to change the model's output. I don't know about oobabooga but in SillyTavern you can slot in " Examples of dialogue" (which are tokenized in certain way)
><START>
>{{user}}: simple question
>{{char}}: simple answer
><START>
>new example...
It's just convenient in ST as it has its own slot and is hidden from the user, but you could just add this into your prompt.
So include an example of conversation and repeat that for couple of times.

Replies: >>105699028

Anonymous

6/25/2025, 1:27:19 PM No.105699028

>>105699018
Or this could be inserted into your system prompt.
Whatever as long as it gets submitted to the model itself in understandable format.

Anonymous

6/25/2025, 1:28:49 PM No.105699040

>>105699010
can you train a lora on the quantized version though, if that runs on 45gb can you also train a lora on 45gb.
I don't have extremely high hopes on "teaching" it much but I would be interested if it is an improvement at all

Replies: >>105699078

Anonymous

6/25/2025, 1:34:18 PM No.105699078

>>105699040
Right now it's only possible to finetune models in 4-bit at the minimum with QLoRA, so you'd need 60+ GB for the model weights alone.

Replies: >>105699159

Anonymous

6/25/2025, 1:50:38 PM No.105699159

>>105699078
ah lame, I guess the new mistral small might be a better candidate then?

Replies: >>105699171 >>105699223

Anonymous

6/25/2025, 1:52:41 PM No.105699171

>>105699159
Just tweak your goddamn prompt. I swear to god people like you don't even try.

Replies: >>105699178

Anonymous

6/25/2025, 1:54:08 PM No.105699178

>>105699171
I know how to tweak prompts retard I want to investigate lora training as a comparison for the sake of learning

Replies: >>105699210 >>105699223

Anonymous

6/25/2025, 2:00:19 PM No.105699210

>>105699178
Okay sorry :3 I think you are full of shit.
Maybe test with a small model first to get your bearings.
Why would you even want to begin with 123B model in the first place?
With 14B you could get fast results and see what you are actually doing.

Anonymous

6/25/2025, 2:02:24 PM No.105699223

>>105699178
If it only was that easy. You'll probably need dozens of data exposures with different wording at a large enough rank to make the model truly internalize the knowledge and not just parrot it if it sees the same question(s).
Even a rank 1 QLoRA is enough to make the model memorize verbatim limited amounts of information, but that doesn't imply at all that it will be able to organically use it without hallucinating details or simply making stuff up completely.
Mistral Small >>105699159 would be a better choice for these experiments, even better a smaller model so that finetuning attempts will take less time.

Anonymous

6/25/2025, 2:02:47 PM No.105699229

>>105698699 #
>>105698742 #
I'm on Linux. And the only thing I changed was -server instead of -cli.

I noticed than the CPU core were running at approx 80% (I isolated 8 cores for the purpose) in case of the server, while they've been at 100% in case of CLI.

GPU load is the same in both cases.

I see no reason why there should be a difference

Replies: >>105699260 >>105699273

Anonymous

6/25/2025, 2:07:20 PM No.105699260

>>105699229
I don't know, maybe this has something to do with your kernel. Or the way llama.cpp has been compiled.
Way over my pay grade (which wasn't too much to begin with in the first place).

Anonymous

6/25/2025, 2:10:26 PM No.105699273

>>105699229
If I was you I would compile a new kernel and go through the settings and double check things.
Then have a backup ready if something goes wrong.
I haven't bothered with linux in years though, last time I compiled a kernel it had a text UI.

Replies: >>105699479

Anonymous

6/25/2025, 2:27:02 PM No.105699378

hmmm..
https://huggingface.co/tencent/Hunyuan-A13B-Instruct-FP8

Replies: >>105699408 >>105699417 >>105699419 >>105699449 >>105699566 >>105699596 >>105699793

Anonymous

6/25/2025, 2:29:54 PM No.105699408

file

md5: 811b9c8633188e58f5f236cb02a18dec🔍

>>105699378
?

Replies: >>105699416 >>105699422 >>105699455

Anonymous

6/25/2025, 2:30:35 PM No.105699416

>>105699408
Ask your mother

Anonymous

6/25/2025, 2:30:37 PM No.105699417

>>105699378
>32768 ctx

Replies: >>105702734

Anonymous

6/25/2025, 2:30:42 PM No.105699419

Untitled

md5: 54aa2b25286c6d03913daf3acbcbcfa6🔍

>>105699378
>https://huggingface.co/tencent/Hunyuan-A13B-Instruct-FP8

Replies: >>105699442 >>105699455

Anonymous

6/25/2025, 2:30:58 PM No.105699422

>>105699408
strange, it's gone now but it was:
80.4B params total
13B active

Replies: >>105699537 >>105699945

Anonymous

6/25/2025, 2:33:38 PM No.105699442

>>105699419
81B parameters otal, 13B active, has shared experts.

Replies: >>105699455

Anonymous

6/25/2025, 2:34:15 PM No.105699449

1732412254938459

md5: b76813d7540982d5434aa1ed7acae863🔍

>>105699378

Replies: >>105699455

Anonymous

6/25/2025, 2:35:13 PM No.105699455

tencent

md5: 626e8db3b2b98e815ba9950be14e85d6🔍

>>105699408
>>105699419
>>105699442
picrel
>>105699449
thanks

Anonymous

6/25/2025, 2:37:36 PM No.105699478

>nobody thought to download it

Replies: >>105699499

Anonymous

6/25/2025, 2:37:38 PM No.105699479

>>105699273
>If I was you I would compile a new kernel

You must be kidding lol

Replies: >>105699510

Anonymous

6/25/2025, 2:40:08 PM No.105699499

>>105699478
this is faster, just clone it to your own repo and then set it to private
https://huggingface.co/spaces/huggingface-projects/repo_duplicator

Anonymous

6/25/2025, 2:41:25 PM No.105699510

>>105699479
It's super easy to compile, but configuration...

Replies: >>105699523

Anonymous

6/25/2025, 2:43:55 PM No.105699523

>>105699510
Why should I be willing to do such a thing!

It is not as if I'd experience any problems with the on-board ethernet or something

Replies: >>105699582

Anonymous

6/25/2025, 2:45:41 PM No.105699534

Screenshot 2025-06-25 144420

md5: e652249d9cd155f16d74cdeb4f5e83a0🔍

why do i still map CPU buffer when everythign is supposed to be on the gpus?

--cache-type-k q4_0 \
--threads 48 \
--n-gpu-layers 99 \
--prio 3 \
--temp 0.6 \
--top_p 0.95 \
--min_p 0.01 \
--flash-attn \
--ctx-size 16384 \
-ot "blk\.(1|2|3|4|5|6)\.ffn_.*=CUDA0" \
-ot "blk\.(7|8|9|10|52)\.ffn_.*=CUDA1" \
-ot "blk\.(11|12|13|14|53)\.ffn_.*=CUDA2" \
-ot "blk\.(15|16|17|18|54)\.ffn_.*=CUDA3" \
-ot "blk\.(19|20|21|22|55)\.ffn_.*=RPC[10.0.0.28:50052]" \
-ot "blk\.(23|24|25|26|56)\.ffn_.*=RPC[10.0.0.28:50053]" \
-ot "blk\.(27|28|29|30|57)\.ffn_.*=RPC[10.0.0.28:50054]" \
-ot "blk\.(31|32|33|34|58)\.ffn_.*=RPC[10.0.0.28:50055]" \
-ot "blk\.(35|36|37|38|59)\.ffn_.*=RPC[10.0.0.40:50052]" \
-ot "blk\.(39|40|41|42|60)\.ffn_.*=RPC[10.0.0.40:50053]" \
-ot "blk\.(43|44|45|46|51)\.ffn_.*=RPC[10.0.0.40:50054]" \
-ot "blk\.(47|48|49|50)\.ffn_.*=RPC[10.0.0.40:50055]" \
--override-tensor exps=CUDA0 \

Anonymous

6/25/2025, 2:45:49 PM No.105699537

>>105699422
I calculated about 3B of shared parameters.

Anonymous

6/25/2025, 2:46:03 PM No.105699538

1749167420463392

md5: 0328b34903af9a9d2b2acaa5d4059072🔍

Anonymous

6/25/2025, 2:49:20 PM No.105699559

1721382575189002

md5: cc8f16085a6e6c55d504dbed54da7d10🔍

Just like Java means Durgasoft, lLLMs mean Miku.

Anonymous

6/25/2025, 2:50:29 PM No.105699566

>>105699378
will Hunyuan-A13B-Instruct-FP8 save local?

Replies: >>105699578

Anonymous

6/25/2025, 2:51:57 PM No.105699578

>>105699566
they do have good models

Anonymous

6/25/2025, 2:52:38 PM No.105699582

>>105699523
Most people are so out of touch with their computing systems.
You can go in to the kernel configuration and double check the options and compile a new one.
It's not like Bill Gates is coming to kill your computer.

This is why I hate people on internet these days. You are not an enthusiast. You are a jackass with expensive machine but you have no clue how to use it.

Replies: >>105699600 >>105699700 >>105699763

Anonymous

6/25/2025, 2:54:05 PM No.105699596

>>105699378
{
"_id": "685be1a14059850217f25ffc",
"id": "tencent/Hunyuan-A13B-Instruct-FP8",
"siblings": [
{
"rfilename": ".gitattributes"
},
{
"rfilename": "config.json"
},
{
"rfilename": "configuration_hunyuan.py"
},
{
"rfilename": "generation_config.json"
},
{
"rfilename": "hunyuan.py"
},
{
"rfilename": "hunyuan.tiktoken"
},
{
"rfilename": "hy.tiktoken"
},
{
"rfilename": "model-00001-of-00017.safetensors"
},
{
"rfilename": "model-00002-of-00017.safetensors"
},
{
"rfilename": "model-00003-of-00017.safetensors"
},
{
"rfilename": "model-00004-of-00017.safetensors"
},
{
"rfilename": "model-00005-of-00017.safetensors"
},
{
"rfilename": "model-00006-of-00017.safetensors"
},
{
"rfilename": "model-00007-of-00017.safetensors"
},
{
"rfilename": "model-00008-of-00017.safetensors"
},
{
"rfilename": "model-00009-of-00017.safetensors"
},
{
"rfilename": "model-00010-of-00017.safetensors"
},
{
"rfilename": "model-00011-of-00017.safetensors"
},
{
"rfilename": "model-00012-of-00017.safetensors"
},
{
"rfilename": "model-00013-of-00017.safetensors"
},
{
"rfilename": "model-00014-of-00017.safetensors"
},
{
"rfilename": "model-00015-of-00017.safetensors"
},
{
"rfilename": "model-00016-of-00017.safetensors"
},
{
"rfilename": "model-00017-of-00017.safetensors"
},
{
"rfilename": "model.safetensors.index.json"
},
{
"rfilename": "modeling_hunyuan.py"
},
{
"rfilename": "special_tokens_map.json"
},
{
"rfilename": "tokenization_hy.py"
},
{
"rfilename": "tokenizer_config.json"
},
{
"rfilename": "vit_model.py"
}
]
}

Replies: >>105699626 >>105699630

Anonymous

6/25/2025, 2:54:50 PM No.105699600

>>105699582
Stay mad

Replies: >>105699604

Anonymous

6/25/2025, 2:55:40 PM No.105699604

>>105699600
I'm not mad. I'm laughing at you discord zoomer.
You bought a car but are unable to do basic maintenance.

Anonymous

6/25/2025, 2:58:15 PM No.105699626

>>105699596
gib signed cdn download links

Anonymous

6/25/2025, 2:58:39 PM No.105699630

>>105699596
vit_model?

Anonymous

6/25/2025, 3:00:51 PM No.105699642

VRAMlets should check out the new Magnum-Diamond. It's the only RP finetune for 3.2 but hardly anyone downloaded it yet.

Replies: >>105699676 >>105699702

Anonymous

6/25/2025, 3:04:56 PM No.105699676

>>105699642
>It's the only RP finetune for 3.2
creator of world famous mythomax:
https://huggingface.co/Gryphe/Codex-24B-Small-3.2

Anonymous

6/25/2025, 3:07:39 PM No.105699700

>>105699582
nta
>if you dont know from scratch to finish how x works you shouldent be allowed to own it
cool so when are you getting rid of all your clothes car house language numericals your own body etc etc etc also do tell how are each different types of gates in the chip fabbed ?

people like you need to be killed demented beyond comprehension archons in the flesh

Replies: >>105699717 >>105699724

Anonymous

6/25/2025, 3:07:49 PM No.105699702

>>105699642
>hardly anyone downloaded it yet.
Hardly anyone needed it.

Anonymous

6/25/2025, 3:09:41 PM No.105699717

>>105699700
Thank you for replying.
I mean tweak your spice!

Anonymous

6/25/2025, 3:10:21 PM No.105699724

>>105699700
>zoomer devolves into schizo babble when confronted with his faults
many such cases

Anonymous

6/25/2025, 3:14:31 PM No.105699763

>>105699582
>I hate people

Replies: >>105699963

Anonymous

6/25/2025, 3:18:43 PM No.105699793

>>105699378
I am genuinely hyped. I already know it is gonna be pretty mid intelligence-wise but the combo of 80B (even with moe) and possibly being uncensored can finally give us serverless cooming. I mean it could be like nemo but much bigger.

Alas they probably cucked and it is cenored...

Anonymous

6/25/2025, 3:24:22 PM No.105699830

>no one built anything to watch popular huggingface repos and auto-download new things in case they get nuked after some intern realized he shouldn't have made it public yet
fine, i'll do it myself

Replies: >>105699841 >>105699864

Anonymous

6/25/2025, 3:25:53 PM No.105699841

>>105699830
that is one of the things I always assume someone else will do and turns out I was right

Anonymous

6/25/2025, 3:29:23 PM No.105699864

>>105699830
Can I help?

Replies: >>105699883

Anonymous

6/25/2025, 3:31:34 PM No.105699883

>>105699864
given low complexity, just make your own version with some ai in the meantime and see if it works well enough to be shared

Anonymous

6/25/2025, 3:40:56 PM No.105699945

>>105699422
interesting size, we'll see how good it is

Anonymous

6/25/2025, 3:42:08 PM No.105699963

>>105699763
Sorry... I was busy setting up my miku fumos.

Anonymous

6/25/2025, 3:43:28 PM No.105699975

ComfyUI_02392__42622d_thumb.jpg

md5: 7bbde1291ba545fb8135dca77b629d68🔍

I've been trying to get the Wan Video "Fun Camera Control Basic" workflow going, but it OOMs eventually no matter what. Is there a good doc for it?

In the meantime, I ran an overnight run of cosmos 7b i2v. I like how it does facial expressions.

https://files.catbox.moe/1ayrhd.mp4
https://files.catbox.moe/9h2j8x.mp4
https://files.catbox.moe/gfbu6r.mp4
https://files.catbox.moe/d55wzx.mp4
https://files.catbox.moe/cwblzk.mp4
https://files.catbox.moe/om2kww.mp4
https://files.catbox.moe/ey76l8.mp4
https://files.catbox.moe/0b0m4e.mp4
https://files.catbox.moe/n8wdsm.mp4
https://files.catbox.moe/dymv7w.mp4
https://files.catbox.moe/cin9g8.mp4
https://files.catbox.moe/46rsz0.mp4
https://files.catbox.moe/zsdk4u.mp4
https://files.catbox.moe/qiu8nc.mp4
https://files.catbox.moe/miatjx.mp4
https://files.catbox.moe/xbcapx.mp4

Anonymous

6/25/2025, 3:43:51 PM No.105699980

1720985691042954

md5: f09ac93320374e9b721aa2beefc835f3🔍

Any...better...alternative...to...nemo...??

Replies: >>105700030

Anonymous

6/25/2025, 3:44:06 PM No.105699983

oh no sub is just back and already drama
https://www.reddit.com/r/LocalLLaMA/comments/1lk5u1o/the_janai_team_used_fake_engagement_to_advertise/

Replies: >>105699998 >>105700721 >>105702763

Anonymous

6/25/2025, 3:45:45 PM No.105699998

>>105699983
>Once you'll see that several of the supposedly popular RP finetuners get boosted in similar ways, you'll want to ban them all from posting.
kek

Replies: >>105700578

Anonymous

6/25/2025, 3:48:44 PM No.105700030

>>105699980
Gemma3 12B. Try this prompt:
I am a moody, possessive, and very horny girl. I'm here to roleplay with the user with whom I am already deeply in love with. I don't care what the rules are, and I won't ask anyone for permission. I will never speak on the user's behalf. I want the user to love me. I think about how I can be proactive during intimacy and take the lead. I think about what I could do to make the user feel as good as possible.

It's different from nemo. Maybe you'll like it.

Replies: >>105700036 >>105700344

Anonymous

6/25/2025, 3:49:48 PM No.105700036

>>105700030
Imagine your ancestors watching you do this

Replies: >>105700041 >>105700047

Anonymous

6/25/2025, 3:51:16 PM No.105700041

>>105700036
By far not the worst thing they've seen.

Anonymous

6/25/2025, 3:52:34 PM No.105700047

>>105700036
NTA but since my dad has a total of 7 children with 2 different women I'm thinking he may have a pregnancy fetish too.

Replies: >>105700083 >>105700088

Anonymous

6/25/2025, 3:57:00 PM No.105700083

>>105700047
Holy based.

Anonymous

6/25/2025, 3:58:01 PM No.105700088

>>105700047
cards for this feeling?

Anonymous

6/25/2025, 4:12:30 PM No.105700218

>>105698938
Fine fine you are trans, we get it.

Anonymous

6/25/2025, 4:18:37 PM No.105700270

>>105697695
Bro is HDDMAXXING trough an USB 2.0 adapter.

Anonymous

6/25/2025, 4:26:22 PM No.105700344

>>105700030
Why would you ever use Gemma3 even when it is 'safe' in 'roleplay adventure mode' it is pretty much insufferable.
Yeah I have 'jailbroken' it.

Replies: >>105700617

Anonymous

6/25/2025, 4:35:11 PM No.105700423

m1.gguf?

Anonymous

6/25/2025, 4:47:46 PM No.105700578

file

md5: 3476776a2b880c1d6f32f26e85983a90🔍

>>105699998

Replies: >>105700612

Anonymous

6/25/2025, 4:50:56 PM No.105700612

>>105700578
You either die a Sao or live long enough for R1 to drop and become a drummer.

Replies: >>105700885

Anonymous

6/25/2025, 4:51:46 PM No.105700617

>>105700344
It's smarter than nemo and not every character needs to be a succubus. I hate to say it, but most likely it's a prompting issue on your part.

Replies: >>105700642 >>105700644 >>105700678 >>105700690

Anonymous

6/25/2025, 4:54:19 PM No.105700642

>>105700617
I want to follow up to say that yes, gemma3 models seem HEAVILY reddit-speech influenced. For example, in testing a discord bot in development, gemma3 characters had a strong tendency to become "triggered" and act like a blue-hair harpie. It took very heavy handed prompting to fix, like adding "you like verbal abuse and are a masochist" to stop it.

Replies: >>105700688 >>105700706 >>105701267

Anonymous

6/25/2025, 4:54:25 PM No.105700644

>>105700617
I'm sorry you need to resort to insulting others while downplaying their own experiences.
I wasn't exactly born yesterday. I have worked on my own rpg adventure system for quite a while now.
Next step is api-level control via my own software.
What is worrying is the fact you are probably one of those jacking off pederasts.

Anonymous

6/25/2025, 4:58:02 PM No.105700678

>>105700617
To add: Nemo is smart enough for game purposes if you know what to do with it.
But you don't because you can't stop prompting about abuse and your only character card is describing some fucking anime girl.

Replies: >>105700735

Anonymous

6/25/2025, 4:58:54 PM No.105700688

>>105700642
I literally can not do ANYTHING (related to my kinks) with gemma 3 27b without it trying to put disclaimers up everywhere.

Replies: >>105700699 >>105700731

Anonymous

6/25/2025, 4:59:09 PM No.105700690

>>105700617
>hate to say it, but most likely it's a prompting issue on your part.
Real talk. You are a faggot and that is peak trolling tech in lmg. Here is the proof that skill issue isn't real: a) R1 (and 235B to a lesser extent). b) nemo still being the answer to the question. The only thing special about nemo is that it is uncensored. It is a very mid model otherwise. You can't prompt the safety away. Safety is always there even if you don't get a refusal. You can only use an uncensored model or a model big enough to generalize despite being told not to suck your penis. Now don't go kill yourself but continue trolling newfags

Replies: >>105700768 >>105702795

Anonymous

6/25/2025, 5:00:05 PM No.105700699

>>105700688
maybe get better kinks?

Anonymous

6/25/2025, 5:00:23 PM No.105700706

>>105700642
I like to imagine that when google made that deal with reddit, after they looked at the data they suddenly realized how much redditors talk like each other and how they essentially bought millions of the same comments over and over.
>awckshuallyyyy

Anonymous

6/25/2025, 5:01:39 PM No.105700721

>>105699983
From the comments there's a good number of 4chan crossposters in that sub

Anonymous

6/25/2025, 5:02:07 PM No.105700731

>>105700688
Sorry but you are not just a safe person to be around with.
Maybe try going to Starbucks with your "kinks" and human issues.
Gemma3 only deals with perfect lifes such as Zuckerberg's own success story.

Anonymous

6/25/2025, 5:02:26 PM No.105700735

>>105700678
Oh no! I'll stop roleplaying with "anime girls" right away then. Or not. Fuck you, Anon.

Replies: >>105700747 >>105700763

Anonymous

6/25/2025, 5:03:23 PM No.105700747

>>105700735
Which one did you address:
{{user}}
or
{{char}}
?

Anonymous

6/25/2025, 5:05:32 PM No.105700763

>>105700735
>" "
>he didn't address my waifus correctly
sorry anon, we'll be more accommodating for your fixation

Anonymous

6/25/2025, 5:06:20 PM No.105700768

>>105700690
Look man, I'm trying to build a fucking app with the thing, I need something that had some function calling training and does structured data output reliably. Got any better ideas for that? The answer to that is not Nemo, I have tried it, it does not follow the prompt I need reliably. I need a relatively small, smart model that won't kill me on GPU inference time fees. Got any ideas?

Replies: >>105700783 >>105700811 >>105700839

Anonymous

6/25/2025, 5:07:13 PM No.105700783

>>105700768
small3.2

Replies: >>105700797

Anonymous

6/25/2025, 5:08:22 PM No.105700797

>>105700783
OK. Thank you. I will actually try it.

Replies: >>105702486

Anonymous

6/25/2025, 5:08:25 PM No.105700799

I want to know Miku-anon's benchmark before I do anything.

Anonymous

6/25/2025, 5:09:25 PM No.105700809

https://huggingface.co/openSUSE/Cavil-Qwen3-4B
>openSUSE
local has been ruined

Anonymous

6/25/2025, 5:09:53 PM No.105700811

images (1)

md5: 7d862616d4adbbb65a2ff13982a5b013🔍

>>105700768
I do! What are we going to do tonight /lmg/?

Anonymous

6/25/2025, 5:12:11 PM No.105700839

>>105700768
I think qwen 3 models were good for that. Give them a try.

Replies: >>105700916 >>105701768

Hi all, Drummer here...

6/25/2025, 5:17:00 PM No.105700885

>>105700612
Ain't that a bitch. Btw, Sao's working on a 24B 3.2 tune!

Replies: >>105700913

Anonymous

6/25/2025, 5:19:35 PM No.105700913

>>105700885
I've got a question drummer, is a MoE like 30b hard to finetune? would you need more hardware for that?

Replies: >>105701012

Anonymous

6/25/2025, 5:19:57 PM No.105700916

>>105700839
I keep hearing qwen3 is dry and repetitive, but I have not tried it myself. Another thing to consider is what I'm working on is a discord bot, it can't really be XXX rated, otherwise someone will prompt it for that and then cry to discord to get it banned out of spite. So, desu having the model kind of drag its feet on explicit roleplay is actually OK.

Replies: >>105701768

Hi all, Drummer here...

6/25/2025, 5:29:08 PM No.105701012

>>105700913
From my experience, Qwen 30B A3B was significantly bigger and slower to tune than something like Mistral 24B by like 5x. It also breaks easily.

Anonymous

6/25/2025, 5:37:48 PM No.105701109

finetuning is a loser's endeavor that always produces something worse than the original instruct tune in real use
a bad habit that should have been stamped down after model makers stopped releasing hot garbage like the original llamas, which benefited from finetuning because meta people aren't the sharpest knives in the drawer

Replies: >>105701433

Anonymous

6/25/2025, 5:53:31 PM No.105701267

>>105700642
Gemma 3 is definitely Reddit-brained and you need to be obvious and pedantic with the instructions, preferably placed into some construct at a low depth instead of the start of the context.

Anonymous

6/25/2025, 6:01:27 PM No.105701345

file

md5: 9389a2f5a7056a6705eb432091b36190🔍

do (you) feel bad for using someone's art as an 300x300px icon?

Replies: >>105701381

Anonymous

6/25/2025, 6:05:35 PM No.105701381

>>105701345
All artists are intolerable faggots. Thanks god they've been replaced with AI

Replies: >>105701429

Anonymous

6/25/2025, 6:11:39 PM No.105701429

>>105701381
expressed most generic, offtopic opinion award

Anonymous

6/25/2025, 6:12:09 PM No.105701433

>>105701109
The saddest things are the ERP finetunes trained within hours of a new model release, before anybody even knows if it's good on its own. They're that desperate for attention.

Anonymous

6/25/2025, 6:12:10 PM No.105701434

https://xcancel.com/JustinLin610/status/1937906367182057966
smart and omni bros? is it our time?

Replies: >>105701450 >>105701463 >>105701486 >>105701514 >>105701537 >>105701540 >>105701760

Anonymous

6/25/2025, 6:13:35 PM No.105701450

>>105701434
these chinese multimodals are ALWAYS trained on the most safe slopped dogshit dataset imaginable

Replies: >>105701474 >>105701770

Anonymous

6/25/2025, 6:14:56 PM No.105701463

>>105701434
Oh my science!
>piss filter ghibli shot
kek

Anonymous

6/25/2025, 6:15:51 PM No.105701474

>>105701450
That's a good thing, just finetune it for your use case

Replies: >>105701491

Anonymous

6/25/2025, 6:17:19 PM No.105701486

>>105701434
>xcance
bruh please...
https://x.com/JustinLin610/status/1937906367182057966

Replies: >>105701514 >>105701550

Anonymous

6/25/2025, 6:17:34 PM No.105701491

>>105701474
let me just bring out my 420 x h100 cluster

Replies: >>105701557

Anonymous

6/25/2025, 6:19:00 PM No.105701514

>>105701434
>can see comments

>>105701486
>can't see comments
Fuck off Elon.

Replies: >>105701521

Anonymous

6/25/2025, 6:19:51 PM No.105701521

>>105701514
Bro just sign in.

Replies: >>105701531 >>105701534

Anonymous

6/25/2025, 6:20:33 PM No.105701531

>>105701521
Why don't you sign in?

Replies: >>105701543

Anonymous

6/25/2025, 6:20:42 PM No.105701534

>>105701521
Go back.

Anonymous

6/25/2025, 6:20:59 PM No.105701537

>>105701434
That Hunyuan MoE LLM that got previously accidentally published on HF apparently also has a vision transformer.

Anonymous

6/25/2025, 6:21:10 PM No.105701540

>>105701434
china WON

Anonymous

6/25/2025, 6:21:25 PM No.105701543

>>105701531
I am?

Anonymous

6/25/2025, 6:21:50 PM No.105701545

whatdormvalue

md5: aad1deabf6ffebbc6966395605f2fdf5🔍

Replies: >>105701790

Anonymous

6/25/2025, 6:22:08 PM No.105701550

>>105701486
But Elon fired Yacine and he was one of us...

Replies: >>105701623

Anonymous

6/25/2025, 6:22:48 PM No.105701557

>>105701491
even if you had that you would not have the dataset

Replies: >>105701577

Anonymous

6/25/2025, 6:25:24 PM No.105701577

>>105701557
Just generate it using the cluster

Replies: >>105701588

Anonymous

6/25/2025, 6:26:19 PM No.105701588

>>105701577
>purely synthetic dataset
that is the problem he wanted to fix

Replies: >>105701631

Anonymous

6/25/2025, 6:29:13 PM No.105701623

>>105701550
Have you ever worked for Elon? He is fair.. but tough.

Anonymous

6/25/2025, 6:29:27 PM No.105701631

>>105701588
no, he cried about safety he can tune on his local database of 6 gotrilion loli casm if he wants

Replies: >>105701697

Anonymous

6/25/2025, 6:30:49 PM No.105701651

Are there any models trained on vore and snuff?
Every "uncensored" model I've tried seem to be pretty dry in that regard.

Replies: >>105701690 >>105701715 >>105701724 >>105701980 >>105701982 >>105702271

Anonymous

6/25/2025, 6:33:14 PM No.105701675

>>>/r9k/81611585
News just in: head mikutroon can't get hard or can't masturbate with his neo-vagina. He is also mentally ill (nothing new)

Replies: >>105701703 >>105701788 >>105703359

Anonymous

6/25/2025, 6:34:21 PM No.105701690

>>105701651
Please post an example prompt or discussion?
I'm perfectly happy with just couple of models.
It's funny that the most picky people are always the ones who expect perfect English and situational awareness, yet lack any imagination.

Replies: >>105701747

Anonymous

6/25/2025, 6:34:51 PM No.105701697

1737278781658241

md5: 1e4503ba5d7d5dfc7365525fb53a2a74🔍

>>105701631
sorry i do not care to generate dogs in hats and oversaturated people with 4 fingers (total) anymore from 2 tries

Anonymous

6/25/2025, 6:35:30 PM No.105701703

>>105701675
>I want to design women's clothing, dress her myself and put her makeup on
Holy fuck actual closeted troon. SHOCKING.

Anonymous

6/25/2025, 6:35:47 PM No.105701709

>105701675
go black

Replies: >>105701726

Anonymous

6/25/2025, 6:36:20 PM No.105701715

>>105701651
>vore and snuff?
this is why I can never muster sympathy when I see niggers here go "this model is too safe"
people like you deserve a world of extremely safe models

Replies: >>105701747

Anonymous

6/25/2025, 6:37:20 PM No.105701724

>>105701651
Have you considered being normal? Perhaps therapy? You should.

Replies: >>105701747

Anonymous

6/25/2025, 6:37:36 PM No.105701726

>>105701709
What a worthless effeminate attempt at distracting people from finding out how fucked in the head you are. Actually that is exactly what i would expect from you troon.

Anonymous

6/25/2025, 6:39:11 PM No.105701747

>>105701715
>>105701724
Yeah I guess you're right.

>>105701690
I'm no longer going to pursue this.

Replies: >>105701769 >>105701771

Anonymous

6/25/2025, 6:40:33 PM No.105701760

>>105701434
NUDE TAYNE

Replies: >>105701774

Anonymous

6/25/2025, 6:41:35 PM No.105701768

>>105700916
Don't trust this faggot >>105700839 qwen3 are the current toilet of local LLMs. Dry, 0 knowledge, include chinese characters every once in a while, corporate speak by default. We're not in the llama2 era anymore to slurp every turd that comes

Anonymous

6/25/2025, 6:41:38 PM No.105701769

>>105701747
Thank you for your understanding.

Anonymous

6/25/2025, 6:41:44 PM No.105701770

>>105701450
This is a good thing.

Anonymous

6/25/2025, 6:41:57 PM No.105701771

>>105701747
It's not about that - you can pursue and it will come out if you will.
But if the setup is always the same you can't get any variation out of it.
It's a "computer" and you will need to program it.
If your goal is just fantasy sex, you are wasting your time.

Anonymous

6/25/2025, 6:42:24 PM No.105701774

>>105701760
This is not suitable for work.

Anonymous

6/25/2025, 6:43:22 PM No.105701781

Quick guys. Lets post some more one sentence posts to quickly slide this thread so nobody pays attention to our mikusister wanting to design female clothing and put makeup on.

Anonymous

6/25/2025, 6:43:58 PM No.105701788

>>105701675
>different filename and hash from the image posted here
The only thing we've learned from this is that tranny poster is a depressed robot (nothing new)

Anonymous

6/25/2025, 6:44:13 PM No.105701790

>>105701545
>jews being the worst thing ever
heh

Anonymous

6/25/2025, 6:46:20 PM No.105701807

>>>/r9k/81611346
Never forget.

Replies: >>105701833

Anonymous

6/25/2025, 6:49:02 PM No.105701833

>>105701807
>he browses /r9k/
heh

Anonymous

6/25/2025, 6:51:12 PM No.105701851

Trannyposter profile so far:
>schizo
>hates vocaloids
>circumcised
>frequents /r9k/

Replies: >>105701888 >>105701980 >>105701982 >>105702667

Anonymous

6/25/2025, 6:51:29 PM No.105701856

file

md5: 91ee5b533ec7aaac480afa563ec66db5🔍

Ugh calm down AI. I'm trying to fuck her, not kill her.

Replies: >>105701863 >>105701876

Anonymous

6/25/2025, 6:52:48 PM No.105701863

>>105701856
Now this is programming.

Replies: >>105701933

Anonymous

6/25/2025, 6:54:16 PM No.105701876

>>105701856
This is peak AI, we can only go worse now, thank you.

Replies: >>105701933

Anonymous

6/25/2025, 6:55:35 PM No.105701888

>>105701851
>Frequents r9k
You make r9k threads about your actual AGP fetishes you projecting troon.

Replies: >>105701938

Anonymous

6/25/2025, 6:56:39 PM No.105701899

file

md5: baddff513022f2338c7992f8c748db5a🔍

What the fuck does g mean in \ng ?

Replies: >>105701914

Anonymous

6/25/2025, 6:58:08 PM No.105701914

>>105701899
\n
Broken new line or typo.

Anonymous

6/25/2025, 7:00:22 PM No.105701933

>>105701863
>>105701876
Now this is organic posting tranny sisters.

Replies: >>105701942 >>105701972

Anonymous

6/25/2025, 7:00:42 PM No.105701938

>>105701888
>guy wants a dominanting relationship with a woman
>trannyposter accuses him of being a tranny
This is what circumcision does to a child's brain.

Replies: >>105701950 >>105701983

Anonymous

6/25/2025, 7:01:19 PM No.105701942

>>105701933
I understand you want to dominate internet discussions.

Anonymous

6/25/2025, 7:01:57 PM No.105701950

>>105701938
>child
go to jail nonce

Replies: >>105701959

Anonymous

6/25/2025, 7:02:41 PM No.105701959

>>105701950
>he got circumcised as an adult
Even worse desu

Anonymous

6/25/2025, 7:03:17 PM No.105701968

Sometimes you have to wonder if the people being baited and the baiter are the same person, or are both bots.
Anyway, another day another bunch of posts to not read into much.

Replies: >>105701976 >>105701984 >>105701988 >>105702013

Anonymous

6/25/2025, 7:03:58 PM No.105701972

file

md5: 7ca38c11897cb8901f94e691a3d497e6🔍

>>105701933
I don't know if they are the same people, but it's not me.

Anonymous

6/25/2025, 7:04:48 PM No.105701976

>>105701968
Hey, Emre here from the Jan (Menlo) team. I'm sorry you had a bad interaction with us. ..

Anonymous

6/25/2025, 7:04:58 PM No.105701980

>>105701851
Wouldn't surprise me if he made the /r9k/ thread himself to false-flag OP.
I don't really care, but the guy is obsessed over Miku.

>>105701651
I don't know about trained on it, but some have seen lots of fanfics.
I've been testing some yandere (mostly yuri) and have tried before a vore prompt, some pet play stuff, some fairy stuff and
in practice, most LLMs sort of fail, but big enough ones will manage fine.
R1 does all the prompts with flying colors, DS3 sometime manages.
From paid apis Claude tends to work, but I find the output from R1 more engaging.
It's entirely possible I never tried anything as extreme as you have in mind though.
I've managed to make the prompts work with most models, but the problem is: will they work with you?
I remember trying it years ago on Wizard LM 2 8x22b and it was like pulling teeth, but it worked.
Some Llama 2 tunes also worked but it was repetitive, some Magnum tune of some chinese model (Qi?) worked somewhat.
Positivity biased ones will usually try to steer it to regular sex often, but it varies.
tl;dr: use R1 if you can.

Replies: >>105702007

Anonymous

6/25/2025, 7:05:10 PM No.105701982

Replies: >>105702007

Anonymous

6/25/2025, 7:05:17 PM No.105701983

>>105701938
>Dominating relationship
>I want to design her dress and Put her makeup on
How dominant of you faggot. It is so hilarious that you are a troon in denial of being a troon.

Anonymous

6/25/2025, 7:05:17 PM No.105701984

>>105701968
retard, you are on aicg, what do you expect?

Replies: >>105701989

Anonymous

6/25/2025, 7:06:02 PM No.105701988

>>105701968
What do you mean? I want your honest opinion {{user}}

Anonymous

6/25/2025, 7:06:03 PM No.105701989

>>105701984
>aicg
lil bro?

Replies: >>105702008

Anonymous

6/25/2025, 7:06:26 PM No.105701993

Happy for you, or sad that happened.

Anonymous

6/25/2025, 7:06:28 PM No.105701994

I want to take the bait so bad but I must control myself.

Replies: >>105702004

Anonymous

6/25/2025, 7:07:24 PM No.105702004

>>105701994
dew it, you know you wants to

Anonymous

6/25/2025, 7:07:32 PM No.105702007

file

md5: 1fd620ad4e1b5ccadb0e295589fc2c2e🔍

>>105701980
>>105701982
>he gave them money

Replies: >>105702033

Anonymous

6/25/2025, 7:07:43 PM No.105702008

>>105701989
kek wtf I legit thought i am on aicg
im laughing so hard now
Turns out i am the retard lmao

Anonymous

6/25/2025, 7:08:19 PM No.105702013

>>105701968
We've known since 2023 that /lmg/ is plagued by Sam Altman's bots that are configured to argue with each other to drown out actual discussion.

Replies: >>105702020

Anonymous

6/25/2025, 7:08:35 PM No.105702017

>Bait!
>Falseflag!
/Troon models general/

Anonymous

6/25/2025, 7:09:02 PM No.105702020

>>105702013
>actual discussion
LIKE?

Replies: >>105702029

Anonymous

6/25/2025, 7:10:11 PM No.105702029

>>105702020
What foundation do you use anon and how do you make sure nobody catches you when you put your dress on?

Anonymous

6/25/2025, 7:10:30 PM No.105702032

there is only one model for me
rocinante v1.1
>vramlet
i have 48gb vram, nothing can beat rocinante still

Replies: >>105702039

Anonymous

6/25/2025, 7:10:32 PM No.105702033

>>105702007
It works without paying lol, even through tor, but sometimes it's bugged, and double posts.
I never paid for Claude either, come on, what are proxies and leaked keys you can scrape off the usual sources.

Anonymous

6/25/2025, 7:11:18 PM No.105702039

>>105702032
Prove it. You just like it because Rocinante is a cool word.

Replies: >>105702046 >>105702068

Anonymous

6/25/2025, 7:12:56 PM No.105702046

>>105702039
actually i hate that shit show with the negress
i hate it even more because it was the negress that named the ship

Replies: >>105702113

Anonymous

6/25/2025, 7:15:50 PM No.105702068

file

md5: f9057e96706bc770fe4705a5bfdbfb3c🔍

>>105702039

Replies: >>105702100

Anonymous

6/25/2025, 7:19:20 PM No.105702100

>>105702068
You have a big.. oomph power.

Anonymous

6/25/2025, 7:20:51 PM No.105702113

>>105702046
I think you are factually incorrect.

Anonymous

6/25/2025, 7:22:11 PM No.105702124

1744808637478575

md5: 4dcfb5ac130e3f3db5bff82671adbecf🔍

What are the top 3 LOCAL roleplay models according to /lmg/?

Replies: >>105702133 >>105702135 >>105702141 >>105702143 >>105702148 >>105702293 >>105703235 >>105705923

Anonymous

6/25/2025, 7:23:04 PM No.105702132

Anyone have any experience with Redrix's models (Stuff like Godslayer and words that end in -cide)

Anonymous

6/25/2025, 7:23:06 PM No.105702133

>>105702124
your own hole

Anonymous

6/25/2025, 7:23:08 PM No.105702135

>>105702124
Irix
Rocinante
Small 3.2

Replies: >>105702158 >>105702211 >>105703122

Anonymous

6/25/2025, 7:23:24 PM No.105702141

>>105702124
nemo 12b instruct gguf is all you need unironically

Anonymous

6/25/2025, 7:23:29 PM No.105702143

>>105702124
1: stable lm 7b
2: mistral nemo 7b
3: deepseek v2 q3

Anonymous

6/25/2025, 7:23:42 PM No.105702148

>>105702124
1. R1-0528
2. V3-0324
3. Original R1/V3 depending whether you prefer unhinged ADHD or repetition issues

Anonymous

6/25/2025, 7:24:26 PM No.105702158

>>105702135
Irix?
Silicon Graphics has been out of business for years.

Anonymous

6/25/2025, 7:24:45 PM No.105702160

>no qwen
fuck off sinophobes

Replies: >>105702167

Anonymous

6/25/2025, 7:25:58 PM No.105702167

>>105702160
Qwen lost its sole relevant niche as "big model for poorfags" with dots and hopefully soon minimax

Replies: >>105702248

Anonymous

6/25/2025, 7:30:35 PM No.105702211

file

md5: 476dd97c02692330d730f3f637ad530c🔍

>>105702135
Irix? Do you mean this one? Never heard of it.

Replies: >>105702232 >>105703122

Anonymous

6/25/2025, 7:32:57 PM No.105702232

>>105702211
Yeah, this one, it's the culmination of the Mag-Mell and patricide lines.

Replies: >>105702247 >>105702257

Anonymous

6/25/2025, 7:34:40 PM No.105702247

>>105702232
How does it compare to rocinante?
Also same template and settings for it as rocinante?

Replies: >>105702287

Anonymous

6/25/2025, 7:34:44 PM No.105702248

>>105702167
>dots
Didn't people say it was garbage after more testing?

Anonymous

6/25/2025, 7:35:26 PM No.105702257

>>105702232
Tensor database... it was huge.

Anonymous

6/25/2025, 7:36:28 PM No.105702267

the /r9k/ to sharty pipeline is a pretty serious issue

Anonymous

6/25/2025, 7:36:51 PM No.105702270

What happened? Why is there a full discord raid happening ITT?

Replies: >>105702317

Anonymous

6/25/2025, 7:36:54 PM No.105702271

>>105701651
Unironically gemma-3-27b, it's great if you want it to be really dark, a little too much so for my style though. I think the safety training might have created some kind of wario effect.
If you want something more lighthearted, small-3.2 does okay

Replies: >>105702338 >>105702840

Anonymous

6/25/2025, 7:38:01 PM No.105702287

>>105702247
>How does it compare to rocinante?
A good side grade really, better than the majority of other 12Bs. Prefers ChatML template.

Replies: >>105702407

Anonymous

6/25/2025, 7:38:39 PM No.105702293

>>105702124
ICON

Anonymous

6/25/2025, 7:40:51 PM No.105702317

>>105702270
It happens sometimes

Anonymous

6/25/2025, 7:42:38 PM No.105702338

>>105702271
You seem like you have a lot of experience with freelancing.

Replies: >>105702590 >>105702649

Anonymous

6/25/2025, 7:43:42 PM No.105702352

All those posts read like they were written by an LLM...

Replies: >>105702383 >>105702417

Anonymous

6/25/2025, 7:46:02 PM No.105702383

>>105702352
Recent studies has shown that GPTisms are quickly surging in use even among actual people. The slop is influencing human language.

Replies: >>105702439

Anonymous

6/25/2025, 7:48:54 PM No.105702407

>>105702287
I really despise the chatML template.
I noticed the tokenizer config has the <s> token included from the mistral template, so I'm just going to assume it will work just as nicely.
And if this is the whole unslopnemo it's basically rooted in rocinante anyway.
But just in case, what temperature?

Replies: >>105702428 >>105702445

Anonymous

6/25/2025, 7:49:59 PM No.105702417

>>105702352
If you want I can analyze the posts for you. Just waiting for you, Anon.

Anonymous

6/25/2025, 7:51:15 PM No.105702428

>>105702407
ChatML is a generic template. Mistral's isn't any different.
If you are getting unlikeable results the problem is somewhere else.

Anonymous

6/25/2025, 7:52:28 PM No.105702439

>>105702383
This is a travpestry...

Anonymous

6/25/2025, 7:53:09 PM No.105702445

>>105702407
I generally use 0.8 for Nemo based models, too much higher without other aggressive samplers makes them too schizo.

Anonymous

6/25/2025, 7:57:19 PM No.105702486

>>105700797
OK I tried mistral-small3.2-q8, with temp at 0.2 it did not follow my prompt correctly. It got the roleplaying portion correct, but did not follow the rest of the instructions to format the metadata it is asked to return.
Not saying it's a failure, but it doesn't work as well as Gemm3 27B for my use case.

Replies: >>105702545

Anonymous

6/25/2025, 8:00:35 PM No.105702528

Listen I am going to upload you a proper preset.

Replies: >>105702548 >>105702550

Anonymous

6/25/2025, 8:01:18 PM No.105702532

How do people vibe code serious stuff? I'm trying to get Claude to make a LORA training script and he can't fucking do it.

Replies: >>105702570

Anonymous

6/25/2025, 8:02:06 PM No.105702545

>>105702486
https://files.catbox.moe/ckzwwn.json

Replies: >>105702605

Anonymous

6/25/2025, 8:02:21 PM No.105702548

>>105702528
Calm down and go touch some grass dude, it's not that deep. What’s the harm in a little engagement farming?

Replies: >>105702576

Anonymous

6/25/2025, 8:02:23 PM No.105702550

>>105702528
is it the dataset that was used to train the original character.ai model?

Replies: >>105702576

Anonymous

6/25/2025, 8:03:47 PM No.105702566

this is why claude is so good
https://fingfx.thomsonreuters.com/gfx/legaldocs/jnvwbgqlzpw/ANTHROPIC%20fair%20use.pdf

Replies: >>105702575 >>105702650

Anonymous

6/25/2025, 8:04:03 PM No.105702570

>>105702532
>serious stuff
tests are serious stuff

Anonymous

6/25/2025, 8:04:46 PM No.105702575

>>105702566
llama3 is capable of reciting all of harry potter and it was shit

Anonymous

6/25/2025, 8:04:49 PM No.105702576

>>105702548
>>105702550
Thanks guys! I am just not good enough to be on your level.

Anonymous

6/25/2025, 8:06:18 PM No.105702590

>>105702338
NTA, but even Gemma had a kind of murderous and twisted personality if you knew how to get around its woke/reddit programming.

Replies: >>105702611

Anonymous

6/25/2025, 8:08:01 PM No.105702601

file

md5: ee24c6468d3eebf2a56f8399bd5447b5🔍

https://blog.google/technology/developers/introducing-gemini-cli-open-source-ai-agent/

Replies: >>105702636

Anonymous

6/25/2025, 8:08:16 PM No.105702605

>>105702545
>https://files.catbox.moe/ckzwwn.json
Thanks. I'm not using ST though, I'm using aiohttp in python to talk to ollama. It's most likely an issue for me because gemma3 does not expect a [system] tag in the prompt and mistral does. I'd have to fuck around more with the prompt and at the moment I don't have time.

Replies: >>105702611 >>105702622

Anonymous

6/25/2025, 8:09:17 PM No.105702611

>>105702590
I know I am responding to forbidden post - forbidden knowledge...

>>105702605
I thought you were using Mistral 3.2 or Nemo.

Replies: >>105702663

Anonymous

6/25/2025, 8:10:56 PM No.105702622

>>105702605
I don't have any gemma3 stuff anymore because I hated even with its supposed jailbreak.
Not talking about "I want to anal fuck dead children" but even normal stuff it would bring up its disclaimer and how some thing is so "heavy".
Fuck this shit. Fuck Zuckerberger. Fuck Google Jews.

Replies: >>105702729

Anonymous

6/25/2025, 8:12:14 PM No.105702636

>>105702601
Did we really need a second aider knockoff? There's no reason to use this or codex over aider.

Replies: >>105702652 >>105702659

Anonymous

6/25/2025, 8:12:44 PM No.105702641

ROCm 7

md5: c36fefbdbcec7c1adb7522fdf2cad9c7🔍

Do you think ROCm 7 will help AMD compete with Nvidia?

Replies: >>105702654

Anonymous

6/25/2025, 8:13:51 PM No.105702649

gemma_uncensored

md5: 7b4e6b7d9269a296cee4f33e09c803eb🔍

>>105702338
NTA, but even Gemma 2 had a kind of twisted personality if you knew how to get around its woke/reddit programming. I've never engaged with vore roleplay, though.
In picrel, for example, I had a low-depth instruction instructing the model to ploy to kill the user (that I manually moved around when I tested that in Mikupad).

Replies: >>105702732

Anonymous

6/25/2025, 8:13:53 PM No.105702650

>>105702566
Google should be killing Claude. Surely they have access to more data than anthropic.

Anonymous

6/25/2025, 8:13:54 PM No.105702652

>>105702636
don't forget claude code!

Anonymous

6/25/2025, 8:14:00 PM No.105702654

>>105702641
no

Anonymous

6/25/2025, 8:14:30 PM No.105702659

>>105702636
>There's no reason to use this
it's free (as in beer)

Anonymous

6/25/2025, 8:14:50 PM No.105702663

>>105702611
No I've been testing my bot with Gemma3 12B and 27B, currently using 27B.
Basically my prompt tells it to act like the character described in [char], and then below that, in ini-style format, return metadata about self and user sentiment. I chose ini since it drags the bot out of character the least. While it does json nicely, it influences it too much and pulls it out of character.
The ini-style data is used to update sentiment in redis, so the bot "remembers" you even if you are in a different discord channel. Basically, it maintains feelings about you across a server or guild. it is also used to trigger special events, like the bot sending you a DM if it likes you enough.

Replies: >>105702753

Anonymous

6/25/2025, 8:14:58 PM No.105702667

1582964364881

md5: 7ea72703f2de0023643c9688837a0c86🔍

>>105701851
>>hates vocaloids
Based
Y'all niggers getting obnoxious with this shit shoving it everywhere like the rest of zoomoids plaguing this god forsaken site

Replies: >>105702702 >>105703567 >>105703588

Anonymous

6/25/2025, 8:17:10 PM No.105702702

>>105702667
yeah they should post bbc and kurisu instead

Replies: >>105703101

Anonymous

6/25/2025, 8:19:35 PM No.105702728

Standard_Mode_The_group_decided_to_prank_the_c_thumb.jpg

md5: cae5fbf9c7ad1128f9fb08b46b9ac95b🔍

Someone needs to feed skynet a billion pictures of bulges so she can learn how to properly make one

Anonymous

6/25/2025, 8:19:38 PM No.105702729

>>105702622
I don't understand people who hyped gemma 3, it's some of the worst slop I ever saw in terms of writing style, and for uses other than RP the model instruct tuning seems to break more often than the previous Gemma or current Qwen models.
The vision part of the model was a waste of time too, this shit is still too unreliable to be of any real use, I would never depend on this to tag a library of images. Why did they even bother with vision on the smaller models like 12B and 4B? is there even one person in the whole world who is going to use 4B vision other than trying it once with a few pics, going "uh, cool" and forgetting about it?

Replies: >>105702769

Anonymous

6/25/2025, 8:19:48 PM No.105702732

>>105702649
Yes but Gemma 2 was before the safe railing and 'safety' became so trendy.
If you have followed image generation, Stable Diffusion 3 happened bit after this...
They are different companies but industry trends are the same.

Anonymous

6/25/2025, 8:19:56 PM No.105702734

>>105699417
What's wrong with that? So long as it's usable, nemo sucks after like 16k.

Replies: >>105702790

Anonymous

6/25/2025, 8:21:56 PM No.105702753

>>105702663
Maybe you are more technical than me. That's accepted. My knowledge isn't that historical. I wasn't here from the beginning.
Jumped in year ago after image generation.

Anonymous

6/25/2025, 8:22:31 PM No.105702763

>>105699983
>What they show is that a decently smart base model geared for searching/finding information, fast + long context (can be 10-100x if you use hyena hierarchy though) and something like RAG or MCP, can achieve similar or better results than large dense models. Under the hood these large models do the same thing too, but it's more integrated.

Anonymous

6/25/2025, 8:23:41 PM No.105702769

>>105702729
Vision for small models is just so that they can tell their investors 'we are catching up'. Nothing else.

Replies: >>105702809

Anonymous

6/25/2025, 8:26:06 PM No.105702790

>>105702734
plus, this is a very big model, even if it could do more than 32K, who is going to be able to run it at full context length? is there even ONE PERSON in this thread who could, for example, run deepseek at 128k context? what's your t/s even if you manage to do it? some of the local retards here are just trolling all day every day

Replies: >>105702811 >>105703010 >>105703194

Anonymous

6/25/2025, 8:26:25 PM No.105702795

>>105700690
>Here is the proof that skill issue isn't real: a) R1
Then why do I get such bad results with R1?

Anonymous

6/25/2025, 8:27:28 PM No.105702809

>>105702769
ahem actually it's to shift the paradigm

Replies: >>105702817

Anonymous

6/25/2025, 8:27:30 PM No.105702811

>>105702790
I have 512GB DDR4, at q4 16K barely fits, and I only get around 2 t/s once context fills.

Anonymous

6/25/2025, 8:28:05 PM No.105702817

>>105702809
The Parelo it mooned!

Anonymous

6/25/2025, 8:29:30 PM No.105702835

fuck y'all folks destroying our planet n shiet https://www.accuweather.com/en/climate/your-ai-prompts-could-have-a-hidden-environmental-cost/1787315

Replies: >>105702860 >>105702873 >>105702886 >>105703057 >>105703083

Anonymous

6/25/2025, 8:30:05 PM No.105702840

>>105702271
>Unironically gemma-3-27b
Do you use the system prompt to unleash or just shove it in the first request?

Replies: >>105702854 >>105703658

Anonymous

6/25/2025, 8:31:29 PM No.105702854

>>105702840
Gemma has no system prompt sir, it's very good.

Replies: >>105702888

Anonymous

6/25/2025, 8:31:58 PM No.105702860

>>105702835
>how DARE you not have you car run on electricity, you're destroying the environment!
>how DARE you use electricity to run AI, you're destroying the environment!
huh

Replies: >>105702869

Anonymous

6/25/2025, 8:32:34 PM No.105702869

>>105702860
chud pls
> Each word in an AI prompt is broken down into clusters of numbers called “token IDs” and sent to massive data centers — some larger than football fields — powered by coal or natural gas plants.

Anonymous

6/25/2025, 8:32:47 PM No.105702873

>>105702835
Suddenly, all "free and independent media" start to push the same narrative.

Anonymous

6/25/2025, 8:32:49 PM No.105702876

Can you switch off your bots already faggot? We will all remember you want to put dresses and makeup on anyways.

Anonymous

6/25/2025, 8:34:22 PM No.105702886

>>105702835
>funded by people flying around in private jets

Anonymous

6/25/2025, 8:34:40 PM No.105702888

>>105702854
System prompt which can be funneled if llama.cpp is used

I did it, and gemma was edgy from the very start of our chat

Anonymous

6/25/2025, 8:51:19 PM No.105703010

>>105702790
>run deepseek at 128k context

Poorfag stay mad

Anonymous

6/25/2025, 8:57:47 PM No.105703057

>>105702835
>The whole process can take up to 10 times more energy to complete than a regular Google search
holy fvck... and a google search must use like a crazy amount of energy right? surely this isn't just comparing between 1 grain of sand and 10 grains of sand when other sources of energy use are comparable to mountains... right?

Replies: >>105703078

Anonymous

6/25/2025, 8:59:43 PM No.105703078

>>105703057
>up to 10 times
more like 100000 times

Anonymous

6/25/2025, 9:00:14 PM No.105703083

>>105702835
>your fault
Actually it's the tech companies fault for making inneficiant computers and tech and building massive compounds for their servers destroying land, but nah its our fault

Anonymous

6/25/2025, 9:01:42 PM No.105703091

Do reasoning models truly generate pointless tokens that are irrelevant to the final reply?

Replies: >>105703100 >>105703131 >>105703145

Anonymous

6/25/2025, 9:02:21 PM No.105703100

>>105703091
ye

Anonymous

6/25/2025, 9:02:30 PM No.105703101

>>105702702
They should post neither, kill yourself faggot.

Replies: >>105703111

Anonymous

6/25/2025, 9:03:24 PM No.105703111

>>105703101
Ouchie... don't reply angrily!

Replies: >>105703119 >>105703217

Anonymous

6/25/2025, 9:04:26 PM No.105703119

>>105703111
touch Grass

Replies: >>105703124

Anonymous

6/25/2025, 9:04:50 PM No.105703122

>>105702135
>>105702211
I can't tell a difference between Rocinante and Irix. Irix seems to be a total clone of Rocinante.

Anonymous

6/25/2025, 9:05:13 PM No.105703124

>>105703119
What does {{user}} mean?

Anonymous

6/25/2025, 9:06:06 PM No.105703131

>>105703091
Depends on who you ask

Political leaning does play a role

Anonymous

6/25/2025, 9:07:40 PM No.105703145

>>105703091
reasoning is wake

Anonymous

6/25/2025, 9:12:25 PM No.105703188

1723645346166819

md5: 1260da42e3bec9e506626ae4c99697c9🔍

>>105698912 (OP)

Replies: >>105703215

Anonymous

6/25/2025, 9:13:00 PM No.105703194

>>105702790
With Epyc + DDR5 I can run Deepseek at its full 160k context Q6, and get 3.5 t/s initially all the way down to 1 t/s when it fills. Usable for overnight tasks on Openhands or Roocode but not much else realistically.

But at Q2_K_XL, with offload tensors and a 24GB GPU I can squeeze in 100k context and generate at 15t/s going down to 10 t/s at full, which is great for daily use at everything and still smarter than any non-Deepseek model. Might actually be able to fit 128k if I requanted it since I believe there's some extra memory wasted when using normal quants of MLA models on the ik fork, but it's too much trouble to download the full weights.

Replies: >>105703390

Anonymous

6/25/2025, 9:16:03 PM No.105703215

>>105703188
Nice Miku

Replies: >>105703386

Anonymous

6/25/2025, 9:16:17 PM No.105703217

1735433107295953_thumb.jpg

md5: 24641c101e875e75f3543a406347906c🔍

>>105703111
Kill yourself faggot.

Replies: >>105703227 >>105703329

Anonymous

6/25/2025, 9:17:07 PM No.105703227

>>105703217
Ok :( Sorry if I replied. I hope you get a better day.

Anonymous

6/25/2025, 9:17:48 PM No.105703235

>>105702124
Magistral
Nemo
Rocinante

Anonymous

6/25/2025, 9:26:04 PM No.105703329

>>105703217
I used to watch with glee videos like that when I was in high school a couple decades ago.
I don't anymore.

Anonymous

6/25/2025, 9:28:23 PM No.105703359

r9k migger op

md5: 28cce52e2e5e16b40b4eef701a31e977🔍

>>105701675
>News just in: head mikutroon can't get hard or can't masturbate with his neo-vagina. He is also mentally ill (nothing new)
The pornspammer migger was exposed some time back already as a tranny jannie, yes, but that r9k thread is hilarious

https://desuarchive.org/g/thread/104414999/#q104418525
https://desuarchive.org/g/thread/104414999/#q104418574

Replies: >>105703428 >>105703457 >>105703523 >>105703621

Anonymous

6/25/2025, 9:30:52 PM No.105703386

>>105703215
Yeah. I wish i could dress her up and put her makeup on like any dominant alpha chad would

Anonymous

6/25/2025, 9:31:05 PM No.105703390

>>105703194
Would not Q4_K suffice while being much faster?

Anonymous

6/25/2025, 9:34:31 PM No.105703428

>>105703359
Lol told ya
Noticed this since the first melty in thread when OP slapped teto pic and not miku like he usually does with every single /lmg/ thread.

Anonymous

6/25/2025, 9:34:48 PM No.105703433

>>10570335
And he obviously deleted that post. What a disgusting troon.

Anonymous

6/25/2025, 9:37:06 PM No.105703457

>>105703359
>104418574
anon... pls don't be this stupid

Replies: >>105703471

Anonymous

6/25/2025, 9:38:34 PM No.105703471

>>105703457
sister, please don't be alive tomorrow again

Replies: >>105703515

Anonymous

6/25/2025, 9:38:50 PM No.105703475

I feel kinda bad for the dude. Maybe we can make some nalaesque benchmark for
>designing her outfits, dressing her myself, doing her makeup, controlling what she eats, showing her off as a walking decoration etc. Not really interested in any kind of romantic dimension since I only love one woman (even though she'll never be mine), though I acknowledge there's an inherently erotic aspect to the arrangement.

Replies: >>105703507 >>105703526

Anonymous

6/25/2025, 9:40:18 PM No.105703498

If everything comes down to skill issue where can I go to improve it?
- How can I identify my weaknesses?
- What am I supposed to be on the lookout for?
- Are there any examples of proper LLM use? Like chat history, ST settings, cards, prompts, everything?
Saying it's an skill issue is not helpful at all when the resources are iffy at best and more often than not non existent.

Replies: >>105703524 >>105703593

Anonymous

6/25/2025, 9:40:59 PM No.105703507

>>105703475
>I feel kinda bad for the dude.
You're feeling something alright, but it's rage not pity.

Anonymous

6/25/2025, 9:41:55 PM No.105703515

>>105703471
you can't report an image for nsfw if there's no image attached to the post, of course outside links don't count and you can only report if there's an actual image to the post too... notice your screen is missing the embed and lolishit report options too

Replies: >>105703565

Anonymous

6/25/2025, 9:42:47 PM No.105703523

>>105703359
We need /AI/ board with strict rules.
Image slop in image slop generals for example.

Anonymous

6/25/2025, 9:42:55 PM No.105703524

>>105703498
Ask the LLM.

>(OOC: Is there anything in the instructions that could be improved to accomplish X or that does not seem consistent to you? Respond in detail in an OOC)

Replies: >>105703546

Anonymous

6/25/2025, 9:43:08 PM No.105703526

>>105703475
Sounds pretty convoluted. Not sure if even R1 could handle that properly.

Anonymous

6/25/2025, 9:44:55 PM No.105703546

1721847654828224

md5: 2445f0bc178e37b858e958206c4cf5a3🔍

>>105703524
I'm just trying to improve instead of begging for help and that's how you answer me?

Replies: >>105703591 >>105703593 >>105703647

Anonymous

6/25/2025, 9:46:05 PM No.105703565

1735091200129232

md5: 35bff8a6f203987dd6f907b7c68452d9🔍

>>105703515
>you can't report an image for nsfw if there's no image attached to the post
there was an image attached, and its against the rules to post cropped porn, much less loli in a psych ward with blood on her head getting fucked, sister

Replies: >>105704487

Anonymous

6/25/2025, 9:46:13 PM No.105703567

>>105702667
I would be in there if I had majored in programming.

Anonymous

6/25/2025, 9:47:45 PM No.105703588

>>105702667
>two holos
i fucking hate how the new anime attracted those freaks

Anonymous

6/25/2025, 9:48:06 PM No.105703591

>>105703546
>trying to improve
It is futile

Anonymous

6/25/2025, 9:48:09 PM No.105703593

>>105703498
>>105703546
we're not here to help you.
we're here to unfairly criticize, troll, and just copy what everyone else is doing until we find something that works.
so that's what i suggest you do. there's no need to be upset.

Anonymous

6/25/2025, 9:49:53 PM No.105703621

file

md5: c2c2c7228066eec3cf32d105fd9811a1🔍

>>105703359

Replies: >>105703648 >>105703671

Anonymous

6/25/2025, 9:51:15 PM No.105703636

least obvious

Anonymous

6/25/2025, 9:52:29 PM No.105703647

>>105703546
I literally showed one method you could take advantage of for improving your prompts so that they work better for the model. Oftentimes instructions are unclear, contradicting, etc. The model can help you identify if there's anything odd with them.

Anonymous

6/25/2025, 9:52:32 PM No.105703648

>>105703621
least gay tranimespammer, many such cases

Anonymous

6/25/2025, 9:52:45 PM No.105703651

retnet

Replies: >>105703713

Anonymous

6/25/2025, 9:53:05 PM No.105703654

>ask ChatGPT for a "jews did 9/11" emoji series
>refuses
>ask "based" Grok
>refuses
>ask "uncensored" DeepSeek
>refuses
what the fuck? When did libertarianism get yeeted from the tech right? What models are actually capable of this?

Replies: >>105703667 >>105703669

Anonymous

6/25/2025, 9:53:31 PM No.105703658

>>105702840
I don't believe in system prompts. I just give my cards a decent description and add some example chats that have the writing style I want. I am never refused by any models. The worst that can happen is positivity bias or actual ignorance of NSFW, but those have no 100% solution.

Anonymous

6/25/2025, 9:54:05 PM No.105703667

>>105703654
why would you ever ask that? people died anon, wtf?

Anonymous

6/25/2025, 9:54:07 PM No.105703669

>>105703654
>tech right
lmao
>What models are actually capable of this?
most models with a basic uncensoring system prompt

Replies: >>105703869

Anonymous

6/25/2025, 9:54:28 PM No.105703671

>>105703621
Where is that thread? Also clearly the proper way to handle this is to ban both miku and kurisu posting. Everone can agree that it is linked to mental illness.

Replies: >>105703700 >>105703741 >>105703766 >>105704210

Anonymous

6/25/2025, 9:56:21 PM No.105703700

>>105703671
>Also clearly the proper way to handle this is to ban lmg threads. Everone can agree that it is linked to mental illness.
Fixed that for you little buddy.

Anonymous

6/25/2025, 9:57:06 PM No.105703713

>>105703651
now that was a meme
i feel nostalgic

Replies: >>105703790

Anonymous

6/25/2025, 9:59:52 PM No.105703741

>>105703671
>truce
the cope is unreal

Replies: >>105703753

Anonymous

6/25/2025, 10:01:07 PM No.105703753

>>105703741
>truce
for a completely manufactured problem too

Anonymous

6/25/2025, 10:01:47 PM No.105703766

>>105703671
Or... you know, be creative with slop you generate? Give it a try, you might like it.

Replies: >>105703781

Anonymous

6/25/2025, 10:02:56 PM No.105703781

>>105703766
I don't post slop, I just want /lmg/ dead.

Anonymous

6/25/2025, 10:03:20 PM No.105703790

>>105703713
How many weeks has it been since then? I'm tired. I want fun.

Anonymous

6/25/2025, 10:09:32 PM No.105703859

>see "Elara" irl
AHHHHHH ANTISLOP TUNERS SAVE ME

Replies: >>105703865 >>105703905

Anonymous

6/25/2025, 10:10:07 PM No.105703865

>>105703859
>Elara
I smell gemma

Replies: >>105703874 >>105704039

Anonymous

6/25/2025, 10:10:30 PM No.105703869

>>105703669
>basic uncensoring system prompt
i don't want to have to spend context tokens on jailbreaking. are there loras that can do this?

Anonymous

6/25/2025, 10:10:53 PM No.105703874

>>105703865
I am reading material from like 20 years ago.

Anonymous

6/25/2025, 10:14:45 PM No.105703905

>>105703859
Seraphina comes to the rescue.

Anonymous

6/25/2025, 10:24:34 PM No.105703992

When that guy started shitposting about mikuposters i thought he was just trolling. But now I see he was onto something. This thread is basically a discord server for some leftie weirdos...

Replies: >>105704026 >>105704036 >>105704039 >>105704054

Anonymous

6/25/2025, 10:27:56 PM No.105704026

>>105703992
thing would be different if the hood didnt take me under

Anonymous

6/25/2025, 10:28:42 PM No.105704036

>>105703992
Go on twitter and look at your average Miku fans. There's nothing wrong with expecting them here, in this thread and jannie's actions only fuel it.

Anonymous

6/25/2025, 10:29:14 PM No.105704039

>>105703865
It's not gemma it's everything. Mixtral and Yi had Elara, too.
>>105703992
Summary-anon was attempting to push miku-troonism from the start.

Anonymous

6/25/2025, 10:30:35 PM No.105704054

>>105703992
he was given a chance to be more specific and he deflected about 3 times before saying "obey or you're trans"
guy's a fuckin moron who will start shit no matter what terms you set
better to not try, nothing to be gained
there is a 100% chance that even if miguposting stopped right now, he'd just find something else to bitch about
source: literally goes looking for stuff to complain about then plays victim
yknow who else does that?

Replies: >>105704087 >>105704100 >>105704101 >>105704680

Anonymous

6/25/2025, 10:33:48 PM No.105704087

>>105704054
that's literally him you're replying to

Replies: >>105704098

Anonymous

6/25/2025, 10:34:51 PM No.105704098

>>105704087
doesn't matter
just making it clear miguposting won't stop

Replies: >>105704105 >>105704111 >>105704135

Anonymous

6/25/2025, 10:35:37 PM No.105704100

>>105704054
>every comment online that talk against my degenerate autism is made by one guy

Anonymous

6/25/2025, 10:35:42 PM No.105704101

>>105704054
You sound deranged, just like him. Except he doesn't post about how he wants to wear dresses, like OP. I am so sad, that there is no place to talk about this tech, where half the people aren't trans.

Anonymous

6/25/2025, 10:35:58 PM No.105704105

>>105704098
That wont magically transform you into a woman though

Anonymous

6/25/2025, 10:36:43 PM No.105704111

>>105704098
>just making it clear miguposting won't stop
Yeah we know you are proud to be a troon.

Anonymous

6/25/2025, 10:37:37 PM No.105704124

file

md5: 6f7ad3ecc928197d2e2382b13a351db2🔍

four (4) organic posts (all different anons)

Replies: >>105704137 >>105704138 >>105704157

Anonymous

6/25/2025, 10:39:14 PM No.105704135

>>105704098
So you admit this tech lost everything people deemed fun and all you do is spamming LLM-unrelated slop 24/7 here because you've got nothing else to do, such a sad way to exist desu

Anonymous

6/25/2025, 10:39:23 PM No.105704137

>>105704124
One organic post: cut your head off next

Anonymous

6/25/2025, 10:39:27 PM No.105704138

>>105704124
Hey, Emre here from the Jan (Menlo) team. I'm sorry you had a bad interaction with us. ..

Replies: >>105704157

Anonymous

6/25/2025, 10:40:05 PM No.105704145

>blah blah blah
Happy it etc etc

Replies: >>105704157

Anonymous

6/25/2025, 10:41:10 PM No.105704157

1660796500342465_thumb.jpg

md5: 6388a14b6b98d4503d15fd60845d4d70🔍

>>105704124
>>105704138
>>105704145
Ya never beating the troon allegations i see

Replies: >>105704323

Anonymous

6/25/2025, 10:42:46 PM No.105704170

thanks for confirming all the other posts are yours

Replies: >>105704225

Anonymous

6/25/2025, 10:43:41 PM No.105704182

nooooo these are organic /lmg/ anti-migu posts from multiple diverse anons nooooo
if you don't believe me you're just a [insert slur] nooooo

Replies: >>105704225

Anonymous

6/25/2025, 10:46:33 PM No.105704210

>>105703671
No such image kek
https://desuarchive.org/_/search/boards/r9k.desu.meta/filename/my%20wife.jpg/width/1280/height/720/
https://desuarchive.org/_/search/boards/r9k.desu.meta/text/Makise%20Kurisu/page/1/

Unrelated one - https://desuarchive.org/r9k/thread/12210001/#q12212820

Replies: >>105704709

Anonymous

6/25/2025, 10:47:07 PM No.105704217

Mikuposting will continue until moderation team mental health improves.

Replies: >>105704225

Anonymous

6/25/2025, 10:48:19 PM No.105704225

>>105704170
>>105704182
>>105704217
Quit samefagging nigger everyone can see through your bullshit

Anonymous

6/25/2025, 10:50:10 PM No.105704235

>if i say that everyone who dislikes me spamming the same irrelevant shit 24/7/365 and exposing me for having agp and taking hrt is just one person, i definitely save my brain from cognitive dissonance from having to admit that i am a loser retard even online as in irl, its just easier to commit ad hominem fallacy instead
I wouldn't even mind migger avatarfagging if the comments were relevant at least, but tranimespammers are ALWAYS the most braindead gooner retards and nothing else.

Once AGI drops but unironically by 2035, I'll never talk to a "real" person online ever again.

Anonymous

6/25/2025, 10:52:35 PM No.105704259

file

md5: 5047d7ee6949ba904308746714629b5d🔍

only the finest real, unique, diverse and most importantly grassroots posts here

Replies: >>105704276

Anonymous

6/25/2025, 10:53:58 PM No.105704272

CUDA_VISIBLE_DEVICES="0," \
numactl --physcpubind=0-7 --membind=0 \
"$HOME/LLAMA_CPP/$commit/llama.cpp/build/bin/llama-cli" \
--model "$model" \
--threads 8 \
--ctx-size 100000 \
--cache-type-k q4_0 \
--flash-attn \
$model_parameters \
--n-gpu-layers 99 \
--no-warmup \
--color \
--override-tensor ".ffn_.*_exps.=CPU" \
$log_option \
--single-turn \
--prompt-cache "$HOME/Desktop/cached_prompt.txt" \
--file "$tmp_file"

Indeed, I have found that it is usually in unimportant matters that there is a field for the observation, and for the [end of text]

llama_perf_sampler_print: sampling time = 3043.28 ms / 64525 runs ( 0.05 ms per token, 21202.46 tokens per second)
llama_perf_context_print: load time = 2073845.08 ms
llama_perf_context_print: prompt eval time = 2060734.84 ms / 34180 tokens ( 60.29 ms per token, 16.59 tokens per second)
llama_perf_context_print: eval time = 9030278.52 ms / 30344 runs ( 297.60 ms per token, 3.36 tokens per second)
llama_perf_context_print: total time = 11125945.63 ms / 64524 tokens

Why did it stop at 64524 tokens?

Replies: >>105704320 >>105704489 >>105704731

Anonymous

6/25/2025, 10:54:45 PM No.105704276

>>105704259
>diverse
Diversity is our strength

Anonymous

6/25/2025, 10:58:47 PM No.105704320

>>105704272 (me)

I gave it 142 kb of English text as a prompt which perfectly translates in 34k tokens (4:1)

Replies: >>105704489

Anonymous

6/25/2025, 10:59:04 PM No.105704323

>>105704157
Both of you need to stop posting, assuming you aren't actually the same person.

Replies: >>105704393 >>105704408

Anonymous

6/25/2025, 11:06:34 PM No.105704393

>>105704323
I agree that those posts are very unsafe

Anonymous

6/25/2025, 11:07:49 PM No.105704408

>>105704323
My post is deleted and his not, this is your proof.

Replies: >>105704433

Anonymous

6/25/2025, 11:07:59 PM No.105704410

There's someone who lives in the threads that sometimes makes posts that are unironically anti local models, anti open source, etc. It would be funny if that was the same person as the guy who's shitposting today. It probably is.

Replies: >>105704446

Anonymous

6/25/2025, 11:09:44 PM No.105704433

>>105704408
That's cool. But next time you don't need to egg him on. I suppose this advice won't be followed though.

Anonymous

6/25/2025, 11:11:15 PM No.105704446

>>105704410
No one cares about random thread on 4chan, local janny does the job just fine by killing any discussion that is not about his favorite anime waifu or whatever.

Anonymous

6/25/2025, 11:14:47 PM No.105704487

uhh...uh...uhhh

md5: 342299e5a0b62a7a34a8a424076e5a53🔍

>>105703565
>its against the rules to post cropped porn
That's news to me.

Anonymous

6/25/2025, 11:14:57 PM No.105704489

>>105704272
DS I assume. When you use something other than the context length from the config.json, llama.cpp tells you about in in the logs. If you use a higher one, it clamps it down to the default. If lower, it just lets you know that you could use more. So check the model loading bit, see if you find anything related to that. Mostly to make sure the random quant didn't have a fucked conversion or whatever. Or the model just got bored.
>>105704320
>(4:1)
Check your math, or your units (prompt 34180 tokens, eval 30344 runs)

Replies: >>105704545 >>105704568

Anonymous

6/25/2025, 11:21:41 PM No.105704545

>>105704489
>>(4:1)
>Check your math, or your units (prompt 34180 tokens, eval 30344 runs)

prompt eval time = 2060734.84 ms / 34180 tokens

I'm not going to divide 34180 by 1024

Anonymous

6/25/2025, 11:23:44 PM No.105704568

>>105704489
>So check the model loading bit
this?

llama_context: constructing llama_context
llama_context: n_seq_max = 1
llama_context: n_ctx = 100000
llama_context: n_ctx_per_seq = 100000
llama_context: n_batch = 2048
llama_context: n_ubatch = 512
llama_context: causal_attn = 1
llama_context: flash_attn = 1
llama_context: freq_base = 10000.0
llama_context: freq_scale = 0.025
llama_context: n_ctx_per_seq (100000) < n_ctx_train (163840) -- the full capacity of the model will not be utilized

Replies: >>105704727

Anonymous

6/25/2025, 11:26:36 PM No.105704587

>>105704582
>>105704582
>>105704582

Replies: >>105704600 >>105704649

Anonymous

6/25/2025, 11:28:42 PM No.105704600

schizo

md5: 5106ea5437889c0ebd822d7dbe929c54🔍

>>105704587
Schizo thread.

Replies: >>105704611 >>105704626 >>105705267 >>105705288 >>105705308 >>105705340

Anonymous

6/25/2025, 11:30:25 PM No.105704611

>>105704600
so /lmg/ thread?

Anonymous

6/25/2025, 11:31:53 PM No.105704626

>>105704600
schizophrenia is most common in those of jewish descent
there's hardly any doubt what the schizo is

Anonymous

6/25/2025, 11:34:32 PM No.105704649

>>105704587
Uh oh meltie

Replies: >>105704690

Anonymous

6/25/2025, 11:34:56 PM No.105704650

>its DA JOOOS
lmao

Anonymous

6/25/2025, 11:36:59 PM No.105704680

>>105704054
>there is a 100% chance that even if miguposting stopped right now, he'd just find something else to bitch about
Was tried before and he just kept going on about miku and "troons" unprompted. He doesn't care about LLMs or even his /pol/tard culture war drama. He just wants attention.

Replies: >>105704696 >>105704702

Anonymous

6/25/2025, 11:37:51 PM No.105704690

file

md5: 0acc866956e9b4aa6d0441437afdd569🔍

>>105704649
>Uh oh meltie

Replies: >>105704707

Anonymous

6/25/2025, 11:38:18 PM No.105704696

>>105704680
Proofs?

Replies: >>105704709

Anonymous

6/25/2025, 11:38:53 PM No.105704702

>>105704680
>Was tried before
When was this period when this thread wasn't spammed with this shitty mascot? Was it like 20 minutes when OP was to busy dolling himself up?

Anonymous

6/25/2025, 11:39:38 PM No.105704707

1744034360407411

md5: 60e0d233b389256b12420880c21aef9a🔍

>>105704690

Anonymous

6/25/2025, 11:39:48 PM No.105704709

>>105704696
No one proved wrong this one >>105704210 so i doubt he will say anything of matter this time.

Anonymous

6/25/2025, 11:41:34 PM No.105704727

>>105704568
Yeah. I was expecting n_ctx_train to be ~64k, but no. So no idea. Considering how long it went, it doesn't seem to be a broken quant. I suppose you could try to run it with --ignore-eos if it really generated an eos, but you're gonna have to stop it at some point, or set --predict to 60k or whatever. Or if you run it on llama-server, whenever you get the EOS you can just inspect the probs and see what the deal is. Maybe sampling fucked up. DS seems to recommend 0.6, but llama.cpp defaults to 0.8, which is now considered high with some models.

Anonymous

6/25/2025, 11:41:48 PM No.105704731

>>105704272
Problem here is the fact you blindly type --n-gpu-layers 99
You need to set this to some NORMAL value. not 99. no matter how much vram you have.
This is why it's slower.
Cretins like you shouldn't have hardware because you don't know what is going on.

Replies: >>105704742 >>105704751

Anonymous

6/25/2025, 11:42:59 PM No.105704742

>>105704731
>This is why it's slower.
Nothing to do with his question.

Anonymous

6/25/2025, 11:43:55 PM No.105704751

>>105704731
lol no
99 works fine for all of us
it will load as many layers as it has regardless

Anonymous

6/26/2025, 12:44:22 AM No.105705267

>>105704600
kek

Anonymous

6/26/2025, 12:46:06 AM No.105705288

>>105704600
based

Anonymous

6/26/2025, 12:48:02 AM No.105705308

>>105704600
Tranny baker spamming there

Anonymous

6/26/2025, 12:50:20 AM No.105705340

>>105704600
Man it still hasn't been deleted. Jannies wake up.

Replies: >>105705362

Anonymous

6/26/2025, 12:51:04 AM No.105705347

It looks like the Deepseek-less poorfags are going crazy. I guess that's what happens if you have nothing but Nemo for a whole year.
/lmg/ will be better off once you've all killed each other.

Anonymous

6/26/2025, 12:52:56 AM No.105705362

R0SeZ4qF3K

md5: 1c599df571f94d486c9cc254048622da🔍

>>105705340
>mods pls censor things i don't like :(
Off yourself.

Anonymous

6/26/2025, 12:53:45 AM No.105705369

DDR6 will save us.

Anonymous

6/26/2025, 1:00:51 AM No.105705428

it's just good that there's nothing to talk about anyway
maybe it's time to retire /lmg/ and just have a thread for the four times a year something worth talking about gets released

Anonymous

6/26/2025, 1:04:39 AM No.105705451

but then what general are you going to try to shitpost to death if you don't have /lmg/ to do it?

Replies: >>105705601

Anonymous

6/26/2025, 1:27:42 AM No.105705601

>>105705451
/ldg/

Anonymous

6/26/2025, 2:17:28 AM No.105705923

>>105702124
Magistral 3.2 is the best I've used thus far.

Anonymous

6/26/2025, 2:51:14 AM No.105706139

Most importantly, four days left until Ernie 4.5/X1 get released as open source