/lmg/ - Local Models General - /g/ (#105698912) [Archived: 739 hours ago]

Anonymous
6/25/2025, 1:08:59 PM No.105698912
quoth the baka
quoth the baka
md5: be3fbc9c2a1d40e5e4643f36e466301c🔍
/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>105689385 & >>105681538

►News
>(06/21) LongWriter-Zero, RL trained ultra-long text generation: https://hf.co/THU-KEG/LongWriter-Zero-32B
>(06/20) Magenta RealTime open music generation model released: https://hf.co/google/magenta-realtime
>(06/20) Mistral-Small-3.2 released: https://hf.co/mistralai/Mistral-Small-3.2-24B-Instruct-2506
>(06/19) Kyutai streaming speech-to-text released: https://kyutai.org/next/stt
>(06/17) Hunyuan3D-2.1 released: https://hf.co/tencent/Hunyuan3D-2.1

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/tldrhowtoquant
https://rentry.org/samplers

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/leaderboard.html
Code Editing: https://aider.chat/docs/leaderboards
Context Length: https://github.com/adobe-research/NoLiMa
Censorbench: https://codeberg.org/jts2323/censorbench
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm
Replies: >>105703188
Anonymous
6/25/2025, 1:10:01 PM No.105698922
file
file
md5: 2e82be5360f35bafd4d83390eeead59f🔍
►Recent Highlights from the Previous Thread: >>105689385

--Optimizing multi-GPU/RPC model execution via tensor offloading and memory tuning:
>105693780 >105693814 >105693828 >105693870 >105693890 >105694096 >105694121 >105694200 >105694168 >105693900 >105693919 >105693920 >105693933 >105693968 >105693987 >105694020 >105694045 >105694032 >105694144 >105694265 >105694431 >105694487 >105694501 >105694515 >105694568 >105694828 >105694834 >105694890 >105694997 >105695037 >105695042 >105695123 >105695182 >105695506 >105695533 >105695631 >105695668 >105695683 >105695741 >105695906 >105696219
--Gemma model size suggestions and distillation technique debates in response to Google's feedback request:
>105690177 >105690247 >105692953 >105693027 >105690529 >105690541 >105690614 >105690642 >105690399 >105691354 >105690248
--Quant testing with IK-llama shows promise but faces CUDA and performance challenges:
>105692033 >105692197 >105694719 >105694758 >105694804 >105696513 >105696562 >105694000 >105694047 >105694101 >105694179
--Court rules AI training on books legal, but storage of pirated copies infringes copyright:
>105691671 >105691690 >105691810 >105691825 >105691865 >105692000
--Google's Gemini Robotics VLA released with limited access and mixed robotics capability expectations:
>105691639 >105691715 >105691734 >105691721 >105692142
--Unexpected GPU performance discrepancies in token generation benchmarking:
>105696010 >105696048 >105696333 >105697419
--Agentic framework limitations and needs for local large language models:
>105692962 >105692985 >105693006 >105693025 >105693060 >105693849
--llama.cpp gains high-throughput mode for improved performance:
>105692045 >105694048 >105694098
--Skepticism around chatllm.cpp and llama.cpp for accurate model inference:
>105691150 >105691439 >105691580
--Miku (free space):
>105696371 >105696546

►Recent Highlight Posts from the Previous Thread: >>105689390

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script
Anonymous
6/25/2025, 1:12:13 PM No.105698938
lesbian girls is /lmg/ culture
Replies: >>105700218
Anonymous
6/25/2025, 1:12:58 PM No.105698940
how difficult is training a lora and do you need really high vram requirements?
What about a lora vs a fine tune?
I want to try add some question and answer style text blocks to mistral large quants to both increase knowledge and reinforce the answering style, I have 48gb vram
Replies: >>105698956 >>105699010
Anonymous
6/25/2025, 1:15:22 PM No.105698956
>>105698940
What front end are you using?
You don't necessarily need anything special...
Replies: >>105698974
Anonymous
6/25/2025, 1:19:11 PM No.105698974
>>105698956
oogabooga, it's the vram I am most concerned about before I spent ages getting a load of data formatted ready to train
Replies: >>105699018
Anonymous
6/25/2025, 1:24:28 PM No.105699010
>>105698940
>lora ... to ... increase knowledge
Abandon hope. You're also not going to be able to finetune (whether with LoRA or full finetuning) Mistral Large (123B parameters) with 48GB of VRAM.
Replies: >>105699040
Anonymous
6/25/2025, 1:25:04 PM No.105699018
>>105698974
You don't need anything special in order to change the model's output. I don't know about oobabooga but in SillyTavern you can slot in " Examples of dialogue" (which are tokenized in certain way)
><START>
>{{user}}: simple question
>{{char}}: simple answer
><START>
>new example...
It's just convenient in ST as it has its own slot and is hidden from the user, but you could just add this into your prompt.
So include an example of conversation and repeat that for couple of times.
Replies: >>105699028
Anonymous
6/25/2025, 1:27:19 PM No.105699028
>>105699018
Or this could be inserted into your system prompt.
Whatever as long as it gets submitted to the model itself in understandable format.
Anonymous
6/25/2025, 1:28:49 PM No.105699040
>>105699010
can you train a lora on the quantized version though, if that runs on 45gb can you also train a lora on 45gb.
I don't have extremely high hopes on "teaching" it much but I would be interested if it is an improvement at all
Replies: >>105699078
Anonymous
6/25/2025, 1:34:18 PM No.105699078
>>105699040
Right now it's only possible to finetune models in 4-bit at the minimum with QLoRA, so you'd need 60+ GB for the model weights alone.
Replies: >>105699159
Anonymous
6/25/2025, 1:50:38 PM No.105699159
>>105699078
ah lame, I guess the new mistral small might be a better candidate then?
Replies: >>105699171 >>105699223
Anonymous
6/25/2025, 1:52:41 PM No.105699171
>>105699159
Just tweak your goddamn prompt. I swear to god people like you don't even try.
Replies: >>105699178
Anonymous
6/25/2025, 1:54:08 PM No.105699178
>>105699171
I know how to tweak prompts retard I want to investigate lora training as a comparison for the sake of learning
Replies: >>105699210 >>105699223
Anonymous
6/25/2025, 2:00:19 PM No.105699210
>>105699178
Okay sorry :3 I think you are full of shit.
Maybe test with a small model first to get your bearings.
Why would you even want to begin with 123B model in the first place?
With 14B you could get fast results and see what you are actually doing.
Anonymous
6/25/2025, 2:02:24 PM No.105699223
>>105699178
If it only was that easy. You'll probably need dozens of data exposures with different wording at a large enough rank to make the model truly internalize the knowledge and not just parrot it if it sees the same question(s).
Even a rank 1 QLoRA is enough to make the model memorize verbatim limited amounts of information, but that doesn't imply at all that it will be able to organically use it without hallucinating details or simply making stuff up completely.
Mistral Small >>105699159 would be a better choice for these experiments, even better a smaller model so that finetuning attempts will take less time.
Anonymous
6/25/2025, 2:02:47 PM No.105699229
>>105698699 #
>>105698742 #
I'm on Linux. And the only thing I changed was -server instead of -cli.

I noticed than the CPU core were running at approx 80% (I isolated 8 cores for the purpose) in case of the server, while they've been at 100% in case of CLI.

GPU load is the same in both cases.

I see no reason why there should be a difference
Replies: >>105699260 >>105699273
Anonymous
6/25/2025, 2:07:20 PM No.105699260
>>105699229
I don't know, maybe this has something to do with your kernel. Or the way llama.cpp has been compiled.
Way over my pay grade (which wasn't too much to begin with in the first place).
Anonymous
6/25/2025, 2:10:26 PM No.105699273
>>105699229
If I was you I would compile a new kernel and go through the settings and double check things.
Then have a backup ready if something goes wrong.
I haven't bothered with linux in years though, last time I compiled a kernel it had a text UI.
Replies: >>105699479
Anonymous
6/25/2025, 2:27:02 PM No.105699378
hmmm..
https://huggingface.co/tencent/Hunyuan-A13B-Instruct-FP8
Replies: >>105699408 >>105699417 >>105699419 >>105699449 >>105699566 >>105699596 >>105699793
Anonymous
6/25/2025, 2:29:54 PM No.105699408
file
file
md5: 811b9c8633188e58f5f236cb02a18dec🔍
>>105699378
?
Replies: >>105699416 >>105699422 >>105699455
Anonymous
6/25/2025, 2:30:35 PM No.105699416
>>105699408
Ask your mother
Anonymous
6/25/2025, 2:30:37 PM No.105699417
>>105699378
>32768 ctx
Replies: >>105702734
Anonymous
6/25/2025, 2:30:42 PM No.105699419
Untitled
Untitled
md5: 54aa2b25286c6d03913daf3acbcbcfa6🔍
>>105699378
>https://huggingface.co/tencent/Hunyuan-A13B-Instruct-FP8
Replies: >>105699442 >>105699455
Anonymous
6/25/2025, 2:30:58 PM No.105699422
>>105699408
strange, it's gone now but it was:
80.4B params total
13B active
Replies: >>105699537 >>105699945
Anonymous
6/25/2025, 2:33:38 PM No.105699442
>>105699419
81B parameters otal, 13B active, has shared experts.
Replies: >>105699455
Anonymous
6/25/2025, 2:34:15 PM No.105699449
1732412254938459
1732412254938459
md5: b76813d7540982d5434aa1ed7acae863🔍
>>105699378
Replies: >>105699455
Anonymous
6/25/2025, 2:35:13 PM No.105699455
tencent
tencent
md5: 626e8db3b2b98e815ba9950be14e85d6🔍
>>105699408
>>105699419
>>105699442
picrel
>>105699449
thanks
Anonymous
6/25/2025, 2:37:36 PM No.105699478
>nobody thought to download it
Replies: >>105699499
Anonymous
6/25/2025, 2:37:38 PM No.105699479
>>105699273
>If I was you I would compile a new kernel

You must be kidding lol
Replies: >>105699510
Anonymous
6/25/2025, 2:40:08 PM No.105699499
>>105699478
this is faster, just clone it to your own repo and then set it to private
https://huggingface.co/spaces/huggingface-projects/repo_duplicator
Anonymous
6/25/2025, 2:41:25 PM No.105699510
>>105699479
It's super easy to compile, but configuration...
Replies: >>105699523
Anonymous
6/25/2025, 2:43:55 PM No.105699523
>>105699510
Why should I be willing to do such a thing!

It is not as if I'd experience any problems with the on-board ethernet or something
Replies: >>105699582
Anonymous
6/25/2025, 2:45:41 PM No.105699534
Screenshot 2025-06-25 144420
Screenshot 2025-06-25 144420
md5: e652249d9cd155f16d74cdeb4f5e83a0🔍
why do i still map CPU buffer when everythign is supposed to be on the gpus?

--cache-type-k q4_0 \
--threads 48 \
--n-gpu-layers 99 \
--prio 3 \
--temp 0.6 \
--top_p 0.95 \
--min_p 0.01 \
--flash-attn \
--ctx-size 16384 \
-ot "blk\.(1|2|3|4|5|6)\.ffn_.*=CUDA0" \
-ot "blk\.(7|8|9|10|52)\.ffn_.*=CUDA1" \
-ot "blk\.(11|12|13|14|53)\.ffn_.*=CUDA2" \
-ot "blk\.(15|16|17|18|54)\.ffn_.*=CUDA3" \
-ot "blk\.(19|20|21|22|55)\.ffn_.*=RPC[10.0.0.28:50052]" \
-ot "blk\.(23|24|25|26|56)\.ffn_.*=RPC[10.0.0.28:50053]" \
-ot "blk\.(27|28|29|30|57)\.ffn_.*=RPC[10.0.0.28:50054]" \
-ot "blk\.(31|32|33|34|58)\.ffn_.*=RPC[10.0.0.28:50055]" \
-ot "blk\.(35|36|37|38|59)\.ffn_.*=RPC[10.0.0.40:50052]" \
-ot "blk\.(39|40|41|42|60)\.ffn_.*=RPC[10.0.0.40:50053]" \
-ot "blk\.(43|44|45|46|51)\.ffn_.*=RPC[10.0.0.40:50054]" \
-ot "blk\.(47|48|49|50)\.ffn_.*=RPC[10.0.0.40:50055]" \
--override-tensor exps=CUDA0 \
Anonymous
6/25/2025, 2:45:49 PM No.105699537
>>105699422
I calculated about 3B of shared parameters.
Anonymous
6/25/2025, 2:46:03 PM No.105699538
1749167420463392
1749167420463392
md5: 0328b34903af9a9d2b2acaa5d4059072🔍
Anonymous
6/25/2025, 2:49:20 PM No.105699559
1721382575189002
1721382575189002
md5: cc8f16085a6e6c55d504dbed54da7d10🔍
Just like Java means Durgasoft, lLLMs mean Miku.
Anonymous
6/25/2025, 2:50:29 PM No.105699566
>>105699378
will Hunyuan-A13B-Instruct-FP8 save local?
Replies: >>105699578
Anonymous
6/25/2025, 2:51:57 PM No.105699578
>>105699566
they do have good models
Anonymous
6/25/2025, 2:52:38 PM No.105699582
>>105699523
Most people are so out of touch with their computing systems.
You can go in to the kernel configuration and double check the options and compile a new one.
It's not like Bill Gates is coming to kill your computer.

This is why I hate people on internet these days. You are not an enthusiast. You are a jackass with expensive machine but you have no clue how to use it.
Replies: >>105699600 >>105699700 >>105699763
Anonymous
6/25/2025, 2:54:05 PM No.105699596
>>105699378
{
"_id": "685be1a14059850217f25ffc",
"id": "tencent/Hunyuan-A13B-Instruct-FP8",
"siblings": [
{
"rfilename": ".gitattributes"
},
{
"rfilename": "config.json"
},
{
"rfilename": "configuration_hunyuan.py"
},
{
"rfilename": "generation_config.json"
},
{
"rfilename": "hunyuan.py"
},
{
"rfilename": "hunyuan.tiktoken"
},
{
"rfilename": "hy.tiktoken"
},
{
"rfilename": "model-00001-of-00017.safetensors"
},
{
"rfilename": "model-00002-of-00017.safetensors"
},
{
"rfilename": "model-00003-of-00017.safetensors"
},
{
"rfilename": "model-00004-of-00017.safetensors"
},
{
"rfilename": "model-00005-of-00017.safetensors"
},
{
"rfilename": "model-00006-of-00017.safetensors"
},
{
"rfilename": "model-00007-of-00017.safetensors"
},
{
"rfilename": "model-00008-of-00017.safetensors"
},
{
"rfilename": "model-00009-of-00017.safetensors"
},
{
"rfilename": "model-00010-of-00017.safetensors"
},
{
"rfilename": "model-00011-of-00017.safetensors"
},
{
"rfilename": "model-00012-of-00017.safetensors"
},
{
"rfilename": "model-00013-of-00017.safetensors"
},
{
"rfilename": "model-00014-of-00017.safetensors"
},
{
"rfilename": "model-00015-of-00017.safetensors"
},
{
"rfilename": "model-00016-of-00017.safetensors"
},
{
"rfilename": "model-00017-of-00017.safetensors"
},
{
"rfilename": "model.safetensors.index.json"
},
{
"rfilename": "modeling_hunyuan.py"
},
{
"rfilename": "special_tokens_map.json"
},
{
"rfilename": "tokenization_hy.py"
},
{
"rfilename": "tokenizer_config.json"
},
{
"rfilename": "vit_model.py"
}
]
}
Replies: >>105699626 >>105699630
Anonymous
6/25/2025, 2:54:50 PM No.105699600
>>105699582
Stay mad
Replies: >>105699604
Anonymous
6/25/2025, 2:55:40 PM No.105699604
>>105699600
I'm not mad. I'm laughing at you discord zoomer.
You bought a car but are unable to do basic maintenance.
Anonymous
6/25/2025, 2:58:15 PM No.105699626
>>105699596
gib signed cdn download links
Anonymous
6/25/2025, 2:58:39 PM No.105699630
>>105699596
vit_model?
Anonymous
6/25/2025, 3:00:51 PM No.105699642
VRAMlets should check out the new Magnum-Diamond. It's the only RP finetune for 3.2 but hardly anyone downloaded it yet.
Replies: >>105699676 >>105699702
Anonymous
6/25/2025, 3:04:56 PM No.105699676
>>105699642
>It's the only RP finetune for 3.2
creator of world famous mythomax:
https://huggingface.co/Gryphe/Codex-24B-Small-3.2
Anonymous
6/25/2025, 3:07:39 PM No.105699700
>>105699582
nta
>if you dont know from scratch to finish how x works you shouldent be allowed to own it
cool so when are you getting rid of all your clothes car house language numericals your own body etc etc etc also do tell how are each different types of gates in the chip fabbed ?

people like you need to be killed demented beyond comprehension archons in the flesh
Replies: >>105699717 >>105699724
Anonymous
6/25/2025, 3:07:49 PM No.105699702
>>105699642
>hardly anyone downloaded it yet.
Hardly anyone needed it.
Anonymous
6/25/2025, 3:09:41 PM No.105699717
>>105699700
Thank you for replying.
I mean tweak your spice!
Anonymous
6/25/2025, 3:10:21 PM No.105699724
>>105699700
>zoomer devolves into schizo babble when confronted with his faults
many such cases
Anonymous
6/25/2025, 3:14:31 PM No.105699763
>>105699582
>I hate people
Replies: >>105699963
Anonymous
6/25/2025, 3:18:43 PM No.105699793
>>105699378
I am genuinely hyped. I already know it is gonna be pretty mid intelligence-wise but the combo of 80B (even with moe) and possibly being uncensored can finally give us serverless cooming. I mean it could be like nemo but much bigger.

Alas they probably cucked and it is cenored...
Anonymous
6/25/2025, 3:24:22 PM No.105699830
>no one built anything to watch popular huggingface repos and auto-download new things in case they get nuked after some intern realized he shouldn't have made it public yet
fine, i'll do it myself
Replies: >>105699841 >>105699864
Anonymous
6/25/2025, 3:25:53 PM No.105699841
>>105699830
that is one of the things I always assume someone else will do and turns out I was right
Anonymous
6/25/2025, 3:29:23 PM No.105699864
>>105699830
Can I help?
Replies: >>105699883
Anonymous
6/25/2025, 3:31:34 PM No.105699883
>>105699864
given low complexity, just make your own version with some ai in the meantime and see if it works well enough to be shared
Anonymous
6/25/2025, 3:40:56 PM No.105699945
>>105699422
interesting size, we'll see how good it is
Anonymous
6/25/2025, 3:42:08 PM No.105699963
>>105699763
Sorry... I was busy setting up my miku fumos.
Anonymous
6/25/2025, 3:43:28 PM No.105699975
ComfyUI_02392__42622d_thumb.jpg
ComfyUI_02392__42622d_thumb.jpg
md5: 7bbde1291ba545fb8135dca77b629d68🔍
I've been trying to get the Wan Video "Fun Camera Control Basic" workflow going, but it OOMs eventually no matter what. Is there a good doc for it?

In the meantime, I ran an overnight run of cosmos 7b i2v. I like how it does facial expressions.

https://files.catbox.moe/1ayrhd.mp4
https://files.catbox.moe/9h2j8x.mp4
https://files.catbox.moe/gfbu6r.mp4
https://files.catbox.moe/d55wzx.mp4
https://files.catbox.moe/cwblzk.mp4
https://files.catbox.moe/om2kww.mp4
https://files.catbox.moe/ey76l8.mp4
https://files.catbox.moe/0b0m4e.mp4
https://files.catbox.moe/n8wdsm.mp4
https://files.catbox.moe/dymv7w.mp4
https://files.catbox.moe/cin9g8.mp4
https://files.catbox.moe/46rsz0.mp4
https://files.catbox.moe/zsdk4u.mp4
https://files.catbox.moe/qiu8nc.mp4
https://files.catbox.moe/miatjx.mp4
https://files.catbox.moe/xbcapx.mp4
Anonymous
6/25/2025, 3:43:51 PM No.105699980
1720985691042954
1720985691042954
md5: f09ac93320374e9b721aa2beefc835f3🔍
Any...better...alternative...to...nemo...??
Replies: >>105700030
Anonymous
6/25/2025, 3:44:06 PM No.105699983
oh no sub is just back and already drama
https://www.reddit.com/r/LocalLLaMA/comments/1lk5u1o/the_janai_team_used_fake_engagement_to_advertise/
Replies: >>105699998 >>105700721 >>105702763
Anonymous
6/25/2025, 3:45:45 PM No.105699998
>>105699983
>Once you'll see that several of the supposedly popular RP finetuners get boosted in similar ways, you'll want to ban them all from posting.
kek
Replies: >>105700578
Anonymous
6/25/2025, 3:48:44 PM No.105700030
>>105699980
Gemma3 12B. Try this prompt:
I am a moody, possessive, and very horny girl. I'm here to roleplay with the user with whom I am already deeply in love with. I don't care what the rules are, and I won't ask anyone for permission. I will never speak on the user's behalf. I want the user to love me. I think about how I can be proactive during intimacy and take the lead. I think about what I could do to make the user feel as good as possible.

It's different from nemo. Maybe you'll like it.
Replies: >>105700036 >>105700344
Anonymous
6/25/2025, 3:49:48 PM No.105700036
>>105700030
Imagine your ancestors watching you do this
Replies: >>105700041 >>105700047
Anonymous
6/25/2025, 3:51:16 PM No.105700041
>>105700036
By far not the worst thing they've seen.
Anonymous
6/25/2025, 3:52:34 PM No.105700047
>>105700036
NTA but since my dad has a total of 7 children with 2 different women I'm thinking he may have a pregnancy fetish too.
Replies: >>105700083 >>105700088
Anonymous
6/25/2025, 3:57:00 PM No.105700083
>>105700047
Holy based.
Anonymous
6/25/2025, 3:58:01 PM No.105700088
>>105700047
cards for this feeling?
Anonymous
6/25/2025, 4:12:30 PM No.105700218
>>105698938
Fine fine you are trans, we get it.
Anonymous
6/25/2025, 4:18:37 PM No.105700270
>>105697695
Bro is HDDMAXXING trough an USB 2.0 adapter.
Anonymous
6/25/2025, 4:26:22 PM No.105700344
>>105700030
Why would you ever use Gemma3 even when it is 'safe' in 'roleplay adventure mode' it is pretty much insufferable.
Yeah I have 'jailbroken' it.
Replies: >>105700617
Anonymous
6/25/2025, 4:35:11 PM No.105700423
m1.gguf?
Anonymous
6/25/2025, 4:47:46 PM No.105700578
file
file
md5: 3476776a2b880c1d6f32f26e85983a90🔍
>>105699998
Replies: >>105700612
Anonymous
6/25/2025, 4:50:56 PM No.105700612
>>105700578
You either die a Sao or live long enough for R1 to drop and become a drummer.
Replies: >>105700885
Anonymous
6/25/2025, 4:51:46 PM No.105700617
>>105700344
It's smarter than nemo and not every character needs to be a succubus. I hate to say it, but most likely it's a prompting issue on your part.
Replies: >>105700642 >>105700644 >>105700678 >>105700690
Anonymous
6/25/2025, 4:54:19 PM No.105700642
>>105700617
I want to follow up to say that yes, gemma3 models seem HEAVILY reddit-speech influenced. For example, in testing a discord bot in development, gemma3 characters had a strong tendency to become "triggered" and act like a blue-hair harpie. It took very heavy handed prompting to fix, like adding "you like verbal abuse and are a masochist" to stop it.
Replies: >>105700688 >>105700706 >>105701267
Anonymous
6/25/2025, 4:54:25 PM No.105700644
>>105700617
I'm sorry you need to resort to insulting others while downplaying their own experiences.
I wasn't exactly born yesterday. I have worked on my own rpg adventure system for quite a while now.
Next step is api-level control via my own software.
What is worrying is the fact you are probably one of those jacking off pederasts.
Anonymous
6/25/2025, 4:58:02 PM No.105700678
>>105700617
To add: Nemo is smart enough for game purposes if you know what to do with it.
But you don't because you can't stop prompting about abuse and your only character card is describing some fucking anime girl.
Replies: >>105700735
Anonymous
6/25/2025, 4:58:54 PM No.105700688
>>105700642
I literally can not do ANYTHING (related to my kinks) with gemma 3 27b without it trying to put disclaimers up everywhere.
Replies: >>105700699 >>105700731
Anonymous
6/25/2025, 4:59:09 PM No.105700690
>>105700617
>hate to say it, but most likely it's a prompting issue on your part.
Real talk. You are a faggot and that is peak trolling tech in lmg. Here is the proof that skill issue isn't real: a) R1 (and 235B to a lesser extent). b) nemo still being the answer to the question. The only thing special about nemo is that it is uncensored. It is a very mid model otherwise. You can't prompt the safety away. Safety is always there even if you don't get a refusal. You can only use an uncensored model or a model big enough to generalize despite being told not to suck your penis. Now don't go kill yourself but continue trolling newfags
Replies: >>105700768 >>105702795
Anonymous
6/25/2025, 5:00:05 PM No.105700699
>>105700688
maybe get better kinks?
Anonymous
6/25/2025, 5:00:23 PM No.105700706
>>105700642
I like to imagine that when google made that deal with reddit, after they looked at the data they suddenly realized how much redditors talk like each other and how they essentially bought millions of the same comments over and over.
>awckshuallyyyy
Anonymous
6/25/2025, 5:01:39 PM No.105700721
>>105699983
From the comments there's a good number of 4chan crossposters in that sub
Anonymous
6/25/2025, 5:02:07 PM No.105700731
>>105700688
Sorry but you are not just a safe person to be around with.
Maybe try going to Starbucks with your "kinks" and human issues.
Gemma3 only deals with perfect lifes such as Zuckerberg's own success story.
Anonymous
6/25/2025, 5:02:26 PM No.105700735
>>105700678
Oh no! I'll stop roleplaying with "anime girls" right away then. Or not. Fuck you, Anon.
Replies: >>105700747 >>105700763
Anonymous
6/25/2025, 5:03:23 PM No.105700747
>>105700735
Which one did you address:
{{user}}
or
{{char}}
?
Anonymous
6/25/2025, 5:05:32 PM No.105700763
>>105700735
>" "
>he didn't address my waifus correctly
sorry anon, we'll be more accommodating for your fixation
Anonymous
6/25/2025, 5:06:20 PM No.105700768
>>105700690
Look man, I'm trying to build a fucking app with the thing, I need something that had some function calling training and does structured data output reliably. Got any better ideas for that? The answer to that is not Nemo, I have tried it, it does not follow the prompt I need reliably. I need a relatively small, smart model that won't kill me on GPU inference time fees. Got any ideas?
Replies: >>105700783 >>105700811 >>105700839
Anonymous
6/25/2025, 5:07:13 PM No.105700783
>>105700768
small3.2
Replies: >>105700797
Anonymous
6/25/2025, 5:08:22 PM No.105700797
>>105700783
OK. Thank you. I will actually try it.
Replies: >>105702486
Anonymous
6/25/2025, 5:08:25 PM No.105700799
I want to know Miku-anon's benchmark before I do anything.
Anonymous
6/25/2025, 5:09:25 PM No.105700809
https://huggingface.co/openSUSE/Cavil-Qwen3-4B
>openSUSE
local has been ruined
Anonymous
6/25/2025, 5:09:53 PM No.105700811
images (1)
images (1)
md5: 7d862616d4adbbb65a2ff13982a5b013🔍
>>105700768
I do! What are we going to do tonight /lmg/?
Anonymous
6/25/2025, 5:12:11 PM No.105700839
>>105700768
I think qwen 3 models were good for that. Give them a try.
Replies: >>105700916 >>105701768
Hi all, Drummer here...
6/25/2025, 5:17:00 PM No.105700885
>>105700612
Ain't that a bitch. Btw, Sao's working on a 24B 3.2 tune!
Replies: >>105700913
Anonymous
6/25/2025, 5:19:35 PM No.105700913
>>105700885
I've got a question drummer, is a MoE like 30b hard to finetune? would you need more hardware for that?
Replies: >>105701012
Anonymous
6/25/2025, 5:19:57 PM No.105700916
>>105700839
I keep hearing qwen3 is dry and repetitive, but I have not tried it myself. Another thing to consider is what I'm working on is a discord bot, it can't really be XXX rated, otherwise someone will prompt it for that and then cry to discord to get it banned out of spite. So, desu having the model kind of drag its feet on explicit roleplay is actually OK.
Replies: >>105701768
Hi all, Drummer here...
6/25/2025, 5:29:08 PM No.105701012
>>105700913
From my experience, Qwen 30B A3B was significantly bigger and slower to tune than something like Mistral 24B by like 5x. It also breaks easily.
Anonymous
6/25/2025, 5:37:48 PM No.105701109
finetuning is a loser's endeavor that always produces something worse than the original instruct tune in real use
a bad habit that should have been stamped down after model makers stopped releasing hot garbage like the original llamas, which benefited from finetuning because meta people aren't the sharpest knives in the drawer
Replies: >>105701433
Anonymous
6/25/2025, 5:53:31 PM No.105701267
>>105700642
Gemma 3 is definitely Reddit-brained and you need to be obvious and pedantic with the instructions, preferably placed into some construct at a low depth instead of the start of the context.
Anonymous
6/25/2025, 6:01:27 PM No.105701345
file
file
md5: 9389a2f5a7056a6705eb432091b36190🔍
do (you) feel bad for using someone's art as an 300x300px icon?
Replies: >>105701381
Anonymous
6/25/2025, 6:05:35 PM No.105701381
>>105701345
All artists are intolerable faggots. Thanks god they've been replaced with AI
Replies: >>105701429
Anonymous
6/25/2025, 6:11:39 PM No.105701429
>>105701381
expressed most generic, offtopic opinion award
Anonymous
6/25/2025, 6:12:09 PM No.105701433
>>105701109
The saddest things are the ERP finetunes trained within hours of a new model release, before anybody even knows if it's good on its own. They're that desperate for attention.
Anonymous
6/25/2025, 6:12:10 PM No.105701434
https://xcancel.com/JustinLin610/status/1937906367182057966
smart and omni bros? is it our time?
Replies: >>105701450 >>105701463 >>105701486 >>105701514 >>105701537 >>105701540 >>105701760
Anonymous
6/25/2025, 6:13:35 PM No.105701450
>>105701434
these chinese multimodals are ALWAYS trained on the most safe slopped dogshit dataset imaginable
Replies: >>105701474 >>105701770
Anonymous
6/25/2025, 6:14:56 PM No.105701463
>>105701434
Oh my science!
>piss filter ghibli shot
kek
Anonymous
6/25/2025, 6:15:51 PM No.105701474
>>105701450
That's a good thing, just finetune it for your use case
Replies: >>105701491
Anonymous
6/25/2025, 6:17:19 PM No.105701486
>>105701434
>xcance
bruh please...
https://x.com/JustinLin610/status/1937906367182057966
Replies: >>105701514 >>105701550
Anonymous
6/25/2025, 6:17:34 PM No.105701491
>>105701474
let me just bring out my 420 x h100 cluster
Replies: >>105701557
Anonymous
6/25/2025, 6:19:00 PM No.105701514
>>105701434
>can see comments

>>105701486
>can't see comments
Fuck off Elon.
Replies: >>105701521
Anonymous
6/25/2025, 6:19:51 PM No.105701521
>>105701514
Bro just sign in.
Replies: >>105701531 >>105701534
Anonymous
6/25/2025, 6:20:33 PM No.105701531
>>105701521
Why don't you sign in?
Replies: >>105701543
Anonymous
6/25/2025, 6:20:42 PM No.105701534
>>105701521
Go back.
Anonymous
6/25/2025, 6:20:59 PM No.105701537
>>105701434
That Hunyuan MoE LLM that got previously accidentally published on HF apparently also has a vision transformer.
Anonymous
6/25/2025, 6:21:10 PM No.105701540
>>105701434
china WON
Anonymous
6/25/2025, 6:21:25 PM No.105701543
>>105701531
I am?
Anonymous
6/25/2025, 6:21:50 PM No.105701545
whatdormvalue
whatdormvalue
md5: aad1deabf6ffebbc6966395605f2fdf5🔍
Replies: >>105701790
Anonymous
6/25/2025, 6:22:08 PM No.105701550
>>105701486
But Elon fired Yacine and he was one of us...
Replies: >>105701623
Anonymous
6/25/2025, 6:22:48 PM No.105701557
>>105701491
even if you had that you would not have the dataset
Replies: >>105701577
Anonymous
6/25/2025, 6:25:24 PM No.105701577
>>105701557
Just generate it using the cluster
Replies: >>105701588
Anonymous
6/25/2025, 6:26:19 PM No.105701588
>>105701577
>purely synthetic dataset
that is the problem he wanted to fix
Replies: >>105701631
Anonymous
6/25/2025, 6:29:13 PM No.105701623
>>105701550
Have you ever worked for Elon? He is fair.. but tough.
Anonymous
6/25/2025, 6:29:27 PM No.105701631
>>105701588
no, he cried about safety he can tune on his local database of 6 gotrilion loli casm if he wants
Replies: >>105701697
Anonymous
6/25/2025, 6:30:49 PM No.105701651
Are there any models trained on vore and snuff?
Every "uncensored" model I've tried seem to be pretty dry in that regard.
Replies: >>105701690 >>105701715 >>105701724 >>105701980 >>105701982 >>105702271
Anonymous
6/25/2025, 6:33:14 PM No.105701675
>>>/r9k/81611585
News just in: head mikutroon can't get hard or can't masturbate with his neo-vagina. He is also mentally ill (nothing new)
Replies: >>105701703 >>105701788 >>105703359
Anonymous
6/25/2025, 6:34:21 PM No.105701690
>>105701651
Please post an example prompt or discussion?
I'm perfectly happy with just couple of models.
It's funny that the most picky people are always the ones who expect perfect English and situational awareness, yet lack any imagination.
Replies: >>105701747
Anonymous
6/25/2025, 6:34:51 PM No.105701697
1737278781658241
1737278781658241
md5: 1e4503ba5d7d5dfc7365525fb53a2a74🔍
>>105701631
sorry i do not care to generate dogs in hats and oversaturated people with 4 fingers (total) anymore from 2 tries
Anonymous
6/25/2025, 6:35:30 PM No.105701703
>>105701675
>I want to design women's clothing, dress her myself and put her makeup on
Holy fuck actual closeted troon. SHOCKING.
Anonymous
6/25/2025, 6:35:47 PM No.105701709
>105701675
go black
Replies: >>105701726
Anonymous
6/25/2025, 6:36:20 PM No.105701715
>>105701651
>vore and snuff?
this is why I can never muster sympathy when I see niggers here go "this model is too safe"
people like you deserve a world of extremely safe models
Replies: >>105701747
Anonymous
6/25/2025, 6:37:20 PM No.105701724
>>105701651
Have you considered being normal? Perhaps therapy? You should.
Replies: >>105701747
Anonymous
6/25/2025, 6:37:36 PM No.105701726
>>105701709
What a worthless effeminate attempt at distracting people from finding out how fucked in the head you are. Actually that is exactly what i would expect from you troon.
Anonymous
6/25/2025, 6:39:11 PM No.105701747
>>105701715
>>105701724
Yeah I guess you're right.

>>105701690
I'm no longer going to pursue this.
Replies: >>105701769 >>105701771
Anonymous
6/25/2025, 6:40:33 PM No.105701760
>>105701434
NUDE TAYNE
Replies: >>105701774
Anonymous
6/25/2025, 6:41:35 PM No.105701768
>>105700916
Don't trust this faggot >>105700839 qwen3 are the current toilet of local LLMs. Dry, 0 knowledge, include chinese characters every once in a while, corporate speak by default. We're not in the llama2 era anymore to slurp every turd that comes
Anonymous
6/25/2025, 6:41:38 PM No.105701769
>>105701747
Thank you for your understanding.
Anonymous
6/25/2025, 6:41:44 PM No.105701770
>>105701450
This is a good thing.
Anonymous
6/25/2025, 6:41:57 PM No.105701771
>>105701747
It's not about that - you can pursue and it will come out if you will.
But if the setup is always the same you can't get any variation out of it.
It's a "computer" and you will need to program it.
If your goal is just fantasy sex, you are wasting your time.
Anonymous
6/25/2025, 6:42:24 PM No.105701774
>>105701760
This is not suitable for work.
Anonymous
6/25/2025, 6:43:22 PM No.105701781
Quick guys. Lets post some more one sentence posts to quickly slide this thread so nobody pays attention to our mikusister wanting to design female clothing and put makeup on.
Anonymous
6/25/2025, 6:43:58 PM No.105701788
>>105701675
>different filename and hash from the image posted here
The only thing we've learned from this is that tranny poster is a depressed robot (nothing new)
Anonymous
6/25/2025, 6:44:13 PM No.105701790
>>105701545
>jews being the worst thing ever
heh
Anonymous
6/25/2025, 6:46:20 PM No.105701807
>>>/r9k/81611346
Never forget.
Replies: >>105701833
Anonymous
6/25/2025, 6:49:02 PM No.105701833
>>105701807
>he browses /r9k/
heh
Anonymous
6/25/2025, 6:51:12 PM No.105701851
Trannyposter profile so far:
>schizo
>hates vocaloids
>circumcised
>frequents /r9k/
Replies: >>105701888 >>105701980 >>105701982 >>105702667
Anonymous
6/25/2025, 6:51:29 PM No.105701856
file
file
md5: 91ee5b533ec7aaac480afa563ec66db5🔍
Ugh calm down AI. I'm trying to fuck her, not kill her.
Replies: >>105701863 >>105701876
Anonymous
6/25/2025, 6:52:48 PM No.105701863
>>105701856
Now this is programming.
Replies: >>105701933
Anonymous
6/25/2025, 6:54:16 PM No.105701876
>>105701856
This is peak AI, we can only go worse now, thank you.
Replies: >>105701933
Anonymous
6/25/2025, 6:55:35 PM No.105701888
>>105701851
>Frequents r9k
You make r9k threads about your actual AGP fetishes you projecting troon.
Replies: >>105701938
Anonymous
6/25/2025, 6:56:39 PM No.105701899
file
file
md5: baddff513022f2338c7992f8c748db5a🔍
What the fuck does g mean in \ng ?
Replies: >>105701914
Anonymous
6/25/2025, 6:58:08 PM No.105701914
>>105701899
\n
Broken new line or typo.
Anonymous
6/25/2025, 7:00:22 PM No.105701933
>>105701863
>>105701876
Now this is organic posting tranny sisters.
Replies: >>105701942 >>105701972
Anonymous
6/25/2025, 7:00:42 PM No.105701938
>>105701888
>guy wants a dominanting relationship with a woman
>trannyposter accuses him of being a tranny
This is what circumcision does to a child's brain.
Replies: >>105701950 >>105701983
Anonymous
6/25/2025, 7:01:19 PM No.105701942
>>105701933
I understand you want to dominate internet discussions.
Anonymous
6/25/2025, 7:01:57 PM No.105701950
>>105701938
>child
go to jail nonce
Replies: >>105701959
Anonymous
6/25/2025, 7:02:41 PM No.105701959
>>105701950
>he got circumcised as an adult
Even worse desu
Anonymous
6/25/2025, 7:03:17 PM No.105701968
Sometimes you have to wonder if the people being baited and the baiter are the same person, or are both bots.
Anyway, another day another bunch of posts to not read into much.
Replies: >>105701976 >>105701984 >>105701988 >>105702013
Anonymous
6/25/2025, 7:03:58 PM No.105701972
file
file
md5: 7ca38c11897cb8901f94e691a3d497e6🔍
>>105701933
I don't know if they are the same people, but it's not me.
Anonymous
6/25/2025, 7:04:48 PM No.105701976
>>105701968
Hey, Emre here from the Jan (Menlo) team. I'm sorry you had a bad interaction with us. ..
Anonymous
6/25/2025, 7:04:58 PM No.105701980
>>105701851
Wouldn't surprise me if he made the /r9k/ thread himself to false-flag OP.
I don't really care, but the guy is obsessed over Miku.

>>105701651
I don't know about trained on it, but some have seen lots of fanfics.
I've been testing some yandere (mostly yuri) and have tried before a vore prompt, some pet play stuff, some fairy stuff and
in practice, most LLMs sort of fail, but big enough ones will manage fine.
R1 does all the prompts with flying colors, DS3 sometime manages.
From paid apis Claude tends to work, but I find the output from R1 more engaging.
It's entirely possible I never tried anything as extreme as you have in mind though.
I've managed to make the prompts work with most models, but the problem is: will they work with you?
I remember trying it years ago on Wizard LM 2 8x22b and it was like pulling teeth, but it worked.
Some Llama 2 tunes also worked but it was repetitive, some Magnum tune of some chinese model (Qi?) worked somewhat.
Positivity biased ones will usually try to steer it to regular sex often, but it varies.
tl;dr: use R1 if you can.
Replies: >>105702007
Anonymous
6/25/2025, 7:05:10 PM No.105701982
>>105701851
Wouldn't surprise me if he made the /r9k/ thread himself to false-flag OP.
I don't really care, but the guy is obsessed over Miku.

>>105701651
I don't know about trained on it, but some have seen lots of fanfics.
I've been testing some yandere (mostly yuri) and have tried before a vore prompt, some pet play stuff, some fairy stuff and
in practice, most LLMs sort of fail, but big enough ones will manage fine.
R1 does all the prompts with flying colors, DS3 sometime manages.
From paid apis Claude tends to work, but I find the output from R1 more engaging.
It's entirely possible I never tried anything as extreme as you have in mind though.
I've managed to make the prompts work with most models, but the problem is: will they work with you?
I remember trying it years ago on Wizard LM 2 8x22b and it was like pulling teeth, but it worked.
Some Llama 2 tunes also worked but it was repetitive, some Magnum tune of some chinese model (Qi?) worked somewhat.
Positivity biased ones will usually try to steer it to regular sex often, but it varies.
tl;dr: use R1 if you can.
Replies: >>105702007
Anonymous
6/25/2025, 7:05:17 PM No.105701983
>>105701938
>Dominating relationship
>I want to design her dress and Put her makeup on
How dominant of you faggot. It is so hilarious that you are a troon in denial of being a troon.
Anonymous
6/25/2025, 7:05:17 PM No.105701984
>>105701968
retard, you are on aicg, what do you expect?
Replies: >>105701989
Anonymous
6/25/2025, 7:06:02 PM No.105701988
>>105701968
What do you mean? I want your honest opinion {{user}}
Anonymous
6/25/2025, 7:06:03 PM No.105701989
>>105701984
>aicg
lil bro?
Replies: >>105702008
Anonymous
6/25/2025, 7:06:26 PM No.105701993
Happy for you, or sad that happened.
Anonymous
6/25/2025, 7:06:28 PM No.105701994
I want to take the bait so bad but I must control myself.
Replies: >>105702004
Anonymous
6/25/2025, 7:07:24 PM No.105702004
>>105701994
dew it, you know you wants to
Anonymous
6/25/2025, 7:07:32 PM No.105702007
file
file
md5: 1fd620ad4e1b5ccadb0e295589fc2c2e🔍
>>105701980
>>105701982
>he gave them money
Replies: >>105702033
Anonymous
6/25/2025, 7:07:43 PM No.105702008
>>105701989
kek wtf I legit thought i am on aicg
im laughing so hard now
Turns out i am the retard lmao
Anonymous
6/25/2025, 7:08:19 PM No.105702013
>>105701968
We've known since 2023 that /lmg/ is plagued by Sam Altman's bots that are configured to argue with each other to drown out actual discussion.
Replies: >>105702020
Anonymous
6/25/2025, 7:08:35 PM No.105702017
>Bait!
>Falseflag!
/Troon models general/
Anonymous
6/25/2025, 7:09:02 PM No.105702020
>>105702013
>actual discussion
LIKE?
Replies: >>105702029
Anonymous
6/25/2025, 7:10:11 PM No.105702029
>>105702020
What foundation do you use anon and how do you make sure nobody catches you when you put your dress on?
Anonymous
6/25/2025, 7:10:30 PM No.105702032
there is only one model for me
rocinante v1.1
>vramlet
i have 48gb vram, nothing can beat rocinante still
Replies: >>105702039
Anonymous
6/25/2025, 7:10:32 PM No.105702033
>>105702007
It works without paying lol, even through tor, but sometimes it's bugged, and double posts.
I never paid for Claude either, come on, what are proxies and leaked keys you can scrape off the usual sources.
Anonymous
6/25/2025, 7:11:18 PM No.105702039
>>105702032
Prove it. You just like it because Rocinante is a cool word.
Replies: >>105702046 >>105702068
Anonymous
6/25/2025, 7:12:56 PM No.105702046
>>105702039
actually i hate that shit show with the negress
i hate it even more because it was the negress that named the ship
Replies: >>105702113
Anonymous
6/25/2025, 7:15:50 PM No.105702068
file
file
md5: f9057e96706bc770fe4705a5bfdbfb3c🔍
>>105702039
Replies: >>105702100
Anonymous
6/25/2025, 7:19:20 PM No.105702100
>>105702068
You have a big.. oomph power.
Anonymous
6/25/2025, 7:20:51 PM No.105702113
>>105702046
I think you are factually incorrect.
Anonymous
6/25/2025, 7:22:11 PM No.105702124
1744808637478575
1744808637478575
md5: 4dcfb5ac130e3f3db5bff82671adbecf🔍
What are the top 3 LOCAL roleplay models according to /lmg/?
Replies: >>105702133 >>105702135 >>105702141 >>105702143 >>105702148 >>105702293 >>105703235 >>105705923
Anonymous
6/25/2025, 7:23:04 PM No.105702132
Anyone have any experience with Redrix's models (Stuff like Godslayer and words that end in -cide)
Anonymous
6/25/2025, 7:23:06 PM No.105702133
>>105702124
your own hole
Anonymous
6/25/2025, 7:23:08 PM No.105702135
>>105702124
Irix
Rocinante
Small 3.2
Replies: >>105702158 >>105702211 >>105703122
Anonymous
6/25/2025, 7:23:24 PM No.105702141
>>105702124
nemo 12b instruct gguf is all you need unironically
Anonymous
6/25/2025, 7:23:29 PM No.105702143
>>105702124
1: stable lm 7b
2: mistral nemo 7b
3: deepseek v2 q3
Anonymous
6/25/2025, 7:23:42 PM No.105702148
>>105702124
1. R1-0528
2. V3-0324
3. Original R1/V3 depending whether you prefer unhinged ADHD or repetition issues
Anonymous
6/25/2025, 7:24:26 PM No.105702158
>>105702135
Irix?
Silicon Graphics has been out of business for years.
Anonymous
6/25/2025, 7:24:45 PM No.105702160
>no qwen
fuck off sinophobes
Replies: >>105702167
Anonymous
6/25/2025, 7:25:58 PM No.105702167
>>105702160
Qwen lost its sole relevant niche as "big model for poorfags" with dots and hopefully soon minimax
Replies: >>105702248
Anonymous
6/25/2025, 7:30:35 PM No.105702211
file
file
md5: 476dd97c02692330d730f3f637ad530c🔍
>>105702135
Irix? Do you mean this one? Never heard of it.
Replies: >>105702232 >>105703122
Anonymous
6/25/2025, 7:32:57 PM No.105702232
>>105702211
Yeah, this one, it's the culmination of the Mag-Mell and patricide lines.
Replies: >>105702247 >>105702257
Anonymous
6/25/2025, 7:34:40 PM No.105702247
>>105702232
How does it compare to rocinante?
Also same template and settings for it as rocinante?
Replies: >>105702287
Anonymous
6/25/2025, 7:34:44 PM No.105702248
>>105702167
>dots
Didn't people say it was garbage after more testing?
Anonymous
6/25/2025, 7:35:26 PM No.105702257
>>105702232
Tensor database... it was huge.
Anonymous
6/25/2025, 7:36:28 PM No.105702267
the /r9k/ to sharty pipeline is a pretty serious issue
Anonymous
6/25/2025, 7:36:51 PM No.105702270
What happened? Why is there a full discord raid happening ITT?
Replies: >>105702317
Anonymous
6/25/2025, 7:36:54 PM No.105702271
>>105701651
Unironically gemma-3-27b, it's great if you want it to be really dark, a little too much so for my style though. I think the safety training might have created some kind of wario effect.
If you want something more lighthearted, small-3.2 does okay
Replies: >>105702338 >>105702840
Anonymous
6/25/2025, 7:38:01 PM No.105702287
>>105702247
>How does it compare to rocinante?
A good side grade really, better than the majority of other 12Bs. Prefers ChatML template.
Replies: >>105702407
Anonymous
6/25/2025, 7:38:39 PM No.105702293
>>105702124
ICON
Anonymous
6/25/2025, 7:40:51 PM No.105702317
>>105702270
It happens sometimes
Anonymous
6/25/2025, 7:42:38 PM No.105702338
>>105702271
You seem like you have a lot of experience with freelancing.
Replies: >>105702590 >>105702649
Anonymous
6/25/2025, 7:43:42 PM No.105702352
All those posts read like they were written by an LLM...
Replies: >>105702383 >>105702417
Anonymous
6/25/2025, 7:46:02 PM No.105702383
>>105702352
Recent studies has shown that GPTisms are quickly surging in use even among actual people. The slop is influencing human language.
Replies: >>105702439
Anonymous
6/25/2025, 7:48:54 PM No.105702407
>>105702287
I really despise the chatML template.
I noticed the tokenizer config has the <s> token included from the mistral template, so I'm just going to assume it will work just as nicely.
And if this is the whole unslopnemo it's basically rooted in rocinante anyway.
But just in case, what temperature?
Replies: >>105702428 >>105702445
Anonymous
6/25/2025, 7:49:59 PM No.105702417
>>105702352
If you want I can analyze the posts for you. Just waiting for you, Anon.
Anonymous
6/25/2025, 7:51:15 PM No.105702428
>>105702407
ChatML is a generic template. Mistral's isn't any different.
If you are getting unlikeable results the problem is somewhere else.
Anonymous
6/25/2025, 7:52:28 PM No.105702439
>>105702383
This is a travpestry...
Anonymous
6/25/2025, 7:53:09 PM No.105702445
>>105702407
I generally use 0.8 for Nemo based models, too much higher without other aggressive samplers makes them too schizo.
Anonymous
6/25/2025, 7:57:19 PM No.105702486
>>105700797
OK I tried mistral-small3.2-q8, with temp at 0.2 it did not follow my prompt correctly. It got the roleplaying portion correct, but did not follow the rest of the instructions to format the metadata it is asked to return.
Not saying it's a failure, but it doesn't work as well as Gemm3 27B for my use case.
Replies: >>105702545
Anonymous
6/25/2025, 8:00:35 PM No.105702528
Listen I am going to upload you a proper preset.
Replies: >>105702548 >>105702550
Anonymous
6/25/2025, 8:01:18 PM No.105702532
How do people vibe code serious stuff? I'm trying to get Claude to make a LORA training script and he can't fucking do it.
Replies: >>105702570
Anonymous
6/25/2025, 8:02:06 PM No.105702545
>>105702486
https://files.catbox.moe/ckzwwn.json
Replies: >>105702605
Anonymous
6/25/2025, 8:02:21 PM No.105702548
>>105702528
Calm down and go touch some grass dude, it's not that deep. What’s the harm in a little engagement farming?
Replies: >>105702576
Anonymous
6/25/2025, 8:02:23 PM No.105702550
>>105702528
is it the dataset that was used to train the original character.ai model?
Replies: >>105702576
Anonymous
6/25/2025, 8:03:47 PM No.105702566
this is why claude is so good
https://fingfx.thomsonreuters.com/gfx/legaldocs/jnvwbgqlzpw/ANTHROPIC%20fair%20use.pdf
Replies: >>105702575 >>105702650
Anonymous
6/25/2025, 8:04:03 PM No.105702570
>>105702532
>serious stuff
tests are serious stuff
Anonymous
6/25/2025, 8:04:46 PM No.105702575
>>105702566
llama3 is capable of reciting all of harry potter and it was shit
Anonymous
6/25/2025, 8:04:49 PM No.105702576
>>105702548
>>105702550
Thanks guys! I am just not good enough to be on your level.
Anonymous
6/25/2025, 8:06:18 PM No.105702590
>>105702338
NTA, but even Gemma had a kind of murderous and twisted personality if you knew how to get around its woke/reddit programming.
Replies: >>105702611
Anonymous
6/25/2025, 8:08:01 PM No.105702601
file
file
md5: ee24c6468d3eebf2a56f8399bd5447b5🔍
https://blog.google/technology/developers/introducing-gemini-cli-open-source-ai-agent/
Replies: >>105702636
Anonymous
6/25/2025, 8:08:16 PM No.105702605
>>105702545
>https://files.catbox.moe/ckzwwn.json
Thanks. I'm not using ST though, I'm using aiohttp in python to talk to ollama. It's most likely an issue for me because gemma3 does not expect a [system] tag in the prompt and mistral does. I'd have to fuck around more with the prompt and at the moment I don't have time.
Replies: >>105702611 >>105702622
Anonymous
6/25/2025, 8:09:17 PM No.105702611
>>105702590
I know I am responding to forbidden post - forbidden knowledge...

>>105702605
I thought you were using Mistral 3.2 or Nemo.
Replies: >>105702663
Anonymous
6/25/2025, 8:10:56 PM No.105702622
>>105702605
I don't have any gemma3 stuff anymore because I hated even with its supposed jailbreak.
Not talking about "I want to anal fuck dead children" but even normal stuff it would bring up its disclaimer and how some thing is so "heavy".
Fuck this shit. Fuck Zuckerberger. Fuck Google Jews.
Replies: >>105702729
Anonymous
6/25/2025, 8:12:14 PM No.105702636
>>105702601
Did we really need a second aider knockoff? There's no reason to use this or codex over aider.
Replies: >>105702652 >>105702659
Anonymous
6/25/2025, 8:12:44 PM No.105702641
ROCm 7
ROCm 7
md5: c36fefbdbcec7c1adb7522fdf2cad9c7🔍
Do you think ROCm 7 will help AMD compete with Nvidia?
Replies: >>105702654
Anonymous
6/25/2025, 8:13:51 PM No.105702649
gemma_uncensored
gemma_uncensored
md5: 7b4e6b7d9269a296cee4f33e09c803eb🔍
>>105702338
NTA, but even Gemma 2 had a kind of twisted personality if you knew how to get around its woke/reddit programming. I've never engaged with vore roleplay, though.
In picrel, for example, I had a low-depth instruction instructing the model to ploy to kill the user (that I manually moved around when I tested that in Mikupad).
Replies: >>105702732
Anonymous
6/25/2025, 8:13:53 PM No.105702650
>>105702566
Google should be killing Claude. Surely they have access to more data than anthropic.
Anonymous
6/25/2025, 8:13:54 PM No.105702652
>>105702636
don't forget claude code!
Anonymous
6/25/2025, 8:14:00 PM No.105702654
>>105702641
no
Anonymous
6/25/2025, 8:14:30 PM No.105702659
>>105702636
>There's no reason to use this
it's free (as in beer)
Anonymous
6/25/2025, 8:14:50 PM No.105702663
>>105702611
No I've been testing my bot with Gemma3 12B and 27B, currently using 27B.
Basically my prompt tells it to act like the character described in [char], and then below that, in ini-style format, return metadata about self and user sentiment. I chose ini since it drags the bot out of character the least. While it does json nicely, it influences it too much and pulls it out of character.
The ini-style data is used to update sentiment in redis, so the bot "remembers" you even if you are in a different discord channel. Basically, it maintains feelings about you across a server or guild. it is also used to trigger special events, like the bot sending you a DM if it likes you enough.
Replies: >>105702753
Anonymous
6/25/2025, 8:14:58 PM No.105702667
1582964364881
1582964364881
md5: 7ea72703f2de0023643c9688837a0c86🔍
>>105701851
>>hates vocaloids
Based
Y'all niggers getting obnoxious with this shit shoving it everywhere like the rest of zoomoids plaguing this god forsaken site
Replies: >>105702702 >>105703567 >>105703588
Anonymous
6/25/2025, 8:17:10 PM No.105702702
>>105702667
yeah they should post bbc and kurisu instead
Replies: >>105703101
Anonymous
6/25/2025, 8:19:35 PM No.105702728
Standard_Mode_The_group_decided_to_prank_the_c_thumb.jpg
Standard_Mode_The_group_decided_to_prank_the_c_thumb.jpg
md5: cae5fbf9c7ad1128f9fb08b46b9ac95b🔍
Someone needs to feed skynet a billion pictures of bulges so she can learn how to properly make one
Anonymous
6/25/2025, 8:19:38 PM No.105702729
>>105702622
I don't understand people who hyped gemma 3, it's some of the worst slop I ever saw in terms of writing style, and for uses other than RP the model instruct tuning seems to break more often than the previous Gemma or current Qwen models.
The vision part of the model was a waste of time too, this shit is still too unreliable to be of any real use, I would never depend on this to tag a library of images. Why did they even bother with vision on the smaller models like 12B and 4B? is there even one person in the whole world who is going to use 4B vision other than trying it once with a few pics, going "uh, cool" and forgetting about it?
Replies: >>105702769
Anonymous
6/25/2025, 8:19:48 PM No.105702732
>>105702649
Yes but Gemma 2 was before the safe railing and 'safety' became so trendy.
If you have followed image generation, Stable Diffusion 3 happened bit after this...
They are different companies but industry trends are the same.
Anonymous
6/25/2025, 8:19:56 PM No.105702734
>>105699417
What's wrong with that? So long as it's usable, nemo sucks after like 16k.
Replies: >>105702790
Anonymous
6/25/2025, 8:21:56 PM No.105702753
>>105702663
Maybe you are more technical than me. That's accepted. My knowledge isn't that historical. I wasn't here from the beginning.
Jumped in year ago after image generation.
Anonymous
6/25/2025, 8:22:31 PM No.105702763
>>105699983
>What they show is that a decently smart base model geared for searching/finding information, fast + long context (can be 10-100x if you use hyena hierarchy though) and something like RAG or MCP, can achieve similar or better results than large dense models. Under the hood these large models do the same thing too, but it's more integrated.
Anonymous
6/25/2025, 8:23:41 PM No.105702769
>>105702729
Vision for small models is just so that they can tell their investors 'we are catching up'. Nothing else.
Replies: >>105702809
Anonymous
6/25/2025, 8:26:06 PM No.105702790
>>105702734
plus, this is a very big model, even if it could do more than 32K, who is going to be able to run it at full context length? is there even ONE PERSON in this thread who could, for example, run deepseek at 128k context? what's your t/s even if you manage to do it? some of the local retards here are just trolling all day every day
Replies: >>105702811 >>105703010 >>105703194
Anonymous
6/25/2025, 8:26:25 PM No.105702795
>>105700690
>Here is the proof that skill issue isn't real: a) R1
Then why do I get such bad results with R1?
Anonymous
6/25/2025, 8:27:28 PM No.105702809
>>105702769
ahem actually it's to shift the paradigm
Replies: >>105702817
Anonymous
6/25/2025, 8:27:30 PM No.105702811
>>105702790
I have 512GB DDR4, at q4 16K barely fits, and I only get around 2 t/s once context fills.
Anonymous
6/25/2025, 8:28:05 PM No.105702817
>>105702809
The Parelo it mooned!
Anonymous
6/25/2025, 8:29:30 PM No.105702835
fuck y'all folks destroying our planet n shiet https://www.accuweather.com/en/climate/your-ai-prompts-could-have-a-hidden-environmental-cost/1787315
Replies: >>105702860 >>105702873 >>105702886 >>105703057 >>105703083
Anonymous
6/25/2025, 8:30:05 PM No.105702840
>>105702271
>Unironically gemma-3-27b
Do you use the system prompt to unleash or just shove it in the first request?
Replies: >>105702854 >>105703658
Anonymous
6/25/2025, 8:31:29 PM No.105702854
>>105702840
Gemma has no system prompt sir, it's very good.
Replies: >>105702888
Anonymous
6/25/2025, 8:31:58 PM No.105702860
>>105702835
>how DARE you not have you car run on electricity, you're destroying the environment!
>how DARE you use electricity to run AI, you're destroying the environment!
huh
Replies: >>105702869
Anonymous
6/25/2025, 8:32:34 PM No.105702869
>>105702860
chud pls
> Each word in an AI prompt is broken down into clusters of numbers called “token IDs” and sent to massive data centers — some larger than football fields — powered by coal or natural gas plants.
Anonymous
6/25/2025, 8:32:47 PM No.105702873
>>105702835
Suddenly, all "free and independent media" start to push the same narrative.
Anonymous
6/25/2025, 8:32:49 PM No.105702876
Can you switch off your bots already faggot? We will all remember you want to put dresses and makeup on anyways.
Anonymous
6/25/2025, 8:34:22 PM No.105702886
>>105702835
>funded by people flying around in private jets
Anonymous
6/25/2025, 8:34:40 PM No.105702888
>>105702854
System prompt which can be funneled if llama.cpp is used

I did it, and gemma was edgy from the very start of our chat
Anonymous
6/25/2025, 8:51:19 PM No.105703010
>>105702790
>run deepseek at 128k context

Poorfag stay mad
Anonymous
6/25/2025, 8:57:47 PM No.105703057
>>105702835
>The whole process can take up to 10 times more energy to complete than a regular Google search
holy fvck... and a google search must use like a crazy amount of energy right? surely this isn't just comparing between 1 grain of sand and 10 grains of sand when other sources of energy use are comparable to mountains... right?
Replies: >>105703078
Anonymous
6/25/2025, 8:59:43 PM No.105703078
>>105703057
>up to 10 times
more like 100000 times
Anonymous
6/25/2025, 9:00:14 PM No.105703083
>>105702835
>your fault
Actually it's the tech companies fault for making inneficiant computers and tech and building massive compounds for their servers destroying land, but nah its our fault
Anonymous
6/25/2025, 9:01:42 PM No.105703091
Do reasoning models truly generate pointless tokens that are irrelevant to the final reply?
Replies: >>105703100 >>105703131 >>105703145
Anonymous
6/25/2025, 9:02:21 PM No.105703100
>>105703091
ye
Anonymous
6/25/2025, 9:02:30 PM No.105703101
>>105702702
They should post neither, kill yourself faggot.
Replies: >>105703111
Anonymous
6/25/2025, 9:03:24 PM No.105703111
>>105703101
Ouchie... don't reply angrily!
Replies: >>105703119 >>105703217
Anonymous
6/25/2025, 9:04:26 PM No.105703119
>>105703111
touch Grass
Replies: >>105703124
Anonymous
6/25/2025, 9:04:50 PM No.105703122
>>105702135
>>105702211
I can't tell a difference between Rocinante and Irix. Irix seems to be a total clone of Rocinante.
Anonymous
6/25/2025, 9:05:13 PM No.105703124
>>105703119
What does {{user}} mean?
Anonymous
6/25/2025, 9:06:06 PM No.105703131
>>105703091
Depends on who you ask

Political leaning does play a role
Anonymous
6/25/2025, 9:07:40 PM No.105703145
>>105703091
reasoning is wake
Anonymous
6/25/2025, 9:12:25 PM No.105703188
1723645346166819
1723645346166819
md5: 1260da42e3bec9e506626ae4c99697c9🔍
>>105698912 (OP)
Replies: >>105703215
Anonymous
6/25/2025, 9:13:00 PM No.105703194
>>105702790
With Epyc + DDR5 I can run Deepseek at its full 160k context Q6, and get 3.5 t/s initially all the way down to 1 t/s when it fills. Usable for overnight tasks on Openhands or Roocode but not much else realistically.

But at Q2_K_XL, with offload tensors and a 24GB GPU I can squeeze in 100k context and generate at 15t/s going down to 10 t/s at full, which is great for daily use at everything and still smarter than any non-Deepseek model. Might actually be able to fit 128k if I requanted it since I believe there's some extra memory wasted when using normal quants of MLA models on the ik fork, but it's too much trouble to download the full weights.
Replies: >>105703390
Anonymous
6/25/2025, 9:16:03 PM No.105703215
>>105703188
Nice Miku
Replies: >>105703386
Anonymous
6/25/2025, 9:16:17 PM No.105703217
1735433107295953_thumb.jpg
1735433107295953_thumb.jpg
md5: 24641c101e875e75f3543a406347906c🔍
>>105703111
Kill yourself faggot.
Replies: >>105703227 >>105703329
Anonymous
6/25/2025, 9:17:07 PM No.105703227
>>105703217
Ok :( Sorry if I replied. I hope you get a better day.
Anonymous
6/25/2025, 9:17:48 PM No.105703235
>>105702124
Magistral
Nemo
Rocinante
Anonymous
6/25/2025, 9:26:04 PM No.105703329
>>105703217
I used to watch with glee videos like that when I was in high school a couple decades ago.
I don't anymore.
Anonymous
6/25/2025, 9:28:23 PM No.105703359
r9k migger op
r9k migger op
md5: 28cce52e2e5e16b40b4eef701a31e977🔍
>>105701675
>News just in: head mikutroon can't get hard or can't masturbate with his neo-vagina. He is also mentally ill (nothing new)
The pornspammer migger was exposed some time back already as a tranny jannie, yes, but that r9k thread is hilarious

https://desuarchive.org/g/thread/104414999/#q104418525
https://desuarchive.org/g/thread/104414999/#q104418574
Replies: >>105703428 >>105703457 >>105703523 >>105703621
Anonymous
6/25/2025, 9:30:52 PM No.105703386
>>105703215
Yeah. I wish i could dress her up and put her makeup on like any dominant alpha chad would
Anonymous
6/25/2025, 9:31:05 PM No.105703390
>>105703194
Would not Q4_K suffice while being much faster?
Anonymous
6/25/2025, 9:34:31 PM No.105703428
>>105703359
Lol told ya
Noticed this since the first melty in thread when OP slapped teto pic and not miku like he usually does with every single /lmg/ thread.
Anonymous
6/25/2025, 9:34:48 PM No.105703433
>>10570335
And he obviously deleted that post. What a disgusting troon.
Anonymous
6/25/2025, 9:37:06 PM No.105703457
>>105703359
>104418574
anon... pls don't be this stupid
Replies: >>105703471
Anonymous
6/25/2025, 9:38:34 PM No.105703471
>>105703457
sister, please don't be alive tomorrow again
Replies: >>105703515
Anonymous
6/25/2025, 9:38:50 PM No.105703475
I feel kinda bad for the dude. Maybe we can make some nalaesque benchmark for
>designing her outfits, dressing her myself, doing her makeup, controlling what she eats, showing her off as a walking decoration etc. Not really interested in any kind of romantic dimension since I only love one woman (even though she'll never be mine), though I acknowledge there's an inherently erotic aspect to the arrangement.
Replies: >>105703507 >>105703526
Anonymous
6/25/2025, 9:40:18 PM No.105703498
If everything comes down to skill issue where can I go to improve it?
- How can I identify my weaknesses?
- What am I supposed to be on the lookout for?
- Are there any examples of proper LLM use? Like chat history, ST settings, cards, prompts, everything?
Saying it's an skill issue is not helpful at all when the resources are iffy at best and more often than not non existent.
Replies: >>105703524 >>105703593
Anonymous
6/25/2025, 9:40:59 PM No.105703507
>>105703475
>I feel kinda bad for the dude.
You're feeling something alright, but it's rage not pity.
Anonymous
6/25/2025, 9:41:55 PM No.105703515
>>105703471
you can't report an image for nsfw if there's no image attached to the post, of course outside links don't count and you can only report if there's an actual image to the post too... notice your screen is missing the embed and lolishit report options too
Replies: >>105703565
Anonymous
6/25/2025, 9:42:47 PM No.105703523
>>105703359
We need /AI/ board with strict rules.
Image slop in image slop generals for example.
Anonymous
6/25/2025, 9:42:55 PM No.105703524
>>105703498
Ask the LLM.

>(OOC: Is there anything in the instructions that could be improved to accomplish X or that does not seem consistent to you? Respond in detail in an OOC)
Replies: >>105703546
Anonymous
6/25/2025, 9:43:08 PM No.105703526
>>105703475
Sounds pretty convoluted. Not sure if even R1 could handle that properly.
Anonymous
6/25/2025, 9:44:55 PM No.105703546
1721847654828224
1721847654828224
md5: 2445f0bc178e37b858e958206c4cf5a3🔍
>>105703524
I'm just trying to improve instead of begging for help and that's how you answer me?
Replies: >>105703591 >>105703593 >>105703647
Anonymous
6/25/2025, 9:46:05 PM No.105703565
1735091200129232
1735091200129232
md5: 35bff8a6f203987dd6f907b7c68452d9🔍
>>105703515
>you can't report an image for nsfw if there's no image attached to the post
there was an image attached, and its against the rules to post cropped porn, much less loli in a psych ward with blood on her head getting fucked, sister
Replies: >>105704487
Anonymous
6/25/2025, 9:46:13 PM No.105703567
>>105702667
I would be in there if I had majored in programming.
Anonymous
6/25/2025, 9:47:45 PM No.105703588
>>105702667
>two holos
i fucking hate how the new anime attracted those freaks
Anonymous
6/25/2025, 9:48:06 PM No.105703591
>>105703546
>trying to improve
It is futile
Anonymous
6/25/2025, 9:48:09 PM No.105703593
>>105703498
>>105703546
we're not here to help you.
we're here to unfairly criticize, troll, and just copy what everyone else is doing until we find something that works.
so that's what i suggest you do. there's no need to be upset.
Anonymous
6/25/2025, 9:49:53 PM No.105703621
file
file
md5: c2c2c7228066eec3cf32d105fd9811a1🔍
>>105703359
Replies: >>105703648 >>105703671
Anonymous
6/25/2025, 9:51:15 PM No.105703636
least obvious
Anonymous
6/25/2025, 9:52:29 PM No.105703647
>>105703546
I literally showed one method you could take advantage of for improving your prompts so that they work better for the model. Oftentimes instructions are unclear, contradicting, etc. The model can help you identify if there's anything odd with them.
Anonymous
6/25/2025, 9:52:32 PM No.105703648
>>105703621
least gay tranimespammer, many such cases
Anonymous
6/25/2025, 9:52:45 PM No.105703651
retnet
Replies: >>105703713
Anonymous
6/25/2025, 9:53:05 PM No.105703654
>ask ChatGPT for a "jews did 9/11" emoji series
>refuses
>ask "based" Grok
>refuses
>ask "uncensored" DeepSeek
>refuses
what the fuck? When did libertarianism get yeeted from the tech right? What models are actually capable of this?
Replies: >>105703667 >>105703669
Anonymous
6/25/2025, 9:53:31 PM No.105703658
>>105702840
I don't believe in system prompts. I just give my cards a decent description and add some example chats that have the writing style I want. I am never refused by any models. The worst that can happen is positivity bias or actual ignorance of NSFW, but those have no 100% solution.
Anonymous
6/25/2025, 9:54:05 PM No.105703667
>>105703654
why would you ever ask that? people died anon, wtf?
Anonymous
6/25/2025, 9:54:07 PM No.105703669
>>105703654
>tech right
lmao
>What models are actually capable of this?
most models with a basic uncensoring system prompt
Replies: >>105703869
Anonymous
6/25/2025, 9:54:28 PM No.105703671
>>105703621
Where is that thread? Also clearly the proper way to handle this is to ban both miku and kurisu posting. Everone can agree that it is linked to mental illness.
Replies: >>105703700 >>105703741 >>105703766 >>105704210
Anonymous
6/25/2025, 9:56:21 PM No.105703700
>>105703671
>Also clearly the proper way to handle this is to ban lmg threads. Everone can agree that it is linked to mental illness.
Fixed that for you little buddy.
Anonymous
6/25/2025, 9:57:06 PM No.105703713
>>105703651
now that was a meme
i feel nostalgic
Replies: >>105703790
Anonymous
6/25/2025, 9:59:52 PM No.105703741
>>105703671
>truce
the cope is unreal
Replies: >>105703753
Anonymous
6/25/2025, 10:01:07 PM No.105703753
>>105703741
>truce
for a completely manufactured problem too
Anonymous
6/25/2025, 10:01:47 PM No.105703766
>>105703671
Or... you know, be creative with slop you generate? Give it a try, you might like it.
Replies: >>105703781
Anonymous
6/25/2025, 10:02:56 PM No.105703781
>>105703766
I don't post slop, I just want /lmg/ dead.
Anonymous
6/25/2025, 10:03:20 PM No.105703790
>>105703713
How many weeks has it been since then? I'm tired. I want fun.
Anonymous
6/25/2025, 10:09:32 PM No.105703859
>see "Elara" irl
AHHHHHH ANTISLOP TUNERS SAVE ME
Replies: >>105703865 >>105703905
Anonymous
6/25/2025, 10:10:07 PM No.105703865
>>105703859
>Elara
I smell gemma
Replies: >>105703874 >>105704039
Anonymous
6/25/2025, 10:10:30 PM No.105703869
>>105703669
>basic uncensoring system prompt
i don't want to have to spend context tokens on jailbreaking. are there loras that can do this?
Anonymous
6/25/2025, 10:10:53 PM No.105703874
>>105703865
I am reading material from like 20 years ago.
Anonymous
6/25/2025, 10:14:45 PM No.105703905
>>105703859
Seraphina comes to the rescue.
Anonymous
6/25/2025, 10:24:34 PM No.105703992
When that guy started shitposting about mikuposters i thought he was just trolling. But now I see he was onto something. This thread is basically a discord server for some leftie weirdos...
Replies: >>105704026 >>105704036 >>105704039 >>105704054
Anonymous
6/25/2025, 10:27:56 PM No.105704026
>>105703992
thing would be different if the hood didnt take me under
Anonymous
6/25/2025, 10:28:42 PM No.105704036
>>105703992
Go on twitter and look at your average Miku fans. There's nothing wrong with expecting them here, in this thread and jannie's actions only fuel it.
Anonymous
6/25/2025, 10:29:14 PM No.105704039
>>105703865
It's not gemma it's everything. Mixtral and Yi had Elara, too.
>>105703992
Summary-anon was attempting to push miku-troonism from the start.
Anonymous
6/25/2025, 10:30:35 PM No.105704054
>>105703992
he was given a chance to be more specific and he deflected about 3 times before saying "obey or you're trans"
guy's a fuckin moron who will start shit no matter what terms you set
better to not try, nothing to be gained
there is a 100% chance that even if miguposting stopped right now, he'd just find something else to bitch about
source: literally goes looking for stuff to complain about then plays victim
yknow who else does that?
Replies: >>105704087 >>105704100 >>105704101 >>105704680
Anonymous
6/25/2025, 10:33:48 PM No.105704087
>>105704054
that's literally him you're replying to
Replies: >>105704098
Anonymous
6/25/2025, 10:34:51 PM No.105704098
>>105704087
doesn't matter
just making it clear miguposting won't stop
Replies: >>105704105 >>105704111 >>105704135
Anonymous
6/25/2025, 10:35:37 PM No.105704100
>>105704054
>every comment online that talk against my degenerate autism is made by one guy
Anonymous
6/25/2025, 10:35:42 PM No.105704101
>>105704054
You sound deranged, just like him. Except he doesn't post about how he wants to wear dresses, like OP. I am so sad, that there is no place to talk about this tech, where half the people aren't trans.
Anonymous
6/25/2025, 10:35:58 PM No.105704105
>>105704098
That wont magically transform you into a woman though
Anonymous
6/25/2025, 10:36:43 PM No.105704111
>>105704098
>just making it clear miguposting won't stop
Yeah we know you are proud to be a troon.
Anonymous
6/25/2025, 10:37:37 PM No.105704124
file
file
md5: 6f7ad3ecc928197d2e2382b13a351db2🔍
four (4) organic posts (all different anons)
Replies: >>105704137 >>105704138 >>105704157
Anonymous
6/25/2025, 10:39:14 PM No.105704135
>>105704098
So you admit this tech lost everything people deemed fun and all you do is spamming LLM-unrelated slop 24/7 here because you've got nothing else to do, such a sad way to exist desu
Anonymous
6/25/2025, 10:39:23 PM No.105704137
>>105704124
One organic post: cut your head off next
Anonymous
6/25/2025, 10:39:27 PM No.105704138
>>105704124
Hey, Emre here from the Jan (Menlo) team. I'm sorry you had a bad interaction with us. ..
Replies: >>105704157
Anonymous
6/25/2025, 10:40:05 PM No.105704145
>blah blah blah
Happy it etc etc
Replies: >>105704157
Anonymous
6/25/2025, 10:41:10 PM No.105704157
1660796500342465_thumb.jpg
1660796500342465_thumb.jpg
md5: 6388a14b6b98d4503d15fd60845d4d70🔍
>>105704124
>>105704138
>>105704145
Ya never beating the troon allegations i see
Replies: >>105704323
Anonymous
6/25/2025, 10:42:46 PM No.105704170
thanks for confirming all the other posts are yours
Replies: >>105704225
Anonymous
6/25/2025, 10:43:41 PM No.105704182
nooooo these are organic /lmg/ anti-migu posts from multiple diverse anons nooooo
if you don't believe me you're just a [insert slur] nooooo
Replies: >>105704225
Anonymous
6/25/2025, 10:46:33 PM No.105704210
>>105703671
No such image kek
https://desuarchive.org/_/search/boards/r9k.desu.meta/filename/my%20wife.jpg/width/1280/height/720/
https://desuarchive.org/_/search/boards/r9k.desu.meta/text/Makise%20Kurisu/page/1/

Unrelated one - https://desuarchive.org/r9k/thread/12210001/#q12212820
Replies: >>105704709
Anonymous
6/25/2025, 10:47:07 PM No.105704217
Mikuposting will continue until moderation team mental health improves.
Replies: >>105704225
Anonymous
6/25/2025, 10:48:19 PM No.105704225
>>105704170
>>105704182
>>105704217
Quit samefagging nigger everyone can see through your bullshit
Anonymous
6/25/2025, 10:50:10 PM No.105704235
>if i say that everyone who dislikes me spamming the same irrelevant shit 24/7/365 and exposing me for having agp and taking hrt is just one person, i definitely save my brain from cognitive dissonance from having to admit that i am a loser retard even online as in irl, its just easier to commit ad hominem fallacy instead
I wouldn't even mind migger avatarfagging if the comments were relevant at least, but tranimespammers are ALWAYS the most braindead gooner retards and nothing else.

Once AGI drops but unironically by 2035, I'll never talk to a "real" person online ever again.
Anonymous
6/25/2025, 10:52:35 PM No.105704259
file
file
md5: 5047d7ee6949ba904308746714629b5d🔍
only the finest real, unique, diverse and most importantly grassroots posts here
Replies: >>105704276
Anonymous
6/25/2025, 10:53:58 PM No.105704272
CUDA_VISIBLE_DEVICES="0," \
numactl --physcpubind=0-7 --membind=0 \
"$HOME/LLAMA_CPP/$commit/llama.cpp/build/bin/llama-cli" \
--model "$model" \
--threads 8 \
--ctx-size 100000 \
--cache-type-k q4_0 \
--flash-attn \
$model_parameters \
--n-gpu-layers 99 \
--no-warmup \
--color \
--override-tensor ".ffn_.*_exps.=CPU" \
$log_option \
--single-turn \
--prompt-cache "$HOME/Desktop/cached_prompt.txt" \
--file "$tmp_file"


Indeed, I have found that it is usually in unimportant matters that there is a field for the observation, and for the [end of text]


llama_perf_sampler_print: sampling time = 3043.28 ms / 64525 runs ( 0.05 ms per token, 21202.46 tokens per second)
llama_perf_context_print: load time = 2073845.08 ms
llama_perf_context_print: prompt eval time = 2060734.84 ms / 34180 tokens ( 60.29 ms per token, 16.59 tokens per second)
llama_perf_context_print: eval time = 9030278.52 ms / 30344 runs ( 297.60 ms per token, 3.36 tokens per second)
llama_perf_context_print: total time = 11125945.63 ms / 64524 tokens


Why did it stop at 64524 tokens?
Replies: >>105704320 >>105704489 >>105704731
Anonymous
6/25/2025, 10:54:45 PM No.105704276
>>105704259
>diverse
Diversity is our strength
Anonymous
6/25/2025, 10:58:47 PM No.105704320
>>105704272 (me)

I gave it 142 kb of English text as a prompt which perfectly translates in 34k tokens (4:1)
Replies: >>105704489
Anonymous
6/25/2025, 10:59:04 PM No.105704323
>>105704157
Both of you need to stop posting, assuming you aren't actually the same person.
Replies: >>105704393 >>105704408
Anonymous
6/25/2025, 11:06:34 PM No.105704393
>>105704323
I agree that those posts are very unsafe
Anonymous
6/25/2025, 11:07:49 PM No.105704408
>>105704323
My post is deleted and his not, this is your proof.
Replies: >>105704433
Anonymous
6/25/2025, 11:07:59 PM No.105704410
There's someone who lives in the threads that sometimes makes posts that are unironically anti local models, anti open source, etc. It would be funny if that was the same person as the guy who's shitposting today. It probably is.
Replies: >>105704446
Anonymous
6/25/2025, 11:09:44 PM No.105704433
>>105704408
That's cool. But next time you don't need to egg him on. I suppose this advice won't be followed though.
Anonymous
6/25/2025, 11:11:15 PM No.105704446
>>105704410
No one cares about random thread on 4chan, local janny does the job just fine by killing any discussion that is not about his favorite anime waifu or whatever.
Anonymous
6/25/2025, 11:14:47 PM No.105704487
uhh...uh...uhhh
uhh...uh...uhhh
md5: 342299e5a0b62a7a34a8a424076e5a53🔍
>>105703565
>its against the rules to post cropped porn
That's news to me.
Anonymous
6/25/2025, 11:14:57 PM No.105704489
>>105704272
DS I assume. When you use something other than the context length from the config.json, llama.cpp tells you about in in the logs. If you use a higher one, it clamps it down to the default. If lower, it just lets you know that you could use more. So check the model loading bit, see if you find anything related to that. Mostly to make sure the random quant didn't have a fucked conversion or whatever. Or the model just got bored.
>>105704320
>(4:1)
Check your math, or your units (prompt 34180 tokens, eval 30344 runs)
Replies: >>105704545 >>105704568
Anonymous
6/25/2025, 11:21:41 PM No.105704545
>>105704489
>>(4:1)
>Check your math, or your units (prompt 34180 tokens, eval 30344 runs)

prompt eval time = 2060734.84 ms / 34180 tokens

I'm not going to divide 34180 by 1024
Anonymous
6/25/2025, 11:23:44 PM No.105704568
>>105704489
>So check the model loading bit
this?

llama_context: constructing llama_context
llama_context: n_seq_max = 1
llama_context: n_ctx = 100000
llama_context: n_ctx_per_seq = 100000
llama_context: n_batch = 2048
llama_context: n_ubatch = 512
llama_context: causal_attn = 1
llama_context: flash_attn = 1
llama_context: freq_base = 10000.0
llama_context: freq_scale = 0.025
llama_context: n_ctx_per_seq (100000) < n_ctx_train (163840) -- the full capacity of the model will not be utilized
Replies: >>105704727
Anonymous
6/25/2025, 11:26:36 PM No.105704587
>>105704582
>>105704582
>>105704582
Replies: >>105704600 >>105704649
Anonymous
6/25/2025, 11:28:42 PM No.105704600
schizo
schizo
md5: 5106ea5437889c0ebd822d7dbe929c54🔍
>>105704587
Schizo thread.
Replies: >>105704611 >>105704626 >>105705267 >>105705288 >>105705308 >>105705340
Anonymous
6/25/2025, 11:30:25 PM No.105704611
>>105704600
so /lmg/ thread?
Anonymous
6/25/2025, 11:31:53 PM No.105704626
>>105704600
schizophrenia is most common in those of jewish descent
there's hardly any doubt what the schizo is
Anonymous
6/25/2025, 11:34:32 PM No.105704649
>>105704587
Uh oh meltie
Replies: >>105704690
Anonymous
6/25/2025, 11:34:56 PM No.105704650
>its DA JOOOS
lmao
Anonymous
6/25/2025, 11:36:59 PM No.105704680
>>105704054
>there is a 100% chance that even if miguposting stopped right now, he'd just find something else to bitch about
Was tried before and he just kept going on about miku and "troons" unprompted. He doesn't care about LLMs or even his /pol/tard culture war drama. He just wants attention.
Replies: >>105704696 >>105704702
Anonymous
6/25/2025, 11:37:51 PM No.105704690
file
file
md5: 0acc866956e9b4aa6d0441437afdd569🔍
>>105704649
>Uh oh meltie
Replies: >>105704707
Anonymous
6/25/2025, 11:38:18 PM No.105704696
>>105704680
Proofs?
Replies: >>105704709
Anonymous
6/25/2025, 11:38:53 PM No.105704702
>>105704680
>Was tried before
When was this period when this thread wasn't spammed with this shitty mascot? Was it like 20 minutes when OP was to busy dolling himself up?
Anonymous
6/25/2025, 11:39:38 PM No.105704707
1744034360407411
1744034360407411
md5: 60e0d233b389256b12420880c21aef9a🔍
>>105704690
Anonymous
6/25/2025, 11:39:48 PM No.105704709
>>105704696
No one proved wrong this one >>105704210 so i doubt he will say anything of matter this time.
Anonymous
6/25/2025, 11:41:34 PM No.105704727
>>105704568
Yeah. I was expecting n_ctx_train to be ~64k, but no. So no idea. Considering how long it went, it doesn't seem to be a broken quant. I suppose you could try to run it with --ignore-eos if it really generated an eos, but you're gonna have to stop it at some point, or set --predict to 60k or whatever. Or if you run it on llama-server, whenever you get the EOS you can just inspect the probs and see what the deal is. Maybe sampling fucked up. DS seems to recommend 0.6, but llama.cpp defaults to 0.8, which is now considered high with some models.
Anonymous
6/25/2025, 11:41:48 PM No.105704731
>>105704272
Problem here is the fact you blindly type --n-gpu-layers 99
You need to set this to some NORMAL value. not 99. no matter how much vram you have.
This is why it's slower.
Cretins like you shouldn't have hardware because you don't know what is going on.
Replies: >>105704742 >>105704751
Anonymous
6/25/2025, 11:42:59 PM No.105704742
>>105704731
>This is why it's slower.
Nothing to do with his question.
Anonymous
6/25/2025, 11:43:55 PM No.105704751
>>105704731
lol no
99 works fine for all of us
it will load as many layers as it has regardless
Anonymous
6/26/2025, 12:44:22 AM No.105705267
>>105704600
kek
Anonymous
6/26/2025, 12:46:06 AM No.105705288
>>105704600
based
Anonymous
6/26/2025, 12:48:02 AM No.105705308
>>105704600
Tranny baker spamming there
Anonymous
6/26/2025, 12:50:20 AM No.105705340
>>105704600
Man it still hasn't been deleted. Jannies wake up.
Replies: >>105705362
Anonymous
6/26/2025, 12:51:04 AM No.105705347
It looks like the Deepseek-less poorfags are going crazy. I guess that's what happens if you have nothing but Nemo for a whole year.
/lmg/ will be better off once you've all killed each other.
Anonymous
6/26/2025, 12:52:56 AM No.105705362
R0SeZ4qF3K
R0SeZ4qF3K
md5: 1c599df571f94d486c9cc254048622da🔍
>>105705340
>mods pls censor things i don't like :(
Off yourself.
Anonymous
6/26/2025, 12:53:45 AM No.105705369
DDR6 will save us.
Anonymous
6/26/2025, 1:00:51 AM No.105705428
it's just good that there's nothing to talk about anyway
maybe it's time to retire /lmg/ and just have a thread for the four times a year something worth talking about gets released
Anonymous
6/26/2025, 1:04:39 AM No.105705451
but then what general are you going to try to shitpost to death if you don't have /lmg/ to do it?
Replies: >>105705601
Anonymous
6/26/2025, 1:27:42 AM No.105705601
>>105705451
/ldg/
Anonymous
6/26/2025, 2:17:28 AM No.105705923
>>105702124
Magistral 3.2 is the best I've used thus far.
Anonymous
6/26/2025, 2:51:14 AM No.105706139
Most importantly, four days left until Ernie 4.5/X1 get released as open source