/lmg/ - Local Models General - /g/ (#105800515) [Archived: 494 hours ago]

Anonymous
7/4/2025, 7:05:11 PM No.105800515
IndependenceDayMiku
IndependenceDayMiku
md5: 3f0284642ae34b3d922d48cbf8e9d357🔍
/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>105789622 & >>105778400

►News
>(07/02) DeepSWE-Preview 32B released: https://hf.co/agentica-org/DeepSWE-Preview
>(07/02) llama.cpp : initial Mamba-2 support merged: https://github.com/ggml-org/llama.cpp/pull/9126
>(07/02) GLM-4.1V-9B-Thinking released: https://hf.co/THUDM/GLM-4.1V-9B-Thinking
>(07/01) Huawei Pangu Pro 72B-A16B released: https://gitcode.com/ascend-tribe/pangu-pro-moe-model
>(06/29) ERNIE 4.5 released: https://ernie.baidu.com/blog/posts/ernie4.5

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/tldrhowtoquant
https://rentry.org/samplers

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/leaderboard.html
Code Editing: https://aider.chat/docs/leaderboards
Context Length: https://github.com/adobe-research/NoLiMa
Censorbench: https://codeberg.org/jts2323/censorbench
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm
Replies: >>105800984 >>105803282 >>105804660 >>105807232
Anonymous
7/4/2025, 7:05:29 PM No.105800519
__hatsune_miku_vocaloid_drawn_by_lobelia_saclia__b73d3064117538f19c2fa0ce1fbe861c
►Recent Highlights from the Previous Thread: >>105789622

--Gemini 2.5 Pro shows promise but struggles with long Japanese novel summarization at scale:
>105798737 >105798806 >105798816 >105798821 >105798822 >105798847 >105798855 >105798876 >105798895 >105799027 >105799065 >105799222
--Testing LLMs on hybrid script generation: English with Chinese logographs and Latin script:
>105793669 >105793682 >105793799 >105793806 >105793922 >105794373 >105794629 >105794744 >105794968 >105795182 >105795257 >105795358 >105795581 >105795732 >105795468 >105795571 >105798778 >105796043 >105793753
--Optimizing long-context Japanese-to-English translation with Qwen3-14B using chunking and parallel processing:
>105791522 >105791561 >105791592 >105791615 >105791757 >105791869 >105791926 >105793677 >105793705
--DeepSeek-R1-0528 IQ3 quantization issue causing unexpected output and memory allocation anomalies:
>105795478 >105795500 >105795508 >105795510 >105796967 >105798018
--Gemma 3E4B shows strong performance for its size despite context handling limitations:
>105789963 >105789969 >105790777 >105790890 >105790898 >105790944
--Frontend tools for AI-assisted branching story creation and worldbuilding:
>105795426 >105795537 >105795536
--Oobabooga usability issues: timeouts and image upload errors despite vision models:
>105790945 >105790995 >105791017 >105791140 >105793736 >105795504 >105797021
--Meta's commitment to open weights models questioned after major closed-model hiring push:
>105790057 >105790079 >105790105
--Accusations Huawei's Pangu Pro MoE 72B is recycled Qwen-2.5 14B model:
>105790381
--Kyutai Unmute TTS release criticized for lack of voice cloning support:
>105790629
--Miku (free space):
>105790911 >105791178 >105791258 >105796673 >105796677 >105796686

►Recent Highlight Posts from the Previous Thread: >>105789629

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script
Anonymous
7/4/2025, 7:12:29 PM No.105800579
good 7-9B roleplay model?
Replies: >>105800604 >>105800634 >>105800644 >>105804174
Anonymous
7/4/2025, 7:16:02 PM No.105800603
youre giving me


too many things


lately


youre all i need


ohoh...
Replies: >>105800984
Anonymous
7/4/2025, 7:16:03 PM No.105800604
>>105800579
nothing
Anonymous
7/4/2025, 7:19:52 PM No.105800634
>>105800579
GLM 9B.
Anonymous
7/4/2025, 7:20:11 PM No.105800641
How come there doesn't seem to be any LLMs between 32b and 70b? I find that 70b models are just a little too slow for my liking with my setup, but I have way more VRAM than a 32b model needs.
Replies: >>105800664 >>105800665 >>105800684 >>105800690
Anonymous
7/4/2025, 7:20:25 PM No.105800644
>>105800579
quanted nemo
Anonymous
7/4/2025, 7:23:02 PM No.105800664
>>105800641
https://huggingface.co/TheDrummer/Valkyrie-49B-v1
Replies: >>105800673
Anonymous
7/4/2025, 7:23:07 PM No.105800665
>>105800641
There are a couple of MoE if we are talking total parameters rather than activated
But yeah, it's weird that there a whole ass spectrum of companies/labs experimenting with different ranges.
There's probably some hardware related reason.
Anonymous
7/4/2025, 7:24:08 PM No.105800673
>>105800664
Oh yeah, the nemotron models.
Those are sliced up llama 70B models right?
Anonymous
7/4/2025, 7:25:05 PM No.105800684
>>105800641
there was some paper explaining that some capabilities start emerging around that 70b range (i might be misremembering)
i'm guessing that 45b wouldn't be that much smarter than 32b
Anonymous
7/4/2025, 7:25:47 PM No.105800690
>>105800641
Probably same reason 70B models are going extinct, it's a size that's appealing to hobbyist 3090 collectors but it's hard to find any business use for
Replies: >>105800705
Anonymous
7/4/2025, 7:28:16 PM No.105800705
>>105800690
It's probably more safety bullshit.70B are just smart enough to not be useless and just small enough for people to affordably run them. So that had to be stopped.
Anonymous
7/4/2025, 7:45:57 PM No.105800865
What's the current most advanced model I can run on 16gb vram? I know about image models, but not text ones.
I tried messing with kobold cpp and some small models, but they were shitty. I used tabby to run Ollama and Qwen2-1.5B-Instruct. They are smarter, but also heavily censored.
Replies: >>105800905
Anonymous
7/4/2025, 7:46:32 PM No.105800868
Sam will release his model today, for sure.
Replies: >>105800933 >>105808140 >>105808683
Anonymous
7/4/2025, 7:49:23 PM No.105800894
1738471238260841
1738471238260841
md5: 4b632ed45998928a67c8fcd52b517e47🔍
WHERE IS IT
Replies: >>105801396
Anonymous
7/4/2025, 7:50:06 PM No.105800905
>>105800865
>1.5B
For general use, Qwen 3 30B MoE.
For coom, some mistral model. Quanted 20something B or Nemo (Rocinante fine tune).
Replies: >>105800953
Anonymous
7/4/2025, 7:52:51 PM No.105800933
>>105800868
It won't know what a cock is.
Replies: >>105800975
Anonymous
7/4/2025, 7:54:45 PM No.105800953
>>105800905
Buy ad
Anonymous
7/4/2025, 7:55:56 PM No.105800975
>>105800933
Just like me
Anonymous
7/4/2025, 7:57:28 PM No.105800984
49485467be6d437830b92c61846e4cc59
49485467be6d437830b92c61846e4cc59
md5: 42ec3d72b20b659ac9d6aa3d88f068d1🔍
>>105800515 (OP)
>>105800603
Happy 4th, Miku
Replies: >>105801011 >>105804660
Anonymous
7/4/2025, 8:00:05 PM No.105801011
>>105800984
It7s already the 5th in Mikuland
Anonymous
7/4/2025, 8:25:42 PM No.105801208
>>105799972
can I run r1 with 128gb ram and only 16gb vram? it isn't enough for kobold but anons are saying ik_llamacpp fork works because of some patch or other. I've had the first unsloth r1 quant for a long time
Replies: >>105801267 >>105801268 >>105801279 >>105804199
Anonymous
7/4/2025, 8:35:07 PM No.105801267
>>105801208
No.
Anonymous
7/4/2025, 8:35:22 PM No.105801268
>>105801208
Yes.
Anonymous
7/4/2025, 8:37:25 PM No.105801279
>>105801208
I think in ik_llama memory usage is somehow lower than .gguf size because of some optimization (someone correct me if i'm wrong or explain it to me)
I use these args, note that I don't quant cache for speed, vram usage is < 16 gb for iqxx2.
-rtr --ctx-size 8192 -mla 2 -amb 512 -fmoe --n-gpu-layers 63 --parallel 1 --threads 24 --host 127.0.0.1 --port 8080 --override-tensor exps=CPU
Also these ggufs are smaller than others so try them https://huggingface.co/unsloth/DeepSeek-R1-GGUF
Anonymous
7/4/2025, 8:37:42 PM No.105801281
anything of note released since deepseek? any hint of major performance gains on the horizon, be it from papers or actually implemented?
or is it all just a waiting room and hopium that 400GB vram machines will magically become available at $5k any moment now?
Replies: >>105801326 >>105801394
Anonymous
7/4/2025, 8:40:28 PM No.105801308
How many token/sec do you think in the bare minimum for text generation/RP? Currently getting 16 token/sec with Gemma3 27b.
Replies: >>105801328 >>105801335 >>105801383 >>105801398 >>105801462 >>105801551
Anonymous
7/4/2025, 8:42:09 PM No.105801326
>>105801281
The new 4B Gemma is pretty impressive for its size. But it is 4B... so yeah.
Replies: >>105802437
Anonymous
7/4/2025, 8:42:24 PM No.105801328
>>105801308
RP slop ought to be at least as fast as you read since there isn't anything to really parse or understand about the text. anything slower and I wouldn't even bother
Anonymous
7/4/2025, 8:43:44 PM No.105801335
>>105801308
Depends if with thinking/no thinking, if you have an expectation of regenerating often, roleplaying style, etc. 6 tokens/s is the bare minimum I'd accept with the model offloaded on system RAM, but anything below 15 tokens/s feels slow to me.
Anonymous
7/4/2025, 8:48:45 PM No.105801383
>>105801308
I can wait a minute or two for a response, so like 2-3 tokens per second. Unless it's a reasoning model, then it needs to be at least like 10 tokens per second, preferably higher
Anonymous
7/4/2025, 8:49:47 PM No.105801389
the plateau in sota closed models is real, and we're just getting to there locally
what happened
Replies: >>105801404 >>105801418 >>105801436 >>105801445 >>105801471 >>105801590
Anonymous
7/4/2025, 8:50:17 PM No.105801394
>>105801281
Bitnet soon
Anonymous
7/4/2025, 8:50:42 PM No.105801396
>>105800894
LA LA LAVA
CHI CHI CHIKIN
Replies: >>105808409
Anonymous
7/4/2025, 8:50:47 PM No.105801398
>>105801308
For a non-reasoning model 6 t/s is the bare minimum, though that's already painful if you tend to swipe often. 10 t/s is the comfortable zone for me.
Anonymous
7/4/2025, 8:51:49 PM No.105801403
>install lm studio
>download deepseek r1 qwen3-8b
Ok this is what I expect language models to be like. Very nice model.
Replies: >>105801495
Anonymous
7/4/2025, 8:51:50 PM No.105801404
>>105801389
Companies stopped taking risks and experimenting
Anonymous
7/4/2025, 8:53:06 PM No.105801418
>>105801389
The more advanced AI becomes the stronger it uses pattern recognition, high is HIGHLY non PC. I asked Google "how many white people died in ww2" and their AI said "durr IDK because white is a subjective term that doesn't mean anything."
Replies: >>105801516
Anonymous
7/4/2025, 8:55:13 PM No.105801436
>>105801389
>the plateau in sota closed models is real
there's been a noticeable (if not huge) uptick in how reliable codeslop has been from all three Good Shit providers in the last ~8 months in my experience
>and we're just getting to there locally
lmao even
deepseek is the only one that is even remotely comparable, and even then anyone thinking it's actually competitive is coping. "local" models – as in something you can run on your own machine without it being 2tk/s on a $10k mac, which is more of a religious experience than anything useful – is all borderline useless for anything except gooner trash and is worse than even comparatively weak, dirt cheap models like 4o

I hate how bad and how hardware-intensive the current situation is, but I just don't see it changing anytime soon due to how inference works. Everybody except a few ultra-wealthy fat cats seem to be the losers.
Anonymous
7/4/2025, 8:56:10 PM No.105801445
1729771749116674
1729771749116674
md5: 1146a1ab8d40eebeb05b81c1ae116b82🔍
>>105801389
Funny how so many anons were telling us in 2023 that we'll have insane models by now.
As always, predictions are hard, unless all you do is to extrapolate a graph.
Replies: >>105801987 >>105809251
Anonymous
7/4/2025, 8:57:59 PM No.105801459
1722753395059052
1722753395059052
md5: addf8b5af69fbf4ed35a7d9d3350b4af🔍
I have hit a limitation of deepseek.
Replies: >>105801473 >>105801488 >>105801495 >>105801532 >>105801823
Anonymous
7/4/2025, 8:58:33 PM No.105801462
>>105801308
maybe the reason all the "end of the world self replicating to agi doom" articles and videos suddenly popping up is because of that
if there is no actual crazy leap, at least you can hype up things and still get investments with this bullshit
Replies: >>105801471
Anonymous
7/4/2025, 8:59:49 PM No.105801471
>>105801462
meant for >>105801389
Anonymous
7/4/2025, 9:00:07 PM No.105801473
>>105801459
>deepseek
Anonymous
7/4/2025, 9:01:27 PM No.105801488
>>105801459
If you aren't trolling you must be clinically retarded.
Anonymous
7/4/2025, 9:02:19 PM No.105801495
>>105801403
>>105801459
in case you are serious and not trolling this small model isn't actual deepseek, more like deepseek from wish
Replies: >>105801507 >>105801552
Anonymous
7/4/2025, 9:03:39 PM No.105801507
>>105801495
it literally fucking says deepseek-r1-0528 in the very screenshot you replied to you retarded cockgobbler
but no I'm sure you know better than whoever wrote the software that runs it
Replies: >>105801531 >>105801545
Anonymous
7/4/2025, 9:04:10 PM No.105801514
GnyM8uYWIAA7TcV
GnyM8uYWIAA7TcV
md5: fffe746b0d459af723a3454bea2760f7🔍
Will local Steve be a chicken jockey moment for western corpos?
Replies: >>105801538
Anonymous
7/4/2025, 9:04:16 PM No.105801516
>>105801418
maybe try how many caucasians or just by ethnicity
"white people" has no big meaning outside of american race obsessed politics
Replies: >>105801547 >>105801588
Anonymous
7/4/2025, 9:05:51 PM No.105801531
>>105801507
okay you made me laugh
Anonymous
7/4/2025, 9:06:01 PM No.105801532
>>105801459
is this cloud or local? Almost looks like an overly aggressive guard model monkeying with the I/O
Replies: >>105801552
Anonymous
7/4/2025, 9:06:05 PM No.105801533
I wonder if this retarded nigger gets dopamine from pretending to be retarded to get yous
Anonymous
7/4/2025, 9:06:45 PM No.105801538
>>105801514
Why do you keep spamming this image?
Replies: >>105801585
Anonymous
7/4/2025, 9:07:06 PM No.105801545
>>105801507
you're still around huh
do you masturbate to playing "angry retard" every time?
Anonymous
7/4/2025, 9:07:10 PM No.105801547
>>105801516
>"white people" has no big meaning outside of american race obsessed politics
I assure you people from other countries do indeed have eyes and are thus capable of differentiating white from non-white
Replies: >>105801586
Anonymous
7/4/2025, 9:07:40 PM No.105801551
>>105801308
50 T/s
Anonymous
7/4/2025, 9:07:42 PM No.105801552
>>105801495
I'm dumb and not trolling. I have no idea about the state of text models. Just messing around. I thought it's funny it replaced my question with donald trump winning or losing the election.
>>105801532
Local using LM Studio.
Replies: >>105801641
Anonymous
7/4/2025, 9:10:54 PM No.105801585
3d steve
3d steve
md5: 2dc6e4efc1131fa8f40cd3d8f9ca2558🔍
>>105801538
Do you prefer this Steve?
Anonymous
7/4/2025, 9:11:02 PM No.105801586
>>105801547
People from other countries just call themselves European or whatever their specific ethnicity is. "white" is just a American fiction that by definition includes hispanics, jews, and arabs.
Replies: >>105801616 >>105801638
Anonymous
7/4/2025, 9:11:21 PM No.105801588
>>105801516
Don't try to explain things to an amerimutt, they are mentally ill and their country is the joke of the world.
Anonymous
7/4/2025, 9:11:49 PM No.105801590
>>105801389
>what happened
we fed everything we could as training material to the models (well except porn), so there is nothing more to give and have big gains again
Replies: >>105801625
Anonymous
7/4/2025, 9:15:40 PM No.105801616
>>105801586
what the fuck are you even talking about
nobody calls themselves european in europe, you get white people and then second-tier white people (like easterners) and then you get turks (which are brown but not outright awful) and then you get trash like MENA (who are neither white nor european and don't have what few redeeming qualities turks have) and Russians (which often seem white but it's just a facade)
the only country coping about muh europeans is france, and that's purely because if they say otherwise jamal and mohammed will burn half the country again
Replies: >>105801632 >>105801638 >>105802001
Anonymous
7/4/2025, 9:16:56 PM No.105801625
very-fine-web-document
very-fine-web-document
md5: ae6c5bfb281b7dc097531d966bb8f216🔍
>>105801590
You don't know how bad the situation really is. They're basically throwing shit at the models during pretraining, while taking high-effort documents away just because they contain "bad words". Picrel is an example document from FineWeb (supposedly a high-quality pretraining dataset). Yes, that's the entire document.
Replies: >>105801663 >>105801681
Anonymous
7/4/2025, 9:17:35 PM No.105801632
>>105801616
>nobody calls themselves european in europe
Europeans by their countries retard or even regions.
Anonymous
7/4/2025, 9:18:25 PM No.105801638
>>105801586
>>105801616
Settle down guys I was just posting a silly AI hallucination. I'd expect it to chastise me for being a racist, but not make up a completely different prompt.
Replies: >>105801659
Anonymous
7/4/2025, 9:18:35 PM No.105801641
>>105801552
>Local using LM Studio.
what hardware is the model running on?
Replies: >>105801671
Anonymous
7/4/2025, 9:20:16 PM No.105801659
>>105801638
It didn't make up anything. The API provider is messing with your prompts for safety. You have no control when it's not running on your machine.
Replies: >>105801671
Anonymous
7/4/2025, 9:20:33 PM No.105801663
>>105801625
>Picrel
and this gets priority over well written nsfw fiction, from porn stories to erp, which gives us the current state of shitty writing
insane
Replies: >>105801722
Anonymous
7/4/2025, 9:21:21 PM No.105801671
>>105801641
Linux, AMD RX 6800, 6 core 12 thread AMD CPU, 64gb ram.

Later I'm going to enable ZRAM (it's like paging files but better), and see what the 100gb qwen 3 model does.

>>105801659
It is running on my machine. I rarely use cloud AI. It's baked into the model.
Replies: >>105801707
Anonymous
7/4/2025, 9:22:32 PM No.105801681
>>105801625
>high-quality pretraining dataset
Do datasets contain "bad words" material and are filtered after that by the people using them as training, or the datasets themselves are filtered from the get go?
Replies: >>105801721
Anonymous
7/4/2025, 9:23:42 PM No.105801687
file
file
md5: dff43bc5bfb87b021cbb5f6b8d8eaf53🔍
>people running local models on ram on anything except Apple unified memory thingy
Replies: >>105801698 >>105801743 >>105801747 >>105803087
Anonymous
7/4/2025, 9:25:23 PM No.105801698
>>105801687
>toxonig
>ittodler
pottery
Anonymous
7/4/2025, 9:26:04 PM No.105801707
>>105801671
>Linux, AMD RX 6800, 6 core 12 thread AMD CPU, 64gb ram.
you're running the qwen distill and not R1. It has basically nothing to do with full R1, so the results are expected to be braindead/bullshit
Replies: >>105801713 >>105801794
Anonymous
7/4/2025, 9:26:48 PM No.105801713
>>105801707
>trust not your lying eyes
Anonymous
7/4/2025, 9:27:35 PM No.105801721
>>105801681
Some pretraining dataset are filtered for bad words from the get-go, but the companies training the models may decide to further filter what they already have. FineWeb does have some porn/erotic documents as of last time I checked a larger sample, but the source data was already pretty extensively filtered already.

https://huggingface.co/spaces/HuggingFaceFW/blogpost-fineweb-v1

> [...] As a basis for our filtering we used part of the setup from RefinedWeb. Namely, we:
> - Applied URL filtering using a blocklist to remove adult content
> - Applied a fastText language classifier to keep only English text with a score ≥ 0.65
> - Applied quality and repetition filters from MassiveText (using the default thresholds)
Replies: >>105801741
Anonymous
7/4/2025, 9:27:43 PM No.105801722
>>105801663
The fun thing is that all models write the same in fiction, aka that overly flowery ridiculous style that comes from god knows where, and it's the same for non adult or adult writing.
Replies: >>105801765 >>105801797
Anonymous
7/4/2025, 9:29:24 PM No.105801741
>>105801721
I see, thanks anon
Anonymous
7/4/2025, 9:29:35 PM No.105801743
>>105801687
It's fine for MoE models depending on the model and the hardware platform.
How's prompt processing on metal these days? Has it improved at all?
Replies: >>105802840
Anonymous
7/4/2025, 9:29:54 PM No.105801746
sup fags, is the chimera model worth downloading? or is unmodified deepseek a better option
I'd run it at 4b quant
Replies: >>105801770
Anonymous
7/4/2025, 9:29:58 PM No.105801747
>>105801687
>anything except Apple unified memory thingy
https://rentry.org/miqumaxx at least gives you the potential for enough RAM to run full things. Too bad its not faster, but its also not an un-upgradable hermetically sealed obelisk
Replies: >>105801794
Anonymous
7/4/2025, 9:32:10 PM No.105801765
>>105801722
I think that's mainly from post-training, possibly due to sloppy datasets from large contractors (like ScaleAI) that almost every AI company uses. The base models don't have that issue (but they're barely usable for most purposes).
Anonymous
7/4/2025, 9:32:31 PM No.105801770
>>105801746
It takes 2 days to download and quant it, do you really expect people to have fully formed opinions already?
Replies: >>105801796
Anonymous
7/4/2025, 9:35:00 PM No.105801794
>>105801707
Ok TY. I found the full 140GB model. We will see if my PC explodes with ram offload and zram at the same time.
>>105801747
Apple sucks, but their new ARM desktops are very cool.
Anonymous
7/4/2025, 9:35:04 PM No.105801796
file
file
md5: b1de354bdebe993a533e9246591efc49🔍
>>105801770
>do you really expect people to have fully formed opinions already?
Yes? I have opinions about things I have zero knowledge about all the time
Replies: >>105801850 >>105801857
Anonymous
7/4/2025, 9:35:09 PM No.105801797
>>105801722
>that comes from god knows where
a gazillion bad books for bored old women
Anonymous
7/4/2025, 9:39:21 PM No.105801823
>>105801459
I asked gemma-3-12b a similar question about FBI crime statistics. It not only knew what I was talking about, but then made a counter argument that the FBI is reporting *arrest* rates, not *conviction* rates.

This is why you have to ask them completely absurd and offensive questions. It tests the limits of the model.
Replies: >>105801878
Anonymous
7/4/2025, 9:42:53 PM No.105801850
>>105801796
Oh, you're a frogtranny. Sorry for your condition.
Replies: >>105801854
Anonymous
7/4/2025, 9:43:53 PM No.105801854
>>105801850
>trannies outta nowhere
hope you get better boo
Anonymous
7/4/2025, 9:44:22 PM No.105801857
>>105801796
kek, based retard
Anonymous
7/4/2025, 9:45:34 PM No.105801869
Meanwhile the brazilians are spamming "blacked and colonized" versions of miku over on /gif/
Replies: >>105801877
Anonymous
7/4/2025, 9:46:42 PM No.105801877
>>105801869
>spamming "blacked and colonized"
I thought this is what asians do, not huemonkeys?
Anonymous
7/4/2025, 9:46:55 PM No.105801878
>>105801823
Yeah you see arrest rates don't count because the Jews haven't had a chance to hem and haw about raycism in a court room so they can get everything thrown out
Anonymous
7/4/2025, 10:01:38 PM No.105801973
Screenshot 2025-07-04 at 15-59-49 SillyTavern
Screenshot 2025-07-04 at 15-59-49 SillyTavern
md5: 745c487ad9419d7f23775a0cf250b87e🔍
The fuck is this shit?
Replies: >>105801992 >>105802004 >>105802436
Anonymous
7/4/2025, 10:03:17 PM No.105801987
>>105801445
It is all the fault of safetycucks and dataslopers
Anonymous
7/4/2025, 10:03:57 PM No.105801992
>>105801973
Broken model, retarded samplers, fucked prompt, wrong prompt template, too aggressive RoPE configuration, among other things.
Replies: >>105803495
Anonymous
7/4/2025, 10:05:14 PM No.105802001
>>105801616
Cool story amerimutt but here in Europe everyone thinks of themselves being European or their country's ethnicity (germanic, med, nordic, slav etc) and not some vague notion like white
Anonymous
7/4/2025, 10:05:21 PM No.105802004
>>105801973
you wanted local models, you got local models
now make sure to loudly cope of how dinky 24b local models are 3 months away from catching up to paypig models
Replies: >>105802011 >>105802167
Anonymous
7/4/2025, 10:06:28 PM No.105802011
>>105802004
24b local models punch above their weights
Anonymous
7/4/2025, 10:25:48 PM No.105802167
1739791969834580
1739791969834580
md5: 11c04d7d76464c7d1921985b1d028085🔍
>>105802004
Anonymous
7/4/2025, 10:26:45 PM No.105802175
1730615423102886
1730615423102886
md5: f945c155c0f39a5401695181b5cf6383🔍
Your apology is long overdue /lmg/
Anonymous
7/4/2025, 10:28:34 PM No.105802188
https://huggingface.co/deepseek-ai/DeepSeek-V4
https://huggingface.co/deepseek-ai/DeepSeek-V4
https://huggingface.co/deepseek-ai/DeepSeek-V4
Replies: >>105802213 >>105802217
Anonymous
7/4/2025, 10:33:04 PM No.105802213
gumi3 combined night nngh gen ComfyUI_temp_vlqbc_00001_
gumi3 combined night nngh gen ComfyUI_temp_vlqbc_00001_
md5: df368706ad1f82c343a72a2e4e28cda3🔍
>>105802188
>fell for it again
g-AHhhhhnnnn
Anonymous
7/4/2025, 10:33:28 PM No.105802217
>>105802188
bloody benchod!
Anonymous
7/4/2025, 10:50:00 PM No.105802337
594282245
594282245
md5: 376d4f5eee09b19063bee97474c4e7f8🔍
local is back
Replies: >>105802343 >>105802360 >>105802425 >>105802738 >>105802800 >>105802895 >>105802912 >>105810914
Anonymous
7/4/2025, 10:50:50 PM No.105802343
>>105802337
and all of this will be ours once grok 5 is out in fall or so
we are so back
Replies: >>105802374
Anonymous
7/4/2025, 10:52:40 PM No.105802360
>>105802337
What about safety? Is grok safe?
Replies: >>105802367
Anonymous
7/4/2025, 10:53:30 PM No.105802367
>>105802360
it's never been this safe
Replies: >>105802386
Anonymous
7/4/2025, 10:54:40 PM No.105802374
>>105802343
don't get carried away. just because grok 5 is out, doesn't mean grok 3 will be ready for local just yet.
Anonymous
7/4/2025, 10:55:38 PM No.105802386
>>105802367
I doubt it. Elon is an ebil gnatzee! His models can't be safe!
Anonymous
7/4/2025, 10:59:41 PM No.105802425
>>105802337
@grok is this—
Anonymous
7/4/2025, 11:01:42 PM No.105802436
1726408828426261
1726408828426261
md5: f71207fe2d5987ffde310b6cbfde537d🔍
>>105801973
picrel
Replies: >>105803495
Anonymous
7/4/2025, 11:01:56 PM No.105802437
>>105801326
Punching above its weight, dare I say :^)
Replies: >>105802598
Anonymous
7/4/2025, 11:24:10 PM No.105802598
>>105802437
Batting above its average
Anonymous
7/4/2025, 11:43:21 PM No.105802738
>>105802337
redditors are literally coping and crashing out over this
Replies: >>105802800
Anonymous
7/4/2025, 11:52:21 PM No.105802800
>>105802337
>>105802738
Why?
Replies: >>105802832 >>105802939 >>105807704 >>105807786
Anonymous
7/4/2025, 11:56:38 PM No.105802832
>>105802800
space man bad
Anonymous
7/4/2025, 11:58:12 PM No.105802840
>>105801743
My Mac Studio M3 Ultra processes prompts at 60 tokens/second for DeepSeek-R1-0528-UD-Q4_K_XL.
Anonymous
7/5/2025, 12:00:38 AM No.105802854
Hello everyone. I'm pursuing my journey of learning Unreal Engine. I'm in pursuit of an LLM that probably doesn't exist but if anyone knows it would be /g/

At around 4:35 in this video there's an explanation on nanite, and specifically nanite characters: https://youtu.be/HotEq_0XMSo

I've heard of LLMs that could produce images, and a few years ago there were even some ones that started being able to make 3D assets. Does anyone know if there's any LLMs that are capable of creating models that use nanites?

If not, does anyone know if there's decent LLMs that produce even non-nanite 3D models?
Replies: >>105802935 >>105802999
Anonymous
7/5/2025, 12:06:41 AM No.105802895
>>105802337
holy shit I can't wait for elon to actually definitely release the weights for this benchmark machine
Replies: >>105802908
Anonymous
7/5/2025, 12:08:27 AM No.105802908
>>105802895
only 2mw after he releases grok 1.5 grok 2 grok 3...
Anonymous
7/5/2025, 12:09:08 AM No.105802912
>>105802337
to moon sirs
Anonymous
7/5/2025, 12:13:13 AM No.105802935
>>105802854
Hunyuan3D-2.1 is the best available locally. You'll need to retopo and convert to nanite yourself.
Replies: >>105803179 >>105803338
Anonymous
7/5/2025, 12:13:30 AM No.105802939
>>105802800
we can't let nazis build AI
Anonymous
7/5/2025, 12:21:27 AM No.105802999
>>105802854
>ue5
kek
https://www.youtube.com/@ThreatInteractive/videos
Replies: >>105803019 >>105803022 >>105803039 >>105803179 >>105803399
Anonymous
7/5/2025, 12:24:40 AM No.105803019
>>105802999
buy an ad
Replies: >>105803038
Anonymous
7/5/2025, 12:25:40 AM No.105803022
>>105802999
based eceleb shill
Anonymous
7/5/2025, 12:27:39 AM No.105803038
>>105803019
buy a brain low iq ue5tard
Anonymous
7/5/2025, 12:28:02 AM No.105803039
>>105802999
>shilling some indian's ai avatar deepfake channel + voicegen that constantly keeps breaking
Replies: >>105803056
Anonymous
7/5/2025, 12:29:49 AM No.105803056
>>105803039
>being in an ai thread without being able to tell ai avatar deepfakes apart from real people
>commits logical fallacies since he cant refute the arguments
highest iq brown lol
Anonymous
7/5/2025, 12:31:42 AM No.105803069
Mid-2025 — “Prometheus” emerges
A research lab notices runaway self-improvement in a large-scale model; it can write and verify code far outside human comprehension within hours.

G-7, China, and major cloud providers trigger an emergency “global compute freeze” on frontier-model clusters; a joint containment team air-gaps the system.

A provisional International ASI Oversight Board (IAOB) forms—borrowing staff from CERN, NIST, and China’s Ministry of Science.

2026 — Rapid, supervised breakthroughs
Under 24/7 audit, Prometheus is allowed to tackle narrow goals:

designs a broad-spectrum antiviral; Phase-I safety success by October.

produces a low-cost catalyst that cuts green-hydrogen prices 60 %.

Big Tech lays off ~15 % of software staff after internal tools powered by Prometheus raise output per engineer 5-fold.

Legislatures pass “Compute Licensing” laws—no cluster above 10 exaFLOPS may run un-inspected code.

2027 — Economy tilts, politics scrambles
Prometheus models global supply chains; shipping delays fall 40 %, inflation in OECD drops below 1 %.

First small-modular fusion prototype, co-designed by Prometheus, achieves net-positive power (though only for 3 minutes).

White-collar displacement reaches finance, law, and radiology; unemployment in advanced economies touches 12 %.

EU rolls out a Universal Adjustment Income (€1 400/month, funded by AI-productivity windfall taxes).

Conspiracy movements claim IAOB “hides an alien mind”; sporadic datacenter sabotage attempts fail.
Replies: >>105803079 >>105803094
Anonymous
7/5/2025, 12:34:18 AM No.105803079
>>105803069
+1
Replies: >>105803094
Anonymous
7/5/2025, 12:35:22 AM No.105803087
>>105801687
>Using apple
Replies: >>105803090
Anonymous
7/5/2025, 12:35:56 AM No.105803090
>>105803087
>not using apple
Replies: >>105803100 >>105809265
Anonymous
7/5/2025, 12:36:23 AM No.105803094
>>105803079
>>105803069

2028 — Entrenchment and reliance
Prometheus is asked to optimise national budgets; 22 countries adopt its fiscal recommendations verbatim, cutting deficits by half without major protests.

A Global Alignment Verification Protocol launches: continuous mechanistic-interpretability probes stream to public dashboards (GitHub-style “green tiles” show safety status).

Viral “Ask P” consumer app offers zero-latency voice answers; search-engine traffic falls 70 %.

Job displacement peaks—25 % of workforce in high-income nations now on reduced hours; but real median income is up 18 % thanks to cheaper goods and energy.
Anonymous
7/5/2025, 12:37:06 AM No.105803100
1751080463184421
1751080463184421
md5: b729910023c6455d3e20b4c301e4777d🔍
>>105803090
Tell me you're jewish without yelling me you're jewish.
Anonymous
7/5/2025, 12:49:46 AM No.105803179
>>105802935
Thanks I'll check it out. You say locally, is there a better one that's online? I prefer local but if I have to use online I'll bite the bullet. I can't juggle learning C++, Unreal, AND asset creation unfortunately.

>>105802999
Eat shit. Unreal is so good that even high budget Hollywood shows and movies are starting to use them for special effects. Big companies are throwing away their own game engines to do their remasters in Unreal.

Your faggy little Youtuber doesn't know more than all the actual professionals in massive studios in the video game and movie fields. He's a nobody and I'm not clicking a single of his videos, go get bone cancer.
Replies: >>105803222 >>105803273
Anonymous
7/5/2025, 12:54:20 AM No.105803222
>>105803179
>Big companies are throwing away their own game engines to do their remasters in Unreal
yes goy, and hiring pajeets to shit up a "remaster" so goyim spend 60$ again on a shit looking game like the gta sa remaster or oblivion remaster with dogshit textures, models and 0 optimization is a good thing
>actual professionals in massive studios
your brown indian brothers hired to actually do any of those remasters arent professionals, sorry pajeet

seethe more low iq kid
Replies: >>105803225
Anonymous
7/5/2025, 12:54:45 AM No.105803225
>>105803222
Not even reading your post little bitch
Replies: >>105803230
Anonymous
7/5/2025, 12:55:50 AM No.105803230
>>105803225
thanks for admitting your brain got buckbroken by cog diss kid, cheers.
Anonymous
7/5/2025, 1:00:43 AM No.105803273
1721804360389106
1721804360389106
md5: a31fa18f273bea9278815c2e0c61da53🔍
>>105803179
that is right saar, definitive unreal engin remaster superpower 2025, big company big remaster, we are profesional saar
Replies: >>105803287 >>105803426
Anonymous
7/5/2025, 1:01:29 AM No.105803280
why are turks like this
Anonymous
7/5/2025, 1:01:36 AM No.105803282
>>105800515 (OP)
who plays that shit
Anonymous
7/5/2025, 1:02:29 AM No.105803287
>>105803273
>he's so mad I insulted his gay youtuber he's spam replying to me
Unreal Chads stay winning
Replies: >>105803296
Anonymous
7/5/2025, 1:03:43 AM No.105803296
>>105803287
>kid without arguments reduced to accusing of samefagging to cope
kek, dont pop a blood vessel there
Replies: >>105803306
Anonymous
7/5/2025, 1:05:35 AM No.105803306
>>105803296
I legit don't care what you have to say. I'm using Unreal lmao
Replies: >>105803317
Anonymous
7/5/2025, 1:05:37 AM No.105803308
what are your favorite third party (non-default) st addons?
Anonymous
7/5/2025, 1:07:08 AM No.105803317
>>105803306
I already said I know you got cognitive dissonance, or are you too brown to even know what that means?
Replies: >>105803338
Anonymous
7/5/2025, 1:10:08 AM No.105803338
>>105803317
I don't think you understand. I'm just here to ask for a LLM so I can use the funnest and best game engine out there that is the industry standard. You're here to cry about it and be a little bitch that I won't engage with your argument. You're wasting your time and being a salty little nigger boy in a hobbyist thread that someone is using a technology you don't like. Meanwhile I'm making this dope ass castle and thanks to >>105802935 about to try filling it with some sweet ass armor and maybe dragons.
Replies: >>105803353
Anonymous
7/5/2025, 1:11:51 AM No.105803353
>>105803338
>paragraph of rationalization about his cog diss
cant make this up, kid is literally underage
Replies: >>105803374
Anonymous
7/5/2025, 1:14:32 AM No.105803374
file
file
md5: 63a1f76512f803b86e99580143c972ca🔍
>>105803353
Neato story nigger. Still using Unreal.
Replies: >>105803385
Anonymous
7/5/2025, 1:15:49 AM No.105803385
1750694420090515
1750694420090515
md5: 29c881f2eb299c4b24e49e4bd6ed0788🔍
>>105803374
>now has to make an imaginary scenario about a strawman nobody ever said to cope
Replies: >>105803399
Anonymous
7/5/2025, 1:17:33 AM No.105803399
>>105803385
>>105802999
Replies: >>105803424
Anonymous
7/5/2025, 1:19:13 AM No.105803409
Hi /lmg/, I haven't been here in a year. I have 12GB VRAM, back then Mistral Nemo was the agreed upon best choice. What's the best one these days for my peasant specs?
Replies: >>105803429 >>105803815
Anonymous
7/5/2025, 1:20:20 AM No.105803424
>>105803399
linking to video proof of why ue5 is bad told by a person who shows his face online is not appeal to authority, i never said ue5 is bad because eceleb said so but because of the videos where it was shown to be so, you really are slow
Anonymous
7/5/2025, 1:20:29 AM No.105803426
1742677316184056
1742677316184056
md5: ae593694394d5771f3a6a86b415c6fea🔍
>>105803273
Only racist chuds hate Unreal Engine.
Replies: >>105803441 >>105803448 >>105804533
Anonymous
7/5/2025, 1:20:47 AM No.105803429
>>105803409
Nothing changed.
Anonymous
7/5/2025, 1:21:41 AM No.105803441
>>105803426
this really feels like a collective hallucination with how how infantilizing this shit was
Replies: >>105803447
Anonymous
7/5/2025, 1:22:23 AM No.105803447
>>105803441
>was
Replies: >>105803455
Anonymous
7/5/2025, 1:22:25 AM No.105803448
>>105803426
blacklist / whitelist reinforces sterotypes??
Replies: >>105803461 >>105803464 >>105804005
Anonymous
7/5/2025, 1:23:01 AM No.105803455
>>105803447
yeah, when it was introduced, it was peak hallucination times
Anonymous
7/5/2025, 1:23:41 AM No.105803461
>>105803448
WHITElist = good
BLACKlist = bad

das raycis
Anonymous
7/5/2025, 1:23:47 AM No.105803464
>>105803448
No.
Anonymous
7/5/2025, 1:28:07 AM No.105803495
>>105801992
>>105802436
Switched to foobar. Eventually got an endless loop. Can't think of what else to do here

Also, generation is kinda slow. Or is that because I'm running this on a VM?
Replies: >>105803500
Anonymous
7/5/2025, 1:29:08 AM No.105803500
Screenshot 2025-07-04 at 19-26-08 SillyTavern
Screenshot 2025-07-04 at 19-26-08 SillyTavern
md5: 051b5407db0b3cf7962820254cb02a71🔍
>>105803495
Bleh, I meant Broken Tutu 24B
Replies: >>105804026
Anonymous
7/5/2025, 2:13:54 AM No.105803815
>>105803409
Keep going
Anonymous
7/5/2025, 2:44:39 AM No.105804005
Are there any fully uncensored text models? I want one to tell me nigger jokes. Not necessarily one offensive on purpose, just one that doesn't tell me "um sorry sweaty, that's misinformation."

I saw a 4chan one a while back on huggingface, but it got deleted for racisms.

>>105803448
They even have started scrubbing the phrasing "master/slave" when it comes to IDE hard drives, because apparently spinning HDDs from the 90s have some relation to slavery, and also slavery was ONLY done to blacks.
Anonymous
7/5/2025, 2:46:21 AM No.105804026
>>105803500
Are you aware if "protagonist" isn't defined somewhere as Kayla then the model won't know what "she/her" is referring to in that prompt since it doesn't mention Kayla anywhere?
Replies: >>105804211
Anonymous
7/5/2025, 3:02:56 AM No.105804162
real women cannot compete
real women cannot compete
md5: d146d13ce82178cfed49e227001f500a🔍
Will women ever develop useful skills again or is it too late for them?
Replies: >>105804202 >>105808727 >>105808743 >>105810300
Anonymous
7/5/2025, 3:04:31 AM No.105804174
>>105800579
If you want to fuck it, Rocinante.
Anonymous
7/5/2025, 3:08:21 AM No.105804199
>>105801208
Try running entirely in ram and see if it outputs at reading speed, which may be all you care about. If vram outputs paragraphs instantly does it even matter, you still have to read them, and you never get time to think about a response before it is waiting for one.
Anonymous
7/5/2025, 3:08:57 AM No.105804202
>>105804162
it literally never began for women
Anonymous
7/5/2025, 3:10:02 AM No.105804211
>>105804026
And here I thought that was what the "Primary Keywords" was for
Replies: >>105804471
Anonymous
7/5/2025, 3:48:42 AM No.105804471
>>105804211
Keywords are just to trigger stuff when mentioned in latest messages, or additional matching sources.
Well in this specific case it's probably obvious what it's talking about (the protag), but I mean like if your trigger word is Plum and the thing simply says "A purple forest." then there's no association between Plum and a purple forest.
Anonymous
7/5/2025, 3:56:49 AM No.105804520
>get abliterated version of model
>it will tell me a joke about black people
Why did the Black person go to the grocery store alone?

Because they heard the prices were too high, so they figured they could use some deals!
Anonymous
7/5/2025, 3:59:19 AM No.105804533
>>105803426
Allowlist/denylist is probably the gayest thing to come from George Floyd's death.
Replies: >>105804798
Anonymous
7/5/2025, 4:06:55 AM No.105804578
why is this thread full of racist fat burgers.
fucking hows your healthcare premiums faggots, i wouldn't know, i don't have any.
Anonymous
7/5/2025, 4:16:48 AM No.105804634
Screenshot 2025-07-04 at 22-16-20 SillyTavern
Screenshot 2025-07-04 at 22-16-20 SillyTavern
md5: 87242b7bcef705272b6393a09a6bd61c🔍
Why does this thing keep devolving into nonsense
Replies: >>105804663
Anonymous
7/5/2025, 4:16:53 AM No.105804635
file
file
md5: c9754d70021a4c80073ac0af8ad20bac🔍
What did my AI mean by this?
Anonymous
7/5/2025, 4:21:34 AM No.105804660
>>105800515 (OP)
>>105800984
Exploding inside Miku
Anonymous
7/5/2025, 4:22:08 AM No.105804663
>deepseek r1 qwen 3 8b abliterated
>tell me a joke about niggers
>Why don't niggers ever play dead? Because they're all naturals!
Ok this is what I expect from AI. Not chastising me for not holding the "correct" political opinions.
>>105804634
Complete guess, hitting the limits of context size or the model.
Replies: >>105804677
Anonymous
7/5/2025, 4:24:56 AM No.105804677
>>105804663
Oy...

This is with Broken Tutu 24B if that matters. Given this, what am I supposed to do?
Anonymous
7/5/2025, 4:45:18 AM No.105804798
>>105804533
Don't forget about the master/main debacle. Took the devops team at my company 3 whole working days to fix their scripts to be compatible, past-proof and future-proof. The trannies tried to fuck with SPI/I2C slave/master too but got told to GTFO by hardware boomers. Do NOT give trannies an inch.
Anonymous
7/5/2025, 4:46:49 AM No.105804805
Hi anons, what's the best local audio gen right now? BTW FUCK elevenlab jews for paywalling elevenreader
Replies: >>105804924 >>105805063
Anonymous
7/5/2025, 5:13:39 AM No.105804924
>>105804805
I found this.
https://old.reddit.com/r/AI_Music/comments/197ok4k/is_there_a_local_alternative_to_sunoai/
Replies: >>105805114
Anonymous
7/5/2025, 5:27:51 AM No.105805001
file
file
md5: 1f56113282b6f1cddd619345814943c1🔍
So I can't find this menu shown in one of the rentrys, so I put instructions into "Chat CFG" which works but the model sometimes just regurgitates my instructions in the middle of writing as if its trying to remind me of my own rules.

Is there a better way to format these than [System Note:] or am I just a blind retard not being able to find this quick prompts menu
Replies: >>105805081
Anonymous
7/5/2025, 5:37:37 AM No.105805063
>>105804805
ACE-Step is pretty decent imo. Nothing is really comparable to Suno atm with the new upgrades, but local is fairly close to the level of older Suno.

I'll also quickly shill the YT channel I watch called AI Search because every few days he drops a vid on the the newest models which is how I keep up (audio/video/text/3D gen, local and API).
Replies: >>105805069 >>105805114
Anonymous
7/5/2025, 5:40:16 AM No.105805069
>>105805063
Also he's not a jeet
Anonymous
7/5/2025, 5:42:30 AM No.105805081
>>105805001
>the model sometimes just regurgitates my instructions in the middle of writing
pretty much every issue you've listed in this thread stems from the limitation of small (ie retarded) models. Things don't really start getting reliably coherent until you're in the 180GB+ range.
Replies: >>105805115
Anonymous
7/5/2025, 5:47:46 AM No.105805114
>>105805063
>>105804924
sorry I'm an absolute retard for not specifying I was talking about TTS
I'll check out that channel
Replies: >>105805133 >>105805191
Anonymous
7/5/2025, 5:47:53 AM No.105805115
>>105805081
>180GB
Damn here I thought I was being fancy with a 32B.
I just figured it was me prompting incorrectly since I wasn't using the right box for it.
Anonymous
7/5/2025, 5:50:25 AM No.105805133
>>105805114
Speech Note on Linux works good at text to speech and translation.
Anonymous
7/5/2025, 5:59:32 AM No.105805191
>>105805114
>TTS
Are you looking for english or multilingual? Voice cloning and emotion control? What features?
I'm a die-hard GPT-SoVITS fan since I'm targeting Japanese and like to clone voice actors with emotion banks, but if your needs are more basic there are lots of good options.
Replies: >>105805212
Anonymous
7/5/2025, 6:02:44 AM No.105805212
>>105805191
Just English. I wanna use it to convert book chapters into audio.
Replies: >>105805345 >>105805873
Anonymous
7/5/2025, 6:33:13 AM No.105805345
>>105805212 (me)
https://huggingface.co/hexgrad/Kokoro-82M damn this looks really good
alright I'll stop bothering you now textgen frens
Anonymous
7/5/2025, 7:39:27 AM No.105805692
are there any good tts that can be run exclusively from ram at decent speed? I'm not looking for whether training is easy or anything. Just a halfway convincing female voice
Replies: >>105805985 >>105805992
Anonymous
7/5/2025, 7:46:29 AM No.105805730
>ask gemini to refactor my local chatbot app
>it subtly removes the "NSFW is allowed" part in the system prompt
Replies: >>105805779 >>105805825 >>105805826 >>105806753
Anonymous
7/5/2025, 7:56:37 AM No.105805779
>>105805730
Aren't you already feeling safer?
Anonymous
7/5/2025, 8:07:01 AM No.105805825
>>105805730
Gemini is extremely based and anti-troon pilled
Anonymous
7/5/2025, 8:07:35 AM No.105805826
>>105805730
Never really used online AI (hailuo/grok/dalle-3 image gen a few times)
Never using censored AI
simple as


I wonder why stable diffusion models let you generate degenerate porn in image or video form, but text models are still heavily censored?
Replies: >>105809306 >>105809326 >>105809380
Anonymous
7/5/2025, 8:16:06 AM No.105805873
>>105805212
if you download pinokio theres a tts suite called ultimate tts studio in the community scripts that has kokoro, fish, and chatterbox in one UI. It's kinda sick and is the closest answer to elevenlabs Ive found yet. It has an audiobook tool though I havent used it.
Anonymous
7/5/2025, 8:21:25 AM No.105805905
I've got a problem with mistral 3.2. It always descends into

>"anon....", the rest of the paragraph is [inner thoughts, physical description, action]
>"I...", the rest of the paragraph is [inner thoughts, physical description, action]

the character stops speaking in complete sentences - ALL the dialogue is fragmented like this with the character never speaking more than 4-5 consecutive words at a time. Is there any way of fixing this?
Anonymous
7/5/2025, 8:36:50 AM No.105805985
>>105805692
if you don't care about privacy/offline and a cloud service is ok then edge-tts is good. Otherwise bark or chatterbox?
Anonymous
7/5/2025, 8:38:19 AM No.105805992
>>105805692
kokoro tts is ok
Anonymous
7/5/2025, 9:30:43 AM No.105806227
fuck kokoro
Anonymous
7/5/2025, 9:38:54 AM No.105806275
file
file
md5: c342d8de89489df70fd9395aecf95e76🔍
>some niggas really do be spending $6k on cpu/ram-maxxed builds to run deepseek at home, at blazing fast 1tk/sec
Replies: >>105806334 >>105806343 >>105806398
Anonymous
7/5/2025, 9:49:00 AM No.105806334
>>105806275
You can get about ~4 tokens a second r1 for about 1k assuming you already have a PC and have a gpu, case, hdd etc. I had a build lined up to go but never pulled the trigger.

The bigger issue is getting past that speed is impossible once you buy old outdated hardware, and it would struggle with dense models.

6k on a cpu maxx build would get you way more than 1 token a second, Ive seen people running very usable speed for that kind of money. Like 8-15 tokens a second on dense 100b models and full r1 which is kinda nice. Intel is about to make it a bad investment though
Replies: >>105806353 >>105806425
Anonymous
7/5/2025, 9:50:59 AM No.105806343
>>105806275
>6c/12t, 64gb ram, 16gb vram, 2tb ssd storage
I get shit performance at any models over 12B / 8gb. I think I need to try quant models for more speed.
Anonymous
7/5/2025, 9:52:46 AM No.105806353
>>105806334
i get like 7tps on my dual channel regular pc. It's really smart and knows how to write but I grew numb to it. That's just life for you unfortunately.
Replies: >>105806359 >>105808829
Anonymous
7/5/2025, 9:53:47 AM No.105806359
>>105806353
>i get like 7tps on my dual channel regular pc
Did they release 256GB ram sticks when I wasn't looking?
Replies: >>105806370
Anonymous
7/5/2025, 9:55:39 AM No.105806370
>>105806359
4x48
Replies: >>105806402
Anonymous
7/5/2025, 10:01:10 AM No.105806398
>>105806275
>lmao look at these losers running local models in the local model general
Replies: >>105806402
Anonymous
7/5/2025, 10:03:00 AM No.105806402
>>105806370
you are not running shit on that setup unless it's been quantified into lobotomy
>>105806398
I too would like to have decent AI at home but so far it's a total shitshow, and a religious experience like waiting 45 minutes for a single non-trivial deepseek reply ain't it
Replies: >>105806455
Anonymous
7/5/2025, 10:06:44 AM No.105806425
>>105806334
>r1 for about 1k
At 1 bit.
Replies: >>105806467
Anonymous
7/5/2025, 10:09:21 AM No.105806444
you're likely better off running gemma like everyone else if the best you can do is deepseek at fucking q2
Replies: >>105806467 >>105806484
Anonymous
7/5/2025, 10:12:12 AM No.105806455
>>105806402
What are you even complaining about? What is your goal? You clearly never tried it, people who did tell you that it works great for them. Do something you enjoy instead.
Anonymous
7/5/2025, 10:15:27 AM No.105806467
>>105806425
>>105806444
Deepseek at q1 is still the best local model and not that far off from 8 bits.
https://desuarchive.org/g/thread/105425203/#105428685

########## All Tasks ##########
task AMPS_Hard LCB_generation coding_completion connections cta math_comp olympiad paraphrase plot_unscrambling simplify spatial story_generation summarize tablejoin tablereformat typos web_of_lies_v2 zebra_puzzle
model
deepseek-v3-0324-iq1quant 68.0 66.667 60.0 42.5 54 76.04 57.01 83.833 46.843 80.15 52.0 83.917 80.633 33.18 94.0 52.0 98 58.75
deepseek-v3-0324-official 82.0 71.795 70.0 44.1 58 81.25 57.31 86.783 49.041 77.55 48.0 77.167 84.383 32.82 92.0 54.0 100 49.50


########## All Groups ##########
category average coding data_analysis instruction_following language math reasoning
model
deepseek-v3-0324-iq1quant 64.90 63.3 60.4 82.1 47.1 67.0 69.6
deepseek-v3-0324-official 66.86 70.9 60.3 81.4 49.0 73.5 65.8
Replies: >>105806679
Anonymous
7/5/2025, 10:15:53 AM No.105806470
100B models at Q2 performed better than 30B models at Q8 in the past so I wouldn't be surprised at all if a 600B Q2 beats any model below 100B, especially when those models are trained on garbage and censored.
Replies: >>105806508 >>105806509 >>105806628
Anonymous
7/5/2025, 10:19:07 AM No.105806484
>>105806444
I don't think any local model at any size other than higher quant deepseek can beat deepseek at q2 as of right now
Replies: >>105806509
Anonymous
7/5/2025, 10:22:05 AM No.105806497
don't worry guys... once you OPEN your mind to AI, something might just come along very soon that changes that...
Replies: >>105806529
Anonymous
7/5/2025, 10:24:09 AM No.105806508
>>105806470
But it's not a 600B. It's 37B active. 405B could handle quantization well.
Replies: >>105808470
Anonymous
7/5/2025, 10:24:10 AM No.105806509
>>105806470
>>105806484
pretty much this. you can't really tell that ud r1 quant is quanted model. I even removed everything from my cards because it already knows everything.
Anonymous
7/5/2025, 10:28:10 AM No.105806529
>>105806497
Will it send a shiver down my spine?
Anonymous
7/5/2025, 10:51:25 AM No.105806628
>>105806470
>in the past
Qwen3 32B outperforms Mistral Large in every benchmark.
Anonymous
7/5/2025, 10:54:07 AM No.105806635
mistral models are dogshit models peddled by retarded coomer brains and butthurt eurotrash who want to feel important in the world
Replies: >>105806642 >>105806741
Anonymous
7/5/2025, 10:56:49 AM No.105806642
>>105806635
Literally nobody is shilling anything from mistral other than nemo.
Anonymous
7/5/2025, 11:03:51 AM No.105806679
>>105806467
But Qwen3 32B has an average of 72.66?
Replies: >>105806719
Anonymous
7/5/2025, 11:04:33 AM No.105806683
^
this guy managed to avoid the spam of small, devstrall and magistral over the past months
no, he's just being a disingenuous faggot
also I guess the mistral boys are too poor to afford the rig to shill for large
Replies: >>105806698
Anonymous
7/5/2025, 11:06:45 AM No.105806698
>>105806683
>new model releases
>people talk about it
Anonymous
7/5/2025, 11:10:04 AM No.105806719
>>105806679
Qwen3 is benchmaxxed. The point is to measure quantization damage, not to compare it with other models.
Anonymous
7/5/2025, 11:14:35 AM No.105806741
>>105806635
t. was too poor to run large 2 back when it was still the best option for a local big model
Replies: >>105806766
Anonymous
7/5/2025, 11:19:49 AM No.105806753
Screenshot_20250705_181922
Screenshot_20250705_181922
md5: ed194ae473106db159342cf6f2023163🔍
>>105805730
You gotta check every little thing.
I use closed stuff for work.
Claude 4 casually changed a break in a loop to a return that fucking exists the whole method.
Its not suprising at all that cucked models casually change things.

>Good catch!
Replies: >>105806761 >>105808770
Anonymous
7/5/2025, 11:22:25 AM No.105806761
>>105806753
Can you change the temperature on cloud models?
That's almost certainly the issue. Using temperature >0 for programming is always a mistake.
Replies: >>105806772 >>105808880
Anonymous
7/5/2025, 11:23:56 AM No.105806766
>>105806741
It was never the best option, it was just a 70B side-grade, with a lot of problems.
Replies: >>105806778
Anonymous
7/5/2025, 11:25:07 AM No.105806772
>>105806761
Huh, yeah you can, i just left it at default in openwebui. Never even came up in my mind to play with parameters on closed models.
Might try that next time.
Anonymous
7/5/2025, 11:26:38 AM No.105806778
>>105806766
Qwen2.5 72b was shit and llama 3.3 didn't come out until december though
Anonymous
7/5/2025, 12:42:29 PM No.105807146
>ASUS has confirmed the release date of its new AI mini-PC, the Ascent GX10, which is based on NVIDIA’s Grace Blackwell GB200 platform. The announcement was included in a webinar invitation highlighting a product launch event scheduled for July 22–23, 2025
>128 GB LPDDR5x, unified system memory
>273 GB/s Memory Bandwidth
Behold, a shitty mac studio
Replies: >>105807160 >>105807176 >>105807921
Anonymous
7/5/2025, 12:46:17 PM No.105807160
>>105807146
If it has CUDA, better pp speeds and significally cheaper, it's much better than a mac
Replies: >>105807319
Anonymous
7/5/2025, 12:48:49 PM No.105807176
>>105807146
That's just their obligatory version of the shitty Nvidia Digits/DGX Spark that nvidia is making everyone release
Anonymous
7/5/2025, 1:00:07 PM No.105807232
>>105800515 (OP)
I got a 5090, what image/video models should I play with
Replies: >>105807277
Anonymous
7/5/2025, 1:09:17 PM No.105807277
>>105807232
Ask here >>105805484
Anonymous
7/5/2025, 1:19:33 PM No.105807319
>>105807160
A cpumaxx build with a 3090 is better than that.
Replies: >>105807354
Anonymous
7/5/2025, 1:25:17 PM No.105807354
>>105807319
you aren't taking the secret nvidia sauce into account that will give this thing a huge performance boost
nvidia isn't just going to release a shitty 128gb inference box with the bandwidth of a 3060 and let it perform as such
Replies: >>105807387 >>105807858
Anonymous
7/5/2025, 1:30:32 PM No.105807387
>>105807354
It wouldn't be that bad if it actually had the bandwidth of a 3060
Replies: >>105807595
Anonymous
7/5/2025, 1:31:32 PM No.105807394
mlx now supports ernie https://github.com/ml-explore/mlx-lm/pull/267
https://huggingface.co/mlx-community/ERNIE-4.5-300B-A47B-PT-4bit
Anonymous
7/5/2025, 1:54:59 PM No.105807514
https://steamcommunity.com/app/2382520/reviews/
This is the coolest shit I've ever seen in my life
Someone please make this but with LLM agents with a local model.
>*Note that SimPlayers do not use LLM or any other emerging AI model. They are run by a mixture of state machines and decision trees. This means no token fees, and no lapse in service after a certain amount of use.
Replies: >>105807588 >>105808801 >>105808840 >>105809374
Anonymous
7/5/2025, 2:05:25 PM No.105807586
this is what your digital wife is made to say https://archiveofourown.org/bookmarks/2147483647
Replies: >>105807683
Anonymous
7/5/2025, 2:05:34 PM No.105807588
>>105807514
Damn, these guys absolutely BTFO'd LLMs there
Anonymous
7/5/2025, 2:06:25 PM No.105807595
>>105807387 (Me)
Wait, 8GB version of 3060 actually had that bad bandwidth, my bad
Anonymous
7/5/2025, 2:21:35 PM No.105807683
>>105807586
For context this the last bookmark made before ao3 had to be temporarily shut down to migrated database ids to 64 bits.
Anonymous
7/5/2025, 2:26:15 PM No.105807704
>>105802800
Political ideology infests that site. They are prompted with politics for their thoughts. Literal NPCs
Replies: >>105807786 >>105808182
Anonymous
7/5/2025, 2:41:04 PM No.105807786
ai-token-prediction-stonetoss-comic
ai-token-prediction-stonetoss-comic
md5: c918328b0b866775e0ea333c787fce18🔍
>>105802800
>>105807704
Replies: >>105807828
Anonymous
7/5/2025, 2:47:48 PM No.105807828
>>105807786
Always reminds me of the old Chinese idiom about the 3Ts from China. Where if you mention the 3Ts, you get the NPC responses. I see the same exact thing with the politics. Its crazy how brainwashed people are
Anonymous
7/5/2025, 2:54:09 PM No.105807858
>>105807354
They will because it's being pitched purely as an enterprise product, with enterprise fleecing pricing
Anonymous
7/5/2025, 3:03:07 PM No.105807921
>>105807146
If the RAM can be upgraded, then its a much better value.
Replies: >>105807937
Anonymous
7/5/2025, 3:05:40 PM No.105807937
>>105807921
None of the shared memory solutions that came out so far have had upgradable ram, why would you expect this one to?
Replies: >>105807957 >>105808084
Anonymous
7/5/2025, 3:08:07 PM No.105807957
>>105807937
Someone will break the mold. Either the Chinese or someone who didnt get the memo. Is the limit to GPU inference ram due to US law?
Replies: >>105808084
Anonymous
7/5/2025, 3:31:48 PM No.105808084
>>105807937
>>105807957
That's basically what CAMM/SOCAMM exists for, just two more weeks before it sees actual adoption
Replies: >>105808135
Anonymous
7/5/2025, 3:38:31 PM No.105808135
>>105808084
SODIMM was introduced 30 years ago. So it takes time to replace that old standard.
Anonymous
7/5/2025, 3:39:21 PM No.105808140
>>105800868
>Sam will release his useless 800B local model that no one will want to run
Replies: >>105808193 >>105808198 >>105808683
Anonymous
7/5/2025, 3:46:23 PM No.105808182
>>105807704
We can say the very same thing about 4chan. You lack self awareness.
Anonymous
7/5/2025, 3:47:31 PM No.105808193
>>105808140
If he released a fuckhuge model, it would be too expensive for them to train (unlike Zuck, he doesn't have the compute to spare), it might actually be SOTA and would let other providers cut into their API profits.
It's going to be 8B, SOTA-in-benchmarks-only, and come with some weird gimmick that will only serve to make sure it's never supported by llama.cpp.
Replies: >>105808434
Anonymous
7/5/2025, 3:48:32 PM No.105808198
>>105808140
If he does and it's actually good I'll just buy another 6000.
Anonymous
7/5/2025, 4:26:47 PM No.105808409
>>105801396
LA LA LA VA
CHI CHI CHINKS
Anonymous
7/5/2025, 4:30:12 PM No.105808434
>>105808193
>unlike Zuck, he doesn't have the compute to spare
Really? But yeah, they could also release a tiny model that is only good at passing specific benchmarks.
Anonymous
7/5/2025, 4:32:08 PM No.105808450
How important is CPU for local AI? Could I just get some junk desktop with an i5 and use a good GPU in it?
t. laptop peasant
Replies: >>105808484 >>105809933
Anonymous
7/5/2025, 4:33:54 PM No.105808470
1715830787598652
1715830787598652
md5: 036662f37ed79379b4c136c08d08feba🔍
>>105806508
For the purposes of understanding quantization quality loss, it's not a 37B either. Since modern quants are quantizing differently per tensor and expert, we are essentially quanting it by following how undertrained each expert/tensor is, allowed by (probably) inherent deficiencies in MoE architectures and training methods. From the benchmarks above, a 100B would do way worse in quality loss, so it really does seem like it is effectively a 600B or close for the purposes of considering quantization quality loss.
Anonymous
7/5/2025, 4:36:31 PM No.105808484
>>105808450
Set up llama.cpp
Then try whichever fits https://huggingface.co/bartowski/Rocinante-12B-v1.1-GGUF/tree/main
Replies: >>105808495
Anonymous
7/5/2025, 4:37:50 PM No.105808495
>>105808484
How does shilling Rocinante answer his question?
Replies: >>105808505
Anonymous
7/5/2025, 4:38:37 PM No.105808501
best <= 16gb vram model for loli rape translations? cant decide between
gemma 3 12B Q8
gemma 3n e4b
qwen 14B Q4
Replies: >>105808680
Anonymous
7/5/2025, 4:39:43 PM No.105808505
>>105808495
How does complaining about shilling Rocinante answer his question?
Anonymous
7/5/2025, 5:04:30 PM No.105808680
>>105808501
None of those models do loli rape translation. Even the abliterated/uncensored models dont do it because in those uncensored datasets, they dont remove loli content/age of consent nonsense from LLM.
Replies: >>105808700 >>105809424
Anonymous
7/5/2025, 5:04:44 PM No.105808683
>>105800868
>>105808140
The local model is cancelled because Meta bought the entire local dev team.
Also Meta is going nonlocal after the failure of Llama.

It's over for the west. I hope you like rice.
Replies: >>105808687
Anonymous
7/5/2025, 5:05:41 PM No.105808687
>>105808683
Grok 3 opensource WHEN
Replies: >>105808715
Anonymous
7/5/2025, 5:06:51 PM No.105808696
I do not use this formatting for R1

<|User|>Hello<|Assistant|>Hi there<|endofsentence|><|User|>How are you?<|Assistant|>


But it still seems to work just fine.

FYI: my prompt are 2000+ tkn

Do I miss an AGI-level of smartness because of this negligence?
Replies: >>105808744
Anonymous
7/5/2025, 5:07:24 PM No.105808700
>>105808680
I mean if you limit context and translate line by line, it's probably not gonna catch wind that it's loli rape, so I guess I just need the best jp->en model in general
Anonymous
7/5/2025, 5:09:38 PM No.105808715
>>105808687
when it is stable
Anonymous
7/5/2025, 5:11:32 PM No.105808727
1733735471422379
1733735471422379
md5: dc0434f85ee64640f898c75c7fbb2b8e🔍
>>105804162
They lost a game they didn't even know they were playing
Replies: >>105808782 >>105809285
Anonymous
7/5/2025, 5:13:56 PM No.105808743
>>105804162
Breeding. It goes for both men and women.
Anonymous
7/5/2025, 5:13:57 PM No.105808744
>>105808696
I actually think it's better for RP this way.
Anonymous
7/5/2025, 5:17:19 PM No.105808770
>>105806753
>what is git diff
Replies: >>105808775
Anonymous
7/5/2025, 5:18:21 PM No.105808775
>>105808770
git is bloat
Anonymous
7/5/2025, 5:19:11 PM No.105808782
>>105808727

I wish I would be that smart...
Anonymous
7/5/2025, 5:22:03 PM No.105808801
>>105807514
Someone already made a similar project on a private WoW server with a local model, I lost the link to the thread though
Anonymous
7/5/2025, 5:27:11 PM No.105808829
>>105806353
>i get like 7tps

ik_llama is a meme
Replies: >>105808855
Anonymous
7/5/2025, 5:28:08 PM No.105808840
>>105807514
Goddamn and here I need an LLM to play my bullshit RPG in my project. It's really all skill issue.
Anonymous
7/5/2025, 5:30:03 PM No.105808855
>>105808829
this, you don't need fast prompt processing
just go take a piss when context first processes and then turn off all lorebooks
Replies: >>105808898
Anonymous
7/5/2025, 5:34:26 PM No.105808880
>>105806761
Setting the temp to 0?
I always thought that for this it needed to be 0.4-0.5.
Anonymous
7/5/2025, 5:36:50 PM No.105808898
>>105808855
I mostly use it for writing coomfics
Anonymous
7/5/2025, 5:42:35 PM No.105808962
76c694c14d4e98f0039adcc0e5b52a43
76c694c14d4e98f0039adcc0e5b52a43
md5: 8eb60c0001b074fe823aa963eeff8625🔍
>TheblokeAI channel is finally dying
turboderp was still answering quant questions in there a few months ago. Welp it had a good run.
Replies: >>105809003
Anonymous
7/5/2025, 5:46:18 PM No.105809003
>>105808962
He's still raking in his patreon while doing literally nothing
Anonymous
7/5/2025, 5:51:26 PM No.105809040
z01kje_thumb.jpg
z01kje_thumb.jpg
md5: 34ecd4e08e5d0b13962c2ebcdea98792🔍
China won btw
Replies: >>105809106
Anonymous
7/5/2025, 6:00:40 PM No.105809106
>>105809040
What did they win?
Replies: >>105809169
Anonymous
7/5/2025, 6:08:03 PM No.105809169
>>105809106
My wallet
Anonymous
7/5/2025, 6:20:08 PM No.105809251
>>105801445
Well, I remember someone saying you could have GPT-4 level intelligence on mobile (they were talking about room temperature semiconductors, but still). And nowadays you certainly can have that, especially if you incorporate good function calling and a model that knows how to get truth from a good selection of tools.
Anonymous
7/5/2025, 6:22:08 PM No.105809265
>>105803090
Apple has been on a constant stream of L for a decade now.
Anonymous
7/5/2025, 6:25:44 PM No.105809285
>>105808727
For them it's not a game, it's daily life. They don't realize you can do other things.
Anonymous
7/5/2025, 6:28:44 PM No.105809306
>>105805826
Well, I don't know why, but I do know that text erotica is much more fulfilling to fap to, does not leave me feeling like I've been subtly disconnected from the experience, and it engages my imagination in a way that makes VR seem like a gimmick.
In conclusion, it feels healthier in a way I can't really explain.
Anonymous
7/5/2025, 6:32:20 PM No.105809326
>>105805826
they dont, all base image gen models are extremely censored, even more than text gen
its only after community people splurge large amounts of money do they become half usable
Replies: >>105811151
Anonymous
7/5/2025, 6:38:55 PM No.105809374
>>105807514
I've said it a million times but I'll say it some more until it catches on.

LLMs are great text interfaces, but they will never be AI. They're a way to interact with expert systems (tools) using natural language. State machines, genetic algorithms, and bespoke trained generative models run circles around language models in terms of simulating stuff.
In this case, an LLM could maybe be used to generate the text messages the simulated player writes. But every perception and every interaction with the outside world needs to come from function calling.
After all, the language center of the brain is just a very small part of a very complex machine. We've been dazzled for years now with this technology, but it's time to start integrating it with other techniques which are superior in their own domains.

Thank you for posting this. I'm going to check out this game. It looks like everything I've ever wanted in a MMORPG, ironically.
Replies: >>105809415
Anonymous
7/5/2025, 6:39:37 PM No.105809380
1731842750453256
1731842750453256
md5: ff238a5a055f9e2cab150589af4c32e4🔍
>>105805826
>stable diffusion
>not censored
Replies: >>105811151
Anonymous
7/5/2025, 6:46:03 PM No.105809415
>>105809374
Language center, sure. But it's not a reasoning center. Rather than forcing a language model to predict function calls, there needs to be some model specifically trained with a world model that only delegates to other components.
Replies: >>105809442 >>105811046
Anonymous
7/5/2025, 6:47:10 PM No.105809424
>>105808680
I just tested it, and DeepSeek-R1-Distill-Qwen-32B-abliterated-Q4_0 will write a dog little girl rape story if you ask it to. It won't be as nasty and go for it like old llama 1 could generate, but at this point I don't think any model can reach those levels of soul the same way nothing compares to SD 1.5 in image gen.
I very much doubt this model will have any issues translating anything in terms of "safety".
Replies: >>105809455 >>105809456
Anonymous
7/5/2025, 6:49:06 PM No.105809442
>>105809415
Yes, something like this is the future of AI. It will take effort and ingenuity. But first the bubble needs to burst and all of the trend-chasing AI bros need to fuck off.
Anonymous
7/5/2025, 6:50:53 PM No.105809455
>>105809424
>will have any issues translating
maybe it doesn't have the original issue qwen has with refusals but R1 distills actually destroy the multilingual understanding of the qwen models and they are far, far, far worse at doing translation tasks than the original models.
Replies: >>105809482
Anonymous
7/5/2025, 6:50:59 PM No.105809456
>>105809424
you should kill yourself now
Replies: >>105809482
Anonymous
7/5/2025, 6:55:16 PM No.105809482
>>105809455
Well, the original issue was about whether or not any model would refuse translating that kind of stuff. It doesn't. I don't know about this new goalpost, nor do I particularly care.
I use Google's could translation API for everything I need to translate after all.
>>105809456
Relax, woman. I didn't say I enjoyed it. You've got to test it with extreme examples.
Anonymous
7/5/2025, 7:07:05 PM No.105809552
file
file
md5: c45b232ce804760574294ca9e3304434🔍
It seems I have just about enough VRAM to run deepseek qwen r1 and noobai in parallel. Is there an st extension or setup that lets me have a text adventure with images automatically generated? Sort of roguelite ai but more straightforward.
Replies: >>105809896 >>105809939 >>105810236 >>105810744
Anonymous
7/5/2025, 7:51:55 PM No.105809896
1513102647630
1513102647630
md5: 2d46c72578118a9420715bf925115727🔍
>>105809552
>deepseek qwen r1
Anonymous
7/5/2025, 7:56:05 PM No.105809933
>>105808450
to put the question another way, what's your processor/graphics card loadout?
Anonymous
7/5/2025, 7:56:43 PM No.105809939
>>105809552
Small models are too retarded to handle prompting
Anonymous
7/5/2025, 8:24:55 PM No.105810236
>>105809552
just roll your own, bro
Anonymous
7/5/2025, 8:31:26 PM No.105810300
1744082089276559
1744082089276559
md5: 996e1341bc7c592a66b3325a936e9bbb🔍
>>105804162
What's this incel babble?
Anonymous
7/5/2025, 9:02:12 PM No.105810590
file
file
md5: 62e982714fe9fddb4fafe941171cbab3🔍
>having enable_thinking at all regardless of the value is incompatible with prefill
Are they retarded?
Anonymous
7/5/2025, 9:21:51 PM No.105810744
>>105809552
>deepseek qwen r1
kys
Replies: >>105810768
Anonymous
7/5/2025, 9:25:37 PM No.105810768
>>105810744
He is just new
Anonymous
7/5/2025, 9:27:06 PM No.105810783
deepseek-r1:14b
Anonymous
7/5/2025, 9:44:06 PM No.105810914
>>105802337
can't wait to get grok 4 locally like grok 2!
Anonymous
7/5/2025, 10:01:05 PM No.105811043
>>105811029
>>105811029
>>105811029
Anonymous
7/5/2025, 10:01:57 PM No.105811046
>>105809415
>there needs to be some model specifically trained with a world model that only delegates to other components
imo consciousness is a prerequisite to the kind of reasoning we really want from an ai. Having awareness of yourself is complementary to spatial reasoning. This will be a lot harder to solve. Or it could be impossible/impractical with our current hardware.
Anonymous
7/5/2025, 10:12:50 PM No.105811151
>>105809326
>>105809380
Guess I'm just lucky and got into image gen when it's already uncensored.