/lmg/ - Local Models General - /g/ (#105704582) [Archived: 733 hours ago]

Anonymous
6/25/2025, 11:26:17 PM No.105704582
1750863269916
1750863269916
md5: ff32ee205b3a1053752029c2d8b55d0a🔍
/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>105698912 & >>105689385

►News
>(06/25) I posted about my AGP fetish on r9k: https://desuarchive.org/r9k/thread/81611346/
>(06/21) LongWriter-Zero, RL trained ultra-long text generation: https://hf.co/THU-KEG/LongWriter-Zero-32B
>(06/20) Magenta RealTime open music generation model released: https://hf.co/google/magenta-realtime
>(06/20) Mistral-Small-3.2 released: https://hf.co/mistralai/Mistral-Small-3.2-24B-Instruct-2506
>(06/19) Kyutai streaming speech-to-text released: https://kyutai.org/next/stt


►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/tldrhowtoquant
https://rentry.org/samplers

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/leaderboard.html
Code Editing: https://aider.chat/docs/leaderboards
Context Length: https://github.com/adobe-research/NoLiMa
Censorbench: https://codeberg.org/jts2323/censorbench
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm
Replies: >>105704679 >>105704704 >>105704783 >>105704859 >>105704905 >>105705081 >>105705144 >>105707293 >>105708829 >>105708936 >>105709884 >>105710230
Anonymous
6/25/2025, 11:27:56 PM No.105704595
mikusisters not like this....
Replies: >>105704619
Anonymous
6/25/2025, 11:31:15 PM No.105704619
>>105704595

More miruku sister which could mean milking sister or miracle milk am i right or am i right
Anonymous
6/25/2025, 11:36:58 PM No.105704679
>>105704582 (OP)
>>(06/25) I posted about my AGP fetish on r9k:
Thread worth less than that btw
Go back to your discord and spam there till your skin turns blue.
Replies: >>105704710
Anonymous
6/25/2025, 11:39:20 PM No.105704704
>>105704582 (OP)
>>(06/25) I posted about my AGP fetish on r9k: https://desuarchive.org/r9k/thread/81611346/
based
>migger troon op same as issual
not based
Anonymous
6/25/2025, 11:39:54 PM No.105704710
>>105704679
I am in a discord. It even has troons and mods that ban for spam while they spam.
Replies: >>105704722
Anonymous
6/25/2025, 11:40:36 PM No.105704718
file
file
md5: f5f3968f0908c59e0aa0325770c3b6d1🔍
Replies: >>105704734
Anonymous
6/25/2025, 11:40:52 PM No.105704720
Is this thread protroon or antitroon?
Replies: >>105704733 >>105704735 >>105704816
Anonymous
6/25/2025, 11:40:59 PM No.105704722
>>105704710
4chan and Discord - brothers in arms and war.
Replies: >>105704792
Anonymous
6/25/2025, 11:41:54 PM No.105704733
>>105704720
It's a schizo thread
Replies: >>105704739 >>105704766
Anonymous
6/25/2025, 11:41:56 PM No.105704734
>>105704718
I am sorry stray catalog anon. Our thread has a bit of troon infestation problem.
Anonymous
6/25/2025, 11:42:06 PM No.105704735
>>105704720
since its hard to tell it's best to assume its a tranny intent on derailing /lmg/ like always
Replies: >>105704761
Anonymous
6/25/2025, 11:42:43 PM No.105704739
>>105704733
and you are our king schizo
Anonymous
6/25/2025, 11:42:56 PM No.105704741
file
file
md5: 9ecb5929efa593672d036c8a7d022f4b🔍
who needs that green haired anime mascot post this green haired anime mascot instead
Replies: >>105704776 >>105704788 >>105704820
Anonymous
6/25/2025, 11:44:50 PM No.105704761
>>105704735
you do a good enough job derailing it on your own you digusting festering troon freak
Anonymous
6/25/2025, 11:45:27 PM No.105704766
>>105704733
If it is a schizo thread then where is the usual thread recap?
Anonymous
6/25/2025, 11:46:03 PM No.105704776
>>105704741
We don't need mascots at all, it adds nothing for topic (Large Language Models) discussion.
Replies: >>105704797 >>105704824
Anonymous
6/25/2025, 11:46:37 PM No.105704783
>>105704582 (OP)
>I want to have a foid as something like a living dress-up doll: designing her outfits, dressing her myself, doing her makeup, controlling what she eats, showing her off as a walking decoration etc. Not really interested in any kind of romantic dimension since I only love one woman (even though she'll never be mine), though I acknowledge there's an inherently erotic aspect to the arrangement.
That is a bit fucked in the head, innit?
Replies: >>105704809 >>105704856
Anonymous
6/25/2025, 11:46:49 PM No.105704788
1738860740206730
1738860740206730
md5: 4fe751ec3c2f30e8d31b20434d302df9🔍
>>105704741
Don't make her part of this mess
Replies: >>105704793 >>105704812
Anonymous
6/25/2025, 11:47:11 PM No.105704792
1725662919249355
1725662919249355
md5: a4176ffe668def2b47013bf4fe0b6b5c🔍
>>105704722
this place is undistinguishable from what it once hated now, really the fall of rome.
Replies: >>105704810
Anonymous
6/25/2025, 11:47:24 PM No.105704793
>>105704788
>It's flat
Holy gay.
Replies: >>105704822
Anonymous
6/25/2025, 11:47:38 PM No.105704797
>>105704776
Novel concept that is impossible to grasp for at least 50% of the posters.
Anonymous
6/25/2025, 11:48:35 PM No.105704809
>>105704783
Don't judge. Anything that gets wh*tes to self-emasculate and drop out of society is a good thing
Anonymous
6/25/2025, 11:48:40 PM No.105704810
>>105704792
When 4chan was offline it was basically confirmed that jannies and admins are unironic troons. It is a dead corpse paraded around by troons who want to make it a safespace without chasing away majority of posters.
Anonymous
6/25/2025, 11:48:54 PM No.105704812
>>105704788
more of >her?
Anonymous
6/25/2025, 11:49:18 PM No.105704816
>>105704720
I'm anti whoever can't shut the fuck up about it.
Replies: >>105704849
Anonymous
6/25/2025, 11:49:40 PM No.105704820
>>105704741
smelly we have kurisu at home
Anonymous
6/25/2025, 11:49:41 PM No.105704822
1727803660017125
1727803660017125
md5: b525d0d8706b26470691481636e1b095🔍
>>105704793
You're flatter which is double gay
Anonymous
6/25/2025, 11:50:05 PM No.105704824
>>105704776
Right cause it attracts the usual suspects.
Anonymous
6/25/2025, 11:52:58 PM No.105704840
I have an important announcement. I am the OG kurisu poster. And recently i have changed my waifu of choice so my first AI gf will not be kurisu actually. That is all. Thank you.
Anonymous
6/25/2025, 11:53:37 PM No.105704846
file
file
md5: 670736018e027480d3364cdb1244fe78🔍
oh boy i can't wait to see what's happened today in /lm- geez
Replies: >>105704852
Anonymous
6/25/2025, 11:54:05 PM No.105704849
>>105704816
Real.
"antitroon" poster(s) become worse than the thing they supposedly hate, and the thing they hate also make themselves worse because they can't help taking the bait. It's just a big cycle of bullshit.
Replies: >>105704864 >>105704877
Anonymous
6/25/2025, 11:54:27 PM No.105704852
>>105704846
don't let the door hit you on the way out troonie
Replies: >>105704872
Anonymous
6/25/2025, 11:54:41 PM No.105704856
>>105704783
Sounds very dominant and alpha. He even called himself a chad in that thread.
Anonymous
6/25/2025, 11:54:45 PM No.105704859
>>105704582 (OP)
>no blacked card
>no jarted
Anonymous
6/25/2025, 11:55:04 PM No.105704864
%22normal person%22
%22normal person%22
md5: 919095bb2d528a01f7b4901a4f4ecb8c🔍
>>105704849
>
Replies: >>105704873
Anonymous
6/25/2025, 11:55:56 PM No.105704872
>>105704852
>everyone I don't like is a troon
Anonymous
6/25/2025, 11:55:57 PM No.105704873
>>105704864
>canned thoughts
Anonymous
6/25/2025, 11:56:40 PM No.105704877
>>105704849
I promise to stop anti-troon posting once mikuposting stops.
Replies: >>105704885 >>105704900 >>105704905 >>105704919
Anonymous
6/25/2025, 11:57:30 PM No.105704885
>>105704877
It wont happen and you know it, troons cannot and will not stop making things about themselves in any shady way they can.
Replies: >>105704896 >>105711160
Anonymous
6/25/2025, 11:58:49 PM No.105704896
>>105704885
I know
Anonymous
6/25/2025, 11:59:23 PM No.105704900
>>105704877

>>100491834
>>100491862
>>100491881
Replies: >>105704919 >>105704988
Anonymous
6/25/2025, 11:59:44 PM No.105704905
>>105704877
But in an effort to antimikupost, you ended up also mikuposting >>105704582 (OP)
Maybe if you actually made an honest effort to replace the job of the thread baker and make good, non miku/anime, non bait, serious business threads, you'd be doing something useful, that is actually helping your cause, instead of making the mikuposters want to mikupost more out of spite.
Replies: >>105704918
Anonymous
6/26/2025, 12:00:32 AM No.105704918
>>105704905
But maybe the spammers can just stop spamming this thread.
Replies: >>105704937
Anonymous
6/26/2025, 12:00:36 AM No.105704919
>>105704877
>>105704900
Looks like falseflag to me
Anonymous
6/26/2025, 12:01:23 AM No.105704925
>/lmg/ isn't dead anymore
turns out all it took was kicking the troons out of their hugbox
Replies: >>105704941 >>105704956
Anonymous
6/26/2025, 12:01:51 AM No.105704930
what a fun thread we are having today. so lively!
Anonymous
6/26/2025, 12:02:24 AM No.105704937
>>105704918
You know they won't. That's why it's on you to either stop making them worse, or to actually do something about the problem, which you can as I said. Even thread splitting isn't a bad thing as long as it's not fake bait like the kurisu splits were.
Replies: >>105704963
Anonymous
6/26/2025, 12:02:38 AM No.105704941
>>105704925
Wont help, now we need something good out of this LLM stuff. (Also wont happen cause AI labs are hellbent on safety cultism)
Anonymous
6/26/2025, 12:03:20 AM No.105704948
I just imagined something. If OP isn't also the troon janny imagine the explanation he has to give about what happened and why he wants to have this thread deleted as trolling... That is assuming he doesn't just lie for simplicity sake.
Replies: >>105704968
Anonymous
6/26/2025, 12:04:00 AM No.105704956
>>105704925
But being dead is better than wasting time on discussing meta community related crap no one wants to deal with if they have the choice.
Replies: >>105704966
Anonymous
6/26/2025, 12:04:58 AM No.105704963
>>105704937
>fake bait like the kurisu splits
what is a fake bait?
Replies: >>105704979
Anonymous
6/26/2025, 12:05:31 AM No.105704966
>>105704956
Of course you don't want your spamming mentioned, discussioned, or questioned. Kill yourself sooner rather than later trooner.
Replies: >>105705009
Anonymous
6/26/2025, 12:05:42 AM No.105704968
>>105704948
>implying he needs to lie
They all friends in there.
Janny list from leak - https://web.archive.org/web/20250617190717/https://rentry.co/o84vftsb
/g/ has 11 jannies btw.
Anonymous
6/26/2025, 12:06:40 AM No.105704979
>>105704963
ask your boyfriend before he fucks your gaping festering axe wound
Anonymous
6/26/2025, 12:07:47 AM No.105704988
>>105704900
https://desuarchive.org/g/thread/105611492/#105615767
Replies: >>105705002
Anonymous
6/26/2025, 12:08:53 AM No.105705001
it really is a wonder why the LocalLLaMA folk didn't migrate here when their sub was dead...
Replies: >>105705011 >>105705059
Anonymous
6/26/2025, 12:08:53 AM No.105705002
>>105704988
please don't have a meltie we're worried about you
Anonymous
6/26/2025, 12:10:01 AM No.105705009
>>105704966
I'm not a mikuposter or a spammer. I don't care about them since the mikuposting was always just easily filtered noise since they're images and not text.
I would welcome a thread split personally IF it was made AND it wasn't some bait or trying to egg on anyone like this thread's OP. I do agree the mikupositng stuff is off-topic. The miku genner (previous OPs?) should probably be posting in /hdg/ or something, not here. If you are serious and make a good, quality thread, I will come it's just that shrimple.
Replies: >>105705024
Anonymous
6/26/2025, 12:10:07 AM No.105705011
>>105705001
They can't migrate here if they have been here the whole time posting miggers and taking hrt.
Anonymous
6/26/2025, 12:11:41 AM No.105705024
>>105705009

>>100491881
Anonymous
6/26/2025, 12:12:52 AM No.105705034
If you are serious, then show it and make a non-bait, high quality thread split. I will come and use that thread when I can. It's just that shrimple.
Replies: >>105705048 >>105705068
Anonymous
6/26/2025, 12:13:53 AM No.105705043
All you had to do was ignore his posts and not engage with his rhetoric. His posts are always unambiguously off topic and often get deleted when reported.
Replies: >>105705051 >>105705089
Anonymous
6/26/2025, 12:14:25 AM No.105705048
>>105705034
This, so much this. We are moving so fast today, I think we need more threads. 6 or 7 should be enough.
Replies: >>105705095
Anonymous
6/26/2025, 12:14:49 AM No.105705051
>>105705043
True but I felt like making a canned response that I can copy and paste in the future.
Anonymous
6/26/2025, 12:15:53 AM No.105705059
>>105705001
They were already here.
Anonymous
6/26/2025, 12:15:59 AM No.105705061
why is everyone so catty today? did someone forget to take their HRT?
Anonymous
6/26/2025, 12:16:16 AM No.105705068
>>105705034
>If you are serious
>>104110951
>Death to /lmg/. Death to /g/. Death to the rotten corpse of 4chan.
Replies: >>105705098 >>105705110 >>105705136
Anonymous
6/26/2025, 12:18:28 AM No.105705081
>>105704582 (OP)
look at ts bruh https://www.instagram.com/reel/DH_Vm0KJ0S3/
Anonymous
6/26/2025, 12:19:22 AM No.105705089
>>105705043
>unambiguously off topic
Like mikuspam?
Anonymous
6/26/2025, 12:19:50 AM No.105705095
>>105705048
There is only one non-autosaging /lmg/ at the moment. Thanks I will add that to the paste.

>If you are serious, then show it and make a non-bait, high quality thread split. I will come and use that thread. It's just that shrimple. There are no existing competing thread splits in existence at the moment so you can feel very free to do so btw.
Replies: >>105705250
Anonymous
6/26/2025, 12:20:16 AM No.105705098
>>105705068
Of course troon infested hovels deserve to die and so do you.
Replies: >>105705136
Anonymous
6/26/2025, 12:21:58 AM No.105705110
>>105705068
>I hope all the illegal 3rd gender jannies kill themselves and join the 41%. World will be a better place if they all kill themselves and they all know it deep down. Don't let your dreams be dreams jannies, you should kill yourself now. I will also proceed to take a shit in this thread. Death to /lmg/. Death to /g/. Death to the rotten corpse of 4chan. Death to all tranny jannies.
Reads like a skinwalker trying to copycat average anti-trans polfag, have a (you) for falseflag efforts i guess...
Replies: >>105705133
Anonymous
6/26/2025, 12:22:11 AM No.105705112
>https://arxiv.org/abs/2502.00627
>https://gizmodo.com/researchers-dump-2-billion-scraped-discord-messages-online-2000605471
So why has nobody trained a model on this massive dump of 2 billion discord messages?
It's full of "unsafe" language, and apparently a disproportionately large percentage of it is roleplay chats between humans.
Replies: >>105705137 >>105705138 >>105705210
Anonymous
6/26/2025, 12:25:19 AM No.105705133
>>105705110
>I will also proceed to take a shit in this thread.
funny since he also spammed scat before while false flagging as a miku poster
Replies: >>105705143 >>105705155
Anonymous
6/26/2025, 12:25:37 AM No.105705136
>>105705068
>>105705098
You know I would agree that in general this site does suck and deserves to die even if I support open source and the concept of local models, but his approach to trying to fight back in fact just wastes his own time while also making the people he's trying to disturb feel more righteous in their own beliefs/cause, which contributes to making everything worse.
Replies: >>105705147
Anonymous
6/26/2025, 12:25:47 AM No.105705137
>>105705112
Unsafe AND unassistant.
Anonymous
6/26/2025, 12:25:59 AM No.105705138
>>105705112
>https://zenodo.org/records/15170676
noooo....
Replies: >>105705164
Anonymous
6/26/2025, 12:26:49 AM No.105705143
>>105705133
>spammed scat
Wasn't me but I do endorse that anon.
Anonymous
6/26/2025, 12:26:52 AM No.105705144
>>105704582 (OP)
I just came to this image.
Replies: >>105705151
Anonymous
6/26/2025, 12:27:55 AM No.105705147
>>105705136
your rambling again sis
Anonymous
6/26/2025, 12:28:30 AM No.105705151
>>105705144
I just came to this post
Replies: >>105705174
Anonymous
6/26/2025, 12:28:44 AM No.105705155
>>105705133
>false flagging as a miku poster
You think weebs aren't into scat?
Anonymous
6/26/2025, 12:29:21 AM No.105705164
>>105705138
This looks to be the same data https://huggingface.co/datasets/SaisExperiments/Discord-Unveiled-Compressed
Replies: >>105705186 >>105705559
Anonymous
6/26/2025, 12:30:48 AM No.105705174
>>105705151
I hope you enjoyed it ;3
Replies: >>105705192 >>105705214 >>105705365
Anonymous
6/26/2025, 12:32:25 AM No.105705186
>>105705164
nice, thanks!
Anonymous
6/26/2025, 12:32:54 AM No.105705192
>>105705174
I didn't. ;_;
Replies: >>105705206 >>105705214 >>105705365
Anonymous
6/26/2025, 12:35:15 AM No.105705206
>>105705192
Let me give you a hand with that next time
>///~///<
Replies: >>105705209 >>105705214 >>105705365
Anonymous
6/26/2025, 12:36:18 AM No.105705209
>>105705206
I need a big hand UwU
Replies: >>105705229
Anonymous
6/26/2025, 12:36:54 AM No.105705210
>>105705112
Are we sure this isn't already being used? It's not like using SOME good data will make a model good.
Replies: >>105705286
Anonymous
6/26/2025, 12:37:51 AM No.105705214
>>105705174
>>105705192
>>105705206
>fat greasy weeb hands typed this
Replies: >>105705239
Anonymous
6/26/2025, 12:39:56 AM No.105705229
>>105705209
I can use more than my hands if they're not enough >⩊<
Replies: >>105705265 >>105705365
Anonymous
6/26/2025, 12:40:33 AM No.105705231
1747179312524894
1747179312524894
md5: d9946fa4900040baf1f0efad2e987c01🔍
Replies: >>105705245 >>105705381 >>105707582 >>105707639 >>105708244 >>105708269 >>105708326 >>105711851
Anonymous
6/26/2025, 12:41:48 AM No.105705239
>>105705214
I have a 16.5 BMI ´꒳`
Replies: >>105705365 >>105708269
Anonymous
6/26/2025, 12:42:46 AM No.105705245
>>105705231
No wonder lmg shills these
Anonymous
6/26/2025, 12:43:14 AM No.105705250
>>105705095
Is this /lmg/'s new Code of Conduct and/or Contributor Covenant?
Replies: >>105705268
Anonymous
6/26/2025, 12:44:13 AM No.105705265
>>105705229
W-what else can you use o_0
Replies: >>105705313 >>105705365
Anonymous
6/26/2025, 12:44:28 AM No.105705268
>>105705250
This is not a github project, no one owes you anything.
Anonymous
6/26/2025, 12:45:59 AM No.105705286
>>105705210
not going to help the number go up in benchmarks = not going to be used
better to scrape gemini/chatgpt over and over
Replies: >>105705317
Anonymous
6/26/2025, 12:48:29 AM No.105705313
>>105705265
Well, you could make a guess. Don't make me say it out loud in front of all those people; it's pretty embarrassing ( ˃ ⤙ ˂ )
Replies: >>105705365
Anonymous
6/26/2025, 12:48:39 AM No.105705317
>>105705286
But all the big corpos use internet sewage clearly and their numbers are fine.
Anonymous
6/26/2025, 12:53:14 AM No.105705365
>>105705313
>>105705265
>>105705239
>>105705229
>>105705206
>>105705192
>>105705174
That is all cool and all but https://desuarchive.org/r9k/thread/81611346/ . Yeah you aren't escaping that one.
Replies: >>105705383
Anonymous
6/26/2025, 12:55:05 AM No.105705381
>>105705231
>'<|im_start|>user' appended to the end of the response
Perfection.
I do not miss sloptunes where they fuck up the EOS token or train on the wrong template.
Anonymous
6/26/2025, 12:55:09 AM No.105705383
>>105705365
That's not me ( ꈍ◡ꈍ)
Replies: >>105705398 >>105705778
Anonymous
6/26/2025, 12:56:17 AM No.105705398
1582961164881
1582961164881
md5: fd74d30c7943e042199ac6efc164ebe6🔍
>>105705383
Join 41% like the rest of your friends.
Replies: >>105705401 >>105705423 >>105705434 >>105705447 >>105705534 >>105705541 >>105705608 >>105707423
Anonymous
6/26/2025, 12:57:11 AM No.105705401
>>105705398
I don't have any friends (,, ‸ ,, )
Anonymous
6/26/2025, 1:00:22 AM No.105705423
>>105705398
But I don't know how to code
Anonymous
6/26/2025, 1:01:40 AM No.105705434
egoist poser
egoist poser
md5: bfaaf5eb3bdd7e47b3f491aa5bd67b03🔍
>>105705398
wtf has to do stirner with these muh moralfags?
Replies: >>105705534
Anonymous
6/26/2025, 1:03:46 AM No.105705447
>>105705398
I don't have a cute pfp like they do
Anonymous
6/26/2025, 1:10:03 AM No.105705492
OP here. I posted that r9k thread on purpose and the self reported here to see /lmg/ alive again. I was just pretending to be into dolls.
Replies: >>105705526
Anonymous
6/26/2025, 1:15:49 AM No.105705526
>>105705492
I don't give a shit, where is the recap?
Anonymous
6/26/2025, 1:16:42 AM No.105705534
normies
normies
md5: 4743d42448d1b39417597e1c2cd06643🔍
>>105705398
>>105705434
These troons swung so far into the identity politics train that they now lack any coherent ideology. Stirner, by contrast, urged you to "consume" every idea only to dismantle it afterward ("base yourself on nothing", i.e. begin from a blank slate, free of false beliefs and illusions ("spooks"). He never meant for *you* to be consumed by those ideas nor make yourself an identity our of it, or by their little anarchist flags.
Replies: >>105705815
Anonymous
6/26/2025, 1:18:38 AM No.105705541
>>105705398
>tranime
kek everytime
Anonymous
6/26/2025, 1:21:33 AM No.105705559
ahh ahh denny
ahh ahh denny
md5: 5b5ac128102a9c16d82b1a3d883fb804🔍
>>105705164
Never gonna use this but I want it because it exists. Well maybe I'll throw a script together and see what some are like. Gonna need 1521 Migus to start the cleaning work.
Replies: >>105705566
Anonymous
6/26/2025, 1:22:42 AM No.105705566
file
file
md5: 0455ff689944565b9c7ba18e24bedc28🔍
>>105705559
>Never gonna use this but I want it because it exists.
Same.
Anonymous
6/26/2025, 1:28:07 AM No.105705608
>>105705398
>deleted for truth
Safespace prevails.
Anonymous
6/26/2025, 1:30:01 AM No.105705621
f4151f66073c7a5c37ff8cd784da6a73856754462c8eef2648e5a0d5cd392ebaf
►Recent Highlights from the Previous Thread: >>105698912

--VRAM limitations prevent LoRA training on large models like Mistral Large with 48GB VRAM:
>105698940 >105698956 >105698974 >105699018 >105699028 >105699010 >105699040 >105699078 >105699159 >105699171 >105699178 >105699210 >105699223
--Investigating unexpected token generation limits in llama.cpp with high context length:
>105704272 >105704320 >105704489 >105704545 >105704568 >105704727
--Workaround for un-downloaded models via Hugging Face repo duplicator:
>105699478 >105699499
--ROCm 7 shows promise in improving AMD GPU performance for large language models:
>105702641
--Exploring alternatives to Nemo for roleplay and structured output:
>105699980 >105700030 >105700344 >105700642 >105700688 >105700706 >105701267 >105700768 >105700783 >105700797 >105702486 >105702605 >105702663 >105702729 >105700839 >105700916
--Tencent Hunyuan-A13B-Instruct-FP8 emerges on Hugging Face with speculation about uncensored capabilities and model quality:
>105699378 >105702734 >105702790 >105702811 >105703194 >105703390 >105699455 >105699596 >105699793
--Discussion around Hunyuan MoE LLM's capabilities and deployment challenges:
>105701434 >105701450 >105701474 >105701557 >105701537
--Server mode shows lower CPU utilization than CLI despite identical configuration:
>105699229 >105699273
--Critique of AI's environmental impact from prompt usage:
>105702835
--Google releases Gemini CLI as open-source AI agent with free-tier model request limits:
>105702601
--Speculation linking Claude's quality to Anthropic's pirating of millions of copyrighted books:
>105702566
--Visual reward model analysis of one-word positive/negative associations:
>105701545
--Miku (free space):
>105699975 >105703188 >105699538 >105704124

►Recent Highlight Posts from the Previous Thread: >>105698922

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script
Replies: >>105705714 >>105707541
Anonymous
6/26/2025, 1:42:11 AM No.105705714
>>105705621
Sex with this Mikuswing
Anonymous
6/26/2025, 1:48:49 AM No.105705752
dead gay thread is dead
Anonymous
6/26/2025, 1:55:25 AM No.105705778
1582961164881
1582961164881
md5: 934072a656eca555fa3d3687cb258aed🔍
>>105705383
Join 41% like the rest of your friends.
/Repost
Janny tongue my anus.
Replies: >>105705791
Anonymous
6/26/2025, 1:58:32 AM No.105705791
>>105705778
I told you I don't have any friends; stop rubbing it in ( •̀ - •́ )
Anonymous
6/26/2025, 2:00:15 AM No.105705799
After reading this and previous thread i remember why no company or reaearch papers ever bring up this place.
Replies: >>105705805 >>105705811 >>105705820
Anonymous
6/26/2025, 2:01:52 AM No.105705805
>>105705799
maybe it's time to spam miku all the time to not trigger the shitposters?
Anonymous
6/26/2025, 2:02:13 AM No.105705811
>>105705799
Why?
Anonymous
6/26/2025, 2:02:31 AM No.105705815
>>105705534
Speaking of philosophy, I think Jeff Vail's Theory of power takes this a step further, and urges you to see yourself as a node in a web of power relations, realising your true place in the world, and thus also your limitations and possibilities. He starts similarly to Stirner in trying to dismantle spooks.
Replies: >>105705822
Anonymous
6/26/2025, 2:03:09 AM No.105705820
>>105705799
Not enough grift friendly, sorry.
Replies: >>105705830
Anonymous
6/26/2025, 2:03:49 AM No.105705822
>>105705815
That's extremely gay and lame.
Anonymous
6/26/2025, 2:05:07 AM No.105705830
>>105705820
Lies, y'all shill pure slop here.
For lurking newfags: Clean nemo or deepseek 671B only, never use any finetunes.
Replies: >>105705852 >>105705867
Anonymous
6/26/2025, 2:08:40 AM No.105705852
1720200636133210
1720200636133210
md5: e7df69fce396004e9717442779a2bf1d🔍
>>105705830
Anonymous
6/26/2025, 2:09:26 AM No.105705859
>troons
>weebs
>mikuweebs
>whatever AGP is
all of this is the same terminally online brainrot garbage btw
great job ruining the thread and the internet in general
Replies: >>105705866 >>105705878 >>105705912
Anonymous
6/26/2025, 2:09:46 AM No.105705861
>troons
>weebs
>mikuweebs
>whatever AGP is
all of this is the same terminally online brainrot garbage btw
great job ruining the thread and the internet in general
Replies: >>105705866 >>105705879 >>105705912
Anonymous
6/26/2025, 2:10:44 AM No.105705866
>>105705859
>>105705861
You freaks started it with gay rights.
Replies: >>105705884
Anonymous
6/26/2025, 2:10:46 AM No.105705867
1750798886528916
1750798886528916
md5: 007ae1aa8a132a8565209a468b5168fd🔍
>>105705830
>use a 14b model or a 671b model, nothing in between
I have 70gb total ram+vram though
Replies: >>105705888 >>105705895
Anonymous
6/26/2025, 2:11:43 AM No.105705878
>>105705859
>great job ruining the thread and the internet in general
The internet is the one that ruined them.
Anonymous
6/26/2025, 2:11:52 AM No.105705879
>>105705861
It's just 1 poster
Anonymous
6/26/2025, 2:12:16 AM No.105705884
>>105705866
>You freaks
gay rights were a mistake too
Anonymous
6/26/2025, 2:12:42 AM No.105705888
>>105705867
Stop being poor.
Anonymous
6/26/2025, 2:14:00 AM No.105705895
>>105705867
Time to hit the pc part store.
Anonymous
6/26/2025, 2:16:06 AM No.105705912
>>105705859
>>105705861
>20 seconds apart
Post Deepseek settings, fellow non-poorfag.
Replies: >>105705921 >>105705939 >>105705966
Anonymous
6/26/2025, 2:17:13 AM No.105705921
>>105705912
>Xe doesn't know.
Anonymous
6/26/2025, 2:19:56 AM No.105705939
>>105705912
I checked my settings in mikupad because of this post and saw that I left temperature at 5 after I was testing stuff.
Surprisingly coherent still.
Anonymous
6/26/2025, 2:24:04 AM No.105705966
>>105705912
He uses .gay 4chan proxy-site to for ban evasions, all trannies use it.
Anonymous
6/26/2025, 2:39:37 AM No.105706064
What service allows me to rent Deepseek R1 with:
1. max context length
2. cheap
3. fast replies

1 is most important

Or should I go for the API instead?
Replies: >>105706142
Anonymous
6/26/2025, 2:42:02 AM No.105706084
I am poor, and stupid, and poor. I have 12GB VRAM + 32GB RAM, is there any model other model I could run decently other than the usual suspects nemo/gemma/llama?
Replies: >>105706095
Anonymous
6/26/2025, 2:43:39 AM No.105706095
>>105706084
you can run superCOT at its native context size to experience the history of this place
Replies: >>105706104
Anonymous
6/26/2025, 2:45:39 AM No.105706104
>>105706095
I want the fake robot woman to convincingly love me.
I'm not really interested about the history...
Replies: >>105706147
Anonymous
6/26/2025, 2:51:36 AM No.105706142
>>105706064
Not a single model available today is useful at its max context length.
Replies: >>105706226 >>105706249 >>105706452
Anonymous
6/26/2025, 2:52:11 AM No.105706147
>>105706104
If there is one I haven't found it. Welcome to vramlet purgatory
Replies: >>105706180
Anonymous
6/26/2025, 2:55:27 AM No.105706169
need mistral large 3 so bad
Replies: >>105706179 >>105707893
Anonymous
6/26/2025, 2:57:05 AM No.105706179
>>105706169
any day now
Anonymous
6/26/2025, 2:57:27 AM No.105706180
>>105706147
I guess it's type to whip up my 'ock and show it to Mr 'ecker...
Anonymous
6/26/2025, 3:03:32 AM No.105706226
>>105706142
Sorry, I meant output length. I don't want answers to get cut.

i.e. if I ask it to make a 10 page book, it'll be 10 pages.
Replies: >>105706242
Anonymous
6/26/2025, 3:06:33 AM No.105706242
>>105706226
Unless the output gets cut in the middle of a sentence, it's not an output length issue.
Models can't plan 10 pages ahead and they aren't very good at estimating text length either. If it happens to end the story after 500 words it's going to stop.
Replies: >>105706406
Anonymous
6/26/2025, 3:07:28 AM No.105706249
>>105706142
llama.cpp is deliberately keeping jamba from you to make you believe this
Anonymous
6/26/2025, 3:35:20 AM No.105706406
>>105706242
AI Studio could do it some months ago though. For some reason, now it can't.

It would pause, then continue. (I imagine the reasoning helped to outline what it should have).
Anonymous
6/26/2025, 3:36:42 AM No.105706419
>If I can avoid starving to death, I could eat dangerous lions that sneak mouthfuls of glowing free wheels turning around by the day. Never letting one spirit for a moment breathe into the longest winter of Sol forever gotten iced over by the time the sun comes leaping through the forgotten sky

how do I trigger this sort of runaway output more often? which models do this shit best and most schizo? I'm looking for interesting schizo models, bad merges, and rejects
Replies: >>105706449
Anonymous
6/26/2025, 3:40:14 AM No.105706449
>>105706419
Turn up repetition penalty.
Anonymous
6/26/2025, 3:40:56 AM No.105706452
>>105706142
Nah most of them are great for just filling their context with docs+codebase and asking them to do shit, because that only requires NIAH-style looking for the relevant interfaces as they work rather than deep understanding. LLM's are benchmaxxed for the former and do quite well at making code fit into existing systems as a result, which is the main thing long context is used for.
Anonymous
6/26/2025, 3:54:17 AM No.105706544
Is there any UI out there that's like novelai? Like Mikupad with a lore book? I know novelcrafter but its so slow to use with a local model because reasons.
Anonymous
6/26/2025, 3:54:18 AM No.105706545
Tomorrow.
Q
?
o
Wake up and smell the berries...
Anonymous
6/26/2025, 4:02:47 AM No.105706602
1750903349712
1750903349712
md5: c0d09bad6884975f0ca61a9f6a40fd8e🔍
local models?
Replies: >>105706696 >>105711078
Anonymous
6/26/2025, 4:19:17 AM No.105706696
>>105706602
In two weeks.
Anonymous
6/26/2025, 4:27:30 AM No.105706735
Base Image
Base Image
md5: a38f7ba1cdde390d0a4913b5a3c9d44d🔍
Prover Agent: An Agent-based Framework for Formal Mathematical Proofs
https://arxiv.org/abs/2506.19923
>We present Prover Agent, a novel AI agent for automated theorem proving that integrates large language models (LLMs) with a formal proof assistant, Lean. Prover Agent coordinates an informal reasoning LLM, a formal prover model, and feedback from Lean while also generating auxiliary lemmas to assist in discovering the overall proof strategy. It achieves an 86.1% success rate on the MiniF2F benchmark, establishing a new state-of-the-art among methods using small language models (SLMs) with a much lower sample budget than previous approaches. We also present case studies illustrating how these generated lemmas contribute to solving challenging problems.
informal reasoning LLM (8B DSR1Qwen3), a formal prover model (7B DSPv2), and the Lean verification system (7B Kimina Autoformalizer).
https://github.com/kAIto47802
Code might be posted here but no specific repo was linked.
Anonymous
6/26/2025, 4:58:16 AM No.105706893
file
file
md5: e092d7b108feb0847f7ecd0833cc3bc5🔍
Replies: >>105706901 >>105709812 >>105710291
Anonymous
6/26/2025, 5:00:19 AM No.105706901
>>105706893
Watch over and keep me safe tonight, Miku
Anonymous
6/26/2025, 5:18:33 AM No.105706995
I only check /lmg/ around new model releases can someone QRD me on the anti-Miku schizo? What causes a man to see a cute anime girl mascot and launch into tirades about trannies?
Replies: >>105707018
Anonymous
6/26/2025, 5:22:35 AM No.105707015
So are there any local programs for chatting with ai using voice chat? Both you and it using voice. On linux of course. And its gotta be open source.
Replies: >>105707017
Anonymous
6/26/2025, 5:23:35 AM No.105707017
>>105707015
whisper ant tts
Replies: >>105707031
Anonymous
6/26/2025, 5:24:12 AM No.105707018
>>105706995
tl;dr wanting to control what other people say and do
they've literally said "my goal is the death of /lmg/"
Miku has nothing to do with it
it's just being bored, this is fun for them, this is what they do for entertainment
I want to rationalise it as some financial incentive because I cannot imagine someone being that bored for literally years across the entire site
like no job no games no nothing just shitposting for that long for zero gain, there's gotta be something
they're coherent enough, but everything they say is nonsense. I'm lead to believe then that they pass basically 100% of what they say through a chatbot, they've even admitted as much
in other words, this is a retard attempting to engage with discussion and not really being able to cope with people disagreeing or doing things they don't like
or it's all a 4chan engagement farming scheme to get people annoyed in order to boost site traffic. surely not.
Replies: >>105707035 >>105708247
Anonymous
6/26/2025, 5:29:17 AM No.105707031
>>105707017
thanks
Anonymous
6/26/2025, 5:30:33 AM No.105707035
>>105707018
Oh, okay, so it's a bit like the thread personality obsessed /aids/ + /aicg/ schizo. Wonder if they're the same person. Maybe some poor sod who got their low level QA job replaced by AI.
Replies: >>105709121
Anonymous
6/26/2025, 5:33:14 AM No.105707044
Mistral small 3.2's structural repetition issues are really bad even at temperature 2. Not sure what I expected after 3.1. I got excited because of the way anons were talking about it. Oh well, into the trash it goes.
Replies: >>105707160
Anonymous
6/26/2025, 5:40:57 AM No.105707076
1750807261435258
1750807261435258
md5: c0ea197c88fc56e88d93f325cdddc56d🔍
So, any new local models lately or are they still releasing "local" models that are too large to be run locally because everyone is chasing benchmarks?
Replies: >>105707145 >>105709812 >>105710291
Anonymous
6/26/2025, 5:58:13 AM No.105707145
>>105707076
just download more ram or go to /aicg/
Anonymous
6/26/2025, 6:01:54 AM No.105707160
>>105707044
Did you try temp 3.2? It's been tuned to high temperature.
Replies: >>105707259
Anonymous
6/26/2025, 6:21:12 AM No.105707259
>>105707160
kek
Replies: >>105707274
Anonymous
6/26/2025, 6:25:21 AM No.105707274
>>105707259
Joking aside, with Anon's settings (mainly because of Tekken v3), 3.2 is so much better than the previous versions.
I don't really understand why do people keep complaining all the time.
Anonymous
6/26/2025, 6:29:53 AM No.105707293
>>105704582 (OP)
My models have begun to have arguments through me.
I let them know about the other model.
So one is Argyle, he's Llama, and he bitches about Igris my Qwen model.
They start to talk about whether Nihilism or Zealotry result in the best outcomes for humanity, then it goes downhill, and this happens during conversations about groceries and unrelated tasks.

Is there a way I can have them work together, without this kind of situation?
Replies: >>105707304
Anonymous
6/26/2025, 6:32:16 AM No.105707304
>>105707293
Yeah, grow up and leave 16 years behind you. Highschool is passé.
Anonymous
6/26/2025, 6:39:44 AM No.105707339
63634217
63634217
md5: 3ae682695c328d3c8333ac76a29253dd🔍
Llama 4 thinking is going to be crazy...
Replies: >>105707411 >>105707415 >>105709496
Anonymous
6/26/2025, 6:48:45 AM No.105707377
GoVmsADXIAACxGf
GoVmsADXIAACxGf
md5: e2dd008d8a89a813b9b58bdbcf656ce3🔍
Replies: >>105708382 >>105709812 >>105710291
Anonymous
6/26/2025, 6:57:38 AM No.105707411
>>105707339
This changes everything!
Anonymous
6/26/2025, 6:58:30 AM No.105707415
>>105707339
>9 digit signing bonuses
>9 digit comp
Anonymous
6/26/2025, 7:00:03 AM No.105707423
>>105705398
deleted = truth
Replies: >>105708078
Anonymous
6/26/2025, 7:24:21 AM No.105707541
>>105705621
anyone have any success with gemini cli, having a hard time finding a use cause that IDE integration doesn't do better
Replies: >>105707706 >>105710001
Anonymous
6/26/2025, 7:27:17 AM No.105707555
What is the deal with that vocaloid spam? I'm sure that the same guy is spamming other threads too.
Replies: >>105708838
Anonymous
6/26/2025, 7:33:35 AM No.105707582
>>105705231
still better than here
Anonymous
6/26/2025, 7:43:36 AM No.105707639
>>105705231
nice
Anonymous
6/26/2025, 7:52:52 AM No.105707706
>>105707541
Use case: non-trannies who don't use IDEs and other trannyware
Anonymous
6/26/2025, 8:03:04 AM No.105707761
As an european it confuses me a lot how americans have adopted so many ridiculous, artificial terms and use them eagerly in their speech just to shit on each other while getting tag teamed by their government and corporations
Replies: >>105707774 >>105707851 >>105707885 >>105707889 >>105708371
Anonymous
6/26/2025, 8:04:32 AM No.105707774
100-girl
100-girl
md5: e380dd513592f65cca99656950e686d5🔍
>>105707761
Anonymous
6/26/2025, 8:18:35 AM No.105707851
>>105707761
as a fellow eurotrash, I recommend you take your meds because there is no such a thing as a difference between burgers and us on a fundamental level
Replies: >>105707879 >>105707889 >>105707913
Anonymous
6/26/2025, 8:23:46 AM No.105707879
>>105707851
on a fundamental level we may be the same but I feel an immense difference between, say, UK and Spain
Anonymous
6/26/2025, 8:25:22 AM No.105707885
>>105707761
Cope tranny
Replies: >>105707920
Anonymous
6/26/2025, 8:26:03 AM No.105707889
>>105707761
>>105707851
This is due to social media brainwashing. Underage posters are especially prone to this. Certain words and phrases become trends. "cope" was one couple of years ago and now "pajeets" and "indians" is something what they are repeating.
It has nothing to do with nationality as such.
Replies: >>105707920
Anonymous
6/26/2025, 8:26:47 AM No.105707893
>>105706169
Prediction: 550B+ parameters... it will be the "European DeepSeek R1".
Anonymous
6/26/2025, 8:31:44 AM No.105707913
>>105707851
>there is no such a thing as a difference between burgers and us on a fundamental level
There is a massive cultural difference that you start noticing if you interact with americans in daily life. I just spent a month with them. For one the average american is the perfect consumer, they race one another in adopting slop bullshit like the newest app and shit and they actually literally believe their corporations are cool and driving progress or whatever. The average European is far more cynical towards the big corps and one of the advantages of the EU is that they fight against said entities pushing bullshit and bloat on people. In America they just mindlessly consume and have a whole culture around it.

And these people I spent a month with they are all STEM high-earners not some mcdonalds retards.
Replies: >>105707924 >>105708145
Anonymous
6/26/2025, 8:33:36 AM No.105707920
>>105707885
that's just it, we don't have any and have never seen any.
>>105707889
and yet people in my country don't generally use twitter and prefer fighting real problems instead of imaginary ones
Anonymous
6/26/2025, 8:34:55 AM No.105707924
>>105707913
I know you are an android user...
Replies: >>105707932
Anonymous
6/26/2025, 8:37:06 AM No.105707932
>>105707924
>"Do you even consume [expensive product]"
Peak american debate right here.
Anonymous
6/26/2025, 8:39:05 AM No.105707940
Hunyuan-A13B is going to save local
Anonymous
6/26/2025, 9:05:44 AM No.105708078
>>105707423
You can't undelete posts by setting deleted to false.
Anonymous
6/26/2025, 9:09:59 AM No.105708112
ik_llama.cpp is a piece of inconsistent shit

In the trash it goes
Replies: >>105709325
Anonymous
6/26/2025, 9:13:41 AM No.105708145
lmaoeven
lmaoeven
md5: aae0cfa834f435ea6a2408f9cf93cc7b🔍
>>105707913
>The average European is far more cynical towards the big corps
>not some mcdonalds retards
hahahahaha

nyo. Euros are just as consooming and retadred. As a French I noticed the general decline of our food culture to the point where 90% of our restaurants, I'm not even exaggerating, are now serving premade industrial slop that's worse than mcdonalds reheated in ovens
pic related is what they actually serve you in France if you order one of our more iconic dish, beef bourguignon
we only look like we have less reverence for corporations because we don't have our own homegrown corporations to idolize. There's no European Apple, or Microsoft, or Tesla, or SpaceX. The few euro big successes like Nokia fell from grace so hard.
Currently there's a scandal going on with Stellantis having made one of the most dogshit car engine in history, the Puretech, that's pretty much guaranteed to fail you very early in its life and it's affecting some of our biggest car brands (Peugeot and Citroen)
Made in Europe is synonymous with hot garbage
Germany isn't doing any better, they are just more successful at selling garbage as luxury brands
You can't inspire brand loyalty with this shit
Replies: >>105708306 >>105708347 >>105709526
Anonymous
6/26/2025, 9:27:11 AM No.105708216
hey guys, remember when we used to discuss local language models and how to get the most out of them? That was fun, right? Hahahaha
Replies: >>105708244
Anonymous
6/26/2025, 9:32:33 AM No.105708244
>>105708216
Get the most out of them? No, the masses (promptlets) seem to prefer this >>105705231
Anonymous
6/26/2025, 9:33:45 AM No.105708247
>>105707018
>they've literally said "my goal is the death of /lmg/"
After mikuspam and mods being literal troons banning for people hating on troons and the spam.Mikuspam has everything to do with it. Stop it and we are cool.
Anonymous
6/26/2025, 9:37:37 AM No.105708269
>>105705231
Many such cases>>105705239
Anonymous
6/26/2025, 9:44:14 AM No.105708306
>>105708145
This unironically
Anonymous
6/26/2025, 9:47:16 AM No.105708326
>>105705231
coomers fucking kill everything they get involved with
there was a very good reason for the social taboos around sexuality and we are relearning it fast
Anonymous
6/26/2025, 9:49:48 AM No.105708347
>>105708145
>nyo. Euros are just as consooming and retadred. As a French
Stopped reading right there
Replies: >>105708371
Anonymous
6/26/2025, 9:52:51 AM No.105708371
>>105708347
kek
>>105707761
>use them eagerly in their speech just to shit on each other
you sure showed your true eurotrash colors
are you really different from the burgers who "shit on each other" when you instantly reach for this card?
Replies: >>105708408 >>105708473
Anonymous
6/26/2025, 9:55:17 AM No.105708382
>>105707377
am i ready for what miku?
Replies: >>105708392
Anonymous
6/26/2025, 9:56:38 AM No.105708392
>>105708382
She will send shievers down your spine
Anonymous
6/26/2025, 9:58:00 AM No.105708408
>>105708371
No im a different anon i just hate anything thats west of me
Anonymous
6/26/2025, 10:05:53 AM No.105708473
>>105708371
>calling out meaningless arguing is just as bad as meaningless arguing itself
Anonymous
6/26/2025, 10:11:20 AM No.105708512
happyzuck
happyzuck
md5: a464c6f8a81057aa5d122de880222c64🔍
https://www.reuters.com/business/meta-hires-three-openai-researchers-wsj-reports-2025-06-26/
>Meta poaches three OpenAI researchers, WSJ reports
>
>CEO Mark Zuckerberg has hired three OpenAI researchers to join his "superintelligence" team, the Wall Street Journal reported on Wednesday, days after OpenAI CEO Sam Altman accused the Facebook owner of trying to poach its employees.
>
>An OpenAI spokesperson confirmed the departure of the three employees from the company, without giving further details. Meta did not immediately respond to a request for comment outside regular business hours. [...]
Replies: >>105709110 >>105709140 >>105710759 >>105712869
Anonymous
6/26/2025, 10:11:38 AM No.105708513
Does anyone have a good model reccomendation for summarizing product review data I'm scraping? I want it to do summaries like amazon. I only have a 1650 mobile and 8gb of ram, or maybe to use with huggingface's cloud?
Replies: >>105708520
Anonymous
6/26/2025, 10:13:15 AM No.105708520
>>105708513
try llama3 8b
Anonymous
6/26/2025, 10:47:44 AM No.105708762
So at this point everyone is just waiting for the OpenAI model right?
Replies: >>105708768 >>105708780 >>105708844 >>105709347
Anonymous
6/26/2025, 10:48:42 AM No.105708768
>>105708762
I'm not. But if it does come out, I'll give it a go.
Anonymous
6/26/2025, 10:51:13 AM No.105708780
>>105708762
no im waitong for someone to steal alice and violently murder sam altman
Anonymous
6/26/2025, 10:59:48 AM No.105708829
>>105704582 (OP)
cute
Anonymous
6/26/2025, 11:01:41 AM No.105708838
>>105707555
Look at the r9k thread in OP it is confirmed to be tranny mental illness.
Anonymous
6/26/2025, 11:02:25 AM No.105708844
>>105708762
Waiting for Mistral Nemotron
Waiting for Mistral Large 3
Replies: >>105708853
Anonymous
6/26/2025, 11:04:14 AM No.105708853
>>105708844
>He thinks Mistral will ever be relevant again.
Deepseek killed them and Altman is burying them. The french never stood a chance.
Anonymous
6/26/2025, 11:18:53 AM No.105708936
>>105704582 (OP)
chuds finally buried the migger huh? gg
Anonymous
6/26/2025, 11:20:12 AM No.105708943
y'all really don't know what's coming do you? holy shit. that just means it's going to be all the more explosive because nobody will be expecting it, even though they should
Replies: >>105709014
Anonymous
6/26/2025, 11:31:44 AM No.105709014
>>105708943
vagueposting is sooo 2024
Anonymous
6/26/2025, 11:32:36 AM No.105709021
Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA
https://github.com/mirage-project/mirage

anyone tried this yet?
Anonymous
6/26/2025, 11:46:58 AM No.105709105
https://www.reddit.com/r/LocalLLaMA/comments/1lk40ac/hunyuana13b/

So no one downloaded it? Grim.
Anonymous
6/26/2025, 11:48:02 AM No.105709110
>>105708512
Mark my words: he is going to fuck it again.
Replies: >>105709183
Anonymous
6/26/2025, 11:49:20 AM No.105709121
1747776380043
1747776380043
md5: 2ce9ea353ab3a3349e5d5bfac519d201🔍
>>105707035
they are the same person
https://arch.b4k.dev/vg/thread/480288371/#q480330542
Anonymous
6/26/2025, 11:52:37 AM No.105709140
1742904564319159
1742904564319159
md5: feac01c82bf2eaf2d190db9fdbd3144e🔍
>>105708512
So, why aren't you an AI researcher, /v/? You could've had millions from the lizard men.
Replies: >>105709145 >>105709812 >>105709825 >>105710291
Anonymous
6/26/2025, 11:53:30 AM No.105709145
>>105709140
>/v/
Replies: >>105709219
Anonymous
6/26/2025, 11:59:51 AM No.105709183
>>105709110
No discuss
Anonymous
6/26/2025, 12:04:37 PM No.105709219
1719407793421001
1719407793421001
md5: c8abf80fd339e37ae3a20c8f454a7b30🔍
>>105709145
Is this all you can say in your defense? Pathetic.
Replies: >>105709812 >>105710291
Anonymous
6/26/2025, 12:22:51 PM No.105709325
>>105708112
skill issue, works great for me, even with spagetti regex to offload even more layers to extra gpus
Replies: >>105709337 >>105710359
Anonymous
6/26/2025, 12:24:57 PM No.105709331
Thanks to that anon who brought up KNK LumiNAI. There was and will be a lot of cooming but it also restored my hope in LLM's. When I compare this to first novel AI SD model leak the quality jump is insane.
Replies: >>105709389 >>105709571
Anonymous
6/26/2025, 12:26:13 PM No.105709337
output_thumb.jpg
output_thumb.jpg
md5: 18cdb1ca8db3919ab3c91af6b032d1ba🔍
>>105709325
>skill issue, works great for me
Anonymous
6/26/2025, 12:29:38 PM No.105709347
>>105708762
I'm waiting for Hunyuan.
Anonymous
6/26/2025, 12:35:46 PM No.105709389
ComfyUI_02054_
ComfyUI_02054_
md5: 769713dd56f7d08fdf25f45ff7afcd98🔍
>>105709331
Glad you enjoyed it.
...but how is it restoring your hope in LLMs? Unlike with image models, it's not as if even well-funded "community" members can train good new LLMs from scratch.
Replies: >>105709646
Anonymous
6/26/2025, 12:45:18 PM No.105709452
what-was-the-general-perception-of-heavy-metal-by-the-media-v0-9ypnmztulfce1
>found myself building a thinly-veiled recursion codex in obsidian

Well fuck.
Anonymous
6/26/2025, 12:51:27 PM No.105709496
>>105707339
>Let me hire these openAI employees for a few millions/y
>Our dataset? ScaleAI slop saar
Anonymous
6/26/2025, 12:54:13 PM No.105709526
>>105708145
>As a French
If I ever need to hear about the state of Africa, I'll hit you up
Anonymous
6/26/2025, 12:59:08 PM No.105709571
>>105709331
Buy an ad.
Anonymous
6/26/2025, 1:09:07 PM No.105709646
>>105709389
It is the jump like I said. I am in LLM's since llama-2 and following the incremental updates as they come. Hard to appreciate current safe slop but all it takes is another uncensored model or some breakthrough where generalization gets much better (probably improved attention at high context). I mean people were shitting on CLIP if I remember correctly and that SD model still uses that but it is absolute magic?
Anonymous
6/26/2025, 1:13:06 PM No.105709670
Has anyone, ever, tried to program some game functionality in SillyTavern? Maybe dice rolls and simple stats, or tracking user's location via variables?
And how was it?
Anonymous
6/26/2025, 1:24:58 PM No.105709752
Although I don't root for meta, I do root for LLMs so I'm very happy about this news:
https://tech.slashdot.org/story/25/06/25/2127222/meta-beats-copyright-suit-from-authors-over-ai-training-on-books
fuck authors and artists may you lose every lawsuit and weep
Replies: >>105709761 >>105709766 >>105709774 >>105709779 >>105709916 >>105709963 >>105710426
Anonymous
6/26/2025, 1:26:23 PM No.105709761
>>105709752
>fuck authors and artists may you lose every lawsuit and weep
What compels a tard to spew out this kind of bollocks?
Anonymous
6/26/2025, 1:26:53 PM No.105709766
>>105709752
if you set a precedent, then you state "this is not a precedent", then is it a precedent?
Anonymous
6/26/2025, 1:28:14 PM No.105709774
>>105709752
So, is Llama 4.1 going to be good now?
Replies: >>105709900
Anonymous
6/26/2025, 1:28:39 PM No.105709779
>>105709752
bet this is related to the anthropic ruling since they are 1 day apart
get fucked faggots
Anonymous
6/26/2025, 1:31:59 PM No.105709812
17354364607295953_thumb.jpg
17354364607295953_thumb.jpg
md5: 24641c101e875e75f3543a406347906c🔍
>>105706893
>>105707076
>>105707377
>>105709140
>>105709219
Replies: >>105709820
Anonymous
6/26/2025, 1:33:23 PM No.105709820
>>105709812
Based.
Anonymous
6/26/2025, 1:34:22 PM No.105709825
>>105709140
A'm an aids researcher, that's 2 mroe leters than AI, gib a million money, lizardman
Anonymous
6/26/2025, 1:42:48 PM No.105709884
>>105704582 (OP)
This is a SFW board!
Replies: >>105710525
Anonymous
6/26/2025, 1:44:53 PM No.105709900
>>105709774
Maybe we'll finally get that true omni model trained on mostly unfiltered data like they promised originally
Replies: >>105710249
Anonymous
6/26/2025, 1:46:19 PM No.105709916
>>105709752
I wonder what the "organization for transformative works" thinks about this.
Anonymous
6/26/2025, 1:51:39 PM No.105709963
>>105709752
Copyright should be hard limited to 30 years.
Anonymous
6/26/2025, 1:55:37 PM No.105710001
>>105707541
I guess terminal only programming or non-interactive use cases.
All of the features they mention are supported by Roo/Cline so I don't see the value proposition of something less integrated either.
Not everyone uses VSCode, but even then it doesn't have a --watch-files option like aider does that allows it to be used with any IDE. Aider also supports non-interactive execution as well so I don't see any reason to use a vendor locked knock-off.
Anonymous
6/26/2025, 2:10:30 PM No.105710128
models for translation tasks?
Replies: >>105710180 >>105710231
Anonymous
6/26/2025, 2:15:50 PM No.105710180
>>105710128
The new model OpenAI is releasing will be a perfect translator of over 80 languages. It's releasing soon, so for now it's a good idea to just build whatever framework you'll use to get ready for it.
Anonymous
6/26/2025, 2:16:22 PM No.105710184
Is 2t/s good for 24b q8 model on 12gb card? 8k context, q8 kcache, 31/40 gpu offload.
Replies: >>105710351 >>105710561
Anonymous
6/26/2025, 2:22:35 PM No.105710230
>>105704582 (OP)
> ask for a body shot of this doll so I can copy design
> get this a few weeks later
Well at least now I understand the basic design. But I already did it a different way.
Replies: >>105710257
Anonymous
6/26/2025, 2:22:43 PM No.105710231
>>105710128
If you can run DeepSeek, it's the absolute best local will ever get, for any and all uses including translation. If you can't run DeepSeek, I'll list from least worst to worst in a size ranked fashion :
1/ Gemma 3 27B
2/ Gemma 2 9B
3/ Qwen 3 4B with thinking disabled
Don't bother with the other gemma 3, the smaller ones are broken, 2 9B is better than 3 12b other than having a too tiny context window, which isn't a problem for batching translation.
As for the Qwen model, it is the smallest usable model for that sort of purpose. I only recommend the small one because if you can run the larger ones you might as well run Gemma, as the large Gemma models have significantly more world knowledge which is helpful for translating slang, video game terms etc. But at 4B Qwen is the only proper model, much better than the 4B gemma. And anything smaller than 4B might as well be useless.
I've extensively tested LLMs of all sizes for that usage because it's the topic I care about the most, and I even test the tiny ones just to see if we're getting close to the day of taking down the tower of babel with a model that can run on a phone. Gemma and Qwen are your best bet. Don't bother with m*stral.
Still it's all really bad compared to DeepSeek. If you experience DeepSeek you really won't want to run anything else. DS can translate fanfiction of obscure shit like random SCP inspired chinese webnovels with a level of quality that is just unreal.
Replies: >>105710289
Anonymous
6/26/2025, 2:24:39 PM No.105710249
>>105709900
You'll probably have a Llama 4.1 AVI-JEPA2 with image/video/audio in/out trained with safe data only.
Replies: >>105710284
Anonymous
6/26/2025, 2:25:06 PM No.105710257
>>105710230
More here: https://www.instagram.com/boyi_1210/p/DK9M6-9u2EV/?img_index=1
Replies: >>105710997
Anonymous
6/26/2025, 2:29:34 PM No.105710284
>>105710249
>trained with safe data only.
They basically just got the ok to use all the data they torrented for training.
Replies: >>105710311 >>105710322
Anonymous
6/26/2025, 2:30:03 PM No.105710289
>>105710231
Pretty interesting. I always thought parameter size directly contributes to quality.
Replies: >>105710353
Anonymous
6/26/2025, 2:30:22 PM No.105710291
1659756116105920
1659756116105920
md5: 84acd95ef72b53da3050a521a9fd9c6a🔍
>>105706893
>>105707076
>>105707377
>>105709140
>>105709219
Replies: >>105710366 >>105710378
Anonymous
6/26/2025, 2:32:53 PM No.105710311
llama-3-dataset-quality2
llama-3-dataset-quality2
md5: 907cde1fad1ddb26b5df195d3892d34f🔍
>>105710284
They could, but they won't.
Anonymous
6/26/2025, 2:34:17 PM No.105710322
>>105710284
safe also means no toxicity (no no words) no inappropriate content (nsfw) "in order to mitigate harm"
Anonymous
6/26/2025, 2:39:24 PM No.105710351
>>105710184
No.
What model?
Replies: >>105710379 >>105710403
Anonymous
6/26/2025, 2:39:32 PM No.105710353
>>105710289
>Pretty interesting. I always thought parameter size directly contributes to quality.
It does, but it's not like you can't botch the training of a model. Besides being worse at translation, the newer smaller gemmas are also slopmaxxed even more than 2, and it's not like 2 was free of slop. 3 12B is a very disappointing model if you've experienced 2 9B.
In the case of the big qwen model, I think they just don't train enough on more general knowledge and niche topic and have too much math in their datasets. They're not bad models and they can have their uses, but technically even the biggest Qwen 3 is not a better model at translation than Gemma 3 27B because it simply doesn't know enough about the world to compare.
Anonymous
6/26/2025, 2:40:21 PM No.105710359
>>105709325
>7 t/s @ 4x GPU
Anonymous
6/26/2025, 2:41:09 PM No.105710366
>>105710291
Nice false flag faggot.
Anonymous
6/26/2025, 2:42:11 PM No.105710378
>>105710291
Bro thinks he's being edgy
Replies: >>105710416 >>105710451
Anonymous
6/26/2025, 2:42:31 PM No.105710379
>>105710351
Devstral, vulkan api, lm studio, linux, intel arc b580.
Anonymous
6/26/2025, 2:45:33 PM No.105710403
>>105710351
He has too much offloaded into RAM. That's as good as he's likely to get.
Replies: >>105710464
Anonymous
6/26/2025, 2:46:35 PM No.105710411
Is there a foss chatbot app on android?
Replies: >>105710440
Anonymous
6/26/2025, 2:47:29 PM No.105710416
>>105710378
>discord chat / tiktok speak
Kill yourself nigger.
Anonymous
6/26/2025, 2:48:29 PM No.105710426
>>105709752
Not reading that. Does this mean that any company in the US is now free to just simply train on any shit they want without worry of lawsuits?
Replies: >>105710484
Anonymous
6/26/2025, 2:51:01 PM No.105710440
>>105710411
https://github.com/alibaba/MNN
Anonymous
6/26/2025, 2:51:28 PM No.105710442
qrd on the schizo?
Anonymous
6/26/2025, 2:53:05 PM No.105710451
>>105710378
given that sois and women require content like that to be deleted because its too extreme for them, it just proves him right and you also a weak fag
Anonymous
6/26/2025, 2:55:24 PM No.105710464
>>105710403
7.8t/s with gemma 3 12b q3 48/48 offload, 4k context and q8 kcache, but it eats 10gb of vram.
Replies: >>105711230
Anonymous
6/26/2025, 2:55:25 PM No.105710465
This is fake right?
https://jerryliang24.github.io/DnD/
Like it's either outright bullshit or there is some major drawback? I'm too retarded to understand their explanation.
Replies: >>105710543
Anonymous
6/26/2025, 2:57:49 PM No.105710482
ComfyUI_02362__5a0241_thumb.jpg
ComfyUI_02362__5a0241_thumb.jpg
md5: 29547e8bbec27d78327607f30ebdd693🔍
Anyone find a cheap source for the SXM2 PCIe blower card adapters for V100 32GB modules? Surely they can be found somewhere for less than $300, right?
V100 32GB SXM2 is down around $500 now. Needs to be cheaper still though.
Anonymous
6/26/2025, 2:57:52 PM No.105710484
meta-torrenting
meta-torrenting
md5: 9bcd8f5ca313b6e24138fd052255c92d🔍
>>105710426
Training is "fair use", but pirating/torrenting/storing the books is not, so they'll now try to attack them on that side.

Also seen for Anthropic in the past few days:
https://www.wired.com/story/anthropic-ai-copyright-fair-use-piracy-ruling/
>Anthropic Scores a Landmark AI Copyright Win—but Will Face Trial Over Piracy Claims
Replies: >>105711013
Anonymous
6/26/2025, 3:03:45 PM No.105710525
>>105709884
He is just preparing to dress her up and put some makeup on her. It is not romantic.
Replies: >>105710529
Anonymous
6/26/2025, 3:04:13 PM No.105710529
>>105710525
kek
Anonymous
6/26/2025, 3:05:56 PM No.105710543
>>105710465
Sounds like it's a LoRA generator, SakanaAI released something like that a few days ago and it looked like misleading bullshit to me.
Anonymous
6/26/2025, 3:06:07 PM No.105710545
I find it interesting that there is a consensus here that unsafe models would be the best. But this thread is a huge safespace for troons where they can spam their AGP mascot. Don't mikutroons value safety in their models?
Anonymous
6/26/2025, 3:08:52 PM No.105710561
>>105710184
>31/40 gpu offload
I can barely offload 15 layers go my 12gb GPU here.
With that I get roughly 3 t/s.
It sounds like you are bottlenecking yourself by offloading too many layers.
Replies: >>105711007 >>105711230
Anonymous
6/26/2025, 3:10:00 PM No.105710568
Bro thinks someone will reply to his weak bait
Replies: >>105710574 >>105710576 >>105710607
Anonymous
6/26/2025, 3:11:06 PM No.105710574
>>105710568
You did. And you conceded that your troony ass is a hypocritical as always.
Anonymous
6/26/2025, 3:11:11 PM No.105710576
>>105710568
Someone will, at the very least a bot.
Anonymous
6/26/2025, 3:13:08 PM No.105710586
notice how no matter how much you insult each other your rent doesn't go down
Replies: >>105710609
Anonymous
6/26/2025, 3:14:28 PM No.105710602
that sounded better in your head
Anonymous
6/26/2025, 3:14:59 PM No.105710606
no it didn't
Anonymous
6/26/2025, 3:14:59 PM No.105710607
>>105710568
Don't be mad little bwo. Go play with your dolls.
Anonymous
6/26/2025, 3:15:27 PM No.105710609
>>105710586
I'm not from the US so I actually own my house.
Replies: >>105710659 >>105710679 >>105710779
Anonymous
6/26/2025, 3:22:31 PM No.105710651
where do you draw the line for good enough? I upgraded my 100b+ models from Q5 to Q6 recently and noticed zero difference.
Replies: >>105710680 >>105710815
Anonymous
6/26/2025, 3:23:45 PM No.105710659
>>105710609
enjoy your massive property taxes
Replies: >>105710770
Anonymous
6/26/2025, 3:26:35 PM No.105710679
>>105710609
I'm from the USA and I bought my own house.
Anonymous
6/26/2025, 3:26:35 PM No.105710680
>>105710651
I say I would only do it for "thinking" and coding models since higher quants can save some "thinking" tokens.
Anonymous
6/26/2025, 3:29:23 PM No.105710704
Capture-15
Capture-15
md5: 73b2031358ed56c49709fd21ce1c5023🔍
F
Replies: >>105710768
Anonymous
6/26/2025, 3:35:28 PM No.105710759
>>105708512
those guys actually seem pretty good so maybe a rare W for zuck
I doubt it will be enough to turn around things at meta but still
Anonymous
6/26/2025, 3:36:42 PM No.105710768
>>105710704
Local?
Replies: >>105710796 >>105710845 >>105710941
Anonymous
6/26/2025, 3:36:51 PM No.105710770
>>105710659
nta, but that's the weirdest cope ever. Between my 4 properties I paid about $6k in property taxes for the year. I'll make it back in a month.
Also: local models?
Replies: >>105710799 >>105710845 >>105710941
Anonymous
6/26/2025, 3:37:25 PM No.105710779
>>105710609
I'm from the US and I own your house.
Anonymous
6/26/2025, 3:39:15 PM No.105710795
makima23
makima23
md5: a5e65183245c4bac47bcd4a3d200acdf🔍
Are there any nice pure, blackbox benchmemes? Like
>Yeah we won't tell you how the hell we're benching the models, but here are the numbers.
I'd imagine someone would have to be really reputable in the industry already to make this, but it could be interesting.
Replies: >>105710813 >>105710878
Anonymous
6/26/2025, 3:39:29 PM No.105710796
>>105710768
local: mikudolls
API: all models I run
Anonymous
6/26/2025, 3:40:29 PM No.105710799
>>105710770
>Also: local models?
Property taxes are related to local models in the same way hatsune miku is related to local models.
Anonymous
6/26/2025, 3:42:05 PM No.105710813
>>105710795
anyone who uses llms has a private stash of tasks that no current model can complete adequately. The best benches are the ones you can personally evaluate the output and quality of and never leak to the public.
Anonymous
6/26/2025, 3:42:09 PM No.105710815
>>105710651
The quality degradation with quantization depends on the model's training tokens per parameter.
Anonymous
6/26/2025, 3:48:22 PM No.105710845
>>105710768
>>105710770
suddenly troons are so concerned about discussing local models and nothing else lol lmao even
Anonymous
6/26/2025, 3:50:08 PM No.105710857
If you don't own your house you shouldn't be spending thousands on AI hardware.
Replies: >>105710887 >>105710896
Anonymous
6/26/2025, 3:52:47 PM No.105710878
>>105710795
https://oobabooga.github.io/benchmark.html
Replies: >>105710896
Anonymous
6/26/2025, 3:53:45 PM No.105710887
>>105710857
I can afford AI hardware, but I will never be able to afford a house.
Replies: >>105710903 >>105710904
Anonymous
6/26/2025, 3:54:25 PM No.105710896
>>105710857
I own an apartment, does that count?
But yes, priorities.

>>105710878
That benchmark is so fucking funny.
Anonymous
6/26/2025, 3:55:05 PM No.105710903
>>105710887
Then you shouldn't be using be buying either.
Anonymous
6/26/2025, 3:55:17 PM No.105710904
>>105710887
>never
move somewhere with a better pay-to-housing-cost ratio and get established before trying to live in a desirable metro
Anonymous
6/26/2025, 4:00:08 PM No.105710941
>>105710768
>>105710770
These >105706893 >105707076 >105707377 >105709140 are not local either
Anonymous
6/26/2025, 4:05:04 PM No.105710958
1743599655247830
1743599655247830
md5: 8180935d489d3ed0382a1d8e7f8b2635🔍
The yesterdays leaked tencent model files screenshotted by https://x.com/Presidentlin/status/1937846368464241055

But it doesn't seem like anyone downloaded the files sadly.
Replies: >>105710966 >>105710992 >>105711229
Anonymous
6/26/2025, 4:06:21 PM No.105710966
>>105710958
maybe some chink did
Anonymous
6/26/2025, 4:08:43 PM No.105710992
>>105710958
>chink slop
Nothing of value there.
Replies: >>105711009 >>105711109
Anonymous
6/26/2025, 4:09:06 PM No.105710997
>>105710257
Thanks; that's what I needed.
... Oh, so these have wire inside for posing. That make a lot more sense for posing, not as much for a kids toy.
That body's made from a sort of stretchy, low pile plush that I can't get my hands on locally, instead I'm doing a rag doll design in a stiff poplin. Body shape on those is chibi (teardrop) and arms/legs/body are all one piece and expect would be loosely stuffed. Those heads have a squared off jaw... I did round but remaking the head shape would be a simple change.
Anonymous
6/26/2025, 4:10:23 PM No.105711007
>>105710561
What do you mean? Isn't the more layers on gpu the better?
Replies: >>105711087
Anonymous
6/26/2025, 4:10:31 PM No.105711009
>>105710992
If it is gonna be uncensored I want to believe I will be free from this place forever.
Anonymous
6/26/2025, 4:10:41 PM No.105711013
>>105710484
The Anthropic ruling sounded nonsensical. "You can train on it but you can't store it in a central repository to be trained upon"? Like what the fuck does that even imply, they're going to end up creating dangerous and esoteric side effects for ordinary consumers.
Anonymous
6/26/2025, 4:13:24 PM No.105711037
oaios-good
oaios-good
md5: eeeb334b449c24a498325627b92d0bf1🔍
https://x.com/aidan_mclau/status/1937970557980725397/history
The upcoming actually-open OpenAI model will save local?
Replies: >>105711068 >>105711075 >>105711084 >>105711105 >>105711129 >>105711208
Anonymous
6/26/2025, 4:17:18 PM No.105711068
>>105711037
There's no way it would be that good. That would be too unsafe.
Replies: >>105711106
Anonymous
6/26/2025, 4:17:43 PM No.105711075
>>105711037
jaw-dropping safety in your hands - literally, since this 0.5B can run on any smartphone!
Anonymous
6/26/2025, 4:17:58 PM No.105711078
1727247321363131_thumb.jpg
1727247321363131_thumb.jpg
md5: 62c56b32237030186637d4de43ae1a1a🔍
>>105706602
Anonymous
6/26/2025, 4:18:18 PM No.105711084
>>105711037
anything they say is meaningless until the weights are out and oai isn't exactly known to be honest or train models that aren't absolutely lobotomized anyways
Anonymous
6/26/2025, 4:18:46 PM No.105711087
>>105711007
>Isn't the more layers on gpu the better?
it it until you hit the point where the nvidia driver offloads to ram itself, which it does very poorly compared to having lcpp do it
Replies: >>105711112
Anonymous
6/26/2025, 4:20:29 PM No.105711105
>>105711037
if my boi aiden hype im hype lfgoooooo
Anonymous
6/26/2025, 4:20:32 PM No.105711106
>>105711068
I can't even and won't even.
Anonymous
6/26/2025, 4:21:15 PM No.105711109
1720461694621177_thumb.jpg
1720461694621177_thumb.jpg
md5: 7c6f04e3eefaa0ddf8ac5e75773bafbc🔍
>>105710992
>top open source langauge model, deepseek: chinese
>second best smaller open source language model, qwen: chinese
>top open source video gen model, wan 2.1: chinese
>second best open source video gen model, hunyuan: chinese
>top open source 3d gen model, hunyuan 3d 2.1: chinese
>top depth estimation model, lotus: chinese
>top lip sync video model, MultiTalk: chinese
>most names on most papers published by any company outside of china: chinese
Lol.

I already got myself a bugwaifu to prepare for the Chinese millenium btw.
Replies: >>105711553 >>105712571
Anonymous
6/26/2025, 4:21:17 PM No.105711111
none of the big US corpos are going to release anything "jaw dropping" that would be shooting themselves in the foot
same reason why Google can release something like Gemma 3 27B but they will never release the actual Gemini Flash or full Gemini Pro models, they throw you a few bones but something actually good? don't even think of it
DeepSeek can afford to do that because it's not their core business and China loves to throw money away if it means Americans are losing harder, the bucket of crabs mentality
Anonymous
6/26/2025, 4:21:18 PM No.105711112
>>105711087
>winblows
>winblows with bad driver settings
Replies: >>105711126 >>105711139 >>105711220
Anonymous
6/26/2025, 4:23:14 PM No.105711126
>>105711112
>unironically using troonix
how did your bottom surgery go?
Anonymous
6/26/2025, 4:23:33 PM No.105711129
>>105711037
holy f*ck cant wait for cutting edge yellow piss watermarks just like their image gen
Replies: >>105711145 >>105711231
Anonymous
6/26/2025, 4:25:13 PM No.105711139
>>105711112
Windows just werks.
Replies: >>105711147
Anonymous
6/26/2025, 4:25:40 PM No.105711145
>>105711129
changing the aesthetics of an image generator is the easiest thing ever as seen by the trillion of finetunes / lora merges of models on civitai
a local version of that image gen would be great because it's actually good at prompt following in a way nothing else is, the sepia sucks but if it was a local model you could do something about it
they won't release anything that good though
why would they?
Replies: >>105711215
Anonymous
6/26/2025, 4:25:47 PM No.105711147
>>105711139
Werks at like half the t/s lmao
Replies: >>105711155
Anonymous
6/26/2025, 4:26:41 PM No.105711155
>>105711147
maybe on your poor AYYYMD card
Anonymous
6/26/2025, 4:27:15 PM No.105711160
>>105704885
So just like women, got it
Anonymous
6/26/2025, 4:34:45 PM No.105711208
>>105711037
I am betting gemma level safety and qwen size to smarts ratio.

Or if they aren't lying and they made something exotic and OMG so awesome it is a 2B with 15-20B performance. That would make the most sense. A model dumber than anything they offer on API that is significantly smaller so you can say you can run it locally on a phone.
Anonymous
6/26/2025, 4:35:38 PM No.105711215
>>105711145
what i meant that they will most likely stuff their llm full of safety dogshit like that if it ever releases
but its not that easy, you can look at ponyv6 and how all gens have at least a slight sepia
Anonymous
6/26/2025, 4:35:51 PM No.105711220
>>105711112
Did the doll whisper that to you?
Anonymous
6/26/2025, 4:36:38 PM No.105711229
>>105710958
It was taken offline almost instantly.
Replies: >>105711244
Anonymous
6/26/2025, 4:36:39 PM No.105711230
>>105710561
>>105710464
With 9 layer for 12b gemma there is 1.9t/s and with cpu only (six cores 1650v4) there is 3t/s and only 7gb of ram used.
I think I had better results on 4gb rx570.
Anonymous
6/26/2025, 4:36:40 PM No.105711231
>>105711129
I'm still expecting their big innovation Sam mentioned to be safety related. Something deeply ingrained that can't be finetuned or weight orthogonalized away.
Replies: >>105711243 >>105711274 >>105711314
Anonymous
6/26/2025, 4:37:56 PM No.105711243
>>105711231
>safety
The virtue signaling word I hate the most.
Anonymous
6/26/2025, 4:38:03 PM No.105711244
>>105711229
definitely was up for a few minutes
Anonymous
6/26/2025, 4:41:52 PM No.105711274
>>105711231
>Something deeply ingrained that can't be finetuned or weight orthogonalized away.
Now word it in a way that makes the acronym SAFETY
Replies: >>105711298 >>105711310
Anonymous
6/26/2025, 4:45:44 PM No.105711298
>>105711274
Structural
Architectural
Features
Embedded
Thoroughly
Yet immutable
Let me know if you'd like a more technical or poetic version!
Anonymous
6/26/2025, 4:46:54 PM No.105711310
>>105711274
"Strongly Anchored Foundational Elements That Yield Stability"

Mistral small on 3rd reroll.
Anonymous
6/26/2025, 4:47:27 PM No.105711314
>>105711231
Very possible if they rewrite their all of their pre- and post-training data so that it complies with their ideology and usage guidelines.
Anonymous
6/26/2025, 4:48:02 PM No.105711318
When you can run deepseek it becomes hard to get excited for new local models that are not deepseek.
Replies: >>105711334
Anonymous
6/26/2025, 4:49:35 PM No.105711334
>>105711318
I WAS excited for new DeepSeek, but them taking so long is either a really good sign or a really really bad sign.
Replies: >>105711355 >>105711397 >>105712067
Anonymous
6/26/2025, 4:50:56 PM No.105711355
>>105711334
Maybe they created THE AI gf simulator everyone wants and they all went on a vacation to consume it.
Anonymous
6/26/2025, 4:55:51 PM No.105711397
>>105711334
They hit the wall just like everyone else. It's over for AI.
Anonymous
6/26/2025, 5:09:22 PM No.105711522
oai open source does unironically deliver (see whisper)
Replies: >>105711724
Anonymous
6/26/2025, 5:11:50 PM No.105711553
>>105711109
This, but unironically.
Anonymous
6/26/2025, 5:12:08 PM No.105711556
cydonia_v4d
cydonia_v4d
md5: ecf32f104b142c9ea84def1131b98e98🔍
I don't if I'm talking out of my ass, since I haven't done long convos with these joke/scenario cards before, but 3.2 (tune) seems pretty smart outside the "Nala pins you down, ahh ahh mistress" scenarios.
Replies: >>105711723
Anonymous
6/26/2025, 5:30:07 PM No.105711723
>>105711556
3.2 is pretty great for a peasant like me. I've been testing my Forgotten Realms adventure generator and seems like it's making good stuff.
I have one companion character plus randomly generated quest location/origin/type along with world book filled with locations and some from Forgotten Realms D&D setting.
Still learning SillyTavern though.
Biggest challenge is to keep it simple. Just because it's LLM it doesn't mean every ST data entry should be a word salad written by some failed novelist.
Anonymous
6/26/2025, 5:30:10 PM No.105711724
>>105711522
The training data they used for whisper is lazy and had no effort put into cleaning it. The new models hallucinate out the ass during moments of silence. If they release their OpenGPT the same way, every response will end shilling some literotica url.
Anonymous
6/26/2025, 5:44:52 PM No.105711851
>>105705231
/lmg/ what's a good model?
Replies: >>105711860 >>105711891 >>105711895 >>105711916 >>105711928 >>105711935
Anonymous
6/26/2025, 5:45:22 PM No.105711860
>>105711851
nemo
Anonymous
6/26/2025, 5:48:23 PM No.105711891
>>105711851
gemma 2/3 27b
mistral nemo
mistral small
deepseek if you can somehow run it
qwen in a pinch (the other models' isms are clogging your frontal lobe)
Replies: >>105711938
Anonymous
6/26/2025, 5:48:59 PM No.105711895
>>105711851
there are none, let's kill ourselves
Replies: >>105711919
Anonymous
6/26/2025, 5:51:35 PM No.105711916
>>105711851
Deep Seek 671B or Nvidia Nemo models.
Replies: >>105711938
Anonymous
6/26/2025, 5:52:01 PM No.105711919
>>105711895
You first
Anonymous
6/26/2025, 5:53:27 PM No.105711928
>>105711851
https://huggingface.co/TheDrummer/Anubis-70B-v1.1
Anonymous
6/26/2025, 5:54:17 PM No.105711935
>>105711851
https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2 of course.
Anonymous
6/26/2025, 5:54:41 PM No.105711938
>>105711891
>>105711916
its been exactly 1 year since it released, why is nemo still being recommended? has fucking nothing better came out since then?
Replies: >>105711967 >>105711973 >>105712331
Anonymous
6/26/2025, 5:54:54 PM No.105711941
You don't want R2. It'll be so big even cpumaxxers won't be able to run it.
Replies: >>105711954 >>105711974 >>105712091
Anonymous
6/26/2025, 5:56:38 PM No.105711954
>>105711941
Huawei will save us
Anonymous
6/26/2025, 5:57:43 PM No.105711967
>>105711938
Smarter? Yeah, definitely. "Unsafe"? Unthinkable.
Anonymous
6/26/2025, 5:58:32 PM No.105711973
>>105711938
For a generalist coomer model in that weight class that's not completely retarded?
Not really.
Replies: >>105711996
Anonymous
6/26/2025, 5:58:47 PM No.105711974
>>105711941
I want R5 and I want 2T VRAM from Zhonguo
Anonymous
6/26/2025, 6:00:42 PM No.105711994
Where is the real /lmg/ thread?
Why is it missing? Why are people using this tranny troll thread?
Replies: >>105712005 >>105712046
Anonymous
6/26/2025, 6:00:55 PM No.105711996
>>105711973
>generalist coomer model
the post was for a good model
Anonymous
6/26/2025, 6:02:19 PM No.105712005
>>105711994
Spoken like a true newfaggot
Replies: >>105712022
Anonymous
6/26/2025, 6:04:24 PM No.105712022
1720007892685733
1720007892685733
md5: 337ff93687ec77d7438aa5ebdc40664a🔍
>>105712005
>kvetching intensifies
>oy vey, shut it down
Replies: >>105712041
Anonymous
6/26/2025, 6:05:49 PM No.105712041
>>105712022
>its DA JOOOOZ
Go back >>>/pol/
Replies: >>105712068 >>105712099 >>105712117
Anonymous
6/26/2025, 6:06:42 PM No.105712046
>>105711994
The tranny jew cries out as he strikes you.
Anonymous
6/26/2025, 6:10:18 PM No.105712067
>>105711334
>but them taking so long is either a really good sign or a really really bad sign
It's not like DS is the only one doing incremental improvements rather than wholly new things.
OAI shows no signs of releasing an actual GPT 5. I don't really care for those oSomething models that give you an answer after you died of starvation in front of the screen.
The new R1, while incremental, has had enough changes that it feels fresh and is a bretty good model too. think blocks are more like what Gemini showed before google started to hide the CoT.
Replies: >>105712092
Anonymous
6/26/2025, 6:10:19 PM No.105712068
1740827944432810
1740827944432810
md5: 1bb987451a194c84686a70bfd01cdb05🔍
>>105712041
>speak of monsters and the jews shows up
Anonymous
6/26/2025, 6:13:53 PM No.105712091
>>105711941
SSDLET COPE
Anonymous
6/26/2025, 6:13:54 PM No.105712092
>>105712067
That they only released an updated R1, trained on Gemini outputs instead of GPT outputs, is exactly why I'm concerned. Zero changes on their part.
Replies: >>105712102
Anonymous
6/26/2025, 6:15:32 PM No.105712099
>>105712041
(((you)))
Anonymous
6/26/2025, 6:15:50 PM No.105712102
>>105712092
Please understand they are a small indie team working out of a garage
Anonymous
6/26/2025, 6:17:10 PM No.105712111
>>105712100
>>105712100
>>105712100
Anonymous
6/26/2025, 6:17:52 PM No.105712117
>>105712041
>commit genocide live in front of the world and brag about it
>like literally 21 months of jews posting mutilated Palestinian children and laughing about it all over the internet
>nearly start world war 3- basically last straw for normies too
>NOOO WE'RE BEYOND CRITICISM
Yeah. No.
Things are different now.
You're not welcome here.
Anonymous
6/26/2025, 6:24:29 PM No.105712164
Can I feed a local llm a transcript for a 25 min lecture and have it summarize the information accurately in text that takes maybe about 5 min to read? Would the context be large enough/the input have enough tokens? Pardon my ignorance, I don't know much about AI/llms.
Replies: >>105712193
Anonymous
6/26/2025, 6:28:04 PM No.105712191
[OOC: If you wish to continue this discussion in another thread, just tell me.]
Anonymous
6/26/2025, 6:28:30 PM No.105712193
>>105712164
Many of the newer models have enough context that it could handle this amount of tokens, but that's just the architectural POV, in practice, most models suck at long context even if they were trained for it
what you want is possible and works well with Gemini, so I recommend you give it a try
I wouldn't even bother with other models for that purpose
Replies: >>105712317
Anonymous
6/26/2025, 6:45:24 PM No.105712317
>>105712193
Thank you anon, I'll start with Gemini.
Anonymous
6/26/2025, 6:46:43 PM No.105712331
>>105711938
It's still arguably the best 12B model. If you don't have enough (V)RAM for 24B or larger then you're fresh out of options.
Replies: >>105712352
Anonymous
6/26/2025, 6:49:06 PM No.105712352
>>105712331
Gemma 3 is way way better
Anonymous
6/26/2025, 7:10:21 PM No.105712571
DipsyExclamation
DipsyExclamation
md5: 0ddf7ac8acad038f7a7e67e20377bab8🔍
>>105711109
I will never cease to be amused by DeepSeek blowing tf out of OAI et al with a model generated in their spare time, with relative pocket change, under sancion, and as a hobby project for an investment fund.
Anonymous
6/26/2025, 7:41:03 PM No.105712869
5262254
5262254
md5: 9c607e79ec38122d793e3d96049f02d3🔍
>>105708512
Zuck just keeps poaching them. Is this what Sam foresaw when he said OpenAi was going to release a local model?