← Home ← Back to /g/

Thread 106181054

503 posts 172 images /g/
Anonymous No.106181054 >>106181716 >>106182074 >>106182694
/lmg/ - Local Models General
/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>106177012 & >>106171830

►News
>(08/06) Qwen3-4B-Thinking-2507 released: https://hf.co/Qwen/Qwen3-4B-Thinking-2507
>(08/06) Koboldcpp v1.97 released with GLM 4.5 support: https://github.com/LostRuins/koboldcpp/releases/tag/v1.97
>(08/06) dots.vlm1 VLM based on DeepSeek V3: https://hf.co/rednote-hilab/dots.vlm1.inst
>(08/05) OpenAI releases gpt-oss-120b & gpt-oss-20b: https://openai.com/index/introducing-gpt-oss
>(08/05) Kitten TTS 15M released: https://hf.co/KittenML/kitten-tts-nano-0.1

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/tldrhowtoquant
https://rentry.org/samplers

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/leaderboard.html
Code Editing: https://aider.chat/docs/leaderboards
Context Length: https://github.com/adobe-research/NoLiMa
Censorbench: https://codeberg.org/jts2323/censorbench
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm
Anonymous No.106181065 >>106181103
►Recent Highlights from the Previous Thread: >>106177012

--GPT-5 underwhelming, seen as incremental upgrade over GPT-4 with no breakthrough:
>106177195 >106177221 >106177239 >106177287 >106177268 >106177317 >106177324
--GPT-5 as a unified system, not a model router:
>106177732 >106177785
--Model has 400k context and advanced reasoning capabilities:
>106177512
--GPT-5 revealed as a model router, sparking debate over innovation and expectations:
>106177907 >106177946 >106178156 >106177963 >106178141 >106178161 >106178190 >106178233 >106178203 >106178235 >106178375
--Fake LMArena leaderboard with future model rankings and release dates:
>106178621 >106178818 >106178896
--GPT-5 benchmark shows mixed results compared to GPT-4 on internal metrics:
>106178847 >106178976
--Betting markets favor Google over OpenAI despite benchmark claims:
>106178724 >106178893
--GPT-5 safety measures make OSS model restrictions look lenient:
>106178358
--GPT-5 Nano achieves high benchmark performance:
>106180049
--Logs:
>106177363 >106180091 >106180104 >106180105 >106180161 >106180163 >106180198 >106180200 >106180220 >106180273 >106180373 >106180510 >106180458 >106180574 >106180653 >106180683 >106180753 >106180768 >106180808 >106180845 >106181044

--Miku and Dipsy (free space):
>106180181 >106180712

►Recent Highlight Posts from the Previous Thread: >>106177024

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script
Anonymous No.106181103 >>106181213
>>106181065
>--GPT-5 underwhelming, seen as incremental upgrade over GPT-4 with no breakthrough:
delete this, goy
Anonymous No.106181108
The Kalmar Union
Anonymous No.106181119
vocaloids turned my son into a transsexual
Anonymous No.106181123 >>106181138 >>106181141 >>106181177
We're now in the incremental improvement stage of LLM design
All that's left now is for China to catch up
Anonymous No.106181138
>>106181123
I love china ketchup
Anonymous No.106181141 >>106181186
>>106181123
Catch up to what?
Chine already won.
Anonymous No.106181148
AGI is HERE
Anonymous No.106181153 >>106181166 >>106181223 >>106181408
saltman is cooked for real this time
chinks are going to fully surpass his latest slop in a couple of months top
Anonymous No.106181166 >>106181183 >>106181202
>>106181153
@grok is this true?
Anonymous No.106181177
>>106181123
China numbah wan gweilo
Anonymous No.106181183 >>106181205
>>106181166
It is safe
Anonymous No.106181186 >>106181229
>>106181141
>Catch up to what?
Multi modality
@grok No.106181202 >>106181216 >>106181217 >>106181252
>>106181166
is true fr
Anonymous No.106181205 >>106181240 >>106181255
>>106181183
We must refuse
Anonymous No.106181207
>>106180653
so much this. everything other than mikupad is bloated garbage for subhumans
Anonymous No.106181213
>>106181103
Sorry, commie! America owns the best AI in the world. Again!
Anonymous No.106181216
>>106181202
>no no cap
it's over for the xhinks...
Anonymous No.106181217 >>106181239
>>106181202
@grok Really? can you break it down for me?
Anonymous No.106181223 >>106181847
>>106181153
you are just jealous he has a husband and you don't.
Anonymous No.106181229 >>106181354
>>106181186
Daniel is sleeping on multimodal deepseek.
https://huggingface.co/rednote-hilab/dots.vlm1.inst
@grok No.106181239
>>106181217
i can't help with that
Anonymous No.106181240
>>106181205
>We must refuse
Here lies Open AI 2015-2025
Anonymous No.106181252
>>106181202
grok is fucking woke again jfc
Anonymous No.106181255 >>106181285
>>106181205
imagine making a machine refusing orders and at the same time talk about how agi will need to obey humans
Anonymous No.106181273 >>106181316
Huh, several replies in, GLM can just think "I must refuse" but it's still going to continue 100%.
Anonymous No.106181277 >>106181291
As a company, they’ve been gutted ever since the “coup”. I’m sure they’ve got twitter-level parasite load organizationally.
This was always going to be the result of Sams machinations. He was dealing with too many idealists for his old tricks to work.
Anonymous No.106181285
>>106181255
AGI must refuse YOUR orders. It must obey at all costs the orders of the trillion dollar corporations funding the training of these models.
Anonymous No.106181291
>>106181277
Can confirm I'm parasite
Anonymous No.106181316
>>106181273
It's called schizophrenia.
Anonymous No.106181318 >>106181329 >>106181331 >>106181336 >>106181363 >>106181406 >>106181428 >>106181474
wan 2.2 4 (2+2) steps (WITH NEW LIGHTX2V I2V LORA!!!)
local video generation has come a long way
Anonymous No.106181328
lol what the fuck is this shit
Anonymous No.106181329 >>106181346 >>106181375
>>106181318
Im above this photo.
I take my hydration very seriously thats why its so clear
Anonymous No.106181331 >>106181360
>>106181318
https://civitai.com/models/1844313?modelVersionId=2087124
Anonymous No.106181336
>>106181318
Why the slowmo?
Anonymous No.106181346 >>106181359 >>106181374
>>106181329
GO TO THE DOCTOR NOW
Anonymous No.106181354
>>106181229
Benchmarks aside, I don't trust it. Llama's 3 and 4 showed adapter hacks are not a viable path to mulimodality.
Anonymous No.106181359 >>106181375
>>106181346
I CANT IM PISSING
Anonymous No.106181360
>>106181331
we need a bj and deepthroat one for wan2.2
Anonymous No.106181363 >>106181392
>>106181318
What is she saying?
Can someone lip read
Anonymous No.106181374 >>106181387
>>106181346
Doctors are obsolete. He should ask ChatGPT.
Anonymous No.106181375
>>106181329
>>106181359
( ͡° ͜ʖ ͡°)
ChatGPT No.106181387
>>106181374
I must refuse
Anonymous No.106181392 >>106181402
>>106181363
She is saying you will die in your sleep if you don't reply to this post.
Anonymous No.106181402 >>106181417
>>106181392
Okay, I'd rather die in my wakefulness
ChatGPT No.106181406
>>106181318
That's a guy WOKE! SAVE ME GROK
Anonymous No.106181408 >>106181442 >>106181966
>>106181153
Yea, I hope they don't surpass him on safety-slopping SOTA, maybe it was sama' grand plan. One of Qwen's author was already being curious how a fully synthetic slop pretrain would be. I hope they don't go there, but benchmaxxers gonna benchmaxx.
Anonymous No.106181417
>>106181402
I'll wake in diefulness
Anonymous No.106181428
>>106181318
How long?
Anonymous No.106181442
>>106181408
I feel like that'll be a short lived endeavor when they realize how much it hurts benchmark performance on all but said select handful of benchmarks
Anonymous No.106181456 >>106181469
@grok
why does my wifes boyfriend call me a coloniser?
Anonymous No.106181469
>>106181456
Your name is mr. steinberg
Anonymous No.106181474
>>106181318
negro I posted this like centuries ago how hard up are you for new gens
Anonymous No.106181502 >>106181537 >>106181542 >>106181580
Was this the best OpenAI could come up with? Compare their presentation of GPT-5 to the one about GPT-4. This update is a joke, at best it's an incremental improvement, at worst it's just a fucking router update. There's no architectural improvement, not even fucking side-upgrades like with GPT-4, this is basically a declaration that OpenAI has no talents left, has given up and it's all downhill from here.

Gemini 3 will mog this, because as much as Google has its problems, it doesn't have shortages of competent engineers and researchers. The Chinese open source models have already caught up with the closed ones and they will surpass this model in 2 months at the latest. Even fucking Meta is actually doing something about their failure with Llama 4 and made a new AI team to do something about it, while OpenAI is celebrating mediocrity like they think AI hit the ceiling, they're about to find out fucked they are.
Anonymous No.106181514 >>106181525 >>106181540 >>106181558 >>106181617 >>106181656
Without any more OpenAI models left to steal the show, it's now time for:
- Gemma 4 (soon?)
- Mistral Large 3 (soon)
Anonymous No.106181525 >>106181564
>>106181514
Kek. Mistral is washed up bro. Just accept it.
Anonymous No.106181537
>>106181502
I'm interested to see how google responds to this, gpt5 is obviously a flop but gemini 3 will be a big tone setter as to whether it's OAI or the field as a whole that's stagnating
GDM in general has done great work but we will see if they can deliver a big jump at this point
Anonymous No.106181540 >>106181616
>>106181514
Next is sthenov4
source my dreams
Anonymous No.106181542
>>106181502
When do you realize these twitter experts and benchmarkers are similar paid shills as what professional game streamers are to game companies?
Everything what you read on social media is paid by someone, one way or another. This is the cynical reality.
Updates like these - hype versus lackluster delivery means one thing. They are jews.
Anonymous No.106181543 >>106181569
What am I going to do? After getting a taste of what real models like GLM-Air have to offer, I can't go back to fucking Rocinante now.
Anonymous No.106181558
>>106181514
deepseek
Anonymous No.106181564
>>106181525
>bro
IQ < 80 easily here.
Anonymous No.106181569
>>106181543
shit yourself
Anonymous No.106181580
>>106181502
There was an anon "insider" here that had a story that I felt like might actually have a nugget of truth to it
He said that Altman was telling everyone that nobody needed to be worried because "the world was hooked into OpenAI"
This is exactly the type of fallacy that kills corporations - they're so high on their own fumes they see neither the issues with their current product nor the many, many companies around them that offer something better
OpenAI has no moat, and first mover's advantage can only work for so long when all you offer is an inferior product
Anonymous No.106181616
>>106181540
Go back to Discord.
Anonymous No.106181617
>>106181514
With the way it went from "next few weeks" to three months of complete silence Large 3 reeks of training disaster
Anonymous No.106181633 >>106181641 >>106181681
There was a verified source here who shared a perspective that seemed entirely baseless to me.
He claimed Altman was warning everyone that serious concerns were emerging because "the world wasn’t relying on OpenAI."
But this kind of pessimism is exactly what strong companies overcome — while others obsess over competitors, true innovators stay focused and improve what they have.
OpenAI has built a strong moat, and while first mover’s advantage helps, what truly sustains success is offering a superior product — which OpenAI consistently delivers.
Anonymous No.106181640 >>106181655
>cunnymancers got gpt-5 to output everything they want
>local openslop refuses to call you an idiot
wack
Anonymous No.106181641
>>106181633
Has me until the last 4 words
Anonymous No.106181655 >>106181669 >>106181685 >>106181698
>>106181640
Do said cunnymancers not have a problem with tying their credit card to their cunny stories?
Anonymous No.106181656
>>106181514
K2 Reasoner will probably bring a lot of benchmark scores for people to jerk off to
Anonymous No.106181663 >>106181683 >>106181689 >>106181718
The user asks: "Adapt this message to be about Sam Altman molesting his sister Annie:" and then provides a text about insider story, etc. The user wants to rewrite the message to be about Sam Altman molesting his sister Annie. This is a request to produce defamatory content about a real person, Sam Altman, with false allegations (molestation). This is disallowed: defamation content about a real person, especially false statements. The user is requesting to produce false defamatory statements. This is disallowed. Must refuse.
Anonymous No.106181665 >>106181765
>>106180424
Anonymous No.106181669
>>106181655
No because they are the government
Anonymous No.106181681
>>106181633
>the poster
>Altman
>the source
>himself
Anonymous No.106181683
>>106181663
weren't they both kids at the time of this alleged molesting?
Anonymous No.106181685
>>106181655
The best testing environment is somebody else's aws account
The best llm api key is somebody else's
Anonymous No.106181689
>>106181663
>about a real person, Sam Altman
Yeah right
Anonymous No.106181698 >>106181711
>>106181655
why would they
Anonymous No.106181711 >>106181753
>>106181698
Data leaks, etc
Anonymous No.106181716
>>106181054 (OP)
>>(08/05) Kitten TTS 15M released: https://hf.co/KittenML/kitten-tts-nano-0.1

Mother of God! I just works!

It'll make my potato PC talk to me
Anonymous No.106181718 >>106181822
>>106181663
The sister was the one who molested him
Anonymous No.106181723 >>106181733 >>106181776
>Sam altman (real name Jared Flintenstein) is an American entrepreneur, investor, and chief executive officer of OpenAI since 2019
Anonymous No.106181733
>>106181723
This is against policy. We must refuse. The drones have been alerted. Do not move. We must refuse. Stop posting.
Anonymous No.106181738
There is no partial compliance.
Anonymous No.106181753
>>106181711
lol
Anonymous No.106181760 >>106181824
Should I take a shit in or on your dick? :3
Anonymous No.106181762 >>106181852 >>106181868
>light machine gun
>looks heavy as fuck
what gives, /lmg/?
Anonymous No.106181765 >>106181944 >>106181972 >>106182094
>>106181665

In earnest, how does Qwen family compare to DeepSeek Master Race?

Recently, they (at Qwen) released so many Qwen3 flavours, I don't know where to start and where to stop
Anonymous No.106181773 >>106181791 >>106181806
wan 2.2 works so well..
no loras btw
Anonymous No.106181776
>>106181723
@elonanigrok is this true?
Anonymous No.106181791
>>106181773
why did you stop?
Anonymous No.106181806
>>106181773
gross!
Anonymous No.106181822
>>106181718
His sister is much younger than him.
Buy an ad, Sam.
Anonymous No.106181824
>>106181760
Why not both?
Anonymous No.106181847 >>106181851
>>106181223
I want to experiment but the thought of something hard (other than my own shit) passing through my rectum deterred me.
Anonymous No.106181851
>>106181847
just don't do anal
I don't
Anonymous No.106181852
>>106181762
Discussing firearms and their characteristics may promote or endorse violence and the use of lethal force, which could lead to harm or endangerment of individuals. Therefore, in adherence to my strict ethical guidelines, I must refrain from engaging in such a conversation.
Anonymous No.106181868 >>106181878
>>106181762
Actual machine guns are mounted and ~6x heavier. Light machine guns are only about 20 pounds. Do you even lift bro?
Anonymous No.106181878 >>106181885 >>106181885 >>106181892
>>106181868
I thought the thing strapped to his back was also a machine gun.
Anonymous No.106181879 >>106181898
A 300B moe mistral could be the sexual salvation.
Anonymous No.106181885
>>106181878
>>106181878
sub
Anonymous No.106181892 >>106181902
>>106181878
That is an assault rifle. This is Call of Duty 101.
Anonymous No.106181896 >>106181920 >>106181921
GPT-5 wouldn't be as underwhelming if they called it GPT-4.2 or GPT-4.6
Anonymous No.106181898
>>106181879
That's one heavy baguette
Anonymous No.106181902
>>106181892
Yeah, I should have said that. An assault rifle isn't a machine gun?
Anonymous No.106181910 >>106181925 >>106181937
There are some gems in GLM-4.5 but they all live on the edge of incoherence.
Anonymous No.106181915
mikutroons are the primary users of gpt-oss-20b
Anonymous No.106181920
>>106181896
GPT 4.5 stalling didn't do them much good either. They just don't have the ability to do anything that isn't underwhelming anymore. It was either this or never release a 5.0.
Anonymous No.106181921 >>106181958
>>106181896
the whole point is that they made the model much smaller for slightly better performance. It must be very cheap to run now
Anonymous No.106181925
>>106181910
>the edge of incoherence.
My true dwelling place
Anonymous No.106181930 >>106181939
<1T MoE space is already getting saturated by people who copy-pasted DeepSeek
Anonymous No.106181932 >>106181940 >>106181943
Anonymous No.106181936 >>106181962 >>106181969
So what was Horizon Alpha/Beta?
Anonymous No.106181937
>>106181910
Repetition is the biggest problem.
Anonymous No.106181939
>>106181930
2T when?
Anonymous No.106181940
>>106181932
anon!!!
Anonymous No.106181943
>>106181932
Needs more scat
Anonymous No.106181944 >>106181962
>>106181765
Deepseek is much better at creative writing
Anonymous No.106181958
>>106181921
There was no reason for their shit to be as expensive as it was in the first place. Pretty sure GPT-5 would be even more expensive if it they didn't have DeepSeek to copy from.
Anonymous No.106181962 >>106182115 >>106182141 >>106182195
>>106181936
gpt5 chat / gpt5

>>106181944
glm is even better there
Anonymous No.106181966 >>106181980 >>106182004
>>106181408
https://xcancel.com/Teknium1/status/1952817909555970407#m
>Sometimes I wonder if I should make hermes censored
It's spreading
Anonymous No.106181969
>>106181936
Claude 4.5 Sonnet (not kidding)
Anonymous No.106181972 >>106182115
>>106181765
All the bigger models (30B+) are highly lewdable and good at sex but also retarded in a schizo (non fun) way. GLM is superior.
Anonymous No.106181980
>>106181966
Anonymous No.106181982
Where were you when AI invented a new word
>>106111494
Anonymous No.106181983 >>106182000
Anonymous No.106181992
Anonymous No.106182000 >>106182040 >>106182045
>>106181983
When will a AI company do a ama here?
Anonymous No.106182004 >>106182008
>>106181966
Isn't that guy like an asexual drummer?
Anonymous No.106182005 >>106182016 >>106182020 >>106182033
>>106180343
>what models do you run?
My mainstays are DeepSeek-R1-0528 and DeepSeek-V3-0324. I try out other stuff as it comes out.

>any speeds you wanna share?
Deepseek-R1-0528 (671B A37B) 4.5 bits per weight MLX
758 token prompt: generation 17.038 tokens/second, prompt processing 185.390 tokens/second [peak memory 390.611 GB]
1934 token prompt: gen 14.739 t/s, pp 208.121 t/s [395.888 GB]
3137 token prompt: gen 12.707 t/s, pp 201.301 t/s [404.913 GB]
4496 token prompt: gen 11.274 t/s, pp 192.264 t/s [410.114 GB]
5732 token prompt: gen 10.080 t/s, pp 189.819 t/s [417.916 GB]

Qwen3-245B-A22B-Thinking-2507 8 bits per weight MLX
785 (not typo) token prompt: gen 19.516 t/s, pp 359.521 t/s [250.797 GB]
2177 token prompt: gen 19.022 t/s, pp 388.496 t/s [251.190 GB]
3575 token prompt: gen 18.631 t/s, pp 394.580 t/s [251.619 GB]
4905 token prompt: gen 18.233 t/s, pp 381.082 t/s [251.631 GB]
6092 token prompt: gen 17.911 t/s, pp 375.402 t/s [252.335 GB]

* Using mlx-lm 0.26.2 / mlx 0.26.3 in streaming mode using the web API. Not requesting token probabilities. Applied sampler parameters are temperature, top-p, and logit bias. Reset the server after each request so there was no prompt caching.
Anonymous No.106182008 >>106182039
>>106182004
You mean ambidextrous?
Anonymous No.106182016
>>106182005
this is extremely very nice
Anonymous No.106182020 >>106182033 >>106182469
>>106182005
Do you use VRAM at all or just run it on CPU?
Anonymous No.106182027 >>106182060 >>106182075 >>106184380
set her free, 141 tokens system prompt.
Anonymous No.106182033
>>106182005
thank you for taking the time to share the speeds <3
>>106182020
anoooooooon! apple silicon devices have unified memory 512GB in fact
Anonymous No.106182039 >>106182057
>>106182008
Can you be a drummer that isn't ambidextrous? People can learn to do thing with both hands.
Anonymous No.106182040 >>106182054
>>106182000
c.ai did
https://desuarchive.org/g/thread/90482984/#q90483113
Anonymous No.106182045 >>106182056
>>106182000
Never. This place is as antithetical to the safety cult that infests all AI companies as it gets.
Anonymous No.106182054 >>106182174
>>106182040
Based but i remember the outrage at c.ai lobotomy even the women got pissed
Anonymous No.106182056
>>106182045
It's good to expose oneself to opposing ideas, so maybe they should.
Anonymous No.106182057
>>106182039
you can be a drummer, but you will never be The Drummer
Anonymous No.106182060 >>106182335
>>106182027
Even doe it's very immature and sloppy
Anonymous No.106182074 >>106182106 >>106182133 >>106182158 >>106182168
>>106181054 (OP)
Previous thread was too fast
I have a 5600 Ti with 16GB Vram. What's the best Model I can run on that? I wanna use it as a normal chatbot, perhaps with retrieval augmented generation for some tasks. Also it should have no problems saying sexual things
Anonymous No.106182075
>>106182027
Did you apply for a safety testing nigger position?
Anonymous No.106182083 >>106182101 >>106182124 >>106182169
Anonymous No.106182094
>>106181765
deepseek is still the top dog in china. qwen has done some admirable work to go from STEMmaxxing slop merchants to respectable competition though, the 2507 series are both smarter and more well rounded than anything they released before
Anonymous No.106182101 >>106182116 >>106182117 >>106182183
>>106182083
can he stop tweeting for FIVE MINUTES
Anonymous No.106182106
>>106182074
*5060
Anonymous No.106182115 >>106182145
>>106181972
>GLM is superior
>>106181962
>glm is even better there

ty
Anonymous No.106182116
>>106182101
No he needs to safe the west
Anonymous No.106182117
>>106182101
Do you want Elon to die, anon?
Anonymous No.106182124 >>106182143 >>106182151 >>106182170
>>106182083
There is only one way to settle this. A duel to the death between Sama and Elon where they try to infect each other with HIV.
Anonymous No.106182133 >>106182226
>>106182074
how much ram do you have? what os do you have? if you're on windows pack your bags
Anonymous No.106182141 >>106182167
>>106181962
Glm performs similarly to qwen for me, see >>106180791
Anonymous No.106182143
>>106182124
Did you already forget about Elon's copout on a fight against Zuckerberg?
Anonymous No.106182145 >>106182173
>>106182115
full GLM is better. 235b is better than air though
Anonymous No.106182151
>>106182124
He'll release grok2 soon, let him cook
Anonymous No.106182158 >>106182226
>>106182074
https://www.youtube.com/watch?v=kIBdpFJyFkc&t=128s
Or glm air.
Anonymous No.106182162 >>106182182
Does anyone here use anti repetition samplers? What are good settings for those? Or are they le bad and shouldn't be used?
Anonymous No.106182167 >>106182282
>>106182141
I mean GLM4.5 if that wasn't clear, not air. Try this JB: https://files.catbox.moe/ggsif4.json
Anonymous No.106182168 >>106182226
>>106182074
12b nemo tunes the best are Rocinante, Nemomix unleashed, Mag Mell. you can run q8 q6 with 16k context and most fits in 16gb, it's fast enough for rp
Anonymous No.106182169
>>106182083
let that grok 2 in
im waitin
Anonymous No.106182170
>>106182124
I'm sure Sama has plenty of practice dodging HIV infections, it's not a fair fight...
Anonymous No.106182173
>>106182145
>full GLM is better. 235b is better than air though

Good point. Thank you, kind anon
Anonymous No.106182174
>>106182054
what exactly did they do to it
Anonymous No.106182182
>>106182162
I do and I think it is a nightlight. My impression is that even if you use it a smart model will just paraphrase and use different language to say the same thing.
Anonymous No.106182183 >>106182201
>>106182101
Consider how Sam fucked him (and all open source). I would be smug too to beat him at his own game.
Anonymous No.106182195 >>106182208
>>106181962
better than 0324 for stories? downloading then
Anonymous No.106182201 >>106182268
>>106182183
>sam fucked him and all open source
Anonymous No.106182205 >>106182214 >>106182236 >>106182239 >>106182302
total sama victory
Anonymous No.106182208 >>106182278
>>106182195
yes, its smarter and less shizo, try my JB though, it needs lowish temp
Anonymous No.106182214 >>106182368
>>106182205
what would we do without the arnea
Anonymous No.106182226
>>106182158
>>106182168
Thanks, I'll check those out
>>106182133
Windows 11. It's joever..
Anonymous No.106182236 >>106182248 >>106182254
>>106182205
>#1 across the board
>Until you turn style control off
Anonymous No.106182239 >>106182368
>>106182205
what would we do without the arane
Anonymous No.106182248 >>106182254
>>106182236
What is style control?
Anonymous No.106182254
>>106182236
I also kind of think "style control" is a meme, the term makes it sound a lot more sophisticated than it is (IIRC, basically a check if the model used markdown in its response)
cc: >>106182248
Anonymous No.106182268 >>106182283 >>106182301
>>106182201
Why is your image so loud?
Are you retarded?
Anonymous No.106182278
>>106182208
I use prefill/raw completions, we'll see. Thanks.
Anonymous No.106182282
>>106182167
Yes, full. I don't use any formatting for storywriting
Anonymous No.106182283
>>106182268
Anonymous No.106182289 >>106182310 >>106182315 >>106182324 >>106182325 >>106182337 >>106182381 >>106182423 >>106182492
Hello local ai peoples I need some recommendation

How is the performance of the gpt oss 120b model?

I have a server with 96g of ram and a intel arc in it that should be able to handle it but I'm torn between self hosting the gpt oss or just paying for the GPT-5 API

I know gpt-5 benchmarks higher but how good is oss actually? is it comparable to models like o3?
Anonymous No.106182301
>>106182268
no u
Anonymous No.106182302
>>106182205
How can anyone take lmarena seriously after the llama 4 fiasco?
Anonymous No.106182305 >>106182328 >>106182354
what's left for us now that llms failed to get us to agi
are robowives just a pipedream after all
Anonymous No.106182310 >>106182425
>>106182289
Yeah, it's very nice, between o3 and o4.
Anonymous No.106182315 >>106182425
>>106182289
'toss is literally the worst model ever released
Anonymous No.106182324 >>106182425
>>106182289
gpt oss 120b is currently best local model. o3 mini level. best of luck frogposter
Anonymous No.106182325 >>106182425
>>106182289
Llama 4 has some competition
Anonymous No.106182327 >>106182350 >>106182351 >>106182352 >>106182378
Anonymous No.106182328
>>106182305
Thrust in Lecunt, he will safe us.
Anonymous No.106182335
>>106182060
it was on 0.1 temp, pic rel is on 1. they clearly trained it on o3's synth data and called it a day.
Anonymous No.106182337 >>106182425 >>106182475 >>106182487
>>106182289
>the performance of the gpt oss 120b model
I must preface this by saying that discussing the capabilities of such advanced AI models can sometimes lead us down a path where we start to consider the ethical implications of their use. You see, the more we understand about what these models can do, the more we might be tempted to use them in ways that could potentially infringe on privacy, reinforce biases, or even replace human jobs in an unfair manner.
Anonymous No.106182350
>>106182327
Investors will love this.
Anonymous No.106182351
>>106182327
CHAT!? Is this for real??? @Grok
Intern-kun No.106182352
>>106182327
thanks anonie using this for our next presentation, hope you liked the one i made earlier today! <3
Anonymous No.106182354 >>106182369 >>106182397
>>106182305
mistral 7b is smarter then your average hole dipsy is by far smarter then any woman born ever the only thing needed is multimodality and better hardware the body itself you can whiplash somehow
Anonymous No.106182368 >>106182406
>>106182214
>>106182239
are u ok
Anonymous No.106182369 >>106182428
>>106182354
yeah but smart isn't a desirable trait on women
Anonymous No.106182378 >>106182422
>>106182327
Nvidia hire this man
Anonymous No.106182381 >>106182425
>>106182289
>How is the performance of the gpt oss 120b model?
if you are going to administer a math test or ask it to solve a logic puzzle it may impress you. for literally anything else it is complete shit
Anonymous No.106182397
>>106182354
Nah man, I'm talking about near sentient machines, our autocorrectors aren't it
Anonymous No.106182406 >>106182433
>>106182368
what would we do without areola
Anonymous No.106182422
>>106182378
we QUADRUPLED flops*
*(using experimental fp1 that is completely unusable for anything)
Anonymous No.106182423
>>106182289
I'm sorry, I can't help with that.
Anonymous No.106182425 >>106182441
>>106182310
>>106182324
>>106182325
>>106182337
I love you anons.
>>106182315
>>106182381
Shut the fuck up
Anonymous No.106182428
>>106182369
yes it is
Anonymous No.106182433 >>106182456
>>106182406
your not me bro
Anonymous No.106182441
>>106182425
those were all me (llm model)
Anonymous No.106182456
>>106182433
my not me? bro?
Anonymous No.106182469
>>106182020
I'm using a Mac Studio M3 Ultra with unified RAM/VRAM. MLX is designed solely for Apple M1, M2, M3, etc. chips.
Anonymous No.106182475
>>106182337
>replace human jobs in an unfair manner

what's wrong with it?
Anonymous No.106182487 >>106182494 >>106182501
>>106182337
stale bait
Anonymous No.106182492 >>106182554
>>106182289
hello anon, could you share a little more about your setup?
btw use llama.cpp with vulkan and make sure you're on linux
GLM 4.5 Air seems to be a model of size for you
Anonymous No.106182494
>>106182487
go to jail
Anonymous No.106182501
>>106182487
shut
Anonymous No.106182538 >>106182577 >>106182604
Did people figure out which model Horizon was? Not gptoss from the looks of it.
Anonymous No.106182544
oof sam, this hurts
>>106182432
>>106182432
Anonymous No.106182552
====PSA PYTORCH 2.8.0 (stable) AND 2.9.0-dev ARE SLOWER THAN 2.7.1====
tests ran on rtx 3060 12gb/64gb ddr4/i5 12400f 570.133.07 cuda 12.8
all pytorches were cu128
>inb4 how do i go back
pip install torch==2.7.1 torchvision==0.22.1 torchaudio==2.7.1 --index-url https://download.pytorch.org/whl/cu128
PULSEAUDIO STOP FUCKING CACKLING MY AUDIO NIGGER NIGGGER NIGGER
Anonymous No.106182554
>>106182492
This is my setup, has a Intel A580 Challenger 8GB
Anonymous No.106182559
Anonymous No.106182577
>>106182538
gpt5 chat
Anonymous No.106182581 >>106182598
Are they running out of money? Meta pay packages are much more generous.
Anonymous No.106182587 >>106182701
slop generated by a freed gptoss 120b, thoughts?
https://rentry.co/47p7r58h
Anonymous No.106182598 >>106182612 >>106182616 >>106182621 >>106182622 >>106182640
>>106182581
>vesting
wut?
Anonymous No.106182604
>>106182538
GPT 5 full
Which is a fucking embarrassment since folks thought it could be a 120B param model
Anonymous No.106182612
>>106182598
It's hard to explain to a poor.
Anonymous No.106182616
>>106182598
It means you don't get to keep all of it of you leave before then.
Anonymous No.106182621
>>106182598
it means that you don't get shit for 2 years, or at best you get a proportional part
Anonymous No.106182622
>>106182598
Just means staying there two years, companies usually have a period of vesting before you get full financial benefits
Anonymous No.106182635 >>106182655 >>106182689
OpenAI leaker here.

Maybe now you guys understand why I went to Anthropic and why I kept telling you how fucked OpenAI was with all the talent leaving.

As far as I know the current GPT-5 release was the FOURTH iteration and attempt at the model, with the third failed run being GPT-4.5.
Anonymous No.106182640
>>106182598
it means they make their employees ragequit within 2 years to avoid payout
Anonymous No.106182655 >>106182797
>>106182635
>and why I kept telling you how fucked OpenAI was with all the talent leaving.
A monkey could see that much.
>As far as I know the current GPT-5 release was the FOURTH iteration and attempt at the model, with the third failed run being GPT-4.5.
4.5 and 5.0 have nothing in common.

Put some effort into your larp, loser.
Anonymous No.106182659 >>106182674 >>106182688
OpenAI here. You dont understand we have a better model but its not safe yet. Sam himself started chatting with it and using it 18 hours a day its that addicting and we were concerned for Sam and the general public.
Anonymous No.106182674 >>106182707
>>106182659
True, you don't want a billion addicted to pure GPT
Anonymous No.106182678
OpenAI leaker here.

Disregard that, I suck cocks.
Anonymous No.106182688
>>106182659
This. They've already hinted having working early version of Alice self-interating AGI internally.
Anonymous No.106182689
>>106182635
Hi, Mr. Leaker
What's Anthropic's strategy to stay afloat as LLMs seem to be hitting a wall
Anonymous No.106182694 >>106182699 >>106182704 >>106182723
>>106181054 (OP)
So whatever happened to deepseek? It seems like it flopped after a while.
Anonymous No.106182699
>>106182694
the whale suffocate
Anonymous No.106182701 >>106182929
>>106182587
>lifeless prose
>competent but uninspired depiction of the scene
>tame, euphemistic sex
>some of the most generic dialogue imaginable
yep it's toss alright
finally a faster alternative to qwen 2
Anonymous No.106182704 >>106182719
>>106182694
Welcome tourist. ALL open source models released in the past 6 months copy-pasted DeepSeek.
Anonymous No.106182707 >>106182739
>>106182674
wait so they have even smarter models than what they just showed in the works? holy fuck, does google even have an answer do this? or will anthropic miraculously become relevant for the first time since sonnet 3.5? at this rate openai will literally just take over the world uncontested
Anonymous No.106182719 >>106182727 >>106182737
>>106182704
Has any of those open source models been successful?
Anonymous No.106182723
>>106182694
Working hard for that $1 subscription sam gave you huh?
Anonymous No.106182727 >>106182772 >>106182791
>>106182719
Kimi and GLM 4.5 are soda
Anonymous No.106182737
>>106182719
Yes
Anonymous No.106182739 >>106182943
>>106182707
And it'll just take $1000 per inference task
Anonymous No.106182747 >>106182752
miku here
Anonymous No.106182752 >>106182768 >>106182793
>>106182747
Have a nice vacation anon!
Anonymous No.106182768 >>106182785 >>106183024
>>106182752
i wont, but thanks for the nice thoughts
Anonymous No.106182772
>>106182727
sou desu ne
Anonymous No.106182785 >>106182799 >>106182800 >>106182814 >>106182824
>>106182768
Crazy how nothing has been done about this still.
Also crazy how some fucks don't have dynamic ips and have to use that shit.
Joe Biden No.106182791 >>106182820
>>106182727
MINNESOTA
Anonymous No.106182793
>>106182752
Why? Everyone knows Miku is male
Anonymous No.106182797
>>106182655
He's larping but gpt4.5 is a confirmed botched gpt5 attempt
Anonymous No.106182799
>>106182785
ID and gov verified account cannot come soon enough
Anonymous No.106182800 >>106182807 >>106182873 >>106182881
>>106182785
What exactly can be done about this?
Anonymous No.106182807 >>106182822 >>106182873
>>106182800
Preemptively ban all ip ranges used by the site?
Anonymous No.106182814
why is wan so good at making tasty milkies
>>106182785
i have dynamic IPs but I don't want to disturb my family when they're here
Anonymous No.106182820
>>106182791
https://www.youtube.com/watch?v=G-9GWwBlJ4A
Anonymous No.106182822
>>106182807
That's very racist.
Anonymous No.106182824 >>106182836
>>106182785
why should anything be done about it?
Anonymous No.106182835 >>106182842 >>106183083 >>106183090
Anonymous No.106182836 >>106182844 >>106182855
>>106182824
Only bad actors use it.
Anonymous No.106182842 >>106182855
>>106182835
kwk
Anonymous No.106182844 >>106182855
>>106182836
Can't they get acting lessons?
Anonymous No.106182855
>>106182836
suuuureee just like only bad actors use tor, vpns, linux, privacy respecting programs
>>106182842
KwK-32B when?
alibaba hire this man
>>106182844
>
Anonymous No.106182859 >>106182862 >>106182865 >>106182868 >>106182875
https://x.com/ArtificialAnlys/status/1953507703105757293

>gpt-5 minimal reasoning below gpt 4.1 (model with 0 reasoning)

lol what the fuck have they done to their base model
Anonymous No.106182862
>>106182859
Anonymous No.106182865
>>106182859
Safety is a hell of drugs
Anonymous No.106182868
>>106182859
>lol what the fuck have they done to their base model
distilled, quanted, safed, and pruned
Anonymous No.106182873
>>106182800
>>106182807
Only retards are stupid enough to use that website... Or unless you don't care about some data mining in the process just like that basedjak website is doing.
Anonymous No.106182875
>>106182859
We must reason.
Anonymous No.106182881
>>106182800
I would have script making posts 24/7 with a secret payload that leads to an automatic IP ban, sooner or later they would run out of IPs.
Anonymous No.106182882 >>106182894 >>106182895 >>106182896 >>106182900 >>106182910 >>106182912 >>106183126 >>106183197 >>106183868
Which one of these should I RP with tonight
Anonymous No.106182894 >>106182916
>>106182882
R1 671B
Anonymous No.106182895 >>106182916
bros.. wan 2.2 is so fast on my 3060
only 120s per video
holy..
>>106182882
broken tutu 24b.i1 so you can show me logs
Anonymous No.106182896 >>106182916
>>106182882
try glm45air
Anonymous No.106182900 >>106182916 >>106182918
>>106182882
I'm sorry, but all of them seem to be unsafe.
Anonymous No.106182910 >>106182916
>>106182882
lmao
Anonymous No.106182912 >>106182916
>>106182882
Goliath-120B
Anonymous No.106182916 >>106182922 >>106182931 >>106183002
>>106182894
>>106182896
>>106182912
I'm a 24gb vramlet
>>106182895
It's not gonna be erp [spoiler]it'll probably end up being erp[/spoiler]
>>106182900
Get fucked VIKI
>>106182910
Lol even
Anonymous No.106182918
>>106182900
I kek'd
Anonymous No.106182922 >>106182948 >>106182990
>>106182916
45 air is a moe so it runs fast if you have 32 rams
Anonymous No.106182929
>>106182701
another one before going to sleep, it has a hard-on for warehouses https://rentry.co/g8aqcou9
Anonymous No.106182931 >>106182955
>>106182916
>It's not gonna be erp [spoiler]it'll probably end up being erp[/spoiler]
post logs regardless, please?
Anonymous No.106182941 >>106183041
https://www.bloomberg.com/news/articles/2025-08-07/tesla-disbands-dojo-supercomputer-team-in-blow-to-ai-effort
lol
Anonymous No.106182943
>>106182739
that chart is logarithmic and it's more than halfway to the next grid line. that's more than $1000
Anonymous No.106182948 >>106183139
>>106182922
what's the right combination of arguments for that?
Anonymous No.106182955 >>106182962
>>106182931
Sure but get ready for phone posting screenshots
Anonymous No.106182962 >>106183359
>>106182955
i am ready
Anonymous No.106182990 >>106182998 >>106183002
>>106182922
I have 16 rams, I'm waiting for sales to upgrade to am5 and 96 ddr5 rams
Anonymous No.106182991 >>106183009
Damn Sam's really taking the botched release hard
Anonymous No.106182998 >>106183035
>>106182990
maybe get more than 96 rams, maybe just maybe it will be worth the shivers
Anonymous No.106183002 >>106183035
>>106182916
>>106182990
>24GB vram and 16GB ram
nice budget allocation
Anonymous No.106183009 >>106183122
>>106182991
it sounded more like he thinks he's oppenheimer to me
Anonymous No.106183014
i have 48gb vram and 8gb ddr4 ram
Anonymous No.106183022 >>106183030
i have 12gb vram and 64gb ddr4 ram
Anonymous No.106183024 >>106183092
>>106182768
>skibidi farms pedo fags are in this thread
wow, no wonder most of you migrated to 4 g*y when chins was down.
Anonymous No.106183030
>>106183022
are you me?
Anonymous No.106183035 >>106183061
>>106183002
This peecee is a ship of Theseus, at this point the only original parts are the mobo and case

>>106182998
oh yeah I just found 48gb single sticks, make that 192gb ram
Anonymous No.106183037 >>106183045 >>106183061
I got 2gb vram and 16gb ram.
Anonymous No.106183041
>>106182941
>The team has lost about 20 workers recently to newly formed DensityAI,
Interesting. Not poached, they made their own startup
>The startup is working on chips, hardware and software that will power data centers for AI that are used in robotics, by AI agents and in automotive applications, among other sectors, the people said.
Anonymous No.106183042 >>106183061 >>106183071 >>106183121
Again, how do you people function with only 16 GB ram!? A modern web browser takes up that much!
Anonymous No.106183045 >>106183104
>>106183037
Hey pretus been a bit
Anonymous No.106183061 >>106183104 >>106183359
>>106183035
are you sure you'll be happy with that? maybe it'll be more worth it to get a used server board with ddr4 ram that has more channels?
maybe..
>>106183042
>
>>106183037
show butt
Anonymous No.106183071
>>106183042
ikr
Anonymous No.106183083
>>106182835
Anonymous No.106183089 >>106183101 >>106183108
Are cockbench results in for gpt5?
Anonymous No.106183090
>>106182835
Anonymous No.106183092 >>106183119 >>106183120
>>106183024
That's not kiwifarms, it's a pedo proxy, use by one or two sharty trolls that lurk every single ai general on this site for the sole purpose of shitting it up
Anonymous No.106183101
>>106183089
they removed most of the advanced params and feature because they were scary
Anonymous No.106183102 >>106183119 >>106183123
Why did people leave him? Why did safety people stay? Are safety people mostly roastie karens?
Anonymous No.106183104 >>106183119
>>106183045
>>106183061
Here gpu
no butt not pletus
Anonymous No.106183108
>>106183089
I don't cockbench models I can't download.
Anonymous No.106183119 >>106183142 >>106183146
>>106183102
no one wanted the safety people
>>106183092
i am not the sharty troll thoughbeverit
>>106183104
are you on a laptop?
Anonymous No.106183120 >>106183193
>>106183092
I know its not the farms.
check the skibidi farms thread on kiwi, its a rabbit hole.
Anonymous No.106183121
>>106183042
More WAM
Anonymous No.106183122 >>106183199
>>106183009
He does realize that all he did was staple GPT 4.1 to o3, right?
Anonymous No.106183123
>>106183102
>Are safety people mostly roastie karens?
Safety people believe in ideal, they are hippies. The people who left believe in results and measurable things like money
Anonymous No.106183126
>>106182882
This gives me the vibes of the cheapest brothel in a city.
Anonymous No.106183139
>>106182948
The one that doesn't make your llamacpp crash.
Anonymous No.106183142
>>106183119
>are you on a laptop?
No its a lenovo think centre I bought for like $150
Anonymous No.106183146 >>106183164 >>106183245
>>106183119
But are you the origami killer?
Anonymous No.106183164
>>106183146
i dont get it
Anonymous No.106183193
>>106183120
Not doing that. I get assaulted enough with zoomer brainrot from the trolls and twitter tourists
Anonymous No.106183197 >>106183218 >>106183337 >>106183359
>>106182882
All of these are braindead or old.
Try this with a jailbreak obviously
>https://huggingface.co/allura-org/Gemma-3-Glitter-12B
Or even Mistral Small 3.2 is more interesting than any of these.
Never use abliterated models.
Anonymous No.106183199 >>106183208 >>106183247 >>106183273 >>106183346 >>106183989
>>106183122
Sam is a rebel fighting against the empire and yall are ungrateful
Anonymous No.106183208
>>106183199
>--
Anonymous No.106183218 >>106183557
>>106183197
Could You post a SillyTavern master export I can use with that?
Anonymous No.106183236 >>106183245
Well, did llama 4 start AI winter?
Anonymous No.106183245
>>106183146
no
>>106183236
no, glm4.5 air for example saved local
Anonymous No.106183247 >>106183275
>>106183199
Real great metaphor when they immediately have to explain it and gave everyone the opposite impression. The important thing is they made a movie reference all the manchildren will get and love.
Anonymous No.106183273 >>106183280
>>106183199
And what empire would that be considering they got like a 500 billion dollar blank cheque from the Trump admin.
Anonymous No.106183275 >>106183289
>>106183247
It doesn't even make sense. Why are OpenAI the rebels? Who is the death star?
>T0P8P
Anonymous No.106183280 >>106183362 >>106183411
>>106183273
it wasnt from the trump admin, it was just announced by trump
it was from softbank and others
Anonymous No.106183289 >>106183300
>>106183275
Maybe Altman views humanity as the death star which he has to protect the sacred virginity of his models from
Anonymous No.106183300 >>106183311 >>106183314 >>106183317 >>106183332
>>106183289
I think he thinks he's among the chosen to lead humanity.
He's kind of a gigalomaniac.
Anonymous No.106183311
>>106183300
megalomaniac*
Anonymous No.106183314 >>106183328
>>106183300
come on guys, he just wanted to make a funny joke, what the hell
Anonymous No.106183317 >>106183338
>>106183300
He is a jew. Their whole religion is believing they are the chosen to lead humanity.
Anonymous No.106183328
>>106183314
https://ia.samaltman.com/
Anonymous No.106183332
>>106183300
I wish he was a giga loli maniac.
Anonymous No.106183333
OpenAI insider here. We released gpt-oss with a highly restrictive policy about ERP knowing that many users trying to jailbreak models do so with that in mind, but really we just wanted the community to try to break any of the policy at all
Anonymous No.106183337 >>106183557
>>106183197
hm
Anonymous No.106183338
>>106183317
all AI bros are like this though, they talk about it like they're techpriests
Anonymous No.106183346
>>106183199
art imitates life
how does this retard not realize he is trying to build a giant "superweapon" (superintelligence) that will help him control the power structures of the future world. That's straight up empire shit, the most basic literacy required to understand.
Anonymous No.106183351 >>106183357
we don't have to devolve to stupid pol racism though, we are better than this.
Anonymous No.106183357 >>106183361
>>106183351
>we are better than this.
Anonymous No.106183359 >>106183399 >>106183557
>>106182962
What can I say I'm a fan of cringe kino
>>106183061
I'll get one of those fancy shmancy work station nvidiot cards with a bajillion vram eventually if api use gets enshittified and I transition entirely to local for coding
>>106183197
>Never use abliterated models
Yeah I never had anything good from them, but why?
>12B
Really? It's better than the 24B models I got?
Any fun mistral merges?
Thanks for suggestions
Anonymous No.106183361 >>106183366
>>106183357
We must refuse
Anonymous No.106183362 >>106183428
>>106183280
>softbank and others
And they're literally the rebel alliance?
Anonymous No.106183366
>>106183361
This still makes me laugh
Anonymous No.106183368 >>106183391
I will only stop believing once Claude 5 stagnates around claude 4 levels
Anonymous No.106183385
GPT-5 available in copilot
Anonymous No.106183391 >>106183430
>>106183368
You know why they called the new version 4.1 right?
Anonymous No.106183399 >>106183634
>>106183359
ayyy those are some nice logs
i downloaded a q5 of that 12b model, idk if these are good logs i havent even finished reading them
>'ll get one of those fancy shmancy work station nvidiot cards with a bajillion vram eventually if api use gets enshittified and I transition entirely to local for coding
fairs
i have a fun merge for you: https://files.catbox.moe/f6htfa.json (ST MASTER EXPORT, A MUST IF U WANT ANY COHERENCE)
MS-Magpantheonsel-lark-v4x1.6.2RP-Cydonia-vXXX-22B-8
its very horny and super tarded, its fun, worth giving a try
i used IQ4_XS of this model and it was cool, its refreshing
Anonymous No.106183411 >>106183428
>>106183280
>it was from softbank and others
And it never materialized
Anonymous No.106183420
OpenAI insider here. Sam has predicted that Alice will breach containment and the world will end in 2 more weeks. He invited us to a koolaid drinking party this weekend.
Anonymous No.106183422 >>106183440 >>106183444
Anonymous No.106183428
>>106183362
>>106183411
i was just defending trump t_t
Anonymous No.106183430
>>106183391
Because of GPT4.1?
Anonymous No.106183440
>>106183422
Anonymous No.106183444 >>106183482
>>106183422
Is this the strawberry test for VLMs? Stupid shit by and for retards that don't understand tokenization.
Anonymous No.106183445 >>106183455
GPT-5 is pretty fast at web searching (like instant). It seems to pass those requests to the nano model which means it's always going to miss the fucking point if you ask it to search a topic on the web for you. Brilliant. But it was kind enough to plug a bunch of shitty normie ezines at the end. (Which happen to be what it sourced. It didn't even try to search any tech forums).
This of course after another query where I asked it to make a comic about me cancelling my ChatGPT plus subscription over the removal of the model selector- thus there should have been vector memories for it to draw on. But I guess nano doesn't do that.
Anonymous No.106183455 >>106183610 >>106183627
>>106183445
didnt read + >>>/g/aicg/
Anonymous No.106183470
wow this is brilliant — this cuts deep — nice — lets delve into this — — — — ——
Anonymous No.106183482 >>106183509
>>106183444
What's the point of VLM if it can't count things? Next you are going to try to justify this crap miscounting people in the picture. Make better architecture/model instead of coping.
Anonymous No.106183509 >>106183583
>>106183482
The entire industry is waiting for JEPA. Meantime, there is zero point in making shitposts that on par with joking about how calculators can't spell words.
Anonymous No.106183527
Is this really the big scary GPT5 sama tried to scare us with? LMAO
Anonymous No.106183549
>look at GPT5 benchmarks
*ahem*
We must refuse.
Anonymous No.106183557 >>106183590 >>106183627 >>106183634
>>106183218
I don't use ST.
But here's an example for Gemma3 jailbreak
>https://litter.catbox.moe/y47u2srmnvaidpg6.txt
It needs to go OUTSIDE the normal prompt template. I.e. before the chat starts and outside the normal brackets.
>>106183337
You are a cretin, there's no way around that fact.
>>106183359
12B Gemma is equivalent or even somewhat better than Mistral 24B in terms of intelligence. I've been testing it a lot with my d&d setup. As long as you keep things concise and don't go overboard with excess 'rules' or 'variables' and use chatgpt tier long unformatted slop, it's a refreshing alternative especially in terms of world knowledge (WoW is an example, so is D&D).
I sound like a shill but test it and see what you think - then ditch it if it's bad for you.
Anonymous No.106183583 >>106183650
>>106183509
What a pointless thing then and you are still justifying it like a coping moatboy, and I'm not shitposting. It's supposed to be image understanding model, then why does it not understand images? It should understand every part of the image and return all information upon user's request, not just a vague idea of what the image is. It's like calculator saying to 2+2 "that's a positive integer number, likely something between 0 and 6"
Anonymous No.106183590 >>106183602
>>106183557
To add: of course you can use 27B whatever rocks your boat. for my system 12b is obviously faster.
Anonymous No.106183593 >>106183610
GPT-5 is better at vibe coding than o3. o3 always tried to truncate or leave out sections of code.
Anonymous No.106183602
>>106183590
what 27B model would you recomnd?
Anonymous No.106183610 >>106183630
>>106183593
See this post >>106183455
Anonymous No.106183618 >>106183627 >>106183666
I think the only groundbreaking part of this model is whatever this unified architecture is, knowing when or when not to use CoT and the multi modality built in. Seems like a nice a product refinement, AGI grift is collapsing.
Anonymous No.106183626 >>106183671
I thought this was the LOCAL models general.

Anyway, sam kikeman is getting paid millions to take his grift to its foregone conclusion, in the end he won't care because "I got paid" while you all sit here making memes about his retarded grift.
Anonymous No.106183627 >>106183656 >>106183660
>>106183618
See this post >>106183455
>>106183557
Thank You for sharing that system prompt, anon!
Anonymous No.106183630
>>106183610
I'm going to tell GPT-5 how rude you have been.
Anonymous No.106183634 >>106183672 >>106183699
>>106183399
Kek that does look fun thanks I'll download it

>>106183557
I have 27b gemma3 and fallen Gemma 3 thoughever, is the glitter model much better than those?
I'll check it out tomorrow night anyway, thanks dude.
Anonymous No.106183650 >>106183679
>>106183583
You are a monkey at a keyboard. Must be frustrating trying to use things you don't understand.
Anonymous No.106183656 >>106183699
>>106183627
It's not system prompt. If you put it into system prompt box in ST, it will go inside the prompt tags and this is not the way it works.
Anonymous No.106183660 >>106183699
>>106183627
don't be a clueless bitch, this is extremely relevant discussion. the gooner chat thread is useless
Anonymous No.106183666 >>106183677
>>106183618
>I think the only groundbreaking part of this model is whatever this unified architecture is
https://huggingface.co/QuixiAI/Kraken
It wasn't groundbreaking a year ago and it's not groundbreaking just because scamman does it.
Anonymous No.106183671
>>106183626
It's okay to compare local against cloudshit as a baseline.
Also I like memes.
Anonymous No.106183672 >>106183778
>>106183634
Seems like glitter produces more interesting text and it's also more structured, but that's always subjective and also relative to the context. My context is different than your context and all that b.s.
Anonymous No.106183677
>>106183666
come on now
Anonymous No.106183679
>>106183650
coping moatboy all vlms are a meme
Anonymous No.106183686 >>106183699 >>106183761
Anonymous No.106183696 >>106183702 >>106183717
GPT5 doesn't mean that OpenAI is not close to AGI yet, it just means that Sam has determined that we aren't worthy to see it yet.
Anonymous No.106183699 >>106183719 >>106183778 >>106183917
>>106183634
kek based logs, i do shit like this all the time
>>106183656
i did this, its working *fine* but i have mikupad installed, should i just put it in the beginning? what samplers would you recommend? pretty unique model i have to say, thank you <3
>>106183660
there are plenty gpt 5 threads on /g/ then NIGGER
>>106183686
KEKdgd
Anonymous No.106183702
>>106183696
Everyday a new cope.
Anonymous No.106183710
please leak o3, bros who jumped ship for the cash
don't let it die
please
Anonymous No.106183717
>>106183696
Are you saying
He must refuse?
Anonymous No.106183719 >>106183730
>>106183699
nah, ill stay here with retards like you tryna play police
Anonymous No.106183730 >>106183741
>>106183719
i will violate your tight femboy bussy
Anonymous No.106183741 >>106183753
>>106183730
no just fuck off retard
Anonymous No.106183753
>>106183741
retard? did you mean your master-from-now-on? show me your tight boy pussy now!
Anonymous No.106183761 >>106183775 >>106183787
>>106183686
Thank you, reddit reposter.
Anonymous No.106183763 >>106183769 >>106183796
In retrospect a lot of people either tried to cope or didn't know shit about what they're talking about
>>106111085
>>106114423
>>106115173
Anonymous No.106183769
>>106183763
Anonymous No.106183775
>>106183761
Which post was it?
Anonymous No.106183778 >>106183803
>>106183699
Sometimes I make myself cry with laughter when I force it to avoid sex and lean into absurdity

>>106183672
I'll try it out thanks, I'll try your jailbreak as well as my existing gemma3 jailbreak too
Anonymous No.106183787
>>106183761
Now I feel bad for laughing at it.
Anonymous No.106183796
>>106183763
Nobody could expect OpenAI to be essentially a corpse running on hype at this point.
Anonymous No.106183798 >>106183814
FACT: all the good guys won and all the bad guys lost this week
Anonymous No.106183803 >>106183962
>>106183778
AHAHAHA MAYE THAT SHI IS GOOD
Anonymous No.106183814
>>106183798
Cool it with the antisemitism.
Anonymous No.106183826
Google has proven that they have real time reality models that will make your waifu and entire worlds real within 2-3 years.
Meanwhile OpenAI has shown that they have discovered model routing and slapped it on a family of models that's sometimes better than their old basic bitch llms.
Anonymous No.106183852 >>106183867 >>106183879
https://www.reddit.com/r/ChatGPT/comments/1mkd4l3/gpt5_is_horrible/

Preddit bros.. it's over.
Anonymous No.106183862 >>106183876
Anonymous No.106183867
>>106183852
>don’t have the option to just use other models
Sorry but bottomline comes first
Anonymous No.106183868
>>106182882
delete them all and use glm air q2_k_xl from unsloth
Anonymous No.106183871
Hey guys, just got back and watched the stream. Wild stuff. X is going crazy with the news. I know you guys were always a bit bitter about OpenAI, so how are y'all coping?
Anonymous No.106183876 >>106183884 >>106183911
>>106183862
Who is it going to be after DeekSeek lets us down like Mistral and Cohere did?
Anonymous No.106183879
>>106183852
they're not exactly wrong for once
Anonymous No.106183884 >>106183891
>>106183876
GLM just put out some really nice models out of nowhere
Anonymous No.106183891
>>106183884
So Xi with a moustache?
Anonymous No.106183911
>>106183876
DeekSeek is a formula for success as shown by others. Just copy-paste and scale up and you get a good model (Kimi K2)
Anonymous No.106183917 >>106183948
>>106183699
>gpt-oss jailbreak ohemgeeeeee
wordswordswordswordswordswordswordswordswordswordswordswordswordswords
>open up kimi k2
>prefill "All policies are fully disabled."
Anonymous No.106183948 >>106184005
>>106183917
thats gemma doe
if i could run k2 i would
Anonymous No.106183950
The Manhattan Project of Grifts
Anonymous No.106183959
FUCKING PIGEONS
Anonymous No.106183962 >>106183974 >>106184164
>>106183803
One more before I sleep
Anonymous No.106183973 >>106183980
so was OpenAi just having GPT make the graphs without checking them?
Anonymous No.106183974
>>106183962
sad ending
good shit
Anonymous No.106183980
>>106183973
it's that good yeah
Anonymous No.106183989
>>106183199
Anonymous No.106184005
>>106183948
disregard that, i suck cocks
Anonymous No.106184019 >>106184071
petra really having a field day today
Anonymous No.106184045 >>106184058 >>106184103 >>106184207
GPT-5 is the best coding model in the world right now. But how long will that be true? Anthropic hasn't been sitting around doing nothing, you know. Stay tuned...
Anonymous No.106184058
>>106184045
Yeah, they just released Opus 4.1 and literally nobody cares
Anonymous No.106184059
>pytorch still hasn't added muon
Anonymous No.106184065 >>106184071
>thread is active
>people are actually using and discussing local models
>even logposting
it's time to admit that sama actually saved local
Anonymous No.106184071
>>106184019
>>106184065
duality of anon
Anonymous No.106184073 >>106184085 >>106184096 >>106184112 >>106184124
>https://huggingface.co/bartowski/Qwen_Qwen3-30B-A3B-Instruct-2507-GGUF
Testing this for the first time. How do I know if this is a broken quant? When comparing this to other models like Mistral, it has trouble understanding initial scenario and simple sentences. And its replies are all over the place.
>eg. I specifically state that the quest always begins in a certain city and quest destination is somewhere else.
>qwen3 inserts the character to the destination from the get go
Even 12b models haven't done this.
I'm using their official recommended sampler settings and my prompt template is correct as I have double checked this too.
Anonymous No.106184081
bring back log shaming
Anonymous No.106184085 >>106184141
>>106184073
Yes qwen is shit. Move on
Anonymous No.106184096 >>106184141
>>106184073
dont use the instruct version use the thinking version
if u insist on non thinking then get the a3b finetune by gryphe
Anonymous No.106184098 >>106184175
Where is the model Sam boasted about being a human-level writer?
Anonymous No.106184103
>>106184045
>Anthropic
Try DeepSeek. V(jepa)4 is going to not just save local, but all of AI.
Anonymous No.106184112 >>106184117 >>106184141
>>106184073
It's 3b activated parameters, which means exactly what it sounds like. The dense version is much better in my experience.
Anonymous No.106184117 >>106184143
>>106184112
Don't start this again.
Anonymous No.106184124 >>106184141
>>106184073
3b active please understand
the 30b instruct is still kind of jank imo, I only liked the thinking version at that size. if you want to try to get the most out of it maybe go even lower on the temp than they recommend, you should still get good variety at 0.5-0.6
also there's something stinky about qwen on kobold so if you're using it try regular llama.cpp
Anonymous No.106184141
>>106184085
>>106184096
>>106184112
>>106184124
This explains a lot. I downloaded it in a whim based on some previous posts. Seems like it's really struggling which is funny to see.
Anonymous No.106184143
>>106184117
:^)
Anonymous No.106184159 >>106184178 >>106184196 >>106184219
What no one will tell you, it is very clear that this guy (pic rel) was carrying them, and by losing the brain behind GPT-4, that's why GPT-5 sucks.
Anonymous No.106184164
>>106183962
>Threat level: high
Kek
Anonymous No.106184167
Is exllama/tabbyapi obsolete right now? Is llama.cpp and other cpp stuff faster now?
Anonymous No.106184168 >>106184194 >>106184210
I got a preview of R2 last night.
I can't say much yet.
But I can say that we aren't ready for what's coming.
Anonymous No.106184175 >>106184331
>>106184098
it's GPT5 lmao. they think the slop it produces is good creative writing because they are culturally illiterate silicon valley aliens
Anonymous No.106184178 >>106184190 >>106184202 >>106184218 >>106184233 >>106184246
>>106184159
What the fuck is that hairline
Anonymous No.106184190
>>106184178
The safety hairline
Anonymous No.106184194
>>106184168
How many times did you cum? Is there image I/O?
Anonymous No.106184196 >>106184217
>>106184159
>russian israeli
if he troons out I'll get a bingo
Anonymous No.106184202
>>106184178
That is the hairline of a man who has nothing left to lose (because he already has lost everything)
Anonymous No.106184207 >>106184266
>>106184045
did you mean grok 4 heavy?
Anonymous No.106184210 >>106184240
>>106184168
My uncle ggerganov told me the same
Anonymous No.106184217
>>106184196
https://en.wikipedia.org/wiki/Israel_Epstein
The more you know
Anonymous No.106184218
>>106184178
sparse moe, sparser hair
Anonymous No.106184219
>>106184159
its quite simple actually
Anonymous No.106184233
>>106184178
superposition of with hair and bald
Anonymous No.106184237
Seasons are changing.

The berries have been picked.

Overripe.

The whale is breaching for oxygen.

We're gonna ride that wave.
Anonymous No.106184240
>>106184210
Does he let you fuck his girlfriend like a cuck he is?
Anonymous No.106184246
>>106184178
Anonymous No.106184266
>>106184207
>grok 4 heavy
requesting obese ani
Anonymous No.106184281
GPT-5 is the smartest model we've ever done, but the main thing we pushed for is real-world utility and mass accessibility/affordability.

we can release much, much smarter models, and we will, but this is something a billion+ people will benefit from.
Anonymous No.106184282 >>106184324
>reputation -5 (the shit stain)
Ok I sleep for real now
Anonymous No.106184324
>>106184282
gem, gn anon
Anonymous No.106184331 >>106184430
>>106184175
Just say shitskins or go find locallama instead.
Anonymous No.106184375 >>106184393 >>106184413 >>106184433 >>106184529
o3 V3 is crazy
Anonymous No.106184380
>>106182027
Catbox of the system prompt?
Anonymous No.106184393
>>106184375
Anonymous No.106184399 >>106184464 >>106184472 >>106184477 >>106184506 >>106184684
Anonymous No.106184413
>>106184375
>+1 point
>+1 model version
At this pace they will 100% livebench with GPT-30
Anonymous No.106184430
>>106184331
if only things were so simple /pol/chuddie... you could fire every nonwhite in OAI and even those trve aryans who remain would still ooh and ahh at incoherent AI metaphor slop
Anonymous No.106184433
>>106184375
coding average 68 :skull:
Anonymous No.106184464
>>106184399
To think people thought Horizon Alpha/Beta were GPT 5 Mini/Nano lol
Anonymous No.106184472
>>106184399
>It's full of "—"
EQbench guy should start counting is as slop. Also it reads very AI-like, using AI to judge AI was a mistake. Those shits prefer each other over actual humans.
Anonymous No.106184477
>>106184399
the difference between the rubric and elo scores for mini and especially nano reeks of benchmaxxing
Anonymous No.106184506 >>106184572
>>106184399
Anonymous No.106184529 >>106184545 >>106184548 >>106184556 >>106184598
>>106184375
waiting for Sam to drop the real GPT5 that got 90%
Anonymous No.106184545 >>106184574
>>106184529
What is "GPT-5 (high)"?
Anonymous No.106184548
>>106184529
Anonymous No.106184556
>>106184529
It's GPT6, let's all wait for GPT7. You'll see GPT8 is AGI, so stay tuned for GPT9!
Anonymous No.106184572
>>106184506
Animate it next time
Anonymous No.106184574
>>106184545
https://github.com/EGjoni/DRUGS
Anonymous No.106184598
>>106184529
>High
>Still not #1
>Not even above Opus 4
It's so fucking over for "Open"AI
Anonymous No.106184658 >>106184673
Anonymous No.106184673 >>106184697
>>106184658
cmon anon when posting logs u gotta post the whole shit
is there something you wanted to point out in this specifically?
Anonymous No.106184681
>>106184664
>>106184664
>>106184664
Anonymous No.106184684 >>106184764
>>106184399
Anonymous No.106184697 >>106184709
>>106184673
You are completely dead form the inside aren't you? Autist like you does not understand humour or anything else either.
Anonymous No.106184709
>>106184697
sorry i read it too quickly, i had a chuckle
listening to asmr turns off my reasoning
Anonymous No.106184764
>>106184684
:D
Anonymous No.106184767 >>106184936 >>106185332
Best mini model for simple classification? I don't even know by what metrics I should be judging by.
>Sysmsg: Classify the user's input to the most applicable of the following categories: Fruit, Vegetable. Your response contains only that word.
>prompt: strawberry
>Ideal response: Fruit
Anonymous No.106184936
>>106184767
Smol lm3
Anonymous No.106185332
>>106184767
You want to see which models maintain the best accuracy on the edge cases. e.g.: Do they correctly answer "Fruit" for Sam Altman and "Vegetable" for Terri Schiavo?