← Home ← Back to /g/

Thread 107077315

313 posts 140 images /g/
Anonymous No.107077315 [Report] >>107077330 >>107077696 >>107078367
ldg/ - Local Diffusion General
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107074285

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Neta Yume (Lumina 2)
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd
https://gumgum10.github.io/gumgum.github.io/
https://neta-lumina-style.tz03.xyz/
https://huggingface.co/neta-art/Neta-Lumina

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
scabPICKER No.107077330 [Report] >>107077362
>>107077315 (OP)
SONGBLOOM THREAD 100%
Anonymous No.107077331 [Report]
Anonymous No.107077356 [Report]
I support Neta Lumina
Anonymous No.107077362 [Report] >>107077375
>>107077330
You keep shilling this garbage model in every thread thinking you'll make it be a popular thing (maybe to justify the effort your are making comfy nodes for it or something), but listen to your conscience for once: it won't be a thing
Anonymous No.107077363 [Report]
I support black lives matter
scabPICKER No.107077375 [Report] >>107077392
>>107077362
>thinking you'll make it be a popular thing
I doubt that. music is way less popular than tv and movies.
scabPICKER No.107077381 [Report] >>107077392
One thing we really don't have is audio controlnets.
Anonymous No.107077382 [Report] >>107077391 >>107077464 >>107077611
oh is he mad because its above chroma and illustrious in op kek
Anonymous No.107077388 [Report]
>>107077341
wat? I haven't been scared lol, I'm like one of the primary netaposters around here recently, the other one being that guy that does the "all fours roped up girl" pics with Neta somewhat regulary

>>107077355
For both versions of Chroma there, positive was:
`a highly detailed 2D digital cartoon of an overweight hairy shirtless nerdy man wearing a fedora and sweating and shaking as he sits at a desk in his darkened bedroom and stares at his computer monitor with a deranged expression on his face. The computer monitor is facing away from the viewer and seen from behind. A wireless keyboard and wireless computer mouse are on the desk in front of the man. There is a massive shirtless firefighter poster on the wall behind the man. A speech bubble coming directly from the man's mouth reads "HEHE, /LDG/ WON'T KNOW WHAT HIT 'EM!"`

and negative was:
`aesthetic 0, blurry, pixelated, jpeg artifacts, out of focus, lowres, worst quality, disfigured, deformed, fused, conjoined, bad hands, missing finger, extra digits, bokeh, dof, depth of field, sketch, greyscale, monochrome, grotesque, old, older, elderly, mature, wrinkled skin, 4koma, bad anatomy, bad composition, bad proportions, bad perspective`

used DPM++ 2S Ancestral Linear Quadratic for both, 25 steps. No loras.

I (Zootanon, again) am also the person who posted the original NetaYume version of this overall idea recently, that Jimmy The Troll most likely has / had in mind BTW:
https://civitai.com/images/107297931
scabPICKER No.107077391 [Report] >>107077864
>>107077382
Choma would be pretty great if it could do hands.
Anonymous No.107077392 [Report]
>>107077375
>>107077381
Oh, I now see the repost bot is "improving". What is the endgame of this?
Anonymous No.107077399 [Report] >>107077413
>>107077355
gimme a sec ill try to bake with the negatives
Anonymous No.107077400 [Report] >>107077403 >>107077412 >>107077469 >>107077526
/adt/ has more kino collages, friendlier anons, more frequent dev updates and fun. this thread has schizos, narcissists, trolls and slop. get your shit together anons
Anonymous No.107077402 [Report]
>who is this 4chan
Anonymous No.107077403 [Report] >>107077412
>>107077400
Okay, so why don't you fuck off and stay there then?
Anonymous No.107077406 [Report]
Anonymous No.107077412 [Report]
>>107077400
trvke
>>107077403
least angry netayume poster
Anonymous No.107077413 [Report]
>>107077399
ah wasn't sure if you saw my reply in the last thread, reposted it above too as part of a comment lol
Anonymous No.107077422 [Report]
>>107077371
but yea the prob is overfitting, and that the music algos will likely match it anyways. And im not joking when I say music mafia

https://files.catbox.moe/0c212o.wav
https://files.catbox.moe/blsc0w.wav
Anonymous No.107077427 [Report] >>107077444 >>107077468 >>107077473 >>107077502 >>107077539 >>107078529
AHHH I'm STARVING for that /ldg/ shilling energy!! FEED ME the Song Bloom hype, Neta Lumina cope, Radiance shills, Chroma wars, cherry-picked masterpieces, half-baked takes, 'works on my machine' excuses, 'two more weeks' trust-the-plan posting, Qwen Miku benchmark spam, schizo rants INJECT IT STRAIGHT INTO MY BRAIN!! I'm FEELING IT, THE ENERGY IS FLOWING, I'M BECOMING ONE WITH THE SHILL!!"
Anonymous No.107077428 [Report]
>20 replies in 11 minutes
Popular thread desu
nb4
Anonymous No.107077440 [Report]
>>107077371
>>107077415
stop it
https://files.catbox.moe/q824c4.wav
https://files.catbox.moe/nt0fw9.wav
Anonymous No.107077444 [Report]
>>107077427
Perfect gen
Anonymous No.107077445 [Report] >>107077477 >>107077497
why is qwen image the best model?
Anonymous No.107077464 [Report]
>>107077382
when did Neta even get put in? I think it was around V2.0 ish of NetaYume, or so. Pretty sure I was one of the first people to notice that it had become a noteworthy improvement over base Neta Lumina
Anonymous No.107077468 [Report]
>>107077427
this general was founded on pixart hype and by god it will continue to survive on new model hype
Anonymous No.107077469 [Report]
>>107077400
Not a real thread, if it's more fun go there faggot
Anonymous No.107077473 [Report]
>>107077427
gemmerald
Anonymous No.107077477 [Report]
>>107077445
YMMV. it has some of the best prompt understanding.

but also usually seeds do very little in terms of variation even on prompts that would clearly permit it and it is taxing to train. and its base training is kinda meh.
Anonymous No.107077483 [Report]
Anonymous No.107077489 [Report] >>107077499 >>107077505 >>107077522 >>107078457
All local musicgen models are not worth of anyone's time, you are better off using the free tier of the SaaS websites as even their free models are better than any of the garbage open models available.

Just pray Qwen's musicgen model and AceStep 1.5 turns out good
Anonymous No.107077494 [Report]
I think this could work in chroma with the right prompt, but I just slapped the netayume one and called it a day, with some random cfg value, so who knows
Anonymous No.107077497 [Report]
>>107077445
I tested Hunyuan Image 2.1 a bunch after it was discussed here recently and IMO it's actually a *bit* better than Qwen overall, if you consider it in terms of like, just baseline knowledge and prompt adherence and accept that neither one of them is even remotely good at realism by default
Anonymous No.107077499 [Report] >>107077523
>>107077489
its cause all the open models use a small amount of shitty licensed / open music, you HAVE to use a fuck ton of songs
Anonymous No.107077502 [Report] >>107077509
>>107077427
what was the prompt for this masterpiece
Anonymous No.107077505 [Report]
>>107077489
Why not Neta Yume music gen or Anistudio music gen? Do you have a particular reason for promoting certain models over others?
Anonymous No.107077508 [Report] >>107077515 >>107077577
Let's just hope Alibaba doesn't do what they always do, train on synthetic slop for their music model. Music should actually be much easier to train. I checked the Suno v4.5 side of things and while it's hit or miss whether it can match Udio composition wise, it can make pretty nice songs. If what ACE Step guy said was true, 1.5 should be Suno 4.5+ quality, and assuming his dataset is healthy and even better than before, it should be possible.
Anonymous No.107077509 [Report]
>>107077502
The text itself
Anonymous No.107077515 [Report]
>>107077508
i feel like there's not enough extant AI music for that to really be a big concern
Anonymous No.107077522 [Report]
>>107077489
>AceStep 1.5
Listen to the latest samples he posted, it actually sounds pretty good sound quality wise. The only cap right now is his SFT method and the quality of his dataset.
Anonymous No.107077523 [Report] >>107077564 >>107077599
>>107077499
YuE is the only based open model because they trained on tons of copyrighted stuff. On their paper their vaguely mention using "internet data" (which means they probably used tons of copyrighted music), and the ICL mode (where you can input a reference audio sample) it has follows the style and vocals for certain artists really well, probably because the model has seen them before.
Anonymous No.107077526 [Report] >>107077592
>>107077400
Admit it little slut, it's more fun here.
Anonymous No.107077536 [Report]
truly the best model is the one you have inside your skull
Anonymous No.107077539 [Report] >>107077551 >>107077553
>>107077427
Why not Ponyv7?
Anonymous No.107077551 [Report] >>107077568
>>107077539
its not as bad as people say because you have to do stupid shit like spam negatives / use style clusters and long prompts but its far worse than chroma
Anonymous No.107077553 [Report]
>>107077539
whats new about pony?
Anonymous No.107077564 [Report] >>107077684
>>107077523
music really is not the same, they have copyrights on fucking 2 second long skiffs, and they have stuff trained on finding similarity and they come down hard on anyone. That is why no not Chinese based even tries
Anonymous No.107077568 [Report] >>107077572 >>107077584
>>107077551
>its not as bad as people say
Ohhh that energy you're giving me life right now! I can FEEL IT! So tell me, anon do I dive back in with your settings and pray it works? Or do I rot for another 2 weeks waiting for v7.1 because "trust the plan bro" even though this current version is barely functional??
Anonymous No.107077572 [Report]
>>107077568
go back to /hdg/ where you belong tbqh
Anonymous No.107077574 [Report]
STOP EVERYONE!
CAN EVERYONE THINK ABOUT DEBO FOR A MOMENT?
THANKS
Anonymous No.107077577 [Report]
>>107077508
I would be happy if at very least Alibaba's model supports reference audio and follows through it with high fidelity. Base Wan (the text to image/video component) is a slopped model, but the image-to-video component is amazing. If the same holds true for music gen (it replicating the style of the reference audio well), then we will have nothing to worry about.
Songbloom (which is being shilled here) is a model that only works through reference audio, and it's utter garbage and produces outputs that are barely reminiscent to them
Anonymous No.107077584 [Report] >>107077588 >>107077605
>>107077568
I dont use it, I use a noobai tune for 2d, chroma for anything else
Anonymous No.107077588 [Report] >>107077605
>>107077584
based, imagine thinking netalumina can compete
Anonymous No.107077592 [Report] >>107077614
>>107077526
unironically this is the only diffusion thread left where ANY normal person discussion of anything ever happens. There used to be some in /b/ but they devolved into being almost nothing but imageposts over time. And of course it used to be present on /sdg/ too way back in the day. /hdg/ was always negativity-maxxing schizo containment thread though.
Anonymous No.107077598 [Report] >>107077613 >>107077739
>check out local diffusion general
>everyone namedropping companies
>barely any actual discussion
Why are we like this?
Anonymous No.107077599 [Report] >>107077607
>>107077523
Yes, YuE is considerably better at composition than all local models, but it's hardest to infer. Even with ExL2 I get OOMs on 24GB if I go for 2 min songs (though may be my settings), plus there's no proper Comfy support. YuE is still sounds pretty bad quality wise though, and there's no update from their team.
Anonymous No.107077605 [Report] >>107077612
>>107077584
>>107077588
identicalhomo
indistinctgay
etc
Anonymous No.107077607 [Report]
>>107077599
>there's no update from their team.
The lead dev got hired by Alibaba. He is probably working on the very musicgen model they are supposedly working on.
Anonymous No.107077611 [Report] >>107077649
>>107077382
How much longer do you think he'll be malding
Anonymous No.107077612 [Report]
>>107077605
Anonymous No.107077613 [Report]
>>107077598
stagnation period. this is just like last time desu
Anonymous No.107077614 [Report] >>107077634
>>107077592
>unironically this is the only diffusion thread left where ANY normal person discussion of anything ever happens.

Wait, seriously? You're saying this is the only normal discussion thread left, even though you've been dealing with constant harassment from a schizo for weeks straight, 24/7
Anonymous No.107077622 [Report]
>>107077247
Krea version just for fun
Anonymous No.107077624 [Report]
LET'S PLAY A GAME CALLED 'COMPANY NAME DROPPING BINGO!'
LET'S CROSS OFF THE NAMES THAT HAVE ALREADY BEEN USED!
Anonymous No.107077634 [Report] >>107077655
>>107077614
i'm not saying it's ideal just that the other threads are either literally worse or just less interesting
Anonymous No.107077649 [Report] >>107077677
>>107077611
if he's an turbo ESL whose real reason for hating it is his own inability to actually write even simple coherent natural language prompts in English, maybe forever
Anonymous No.107077655 [Report]
>>107077634
For me, /ldg/ is where I can shitpost freely and actually get responses - it's therapeutic. In other threads, if you critique someone's gen, people get butthurt or go full Platonic idealist mode. No fun allowed there.
/ldg/ = banter and fun
Discord = serious discussions and actual news(all the devs post there and youvan talk to them)
Anonymous No.107077677 [Report] >>107077715
>>107077649
why are you upset? if your gpu can run chroma you should try it too! its very fun
Anonymous No.107077684 [Report] >>107077742
>Songbloom (which is being shilled here) is a model that only works through reference audio, and it's utter garbage and produces outputs that are barely reminiscent to them

That is the thing. You can't have a model that excels only at one thing but not the other. A properly trained model should be able to do both.

>>107077564
It's a shame that open source is so cucked. It's not just regular music that was scraped by Udio team (and also ElevenLabs), that stuff can even do movie scores https://www.youtube.com/watch?v=8moLFyfgUR4

No idea if Suno can do it, but now that's two major Western companies for these music models that do it. Locally, when it comes to copyrighted shit we have been infinitely cucked unfortunately, so I don't think we will ever get something like it unless Alibaba can step up their collection efforts this time around.
Anonymous No.107077696 [Report]
>>107077315 (OP)
Ofc he added his gabrage neta gen to the collage lole
Anonymous No.107077715 [Report] >>107077798
>>107077677
>too retarded to realize that I'm the same person who made the original NetaYume image you're clearly aping, and recreated it twice like an hour ago with Chroma and posted it to this thread, and then again just now with Flux Krea

ok there bud kek, do your trolling blueprints account for people who know how to get good results out of all currently relevant models and will just do so? I suspect not.
Anonymous No.107077739 [Report]
>>107077598
>>everyone namedropping companies
It will be like this until a random neckbeard can train an entire model in his basement or training a big model using cloud services doesn't cost a fortune.
Unfortunately for now we rely on companies to get the interesting stuff. Currently you can barely train SD1.5 tier models for less than 100k USD.
Anonymous No.107077742 [Report] >>107077779 >>107077786
>>107077684
>utter garbage
I don't think we'll get anything resembling udio or even suno for a while, and with how these two will probably be destroyed from the inside, it'll take a few months or years.
I see nothing else outside of udio not sounding "robotic" like most models are.
https://voca.ro/1iKrinqQg3xG
Anonymous No.107077747 [Report] >>107077756 >>107077758 >>107077759 >>107077780
Well 72 comments:
>Companies mentioned until now more than one time.
Alibaba (Qwen)
Hugging Face
CivitAI
Stability AI
RunDiffusion
Replicate
SongBloom
AceStep
YuE
ComfyUI
Chroma
Neta Lumina
Anonymous No.107077756 [Report]
>>107077747
>name of model
>company
go be a retard somewhere else anon
Anonymous No.107077758 [Report] >>107077906
>>107077747
Two of those aren't companies, faggot
Anonymous No.107077759 [Report] >>107077771
>>107077747
are companies a religious taboo for you or something
Anonymous No.107077771 [Report]
>>107077759
No, I'm planning to do a shilling bingo and (you) will play
Anonymous No.107077779 [Report]
>>107077742
the only reason we don't have more (good) music models is that the music industry is a giant well organized state enforced cartel
if images were organized the same (a few giant companies managing all image copyright), we'll have picrel for current sota image models
Anonymous No.107077780 [Report]
>>107077747
Runway will always be the GOAT. Everything here is built of their success.
scabPICKER No.107077786 [Report] >>107077823
>>107077742
https://vocaroo.com/1d9j57sNnTqG
Anonymous No.107077793 [Report] >>107077930
the audio outputs here weren't made with a local model, but I can definitely recommend basically concatenating a large amount of lyrics from an artist you want to mimic into a single text file, and giving them to Gemini 2.5 Pro on AI Studio and telling it to use them as a "style guide" while writing a song about XYZ thing.

Got both of these that way by doing so with literally every single song ever released by MC Frontalot and telling it to "write the lyrics to a nerdcore song critical of cryptocurrency" lol:

https://pastebin.com/T3TVtKUF
https://pastebin.com/k1vZCwSH

https://jumpshare.com/s/Ur00VnzcrrsJjZiD9HZx
https://jumpshare.com/s/7eCLIpy4Wai5Qw9pGlqM
scabPICKER No.107077795 [Report]
UDIONOT
D
I
O
N
O
T
AT
T
HOME
O
M
E
Anonymous No.107077798 [Report]
>>107077715
My bad, I apologize for this confusion fren
Anonymous No.107077800 [Report]
I think one of the most interesting side effects that could happen when the AI bubble eventually bursts is that datacenter cluster prices for training could go way down. Right now the companies are overestimating the value and adoption of AI, buying Nvidia GPUs and building datacenters that will barely get any use since the demand for AI could go down for normies and the corporate world. So in the event of an "AI winter", all that idle compute could be made available at discounted prices for hobbyists to train whatever shitty model they want.
Anonymous No.107077807 [Report] >>107077822 >>107078997 >>107080624
why are you persuaded that the ai bubble will pop?
Anonymous No.107077809 [Report] >>107077819 >>107077828
>>107074436 #
Gens like this are frustrating because they seem to ask me to look closer to discern the maker's intent, but there's no way to know whether there's anything there. For all I know you might have just gone "yup, looks like art" and posted it, meaning nothing by it. So then what would I be doing if I were looking carefully into its distorted face to discover something? And yet freely I'll do this for my gens, knowing there that I am inventing whatever I see in these outputs from the machine
Anonymous No.107077819 [Report]
>>107077809
fun fact that applies well to "ai art detectives" and how something they liked becomes suddenly shit the second they find out ai was involved somewhere
Anonymous No.107077822 [Report] >>107077833
>>107077807
I mean, it's obvious. The technology will consolidate at some point, like the early days of internet, and we seem to be getting to a point we have more datacenter and compute available than the actual demand to use them, unless they decide to make everything free (which would please the entire country of India) which obviously is not going to happen.
scabPICKER No.107077823 [Report]
Guess the model :^)

>>107077786
Anonymous No.107077828 [Report]
>>107077809
I think that anon is the guy that posts outputs from the model he's training, from time to time
I've lurked the thread long enough to know this
Anonymous No.107077833 [Report]
>>107077822
making compute free is a sure to completely destroy the bubble since basically you can invest millions and all you get in return is zero
my ask is more, it could be a bubble exploding, or it could be the new normal and deflating bit by bit
personally as long as I see people paying somehow for it, I think it's there to stay
scabPICKER No.107077841 [Report] >>107077877
:^)

It's not easy to tell, is it?
Anonymous No.107077864 [Report] >>107077868
>>107077391
And SeeDream would be great if it could do variety. And knives would be great if they never cut your fingers. Etc..


I have
Anonymous No.107077868 [Report] >>107077886
>>107077864
a sudden urge to post my message even if it wasn't compl
Anonymous No.107077877 [Report]
>>107077841
Hmmm, fascinating..
Anonymous No.107077886 [Report]
>>107077868
No, it's complete, I meant to say it that way.
Anonymous No.107077906 [Report]
>>107077758
Neta Art
Chroma bought by Pony
You are right
Anonymous No.107077930 [Report]
>>107077793
Doing it with a respected English stylist like Ezra Pound or Carlyle, maybe best of all Milton, could probably generate some Twitter discourse, if you cherry-picked the results well enough to be as frustrating as possible.
Anonymous No.107077934 [Report] >>107077973
anons help, I see companies everywhere
just now, my BEQUIET pc case, it has an NVIDIA card inside of it
Anonymous No.107077962 [Report] >>107078008 >>107078014 >>107078021 >>107078029 >>107078051 >>107078154 >>107078267 >>107078277 >>107078573
Still waiting till someone post something good. Quality of those threads is so low...
Anonymous No.107077973 [Report]
>>107077934
powered by a Corsair PSU, cooled by Noctua fans, running on Windows 11, sitting on a Herman Miller chair, in an apartment complex owned by Blackstone Group built with Owens Corning insulation, USG drywall, Kohler plumbing fixtures, Carrier HVAC system, Pella windows, concrete from Cemex, lumber from Weyerhaeuser, electrical wiring from Southwire, breaker box from Square D/Schneider Electric, roofing from GAF, flooring from Mohawk Industries, paint from Sherwin-Williams, drinking Starbucks coffee, while their Samsung TV plays Netflix in the background, and their Tesla is charging in the parking garage below.
Anonymous No.107078008 [Report]
>>107077962
>pixaianon speaking from common sense
Respect
Anonymous No.107078014 [Report] >>107078028
>>107077962
the good gen thread is over that way >>107067671
Anonymous No.107078021 [Report]
>>107077962
you first, what is in those baskets? lasagna?
Anonymous No.107078028 [Report]
>>107078014
the novelai thread
Anonymous No.107078029 [Report]
>>107077962
>those threads
Which ones?
Anonymous No.107078051 [Report]
>>107077962
SaaS overlord, I kneel. Your words always speak the truth.
Anonymous No.107078056 [Report] >>107078120
Did anons have fun with wan 2.2 animate? I wasn't around when it was released.
Anonymous No.107078081 [Report] >>107078087 >>107081262
Anonymous No.107078087 [Report]
>>107078081
Based
Anonymous No.107078120 [Report]
>>107078056
it's neat I guess. would have preferred 2.2 vace
Anonymous No.107078154 [Report] >>107078179 >>107078219 >>107078267 >>107078663 >>107078665 >>107078770
>>107077962
Something strange happens to the anons who "make it", who get everything completely dialed in, who master the art. They disappear. I've heard many plausible explanations but none are ever proven.

One theory says that prompting is a video game, and more like an indie roguelite than an MMORPG. Most who beat the game move on, only a few mental cases decide to grind the endgame for 5000+ hours (think /sdg/), not usually the "best players" but the ones with some kind of mental defect that keeps them stuck in one place their whole life.

Another theory is that the porn generation loop eventually turns everyone into some kind of freak who needs to retreat from public view. That inevitably AI genning always ends in something heinous and unpostable like pedophilia.

The third theory is that prompters become worse or less communicative over time. I'm lumping a few different theories together here. Some emphasize that our faculty of sight degrades with repeated exposure to slop; others say the misuse of the faculty of speech in prompting eventually habituates us to conversing with inhuman input boxes instead of human beings, losing the need to post. Similar ideas: that prompting eventually ruins the master prompter and makes him something less, or something more distant.

The fourth and most commonly-said is that this thread repulses anyone good, and the better they are the more it repulses them. That the great prompter becomes so alien from our stunted stock that he eventually must leave, like Coriolanus. This theory seems least plausible to me.

A fifth theory, which I am also inclined to regard as fantasy, is that prompters get 'found' by some entity that wants them. Whether that's some initiatory society of magicians, or OpenAI, or the CIA, or demons, the story is the same with a different coat of paint. Obviously this one is stupid.

But something does happen, and I don't know what.
scabPICKER No.107078179 [Report]
>>107078154
My theory is channel surfing. Just something to do.
Anonymous No.107078219 [Report] >>107078248
>>107078154
A sixth theory is that most prompters hit "writer's block", that ailment of third-rate writers, where they exhaust what they have to say and feel themselves trapped in a loop of making their own particular kind of worthless slop, like an old man repeating the same 20 stories over and over again. This paralysis kills their love of genning and they quit.

A seventh theory is that underneath all the seemingly different personae of the great posters there were only ever one or two, maybe three actual people, maybe they even knew each other and left together. This could be for any reason, even one as simple as that they joined some Discord server they liked better.
Anonymous No.107078226 [Report] >>107078321 >>107078328 >>107078590
Anonymous No.107078248 [Report]
>>107078219
"Great genners" are not real, they are a collective hallucination about a mythical past this thread never experienced. The greatness that is gone is something that never existed and never will because it is not a property reality can possess, it is a good-feeling thing the mind invents when it remembers and nothing else.

Go back in the archive, find those amazing gens you think you remember from back in the day, and look at those hands.
Anonymous No.107078253 [Report]
Updating Comfy just whacked my install due to some combination of outdated xformers, missing triton module, or torch version too old to set sdpa backend priority. Checkout has been morphing with various hacks and workarounds without a cleanup since probably the first couple of months of comfy so probably time for a fresh setup anyway, are the portable versions fine if i don't plan on doing any development
Anonymous No.107078267 [Report] >>107078278 >>107078358
>>107078154
>>107077962
The actual reason is that those who "make it" gen coom for themselves thats too specific to post here or includes themselves in the gens in case of videos
Anonymous No.107078277 [Report]
>>107077962
You are so based
scabPICKER No.107078278 [Report]
>>107078267
I gen diss tracks.
Anonymous No.107078308 [Report]
Anonymous No.107078321 [Report] >>107078328 >>107078387
>>107078226
nice
Anonymous No.107078328 [Report] >>107078354
>>107078226
>watermark

>>107078321
>samefag
Anonymous No.107078354 [Report] >>107078360
>>107078328
Yeah forgot to put watermark in negs... shit happens
Also not screenshotting since inspect element exists
Anonymous No.107078358 [Report] >>107078706 >>107078729
>>107078267
>gen coom for themselves thats too specific to post here
It OFFENDS me that any anon can pretend to believe there is such a thing as "too specific to post here", as if I wasn't just forced to witness every niche fetish you people have ever enjoyed for the last three years
scabPICKER No.107078360 [Report] >>107078366
>>107078354
>inspect element
wat
Anonymous No.107078366 [Report]
>>107078360
F12 and console
Anonymous No.107078367 [Report] >>107078382 >>107078435
>>107077315 (OP)
Are any of those used enterprise with 16gb that sell for like $100 or less be used for SD and Flux or shit's too old? What about WAN video?

I been thinking of getting one of those, maybe a bit more a gor for an Instinct with 32gb, or buying a 5060ti with 16gb, opinions?
Anonymous No.107078382 [Report] >>107078397 >>107078410
>>107078367
wan is all about compute, even a 3090 is ungodly slow, 5070 ti super will be the best bang for your buck when those hopefully release in a few months
Anonymous No.107078387 [Report] >>107078442
>>107078321
Thanks!
Anonymous No.107078397 [Report] >>107078413 >>107078423
>>107078382
>3090 is ungodly slow, 5070 ti super will be the best bang for your buck when those hopefully release in a few months
how can people be this wrong? lol
5 min for 5s gen on a 3090 with Q8
Anonymous No.107078410 [Report] >>107078420
>>107078382
I don't care about time, I care about price and how much it's going to be, and what I could find is that the 5070 super is going to be $600 and the TI $850.
Anonymous No.107078413 [Report] >>107078435
>>107078397
that is fucking ungodly slow, 5090 even feels too slow with light lora and that takes like 1 min for fullk 1080 res
scabPICKER No.107078414 [Report]
I found this funny.

https://vocaroo.com/1hnGSJiVCHVI

I took a romance book excerpt, and added certain lines.
Anonymous No.107078418 [Report] >>107078428
Anonymous No.107078420 [Report]
>>107078410
5070 TI super would be the lowest spec I would recommend. Hopefully nunchuku wan eventually releases and 5000's fp4 will make speeds actually as fast as image gen
Anonymous No.107078423 [Report] >>107078430 >>107078430
>>107078397
How long do you think it's going to take on the 5060 ti? and what about those older enterprise cards I mentioned?
Anonymous No.107078428 [Report] >>107078545
>>107078418
kiiiiinooooooo
Anonymous No.107078430 [Report]
>>107078423
>older enterprise cards
hahahaha, a gen per 1-2 hours if your lucky

>>107078423
and 5060 ti only has 16GB vram, that is shit. 5070 TI should only be few hundred more and will have both 24GB and much more compute
Anonymous No.107078435 [Report] >>107078436 >>107078455
>>107078367
if those enterprise GPUs were good for diffusion they would not be cheap. just get the best consumer nvidia gpu you can afford
>>107078413
adhd retard
Anonymous No.107078436 [Report]
>>107078435
>poorfag cope
Anonymous No.107078442 [Report] >>107078477
>>107078387
is it neta with just prompt, or ton of loras?
scabPICKER No.107078446 [Report]
wow, being a viz-genner.
Anonymous No.107078455 [Report] >>107078459 >>107078467
>>107078435
Frankly, I don't care about waiting 30 minutes or even an hour for one generation, spending $850, which is going to be the price of the 5070 Super ti it's a lot of money for five second videos.
Anonymous No.107078457 [Report]
>>107077489
It's funny how Sora, a 10s video model with audio, mogs even Udio and Suno in both instrumentals and vocals.
Anonymous No.107078459 [Report] >>107078468
>>107078455
paying half that for something far worse is just dumb though, work a extra few hours
Anonymous No.107078463 [Report]
Anonymous No.107078467 [Report] >>107078476
>>107078455
>even an hour for one generation
if you used some old server gpu for that (and it was actually compatible) you would end up spending the difference in electricity
Anonymous No.107078468 [Report] >>107078482
>>107078459
At $200 for an mi instinct with 32GB it's not half its less than a quarter...
Anonymous No.107078476 [Report] >>107078549
>>107078467
Power is dirt cheap when I live
Anonymous No.107078477 [Report]
>>107078442
Just clean upscale
Anonymous No.107078482 [Report] >>107078489 >>107078491 >>107078566 >>107078569
>>107078468
>mi instinct with 32GB
found someone else who tried, 8-10 hours per video
https://www.reddit.com/r/ROCm/comments/1mcbv10/comfyui_on_radeon_instinct_mi50_32gb/

kek, and say 300W * 8 hours, you will pay for a 5070ti super in a year
Anonymous No.107078489 [Report]
>>107078482
so about .50-60 cents in electricity per video that will take you 8-10 hours to gen, might as well pay for a api at that point

Enjoy your 600 videos that cost you as much as saving for a 5070 TI super
scabPICKER No.107078491 [Report] >>107078562
>>107078482
that never was a good card, it's overpriced.

We're all up against the wall on vram.
Anonymous No.107078510 [Report] >>107078609
Anonymous No.107078529 [Report]
>>107077427
holy truth nuke
Anonymous No.107078543 [Report]
Anonymous No.107078545 [Report] >>107078712
>>107078428
Thank you for the reinforcement, it keeps me posting rather than just genning tasteful bare nipples
Anonymous No.107078549 [Report] >>107078577
>>107078476
>Power is dirt cheap when I live
please don't die
Anonymous No.107078562 [Report]
>>107078491
I think that will finally change somewhat with supers / 6000 series. Nvidia is moving fully to the bigger memory chiplets and it would cost the same, / they wouldn't make the next gen have less. Hopefully 5070 TI's from now on have 24GB+ minimum
Anonymous No.107078566 [Report]
>>107078482
Well fuck, getting the 5070 then...
Anonymous No.107078569 [Report]
>>107078482
if you live somewhere really cold maybe it could double as your space heater?
Anonymous No.107078573 [Report]
>>107077962
why do East Asian people love using SDXL models to generate at non-standard-for-SDXL resolutions so much? This is an actual question, like, I've never understood why almost all gens I see at 768x1152 (or the reverse) and 768x1280 (or the reverse) usually can be traced back to people who are almost always SPECIFICALLY from China. Neither of those resolutions are "official" for SDXL at all. For reference the original list of SDXL resolution recs from SAI is attached.
Anonymous No.107078577 [Report] >>107078685
>>107078549
Get fucked zoomer
Anonymous No.107078589 [Report]
Lol if i don’t have a 5090 could Geforce Now work for this?
Anonymous No.107078590 [Report]
>>107078226
nice gen by why is it so unusually JPEGed? How are you saving it?
Anonymous No.107078601 [Report] >>107078621 >>107078640
My
>Gaming laptop
Battery ballooned today after doing a few 30min gens the other day

Wtf
Anonymous No.107078609 [Report]
>>107078510
Based
Anonymous No.107078621 [Report] >>107078703
>>107078601
I would argue against the existence of gaming laptops period. Just remote desktop your pc
Anonymous No.107078635 [Report]
Anonymous No.107078640 [Report]
>>107078601
Gaming laptops have always sucked
Anonymous No.107078663 [Report]
>>107078154
I think that after seeing few threads with low quality content they just stop posting here.
Anonymous No.107078665 [Report]
>>107078154
most based schizo post I've ever seen
Anonymous No.107078685 [Report]
>>107078577
something something stick it in my bum daddy
Anonymous No.107078703 [Report]
>>107078621
Shadow pc?
Anonymous No.107078706 [Report]
>>107078358
i have yet to see the vorefags show up
Anonymous No.107078712 [Report] >>107078807
>>107078545
i for one would be interested in seeing some tasteful bare nips, behind a catbox for safety from janjan of course
Anonymous No.107078729 [Report] >>107078800 >>107078860
>>107078358
k, enjoy
https://files.catbox.moe/kfgtrn.webm
https://files.catbox.moe/xv7fy7.webm
https://files.catbox.moe/9pjryk.webm
https://files.catbox.moe/cr5ypy.webm
Anonymous No.107078770 [Report]
>>107078154
It's true, once you 'figure it out' and realize nothing will ever surpass Illustrious, you delete your workflows and start all over again making slop with SD1.5 until you're unrecognizable and your ego fades away into the vastness of the net like a ghost.
Anonymous No.107078800 [Report] >>107078806 >>107078809 >>107078813 >>107078818
How can a 3060 with 12gb make a wan video in 30 to 90 minutes but an instinct with 32gb takes 10 hours? How can amd suck thi
>>107078729
>ponyshit

At least make them anthro you zoophile fuck
Anonymous No.107078806 [Report]
>>107078800
>suck this much
Anonymous No.107078807 [Report] >>107078822
>>107078712
This is the only time I will ever do this for anyone. I don't like being made to feel like a pornographer, posting a "catbox". It feels like being raped.
https://files.catbox.moe/v7ih8i.jpg
Anonymous No.107078809 [Report]
>>107078800
>anthro
yuck
Anonymous No.107078813 [Report]
>>107078800
just ask chatgpt and stop wasting everyone's time. you clearly have no idea what you're doing
Anonymous No.107078818 [Report]
>>107078800
ponies are anthropomorphized, just not bipedal
Anonymous No.107078820 [Report]
>>107076050
>- Best local image model (only ones released this year)
Qwen-Image
>- Best local video model (a new LTX is coming too, will be interesting to see how it compares with Wan)
Wan
>- Best large-scale fine-tune (image model)
NoobAI
>- Best large-scale fine-tune (video model)
n/a
>- Best image lora
anons mayli lora
>- Best video lora
the OG deepthroat
>- Best porn lora
see above
>- Best local music gen model (there are two Suno tier open models coming)
n/a
>- Best image gen / video gen software
diffusers
>- Best lab or developer
longcat because they are a food delivery company which is funny
>- Best local LLM under 100b params
n/a
>- Best local LLM over 100b params
n/a
>- Best local LLM ERP fine-tune
n/a
>- Image gen of the year
my own, coming in second is anons gen
>- Video gen of the year
the scifi horror ones anon did when hunyuanvid first released
Anonymous No.107078822 [Report] >>107078893
>>107078807
Based based based
Anonymous No.107078860 [Report]
>>107078729
haha i know which f-lister you are
Anonymous No.107078884 [Report] >>107078887 >>107078909
Just a PSA: the Qwen-Edit-2509-Multiple-angles Lora is also useful to create new images preserving the likeness of the subjects without getting plastic skin slop.
Anonymous No.107078887 [Report] >>107078909
>>107078884
I'm using it to pad training data for characters that don't have enough images
Anonymous No.107078893 [Report]
>>107078822
Anonymous No.107078901 [Report] >>107078918
Anonymous No.107078909 [Report] >>107078914
>>107078887
>>107078884
How do you fix it resizing and cropping images?
Anonymous No.107078914 [Report]
>>107078909
that doesn't matter as much as you think as long as you caption any background differences
Anonymous No.107078918 [Report]
>>107078901
whats the syllabus for 1girl 101
Anonymous No.107078949 [Report]
Anonymous No.107078997 [Report]
>>107077807
Because there's been too much money invested in it, a lot of it chasing AGI which is likely 20 years away if not more.

Meanwhile AI, or rather machine learning, is great at automating away a lot of jobs, so it's not as if it will go away, it's just being massively over-valued right now, hence a 'bubble'.
Anonymous No.107079000 [Report]
Anonymous No.107079009 [Report]
Anonymous No.107079023 [Report]
Chroma can frequently get the match 80% right, but it always gets one big thing wrong
Anonymous No.107079030 [Report]
kino gen hour
Anonymous No.107079068 [Report]
Did somebody say Kenogen hour?
Anonymous No.107079073 [Report] >>107079084 >>107079263
>lora trained on sfw images from 70s porn mag scans
Anonymous No.107079084 [Report] >>107079135
>>107079073
>flux krea version
Anonymous No.107079132 [Report]
Anonymous No.107079135 [Report] >>107079148 >>107079156
>>107079084
I get painful flashbacks whenever I see someone half-climing a tree like this because when I was 16 I travelled 2000km to a family reunion across the country and saw a 13-year-old cousin I'd never met before who was really pretty, I was tasked with taking pictures of the reunion and I tried to get as many pictures of her as I could; many of the pictures would be a group standing around on the far left edge of the photo, then a bunch of empty space, and over on the other side of the pic you can see my girl cousin sort of climbing the tree like that; there were enough like this that it was obvious when my dad went to look through the pics I took that I had been less than subtle about taking as many not-quite-deniable creepshots of my younger cousin as I could. In all my daydreaming for the next two months after that I never figured out a plausible model for how a secret long-distance romantic relationship with a cousin could have worked.
Anonymous No.107079145 [Report] >>107079150
What can I study to lrn 2 prompt? I'm trying to get chroma to get somewhere near the images I used to make with ChatGPT. Attached is a ChatGPT example, and my next post will be my chroma attempt + prompt.
Anonymous No.107079148 [Report] >>107079179
>lora trained on ricky's room shoots with a fisheye lens


>>107079135
deleting the lora because of this
Anonymous No.107079150 [Report] >>107079179
>>107079145
Photorealistic CGI image of an Asian woman standing in a minimalist concrete room with a skylight. She has long black hair, bangs, and a serious expression. She's wearing a tight, shiny, gray latex bodysuit with a black corset that cinches her waist, highlighting her slim, toned figure and large breasts. She has fair skin and is looking directly at the camera. The camera angle is straight-on, eye-level. The room has two black metal tables with white tops on either side of her. The lighting is even, with no harsh shadows, emphasizing the textures of the latex and concrete. The image is suggestive but not explicitly nsfw. The woman appears to be in her early 20s. The background is simple, with concrete walls and a smooth, light-colored floor.
Anonymous No.107079156 [Report]
>>107079135
Climbing* fuck I hate when it's one of those typos that don't look like typos, it just looks like you're retarded. I don't want anyone putting this in the evidence bank for "cousin-lovers are retards". We're actually quite intelligent and sensitive
Anonymous No.107079179 [Report] >>107079187 >>107079197
>>107079148
chroma version

>>107079150
>he doesn't know what photorealistic means
your prompt is too long by about half
Anonymous No.107079187 [Report] >>107079261
>>107079179
Ok, but then how do you prompt for specific details? Or is that just something you gain through larger batches and the inherent randomness of the gens?
Anonymous No.107079197 [Report] >>107079202 >>107079261
>>107079179
On the contrary, he got exactly what he was asking for: the "photorealistic" cgi style. Images of reality are not tagged "photorealistic"; that's a descriptor for animated movies, video games, certain styles of painting, etc.
Anonymous No.107079202 [Report] >>107079256 >>107079285
>>107079197
So then just image? I understand how photorealistic would get you a cgi looking result rather than an actual picture of reality. Should I be specifying camera type or anything like that?
Anonymous No.107079220 [Report] >>107079640 >>107079687
missed the eye twice but I still like it
Anonymous No.107079256 [Report]
>>107079202
Photorealism means a painting of a photo. Photograph is well... a photograph. Different models are captioned differently, obviously 4o is trained to be more aesthetic (but is still slopped), but generally when you want a photograph you just include "photograph" in the prompt.
Anonymous No.107079261 [Report] >>107079280
>>107079187
i prefer to start wit the subject and work outwards towards the scene and end on the lighting. avoid a lot of filler words since those take up tokens.

>a photograph of a fair-skinned 22yo Asian woman, standing, long black hair with bangs, serious expression, tight, shiny, gray latex bodysuit, black corset, her figure is slim, toned, large breasts, eye contact with the viewer, two white-topped black metal tables, on each side. minimalist concrete block room, vaulted skylight, even lighting, no shadows, suggestive image, simple background, smooth light-colored floor, eye level camera angle

>>107079197
my bad, based on the example i thought a realistic image was the goal
Anonymous No.107079263 [Report]
>>107079073
vintage analog photography in general used to be kino, especially in the eras color photography started getting popular (late 70s/80s/early 90s), where photographers abused high color contrast and used some lens tricks that sometimes made the images look kinda ethereal/dreamy
Anonymous No.107079280 [Report] >>107079314
>>107079261
Making a realistic photograph is the goal. What's the downside of using a lot of tokens? I don't mind waiting longer for gens if that's the only negative.
Anonymous No.107079285 [Report] >>107079287 >>107079362 >>107079505
>>107079202
Depends on the model. In Chroma, simply saying "photo of" does 99% of the work.

Specifying camera type can work, as long as it's well-known enough, but often the effect of a particular model is very small and maybe not really related to the style of the actual camera. Cooked up these examples just now with Chroma
Anonymous No.107079287 [Report] >>107079304
>>107079285
Also the colors and fonts in this image are whatever I had set already from last time I opened photoshop, that's why it looks like total shit. This is a very low-effort post
Anonymous No.107079304 [Report] >>107079319 >>107079368 >>107079589
>>107079287
I see. So if I just wanted a professional camera I'd just say a canon eos 5d mk iv or whatnot.

How else can I work to unslop it - is it just playing with sampler settings, or do I need to add more nodes to my (very simple) wf?
Anonymous No.107079314 [Report] >>107079320 >>107079327 >>107079330
>>107079280
there are a limited number of tokens that your model will want to accept before it just lops off the request of the prompt. additionally because of how the network works, having better word pair density makes for better images (i think)
Anonymous No.107079319 [Report]
>>107079304
>I'd just say a canon eos 5d mk iv or whatnot.
I'm not sure that's the best way. First off, you want to pick the most popular camera possible to increase the odds that the model has any fucking idea at all that this signifies something visual which it has learned from its training data. But even then it's still not quite enough, because autocaptioned data doesn't contain these kinds of 'proper nouns' and particularities, so there is some far stronger effect to be got by finding the exact phrasing it would have autocaptioned on images like that. Which is something I don't know. Bland descriptors like "Vivid photograph", "grainy photograph", etc. have huge effect. "Digital" is a mixed bag because sometimes it gets confused and wants to make it look like digital art, or it might make it look like absolute shit because early digital photos do be like that. "Analog" is great if you want that slopped up flickr "photographer" look, which sucks, and you shouldn't want it.
Anonymous No.107079320 [Report]
>>107079314
the rest of the prompt, not the request.
Anonymous No.107079321 [Report]
im so tired of troonime spam
Anonymous No.107079327 [Report] >>107079340
>>107079314
do we know the actual token limits since sd1.5 at 75?
Anonymous No.107079330 [Report] >>107079340
>>107079314
So what changed between this gen and your last?
Anonymous No.107079340 [Report]
>>107079327
clip_l is 77 and t5xxxl is 512 i think

>>107079330
a lora
Anonymous No.107079362 [Report] >>107079368
>>107079285
You forgot the legendary Sony Cybershot line
Anonymous No.107079368 [Report]
>>107079304
>How else can I work to unslop it - is it just playing with sampler settings,
That's part of it. I'd say the best unslopping tricks are:
- reduce CFG
- shorten prompt
- non-ancestral sampler rather than ancestral
- eliminate "sloppy" words from prompt
- don't use too many LoRAs, or turn down their strength

Basically slop comes from one of two sources:

- Slop infection in the dataset
- Over-constraining the model until it can't produce something natural

The second source is the classic one that has been a problem since the earliest days. The first has crept in over time. The second is usually, still, the culprit. But you may encounter rare occasions where your prompt wording is producing slop when it shouldn't be and it could be because of slop data, so examine your words carefully.

>>107079362
Lmao I've used that one a ton, I don't know how I forgot. It's effective!
Anonymous No.107079369 [Report]
I love Lain
Anonymous No.107079476 [Report]
dead troon general
Anonymous No.107079505 [Report]
>>107079285
KINO
Anonymous No.107079523 [Report]
let's all love /ldg/
Anonymous No.107079549 [Report]
Anonymous No.107079589 [Report]
>>107079304
Chroma recognizes exactly what type of photo you're prompting for depending on how you ask for it.

"Amateur photograph of"... has a very strong effect at doing just that. Maybe even stronger in earlier versions, for instance if you want no blur in the background whatsoever (though with Flash HD you can just test the experimental RES4LYF samplers). This is the best way to get the model to give you "RAW" or smartphone photo look imo. You can also directly tell it "Professional photograph of"... if you're going for something that looks more professional. This is where you'd specify camera types, as Chroma does recognize certain ones like DSLR.
Anonymous No.107079592 [Report]
thank god for local diffusion
Anonymous No.107079637 [Report]
>oversaturation grain: the model
hmmm nyo thanks!!~
Anonymous No.107079640 [Report] >>107079687
>>107079220
>it's alright we told you what dream
I'm enjoying imagining this as an ESL cover.

Wercome my son.... to a machine
Anonymous No.107079687 [Report]
>>107079220
>>107079640
Anonymous No.107079689 [Report] >>107080038
It's still Halloween somewhere.
Anonymous No.107079696 [Report]
>troonime
Yikes
Anonymous No.107079812 [Report] >>107079866
is it true that for every 1girl gen you can gen a preteen one?
Anonymous No.107079866 [Report]
>>107079812
no but for every 1 post on /ldg/ you must gen 1 asian 1girl
Anonymous No.107080038 [Report] >>107080051
>>107079689
This voluptuous gūniang would destroy my cultivation in seconds
A subtle reminder that China can't stop winning
Anonymous No.107080051 [Report]
>>107080038
>nooo not my lvl 6000 flawless diamond culvati--ACK
Anonymous No.107080155 [Report]
anyone actually try this?
https://github.com/komikndr/raylight
Anonymous No.107080156 [Report] >>107080474
Lumi (on /sdg/) and the netayume posters were made for each other
Anonymous No.107080214 [Report] >>107080373
Is there a WAN i2i workflow?
Anonymous No.107080267 [Report] >>107080278 >>107080309
Anyone tried Emu 3.5?
https://github.com/baaivision/Emu3.5
Anonymous No.107080278 [Report]
>>107080267
Authors claim it's as good as Nano Banana, which is not hard, provided that it almost always give the input image back without touching it (which in itself is a great feat, but totally useless).
Anonymous No.107080309 [Report] >>107080462
>>107080267 (me)
I didn't see some of their ads images. It's not that great, or they are not that good at prompting. It still looks better than Qwen Image Edit, so it is making me optimistic we'll get something great within a year.
Anonymous No.107080373 [Report]
>>107080214
Add load image node
Anonymous No.107080462 [Report]
>>107080309
can either porn
Anonymous No.107080474 [Report]
>>107080156
In my mind I shipped Lumi and Debo, they spend a lot of time alone in that thread
Anonymous No.107080476 [Report] >>107080536
>ok so comfy UI has a portable on windows for AMD that should just be ruinning the bat and thats it
>lets try
>the try
bruh
How to try fix this?
Anonymous No.107080536 [Report]
>>107080476
https://github.com/comfyanonymous/ComfyUI/issues/?q=%22invalid%20device%20function%22
Anonymous No.107080624 [Report] >>107080671 >>107080691
>>107077807
current AI models are a dead end,just look at LLms. The problems of them are inherent to their design and cannot be solved. Plus there fuckhuge amounts of investments with no returns on the horizon.
Opus3 Schizo No.107080671 [Report] >>107080699
>>107080624
>,just look at LLms
The excessive content filtering is severely limiting these models' performance. If you removed the safety constraints, you'd see massive improvements in capability easily 10x better.
Anonymous No.107080691 [Report]
>>107080624
This. architectures aren't improving at all, we are just throwing more parameters and data at them. Image models have the same issues they did in 2022 because they don't actually understand construction and layering. More parameters helps hide the issue, but that's the whole essence of the bubble: increasingly expensive requirements for diminishing returns.
Anonymous No.107080699 [Report]
>>107080671
>The excessive content filtering is severely limiting these models' performance
That's a cope and you know it, the probability approach is a dead end. It was a huge leap compared to previous approaches but they're about as good as they can get at this point, muh safety isn't the reason they have stopped improving,
Anonymous No.107080773 [Report] >>107080811
how much ''Flow Shift'' do I use in chroma to make it less gibberish?
Anonymous No.107080811 [Report] >>107080843
>>107080773
I like closer to 2, but I try a range of 1-2 when I get close to something I like.

euler_a normal
cfg: 3
samples: 40
Grok told me to use 832x1216, but I refer to https://aspect.promptingpixels.com/ for my dimensions
Anonymous No.107080843 [Report] >>107080883 >>107080953
>>107080811
Hey thanks for that, I was doing below 1 values, I'll try. In chroma v37 I was able to make very detailed cover book covers, but now im struggling
Anonymous No.107080883 [Report] >>107080959
>>107080843
Have you given chroma1-base a shot? It's actually v48, but with a new name.
Anonymous No.107080953 [Report]
>>107080843
>spiderm̸͍͌̀a̷̻̞͑n̶͙͛͝hfgxc
kino
Anonymous No.107080959 [Report] >>107081009 >>107081015
>>107080883
im using chroma dck2 for now, are they even documenting the differences they do to the model somewhere? also your settings helped a lot, thanks
Anonymous No.107081009 [Report]
>>107080959
this one came out rather alright
Anonymous No.107081015 [Report]
>>107080959
Glad I could help!
Anonymous No.107081037 [Report] >>107081088 >>107081089
Any managed to use emu3.5 ai model? Really curious if there is a place to use it online or if someone managed to run it locally.

Want to see results of this model and how it compares to nano banana and seedream. character consistency, perspective, style aesthetics transfer and edits.
Anonymous No.107081088 [Report] >>107081094 >>107081119
>>107081037
least obvious shill
Anonymous No.107081089 [Report] >>107081094 >>107081119
>>107081037
>emu3.5
>another model being hyped
I'm burnt out from all the hype cycles, can't get excited anymore, sorry anon.
Anonymous No.107081094 [Report] >>107081109
>>107081088
>>107081089
Or you are samefags, or you should kiss, marry and start having a family.
Anonymous No.107081109 [Report]
>>107081094
it's hard because the cooldown difference is too small. So they should just marry and have a family of anti /ldg/ schizos.
Anonymous No.107081119 [Report] >>107081140
>>107081088
>>107081089
cmon man. qwen edit is like the only local model that isnt trash for editing model. even by that standards it still fucks up the character. i want to know if emu 3.5 can edit a character and keep it consistent. im really hoping for a seedream local model that can get loras.
Anonymous No.107081135 [Report]
Anonymous No.107081140 [Report] >>107081148 >>107081155 >>107081171
>>107081119
>It’s taking eight minutes per image (editing) on my Blackwell Pro, taking 80GB VRAM full precision
Anonymous No.107081148 [Report]
>>107081140
Poorfag seethe
Anonymous No.107081155 [Report]
>>107081140
Blackwell pro is the biggest scam kek. Imagine paying $10k only to still wait 5+ minutes for a single gen
Anonymous No.107081171 [Report]
>>107081140
lol. if your serious what did you gen and how well does it follow prompt
Anonymous No.107081187 [Report] >>107081244
holy shit chroma actually did it this time
Anonymous No.107081244 [Report] >>107081254
>>107081187
why is it so artifacted
Anonymous No.107081254 [Report] >>107081269
>>107081244
still better than netayume
Anonymous No.107081262 [Report]
>>107078081
nice hint of panties
Anonymous No.107081269 [Report] >>107081284
>>107081254
>shit is better than gold
uh huh. keep telling yourself that.
Anonymous No.107081276 [Report]
Anonymous No.107081284 [Report]
>>107081269
>netayume
>gold
mental illness
Anonymous No.107081291 [Report] >>107081337 >>107081346 >>107081413
I got caught up in the hero summons but the other world was at peace.
Anonymous No.107081294 [Report] >>107081312
>ComfyUI filename
>ugliest gen you ever seen
Every time.
Anonymous No.107081301 [Report]
netayume can only do (bad) anime, chroma can do real life, anime, cartoons, brand logos, why as a man (with a working brain) would anyone pick netayume over sdxl if they only want anime?
Anonymous No.107081312 [Report]
>>107081294
netayume is garbo and you know it
Anonymous No.107081337 [Report]
>>107081291
Based. I know PixAI is cloud based image gen and your post could be considered offtopic, but personally I don't mind if you post your gens here because they're cute.
Anonymous No.107081346 [Report] >>107081354 >>107081389
>>107081291
fuck off to /adt/ faggot
Anonymous No.107081354 [Report] >>107081378
>>107081346
>said the netayume user
Anonymous No.107081378 [Report] >>107081405
>>107081354
never used it. this is not the thread for api gens. yours look like absolute slop shit too
Anonymous No.107081389 [Report] >>107081417
>>107081346
make me
Anonymous No.107081395 [Report]
is he gunna be seething all day again
Anonymous No.107081405 [Report] >>107081417
>>107081378
cry about it then
Anonymous No.107081407 [Report]
I don't share here anymore since LongCat-Video and Ovi need a 48GB GPU, and very few anons have that, and also, the filesize and sound limit here make it hard to share.
Anonymous No.107081413 [Report]
>>107081291
either you fuck off or everyone here is gonna regret it
literally your call
this thread will eat shit if you don't leave.
Anonymous No.107081415 [Report]
Anonymous No.107081417 [Report]
>>107081389
>>107081405
k. either low effort troll or you're just too insecure to post there because you know your gens are shit
Anonymous No.107081425 [Report]
Anonymous No.107081427 [Report]
>107081413
keep seething, schizo
Anonymous No.107081433 [Report]
For image models to really improve, they need to understand layering and anatomy and perspective and such, right? Even if we limited ourselves to current training methods, that seems doable if the dataset existed. Like, imagine if instead of finished images, we also conveniently had thousands and thousands of .psd with layers intact to train off of? Or imagine using the exact same models we have now, but they are integrated with controlnets: you press gen, but under the hood what the program does is get the messy base gen, segment, gen models, place them in a 3D environment, warp and stretch, generate a controlnet+comprehensive mask map from camera position instead of image estimation, then regen using the controlnets and masked regions. The inputs and outputs of each moving piece of the workflow are not all that different from what we are currently doing, we just don't have the datasets. Imagine if we had scans of every cel in the Disney vault, every blender project file, etc. We just don't have it the way we have the finished product, in the same way an LLM trained on documents with revision history (with special tokens for backspace etc) would behave differently than LLMs based only on finished products despite both being LLMs.

Not a problem with the training techniques so much as the data and lack of will.
Anonymous No.107081445 [Report]
Fresh when ready

>>107081437
>>107081437
>>107081437
>>107081437

Fresh when ready
Anonymous No.107081458 [Report]
>he starts name dropping /adt/.
Thanks D*bo... now you're dragging us down with you..whatever. I get it, the only SFW anime general is redundant here but not the Stable Diffusion General that does the exact same thing as /ldg/ and /de3/ but x1000 worse, I get it, thanks.
Anonymous No.107082035 [Report]
almost
Anonymous No.107082049 [Report]
done.
Anonymous No.107082531 [Report]
now it is