Discussion of Free and Open Source Text-to-Image/Video Models
Prev:
>>106210147https://rentry.org/ldg-lazy-getting-started-guide
>UIComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.com
https://civitaiarchive.com/
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows
>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>WanXhttps://github.com/Wan-Video
2.1: https://rentry.org/wan21kjguide
2.2: https://rentry.org/wan22ldgguide
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y
>Chromahttps://huggingface.co/lodestones/Chroma1-HD/tree/main
Training: https://rentry.org/mvu52t46
>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/
>MiscLocal Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage
>Neighbourshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
Blessed thread of frenship
qwen is based
chroma is trash
>>106213906oh wait, that's the other one. my bad.
>>106213906Those are really cool
Do you have a hint on how to make characters consistently shut up / keep their mouths closed? Like i gave up making dance videos because characters always talk like there's no tomorrow
>>106213893qwen cant into realism
>>106213935could prompt "their mouth is closed" to ensure they don't talk, otherwise the model decides
>>106213947https://huggingface.co/flymy-ai/qwen-image-realism-lora
>>106213947This can be fixed with loras, it's REALLY huge and slow though, even with speedup loras it will still be slow, and it needs high resolutions to look good which makes things even slower.
does riflexrope work with wan 2.2?
I tried it and basically got a dull painting as a result
is there a nsfw focused guide for WAN genning? how tricky is it? i kind of want to get a 5090 for it, but i heard the base model was censored and needs a bunch of workarounds that only work 10% of the time, or something
comfy should be dragged out on the street and shot
>https://github.com/cocktailpeanut/fluxgym
cant this be adapted for chroma? 12gb and 16gb lora training would be rad
>>106213436>the girl hears a wolf howlYou are an insane retard.
Another stupid question. Does the method of tagging my samples differ when doing the diffusion-pipe method? How exactly is one to tag their data when doing this
>>106214025Go to civitai, enable NSFW and download NSFW loras, they describe how to prompt.
You always roll the dice when genning, but the most popular Wan NSFW loras work well, probably the reason they are popular.
>>106214080It uses a text file of the same name as the image it describes.
So if you have blabla_01.jpg, add a textfile named blabla_01.txt and write the caption for that image in that text file.
>>106213949Just use the 2.1 lightx2v loras, increase the length and learn to prompt
>>106214137Non, I get that anon, I mean like the style of tagging. I remember that it was something like (for training style LORAs) "do not describe style elements at all, just the subject". Is that correct?
>>106214025You can make legendary fapbait with wan 2.2 base as long as you set low expectations when genning genitals. i2v is less affected by this problem
>>106214152are you saying I need to use 2.1 lightx2v do use riflexrope, or I don't need riflexrope to crank the length to 121/129?
>>106214160I've tried both approaches both for Flux and Chroma which are trained on ai generated descriptive text, and I've seen no difference in quality when generating by training on captions like 'a beautiful blonde woman wearing a polka dot bikini and sunglasses on a sunny beach' or something tag based like 'beautiful blonde, woman, polka dot bikini, sunglasses, sunny beach'.
As long as you prompt accordingly, they work equally well in my experience, so basically, caption the way you want to prompt.
>>106213893so far from my testing:
>qwengood at prompt adherence
can understand complex prompt and geometry
can do reflection and mirroring correctly
easier to control camera angle
>chromav50 more color vibrant to v48
many fetish built-in
detailed textures
can work with flux ecosystem like redux
I think chroma might end up in the sdxl situation when sdxl and flux both came out
>>106213893Based
DELETE CHROMA FROM ANCHOR
No matter how the dev shills it here with shit examples without prompt examples, I will not use your shitty, shady thing.
can you go above 720p when genning t2i using wan2.2?
>>106213893qwen can't nsfw
>>106214280nogen still seething
>>106214257Who the hell tests a model so much and makes comparisons on MSpaint and shiฤบ it here without shiwi g specific prompts if not a faggot and loser dev?
>>106214287Any qwen nsfw lora already out?
is this block swap node all i need to do to utilize my RAM as well as VRAM? i have 24gb VRAM and 32gb RAM and i am able to create at most 140 frames. are there any technique i can try to get more frames with the hardware i have? i dont care if it doubles gen times. i know there is a bit of looping when you try to gen longer videos
>>106213893>>106214287Qwen can nsfw there at least 100 loras for that and checkpoints and has 2 week old.
Kill yourself chroma tard
How many loras or anything did your 1 year old """community"""?
>>106214327>Qwen can nsfw there at least 100 loras for thatWhere? On civitai?
>>106211319>>106214312>obvious synthslop lorakek
>>106211319Thanks, I will now download qwen.
>>106214327This is an English board, Rajnesh. If you can't fluently write in English then I'm afraid you'll have to leave.
>>106213949I used normal wan2.2 until 7s/8s without any looping issue. I think it's less of an issue vs wan2.1.
Could someone help me out?
I would like to generate video, in terms of software I know how to do that, but basically I don't have high enough gpu, at only 7.6gb.
Someone suggested I can offload it to dram, of which I have 16GB.
I'm assuming my computer is just too shit, but If this is an option, how would I do this?
>>106214370Generate less frames, for example 33 (2s@16fps) and generate at lower resolutions.
>>106213893im a chroma naysayer but qwen has a bunch of issues too, there really is no good model
>>106214327>Qwen can nsfw there at least 100 lorasbullshit, it doesn't have its own category on civit
>>106214257It's proven qwen outperforms chroma right here
>>106203690That chroma anon faded out like a loser lmao
>>106213947>qwen cant into realism>>106213976>>106213981>immediate lora copelol lmao even
Chroma sucks ass but it's funny that we have multiple schizos here that sperg out any time you mention it. I think I'll make a lot of Chroma gens and keep posting them here.
>thirdie qwen shills are obsessed with windows
there's a satisfying symmetry here
>>106213893Qwen outputs are slopped, noisy and fingerprinted compared to Chroma. Puts even older versions of Chroma to shame. They transferred the fingerprint over from the synthetic slop of Seedream 3.0 and 4o which they used in training.
>>106214436Bro either start posting a model thats more realistic than chroma out of the box or just say you're a vramlet that cant even run it already.
Kid literally built his identity around liking or hating a particular model instead of just using it as a tool. Keeps seething with 0 images posted every time lmao.
>>106214478ok
whatever your fingerprint nonsense means
>>106214478>do color correction video that isn't even visible to the eye>do pixel morph that isn't even visible to the eyefingerprint gone
I just got into lora training, did a crude and quick one and holy shit it seems promising.
I want to train one to generate photorealistic stuff of my ex based on pics + videos(that I will extract snapshots of) OC that I have of her.
Any tips to optimize this and extract the best from the data?
>>106214505A slopped image is not just slop, it is a fingerprint. Aside from that, if you zoom in, you see a very distinct pattern of noise. With 4o based images, it is visible with the naked eye.
I've just installed reForged today and I've started genning for the first time. Are there any models or loras that create good car interior images?
>>106213893we need a chroma'd qwen. is it possible?
>>106214544Probably, if you got the money to train it.
>>106214544only gonna take a gorillion dollars, perpetual 2 weeks of training, and may or may not fail or get obsoleted by the next meme that releases
>>106214579>Probably, if you got the money to train it.No need, Chroma is a community funded tune, nothing prevents the same from happening with Qwen. Question is, whether Qwen is worth it. Chroma is already pretty good at realism. Would a Qwen equivalent learn styles? Otherwise it wouldn't be worth it.
>clip_l.safetensors
Why don't image gen models use LLM to have better understanding of the text prompts?
what's up with qwen and being super insistent on the image regardless of seed? i don't know if this is the 8step lora causing this but when i have a prompt and let it generate 5 more, they are all essentially the same. i have not seen any other model do this.
can someone share a good workflow for qwen?
>>106214765https://comfyanonymous.github.io/ComfyUI_examples/chroma/
>>106214499I have no horse in this race and am not whoever it is that you've convinced yourself I am. You're shadowboxing with anons that don't even exist and you need to close the tab before you fully lose it.
>>106214744It's probably just overtrained.
>>106214782that's.. not qwen
>>106213189don't listen these retards, I got a 7900 XTX for exactly this reason. Nvidia drivers genuinely are bad on Linux, not to mention they're proprietary. the Linux kernel AMDGPU driver is fast and stable. I've been able to run LLMs, SDXL, Lumina, Chroma, SD3.5, and even Qwen Image. Someone else ITT was posting their workflow for running Wan on their XTX too. I haven't tried it yet, but I have no reason to doubt it can work.
Challenges/problems I've had:
>SVDQuant doesn't support AMD (it's not necessary, just a fast compressed format)>SageAttention2 doesn't support AMD (it's a decent optimization)>VAE encode/decode has some kind of perf bug on AMD, substitute it with the Tiled VAE Encode/decode nodes in ComfyUI to improve gen speedsI started out using an RX 6800 back in the sd1.5 days, the experience on AMD used to be a lot buggier back then. While AMD has some quirks for AI still, I have seen a ton of improvement in their drivers and ROCM. I think the AMD FineWine effect is going to start for AI eventually.
>>106214257Chroma not only understands complex prompts, it also varies outputs significantly compared to Qwen.
There is no reason to use Qwen because Chroma does those things better.
>>106214790>best local realism model "sucks ass">no i will not post comparisons or better models>everyone who thinks or shows otherwise it is just a SCHIZO and is just SPERGING OUT out>spergs out without quoting anyone below the discussion where people discussed this very topic>"I have no horse in this race"Never change, NPC retard-kun.
I think I generated a solid sequence from an anime girl image, as long as itโs not showing any privates, itโs good to post on a blue board right? It ended up kinda barby-doll ish anyway.
>>106214544You probably wouldn't need to go as far as Lodestone did with Chroma, but keep in mind that Chroma cost $150k over 6 months. Qwen is over twice the size of that, so it'd be a massive undertaking that runs the risk of being obselete by the time it's done.
>>106214839not that anon but if amd come out with a 48gb card that is at least as fast or as good as a 4090 for ai i will jump ship but until then i'll stick to nvidia
>>106214854It's not my fault you wasted all that money, Lodestone, but I'm sorry you did anyway.
>>106214861man, we'll be stuck with pony being the most used and popular forever, won't we
>>106214422Now generate 10 images with that prompt with both models and see how qwen has no seed diversity, doesnt make it useless, but a lot of people want to find a good prompt and gen a lot of different images with it.
>>106214874>nogen>derails all arguments since he cant engageDefinitely not seething and seprging out there lil bro. Thanks for conceeding lol.
>>106214861Nah, you would. Some ledditor trained a realism LoRA for Qwen, it's about same quality as a Flux realism LoRA. Unless you are completely blind to the capabilities of Chroma, it means you'd have to train Qwen for much more to achieve results equivalent to Chroma.
>>106214877Probably yeah. Diffusion models suffer a massive hardware disadvantage because we can't really effectively split these things across GPUs or run them even somewhat efficiently on CPU. Makes doing anything with any of this stuff more expensive, which means otherwise qualified people don't bother.
>>1062148397900 XTX is only as fast as 3090 in AI while having 33% more raw computing power indicates poor optimization of AMD driver.
AMD also severely limits distro choice since you always want the latest kernel for bugfix which excludes stable distros. Meanwhile I can update to the latest driver on debian stable with nvidia just fine without kernel update.
>>106214899>seprging>conceedingYou can't expect me to take you seriously when you can't even spell, anon.
>>106214908Take a look here
https://www.reddit.com/r/StableDiffusion/comments/1mjys5b/18_qwenimage_realism_lora_samples_first_attempt/
These are pretty much the same quality we got out of Flux in early days.
>>106214862It will be nvidia - $50 as usual.
>>106214932Already accepted your concession when you couldn't engage beyond ad hominems, sorry lil bro. Better luck next thread.
>>106214963Anon, there is no argument. I said that I thought Chroma sucked ass but would use it anyway, and you've spent the rest of the thread having a meltdown because you're convinced that every post you don't like is made by the same person.
Damn the continue video with the 4step lora really cooked it.
>>106214978>because you're convinced that every post you don't like is made by the same personBrutal irony from an NPC. Really can't make these up lol
this workflow gives really good results without killing motion:
10 steps (out of 20) for high noise, cfg 3.5, no speedup loras
5 steps (out of 10) for low noise, cfg 1, lightning lora strength 1
downside: it takes 3x longer to gen
>>106214978>you've spent the rest of the thread having a meltdown because you're convinced that every post you don't like is made by the same person.first time in /ldg/?
>>106215007If you say so, anon.
why are training samples like that
>>106214919>7900 XTX is only as fast as 3090 in AI while having 33% more raw computing power indicates poor optimization of AMD driverabsolutely, that and/or ROCM. But 3090 perf is pretty decent for the price, since it's roughly the same price as a 3090. Driver updates are free, so as AMD is forced to compete more in the AI market, later driver updates will probably unlock that +33% perf.
>AMD also severely limits distro choice since you always want the latest kernel for bugfix which excludes stable distrosYou are using a proprietary driver, that's sad. I'm using Fedora which is a very stable desktop OS and have had no issues with the AMDGPU update cadence.
Nothing against Nvidia users, just FUD being spread that limits people's choices
>>106214978you need to genuinely hide their posts and not respond. they are not only mentally unwell but will troll the fuck out of you if you even give them an inch.
>>106215080That looks so shit, I'm sorry
>>106215099too bad that your opinion doesn't matter. post a better gen. i'll wait.
>>106214979what workflow?
>>106215111>waah nogen nogenI'm not a Michelin star chef but I know a shit sandwich when I see one.
file
md5: 6b5e75769a4b3b67270c8037096ccb27
๐
>>106215010Wouldn't the opposite be better? aka having the main model (first one) be the one with lightv2x and then the second one refining without lightv2x?
Also for cfg I use something like picrel instead.
>>106215134can't hear you over the nogen tears. please try again.
>>106215099it's qwen, it's a little slopped ยฏ\_(ใ)_/ยฏ
>>106215142>Wouldn't the opposite be better?not really because lightx2v kills motion and high noise (1st model) is where all the motion is created
>>106215134>I'm not a Michelin star chef but I know a shit sandwich when I see one.the difference is if you know better you should be able to easily make a better image versus training for years to become a master chef
Is there really still no way to split .gguf diffusion models across multiple GPUs?
Wtf, I can't find comfyui cmd arguments list, I remember there were one large.
file
md5: 039f35b719ad9f9c39ba15eed177f605
๐
this is bullshit, what the hell qwen
Chroma is a disappointment... prove me wrong with a realistic full shot showing correct fingers and toes, good luck.
>>106215203> toesnice try footfag
Who ordered nogens with extra salt today?
>>106215203https://en.wikipedia.org/wiki/Burden_of_proof_(law)
I see no issue with chroma feet ?
>>106214979which "continue video" wf?
>>106215255Look at the texture. Other models could never. The future of local is here.
>>106215255Isn't that... Hunter Biden's niece?
nice... hand is bad though :v
>The scene is set inside a vast, neon-lit server room with towering racks of blinking data servers, cables coiling like serpents across the floor.
>"like" serpents
>literal snakes
>>106215010will try these settings, lightning 2.2 is distorting all my stuff
>>106215312the downsides have having to use prose-like natural language is that sometimes the model will interpret it literally. it fucking sucks
https://civitai.com/models/1855413/artist-stylewlopqwen-image
this is pretty good, qwen loras are going to be awesome. would be nicer to have baked in styles though.
>>106215203Chroma is fine by me but I dont know why anyone cares with wan 2.2 being so good. Wan gets the anatomy right even with complicated placement of multiple subjects. It's decent at doing headphone cables which I've never seen. It's the first model I've used that knows what rimlighting is. Just the first things I usually try and I'm usually disappointed with just werk with wan. It doesn't look like slop either.
What settings are you guys using for Chroma? Does the cfg need to be higher if you're attempting for realism? Is the default Euler and Beta at 26 fine or is a different set up better?
>>106215326chroma doesn't have problem with it
BigASP guy said he was considering finetuning Wan for uncensored T2I. Would be nice if he managed to pull that off.
!!! Qwen can handle tag prompts !!!
Idk if it had tags in its training set, or it's just due to the model's strong prompt understanding.
>Anime drawing with artist style:wlop.
>woman, huge breasts, wide hips, transparent white sundress, bikini under clothes, long blonde hair, wind, sunglasses, looking over classes, looking at viewer, drinking kirin ichiban beer bottle, lying on beach chair, on towel,elbow rest,
>Background:
>beach, cliff, sand, tidal waves, ocean, sunrise.
>>106215371lustify should be coming out with a new model soon too
but as said anon chroma really shine when it comes to anime particulary for hands and feet :v
>>106215384I remember reading him say somewhere that he doesn't really have the money to tune anything but SDXL. Wish he did because I think we've probably reached the limit of what can be done with it.
>>106215354>Chroma is fine by me but I dont know why anyone cares with wan 2.2 being so goodWan is more cinematic oriented than chroma for realistic photograph or social media model type of images of 1girls, which is the main thing people want to gen, and also slower to gen.
>>106215392Thin your paints
>>106215361cfg 3 steps 35
euler beta is great, res_2s bong tangent can be better some times but somewhat more unstable and 2x slower
also regular chroma 48 or 50 without detailed seem best for me for realism although detailed can be good to get a different output for same prompt when genning a lot of images of one prompt you like
>>106215392One style chroma doesnt do better than ilust/noob was always tranime, idk where you heard otherwise
>>106215392>1050x759sorry, too close to the ground zero of 1024x1024.
expect shit gens. only half joking.
>>106214839>Nvidia drivers genuinely are bad on Linuxkek, ALL research and corporate use of Nvidia for AI is done on Linux, the Nvidia drivers on Linux are primed for AI over anything else.
So is there another Chroma version coming? Lodestone is constantly updating the dev repo on huggingface.
>>106212893>>106212877>>106212679>>106212587Thanks for sharing the workflow.
It works well on my 32GB RAM and 12GB 3060 GPU for one frame txtimg.
How can I enhance/upgrade model consistency/quality?
How should I upgrade the Lora, checkpoint, clip, or something else?
>>106215371very nice, currently genitalia in t2i looks hilariously bad
>>106215364flux fusion v2 (schnell finetune) is honestly better
>>106215451Appriate it anon. I'll give that a try. I've been having good luck with v48 so I'll be sticking with that
>>106215380and this is with no LORA. actually a decent style IMO:
>impressionist expressionist painting with thick impasto brush strokes in acrylic paint, slight anime style influence.>>106215488that's not the same usecase as desktop linux. nvidia drivers suck ass on desktop linux and even linux gaming.
>>106215576but qwen fails the armpit hair test really badly, wtf is this
>>106215576>that's not the same usecase as desktop linux. nvidia drivers suck ass on desktop linuxIt's the same driver you moron, there's no separate AI driver, are you retarded ?
>>106215576>nvidia drivers suck ass on desktop linux and even linux gaming.LOL
>>106214862wasn't there some rumors about a 32gb 9070 (xt?)
Qwen is heavily overtrained. you can't really keep a prompt and "try again" because it will spit out the same image regardless, wtf.
After everything going on with LLMs and now the Qwen and Chroma shit, it's become obvious that literally nobody making any of these models has a single fucking clue what they're doing.
>>106215732sure they do, benchmaxxing
>>106214327>100 lorasI only see 14.
https://civitai.com/search/models?modelType=LORA&sortBy=models_v9&query=qwen
>>106215660yeah it's the same driver. nvidia's driver is intended to support linux servers and windows PCs. it's less stable for linux desktop users because it's not as well tested or supported for that environment. screen tearing is a common issue for nvidia on linux.
>>106215732true, the AI crash is coming soon because of this.
>>106215732The tech has only really been around and accessible for a few years. NAI leaked in late 2022, we are not even at the third year anniversary yet. Of course nobody knows what they are doing, everyone is in the process of writing the canon through current mistakes and failures and serendipitous successes.
>>106215732Could be better but Chroma did improve and was good for a lot of things for a long time. And qwen image has good prompt following, although the big thing there will be the image editing model, idk what you expect, multibillion dollar companies like meta and closedai just had brutally embarrassing releases that are essentially completely worthless despite billions of investments.
>>106214187you don't need riflexrope
>>106214327Plenty of anons even here trained loras for chroma to test it out while it was cooking, everyone else waited for the final version, why would anyone train a lora that has to be retrained a few months later, are you that brown and retarded?
>>106215780wan2.2 natively supports 129 frames? i know in wan2.1 the video loops/degrades after 81 frames, hence needing riflexrope.
>>106215794Just look at it and there you have it. Worthless for realism compared to chroma and worthless for everything else, cinematic look, product design or whatever compared to wan.
Is it possible to use Wan 2.2 to generate a 360 degree rotation 100% of the time? (on every seed)
im so tired of this toxic general yet I spend all my time here. I need a new home anons. p.s. fuck you all
>>106215817I love you bro
>>106215815there are a few 360 camera loras for 2.1 could try those i guess
>>106215758>nvidia's driver is intended to support linux serversAll of the AI industry runs on Linux, from research to SAAS, this is the goldmine for NVidia, their Linux driver is top notch for AI
Nobody in the industry or research uses Windows for AI, Microsoft doesn't even release their own AI projects like DeepSpeed for Windows, only for Linux
Can anyone confirm if Chroma v49 gets fine details better than Chroma v50?
>>106215833oh shit you're right! this is exactly what I needed
okay so something happened at the last step of chroma's training which resulted in some weird behavior at the 1024 res mark. blurry pictures, flux chins, faint lines at 512 like it was made of 4 images, etc. as a person who knows very little about this stuff my question is, how will this affect the future finetunes, loras etc? if chroma were to take off and become the new community base like I hope it does, are we gonna see guides on civitai months or years down the line mention not to use 1024x1024? how bad is it?
>>106215757maybe some chinese only platform hosts nsfw loras
well in taiwan or hk, no way they'd do that in clearnet in mainland
>>106215794so safe it was going out of its way to disallow nsfw
>Sampling 1 frames at 904x1600 with 20 steps
Error during model prediction: The size of tensor a (5700) must match the size of tensor b (5650) at non-singleton dimension 1
Why can't I generate anything bigger than 720p on wan t2i?
>>106215835all completely true, that doesn't contradict what I'm saying.
>>106215847also for kontext
>>106215758bud you're lucky you're anonymous because you're as useless as an inflatable dartboard. go learn something and we'll pretend this never happened.
>>106215853>months or years down the linewe wont be using chroma years down the line or if you will llms will research best parameters and set it up for you
>>106215845why dont you just compare it yourself
>>106215845Just avoid v50. v48 was the culmination of the previous epochs, and v49 is that + 1024 training.
Why wont ComfyUI release ram? It just chokes itself until the program crashes.
>>106215910we are still using sdxl tho and its 2 years old
>>106215935does 49 suffers from the same problems as 50?
>>106215963some people will probably use sd1.5 in 2030 still, simply because it's easy to run and has so many loras
file
md5: 514dc55bd749232a816e26d098f1c2c6
๐
>ok liked this seed!
>lets change it a bit
>it changes the color of the ears
how the fuck does this shit work man
wan2.2 take on your tags prompt :3
>>106215989dunno but these eyes are nice
>>106215955Use the Unload All Models custom nodes. I am a 4gb VRAMlet so I organize everything into grouped stages that are connected only through Unload All Models passthroughs, and it works relatively fine.
>>106215935i skipped v49 so i could give v50 an unbiased try before i compared them. it's very different to v48 and earlier.
>>106215963i use 1.4/1.5 a lot for the laughs. it cuhranks out the slop, but it's so satisfying letting go after you've burnt out on being so close yet so far with newer models.
anon can prophesize as much as they want, but they'll never get to set the standards on fun or taste.
>>106215969As another anons have mentioned, it throws a fit when you try to generate at exactly 1024x1024. In my tests, v49 and v50 both generate overly smoothed images, with v50 being especially egregious.
>>106215983Personal AI girlfriends/Onlyfans models will run on SD1.5. SD1.5 1girl is unmatched by all later technologies.
I've been comparing chroma v29 and v50 and it's not flattering for the latter
>>106216018Does 1.4 have any specialized uses over 1.5? I do miss some of the early unexpected outputs from prompts because of how the captions were back then, but I don't know if those were 1.5 or 1.4.
>>106216013I don't understand why there's a need for this custom node snakeoil. Why does the program keep these models in memory for no reason
should I not even try wan 2.2 on 12gb vram?
How do you guys organize your outputs? Feels like it's a pain in the ass to filter out all the bad gens once I'm done for the sitting.
>>106216055works fine Q8 and 32GB RAM (not VRAM)
>>106216077Everything saved in folder by the day. After genning a lot you go through and 1 click delete the images you don't like.
>>106216077Save all the good gens straight from comfyUI to specific folder, could be downloads, etc...
>>106216050In theory, to prevent you from having to reload them constantly, if you reuse the same models for multiple parts of the workflow. For example, let's say you do segmentation early in a workflow, do a denoise round, and then later on you need to segment again to do faces. Keeping the models cached means that, in theory, the overall workflow should take less time. In practice, everyone is constantly running up at the very limit of their available VRAM at all times, so the default behavior doesn't actually help.
The custom node just triggers the inbuilt "unload all models" functionality during the workflow, instead of it being a button you manually press after the final output. Yeah, there are a lot of custom nodes that should be native functionality, and some have undergone that conversion already, so it'll get there when it gets there.
>>106214515What are you training for? I did one of a MILF I know on Chroma and have been I2Ving them; shit is cash money.
Itโs definitely a quality in;quality out deal.
>>106216055you can easily use 2.2 with lightx2v, but the face is the same. i2v is better, because you can use an infinite number of people
can someone share an amazing realistic image from Chroma ? rn it just takes minutes to output shit D:<
why not train a new model based on sd1.5 but with more parameters and natural language + tags?
>>106214515>generate photorealistic stuff of my exso many people ask for that in request threads, and I don't get it
unless she's dead or something
if they separated, why the need to generate stuff for a girl they don't like nor want to hear about
some kind of rage fap session?
>>106216077it all goes straight to my mechanical hdd, has for the past 4 years with no incident
that's also where obsolete checkpoints go, having space is nice
pika1
md5: 4dce15be2924af5d759b5b22ea40dfdb
๐
>>106216039The older, crunchier chromas are honestly really fun because of how schizophrenic the gens can be. v50 just feels like a noisier flux.
>>106216176most men are loser coomers which got left by the girl, so why do you think
>>106216176Imagine brains as big neural nets.
Sometimes, latent space is just weird and has weird connections, because the training is equally weird. Just because someone is an ex does not mean the emotional/sexual connections automatically vanish.
>>106216055It runs fine even on 8gb if you stick to gguf. It will be slow, but just render at low res then upscale.
foid detection warning
foid detection warning
Wow, the nunchaku Chinks are something else
https://github.com/nunchaku-tech/nunchaku/issues/583#issuecomment-3153388671
Abandon Chroma but get to working on support of Qwen overnight. It's not that good, the only thing it does better than Flux is Chink text. Chroma is way more impressive.
>>106216203should I get the low or high noise version?
>>106216196honestly I want nothing to do with my ex, she's like an instant depression thought inducing thing the second I think or see her
no way I'd get aroused by her, and I used to find her beautiful
>>106216200I guess since so many people ask for that
>>106216176Glowies. They post about shit that is not even legal. Ignore those posts.
>>106216117>Yeah, there are a lot of custom nodes that should be native functionality, and some have undergone that conversion already, so it'll get there when it gets there.Hope so. Doesn't make sense to me that 10gb q5 model swap chokes out 16gb vram 32gb ram machine
>>106216232You need both, wan 2.2 loads one first, then swaps out to the second. I suggest q5 if you're vram/ram limited, but q4 is probably okay too.
>>106216277what does ldg heaven look like
>>106216123>What are you training for? Integrating in whatever goon generating workflow I devise based on the OC goon material that we did back in the day. Which is lots of nude pics and lewd videos softcore and hardcore.
I started with a general whole character lora to start playing, but I'm aiming for a dedicated face lora since its coming all mangled in a way adetailer isn't being able to fix(base model lustify sdxl) and a one of her huge tits lora meant to be high fidelity to the source to dial it in better in her gens and also able to apply on other girls and other degenerate flows.
80% of the source content is in 1080p videos so I'm looking into how to extract frames removing the motion blur and making it sharper like still pics.
I'm looking in this for now https://github.com/ckkelvinchan/RealBasicVSR but if you guys know better options you could guide me where to look for.
Video gen stuff seems hugely promising too and but I didn't dabble much into it yet. Its in the plans after perfecting pic loras.
>>106216176Cooming to something you experienced first hand hits different anon. Also a good way to finally put to use the effort into recoding stuff and eternalize that specific experience with fresh content forever. Its not really about having feelings for the girl.
>>106216216> abandon chromasource?
i have been editing together videos of multiple gens, should i upload them to civitai? im not sure how it works. its such a nuisance having to put your prompt and stuff with your gen.
>>106216306It's bait. They never had anybody assigned to chroma.
>>106216299there isn't. ldg is just infinite layers of hell.
>>106216264I see thx
I'll try a regular fp8 checkpoint first and if it doesn't work on 12gb then q6 gguf I suppose
being a vramlet is suffering
>>106215845>>106215935What? v50 is already good
>>106209052>>106208695Unless I am missing something
>>106216002true... just wish it was more faithful to the seed...
will i have to train a lora or what?
>>106216277Are the civitjeets and saasfags in the room with us?
1.
2.
3.
4.
5.
6. VRAMlets
7. Rigger/Migger/Nogens
>>106213893Faggots if chroma guy has success his next model will be a finetune of qwen
>>106216303You need to try on Chroma, thatโs what I did and it learns/retains face and likeness overall better than SDXL.
>>106216327I prefer the v50 output here
>>106216443now do v29, the last good version
>>106214286I did i2v at 1080p and it sometimes kinda worked, if that helps
>>106216443Every single chroma gen looks fried as hell and I'm tired of pretending like it doesn't. At this point the "chroma look" is as instantly recognizable to me as GPT piss filter.
>>106216443v50 feels like it desperately tried to smooth over realism prompts at the expense of literally everything else.
lodestone should be dragged out on the street and shot
>>106216456v29 wouldn't be able to keep up with the gen quality higher res. See
>>106208695 and
>>106209052
>>106214744Hidream does that
qwen team should be dragged out on the street and get taught about making a good dataset
>>106215882Are you using kijai's sampler? You have to make sure your dimensions are divisible by 16.
>>106216048it has an abstract and dreamlike quality that was tuned out of later versions, but mainly the lol factor. i kept my first 1.4 gens, and the way the overall impression is correct but the details are so very wrong still makes me laugh. picrel is a concept that came from that.
Chroma exists for people to gen their fursona getting dicked. Hamstringing it to try and get normies to gen their "1girl most beautiful woman ever big breasts" prompts with it is retardered. Doubly so when you consider how slow and demanding of a model it is.
>>106216311I see plenty of gens posted there with no prompts or settings
>>106216611you're slow and demanding
>>106216506I haven't really noticed that. The outputs look clean and ok.
>>106216584To add: the comfy-core sampler will do any resolution
Anyone have a workflow that's a modified version of the one in the wan22 guide where you insert 2 images where the first image turns into the second image as a video with a prompt?
>>106216584>>106216637This worked, thanks anon!
so what's qwen's story? did they steal chatgpt 4o's content or something?
>>106216705Just Chink things. Use synthetic slop to train model faster, at the expense of the quality of your model. But every normie including Chinks praise it to high heavens. And you're supposed to believe their model is superior.
>>106216622True, but my point stands
>>106216723I genuinely don't understand console wars over free stuff. This feels about as crazy as arguing really hard about wrenches and screwdrivers. Tools are tools.
>>106216764It's a tarpit contrarian engagement bot.
It's designed to disagree with you no matter what.
It's designed to incite tribalism, division, etc.
If you get along with people, you win.
Ignore bad actors, enjoy cool tech.
>>106215990Nice! Please share workflow image catbox
I dont care if my pc explodes
SHARE IT
Anyone here fans of Dalle 3 or are fond of its threads?
I trained a Chroma v50 Dalle 3 lora on a few thousand dalle images, enjoy:
https://files.catbox.moe/l713nm.safetensors
Keep in mind about the sentences used to trigger the styles. The trigger words for styles usually were:
"A digitally manipulated photo"
"A digital artwork"
"A digital illustration"
"A digital painting"
"An impressionist painting"
"An artwork in comic book style"
"A pixel art artwork"
etc
I highly recommend using Chroma Flash (the 10 step Heun sampler one) with this, I actually had -worse- results on vanilla v50
>>106216740>>106216777just wanted to let you know these are really fuckin cool.
more please thanks
>>106216764it's literally just contrarianism for the sake of it, you can see that in most generals too
>>106216740>>106216777I miss landscape general...
>>106216819>Chroma v50 Dalle 3 loraGarbage in, garbage out. But it was nice for artworks.
Dear /ldg/
Filter thos if you want to maintain thread quality:
Chroma
chroma
V1 - to - V50
v1 - to - v50
Ask your LLM of choise to do it for you
>>106216809here you are :3
https://files.catbox.moe/zdvdhi.png
>>106216516Can I request a comparison with an older epoch, maybe 34 or so? Curious to see it.
> >106216873
imagine trying to force a meme
>>106216819early dalle3 was lots of fun, I have similar dataset saved
>>106214327>Qwen can nsfw there at least 100 loras for thatforcing NSFW with a LoRA is never even close to what you get with a model trained with NSFW in base
Anyone who thinks a NSFW LoRA is good enough is Indian
Trying to set SD up on my new rig. What's the most efficient and similar to Automatic1111 for a 9070xt on Arch?
>>106216819is chroma flash worth it ? am using Q8 rn and am just disappointed... >.>
Has anyone figured out how to get the best results out of Chroma yet?
It seems like if you're using it just like Flux (i.e, not porn) then it has similar quality to Flux. But if you try to use it for something Chroma-specific (i.e, porn) then the quality sharply drops.
It's good to have the better prompt comprehension and other advantages that being Flux-based brings, but it feels like modern refined SDXL based models achieve better output quality.
>>106216907If you are trying to gen non-photorealistic images, it is unironically better than the base model in my tests
>>106216809>i dont care if my pc explodes after viewing THIS thread? ME EITHER
BEAHAHAHGAHAHAHA
>>106216879MS PAiNT?
who needs it??
BAWHAHAHAAA
>>106216911so basically flux + a random porn lora would be better?
>>106216764Just like there is no excuse for cost cutting manufacturing practices that result in a cheaper product, there is no excuse for cheaper training practices that result in a cheaper product. If you like cheap products, then that is on you, but in this case they are the reason we are falling behind on many things.
>>106216911The prompting is completely different. You need to either describe the image as a kind of photograph for realism, or as a particular style of art for an illustration. It gets lots of context from angle and lighting instructions. It also helps to include the "style" of the image at both the beginning and end of the prompt.
V50 seems to be worse for illustrations based on what I've seen, so try v48 for that
Simple Steps for /ldg/
Open any 4chan page.
At the top right, click the little Settings icon.
Go to Filters & Highlights tab.
Click Add (to make a new filter).
In โPatternโ box paste:
'''
(?i)(chroma|\bv([1-9]|[1-4][0-9]|50)\b)
'''
Check the box "Regexp" (important! otherwise it wonโt work)
In โTypeโ dropdown choose Comment or "Comment+Subject"
In โActionโ dropdown select Hide.
Close settings โ changes save instantly.
>>1062170062/3 of the thread wpuld be filtered but thats how a schizo mind works
>>106217006how much vram do u have? be honest
Why is Queen suddenly outputting a black image? It was working just fine before. Is it based on the prompt?
>>106216506yeah, v50 is definitely better at detail than v49. Looks like I can ignore anons telling me to use older versions.
>>106217032cumfartui
gotta disable sage and fp16 accumulation
>>106217032remove --use-sage-attention
>>106217006ok but how do i filter the schizos?
BEAHAGAHAHAHA
>>106217025I can run WAN 2.2 and Qwen, this filter is not for VRAMlets which also o recommend to use SDXL and a detail lora
>>106216299>LDG heavenNO SUCH THING
BEAHAHGHAHAHAH
>>106216632I know its just one image but looks at the red lights. Left is realistic, right is slop
>>106217061while Chromaist debate about 49vs50, Wan2.2 T2I inspired from previous post
>>106216764>This feels about as crazy as arguing really hard about wrenches and screwdrivers. Tools are tools.Yeah tools, you know those things where some are vastly better than others
So what resolution for Chroma? 1024 or 1152, or something else?
>>106216907Works as well as it possibly could, but you don't have the negative prompt so you can't schizoprompt in it to stop it from changing styles.
>>106217129but cars are cars... like gpu are gpu .... wtf this nonsense xD ... women are women too ? right xD
>>106217119she is 6\7th the size of a normal human
are you fucking blind or retarded?
My ComfyUI workflow goes:
1. Initial Gen
2. Upscaling
3. ADetailer Nodes
4. Post Processing (currently just https://github.com/SparknightLLC/ComfyUI-ImageAutotone but I'm looking for more if anything can add quality)
Is that the optimal workflow? Are there any meta custom nodes that improve gens?
>>106217159not anon fault if your wife is a fatty x3
so, when are you guys gonna post something GOOD??
bwahahaha
>>106217209Very cool Bayonetta.
>>106217119absolute mogging
sure it's "too professional" looking. But you can see the whole interior and exterior and its all correct. Her pose sucks and nobody would rest their elblow on nothing, but her fucking butt cheek and blue jeans are reflecting in the black leather seat
>>106217237>GOOD?!theres only 55 images and 5% of them are Statler\Waldorf
BEAHAGAHAHGAH
>>106217255new versions seem to have better character knowledge
>>106217119Can it do anime porn though?
>>106217159>>106217269Yeah you should see a girl in person that's not your mom
>>106217209mind to share WF on catbox ?
>>106217299>LDG>girl in personBWAHAHAHAHAH
>>106217319I have to clean it
file
md5: 30523e8fb3d0216dab11d17b167c2e5d
๐
torch compile literally does nothing on my gen times, what am I missing?
>try to use i2v loras
>my results look like complete shit compared to the examples posted on civitai
are those just cherry picked 0.01% RNG rolls? i feel like i'm getting trolled by my gens sometimes
>50 actual images or less
the absolute state of this shitty "general"
its like sex, you spend more time talking about it than actually doing it
>>106217378all loras cause distortions, in a sense that's what they're designed to do but in a specific way
I think it really helps to know how the lora was made, what was in the training data and a good understanding of the checkpoint to guide it in the right direction as implicitly as possible
but yeah there's also a gacha element to it
i think i am going to make the bake
>>106217423shouldn't be a problem since we're at the bump limit due to all the samefags, schizos, and discordtrannies talking to eachother 18 hours a day here right
BERAHAHAH
file
md5: 60b779fc2da716ce4fdba83462d28a3e
๐
>>106217404I'm still doing Chroma comparisons on the HD checkpoint. Trying to see which sampler/scheduler gets some good gens that also upscale decently well. Can't just poop out plots willy nilly. Or well, I could but that's kinda getting spammy, huh?
>>106217416LoRA can actually really fuck a model
>>106217423>statler\waldorf collage would make me laughbut please dont do that just wait for the real baker
>>106217439>spammyposting more than 1-2 images is "spam" according to the antirocket guy
>>106215955$20 says this makes the collage again
>>106217451>wait for the real schizowhy so he can choose only his own "gens" (as per usual?)
BWAHAHAHA
>sliding you ai niggers off the board
fuck all of u
>>106217474so you just dont care that it looks like SHIT?
>>106217484Most gens in this thread look like shit too, so I am just joining the club.
>>106217476tick tick tock tock
bye bye Ai fags
>>106217542>>106217476>page 9die die die
stay off /g/ !!!!!
https://files.catbox.moe/i6eil4.png
as a complete noob when it comes to image gen which one gives you the best realistic results
chrome,wan or qwen?