Discussion of Free and Open Source Text-to-Image/Video Models
Prev:
>>105776972https://rentry.org/ldg-lazy-getting-started-guide
>UISwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, & Upscalershttps://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>WanX (video)Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1
>ChromaTraining: https://rentry.org/mvu52t46
>Illustrious1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/
>MiscLocal Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate
>Neighbourshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/celeb+ai>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
Blessed thread of frenship
>>105782166Do you need to do the math on the likelihood 1 token will appear in 25 images with a random dropout? That's also ignoring the absurdity that prompts aren't just random hodgepodges of words, we are trying write instructions. It's like having a radio and randomly adding static to different parts of the clip and then asking someone to learn English or even better, what they're talking about and even more specifically that Mary's dress is red and she ordered blue. Caption dropout *might* work for tag soups but they're not training on tag soups, they're doing full paragraph + tag soup + metadata. You don't just randomly do caption dropouts without side effects or now the obvious fact: IT IS NOT LEARNING ARTISTS and as I already pointed out, this is a logical outcome.
>>105782220>retarded drama on the previous thread>debo is herethat explains a lot
>>105782233it knows a few furry artists really well at least, might be more about the artist you are wanting just not making it into the 5 million image budger
>>105782240https://www.youtube.com/watch?v=c1HSeVMz_yw
>optimization comes out
>gets forgotten
>>105782248>it knows a few furry artists really well at leastlike who? I wanna test that out, cool that it starts to know some artists at last
>>105782240https://www.youtube.com/watch?v=SHnt6zYURvk
Can anyone who uses Sage try to compare imagegen speeds when using a normal model loader and and when using the MultiGPU loader with RAM set to 0?
>>105782264fluff-kevlar also somewhat works atm
Returning to Chroma and the new AE:
>Lodestone:
>I'm experimenting on calibrating Chroma v38 checkpoint to my newer AE where it has compression and general EQ.
>the AE has different compression level F16C64 vs F32C64 and it has nifty properties where the latent has clear frequency separation. it's going to be interesting!
Here's hoping this won't just be a waste of time and force a rollback, as I understand it this is not used in any of the released checkpoints, but perhaps in v42 ?
>>105782240>>105782282https://www.youtube.com/watch?v=8p3_CWjRkHY
>>105782290>v38dont see why it would end up being a rollback, he is testing on a already released checkpoint
>>105782233>"A young woman named Mary stands in front of a boutique. She's wearing a long red dress with lace trim. Her expression suggests disappointment. A tag on the dress reads 'ordered: blue'. The sidewalk reflects soft afternoon light. Metadata: artist: McCumface3999, style: oil painting, location: downtown street, lighting: natural, color scheme: warm tones, mood: contemplative, keywords: woman, red dress, order mistake, fashion, lace."=
>*"A young woman ***** Mary stands in front ***** a boutique. She's ***** a long ***** dress with ***** trim. Her ***** suggests *****. A ***** on the ***** reads 'ordered: *****'. The sidewalk ***** soft afternoon *****. Metadata: artist: *****, style: ***** painting, location: ***** street, lighting: *****, color scheme: ***** tones, mood: *****, keywords: *****, red dress, ***** mistake, fashion, ****."Hope we didn't drop out anything important that epoch.
>>105782290that looks bad for the 3 of them lol, but what would Chroma gains if the AE gets improved?
>>105782310https://x.com/LodestoneE621/status/1939504778805191097
and maybe this is related?
https://x.com/LodestoneE621/status/1938734989434351796
For the anons trying the Wan nsfw finetune/loras, teacache's settings and the light2x lora don't work well on it anymore, as the finetune strayed too far from the base model
Tried the extracted lora for i2v with different samplers, steps, cfg, but nah, still blurred, most likely because the t2v model strayed too far from the i2v model. I didn't try without teacache though, but it would be a pain to do so
>>105782334>this will scale and shift the default CFG strength 5 into 1.So, guidance distillation?
>>105782322>>105782334Why don't you use xcanceled?
>>105782302If he implements this in upcoming releases and it turns out it's not so good after a few epochs of use, that would mean a rollback.
>>105782348cause im not a weirdo
>>105782354Sorry Mister but this is a discord screenshot.
>>105782360still not as strange as using a "service" whos entire point is to protest something dumb for stupid reasons
is this the discord screenshot comments thread?
>>105782354>muh fast AE that keeps the qualityI've seen that somewhere...
https://nvlabs.github.io/Sana/
>>105782362You are the perfect example of the people who argue here about nothing and in-fight each other.
Less than 70 IQ with no technical knowledge whatsoever.
>>105782354imo he's too obsessed with the speed, first with the "fast" thing that makes the model run on lower steps, now this, can't he just focus on getting a model that doesn't have frankenstein anatomy first, we'll think about the speed later
>>105782290>>105782302this nigga has good intentions but he has fucking squirrel level adhd, instead of just training chroma to v50 and going from there, he's doing distillation, detail experiments, training his own vae and random spaghetti merging 7 checkpoints into one at the same time and releasing another 5 parallel branches
>>105782378go cry on blue cry, bet you use that dying hugbox full of pedos as well
>>105782378>You are the perfect example of the people who argue here about nothing and in-fight each other.says the guy who argues here about nothing
Is there a local AI solution similar to Eleven Labs? I want to transcribe audio books without paying 100 a month
>>105782388its all separate anyways
>>105782400What do you mean? I only wanted to recommend xcanceled instead of direct links of twitter or discord. Seems like you are pretty hostile.
>>105782408"fast" is some schizo distilled bullshit that has been merged into the main model since v29 or so and it introduces flux plastic slopskin back
>>105773833catbox por favor?
>>105782416>xcanceledstop with this meme
>>105782416>I only wanted to recommendrequest denied, if you can't accept a no, you're starting to argue here about nothing, you don't want to sound like a low IQ don't you anon?
i had respect for you but now i'm worried. you're spending 16 hours on 4chan every single day.
>>105782433never meet your heroes
>>105782394Yeah, as much as I like Chroma I have to agree.
Like who the fuck decides to switch to his own experimental AE at 41 of estimated 50 epochs, this guy sure don't like smooth sailing.
That said, if this works well and is an undeniable improvement, he will be hailed as a genius, if not...
>>105782408>its all separate anywaysif only he did the same for fast, I don't mind his experiments, but at some point he changed the whole training process at v29.5, for the worse
Please stop fighting and gen stupid poop!
>>105782426you are crying because you cannot use your centralized accounts?
>>105782408Ok, THAT makes sense.
Just how many branches is he training now ? Who is funding all this, furries ?
>>105782452you are using a website who's entire purpose is crying about another
>>105782463>Just how many branches is he training now ?all those useless branches when he could just focus all his gpus to a single normal one and get this shit done faster, sigh...
>>105782396he is in the hyperbolic time chamber training on the highest gravity settings. be patient, let him cook
>>105782473I think he already has everything calibrated for however many gpu's he's using per instance
>>105782394>this nigga has good intentions but he has fucking squirrel level adhdIdk if he's brave or just crazy, he said he already spent 100k on this project, and yet with all those stakes he's still doing some weird experiments as if the only bad consequence would be a broken glass or something lol
file
md5: 69821ece13314a9a737158ae15123e79
๐
chroma bros..? 50 steps btw
>fr fr twitter and netflix are cool
/ldg/ moments
>>105782524trust the plan anon, he knows what he's doing, the "fast" poison is just kicking in... or something...
>>105782524I have a simple question, I never tested Schnell, were the hands that bad on that one?
okay time for Radial Attention NOW
>>105782524Chroma will never get the small details right. He admitted his database is relatively small and training time is also an issue.
>>105782524Anyone can cherry-pick a gen that looks bad across multiple epochs, it proves jack shit.
>>105782548they are not that bad on either, he is using shitty settings or something
>>105782524grab a workflow from here
https://civitai.com/models/1330309/chroma
>>105782548just try it? it's a nice model. hands do have a chance to come out wrong, it's certainly not always tho.
0
md5: 55429bc66b8934eee0522d412a2ea1ae
๐
>>105782555Only recently was a branch bumped from 512 to 1024, fine detail like fingers will likely look bad on zoomed out people even at epoch 50.
It can most likely be fixed with further fine-tuning, perhaps even loras.
>>105782595It's a matter of further training.
i'm so fucking bad at proooompting
i hate myself
>>105782576>scroll down>first image is two zesty niggas hiding the sausageAnon...
>>105782595What I do remember is that when SDXL 0.9 was published it was already pretty good. SDXL 1.0 was worse in some ways.
Chroma with NAG works well for me for real life stuff, but for illustrations it keeps giving me worse results than regular neg.
>>105782576it's literally that workflow, but the person is not zoomed in portrait 1girl like most gens but a guy dancing in front of a store
i'm not mad, the guy spent like 50k training this and is giving it away for free, but let's not pretend it is what it isn't
0
md5: 7f521f6c0f36ffc77cf501972a987281
๐
>>105782675>but let's not pretend it is what it isn'tthat's too hard anon, you have to pretend that this model has no issue and that everything he's done to the model was the right choice!
ugh
md5: c78d20f9511873f38942422a292f3712
๐
>try to generate oil painting of a woman
>the background and clothing look like an oil painting, but the woman's face and cleavage look realistic because the model was over-trained on photographs of womens' faces and boobs
what's the solution here?
yes I already tried (oil painting:2.0)
>using sage
Why do my gens get worse it/s as it progresses?
>>105782733welcome to 1girl slop hell
>>105782733Looks like a Flux thing.
Try
>oil paint medium, impasto technique, oil painting by some artist, glazing,
I for one like chroma, but I'm the target audience
woops, meant to catbox
I for one like chroma, but I'm the target audience
https://files.catbox.moe/1rg68z.png
>>105782780aww man not my strawberry preserves
>>105782733did you try an oil painting lora and increasing its strength
How do I:
>completely nuke all traces of python
>completely nuke all traces of comfy
I might need to reinstall this entire thing and get portable since this shit is falling apart
>>105782800Do you know what directory is?
>>105782803I wish this was in one directory and not buried in several hidden folders.
>>105782830Bit busy now :/
>>105782837What about using an uninstaller? You clicked Python.exe installer and now you need to click the uninstaller.
>>105782800you are in for a bad time regardless of how you install python
Hello, I'm the guy trying to get 2D loops working. I thought I had it working with a Color Match node, but it was a placebo effect.
Now I'm looking at https://github.com/kazeyori/ComfyUI-QuickImageSequenceProcess to remove the starting/end frames to just bypass the unwanted flashing effect, but unfortunately it doesn't work as stated and you can't define a negative number to remove frames.
Back to the drawing board...
>>105782887how did you get it to work? anons were saying it crashes all the time
>>105782889This is why people like you should only be using smartphones.
>>105782889I know but I installed too many random python packages and custom nodes and now it's all fucky
>>105782910the exact same thing happens with the portable version. Just delete the venv and all the shitty custom nodes you don't use
I'm unironically excited again that we soon might be free from the webshit menace. C chads, rise up
>>105782800which OS are you on? how did you install python and comfy cause there's like a million ways
desu Comfy should've just embraced uv as the idiomatic way, now you've got an army of tech illiterates whose installs are split between native python installations, python venvs, conda, and those weird zipped portable or one click installer shit
good luck troubleshooting all of that at once
>>105782917ok i deleted it and now it says something about missing program files folder?
>>105782933The standalone.
>>105782917Here's hoping he installed the random python packages inside the venv and not system wide...
>>105782962Yeah about that...
>>105782975Samefagging pedo
>>105782983I am posting gens
>>105783004have you considered therapy instead of gens?
>>105783029wow you are a meanie :(
Can I delete the pip and uv cache? There's no system shit inside, right?
>>105782524doomGODS always win
>>105782354The throughput of the 16 channel VAE is like 100ms. It's not noticeable during generation.
>>105783390level of compression seems to be the goal
>>105783393He might as well train a new model on top of 1.6B Sana then it'd be quicker.
>>105783419he apparently found a better way
>>105783441None of his schemes proved to be that good, even his custom model is inferior to Flex which actually is trainable on a 24 GB GPU and has half the inference time.
>>105783448>Flextried it, seemed shit in comparison on top of being censored
>>105783468It's just a smaller Flux with CFG, there is nothing special to it outside of being something someone should be spending $100k to finetune and not a shitty extremely slow model we call Chroma. Flex is just, foundationally, faster than Chroma and it also has CFG.
>>105783476I can't say I care about a slightly faster flux since nunchucku already exists anyways
Any Chinese danbooru artists? Or any that aren't Japanese or Western Cartoon?
>>105783500Well you should care because that also applies to training speed and cost and I assume you're a desperate coomer given you talk about Chroma.
>>105783504Ching BiaoXing is a great one.
>>105783513you know it would cost about as much still right? Retraining a model on anatomy is a huge deal
>>105783522Flex is at least 30% faster than Chroma, so that's 30% faster per step buddy. That would mean *checks notes* we'd be past Epoch 50 now. TURNS OUT THAT MATTERS :D
Why the FUCK does portable not come with node manager??
>using the portable version
>>105783536if you weighed it down with too much stuff it wouldn't be portable now would it
>>105783173Aww, you're so cute and innocent, you butt hurt silly boy! :3
I just want to eat you up! ;p
>>105783530can it do this though?
https://files.catbox.moe/vmda0y.png
also besides nsfw stuff flux is way too locked to a few styles
>>105783521I can't find that one on danbooru :(
>>105783560Yes if he trained using Flex instead of his frakenmodel that is 30% slower?
>>105783569didn't flex release afterwards?
>>105783560y-you want that?
>>105783575Flex is 5 months now so they have similar timing.
has anyone had any success with ltx-video?
>>105783569from his discord the smaller size comes at a cost
>flex is just removing some transformer block>while chroma is notchroma retain 19 MMDiT and 38 DiT blocks
while flex prune the MMDiT down to 5 IIRC
so chroma the model depth is not compromized
>>105782524> 50 stepsah, so you're retarded. got it.
>>105783642that anon is right, you need at least 69420 steps to get decent hands
>>105783581You do know ostris just undistilled it right? lodestone undistilled it and is now retraining anatomy pretty much entirely, and a side effect is it also broke flux's style bias
That takes quite a lot more time to do
>>105783642>>105783645i get pretty good hands at 30-35 on photos
>>105782733>because the model was over-trained on photographs of womens' faces and boobsuse a better model keke
>>105783536What stops you from using
>.\python_embeded\python.exe -m pip install -r .\ComfyUI\custom_nodes\SOMENODE\requirements.txtIt's not that hard.
>>105783597The pruned layers in Flex contributed little to nothing to final output, Flux is an overly bloated model so made his model 30% slower for no reason.
>>105783671lodestone said he removed all redundant layers and that removing any more started to have a negative effect on it
>>105783647Flex is 8B parameters and undistilled, so yes it's basically Chroma except 30% faster and would've been a better starting model and he would've saved 30% on his training and maybe wouldn't have had to resort in his gimmick training strategy to begin with.
>>105782733>>105717894 welcome to local models. redeem the mogao if you want to unlock painting styles
>>105783680Yes, you prune and then train a little to bring back capabilities, the process is well documented and again, Flex did a very good job at it an unlike Chroma is actually fast and requires less VRAM and compute to train. Chroma is just an example of hubris costing money.
>>105783671chroma is slower because it re-introduces CFG/negative prompt, something that flux lacks. flux would be even slower if it had this
>>105783681cept flex came out just recently, is not nearly the same scale nor does it have the same goal... flex still is locked to flux style / subject bias. Good luck doing anything not person standing or the 3 styles it defaults to
>>105783697flex was trained on flux outputs so it's just doubling down on fluxslop. i don't know why people believe flux needs to be de-distilled in the first place. flux dev is perfectly trainable with both finetunes and loras
>>105783704cause a tune big enough to introduce nsfw elements costs tons in compute, illustrious cost like several hundred thousand and it was sdxl, can't expect people wanting to entirely foot that kind of bill out of charity
>>105783704>flux dev is perfectly trainable with both finetunes and lorasmust be why there're so many amazing nsfw loras and finetunes of dev hahahah
>>105783697Flex is 5 months old.
>>105783390the latent is larger -> diffusion model is working with a larger input -> slower generation
sdxl: 3x1024x1024 image -> 4x128x128 latent = 65,536 values
flux: 3x1024x1024 image -> 16x128x128 latent = 262,144 values
>>105783728>must be why there're so many amazing nsfw loras and finetunes of dev hahahahkek
>>105783718the bill balances out in the end, there are no shortcuts. if you look at the early epochs of chroma, it was an incomprehensible mess. it took many epochs on a 3mil+ dataset before it even started being remotely coherent. what chroma accomplished in 40 epochs on de-distilled schnell is doable on 10 epochs of dev.
>>105783732>Flexwas gonna say that was wrong but apparently its flux 2 that just recently released a preview
>>105783733Then you would just use the Sana AE which is significantly smaller than SDX with comparable quality. But it's retarded because that requires a massive finetune for either SDXL or Sana -- so you might as well do a full Sana finetune completely and take advantage of ALL the speed improvements.
>Yes, the images all look like Flux because Flex is just Flux trained on it's own image generation.
https://www.reddit.com/r/StableDiffusion/comments/1k5s2zb/flex2preview_released_by_ostris/
What a idiot... fucking inbreeding a model like that.
>>105783685chroma does it fairly well unless you get unlucky and it tries to force midjourney slop
>>105783748Yeah I'm talking about Flex.1 which is the core 8B model + undistill. Flex.2 is the controlnet model.
>>105783749people tested sana with tiny test tunes, it sucks
>>105783763Given the best audio model available right is Sana-based seems unlikely.
>>105783749the sana ae is NOT comparable quality and sana is a completely unproved model
i made this a while ago but i doubt anything changed with the ae
https://slow.pics/s/DIDxrbQx
>>105783753no wonder flex always looked like ai slop turned to 11
>>105783753Yikes. Flux dev looks like plastic shit because it is trained on output from Flux Pro
Flex is then trained on the even more plastic shit output from Flux dev... ?
But why ?
>>105783776Are you retards even capable of knowing the difference between 1 or 2. Try holding your fingers up.
>JUST FINETUNE SANA IT WILL BE SO HECKING GOOD!!!
file
md5: 2d5295904e2c0c9c4137abbda7b585f3
๐
>>105783774Have you fucking even looked at your own test of SDXL vs Sana AE? SDXL is absolute vomit.
>>105783756>tries to force midjourney slopsurely he tagged the MJ images as such so you can put that shit in the negatives... right? right??
>>10578378111 is 2 1s next to each other
>>105783788yeah now how about you compare it to sana dc-ae
i feel so lost. tried to delete my python folder.
>>105783780datasets are a lost art. nobody knows how to assemble them properly and they read about the wonders of synthetic data from the LLM world and think it applies equally to image models.
>ctrl+f sana
>14 results
are we back
>>105783800Yeah pick the details you want to mangle. SDXL is generally worse on all details and *sometimes* better at faces. Also Sana has multiple AEs so you can do the 16 instead of 32.
help i accidentally made the comfyui image feed max size and now i cant resize it back
how do i unfuck myself?
After testing local anime diffusion, I've decided to stick with NovelAI.
Why?
CivitAI has attractive models with impressive samples and a diverse style database, but there's a major flaw:
SDXL is fundamentally stupid model.
I believe the quality of the dataset and user efforts,through samplers, Lora workflows, etc. put into SDXL is significantly higher, yet they can't compensate for its shortcomings, which is disappointing given the dedication of CivitAI's developers.
Did you know that NovelAI comprehends prose exceptionally well? Even better than Flux!
I can input vague scene descriptions and generate an infinite variety of high quality outputs, all aligned with my prompts.
Pic related its a local image creation. It's quite beautiful, but it lacks a story or any significance behind its beauty.
If there's any beauty, it's in the quality of the checkpoints, which are often passed due to the diligence of the people from CivitAI.
And the AI artist prompts not with prose or a message with meaning, but aims to 'hack' the stupid language of SDXL.
>>105783814>comfyui image feedWhat are the advantages of the image feed over the queue feed?
>>105783807First ask yourself what Flex.2 is. Then ask yourself What Flex.1 is. Then ask why Flex.2 used synthetic images.
There's a reason why sana isn't popular. Its vae might be good but the model is past saving
>>105783830let them cope. pixshart finetunes in 2 more weeks!
>>105783830Sana isn't popular because it has a 32 GB VRAM minimum requirement because Nvidia wanted to sell 5090s. Sana Sprint is pretty cool though.
>>105783823if only they would do a midjourney and release a uncensored version of their video model
>>105783837Yeah in 2 more months maybe Chroma will fix hands and the caption dropout will skip the art tags.
>>105783837pixshart is different than sauna tho
>>105783811kek you dont understand how this shit works
sana cant just swap out the ae without retraining
and if you swap the ae you lose compression meaning you lose speed meaning the whole point of the model is lost
>>105783847chroma does hands well, stop falling for the one guy either trolling or using shitty settings
>>105783825I used image scale node and it seems to fuck up my gens permanently.
>>105782211 (OP)add
>>>/b/ai+parodyto neighbors plox
>>105783857>bro it's your problem is 30% of the time the hands are mangled, you're supposed to gen like 10 times
>>105783857Is Chroma compatible with Forge? Are there any tutorials for using it there or in ComfyUI?
>>105783884I've been using the last 5 or so epoches as my main model and that is not true at all, grab a random WF from here https://civitai.com/models/1330309/chroma
>>105783789>surely he tagged the MJ images as such so you can put that shit in the negatives... right? right??I don't think those have been tagged with MJ or midjourney aesthetic or anything like that, I've tried. You can get the style out with using vintage and retro tags in combination and prompting for realism. You'll get the 1.5 era elongated Midjourney physique with the trademark disgusting yellow tint.
>>105783876stuff like that shouldn't be openly advertised since it's porn pictures of celebs. have your fun but don't be retarded
>>105783894I've literally used it, I know how good and bad it is. It's also absolutely horrible for prompting.
>>105783904post your gens
>>105783876>>105783900>All characters depicted in this thread are fictitious. Any resemblance to a real person, living or dead, is purely coincidental. No pictures in this thread are intended to harm any individual.desu seems like they covered the legal part
chroma v41 vs flux dev pixelwave
>a close up photograph of a pigeon with an afro wig looking straight to the camera.
>>105783907You already know this does nothing, I can cherry pick bad images and you can cherry pick good images.
>>105783914blebbet tier prompt
>>105783907yet you don't have any gens to show either.
chroma v41 vs flux dev pixelwave
>classical Renaissance oil painting of a woman standing in a field with a pigeon sitting on her shoulder
>>105783909doesn't mean they should test it. keep a low profile
>when all my gens with chroma are nsfw
Say all you want about Chroma. It is the only model that can generate feet. Therefore it is the only model that knows about human anatomy. A model that can't or won't is shit, it's that simple.
file
md5: c43534958203eab215804b0c04632e56
๐
>Majestic mountains rise dramatically against a starry night sky, partially shrouded in thick clouds, with a small, rustic building visible at the base.
https://files.catbox.moe/ce681p.mp3
acestep
genned it yesterday, but didn't post it, it's a new seed for Foreigners, and I think it's nice.
chroma knows the important anatomy
https://files.catbox.moe/i4kb64.png
>>105783966Easy!!!!!!!
>Kontext>Modify the image so that the people are wearing casual attire.
Haven't updated in a month or two, did Comfy fix the memory leak? The browser tabs gets to over 1gb over the course of a day's prompting and it becomes very slow.
>>105783973could you, perhaps, be referring specifically to early 2000s candid amateur y2k flip-phone camera photos of asian women's dirty feet complete with vintage 512x noise-artifacts??
>>105783998In fairness: I don't know if it's addons doing this, maybe it's not ComfyUI's fault. I don't have many addons though
file
md5: 69f726ac8e4b8927b72e6e8949f96c60
๐
This image displays a section of urban street art painted on a textured concrete wall, likely captured to document or showcase the artworkโs presence and meaning.
From top to bottom: three red arrows are painted directly on the wall, each pointing downward above one of the three black-and-white stenciled human figures. On the left sits a young child with a distressed or passive expression, legs apart, holding an object in their left hand. In the center stands a slightly older child in a long top with buttons, holding their right hand up to their nose or mouth. On the right, an elderly man with a bald head and beard sits cross-legged, holding a traditional bowed instrument resembling an erhu or similar stringed instrument upright in his left hand, with the bow in his right hand. There is no other visible text. The background wall is grey and worn, with darker degrading spots. On the left edge, there is a vertical dark-colored metal structure with a red and white sticker reading โSTICKER.โ On the top right corner, a protruding metal beam or bracket extends from the wall. The three figures are aligned horizontally across the image, each evenly spaced.
The image may have been shared to spotlight thought-provoking urban art that appears to comment on age, tradition, and possibly hardship or socioeconomic commentary. It is a street art photograph documented in situ with no visible enhancements or edits. The style is stencil graffiti with a limited monochrome palette, except for the red arrows and sticker. The
>>105783973I've seen Chroma do better humans than API solutions, E.G. 4o, Imagen, Mogao (Seedream). Even that other anon that came from the API thread boasting about Reve got BTFO'd quickly. The furry is onto something with his training method. It is SOTA.
also chroma can do some shit only novelai knew before https://files.catbox.moe/qj8twv.png
>>105783980Forever and ever training.
>>105783992Not that anon, but I genuinely tried this before and it sometimes add women's nipples as part of the clothing, kek
>>105783823isnt it possible to run NovelAI or whatever locally? I agree SDXL is dogshit for composing, but Illustrious is sometimes decent, but what works for me is making flux compositions then masking with illustrious, then upscaling with SDXL. all my recent gens follow this process
I know the community didn't want this but here it is. We are going to scale back the expectations for Phlegm v3.0.
We need to be better.
>>105784020aesthetic11, aesthetic10, Digital artwork of a woman sitting on a photocopier machine. She is wearing business casual clothing and is bottomless. Between her feet a slot in the machine is printing a picture of her ass on glass, pussy on glass, and she is smirking at the viewer. The woman is a white and black furred marble fox with medium breasts and an athletic build. The machine has a printer slot at the bottom. The image is by the artist hioshiru. ,no lineart, The image being printed by the machine has only her butt in focus. The printed image is a pussy focus, butt focus, ass on glass image. front view,
file
md5: 2484791c033ffc56d014e5142b8dc938
๐
This image shows a handcrafted greeting card featuring a cartoon girl illustration and decorative floral elements, designed likely for a birthday or celebratory occasion.
At the top left of the card, there is a pink background with a fine dotted texture. The card itself is tri-fold, with patterned panels on the left and right flaps featuring dark blue and white gingham-style checkers. In the center of the left flap is an oval, scalloped-edge cutout containing a cartoon drawing of a girl with brown hair styled in a bun and ponytail, wearing a purple dress accented with darker trim and white details. She has a flower hair clip on the right side of her hair. Below and to the left and right of this central figure are several layered flowers in various shades of purple with white button centers and white ribbon bows. The interior middle panel is partially visible and shows partially obscured cursive text that begins with "Bi", likely "Birthday". On the top right, another arrangement of three purple flowers with green leaves decorates the card, with a shiny, translucent ribbon extending across the card from left to right.
The imageโs purpose is likely to showcase the intricacy and craftsmanship of the handmade greeting card, possibly for promotional or inspirational purposes in a crafting or DIY community.
The style is whimsical and scrapbook-like, characteristic of handmade craft card photography with attention to layout and color coordination.
The image is clean, brightly lit, and in focus, with no visible compression artifacts or defects, indicating
file
md5: 481700a6208e59c1e549328beb113b62
๐
The image is a formal portrait painting of a woman posed in front of a plain dark red background, created to depict her appearance and attire with detailed realism.
At the top center of the painting, the woman wears a headdress featuring a green and gold woven pattern that wraps around her hair, which is secured tightly and styled back. Moving downward, her face is pale with fine detailing in the features, including slightly arched eyebrows, deep-set eyes, a long nose, and a closed, composed mouth. Her expression is neutral and poised. She wears a high-collared white undershirt with visible black embroidery trim on the collar and neckline. Over this, she dons a prominent green dress with voluminous puffed sleeves. These sleeves are separated horizontally, showing an underlying layer of horizontally banded fabric resembling ivory ribbon armor. A green band wraps around the bodice, accentuating her waist. Her hands are folded delicately at her waistline; she has a coppery ring on the index finger of her right hand and another ring on the pinky finger of her left. There are no texts present on the image.
The purpose of the painting appears to be commemorative or aristocratic, created to display the sitterโs status, fashion, and composure, likely commissioned for personal or familial prestige.
The style is of Northern Renaissance portraiture, characterized by high realism, elaborate textile rendering, and solemn expressions typical of formal portrait commissions.
The image shows high fidelity without visible artifacts, with sharp
1
md5: 97f10a52c4080548fc3b76aece16134b
๐
>>105784024this was the initial flux gen
>>105784029Would be nice see datasets used for aesthetic 1 etc
>>105783997actual photo of my current financial situation
>>105783957stay in your containment thread
you are not the sharpest pencil aren't you
>>105784009cool, now anyone can be a gay british vandalizer
file
md5: 02c1e96a8761dc5eabb2a5c724538860
๐
The image is a promotional character artwork depicting Sonic the Hedgehog holding a sword, likely intended for a video game or merchandise.
At the top, Sonic's signature spiked blue hair extends outward in multiple directions, and his large, green eyes are open wide with a determined expression. Just below, his mouth is slightly open, showing a smirk. He wears white gloves, and in his right hand (to the left of the image) he holds a large ornate sword with a metallic blade, intricate cross-guard, and runic-style text engraved on the blade. His left arm (on the right of the image) wears an armored gauntlet with technological-mechanical detailing, including bolts and a black-and-gold motif. In the bottom portion of the image, Sonic's red shoes with white straps and gray soles are visible, positioned in a dynamic and wide stance, emphasizing motion. On the bottom right, there is a stylized blue and black logo with a dragon surrounding a smaller Sonic figure and the word "Sonic" incorporated within. The background is white and featureless.
The image was likely shared to promote or highlight Sonic's appearance in a particular gameโprobably "Sonic and the Black Knight"โshowing him in a fantasy or combat setting with weaponry.
The style is highly rendered and digital, indicative of official game promotional art or box art, using clean lines, rich gradients, and polished lighting.
The image quality is high, with sharp
also, people know they should be using flan_t5 with chroma right? Don't use regular T5, its far worse with it, I've seen people's workflows using that before
Also use the T5 tokenizer options node with min_padding 1 and 3 min_length
>>105784045What do you mean with this command?
file
md5: 0bec760ac326bea200570674020c56ba
๐
>>105784066he is saying he bought a 5090
>>105784002It can do all kinds, that's what gives it sovl.
All you gotta do is tell kontext to fix the hands in your chroma gens and baby you got a stew goin.
>>105784058exit through the giftshop pls sir thank u come again
>>105784062also also, if using tags make sure they match how e621 has them
and regardless of what model you use I suggest using the custom clownshark samplers then use ultimate sd upscale
i love when people quote across threads to tell someone to fuck off
file
md5: 958ee672782b9039fa39766e79086a89
๐
A young man with short, slicked-back hair walks down a brightly lit runway in a black leather jacket, white turtleneck, and black pinstripe pants. The background is a plain, white wall with soft lighting. The image is a high-quality photograph capturing the elegance and sophistication of the fashion show.
--
Not very slick.
>>105784083so i can finally fix all my SOUL diffusion gens from 4 years ago?????????????????? :DDDD
>>105784062>people know they should be using flan_t5 with chroma rightno. never heard this
>>105784077He is not Poor Indian.
file
md5: c0a84531ab831b4c0978f3cff8b5e1f4
๐
>>105784090Chroma 40 Detailed
>>105784083the image is more saturated though, look at the sky it doesn't have that yellow tint anymore
>>105784106>>105784138that might explain some anon's issues with small details like hands
>>105784024I completely agree with you.
I use NovelAI similarly to conserve anlas, utilizing controlnet to lineart the structure of the gens, and I've incorporated these local checkpoints that have great artistic quality. I recognize that the datasets and efforts from creators like Illustrious NoobAI are of much higher standard.
The one thing I truly regret is the lack of a more advanced SDXL model that can work with the checkpoints made by these colleagues at CivitAI.
>>105784135Blue channel is more saturated.
>>105782398ill have to look up this model again that gen is cool
>>105783029>>105783173its just one drunk retard
he does this every thread
file
md5: fba00a2900292267564c546fa80b3aeb
๐
>>105783823>Pic related its a local image creation. It's quite beautiful, but it lacks a story or any significance behind its beauty.You're drinking the koolaid if you thinki you're manifesting your soul by plying the AI slot machine when you hit generate for any AI model.
d
md5: 4412d2d185f35dc75dd1d2f3d84b2252
๐
img
md5: e2decbf981a001eba55c385db2cbdf7d
๐
The only model I've ever held disdain for is Pony but even then I can recognize it was the coom meta for awhile and had a large ecosystem.
>>105784241using flan_t5 now?
>>105784218demonstrably true claim,
check the archives he attacks everyone and calls them pedos until they leave
>>105784244no that's MOGao lel
>>105784245Is there a pastebin? I don't think Debo ever called anyone with names.
>>105784243WHO ARE YOU TALKING TO? WHO ARE YOU REPLYING TO?
CAN YOU GO THE FUCK TO BED YOU DRUNK RETARDED BITCH?@??@?!?
literally NO ONE give a FUCK about your (retarded) opinion
>>105782983>>105784252>pastebintry opening your FAGGY eyes in this very thread
4chan
md5: 2fc1ee00aa99e1e5a9fd6c5a5e638ae0
๐
This is my 4chan. There are many like it, but this one is mine. My 4chan is my best friend. It is my life. I must master it as I must master my life. Without me, my 4chan is useless. Without my 4chan, I am useless. I must fire my 4chan true.
>>105784023I recommend an mspaint scribble where you want the clothing. Kontext is capable of doing the heavy lifting in photoshopping (ie it makes mspaint into enough for photoshopping). But, it's somewhat inconsistent, and I don't know how to make it more reliable. Got a lot of practice to do, and a lot of stupid shit has been dumped in my IRL
>>105784266:3
>'if i die, the seas will silence, the day will turn to night...'
>>105784266ahahahah yay I was hoping someone saw my "this is my model" post
>>105784276You are an active facebook commenter but you rarely create any original content.
>>105784282the janny has access to our cookies and shieeet
this is our doomed future
all is lost now
it is simply
O V E R
anon is quite unwell it seems
>>105784276>>105784312Joking aside, it's all good. Fair games and good hearted banter.
>>105783823all that jibberjabber for a sloppa image that you could have gotten a better\newer\higher resolution carbon-copy version on your oogieboogie civitai (with PONY sdxl no less)
>>105784282No, the facebook guy copied my post.
Use 2 samplers at the same time on every step???
I CANT EVEN RUN THE THING FFS
>>105784385there there, there are always services.
>>105784392not the model dummy the studio its broken
>>105784364>>105784282>facebookFUCKING NORMIES
>>105784395an autistic pedo can get it running but you can't
>>105784408why would you self report like that though
Wow, with the portable comfy, the Krita AI shit managed to connect first try. Truly unbelievable how shit the standalone is, considering googling it leads you to the standalone installer and not portable.
>>105784420electron is garbage is why and they keep wanting to push the garbage grifter shit in front of you to forward the enshitification narritive