← Home ← Back to /g/

Thread 105962656

326 posts 246 images /g/
Anonymous No.105962656 >>105962692 >>105962703 >>105963110 >>105964469
/ldg/ - Local Diffusion General
1girl Awards Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>105956911

https://rentry.org/ldg-lazy-getting-started-guide

>UI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows/home

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX (video)
Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1

>Chroma
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.105962692 >>105964370
>>105962656 (OP)
cool bake by a cool guy
Anonymous No.105962703 >>105962747 >>105962917 >>105963292 >>105963489 >>105966842
>>105962656 (OP)
Anonymous No.105962722
>>105961964
it sucks dick

>oh its for pros
no it sucks
Anonymous No.105962747 >>105962881
>>105962703
That look of pure bliss on Ash
Anonymous No.105962787 >>105962807
>>105961884
I'm testing this now and it's a massive improvement over NetaLumina. With the natural language prompt understanding of Lumina, it might unironically be on par with Illustrious, at least for some things.

It also kind of validates my earlier claim that NetaLumina was just being trained wrong. They used 50k A100 hours and the quality still looked like shit, this guy used 4k B200 hours and now it actually looks good.
Anonymous No.105962792 >>105962952
ANCHOR
Anonymous No.105962807 >>105962862
>>105962787
Ok and you gens? If you don't share gens then its slop
Anonymous No.105962862 >>105962915
>>105962807
just try it yourself, why would you trust anybody's cherrypicked images?
Anonymous No.105962879 >>105963513
Anonymous No.105962881 >>105963335
>>105962747
Ash is very lucky. The question now is who's next.
Anonymous No.105962887
Anonymous No.105962915
>>105962862
nta but because most of the tunes are so fucking bad you almost cant even cherrypick a good fucking image out of it
Anonymous No.105962917 >>105962932 >>105962949
>>105962703
how did you maintain the drawing effect? i have 3dcg slop effect
Anonymous No.105962932 >>105962969 >>105963005
>>105962917
>how did you maintain the drawing effect?
he didnt, the gen just doesnt have much movement and hes a vramletnigger who cant even afford to interpolate from 16 to 32fps
Anonymous No.105962945 >>105963513
Anonymous No.105962949
>>105962917
It still sort of has a 3D effect, it's just that the composition and character actions make it less obvious.
I'm using the wan2.1 nsfw finetune which offers a decent balance between motion and artstyle retention. No loras (yet).
Anonymous No.105962952 >>105962992
>>105962792

>CHROMA YUME NOOBAI XL

• What it is
A V-prediction diffusion model (NOOBAI XL-VPred lineage) rebuilt across 4 numbered releases. Only runs on tensor.art for now (not Civitai).

• Data & training highlights
– v1.0–3.0: Danbooru 2024, Yande, e621 + Illustrious XL & NOOBAI XL as teachers
– v2.0: +50 k real-life photos
– v3.0: dataset relabeled with GPT-4.5 then manually checked
– v4.0: fresh train on expanded anime/photographic sets (danbooru_newest, gelbooru_full, etc.) + custom captioned data; CLIP, VAE and U-Net retrained from scratch

• Version-to-version behavior
v1.0 – balanced multi-style anime, solid anatomy
v2.0 – much better anatomy & realism, weaker cross-style quality
v3.0 – recovers multi-style capability while keeping realism; prompt accuracy now critical
v4.0 – cleaner anatomy and style faithfulness, resolves v2/3 issues

• Core usage tips (V-Pred specific)
Positive (generic): “masterpiece, best quality, amazing quality”
Negative (generic): “bad quality, worst detail, sketch, censor, simple/transparent background”
Typical hyper-params:
‑ Sampler: Euler a (or Euler Ancestral CFG++ –Simple)
‑ Steps: 20 – 30 (25 – 30 with CFG++ recipe)
‑ CFG: 4 – 6 (1.2 – 1.5 with CFG++ recipe)
‑ Clip-skip: 2

For photoreal / cosplay output add to prompts:
Positive: “realistic, cosplay, real life, photorealistic”
Negative: “illustration, blur, film grain, cartoon, anime coloring, 3D, 2D, unrealistic …”

• Misc.
– All showcase images are straight from the base model (no LoRA / post-processing).
– Author actively seeks feedback to refine future versions.
– Credits: narugo1992, Nyanko, Laxhar Lab, Sennke and others.

https://civitai.com/models/1330192/chromayume-noobai-xl-nai-xl
Anonymous No.105962969 >>105963008
>>105962932
I have 16gb vram and interpolate every single gen, I just prefer 16fps. Continue being a retard at your leisure though.
Anonymous No.105962977 >>105963015
>instead of waiting for the last chroma version so you can at least try to be the first with an ok anime finetune of chroma or something big like that retards spend $ to finetune models to make it look worse than base sdxl
Anonymous No.105962992
>>105962952
>v3.0: dataset relabeled with GPT-4.5 then manually checked
Oh that cost a lot of money
Anonymous No.105963005 >>105963019
>>105962932
why on earth would someone interpolate an anime-style video?
Anonymous No.105963008 >>105963019 >>105963028 >>105963044 >>105965058
>>105962969
>i prefer choppy 16fps slideshows
Maybe with some advanced motion awareness models in the future you could argue you want to emulate anime/movie fps but with the current models you just get choppy af output on a 2D looking scene that actually animated as if it were 3D
Anonymous No.105963015
>>105962977
All local models are a huge source of income for GPU farms. Companies like NovelAI continue to profit, and dreamers continue to dream, spending money on careless finetunes.
Anonymous No.105963018 >>105963043
>closeup with makeup and mascara, wearing choker with the lower case text "/ldg/" sewn onto it
Chroma didn't want to do lower case, but at least it tried with the 'sewn' part
Anonymous No.105963019
>>105963005
>>105963008
Anonymous No.105963028 >>105963084
>>105963008
anime is usually 12fps at most my dude
Anonymous No.105963030
danbooru2024, danbooru_newest-all datasets, e621, e621_newest, gelbooru_full, yande_full sounds cool but the model looks slopped. at least it can still do true blacks desu
Anonymous No.105963043 >>105963107
>>105963018

I cannot wait for the eventual completely pornified chroma finetune... if done right, it's going to be the GOAT model for NSFW.
Anonymous No.105963044
>>105963008
Don't care. I see the two previews and I prefer 16fps to 32fps. It's as simple as that.
Anonymous No.105963084
>>105963028
>tranimetroon cant read, and is dumb
pottery
Anonymous No.105963099
>>105962626
https://files.catbox.moe/twui5f.mp4
It's just the lightx2v workflow from OP adapted to t2v
Not uploaded Peach lora yet
Anonymous No.105963107
>>105963043
As long as someone has the money and the motivation, Chroma is indeed the perfect base model for this, fully permissive licensing, uncensored by default, great quality out of the box, easily made better for 'specific needs' with further finetuning / loras.

I think the 'community' will be all over this model once it reaches final release.
Anonymous No.105963110
>>105962656 (OP)
>ldg 1girl awards
>shoulders much broader than hips

kekd
Anonymous No.105963120 >>105963192
Anonymous No.105963192 >>105963195
>>105963120
I smiled and rethought my life
Anonymous No.105963195 >>105963237
>>105963192
and?
Anonymous No.105963237 >>105963249
>>105963195
i'm still just waiting around to die
Anonymous No.105963249
>>105963237
good lad
don't try to find meaning in chaos
Anonymous No.105963292
>>105962703
>1boy
You need to leave.
Anonymous No.105963335
>>105962881
Please do May next
Anonymous No.105963346
Anonymous No.105963372 >>105964281
>>105961806
>>105961759
what model did you use for the pic?
Anonymous No.105963374 >>105963381 >>105963389 >>105963403
Is the light lora for WAN needed only for vramlets or does it offer any benefits over running just WAN itself?
Anonymous No.105963381 >>105963479
>>105963374
it has nothing to do with ram, you can just gen videos faster. 4-8 steps vs 20-30
Anonymous No.105963389
>>105963374
720p is unbearably slow without it. Doesn't matter if you've got a 5090
Anonymous No.105963403 >>105963413 >>105963479
>>105963374
Its the new meta, fullstop. More than double the speed over the original rentry workflow and pretty much no quality loss.
Anonymous No.105963410
Anonymous No.105963413 >>105966266
>>105963403
where can i read about this new meta
Anonymous No.105963432
Anonymous No.105963450
Anonymous No.105963479 >>105963511
>>105963381
>>105963403
Do I still need to run the new lora on lower strength to not cause slowmo?
Anonymous No.105963482 >>105963585 >>105963604
Anonymous No.105963489 >>105963494
>>105962703
>1boy
I'll never understand these people...You like an anime girl, and what you do is create 1boy too? You guys are crazy.
Anonymous No.105963494 >>105963500 >>105963507 >>105963547
>>105963489
is it gay to prefer to see a futa fuck a female over a 1boy?
Anonymous No.105963500 >>105963547
>>105963494
that's between you and your god
Anonymous No.105963507
>>105963494
extremely gay. gayer than gay sex
Anonymous No.105963511
>>105963479
no
Anonymous No.105963513 >>105963545 >>105965451
>>105962879
>>105962945
model? i mean, you don't even have to indulge me in the actual checkpoints or loras you're using, but some info on your workflow would be nice
those are the best sprites I've seen
Anonymous No.105963516 >>105963613 >>105963809
Anonymous No.105963520
Best model for this prompt?
>Concept art of an Evangelion unit from Neon Genesis Evangelion. It has a slender form with reduced armor. Flexible spine. Elongated head with rows of teeth. Two eyes at the front of the head, two at the side of the head. Digitigrade leg structure. Matte black with green accents. Extremely long tail-whip. Progressive talons on all limbs. In the style of Studio Gainax.
Anonymous No.105963545
>>105963513
Flux + Image Pixelate
Anonymous No.105963547
>>105963500
>>105963494

god here, its fine
Anonymous No.105963575
Blessed thread of frenship
Anonymous No.105963585 >>105963604
>>105963482
Anonymous No.105963591
Anonymous No.105963604 >>105963622 >>105963627 >>105963636
>>105963482
>>105963585
why he depressed now hes jacked af
Anonymous No.105963613 >>105963809
>>105963516
>new weird fetish...
>activated!
Anonymous No.105963622
>>105963604
But his shirt is torn, are you blind ?
Anonymous No.105963627 >>105963644
>>105963604
I am suddenly reminded of East Of Eden, and the brother scene. Fuck.
Anonymous No.105963636 >>105963647
>>105963604
he realized he has wasted his life doing it for free
Anonymous No.105963644 >>105963707
>>105963627
>In "East of Eden," the pivotal "brother scene" involves Cal revealing to Aron that their mother is alive and running a brothel, which leads to Aron's psychological breakdown and enlistment in the army. This act, driven by Cal's jealousy and desire to hurt Aron, has devastating consequences, including Adam's stroke and Aron's tragic fate.

dang....
Anonymous No.105963647
>>105963636
fat version
Anonymous No.105963654 >>105963664 >>105963677
I want all the cables to disconnect baka
Anonymous No.105963664
>>105963654
lol good luck with that
Anonymous No.105963676
Anonymous No.105963677
>>105963654
she already does whatever i ask her to do
Anonymous No.105963687 >>105963756
Anonymous No.105963707
>>105963644
not that one lol. If you dare, read about the one with the brother and sister.
Anonymous No.105963722
what's this guy up to
Anonymous No.105963726 >>105963736
Can I take pictures of real women I know and make them hotter and masturbate with AI then fill up a folder of these videos and accidentally leak it to them so they feel both disgust and that they're not pretty enough since people have to AI them hotter?
Anonymous No.105963733 >>105963739 >>105963767
how difficult is it to train/create my own LoRa? (to make a semi-consistent character)
I'm a monkey that just downloaded comfyUI a day ago and I feel like a caveman discovering fire
I've just been using the templates and plugging in stuff i got off civitAI but I don't know how any of this works outside of that
this stuff runs so much better than text generation on a single gpu it's unreal
Anonymous No.105963736
>>105963726
That's illegal
Anonymous No.105963739 >>105963749
>>105963733
>I'm a monkey that just downloaded comfyUI a day ago
my condolances
Anonymous No.105963741
Anonymous No.105963749
>>105963739
I looked at forge but haven't tried it yet
seems more confusing since it doesn't have the visual-aid little colored boxes and strings?
Anonymous No.105963756
>>105963687
spicy
Anonymous No.105963767 >>105963785
>>105963733
The most obnoxious part is tagging the dataset images. You can use AI + manual QC. I used Onetrainer (since it was the only one that told me what to do) and a dataset of 25 images got baked into a lora in ~30 minutes on a 4070S
Anonymous No.105963768 >>105963781 >>105963813 >>105964218
holy kino
Anonymous No.105963781
>>105963768
Anonymous No.105963785
>>105963767
thanks I'll check it out
Anonymous No.105963788
You fucking robot whore
Anonymous No.105963809 >>105963832 >>105964013
>>105963516
>>105963613
Anonymous No.105963813
>>105963768
reminds me of that initial d beach episode for some reason lol
Anonymous No.105963832
>>105963809
Stop, my penis can only get so erect
Anonymous No.105963901 >>105964391
accidental cinema
Anonymous No.105963936 >>105964111
was my mistake trying to do multiple mask areas at once? should i be just doing them 1 at a time?
Anonymous No.105963937 >>105964213
Anonymous No.105963984 >>105963997
My computers shit, can an anon make this animated slightly? it's retro 80's dark fantasy ish.
Anonymous No.105963997
>>105963984
stand by
Anonymous No.105964013 >>105964048
>>105963809
is this 1girl or 2girl
Anonymous No.105964048
>>105964013
(1girl:2)
Anonymous No.105964051 >>105964097
Even better
Anonymous No.105964097 >>105964129
>>105964051
ur pumping these out fast
how
Anonymous No.105964111 >>105964247 >>105967291 >>105967370 >>105967415
>>105963936
sorry what is the context here?
Anonymous No.105964129
>>105964097
No?
Anonymous No.105964162 >>105966035 >>105966521 >>105966931
>still check sdg sometimes to laugh at how bad their gens are
I don't understand why anyone would waste time and money generating such repetitive and ugly shit
Anonymous No.105964212 >>105966887
Anonymous No.105964213
>>105963937
shes upset because he didnt actually spank her
Anonymous No.105964218
>>105963768
these backgrounds look so good, mind sharing catbox?
Anonymous No.105964236 >>105964271
Anonymous No.105964247
>>105964111
Imagine the sons she would produce
Anonymous No.105964271
>>105964236
i too laugh at vampires
Anonymous No.105964273
Anonymous No.105964281 >>105964569 >>105966041
>>105963372
i didnt use a pic its t2v
Anonymous No.105964343
What's the current tech for consistency between gens? For example, making a character or a scene for several comics or visual novel panels? Making a small image lora, ipadapter, simple promptable designs, sketching out before genning, inpainting and editing afterwards? Has anyone had success in this regard?
Anonymous No.105964346 >>105964398
postcard No.105964370 >>105964386
>>105962692
Blessed thread of frenzone ;3
Anonymous No.105964386
>>105964370
Indeed!
postcārd No.105964391 >>105964923
>>105963901
iMA LET U FINISH BUT
Do don pachi dai oh jou black label was one of the best shmups of all time
Anonymous No.105964398
>>105964346
>You were masturbating to pictures of me, your own mother !?
Anonymous No.105964428 >>105964447 >>105964463 >>105964507
What prompt should I use if I want the girl's boobs to look like this?
>perky breasts
>pointy breasts
Already tried that, didn't achieve the desired effect.
Anonymous No.105964447 >>105964462
>>105964428
Wasn't this called something like triangle tits?
Anonymous No.105964462 >>105964507
>>105964447
I thought it might be "torpedo breasts" so I tried that, but I can never get the desired boobs.
Anonymous No.105964463 >>105964511
>>105964428
Tubular breasts? Cone-shaped breasts? Conical breasts? Swooping breasts? Constricted breasts?
Anonymous No.105964469 >>105964602
>>105962656 (OP)
Top right on catbox?
Anonymous No.105964471 >>105964569
How bad is T2V compared to I2V if I can't use the input images? Do I have to specify all the minute details even harder than with I2V or it gens a total mess?
Anonymous No.105964507
>>105964428
>>105964462
1girl gooning models are trained on datasets of professionally-drawn breasts that look like they have weight and obey gravity, so to make a breast look perfectly triangular, you have to "trick" the AI into giving you unrealistic breasts that an amateur would draw.
>"Durr durr boobs are like pyramids growing out of a girl's chest right? Durr durr I'll draw two triangles"
The art of generating triangle breasts involves tricking the AI into forgetting how to generate "correct" breasts and generate less realistic breasts than it normally would
But if you don't want to go through all that trouble and you just want a hack that gives you a relatively high success rate,
>small breasts, pointy breasts
works because the AI probably will usually not give weight or heft or sag if it's been instructed to generate small breasts.
Anonymous No.105964511
>>105964463
Swooping breasts with puffy nipples FTW
Anonymous No.105964530 >>105964569
>>105961964
This model is kino, you just have to figure out which keywords it recognizes. Once you figure that out there's like zero gambling, it either works or it doesn't.
Anonymous No.105964569 >>105964821
>>105964281
hot, catbox?

>>105964471
t2v is surprisingly good on understanding prompts

>>105964530
gimme an example
Anonymous No.105964602
>>105964469
Top left not animated why
Anonymous No.105964612
>>105957628
please, post/link the uncensored version
Anonymous No.105964796 >>105966079 >>105966767
has 4chan suddenly stopped eating mp4? I keep getting corrupted/unsupported error
Anonymous No.105964814
ok so apparently the site is fucked because it keeps telling me my vid is corrupted and you can't upload it.
Anonymous No.105964821 >>105965249 >>105966502
>>105964569
>gimme an example
I mostly use AI to coom so this is the best I can do on such short notice. There's this shift thing that this model uses that I've just been leaving at 1.0 so it's possible I could get better results messing around with that, dunno yet.
pos: (amateur photo:1.2) of a gigantic bright silver shiny metal robot dinosaur with round (glowing:0.75) purple glass eyes a gaping maw filled with razor sharp teeth, the inside of its mouth is (glowing:0.65) faintly crimson and blood drips from its fangs, trending, epic,
daytime in the park, low-angle view,
neg: low quality, toy, small, fat, furry, anthro, tongue, pupils, drawing, anime, fake, cgi
Anonymous No.105964846
Oh even webms are fucked
Anonymous No.105964923 >>105965106
>>105964391
>mfw Cave still exists
That's impressive, so few of the smaller oldskool japanese arcade game companies are still around these days.
Anonymous No.105965058
>>105963008
did someone say CHOPPY?
PØȘŤĊĄŘĎ No.105965106 >>105965188
>>105964923
M2 shot triggers carrying that weight
I have a few aging arcade pcb
But soon they will fail &
Return to dust ;
Ash to ash
Anonymous No.105965188 >>105965327
>>105965106
There's always MAME, not the same as the real thing, but still.
Anonymous No.105965249
>>105964821
Put the coom in a catbox so anon can see
Anonymous No.105965327
>>105965188
Jamma or bust
Anonymous No.105965451 >>105965947
>>105963513
>captcha SDGX4
This image is a digital pixel art composition featuring a collection of fantasy-themed items. The background is plain white, which contrasts sharply with the detailed, colorful objects arranged in a grid-like pattern. Lines are distinct and emphasized with sharp white or dark lines for contrast. All objects are oriented in a vertical position.

In the left from top to the middle, a large detailed {ornate|ancient|etc} magical {chainmail|plate|full suit} medieval armor with {spiked pauldrons|a family crest|a glowing runic enchantment|multiple layers of armor} and {scuffs|dents|light cracks|heavy cracks|perfect condition} is depicted. Next to it are blah blah blah is depicted.

In the center from top to the middle, a large detailed {ornate|ancient|arcane|simple} magical {steel|iron|copper|gold} {long sword|battle sword} with {simple|bejeweled} pommel, blah blah blah is depicted.

In the bottom row there are two {amulets|necklaces|medallions|pendants} blahblahblah

In the left from middle to bottom a large detailed {round|kite|tower} shield is depicted, {fresh and vibrant|dry and aged|rotten and spoiled} with {fishbone|houndstooth|broken line|tile|harlequin} patterns, with a {wooden|metallic} frame trim.

Additionally, a blahblahblah

All objects share a theme of {leaves, flowers, nature, flora, and the spring season|death, decay, rot, bones, and gore|ice, snow, stones, dwarven runes|led light strips, sci-fi cyberpunk technology|astrology, stars, and the cosmos|ghosts, nostalgia, eerieness, and the supernatural}, with occasional motifs of {angel wings|bat wings|viper tongues|beast claws|phallic symbols}.

The style of the image is classic pixelated pixel art. Shading is dithered.
Anonymous No.105965512
Am I stupid or is installing SageAttention 2.2 using more VRAM than SageAttention 1.0.6? Anyone know what's up?
Anonymous No.105965696
someone please make me a Daria lora
Anonymous No.105965724 >>105965833 >>105965843
How come that Wan text-to-image killed Flux?
Anonymous No.105965763 >>105967154
test are we back
Anonymous No.105965783 >>105965833
Now there are three or four aspiring local models trying to surpass SDXL and subtitle images with prose like Flux,
1- Neta slop
2-illustrouslop
3-RouWeislop
4-Chromaslop
Anonymous No.105965812
Okay, T2V is wild. What the fuck is happening
Anonymous No.105965828 >>105966067
radial attention was snake oil
i award myself the fell for it again award. again.
Anonymous No.105965833 >>105965920
>>105965783

this >>105965724

We did not see the forest for the trees
Anonymous No.105965843 >>105965867
>>105965724
How does that work? 1fps video? Or is there a specific mode for that?
Anonymous No.105965867 >>105965929 >>105967358
>>105965843

this guy has delivered

https://www.youtube.com/watch?v=G1F13R-WpO0
Anonymous No.105965920
>>105965833
Based
Anonymous No.105965929
>>105965867
Can it generate anime girls?
Anonymous No.105965935 >>105966384
almost
Anonymous No.105965947 >>105965966
>>105965451
This ,{a|b|c}, are random prompts right? Can I do that in Forge or do I have to have an extencion?
Anonymous No.105965959 >>105966053 >>105966066 >>105966073
Anonymous No.105965966 >>105965981
>>105965947
It picks randomly from one of them yeah. I think there was a way to have it pull from a database instead of bloating the { } too much, but I forgor.
Anonymous No.105965980
Anonymous No.105965981 >>105965992
>>105965966
Thanks, what you try to recall its called dynamic prompts but i prefer to bloat because I don want to have notepads for that

Can i do what you do in WebUI or did you do it in another UI?
Anonymous No.105965992 >>105966001
>>105965981
No clue. Isn't that just SD prompt syntax? Idk if it is native to comfy or not.
Anonymous No.105965993
nyyyyo my fluxerinosss my futureeeee my fenec fox girl
Anonymous No.105966001
>>105965992
Oh, neat I wil try it!
Thanks nonie
Anonymous No.105966011 >>105966096
Anonymous No.105966035
>>105964162
listen, just let them be contained alright
Anonymous No.105966036 >>105966195
Finally, pistol round AR
Anonymous No.105966041
>>105964281
oh. can i ask for the prompt then?
Anonymous No.105966053 >>105966066
>>105965959
>Wan text to image out of the box
Anonymous No.105966066
>>105966053

The prompt was generated this llama-joycaption-beta-one-hf-llava from this >>105965959 image


>This is a vivid, digitally enhanced photograph of a young Latina woman in a tomato field, captured in the rain. She has medium-dark skin, long black hair with bangs, and dark eyes. Her expression is focused and slightly pensive. She is wearing a wide-brimmed straw hat, blue denim overalls that accentuate her large breasts, and black rubber gloves. Raindrops are visible on her overalls, hair, and hat, adding a wet sheen to her skin. She holds two ripe, red tomatoes in each hand, positioned to the left and right of her upper body. The background shows lush green tomato plants with more tomatoes and blurred trees in the distance, suggesting a rural farm setting. The overcast sky and rain create a moody, almost cinematic atmosphere. The textures are highly detailed, with the denim overalls looking rugged and the tomatoes appearing juicy and fresh. The overall mood of the image conveys a sense of hard work and natural beauty. The digital enhancement adds a slightly surreal, almost hyper-realistic quality to the photograph. The woman's calm, determined demeanor contrasts with the stormy weather, suggesting she is undeterred by the rain.
Anonymous No.105966067 >>105966884
>>105965828
sauce? i had a sinking feeling that was the case
Anonymous No.105966073
>>105965959
someone show this to trump
Anonymous No.105966078 >>105966100 >>105966135 >>105966179 >>105966242
IllusioN-R: a model for making awesome anime-style art. Great for anime characters, waifu, and playful fanservice.
Recommended settings

Resolution: 1024x1024, 1152x896, 896x1152, 864x1152, 1152x864

CLIP skip: 2

Sampler: Euler a

Sampling Steps: 20-40

CFG Scale: 3-7

Hires upscale: 1.5, Hires steps: 20, Hires upscaler: Remacri, Denoising strength: 0.35~0.5

Use Adetailer for face details

https://civitai.com/models/1604942/illusion-r-nsfw-illustrious-xl
Anonymous No.105966079
>>105964796
Try downloading the file with a different browser. I kept getting that issue on Windows 11 with brave for some reason.
Anonymous No.105966096
>>105966011

>This is a highly detailed, hyper-realistic digital photograph of a young woman standing in a narrow, wet alleyway on an overcast day. She is of Latina ethnicity with a light brown skin tone, long straight black hair with bangs, and full lips. Her large, dark eyes are looking directly at the camera with a confident and slightly sultry expression. She has a curvy physique with large breasts, a slim waist, and a tattoo of a butterfly on her chest.

>She is wearing a bright pink, strappy bikini top that is slightly wet, revealing her cleavage, and matching pink sports shorts with a black drawstring. Her midriff is exposed, showing a navel piercing. She is holding two wine bottles in her hands, one in each hand, and both are also wet, suggesting recent exposure to rain.

>The alleyway is narrow with dark gray buildings on either side, featuring closed garage doors and a few scattered items on the wet pavement. The ground is shiny from the rain, reflecting the woman and the surrounding buildings. The sky is overcast, casting a diffuse light over the scene, and the air appears cool and damp. There is a slight sense of urban grunge, contrasting with the vibrant color of her outfit. The overall mood of the photograph is both bold and sensual, with the wet conditions adding a layer of rawness to the image.
Anonymous No.105966100 >>105966128
>>105966078
>absolutely no information on how the model was made
Anonymous No.105966128 >>105966140 >>105966164
>>105966100
"About this version
Better detail
Nicer lighting
Richer effects
More expressive"

Vibe merging
Anonymous No.105966135 >>105966145 >>105966149 >>105966156 >>105966216
>>105966078
>Wan text to image, no loras

>This is a digital anime-style illustration of the popular virtual singer, Hatsune Miku. The art features vibrant, high-contrast colors, primarily in shades of blue, pink, and white. Miku, a young Japanese woman with long, aqua-blue twin-tails, is depicted from the chest up, gazing upward with a serene, slightly dreamy expression. Her large, luminescent blue eyes are prominently featured, reflecting light and adding a sense of depth to her gaze. She wears her signature futuristic black and blue outfit with pink accents, which includes a high-collared shirt and a large, circular headset with a pink and red pattern. The background is a dynamic mix of bright, swirling colors, creating a sense of movement and energy. The texture of the illustration is sharp and bold, with thick, energetic lines that add to the sense of motion and vibrancy. The overall mood of the piece is futuristic and ethereal, capturing Miku's iconic blend of technological and musical elements. The colors and composition emphasize her otherworldly presence and the sense of being transported into a virtual realm. The artist has effectively used digital techniques to create a vivid, almost glowing effect, typical of modern anime art.
Anonymous No.105966140 >>105966168
>>105966128

>trained to copyrighted material scrapped from internet
Anonymous No.105966145
>>105966135
AAAAAIIIIIIIIEEEEEEEE
Anonymous No.105966149
>>105966135
Anonymous No.105966156 >>105966170
>>105966135
Ok wan is the next step for SDXL without Flux bloat

What do we need? LORAS?
Anonymous No.105966164 >>105966189
>>105966128
For what reason do they not disclose what went into the merge and how it was made? Are they profiting from being closed-source somehow?
Anonymous No.105966168 >>105966181
>>105966140
Based?
Anonymous No.105966170 >>105966200
>>105966156

It is essentially the same workflows

Use loras of your choice
Anonymous No.105966179
>>105966078
so sick of this style
Anonymous No.105966181 >>105966241
>>105966168

Me being edgy today
Anonymous No.105966189 >>105966395
>>105966164
90% of checkpoint mergers don't know what they're doing. They simply merge with models they personally like.
Anonymous No.105966195 >>105966206 >>105966225
>>105966036
I forgot to disable a realism lora
Anonymous No.105966200
>>105966170
Im a gooner.
What you are saying to me is that SDXL loras = Wan Loras?
Anonymous No.105966206 >>105966225
>>105966195
>realism lora OFF
Anonymous No.105966216 >>105966298
>>105966135
Could you generate a sexy pic of Ishtar from Fate Grand Order in the style of Ufotable?
Anonymous No.105966225
>>105966206
>first game
>>105966195
>sequel
Anonymous No.105966241 >>105966261
>>105966181
I'm not a cloud friendly zoomer like you, I grew up in the generation of eMule, Ares, and megaupload. Stealing is okay.
Anonymous No.105966242 >>105966258 >>105966273
>>105966078
>pure wan text to image
Anonymous No.105966258 >>105966273 >>105966276 >>105966298
>>105966242
Thanks for trying this model! I hope it meets your creative needs and brings you joy. And don't forget to give a if you like it! ;>

If you enjoy my work, you can support me on Ko-fi or become a patron on Patreon – every little bit helps!
Anonymous No.105966261
>>105966241
>megaupload

I remember the time when rapidshare was a thing
Anonymous No.105966266
>>105963413
Just use the lightx2v workflow in the rentry guide but use the latest lightx2v lora.
Anonymous No.105966273
>>105966242
>>105966258
They make a living out of merge checkpoints, DO NOT USE WAN
Anonymous No.105966276
>>105966258
i'm gonna barf
Anonymous No.105966294
12 gigs vram enough for basic video or should I just give up
Anonymous No.105966298 >>105966309 >>105966345
>>105966258
>Thanks for trying this model!

It is the vanilla Wan2.1 t2v btw

>>105966216
no re-styling loras applied. It catches the description quite good
>This digital anime-style illustration features a young female character with long, flowing black hair adorned with two large bows. Her skin is pale, and she has striking red eyes. She is wearing a revealing black and gold bikini, with intricate gold patterns on the top and bottom. Her bikini top accentuates her moderately sized breasts, and her bikini bottom is high-cut, highlighting her slender, toned legs and flat stomach. She is also wearing black thigh-high stockings with gold accents. The character is winking playfully with her right eye while extending her right hand towards the viewer, as if inviting them to touch or playfully interacting with her. Her left hand rests on her hip.

>The background is a vibrant, partly cloudy sky, with shades of blue and white, giving the impression of a bright, sunny day. The character's hair and bikini contrast sharply against the sky, drawing attention to her figure. The illustration includes a gray border on the left and right sides, framing the character. The overall emotional state of the character appears confident and playful. The art style is typical of anime, with bold lines, bright colors, and exaggerated features. The illustration includes Japanese text in the top left corner, adding an authentic touch to the anime aesthetic. The character's ethnicity is not explicitly specified, but her appearance is typical of a Japanese anime character.
Anonymous No.105966309 >>105966345
>>105966298
>re-run with same prompt
Anonymous No.105966345 >>105966375 >>105966592
>>105966298
>>105966309
Okay, so WAN can generate more consistent images than SDXL. How do we tell the entire SDXL community at Civit AI to stop messing around with SDXL and start using WAN as a checkpoint, Lora, etc.?
Anonymous No.105966356
Someone call the chroma furry and tell him his project just got wanned
Anonymous No.105966375 >>105966592
>>105966345
I think that at this point, SDXL slopper s and civit AI are so deeply involved with SDXL that it would be difficult to switch to another model, even if it were a better one.
Anonymous No.105966384 >>105966394
>>105965935
Anonymous No.105966394
>>105966384
Anonymous No.105966395 >>105966433
>>105966189
>People will say this is a bad thing and be unable to explain why.
Anonymous No.105966433
>>105966395
Let me explain why a checkpoint must be balanced. Same copyrighted character, different scenarios, different posture and lighting, and their style becomes unstable and changes. That is a checkpoint error.
Anonymous No.105966501 >>105966577
Anonymous No.105966502
>>105964821
I guess I will ask joycaption to add some metallic look to the description

>The photograph captures a massive, metallic T-Rex sculpture standing in a dimly lit, eerie outdoor setting. The sculpture's head is the focal point, dominating the frame with its open, menacing mouth. The T-Rex's metallic surface reflects the surrounding light, creating a stark, glossy sheen that enhances its ominous presence. Its large, purple, glowing eyes stand out, adding an unsettling, almost supernatural quality to the creature. The mouth is wide open, revealing sharp, silver teeth and a cavernous, red-lit interior that appears to be illuminated from within, giving it a bloodthirsty, almost alive appearance. The sunlight filtering through the nearby trees casts a harsh, contrasting light on the sculpture's surface, creating sharp shadows that add to the dramatic, foreboding atmosphere. The background includes dark, silhouetted trees, which further emphasize the sculpture's menacing form. The overall composition and use of light and shadow evoke a sense of horror and dread, as if the T-Rex is ready to pounce at any moment. The photograph's high contrast and vivid colors amplify the dramatic, unsettling mood, drawing the viewer into a sense of impending doom.
Anonymous No.105966521
>>105964162
Catbox?
Anonymous No.105966577 >>105966612
>>105966501
>This image depicts a young Caucasian woman standing confidently on a red carpeted stage with rich red curtains in the background. She is wearing a tight, shiny black latex bunny costume, including a strapless leotard that accentuates her curvy, hourglass figure, a black bow tie around her neck, white cuffs on her wrists, and a pair of black bunny ears with green ribbons on top of her head. Her long brown hair is styled in twin pigtails that drape over her shoulders. She has a fair skin tone and large, expressive brown eyes. She is holding a microphone in her right hand, which is raised to shoulder level, and her left hand is resting on her hip, exuding confidence and charisma. Her facial expression is neutral yet slightly alluring, with a subtle smile. The lighting is bright and even, highlighting her figure and the glossy texture of her outfit. In the background, there are two black speaker systems labeled "JBL" positioned on either side of her, adding to the stage setup. The overall mood of the image is sultry and playful, with a focus on the woman's sexuality and stage presence. The image is framed with gray borders, and the texture of the latex costume is highly realistic, emphasizing the tight fit and reflective quality of the material.
Anonymous No.105966592 >>105966643 >>105966912
>>105966375
>>105966345
SDXL will never die because the ability to conjure porn with tags and broken english is the biggest retard crutch on the planet.
Anonymous No.105966612 >>105966661
>>105966577
frame removed
Anonymous No.105966643 >>105967180
>>105966592
yes, SDXL it's the thirld worlders image gen model,
Anonymous No.105966648
https://techcrunch.com/2025/07/18/netflix-starts-using-genai-in-its-shows-and-films
Anonymous No.105966661
>>105966612
She is one of us!

thank you, comrade Xi!
Anonymous No.105966668 >>105966870
Text is so easy with Wan. It just werks
Anonymous No.105966709 >>105966787
Are we suppossed to use the light lora for WAN T2I too? How does the lora rank affect the outcome in still image?
Anonymous No.105966767
>>105964796
Can Wan create new memes?
Anonymous No.105966787 >>105966829 >>105966901
>>105966709
I'm using it

And yes, I believe whatever lora will always collide with the model's logic

>picrel
>This is a digital, line-art drawing of a cartoonish, anthropomorphic character. The character is a white, round-headed figure with a smooth, featureless body, suggesting a minimalist or abstract style. Its facial features are exaggerated and anxious: it has two large, black, almond-shaped eyes that are slightly wide with a hint of desperation, a small, straight black line for a mouth that is slightly open in a worried expression, and a slight upward tilt, giving a look of urgency. The character's right hand is raised, with fingers extended in a waving gesture, and the palm is facing outward, as if pleading for help. The background is a solid, light blue color, which contrasts with the white character and makes it stand out.

>The character's head is tilted slightly to the right, with its eyes and mouth looking slightly upwards, emphasizing its anxious and desperate state. There is a speech bubble emerging from the character's right side, containing the text "LDG halp!" in black, bold letters, adding to the sense of urgency and desperation.

>The drawing uses clean and thick black lines, giving it a bold, cartoonish appearance. The character's overall appearance and the positioning of the eyes, mouth, and hand convey a sense of nervousness and desperation as it seeks help. The lighting is even, with no shadows or gradients, keeping the focus on the simplicity and clarity of the character's expression and gesture.
Anonymous No.105966829
>>105966787
I start to like it

I mean I give joycaption a picture to get its description, then I ask the sane VL model to modify the expression

And it works!
Anonymous No.105966842 >>105967232
>>105962703
Piss off spammer
Anonymous No.105966870 >>105966903
>>105966668
>nooo you must use comfyui , read my tutorials, buy flux api nodes and use a censored model, think about the future, look at thr piciture of my slop fenec fox girl!
Anonymous No.105966880
chroma turbo lora fucking when
Anonymous No.105966884
>>105966067
hopefully it was only the bad implementation: https://www.reddit.com/r/StableDiffusion/comments/1m3pock/holy_speed_balls_it_fast_after_some_config/n3z4bud/

maybe a native node will work better, much like the initial NAG stuff when that was new.
but.. it's not looking good.
Anonymous No.105966887 >>105966907 >>105967053 >>105967167
>>105964212
Wan strikes again
Anonymous No.105966901 >>105966922 >>105966931
>>105966787
Did you try removing all the GPT yapping prose bloat ? And combine it with tags like SDXL and where prose is needed, implement it properly?
Anonymous No.105966903
>>105966870
>you must use comfyui

I apologize Master, but I'm using comfy
Anonymous No.105966907
>>105966887
Compare it with Kontext slop
Anonymous No.105966909 >>105967103
Anonymous No.105966912 >>105966926 >>105967083
>>105966592
sdxl and its finetunes are easy to run and are not cucked with censorship compared to the newer and larger models. In the end of the day, i just want to generate 1girl with ease and transport my gens to wan for i2v.
Anonymous No.105966922 >>105966939
>>105966901
Wan was trained with the yapping
Anonymous No.105966926
>>105966912
Okay, but now think about the people in the third world.
Anonymous No.105966931 >>105966953
>>105966901
>GPT yapping prose bloat

I might try in the future. It's my first weekend with the joycaption model

>https://huggingface.co/mradermacher/llama-joycaption-beta-one-hf-llava-GGUF/tree/main

I guess I can ask it right in the chat

>>105964162
wan is winrar
Anonymous No.105966939 >>105966964
>>105966922
Ok so i must learn2yap?
Anonymous No.105966953 >>105966963
>>105966931
I can assure you that Llama or any local model is trained with GPT slop, so it doesn't matter which model you use.
Anonymous No.105966963
>>105966953
t. disingenuous faggot
Anonymous No.105966964 >>105966990 >>105967021
>>105966939
Use VL models to write the prompt for you
Anonymous No.105966990 >>105966997 >>105967012 >>105967217
>>105966964
i think ai is done for if the future is all ai sloptext for prompting. it has gone full retard where you need an v/llm to describe an image just so u can get a gen that doesn't look shit.
chroma is cool and all but the need to describe everything so disgustingly verbose makes me want to kms
Anonymous No.105966997
>>105966990

I floats my boat, so I go with it
Anonymous No.105967012
>>105966990
THIS time for sure its done lil bro, 100%, 5th billionth time is the charm, any 2 weeks now
Anonymous No.105967016 >>105967042 >>105967055
Does anyone know how to assign variables within variables?

## assign breast size
${breasts=huge breasts}

## adjust breast size
${adjustboobsize=${breasts=small breasts}
}

##
## call variable to reassign breast variable
${adjustboobsize}

## prompt
1girl, solo, ${breasts}


The thing just blows up if I call ${adjustboobsize} and I think I get a syntax error. Anyone know what I'm doing wrong or if it's even possible?
Anonymous No.105967021
>>105966964
Does the joycaption that is run on HF filter freak fetishes and loli?
Anonymous No.105967042 >>105967072
>>105967016
What syntax is that, even? What UI supports this?
Anonymous No.105967053 >>105967181 >>105967318 >>105967337
>>105966887
Based.
Also wan.
Anonymous No.105967055 >>105967072
>>105967016
are you the same guy that asked if you can prompt with color codes? wtf is wrong with you?
Anonymous No.105967072
>>105967042
it's just run of the mil stable diffusion, assuming you have dynamic prompt installed.

>>105967055
Why are you asking what's wrong with me if you're not sure who I am? I'm not that guy nor do I know what color codes is. I just have a list of 8 girls in my prompt randomized and I want to be able to adjust their boob size/body type on the fly.
Anonymous No.105967083
>>105966912
Wan refused to follow the prompt and cover her nipples
Anonymous No.105967103 >>105967138
>>105966909

Wan just can't stop winning

>This image is a surreal and terrifying digital illustration featuring a glowing, skeletal skull with bright pink, menacing eyes at its center. The skull is adorned with a large, coiled snake with intricate, colorful scales that resemble an ancient, mythical pattern. The snake's head is positioned directly above the skull, giving the impression that it is about to consume or possess the skull. The background is a dark, star-filled night sky, filled with swirling clouds of vibrant pink and blue neon lights, adding to the otherworldly and unsettling atmosphere. The bottom of the image is framed by jagged, rocky terrain that looks like a lunar or extraterrestrial landscape. The overall composition is both hypnotic and frightening, with the combination of the glowing eyes, colorful snake, and neon sky creating a sense of unease and cosmic horror. The image is incredibly detailed, with a hyper-realistic texture to the bones and scales, which enhances the eerie and fantastical elements of the scene. The use of neon colors against the dark background adds a surreal, nightmarish quality to the illustration. The entire image conveys a sense of cosmic dread and supernatural menace, making it both visually striking and unsettling.
Anonymous No.105967138
>>105967103

what a friendly smile
Anonymous No.105967154 >>105967169
>>105965763
now turn it into a full feature 3 hour movie by stringing together a bunch of 5 s videos. go!
Anonymous No.105967167
>>105966887
looks heavily baked tho
Anonymous No.105967169 >>105967203
>>105967154
nta

She won't survive 3-hour long ordeal though
Anonymous No.105967180
>>105966643
> yes, SDXL it's the thirld worlders image gen model,
I would guess that's still 1.5 and also clinging to SAAS scraps.
Anonymous No.105967181 >>105967318 >>105967337
>>105967053

I wanned your wan gen once again

>This is a digital anime-style illustration of a young woman with light skin and a slightly flushed complexion, standing in a shallow, clear blue pond. She has short, dark purple hair with pinkish tips, and bright green eyes that are slightly narrowed, giving her a confident and slightly mischievous expression. Her physique is slender yet curvy, with large, prominent breasts that are accentuated by her dark blue, shiny, halter-neck one-piece swimsuit. The swimsuit is tight-fitting and reveals a significant amount of cleavage, with a small keyhole cutout just above her navel. She wears a black choker around her neck.

>The background features a lush, green pondside with large, broad leaves and some rocks partially submerged in the water. The sunlight filters through the foliage, creating a mix of bright and shadowy areas, with light reflections dancing on the water's surface. Small green lily pads can be seen floating in the pond.

>The lighting in the image is bright and natural, highlighting her wet hair and the sheen of her swimsuit. Her pose is relaxed yet slightly forward-leaning, with her hands partially submerged in the water. The overall atmosphere is serene and slightly playful, capturing a moment of calm in a natural setting. The artistic style is crisp and vibrant, with clean lines and a focus on bold, contrasting colors.
Anonymous No.105967198
Anonymous No.105967202
what if the man in the moon didn't want to be there
Anonymous No.105967203
>>105967169
>She won't survive 3-hour long ordeal though

she will if she's eternalized in a LORA
Anonymous No.105967211 >>105967328
Anonymous No.105967217
>>105966990
sounds like over reliance on prompting technique for a different model. i came from stable diffusion, and it took me a while to get how to do it for t5.
my process in chroma has been to generate one image per tag. if the result is coherent, then i know how and that chroma knows that tag, and i'll use it directly. if i get no output, then i figure out how to prompt for the concept longhand.
there are things that chroma knows well, that it has a strong idea about, that it has mixed understanding of, that it doesn't know at all, and that it gets wrong because of bad/poisoned tagging in datasets. not trying to shoehorn it into your old sd-style prompts is the best way to start getting the most out of it.
Anonymous No.105967232 >>105967285
>>105966842
huh? how am i spamming?
Anonymous No.105967248 >>105967302
What joycaption settings do I pick for I2prompt Wan? Just descriptive, since wan can eat normal english?
Anonymous No.105967268 >>105967292
Can someone explain to me what chroma and flux are?

How are they different from using the ComfyUI w/ Illustrious from the rentry guide?
Anonymous No.105967285
>>105967232
kek, i think he was acting pre-emptively, after having (perhaps wrongfully, perhaps thinking you were DEBOAN) identified you as a spammer.
Anonymous No.105967291
>>105964111
God damn, anon, that's pretty hot.
Anonymous No.105967292 >>105967392
>>105967268
Just different models
Anonymous No.105967302
>>105967248

>describe the picture
Anonymous No.105967315
I put the interpolated version here https://files.catbox.moe/8rpwrl.mp4
This one smoothed out really well
Anonymous No.105967318 >>105967337 >>105967347 >>105967384
>>105967181
>>105967053
again, how do we tell the SDXL model creators to stop sloping in SDXL and use superior WAN?
Anonymous No.105967328 >>105967347 >>105967367
>>105967211
I tried...

>In this apocalyptic, digital artwork, a towering, black, crumbling monolith stands as a haunting sentinel amidst a desolate, war-torn landscape. The monolith's jagged, charred surface is illuminated by intense, crackling white lightning that zigzags across a dark, ominous sky filled with swirling, ash-gray clouds. Below, a river of glowing, molten lava cascades down the monolith's base, casting an eerie, fiery red glow that contrasts sharply with the surrounding darkness. The barren ground is littered with scorched, skeletal trees, their blackened branches reaching towards the sky like skeletal fingers. In the background, the horizon is shrouded in a thick, oppressive fog, adding to the scene's sense of foreboding and despair. The overall color palette is dominated by dark grays and blacks, with flashes of white lightning and red lava providing the only bright, striking colors. The image exudes a sense of doom and destruction, capturing the raw, unrelenting power of nature's wrath and the bleak, desolate beauty of a post-apocalyptic world. The edges of the image are framed with a gray border, emphasizing the isolation and severity of the scene. The artwork's digital style is hyper-realistic, with meticulous attention to detail in the textures of the monolith, lava, and charred trees, enhancing the dramatic and unsettling atmosphere.
Anonymous No.105967331 >>105967593
I've been out of the loop for a while, can someone please remind me how to improve faces?
This is just with a prompt line
>1girl, seifuku, sailor collar, neckerchief, school uniform, white hair, blue eyes, smile, looking at viewer
using waiNSFWIllustrious_v140
Do I need some kind of post-processing? My goal is to generate dozens of images with no manual intervention so I don't want to have to inpaint, etc
Anonymous No.105967337 >>105967358 >>105967456
>>105967318
>>105967181
>>105967053
Same anon, I don't have problem in doing it myself, I have to create an Article in civitAI? what workflow, UI, do you use? I will use your images as exaple.
Anonymous No.105967347
>>105967318
>how do we tell the SDXL model creators


it is the job for their therapists

>>105967328
Anonymous No.105967358
>>105967337
>what workflow, UI, do you use?

from this video >>105965867
The workflow is in the description
Anonymous No.105967362 >>105967964
I've been messing with AI for quite some time now, but not at the level you guys seem to operate. I'm not even proficient with prompts as I just describe what I want to see in normal language.

I used Midjourney when it was introduced and I really liked mixing styles together to get something unique.

So 1st question is, is there a benchmark or a list somewhere that evaluate what mainstream models are good at base level for image gen? Like GPT could be good for portrait picture, another model could be good for scenery, another for anime...

2nd question, I'd like to build a small pipeline where I can output a consistant style across different types of subjects (characters, backgrounds, items...). I'm not sure where to start as what you guys seem to do is pretty technical, and also I have an AMD GPU (9070) which I think would block ways to get into it.
When I say pipeline, it would be a fixed pipeline requiring almost no tweaks even if it impacts negatively the result of the pictures slightly (if that's possible as of now).

Any recommendations?
Pic related is a 3D model I generated for Tabletop Simulator where I managed to somewhat create a "pipeline" to obtain consistant results creating several models.
Anonymous No.105967367 >>105967488 >>105967715
>>105967328
Not too far off:
> fantasy illustration, dark fantasy, painting, A blasted wasteland covered in patches of molten ash, where a single obsidian monolith stands at the center of an impact crater. Lightning cracks above, arcing from cloud to ground, drawn to the monolith’s rune-covered surface. Blackened dead trees surround the monolith. The sky above is a turbulent mix of purples and grays, with roiling storm clouds.
Anonymous No.105967369 >>105967397
Do wan loras work with wan fun camera? It's not working for me, I must be doing it wrong. I'm using the example workflow which generates the matplot crap (removed from the output stream). There's a wan lora loader which will connect to the lora input used in the example, but as you can see from this gen, it doesn't work.
Anonymous No.105967370 >>105967379
>>105964111
I might make her look a bit more mature
Anonymous No.105967379
>>105967370
Anonymous No.105967384 >>105967405 >>105967415
>>105967318
I recently asked a lora artist friend of mine if he's tried Wan, he said he did and the results weren't good, so he's sticking to Illustrious for now.

I guess we need SDXL lora artists to start pumping out Wan T2I I2I specific loras. Otherwise the SDXL community won't switch over.
Anonymous No.105967392 >>105967409 >>105967433 >>105967598
>>105967292
Can you elaborate? I see people here talking about chroma all the time. Why use chroma over the tried and tested?
Anonymous No.105967397 >>105967423
>>105967369
Did you try Ani_Wan?
Anonymous No.105967405
>>105967384
I made a tool to help with building loras from video stills:
https://huggingface.co/quarterturn/facesaver

I've used it with my molmo captioner (preferred because it is the least censored) to pretty easily crank out lora training datasets.
Anonymous No.105967409
>>105967392
Because SDXL hits its limits when you try to do anything more complex.
Anonymous No.105967415 >>105967456 >>105967471
>>105967384
>Wan T2I I2I specific loras

My guess is that any existing Wan t2v style lora would work

>>105964111
this shit is just too realistic

>Mommy, please don't punish me! I will eat my broccoli
Anonymous No.105967423 >>105967451 >>105967543
>>105967397
You mean animanon's comfy-replaceemnt? No, give me a link or something.
Anonymous No.105967433
>>105967392
>Why use chroma over the tried and tested?

Chroma is trained by some guy who just wants to know
Anonymous No.105967451 >>105967617
>>105967423
No. Aniwan is an anime fine-tune of wan, not a tranny front-end
Anonymous No.105967456 >>105967473
>>105967337
https://files.catbox.moe/nex1by.png
my wan workflow
the only modification I've made was to add a node so you can pick the resolution in megapixels

>>105967415
made that one on biglust
Anonymous No.105967471 >>105967560 >>105967788
>>105967415
Could you try one of these?
https://huggingface.co/quarterturn/wan-2.1-14b-t2v-bbw
https://huggingface.co/quarterturn/bbwhot
I'm mostly curious if the wan lora is baked enough. I think it is, but I'd like to see other people try it.
Anonymous No.105967473
>>105967456
>made that one on biglust

Mine is made with wan
>This is a photograph of a 37-year-old mature woman with a fair skin tone and dark, short, slightly wavy hair. She has a curvy physique with very large, prominent breasts and wide hips. She is wearing a teal one-piece swimsuit that has the brand name "SUGOI DEKAI" printed in two lines in white on the chest. The swimsuit is high-cut, revealing her thighs and accentuating her hourglass figure. She is standing poolside, leaning slightly back with her hands resting on the pool's edge, which is adorned with small, square, white tiles. Her legs are slightly apart, and she is looking directly at the camera with a neutral expression.

>The background features a clear, bright blue swimming pool with a white metal frame visible in the upper left corner. The pool is surrounded by a sunny, green landscape with tall trees and a clear blue sky. The sunlight is bright, casting sharp shadows and highlighting her pale skin and the texture of her swimsuit. The photograph has a high-contrast, slightly over-saturated style, adding a vivid, almost digital quality to the scene. The woman's confident stance and direct gaze suggest a sense of self-assuredness and calm.
Anonymous No.105967488 >>105967534 >>105967560 >>105967588 >>105967607
>>105967367
we are so back

>This oil painting depicts a serene, enchanted forest at twilight, bathed in soft, ethereal light. The focal point is an ethereal, white-winged angel seated on a luminous, moss-covered rock in the left foreground. The angel's delicate, angelic wings are intricately detailed with fine feathering. His eyes are closed, and he has a serene, focused expression as he tends to a wounded deer lying before him. The angel is gently laying his hands on the deer's side, transferring a glowing, translucent aura of mana that radiates from his fingertips, enveloping the deer in a soft, greenish-blue light.

>The deer, initially lifeless and still, begins to stir under the angel's touch, its eyes opening with a glimmer of renewed life. Surrounding the angel and deer, the forest is dense with dark, shadowy trees, their leaves creating a natural canopy above. Small, twinkling fireflies and a soft, blue-green aurora in the background add a magical, otherworldly atmosphere. A gentle, reflective stream flows through the forest in the background, its surface mirroring the soft lights and colors of the scene. The overall composition exudes a sense of peace, healing, and the gentle magic of nature. The artist's use of light and shadow creates a realistic yet fantastical atmosphere.
Anonymous No.105967534 >>105967623
>>105967488
If you got that in one go that's pretty impressive since it took pixelwave flux some finagling to get her hand to point towards and the spell to actually touch the deer.
Anonymous No.105967543
>>105967423
https://civitai.com/models/1626197?modelVersionId=1840561
Anonymous No.105967555
How do you fix the genital gore issue in wan? What's a good lora + weight?
Anonymous No.105967560 >>105967788
>>105967471
sorry, I'm too retarded to be able to quickly swap GGUF model loader for the loader of .safetensors

>>105967488
Anonymous No.105967570
good lord do i hate comfyui, just work you piece of shit
Anonymous No.105967588
>>105967488
wan is impressive but needs a finetune to add art styles to really be competitive with anime models or chroma.
Anonymous No.105967593
>>105967331
Use adetailer. There's probably an extension or section for it on Forge/Reforge. On Comfy you use the Impact nodes. See https://rentry.org/comfyui_guide_1girl#face-and-hand-detailing
Anonymous No.105967598 >>105967669
>>105967392
Chroma isn't great at anime pictures right now (might change once people start training loras) but it can do some compositions that SDXL can't and it can also make some pretty good looking "amateur photographs" pictures.
Anonymous No.105967607 >>105967634
>>105967488
why'd you get a she when the prompt says he? femboy? tried your prompt on neta lumina with artist tags (@wlop:0.7), (@quasarcake:0.7),
Anonymous No.105967614
Anonymous No.105967617 >>105967664
>>105967451
comfy is the tranny frontend. what do you think comfy does with his Chinese handlers all day?
Anonymous No.105967623 >>105967642
>>105967534
>If you got that in one go
yes, this was the 1st swipe
I'm very much impressed by wan t2i too

>This enchanting aquarelle painting depicts a delicate pixie with vibrant red-orange hair, sitting contemplatively on a moss-covered rock in a magical forest at night. Her ethereal, translucent wings glow with a soft blue light, slightly blurred to create a dreamy effect. She is adorned in a flowing, light blue skirt that cascades gently around her, and she is barefoot, with one leg crossed over the other, her feet resting on the rock.

>The background is a lush, dense forest, shrouded in shadows and illuminated by a glowing, full moon that casts a soft, silvery light over the scene. The moon is positioned in the upper left corner, partially obscured by misty, swirling clouds. Tiny, luminous blue butterflies flutter around her, their presence subtly suggested rather than overtly stated, adding a sense of whimsy and magic to the atmosphere.

>The overall color palette is dominated by deep blues and blacks, with the soft blue and red-orange colors providing a gentle, captivating contrast. The texture of the painting is rich and slightly grainy, with visible brushstrokes that add depth and a sense of movement to the ethereal forest setting. The pixie's serene expression and the gentle, almost melancholic ambiance of the scene evoke a sense of quiet reflection and magical wonder, with a slightly blurred effect that enhances the dreamy, otherworldly quality of the painting.
Anonymous No.105967634
>>105967607
>why'd you get a she when the prompt says he?
idk, might say "male" explicitly
Anonymous No.105967642
>>105967623
Such a waste! Wan was trained in French as well!

>Cette toile impressionniste àquarelle représente une petite fée avec des cheveux vives de roux-orange, assise contemplativement sur une pierre couverte de mousse dans un forest magical pendant la nuit. Ses ailes transparentes et éthérées brillent d'un doux bleu, légèrement floues pour créer un effet de rêve. Elle est vêtue d'une robe enroulée bleue qui s'efface doucement autour d'elle, et elle est pieds nus, avec un pied croisé sur l'autre, ses pieds reposant sur la pierre.

>Le fond est une forêt dense, enveloppée dans des ombres et illuminée par la lune brillante qui projette une lumière argentée sur la scène. La lune est située dans le coin supérieur gauche, partiellement masquée par des nuages mous et mouvementés. Des papillons lumineux bleus s'envolent autour d'elle, leur présence légèrement suggérée plutôt que nettement définie, ajoutant un peu de fantaisie et de magie à l'atmosphère.

>Le palette de couleurs est dominée par des bleus foncés et des noirs, avec les couleurs bleu et roux-orange qui se contrastent doucement. La texture de la peinture est riche et légèrement granuleuse, avec des touches visibles de pinceaux qui ajoutent de la profondeur et un mouvement à l'espace forestier éthéré. L'expression tranquille de la fée et l'ambiance douce et mélancolique de la scène évoquent un sentiment de réflexion paisible et de rêve enchantée, avec un flou qui renforce la qualité rêveuse de la toile.
Anonymous No.105967664 >>105967693 >>105967770
>>105967617
Comfy is a heterosexual man, retard
Anonymous No.105967669 >>105967757
>>105967598
>but it can do some compositions that SDXL can't
Can you elaborate? Or post some examples?
Anonymous No.105967693
>>105967664
Chinese ladyboys are not girlfriends
Anonymous No.105967699 >>105967785
Is it worth using hand loras to fix hands? How does this compare to just masking?

https://civitai.com/models/200255?modelVersionId=1464262
Anonymous No.105967715
>>105967367
lovely gen, gj
Anonymous No.105967740
Anonymous No.105967757
>>105967669
It understands natural language, so you can dictate what goes where in 3D space more easily, and you can describe multiple characters without regional prompting and not have each characteristic be applied to a random character(s).

Pic related is an Illustrious gen that used a Chroma gen as a ControlNet input. Afraid I lost the Chroma gen though.
Anonymous No.105967766
new
>>105967759
>>105967759
>>105967759
Anonymous No.105967767
Launched today

77® Flux Anime

Tool for creating dynamic anime/manga art.
Captures vibrant energy and emotion of Japanese animation.
Versatile for characters, scenes, and illustrations, from 90s nostalgia to modern styles.
Ensures quality, detail, and style.

Sampler: normal-sample
CFG Scale: 3.5-7.5
Steps: 25-30

https://civitai.com/models/1793130/77r-flux-anime
Anonymous No.105967770
>>105967664
Are you mad because they're sexhavers? tfw-mno-gf.jpg?
Anonymous No.105967778
baking, few minutes
Anonymous No.105967785
>>105967699
I haven't tried it but loras aren't free and they will never work around the VAE's limitations.
Anonymous No.105967788
>>105967471
>>105967560

Wait! It is a lora.
Anonymous No.105967801 >>105967808
>>105967797
>>105967797
>>105967797
Anonymous No.105967808
>>105967801
we already had a new thread, this is a duplicate
Anonymous No.105967964
>>105967362
1) people tend to stick with whatever they know, so there's not a lot of broad, systemic testing. when deciding: base model (highest knowledge, generalized output), finetune (partial knowledge, specialized output), and lora (minimal knowledge, discrete output). a finetune takes a base model and squeezes its attention into an narrower band (2d: anime), and a lora teaches a discrete, specific style or concept (3d: game assets: WoW Classic skill effects).
2) I'd vouch for comfy for "pipeline" creation. you can create multiple prompt nodes and toggle between them or combine them, or have "styles" that you save as csv and toggle on or off in chains with lora.
for instance, you can draw a square, group nodes in it, then activate or deactivate everything in that square with a click. if you create multiple squares and assign them different style prompts and loras, you could run them all in a single queue.