← Home ← Back to /g/

Thread 105673353

319 posts 208 images /g/
Anonymous No.105673353 >>105676092 >>105677705
/ldg/ - Local Diffusion General
dfvdfh8bh7df Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>105669256

https://rentry.org/ldg-lazy-getting-started-guide

>UI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Models, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info

>Cook
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX (video)
Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1

>Chroma
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate

>Neighbors
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/celeb+ai
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.105673389
Anonymous No.105673400
Anonymous No.105673409 >>105673428 >>105673436 >>105673508 >>105676669
When picking a graphics card, I should get as much VRAM as possible, even if it means getting an older card, right?
Anonymous No.105673428
>>105673409
for image gen compute also matters alot so don't go below the 30 series. 40 series and up is ideal
Anonymous No.105673434
diaper status?
Anonymous No.105673436 >>105673474
>>105673409
Anonymous No.105673474 >>105673509
>>105673436
image/video %/5090
This specifically tests generation speeds, right?
Anonymous No.105673508
>>105673409
no, dont buy a p40 or a m40
theyre overpriced now
2-3 years ago a p40 was 200$ max, usable for llms
bandwidth is also important
so yeah get an arc a770 16gb or wait 2 more quarters for the b60 with 24gb
Anonymous No.105673509 >>105673781
>>105673474
Image/video score: 20% VRAM + 80% TFLOPS * Tensor core bonus.
Anonymous No.105673510 >>105673524
Anonymous No.105673514 >>105673568
Can you anons give me feedback for my training settings please? I'm trying to train illustrious.
pastebin.com/pdnQG4fj
Anonymous No.105673519
>>105671804
please post more and post the workflow
Anonymous No.105673524 >>105673526 >>105673539
>>105673510
Pride Month?
Anonymous No.105673526
>>105673524
that's my pubic hair
Anonymous No.105673539 >>105673807
>>105673524
no, trying to get something to work
Anonymous No.105673555
anyone here with a pascal gpu? what are your gen speeds?
Anonymous No.105673568 >>105673702
>>105673514
look into captioning with new nsfw joycaption model inside https://github.com/aisingapore/TagUI, the model is https://huggingface.co/fancyfeast/llama-joycaption-beta-one-hf-llava/tree/main
try out all of the top training software and see which one has best support or best defaults for illust
Anonymous No.105673684
Anonymous No.105673686
Anonymous No.105673702 >>105673735
>>105673568
I don't want to be rude but, what does that have to do with what I asked?
Anonymous No.105673730
Anonymous No.105673735 >>105674198
>>105673702
captions are one of the most important things for creating a good lora so if you are asking for help with training settings you probably dont know the current top captioning model that can help you

and i dont know whats hard to understand about how looking and testing default training settings of top training software can help you in knowing... which training settings are good
Anonymous No.105673742 >>105673888 >>105674084
Anonymous No.105673777
Anonymous No.105673779
>running my cpu at 600mhz
>pc is usable
>5w max, usually at 2.5w idle
intel.. I KNEEL
Anonymous No.105673781 >>105673822
>>105673509
I ran the numbers through this formula and it gives slightly different results. More importantly it says an 8gb 4060 Ti would perform better than a 12gb 3060.
Anonymous No.105673796 >>105673806
This was the most drastic hires fix I've ever seen and the denoise is only at 0.3. What the fuck lol
Anonymous No.105673806 >>105673833
>>105673796
but the images are in 2 completely different locations.. what aas the promtp
Anonymous No.105673807 >>105673832 >>105674899 >>105675743
>>105673539
almost got what I want
>a real life 4K DSLR photo shot on 24mm wide angle lens of a real life storefront and sidewalk in New York city. Multilayered 3D abstract pontilist see-through standing blonde white woman made entirely out of paint wearing a blue dress with visible thick brushstrokes floats in the foreground with a parallax effect. the woman is made of paint. the photoreal background is visible through gaps in the paint.

anyone have thoughts on how to improve the colors/details on the womand while still making her look like paint and keeping the background photoreal?
Anonymous No.105673822 >>105673856
>>105673781
The formula uses 5090% values for calculation, and any card with less than 16GB VRAM other than 3060 12GB isn't considered viable.
Anonymous No.105673832
>>105673807
that kind of prompt really takes me back yerp
Anonymous No.105673833
>>105673806
I think it was due to the cfg settings in conjunction with this model which is different than the model I made the prompt for previously, the hires fix had a much higher setting.
https://files.catbox.moe/gz3j5i.jpg
Anonymous No.105673845 >>105673868 >>105675724
fuck is the problem with this node
Anonymous No.105673856 >>105674207
>>105673822
>and any card with less than 16GB VRAM other than 3060 12GB isn't considered viable.
So... would I get faster or slower generation speeds with a 3060 compared to 4060 Ti. From what I tested in comfy, 4060 Ti 8gb does 4 1000x1000 pics in under a minute.
Anonymous No.105673868
>>105673845
Well you see when you multiply two matrices, the number of columns in the first matrix has to match the number of rows in the second, or the multiplication can't be performed.
Anonymous No.105673888 >>105674553
>>105673742

Is this made off a chroma Lora? Been trying to see if it’s worth trying to train one at this moment
Anonymous No.105673896 >>105674291
wan is a better image model than chroma
Anonymous No.105673937 >>105675321
all i want for Christmas is an RTX PRO 6000
Anonymous No.105673948
all i want for Christmas is for anon to gen kino soul
Anonymous No.105673955
can someone redpill me on pose control especially for wan. is it worth the effort?
Anonymous No.105673984
Anonymous No.105674037 >>105676092
Best lora for egirls?
There are a handful on civitai but most of them seem very old (for pony, etc)
Anonymous No.105674084
>>105673742
now thats a good meme
Anonymous No.105674094
Anonymous No.105674180 >>105674207 >>105674218
>>105673344
I dunno, I was getting some good T2V porn from Hunyuan, and I already have a ton of Lora for it. I just can't seem to get I2V or V2V to work.

How's the lora situation for WAN? didn't civit implode?
Anonymous No.105674198
>>105673735
But I'm not asking for that, I'm not asking for captions, I'm asking for the settings in the pastebin
Anonymous No.105674207 >>105674227
>>105674180
https://civitaiarchive.com/
>>105673856
ideno anon post a workflow
Anonymous No.105674211 >>105674405
Can you anons give me feedback for my training settings please? I'm trying to train illustrious.

pastebin.com/pdnQG4fj
Anonymous No.105674218
>>105674180
>porn
>>>/gif/vdg
is like 90% wan theyd probably know
Anonymous No.105674227 >>105674237 >>105674412
>>105674207
comfy ui with the basic nodes. Just positive and negative prompts.
Anonymous No.105674237
>>105674227
bwo, sdxl?
Anonymous No.105674291
>>105673896
kino
Anonymous No.105674405 >>105674431
>>105674211
You should train the convolutional layers too if you want better likeness.
Anonymous No.105674412 >>105674442
>>105674227
4 1024x1024 images take 58 seconds on 3060 powerlimited to 100W
49 seconds @170W
sdxl based model
Anonymous No.105674431 >>105674559
>>105674405
I've heard about it but I never got into it, can you tell me more please?
I was also planning on using regularization images but I've never done it, I also wanted to check my tags because I still have to stick to the same 225 token limit for all images, so I need to remove ambiguous tags
Anonymous No.105674442
>>105674412
*excluding vae decode and whatever else
Anonymous No.105674550 >>105674570
Anonymous No.105674553
>>105673888
no its regular flux with the Mayli LoRa and then turned into a video with Wan
Anonymous No.105674559 >>105674616
>>105674431
You just set the dimension and alpha for the convolutional layers. I usually set them the same as the network dimension and alpha. You need to choose a non-standard lora type in order to use them.

Regularization images aren't necessary if the model is already familiar with the concept you want to train. In my experiments, what the regularization images did was acting like a combination of a controlnet and img2img, and not at all like what people say online.

If you're training for illustrious, you should use mostly danbooru tags.
Anonymous No.105674569
>"Model, draw X here and there"
>No
>"Model, draw (X:1.5) here and there"
>No, or maybe here but not there, or there but not here
>Have to start spamming gens until X appears in all the correct places
There is something demeaning about transforming electricity into slop so maybe the 35th gen will actually follow the prompt.
Anonymous No.105674570 >>105674581 >>105674594
>>105674550
hey lk anon. is being a lupin count as being a lupin or not?
Anonymous No.105674581
>>105674570
>lupin
*mutant
Anonymous No.105674594
>>105674570
only if you identify as one
Anonymous No.105674616 >>105676099
>>105674559
Aren't those already set up on my training settings?
Anonymous No.105674663 >>105674698 >>105674847 >>105675267 >>105675343 >>105675362 >>105675979
any way to convert the heat from my graphics card to cold air?
Anonymous No.105674698 >>105674709
>>105674663
no. by genning, you are permanently creating entropy and accelerating the heat death of the universe.
Anonymous No.105674709 >>105674780
>>105674698
by generating AI porn we are killing this subhuman shithole planet for good.
how I fucking hate this place.
Anonymous No.105674775 >>105674778
ever have a flux prompt, and the results are so great every time you can't stop genning the same flow to see what it comes up with next. model is fluxfusion
Anonymous No.105674778
>>105674775
pic unrelated?
Anonymous No.105674780 >>105674793 >>105678212
>>105674709
that's just suicidal nihilism, save the planet - end yourself
Anonymous No.105674793 >>105674828
>>105674780
kill the planet and kill everything on it is much better.
otherwise we risk respawning on this cursed rock after death.
so keep those GPUs running, the more the better.
Anonymous No.105674807
Anonymous No.105674822
Anonymous No.105674828 >>105674843
>>105674793
just start with yourself to make sure, do it now!
Anonymous No.105674843 >>105674856
>>105674828
no, I have to use my maximum possible lifespan to destroy this rock in as many ways as humanly possible.
Anonymous No.105674847
>>105674663
stirling engine
Anonymous No.105674856 >>105674869
>>105674843
do it, save the planet, become an hero
Anonymous No.105674869 >>105674917
>>105674856
>save the planet
are you retarded and illiterate or something?
Anonymous No.105674899 >>105674919 >>105675620
>>105673807
that looks looks like midjourney slop. you would be popular on twitter

>lets make this super surreal thing and get 10,000 upvotes
Anonymous No.105674917 >>105674965
>>105674869
now dont be a hypocrite, act on your principles, do it
Anonymous No.105674919 >>105675033
>>105674899
doesn't quite cut it anymore. Sasquatch holiday with veo3 is the meta
Anonymous No.105674965
>>105674917
you have to learn how to read good
Anonymous No.105675033 >>105675102
>>105674919
>veo3
>$124.99 USD / month for 3 months
Are there cheaper ways of getting clout?
Anonymous No.105675098 >>105675327 >>105675542
Is there an ultralytics bbox detector for feet? AI image gen is slowing turning me into a footfag. I don't care about feet but like hands I hate too many toes or fused/mangled toes. I see a few pth files on huggingface but they have no downloads at all hardly and seem suspicious. And I can't find any on civitai or comfyui manager.
Anonymous No.105675102 >>105675113
>>105675033
you only get clout if you have money. welcome to the real world
Anonymous No.105675113
>>105675102
is that why i everything i see at the top of my feed is shit. its almost like chasing clout = reducing quaity
Anonymous No.105675267
>>105674663
this machine is called an air conditioner, hope this helps
Anonymous No.105675321 >>105675408
>>105673937
same but it's like $10k, and if you make under $150k/yr you're literally wasting money because by the time you can afford it, a significantly better card will already be out.
Anonymous No.105675327
>>105675098
no, why not just make one? grab a few hundred pictures of feet and do it
Anonymous No.105675343
>>105674663
>90F all week
>A/C has to run all day to keep my room from overheating

I wished I lived in an extremely cold environment where it's below freezing everyday.
Anonymous No.105675362
>>105674663
Not sure but I think there's probably a way involving a wheel and magnets on each spoke
Anonymous No.105675408
>>105675321
i can't really justify it since my 3090 still kinda does everything i need it to and i bought it used for $700 a couple years ago
idk how that fucker hasn't died yet i don't power limit or any of the memes just straight up abuse non stop
Anonymous No.105675420
Anonymous No.105675431 >>105675447 >>105675462 >>105675511
Do any of you post your gens on other sites like civitai or something?
Anonymous No.105675447
>>105675431
perhaps
Anonymous No.105675455
Anonymous No.105675462
>>105675431
no, i am a lazy fag thats been following llms and image models since 2022 and i did nothing useful with them
never tried to monetize them
almost every day gen and goon yet nothing good
atleast its fun
Anonymous No.105675491 >>105676259 >>105676346
give it to me fr fr no cap is chroma finna fix anatomy, faces and fingers before v50?
Anonymous No.105675499
so uh.. is nag available for chroma yet?
Anonymous No.105675511 >>105675634
>>105675431
i shared a few loras i trained on there and the popularity gave me enough credits to basically train unlimited loras so i don't have to pay for runpod instances at least
Anonymous No.105675515 >>105676057 >>105676092
After I got bored of crucifixion scenes I went back to genning redheads again.
Man. Redheads, huh?
Anonymous No.105675541
Anonymous No.105675542
>>105675098
there's a foot-yolov8l.pt on someone's huggingface if you google it but haven't tried how reliable it is
Anonymous No.105675620
>>105674899
yeah it does look like MJ slop, but it could be improved upon if I can get a consistent result. chroma gets way too much concept bleed here, and doesn't actually do what I'm asking for. I want an expressionist painting where the brushstrokes are floating in midair, while chroma tends towards just making this woman a bunch of paint splashes at best.

no point trying to de-slop and gen a more interesting subject if I can't even get 1girl to render properly.
Anonymous No.105675634
>>105675511
Heh that's what I'm doing, saving buzz for chroma
Anonymous No.105675661 >>105676234
Anonymous No.105675673 >>105675700
how long till we get a local image gen model with gpt levels of following the prompt? "slightly better than flux" prompt adherence ain't enough
Anonymous No.105675700 >>105675775
>>105675673
Just pray one of the chink labs deliver. I think it will happen eventually when they release a new multimodal LLM with image output as well as input, but like 90% of the users in this thread wouldn't be able to run it anyway and nobody would wait 10 minutes to get a 1girl image
Anonymous No.105675704 >>105675783
any new models/developments? where is that kontext thing?

in any case, the light2x wan lora is amazing, you get videos in barely over a minute now.
Anonymous No.105675724
>>105673845
Make sure you're using the correct T5 (that was my problem when I encountered that error).
Anonymous No.105675743 >>105675791 >>105676046 >>105676084
>>105673807
gives strange concept bleed
Anonymous No.105675775 >>105675988
>>105675700
cinks have been using more and more slop data. I wouldn't hype them too much
Anonymous No.105675783
>>105675704
>kontext
cancelled due to a comfyorg faggot
Anonymous No.105675791
>>105675743
definitely, chroma struggles here. I suspect a highly specific prompt could get better results.

Any midjourney anons want to do a comparison? to that or this:
>a real life DSLR photo of a multicolored, multilayered 3D painting of a woman with visible floating brushstrokes with a parallax effect. lines of paint are floating in a dark void. the background is pure black.
Anonymous No.105675876
Anonymous No.105675932 >>105676118 >>105676137
Anonymous No.105675979
>>105674663
Make your body temp hotter than the graphics card and it will feel cold in comparison
Anonymous No.105675988
>>105675775
Regardless, they are the only ones open sourcing cool stuff
Anonymous No.105676046 >>105676059 >>105676084 >>105676141 >>105676899
>>105675743
nah you're just shit at Chroma
stop using whatever utter dogshit sampler it is you're using lmao
and treat it like the not-distilled model it is
you NEED to have a relatively schizo negative with it, there's no way around that in a model mixing content this heavily
Euler Ancestral Beta @ CFG 6.5 is good as a baseline, also
Anonymous No.105676057
>>105675515
https://files.catbox.moe/iu5i55.jpg
https://www.youtube.com/watch?v=DRSvgXoQlBY
Anonymous No.105676059
>>105676046
another one (not how my buildings aren't like Timmy's First Deviantart Upload)
Anonymous No.105676084 >>105676125
>>105676046
>shit at chroma
>posts this sloppa
come one now, >>105675743 is clearly better
Anonymous No.105676092 >>105676349
>>105673353 (OP)
>>>/vp/napt is your neighbor ;3
>>105674037
i like realdream15 but you need specific vae to avoid 'waxy' girls
>>105675515
i like it a lot..
Anonymous No.105676099
>>105674616
They're set to the defaults at 1/1, so they won't do anything. You need to set them to higher values for them to have any effect.
Anonymous No.105676118
>>105675932
kek nice
Anonymous No.105676125
>>105676084
gr8 b8 m8
Anonymous No.105676137
>>105675932
nigga stole my bike
Anonymous No.105676141
>>105676046
you can crank up cfg to 11, won't help with concept bleed
Anonymous No.105676160 >>105677958 >>105677962
(Cherry blossom print)

any other good print or patterns for clothing?

I did a snowflake print for Korra and flame print with Azula
Anonymous No.105676191
'nuff fucking around with 1.3b
not worth the quality drop, tried 960x544 too
this was 110s on a 3060 @100W
for comparison 14B, which is WAY better takes only 200s on a 3060 @100W
Anonymous No.105676213 >>105676238 >>105676382 >>105676391 >>105676482 >>105676510
can anyone recommend a good model that can make males and more serious shit. I am kinda autistic so instead of porn i like to make shit like people fighting and mages casting spells but these models i see on top of civitAI most liked are real bad at this.
Anonymous No.105676234
>>105675661
the coloring/shading is nice. shame it's such a dogshit subject matter.
Anonymous No.105676238
>>105676213
>I am kinda autistic
>instead of porn i like to make shit like people fighting and mages
kek, it's the other way around
Anonymous No.105676259
>>105675491


TEEHEE SURE!!!
Anonymous No.105676278
Anonymous No.105676346
>>105675491
two more weeks
Anonymous No.105676349 >>105676524
>>105676092
byeeeeee
Anonymous No.105676382 >>105677070
>>105676213
Flux
Anonymous No.105676391 >>105677039 >>105677041
>>105676213
sd3.5 large
hidream full
Anonymous No.105676399
Anonymous No.105676417 >>105676459 >>105676622
Anonymous No.105676423
Anonymous No.105676459
>>105676417
i am an earth woman!
and most definitely NOT an alien!
i enjoy all the typical earth-female activities:
such as being a baseball, and eating boys videogames haha :3
nevermind my praying mantis arms these are NORMAL!
Anonymous No.105676482
>>105676213
Midjourney of GPT-image
Anonymous No.105676498 >>105676503
does self-forcing still have slowmotion ?
Anonymous No.105676501
Anonymous No.105676503 >>105676518
>>105676498
yes
Anonymous No.105676510
>>105676213
pixelwave flux is what I use for that though there are multiple options
Anonymous No.105676518 >>105676532
>>105676503
tf is the point of this shit. It's faster to gen but slow movement down .
Anonymous No.105676524 >>105676543
>>105676349
What checkpoint/lora/artist names do you use to get that look?
Anonymous No.105676529 >>105676547 >>105676555
>>105669256
I skipped the bottom line. AceStep version of op image:

https://files.catbox.moe/9r9o7u.mp3

[inst]

[verse]
toon guy with vomit bloodclots out his mouth
vid chink bedroom top down view cleavage dress
vid bounce breast blue haired winged liner wide mouth
four panel purple and red an e mays

[chorus]
Why do we keep posting?
Why do I hit Run?
Why do I fill the q?
What do I anticipate?
Why do we keep posting?
Why do I hit Run?
Why do I fill the q?
What do I anticipate?

[verse]
disassembled robot backdrop orange
animation a lunar volcano
long legs long hair grafitti under bridge
a semite moon and earth in bad shadow


[chorus]
Why do we keep posting?
Why do I hit Run?
Why do I fill the q?
What do I anticipate?
Why do we keep posting?
Why do I hit Run?
Why do I fill the q?
What do I anticipate?

[verse]
jap slap jiggle ass mask long braid with gloves
a silly soldier forgot his flak vest
street view exercise sluts all previous
an e may red lens milk cows handsome chest

{it didn't make it this far:}

[chorus]
Why do we keep posting?
Why do I hit Run?
Why do I fill the q?
What do I anticipate?
Why do we keep posting?
Why do I hit Run?
Why do I fill the q?
What do I anticipate?
Anonymous No.105676532
black hole cmake thinkan

>>105676518
instant gratification but that's really about it
Anonymous No.105676543
>>105676524
its a shidload of fuck bro
and not recommended
its a wobbly stack of 4-5 different lora
(which causes the, originally unintended, paint smear\glitch effect)

'sweatcreamcake' and 'toonbabes' seem to be the main ones needed

>prompt: amazing background, paintswirl background, etc
Anonymous No.105676547
>>105676529
try finding a better seed bro
Anonymous No.105676555 >>105676572
>>105676529
AceStep is terrible
YuE is better, the problem is that it takes 20 minutes for a fucking song
Anonymous No.105676558 >>105676567 >>105676685
jesus fucking christ
Anonymous No.105676564 >>105676572 >>105676593
uh guys

acestep can lay down a solo lol

unfortunately cut short, but it's perfect.

A love song for all the ladies out there (not you, ywnbaw!!!, I mean REAL women -_-):
https://files.catbox.moe/5se7w3.mp3
Anonymous No.105676567
>>105676558
nice goblin hands. you got there, miss
Anonymous No.105676572 >>105676591
>>105676555
Checked, I need to try it!!!! is there an official guide for comfyui? but acestep is laying down serious solos lol like lays it down (sometimes):
>>105676564
Anonymous No.105676591 >>105676633
>>105676572
The versions I checked did not use Comfy
https://github.com/sgsdxzy/YuE-exllamav2
https://github.com/Mozer/YuE-extend

Keep in mind this shit is slow, but I had tons of fun using the ICL feature (using reference audio clips to generate songs)
Anonymous No.105676593
>>105676564
text in case anyone cares.

[inst]

[verse]
I like to lick your sexy toes
Lady with the athlete's feet
I like to taste the tainted twixt
of the web of the sweaty things

To suck and suck on that one big toe
that's bent and has a thick nail
I'd like to love your feet forever
but our elevator's there

[chorus]
Going up your legs
is probably advised
But I'm just down here sucking
on my lady's feet and thighs
Going up your legs
is probably advised
But I'm just down here sucking
on my lady's feet and thighs

[verse]
I like to chew on my lady's toenails
so they're smooth and round
and chomp a bit on the flavor
of the polish she covers them in

she's got some callouses I'll moisten them
with hours of my spit in love
and soon I'll grind them all away
a perfectly spent year and ten months

[chorus]
Going up your legs
is probably advised
But I'm just down here sucking
on my lady's feet and thighs
Going up your legs
is probably advised
But I'm just down here sucking
on my lady's feet and thighs
Anonymous No.105676602 >>105676633
I'm learning a lot about art and painting from fixing all my failed gens. You fix a finger here and there and eventually you're watching tutorials and you realize the shading is fucked up so you start fixing that. Then you notice small anatomical errors that you can paint over. Now I'm starting to just draw and paint my own stuff sometimes. It's way way slower but satisfying in a different way.
Anonymous No.105676622 >>105676632 >>105676639
>>105676417
Anonymous No.105676632
>>105676622
Based.
Anonymous No.105676633 >>105676641
>>105676591
Yeah, I gotta use Comfy, AMD-cel here.

>>105676602
genned:
https://files.catbox.moe/igrmq8.mp3
Anonymous No.105676639
>>105676622
LOL nice
Anonymous No.105676641 >>105676648 >>105676656
>>105676633
>Yeah, I gotta use Comfy, AMD-cel here.
Just use ZLUDA
Anonymous No.105676648 >>105676654
>>105676641
ahem, I am ALSO a Linux-cel
Anonymous No.105676654
>>105676648
It works on Linux
Anonymous No.105676656 >>105676663
>>105676641
>comfyui has a Yue
gonna try it lol
Anonymous No.105676663 >>105676717 >>105676727
>>105676656
If you're gonna try it, I highly recommend using ICL (use a reference audio)
Anonymous No.105676669
>>105673409
If you can swing it, $3K buys you a 4090D with 48GB of VRAM, Ada-gen, and 14K CUDA cores. It's not as fast as a 5090 at inference, but it has 16GB more VRAM, which opens up things like video LoRA training, or using models at their native fp16 for best quality.
Anonymous No.105676685
>>105676558
Anonymous No.105676717 >>105676727 >>105676730
>>105676663
>fetching 3 files
so it's downloading the model?
Anonymous No.105676724
Anonymous No.105676725 >>105676731 >>105676736 >>105676753 >>105676974 >>105677185 >>105677232 >>105677289 >>105677537
Babe wake up, Omnigen 2 got released
https://github.com/VectorSpaceLab/OmniGen2
Anonymous No.105676727
>>105676663
>>105676717
it was SENDING data. no clue why lol. Gotta figure out how to config this thing.
Anonymous No.105676730 >>105676822
>>105676717
If you are using anything else other than the exl2 fork, especially not on Nvidia, prepare to wait forever for a gen
Anonymous No.105676731
>>105676725
>the hat isn't beautiful
wat mean
Anonymous No.105676736 >>105676746 >>105676973
>>105676725
Why did we ignore the first version? I'm sure there was a good reason but I can't remember what it was.
Anonymous No.105676738
Anonymous No.105676744
Anonymous No.105676746
>>105676736
>Why did we ignore the first version?
because it was ass lol
Anonymous No.105676753 >>105676802
>>105676725
Hopefully it is at least faster than BAGEL, I tested that shit local and it takes like 5 minutes to gen on a 3090 and most results were meh
We need a local alternative to Flux Kontext since BFL became greedy jews and won't release the open weights version anytime soon
Anonymous No.105676802
>>105676753
>(((open))) weights
Anonymous No.105676822 >>105676856
>>105676730
changed my mind, too many steps lol

install is jank, looks like hours of tinkering, or days. I'll stick to acestep. SOVL wins again.

Look at the bright side, eventually our low quality diffusions will be easily upscaled.
Anonymous No.105676856 >>105676888
>>105676822
Anonymous No.105676860 >>105676877 >>105677558
Anonymous No.105676863
I don't know how to explain it, but AceStep is kind of like rock guitar distortion.

Like you don't get it now, but this kind of glitch is gonna be YUGE
Anonymous No.105676877
>>105676860
did - did she just spit out a tendy?
Anonymous No.105676883
Anonymous No.105676886 >>105676894
Anonymous No.105676888 >>105676930
>>105676856
Anonymous No.105676894 >>105676945
>>105676886
why is that guy in a sack of watermelons?
Anonymous No.105676899 >>105676904
>>105676046
you are completely missing the point of the prompt. guess i can't blame chroma if even real humans are this retarded
Anonymous No.105676904 >>105676974
>>105676899
Chroma makes cool stuff, but it messes up hands, probably because it's trained to make paws.
Anonymous No.105676928
Anonymous No.105676930
>>105676888
Anonymous No.105676942
I would buy a pmp that had an llm and acestep to generate covers of hits
Anonymous No.105676945
>>105676894
He likes his watermelons tightly secured.
Anonymous No.105676966
Anonymous No.105676973 >>105677060
>>105676736
the same reason we'll ignore this version
Anonymous No.105676974 >>105676983
>>105676725
>26s on a A800 using 17gb
whats a coomsumer equivalent card?

>>105676904
true, you have to put gloves on everything
Anonymous No.105676983
>>105676974
that really looks like paws, I think his furry shit is killing the hands lol
Anonymous No.105676994 >>105677045
Anonymous No.105677017 >>105677023 >>105677045
Anonymous No.105677023
>>105677017
needs visible throat
Anonymous No.105677039 >>105678956
>>105676391
>sd3.5 large

there is no finetunes though?
also, how is it better than flux/chroma? Last i saw wasnt it a failed endeavour?
Anonymous No.105677041 >>105678956
>>105676391
>sd3.5 large
lmaoooooo
Anonymous No.105677045
>>105676994
>>105677017
woud watch that video
Anonymous No.105677060 >>105677093
>>105676973
slop?
Anonymous No.105677070 >>105677081
>>105676382
but isnt flux mainly for realistic shit? I dont think there is anything on flux like illustrious.
Anonymous No.105677081 >>105677124
>>105677070
>but isnt flux mainly for realistic shit?
it's not even good at that, the skin is so plastic, the blur is so intense, the chin is so butted
Anonymous No.105677093 >>105677106
>>105677060
Looking back at the archive, it never got much traction. It also released the same day as (I'm quoting here)
>- Moshi 1
>- SD3.5
>- Allegro
>- Emu1
>- The demo of Sana
Weren't all those also forgotten (or in SD3.5's case, spat upon)?
Anonymous No.105677106
>>105677093
>Weren't all those also forgotten
mochi had a momentum and then hunyuanvideo killed it, they promised mochiHD but it didn't happen
Anonymous No.105677108 >>105678369
Anonymous No.105677124 >>105677135
>>105677081
I hear chrome is better at this... so what am i just doomed since all on sdxl is for coomers?
Anonymous No.105677135
>>105677124
>I hear chrome is better at this.
it has better skin texture, no flux chin anymore, but the anatomy is so fucking garbage, why can't we have nice thing man...
Anonymous No.105677171 >>105677173
>do what chromafag is doing except not with furshit
>????
>profit
Anonymous No.105677173
>>105677171
No one has 50k dollars to spare :(
Anonymous No.105677185 >>105677207
>>105676725
SOVL
Anonymous No.105677186 >>105677314
Chroma's anatomy problems would be fixed if the furfag started training on 1024 with the full cluster right now instead of waiting for the last epochs
But apparently doing this now would be more expensive or something
Anonymous No.105677207 >>105677232 >>105677250 >>105677255 >>105677331
>>105677185
https://huggingface.co/spaces/OmniGen2/OmniGen2
this is pretty good, and it's apache 2.0 licence, I think kontext dev has some serious trouble now lol
Anonymous No.105677212
is pony still king of sdxl anatomy or has illustrious managed to close the gap?
Anonymous No.105677232 >>105677239
>>105677207
>>105676725
In my experience it's not tha good with big changes but I think this is great since it will kinda force kontext devs to release their main model or something very close to it if they plan on their release not being DOA, since the kontext they wanted to release as open weights is gimped garbage
Anonymous No.105677239
>>105677232
kontext will be some distilled garbage with a shit licence, while this one is even able to handle multiple images together like 4o, feels good that BFL is getting pwned on that one, sick and tired of their shenanigans
Anonymous No.105677250 >>105677374
>>105677207
wen comfyui?
Anonymous No.105677255 >>105677331
>>105677207
>https://huggingface.co/spaces/OmniGen2/OmniGen2
>this is pretty good
Not bad indeed.
Anonymous No.105677289 >>105677311
>>105676725
17gb of vram for a 4b diffusion model is brutal, wtf
Anonymous No.105677311 >>105677373
>>105677289
used 3090 chads can't stop winning
Anonymous No.105677314
>>105677186
I doubt 2 epochs will be enough to fix the anatomy
Anonymous No.105677331
>>105677207
>>105677255
>China saved us again
Xi Jinping, my back hurts from kneeling so much :(
Anonymous No.105677373 >>105677472
>>105677311
I really think nobody at AMD knows anything about ai, literally.
Anonymous No.105677374 >>105677429 >>105677433 >>105677458 >>105677601
>>105677250
>wen comfyui?
https://github.com/Yuan-ManX/ComfyUI-OmniGen2
Anonymous No.105677429 >>105677433 >>105677458
>>105677374
since to be working only with that PR
https://github.com/Yuan-ManX/ComfyUI-OmniGen2/pull/2

So you do this
>git clone https://github.com/Yuan-ManX/ComfyUI-OmniGen2.git
>cd ComfyUI-OmniGen2
>git fetch origin pull/2/head:pr-2
>git checkout pr-2
Anonymous No.105677433
>>105677374
>>105677429
nice ty
Anonymous No.105677458 >>105677467 >>105677470
>>105677374
>>105677429
>ModuleNotFoundError: No module named 'flash_attn'
those niggers used flash attention instead of sage? FUCKING WHY?
Anonymous No.105677467 >>105677482
>>105677458
and the official instructions say build it yourself, lel
yeah I think this will fuck your comfy install, just use the gradio interface
Anonymous No.105677470 >>105677476
>>105677458
sage is slightly lossy
AI researchers are doing their shit on fast high vram cloud computers, so they have no interest in a lossy form of attention that makes things faster on consumer hardware
Anonymous No.105677472
>>105677373
I would bet even fucking Intel catching up to Nvidia in AI over AMD
They simply stopped trying, not to mention Lisa Su and Jensen Huang are relatives
Anonymous No.105677475 >>105677492 >>105677504
I have a presumably retarded question. I'm seeing realism based models trained on photography but also saying to use Danbooru tags when prompting. Why do they use anime booru tags for realism?
Anonymous No.105677476
>>105677470
>sage is slightly lossy
flash is more lossy, it doesn't make any sense
Anonymous No.105677482 >>105677484
>>105677467
>and the official instructions say build it yourself, lel
I can feel my day will be a lot of pain...
Anonymous No.105677484 >>105677489
>>105677482
nigga just use one of their wheels lol
https://github.com/Dao-AILab/flash-attention/releases/tag/v2.8.0.post2
Anonymous No.105677489 >>105677500
>>105677484
>linux
more like tongue my anux
Anonymous No.105677492
>>105677475
Tags are not exlusive to tranime images online.
Anonymous No.105677500 >>105677520
>>105677489
you can try those ones
https://huggingface.co/lldacing/flash-attention-windows-wheel/tree/main
Anonymous No.105677504
>>105677475
>anime model gets trained on realism
>only enough to rape the style, not the fact that it works best with tags
something like that
Anonymous No.105677520 >>105677529 >>105677568 >>105677573 >>105677591 >>105677603 >>105677626
>>105677500
ugh... do they even test their shit before shipping it to the public?
Anonymous No.105677529
>>105677520
>researchers
>consumer level testing
oh my sweet summer child
Anonymous No.105677537 >>105677568
>>105676725
From my tests, it's ok for stuff like removing things (subjects, objects and watermarks), adding new things to the image, and making stylistic changes. It's crap for anything else, like making new images featuring the same subject as the reference image etc (it doesn't preserve the identity well and sometimes does not even do what you asked), both BAGEL and Flux Kontext are way better in that regard. It also does not understand the instruction of changing camera angles or making variations of the same image
Anonymous No.105677545
Let's start a rumor the Ayatollah is hiding in Mexico City.
Anonymous No.105677558
>>105676860
lol
Anonymous No.105677568
>>105677520
anyways, there's a lot of working demos in there if you want to test it out
https://github.com/VectorSpaceLab/OmniGen2?tab=readme-ov-file#-gradio-demo
>>105677537
>BAGEL and Flux Kontext are way better in that regard.
we'll get the distilled Flux Kontext dev shit though, I doubt it'll be much better than omnigen desu
Anonymous No.105677573
>>105677520
I'm not even getting that far, I just get the error "prompt has no outputs" even though I set up my workflow just like yours
(and yes I cloned the huggingface model repo to models/OmniGen2/OmniGen2)
Anonymous No.105677591 >>105677603 >>105677626
>>105677520
lol getting this too
what the fuck
Anonymous No.105677601 >>105677603
>>105677374
>torch==2.6.0
i sleep
Anonymous No.105677603 >>105677626 >>105677800
>>105677520
>>105677591
yep they need to fix that shit aswell
https://github.com/Yuan-ManX/ComfyUI-OmniGen2/issues/3
I'll be using their gradio in the meantime
>>105677601
you can remove that on the requirements.txt, it doesn't have to be a specific torch
Anonymous No.105677605 >>105677608
>cannot find a waifu to gen who's obscure enough to be cool but not too obscure that she has too few tagged images
Anonymous No.105677608
>>105677605
Jubilee
Anonymous No.105677612 >>105677690 >>105677696 >>105677706 >>105678956
>Wan2.1 I2V
>Linux Mint 21, RX 6700 XT 12GB VRAM, 32 GB RAM
How cooked am I?

Currently managed to get comfyUI up and running. Having to tweak the default wan workflow to use fp8 checkpoint instead of fp16, image size 400x400, turning VAE decoding tile size all the way down to like 128, splitting the pipeline of saving output latents into files and feeding said latents to VAE decoding into 2 separate workflow. All these to avoid OOM error, which only managed to produce some blurry mangled mess.
Try to Wan2GP and it always gives me segfault with no meaningful way to debug that shit.
Any tips for poor fags like me?
Anonymous No.105677626 >>105677638 >>105677666
>>105677603
>>105677591
>>105677520
go into omnigen's nodes.py and edit line 138

from:
def load_model(self, model_path, dtype):
to:
def load_model(self, model_path, dtype, device):

I did this and now it runs, it's downloading the model files
who knows if it'll work once they download though lol, fully expecting there'll be a new bug when it tries to actually run them
Anonymous No.105677638
>>105677626
based, thanks anon
Anonymous No.105677666 >>105677673
>>105677626 (me)
>fully expecting there'll be a new bug when it tries to actually run them
confirmed:

File "C:\ComfyUI_windows_portable\ComfyUI\custom_nodes\ComfyUI-OmniGen2\nodes.py", line 20, in load_pipeline
if args.scheduler == "dpmsolver":
^^^^
NameError: name 'args' is not defined

Oh well. this one is probably beyond me, a retard
Anonymous No.105677673 >>105677846
>>105677666
>this one is probably beyond me, a retard
ask claude to help you on that one
Anonymous No.105677690 >>105677696
>>105677612
>Any tips for poor fags like me?
Stop being poor
Anonymous No.105677696
>>105677612
>>105677690
>Stop being poor
this
https://www.youtube.com/watch?v=VN6OThlrA6g&t=51s
Anonymous No.105677705 >>105677710 >>105677717
>>105673353 (OP)

damn this shit is easy... most anons using comfy UI?
Anonymous No.105677706
>>105677612
save money for a year
Anonymous No.105677710
>>105677705
blue board anon
Anonymous No.105677717
>>105677705
>most anons using comfy UI?
no https://github.com/FizzleDorf/AniStudio
Anonymous No.105677730 >>105678197
why the fuck am i so bad at genning shit
fuck, i'm so fucking dumb
Anonymous No.105677749 >>105677759
Guess the model
>>>/tv/211857362
Anonymous No.105677759
>>105677749
chroma?
Anonymous No.105677783
>follow guide
>it makes my pc produce hot air

its fucking summer
Anonymous No.105677800 >>105677825
>>105677603
>I'll be using their gradio in the meantime
and of course they didn't specify you need gradio to make this shit work, here's the wheels
https://github.com/woct0rdho/triton-windows/releases/tag/v3.3.1-windows.post19
Anonymous No.105677825 >>105677831 >>105677834
>>105677800
How many malwares have you installed in your lifetime?
Anonymous No.105677831
>>105677825
https://pytorch.org/get-started/locally/
this is my fave malware
Anonymous No.105677834
>>105677825
zero
Anonymous No.105677846 >>105677976
>>105677673
I didn't use any LLM but I fixed a long string of simple variable passing bugs (like six of them) and finally I've run up against a problem with the image encoding backend that seems to require actual torch ML knowledge, so this is as far I can go

This repo is clearly just not finished
Anonymous No.105677958 >>105677998
>>105676160
Mind sharing the prompt or catbox?
And the word "theme" can work well sometimes.
Have you tried strawberry print or strawberry theme?
Anonymous No.105677962 >>105678006
>>105676160
You've got a non-exhaustive list over there:
https://tagexplorer.github.io/#/composition?tagGroupFilter=prints
Anonymous No.105677976 >>105677983
>>105677846
>This repo is clearly just not finished
this chink managed to make it run
https://youtu.be/NQl59lHeqcA?t=188
Anonymous No.105677983 >>105677989
>>105677976
these nodes are different, he's using some other repo or made his own
Anonymous No.105677985
Anonymous No.105677989 >>105677994
>>105677983
it's slightly different yeah, but on his description he's showing the repo we're using, maybe he's using an older version that actually work Idk lol
Anonymous No.105677994 >>105678000 >>105678009
>>105677989
>After the author reproduced the node of this project, there were a lot of errors and bugs. Thanks to my friend UP host @δΈ€ζ£΅ζœ¨ζœ¨ζœ¨ζœ¨ζœ¨ε€΄, after modifying the code for everyone, you can use it! , Thank you for your continuous support. Send the password and it will automatically drop. The password (to the right of the colon) is: δΈ€ζ£΅ζœ¨ζœ¨ζœ¨ζœ¨ζœ¨ε€΄

machine translated by youtube. so yeah he's saying a friend of his fixed all the bugs? I'll try downloading it with this code he gave, I hate trying to use chinese websites though
Anonymous No.105677998
>>105677958
I'll try strawberry next

https://files.catbox.moe/a5s6ey.png

https://files.catbox.moe/zjanzz.png

https://files.catbox.moe/lrnflv.png
Anonymous No.105678000 >>105678016
>>105677994
my "friend" made this custom python wheel desu, it is delicious wheel, you must run it desu
Anonymous No.105678006
>>105677962
thanks, I'm going to use that site
Anonymous No.105678009
>>105677994
lol, it's obvious he fixed the repo but he doesn't share the fixed version anywhere... fuck
Anonymous No.105678016 >>105678031 >>105678086
>>105678000
I can't find any link anyway, probably the translation is off and he just hasn't actually shared his fixed version

how annoying
Anonymous No.105678031 >>105678037 >>105678040 >>105678049
>>105678016
wait nm!

https://github.com/VectorSpaceLab/OmniGen2

it was hiding underneath show more
Anonymous No.105678037
>>105678031
>last commit 6 minutes ago
we're so back
Anonymous No.105678040
>>105678031
gg wp anon
Anonymous No.105678049 >>105678061
>>105678031
wait that's not the comfyui version though?
Anonymous No.105678060 >>105680067
Im in bed and too tired to get up to my 5080 can someone make an image of Bocchi the Rock but all bruised bloody beaten and scared I hate her so much im seething right now. Thanks.
Anonymous No.105678061
>>105678049
yeah sorry I'm half asleep and got too excited by seeing another github repo link. I'm retarded
;_;
we're just gonna have to wait ig
Anonymous No.105678086 >>105678119
>>105678016
>probably the translation is off and he just hasn't actually shared his fixed version
no, the translation is good, on his video he says that he's using the "fixed" version, but at no point he's saying where we should get this shit, that's so RETARDED
https://youtu.be/NQl59lHeqcA?t=71
Anonymous No.105678119
>>105678086
I think you have to download this shit here
https://t.zsxq.com/7F90A
https://t.zsxq.com/b9R9G
fucking chinks... what's wrong with uploading with github?
Anonymous No.105678149 >>105678962
Strawberry Wakfu
20Loras No.105678158
>install comfui for WAN shenanigans
>install pytorch 2.7
>see it several times during updates and installations
>now shows 3.2

God fucking damn it.
Anonymous No.105678196
Hmm, when I take my quality prefix and put it in "Concat Conditioning" a bunch of times, it's improving the image a lot.

Is there a difference between doing this versus increasing the weights of the quality prefix?

(By quality prefix I mean the stuff that contains "masterpiece, amazing quality, best quality, etc")
Anonymous No.105678197
>>105677730
keep doing it an eventually you might get better
20Loras No.105678210 >>105678223
Are you supposed to be stuck on this part for several minutes with wan?
Anonymous No.105678212 >>105678268
>>105674780
Model?
Anonymous No.105678223 >>105678260
>>105678210
look at task manager, if your vram space is being fully filled you're fucked
20Loras No.105678260
>>105678223
It was a common mistake, retardation.
I was running forge in the background hogging all the vram. It works now.
Anonymous No.105678268
>>105678212
too long to be generated, and background is too coherent
it's just a real camgirl video
20Loras No.105678316
Man, it's crazy how the masking works. First wan gen anyway, hooray.

Took 6min to render on a 4090. Normal speed?
Anonymous No.105678369
>>105677108
KEKD
Anonymous No.105678512 >>105678553
https://github.com/VectorSpaceLab/OmniGen2
Finally managed to use the gradio one, it's using 18.2 gb of vram and it took this long on my 3090
>50/50 [02:52<00:00, 3.44s/it]
Anonymous No.105678553
>>105678512
kinda brutal for a 4b model not gonna lie
Anonymous No.105678562 >>105678581
Fresh bread

>>105678558
>>105678558
>>105678558
>>105678558
Anonymous No.105678581
>>105678562
Thumbs up for not racing the text reply limit for once.
Anonymous No.105678956
>>105677041
laugh all you want..
>>105677039
so what? you said you wanted to create mages
>>105677612
use kijai workflow with block swapping, im getting wonderful results on a 3060 12gb on debian 12
Anonymous No.105678962
>>105678149
BASED wakfuck enjoyer
Anonymous No.105680067
>>105678060
>He doesn't run run comfy in listen mode and execute gens from his phone.
ngmi