← Home ← Back to /g/

Thread 105818934

330 posts 180 images /g/
Anonymous No.105818934 [Report] >>105818993
/ldg/ - Local Diffusion General
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>105814447

https://rentry.org/ldg-lazy-getting-started-guide

>UI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX (video)
Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1

>Chroma
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/celeb+ai
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.105818943 [Report] >>105818952 >>105818963 >>105819409 >>105820145 >>105820173 >>105820234 >>105820277 >>105820281 >>105820313
eternal thread of SDXL forever
Anonymous No.105818946 [Report] >>105818961
Looks like my from scratch model is in the OP again.
Anonymous No.105818951 [Report] >>105819031
MOAR sluttified sailor moon pl0xe
Anonymous No.105818952 [Report] >>105819077
>>105818943
acestep
c
e
s
t
e
p
Anonymous No.105818961 [Report]
>>105818946
acestep never makes the collage.
Anonymous No.105818962 [Report] >>105819023 >>105819037
Guys please understand we haven't advanced AI training since SD 1.4. Literally impossible to make a model on $50k in GPUs.
Anonymous No.105818963 [Report]
>>105818943
SDXL is kino
Anonymous No.105818975 [Report]
Anonymous No.105818993 [Report] >>105819075 >>105819082 >>105819211 >>105819293 >>105819343
>>105818934 (OP)
what an ugly selection of gens, that's why tourists never visit these threads, unlike when you bake a thread with an interesting video or image like mine, people gather to post about it, you don't believe me? just look how many anons replied to my sailor moon video in the last thread
Anonymous No.105819023 [Report]
>>105818962
It's very important that models are only made by people like SAI and BFL so we must make it clear that it's impossible to do it with less than $10 million in hardware.
Anonymous No.105819031 [Report] >>105819075 >>105819082 >>105819293
>>105818951
There you go little buddy
Anonymous No.105819037 [Report] >>105819060
>>105818962
if training is so cheap, where is your model then?
Anonymous No.105819060 [Report] >>105819073 >>105819124
>>105819037
Explain why 8xA100s can't make a model.
Anonymous No.105819073 [Report] >>105819081
>>105819060
if they could, it would exist. that simple
Anonymous No.105819075 [Report]
>>105818993
>>105819031
I really liek, thx
Anonymous No.105819077 [Report] >>105819379
>>105818952
Where is version 1.5, it's been training for a while now, come on!!!!
Anonymous No.105819081 [Report]
>>105819073
No, I don't think so. You people can't even make Loras.
Anonymous No.105819082 [Report] >>105819128
>>105819031
>>105818993
are the initial images made with Wai?
Anonymous No.105819092 [Report] >>105819242 >>105819293
Anonymous No.105819124 [Report] >>105819154
>>105819060
Sure it can make a model, most likely another shit model to put on the pile.

That said, if I have a pet peeve it would be that there seems to be too little focus on the dataset, with careful curation you could likely get away with a lot less training and still get a better result.

Of course careful curation takes time, alternative is to use tightly controlled AI output but then you get very generalised results, like the 100 recycled poses of Flux, and the plastic skin.
Anonymous No.105819128 [Report] >>105819139
>>105819082
the ones that look more old school were made with noobaicyberfix_vrped10, the ones that look sharper and "newer" were made with wai
Anonymous No.105819139 [Report] >>105819242 >>105819293
>>105819128
Anonymous No.105819154 [Report] >>105819166
>>105819124
There's a very wide gap between Flux and SDXL and it ignores the real problem people have with Pony: SDXL has a shitty VAE. So the goal should be less on making a huge parameter model but rather make a good enough model has better reconstructions with a plan for scaling. Even a 2B transformer model would outperform SDXL and you can always scale/grow the model to 4B once your first version is done.
Anonymous No.105819166 [Report] >>105819225
>>105819154
Is that not just Lumina 2? 2.6B with 16ch vae
Anonymous No.105819211 [Report]
>>105818993
>slutty sailor moon image #839,345,241
Oh boy
Anonymous No.105819220 [Report] >>105819257
>>105818630
>many 14b loras are compatible between t2v and i2v.
I wish the new nsfw one did
Anonymous No.105819225 [Report]
>>105819166
Gemma sucks
Anonymous No.105819242 [Report]
>>105819092
>>105819139
hawt
Anonymous No.105819257 [Report] >>105819293 >>105819331 >>105819433
>>105819220
the nsfw-api lora? I didn't like it from my testing, might be useful for t2v but not for i2v, it felt overfit to me and it looks like the guy who trained that lora used low quality videos or had to train with low quality because all the tests I did it, it changed the quality of the original image into very grainy videos
Anonymous No.105819293 [Report] >>105819324 >>105819458
>>105819257
>>105819139
>>105819092
>>105819031
>>105818993
will this dumb nigger ever stop spamming?
Anonymous No.105819324 [Report]
>>105819293
Doesn't look like it.
Anonymous No.105819331 [Report] >>105819334
>>105819257
MOAR saliro moon sloots
Anonymous No.105819334 [Report] >>105819377
>>105819331
go away jeet
Anonymous No.105819343 [Report]
>>105818993
I'm with you on this anon, I'd rather one image/video than this try hard trying to appease to more anons to gain favor.
Anonymous No.105819377 [Report] >>105819395 >>105819435
>>105819334
maybe you should fuck off to that troon discord you crawled from, fagtron?
Anonymous No.105819379 [Report] >>105819469
>>105819077
>Where is version 1.5, it's been training for a while now, come on!!!!
https://files.catbox.moe/amzxng.flac
Anonymous No.105819383 [Report] >>105822845
Anonymous No.105819386 [Report] >>105819394 >>105819742
Why doesn't Comfy have actually useful base nodes?
Anonymous No.105819394 [Report]
>>105819386
I want to know why there are only 2 laten operation that plug into applylatentoperation / applylatentoperationcfg.
Anonymous No.105819395 [Report] >>105819417
>>105819377
you are brown
Anonymous No.105819400 [Report] >>105819409 >>105819412 >>105819425 >>105819433 >>105819441 >>105820388 >>105822944
I'm really curious to see where this tech will be in a few years
Anonymous No.105819409 [Report]
>>105819400
see: >>105818943
Anonymous No.105819412 [Report] >>105819430
>>105819400
probably wont get much better than what it is now.
Anonymous No.105819417 [Report] >>105819423
>>105819395
I'm white, and you're mentally disabled
Anonymous No.105819423 [Report] >>105819438
>>105819417
you are brown and projecting
now go jerk your microcock somewhere else shitskin.
Anonymous No.105819425 [Report] >>105820392
>>105819400
I'd say banned, but then there's the outlaw way, piracy
Anonymous No.105819429 [Report]
i'm anon, and this is jackass
Anonymous No.105819430 [Report] >>105819451
>>105819412
There's a lot better we can get just from going with Flux Kontext's ideas. A Wan video model that takes arbitrary inputs with the prompt/direction. Less img2video (although it could do that) but do everything without necessarily slaving to a start frame.

e.g. "Start like this and end like this but do the these characters".
Anonymous No.105819433 [Report]
>>105819257
>it changed the quality of the original image into very grainy videos
The grainy thing is only here when using i2v
>>105819400
>10 sec
How?
Anonymous No.105819435 [Report] >>105819450 >>105819458
>>105819377
someone is uppity today, let me guess... an angry vramlet that can barely generate images, let alone videos kek
Anonymous No.105819438 [Report]
>>105819423
ok retard, have a good one
Anonymous No.105819441 [Report]
>>105819400
buy our early-access preorder subcription right now to find out in a a few years
Anonymous No.105819442 [Report] >>105819497
>>105816619
I have 128GB of RAM but only 12GB of VRAM...
Can I convert my spare unused RAM into VRAM using some sort of black magic?
Anonymous No.105819444 [Report]
Anon is so dreamy
Anonymous No.105819450 [Report]
>>105819435
I gotta get a sailor slut cosplay for my bitch
Anonymous No.105819451 [Report] >>105819465 >>105819506
>>105819430
>Flux Kontext's
forgot the shitty license? nobody will do shit with it.
>A Wan video model
what makes you think they will release another one?
Anonymous No.105819457 [Report] >>105821149
>>105819314
>I really like these gens. Were the original images made using a Lora, or is this artstyle built into Illustrious / etc.?
Worse, you have to download an entire checkpoint for a single "style".
Anonymous No.105819458 [Report] >>105819466
>>105819435
was replying to >>105819293
btw
Anonymous No.105819459 [Report] >>105819468 >>105819470 >>105819488 >>105819534 >>105819547 >>105819580 >>105822857
Anonymous No.105819462 [Report]
Imagine anon in a sailor slut cosplay.
Anonymous No.105819465 [Report]
>>105819451
>what makes you think they will release another one?
Cyber terrorism against the US in a digital proxy war?
Anonymous No.105819466 [Report]
>>105819458
the guy is right, stop spamming you retarded braindead nigger.
Anonymous No.105819468 [Report] >>105819526
>>105819459
you can convine /pol/ this is real image
Anonymous No.105819469 [Report] >>105819475
>>105819379
This is just mean, I may be retarded, but I'm not a
Anonymous No.105819470 [Report] >>105819526
>>105819459
Oy vey, shut it down!
Anonymous No.105819475 [Report]
>>105819469
wow that's retarded you didn't finish your sentence!! ha-ha (points)
Anonymous No.105819488 [Report] >>105819526
>>105819459
https://www.youtube.com/watch?v=0GN-r5J8DWk
Anonymous No.105819497 [Report]
>>105819442
Yes, in a a way.

AI Inference and training programs can automatically offload parts of the model from vram to ram which means you can use models that doesn't fit 12gb.

For inference through programs like Comfy and Forge, this is basically done automatically, for training you will likely need to modify the settings, like quantization, batch size, resolution, etc in order to make it fit.

It can't do miracles though, you for larger models you will need to use FP8 / Q8 quantized versions.
Anonymous No.105819506 [Report] >>105819515
>>105819451
>what makes you think they will release another one?
They've already announced they will be releasing Wan 2.2, and that it will be an open model, it was all over the 'ai news' a week or so ago.
Anonymous No.105819515 [Report] >>105819553
>>105819506
>they will be
I believe it when I see it, wouldnt be the first time someone cucks out from releasing it.
Anonymous No.105819526 [Report] >>105819535 >>105819543 >>105819547 >>105819580 >>105819583 >>105819794
>>105819468
>>105819470
>>105819488
No one was hurt in the making of this video
Anonymous No.105819534 [Report] >>105819554
>>105819459
People joke but this is what really happened.
Anonymous No.105819535 [Report] >>105819565
>>105819526
such a big plane
Anonymous No.105819543 [Report]
>>105819526
The shadow is pretty fucking impressive
Anonymous No.105819547 [Report]
>>105819459
>>105819526
why does it turn into nu-wtc
Anonymous No.105819553 [Report]
>>105819515
Then why even announce it ? They released Wan, Wan Vace subsequently, also they're a giant chinese tech corporation, they have nothing to fear from a legal standpoint.
Anonymous No.105819554 [Report] >>105819598
>>105819534
just that they didnt have AI back then so the plane was just a video layer on top of the live video showing the towers.
thats why in every video that shows a plane approaching the WTC the flightpath is different.
Anonymous No.105819565 [Report]
>>105819535
For you
Anonymous No.105819580 [Report]
>>105819459
>>105819526
kino finally
Anonymous No.105819583 [Report]
>>105819526
muslim pilots when there's a go-around
Anonymous No.105819584 [Report] >>105819615 >>105819703
Anonymous No.105819598 [Report]
>>105819554
I remember feeling in disarray. Where's what? Like when you watch a movie, but they assemble clips to tell a story, but the 3d space isn't there.
Anonymous No.105819615 [Report]
>>105819584
fuck off already
Anonymous No.105819622 [Report] >>105819634 >>105819635 >>105819650
The porn singularity is nigh!
Repent!
Anonymous No.105819634 [Report]
>>105819622
Sydney Sweeney?
Anonymous No.105819635 [Report]
>>105819622
t-this i-is a blue board, reeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee , please stop, i-i'm gg-gonna report you because i'm a loser
Anonymous No.105819638 [Report] >>105819710 >>105821303
just give me models that do not require autistic prompts
Anonymous No.105819650 [Report] >>105819661 >>105819669 >>105819671 >>105819782
>>105819622
Anonymous No.105819661 [Report] >>105819668 >>105819672
>>105819650
>buttchin garbage
Anonymous No.105819668 [Report]
>>105819661
hey, I've the same chin too
Anonymous No.105819669 [Report]
>>105819650
kling still struggles with hands, its so bad, WAN mogs the shit outta kling and its free kek
Anonymous No.105819671 [Report]
>>105819650
Posts like this really need a picture of the poster attached.
Anonymous No.105819672 [Report] >>105819707
>>105819661
>Looking at her face
Faggot detected
Anonymous No.105819680 [Report]
It makes more sense to upscale first, then interpolate, right?
Anonymous No.105819703 [Report] >>105819729
>>105819584
Personally I'm enjoying your contributions
Anonymous No.105819705 [Report] >>105819726
this is the first model to recognize that the arrow hit the head (instead of apple).
very impressive for a 3b
Anonymous No.105819707 [Report]
>>105819672
newfag detected
Anonymous No.105819710 [Report]
>>105819638
The autism is what makes it work.
Anonymous No.105819726 [Report]
>>105819705
I think the reality is you're accustomed to the lie that we need 12B parameters to do basic things
Anonymous No.105819729 [Report]
>>105819703
cheers anon
Anonymous No.105819733 [Report]
B parameters? It's time to go T!
Anonymous No.105819737 [Report] >>105819775
Anonymous No.105819740 [Report]
Anonymous No.105819742 [Report]
>>105819386
so 60 people make the same custom nodes for the same use case :^)
Anonymous No.105819761 [Report] >>105819770
Can't we all just get along?
Anonymous No.105819770 [Report] >>105820072
>>105819761
we cant unfortunately, as long as brown people exist they will shit up everything.
Anonymous No.105819775 [Report] >>105819796
>>105819737
Captures the style well, is it a MAD magazine lora ?
Anonymous No.105819780 [Report]
Anonymous No.105819782 [Report] >>105819794 >>105819797 >>105819808 >>105820072
>>105819650
i'm getting bored of wan and local video genning as whole. Non of gens are any good and I'm stuck with the 480p. Self forcing was a nice game changer but my wans gen are still shit and sometimes even shitter than my framepack gens.
Anonymous No.105819793 [Report] >>105819821
please make the ugly stop
Anonymous No.105819794 [Report]
>>105819782
>i'm getting bored of wan and local video genning
Perhaps something other than 1girl like >>105819526
Anonymous No.105819796 [Report]
>>105819775
its actually just Flux Kontext and an image of Alfred E. Neuman
Anonymous No.105819797 [Report]
>>105819782
>make celeb porn
>sell it for money
>??
>profit
Anonymous No.105819808 [Report]
>>105819782
>Non of gens are any good
so why do you insist on posting them?
are you retarded?
Anonymous No.105819809 [Report] >>105819816
>infinite canvas
>gets bored
Anonymous No.105819815 [Report]
>skill
issue
Anonymous No.105819816 [Report]
>>105819809
The trick to not get bored is to have a project.
Anonymous No.105819821 [Report]
>>105819793
it's local diffusion
Anonymous No.105819826 [Report] >>105819889 >>105819902 >>105819910
can Wan make this mother daughter due kiss?
Anonymous No.105819828 [Report]
GOOD MORNING SAAAAAR
Anonymous No.105819860 [Report]
Anonymous No.105819861 [Report] >>105822831
Anonymous No.105819877 [Report]
please make the pretty start
Anonymous No.105819882 [Report]
Anonymous No.105819889 [Report] >>105819896
>>105819826
what's it worth to ya
Anonymous No.105819896 [Report]
>>105819889
nothing
Anonymous No.105819902 [Report] >>105819907
>>105819826
wait, how the fuck is this just from a mainstream lingerie ad lmao
Anonymous No.105819907 [Report]
>>105819902
Amazing how mid her daughter is.
Anonymous No.105819910 [Report] >>105819920
>>105819826
It can make them do more than kiss... sir!
Anonymous No.105819920 [Report]
>>105819910
like scissor fight?
Anonymous No.105819941 [Report] >>105819998 >>105820039
Anything new on the realism front? Or are biglust, bigASP, lustify and their merges still the best we've got?
Anonymous No.105819942 [Report] >>105820321
Anonymous No.105819998 [Report] >>105820039
>>105819941
Chroma is your best bet, despite being trained by a furry
Anonymous No.105820039 [Report] >>105824227
>>105819941
>>105819998
and also go ahead and dl it cuz it might get b&

get 27, and the latest 42. maybe 43 will come out soon.
Anonymous No.105820058 [Report]
Anonymous No.105820072 [Report] >>105820082
>>105819770
So that's how it is on this bitch of an earth.
>>105819782
I just can't get behind the wait time for video gens. And API just doesn't do it for me.
Anonymous No.105820082 [Report]
>>105820072
>pic
maximum SOVL
Anonymous No.105820118 [Report] >>105820129 >>105820169
In Ai-toolkit, other optimizers like prodigy aren't showing in the UI. Is there anything I'm missing?
Anonymous No.105820129 [Report] >>105820161
>>105820118
>hasn't transcended the UI
Anonymous No.105820145 [Report]
>>105818943
AAAAAIIIIIÌEEEE
Anonymous No.105820161 [Report]
>>105820129
I like monitoring my progress remotely with the UI. prodigy_8bit.py exists inside toolkit/optimizers, so i'm not sure why it's not selectable. I wonder if something in the ui has to be updated
Anonymous No.105820169 [Report]
>>105820118
>UIcuck
Enjoy your slop
Anonymous No.105820173 [Report]
>>105818943
it do be like that
Anonymous No.105820188 [Report]
Anonymous No.105820198 [Report] >>105820207 >>105820238
Anonymous No.105820200 [Report] >>105820331
Retard here, does the learning speed depend on the captions and/or data quality? With the same settings and roughyl same amount of images for data sets I got wildly differing learning speeds on different lora trainign attempts. On Onetrainer btw
Anonymous No.105820207 [Report]
>>105820198
corona chan?
Anonymous No.105820234 [Report]
>>105818943

Couldn't we train the next checkpoints and loras in a textual prose style like Flux, rather than rigid tagging systems?

I think that's the main disadvantage we have right now. Or at least that he handles both things, that he can understand in more abstract terms, and SDXL gives us a little bit of Kino. "Sad atmosphere" and, apart from all the labels, it gives us some surprises in the scene, the composition, and body language and camera angle.

Also think the main problem with SDXL its the poor context.
Anonymous No.105820238 [Report] >>105820275
>>105820198
goth deepseek chan?
Anonymous No.105820268 [Report] >>105820415 >>105821131
Anonymous No.105820275 [Report]
>>105820238
I forgot that was a thing
Anonymous No.105820277 [Report]
>>105818943
In addition, over time, the Loras and merges of the community are making the models more autistic. I was testing last year's models, and they have more breathing room. Have you ever tried not sending any prompts and having the model automatically send you an anime girl? That's what I call dataset poison.
Anonymous No.105820281 [Report] >>105820337
>>105818943
GPUs aren´t improving fast enough. SDXL-sized models are still the most convenient for most people. Hell there are probably a lot of people who struggle with SDXL models and wish we were still at 1.5 sizes.
Anonymous No.105820313 [Report] >>105821956
>>105818943
Flux is slop also.
Anonymous No.105820321 [Report]
>>105819942
Shakira
Anonymous No.105820331 [Report]
>>105820200
How fast a model learns depends on how aware the model already is about the concept, it then also depends on your signal to noise ratio, the signal are the shared characteristics and features of the images and captions you use, the noise is all the irrelevant features and characteristics every image inherently has. Lots of noise means the model can't find the signal.
Anonymous No.105820337 [Report] >>105820378
>>105820281
I like this artstyle, what is called? Share your metadata!
Anonymous No.105820349 [Report]
There, anon said it. Fun is over boys, we're cancelling AI. We've had our fun, but there's absolutely nothing more to do with it. Mediocre 1girl is the limit. All's well that ends well, but it's time to find a new hobby. See you around the annual grass touching event.
Anonymous No.105820364 [Report]
Anonymous No.105820378 [Report] >>105821149
>>105820337
I am experimenting with base gens with illustrious and then img2img with pixelwave flux using joycaption to caption the former. It isn´t in one workflow right now and results are kind of random.
Anonymous No.105820388 [Report] >>105820397
>>105819400
>what is autoregression?
Anonymous No.105820392 [Report] >>105820399
>>105819425
You guys keep saying this but literally nothing has happened in the past 3 years. You guys always cry about censorship too but you can gen thousands of big titty monsters more easily than ever.
Anonymous No.105820397 [Report] >>105820429
>>105820388
a dead end and autorepression is experimenting with diffusion now
Anonymous No.105820399 [Report] >>105820420
>>105820392
enshittification is happens, look at google search
Anonymous No.105820409 [Report]
mmm, monsters
Anonymous No.105820415 [Report]
>>105820268
Can I get the workflowbox for posteri... post... for keeps.
Anonymous No.105820420 [Report] >>105820478
>>105820399
I'm genuinely curious if you consider your braindead parrot as an example of enshittification. It's curious how the people who say this are also people who did literally nothing hoping someone will pave the road to the promised land for them.
Anonymous No.105820429 [Report] >>105820439
>>105820397
I disagree
Anonymous No.105820432 [Report] >>105820444 >>105822444
Anonymous No.105820439 [Report] >>105820465 >>105821780
>>105820429
>bro let's do autoregression on 1,000,000 pixels (1024x1024)
How many LLMs have 1 million context?
Anonymous No.105820444 [Report] >>105820530
>>105820432
kys
Anonymous No.105820465 [Report] >>105820468
>>105820439
6
Anonymous No.105820468 [Report] >>105820474
>>105820465
Looking forward to your finetune.
Anonymous No.105820474 [Report]
>>105820468
thanks
Anonymous No.105820478 [Report] >>105820493
>>105820420
cool story bro, enjoy your shitty google
Anonymous No.105820481 [Report]
Anon is so ambitious
Anonymous No.105820493 [Report] >>105820497
>>105820478
That's not what I said. Enjoy the shit because no one is going to do anything nice for you. We're simply going back to the mean of a low trust society which you helped create.
Anonymous No.105820497 [Report]
>>105820493
ok retard
Anonymous No.105820530 [Report] >>105822307
>>105820444
use soap and learn how to use a toilet
Anonymous No.105820550 [Report] >>105820578 >>105820583 >>105821045
It's always weird to just post gens without a comment, I think.
Anonymous No.105820578 [Report]
>>105820550
I don't think so.
Anonymous No.105820583 [Report] >>105820619
>>105820550
>Chroma_final
which version is the final one friend?
Anonymous No.105820619 [Report] >>105821045
>>105820583
That's just the image name from my last step after upscaling. I always gen with the latest checkpoint, this is v42 detail calibrated. Sorry for the confusion.
Anonymous No.105820831 [Report] >>105820848
any acestep finetunes?
Anonymous No.105820848 [Report] >>105820873
>>105820831
CivitAI etc would never allow them since RIAA would be on them with a lawsuit in a heartbeat.
Anonymous No.105820853 [Report]
acestep actually is udio 1.0 tier, except that acestep workflows have not yet solved the cfg problem. When you listen to the examples, realize it's obvious they are cooking the cfg super bad.
Anonymous No.105820873 [Report]
>>105820848
We need chinese (with English settings)
Anonymous No.105820987 [Report] >>105821032
Remember to stay hydrated while prompting.
Anonymous No.105821003 [Report]
Anonymous No.105821024 [Report] >>105821131 >>105821148
Anonymous No.105821032 [Report] >>105821066
>>105820987
Thank you fox chan, I will!
Anonymous No.105821045 [Report]
>>105820550
>>105820619

These are neat. Very outlandish in terms of style, I like 'em.
Anonymous No.105821066 [Report]
>>105821032
OK d*bo
Anonymous No.105821093 [Report] >>105821149 >>105821185
Every now and again, the shading and lighting turn out really nice. I wonder if there's a way to prompt it more consistently (on illustrious).
Anonymous No.105821131 [Report]
>>105821024
>>105820268
Mass reporting this retarded spamming imbecile
Anonymous No.105821148 [Report]
>>105821024
slowest shit i've ever seen.
Anonymous No.105821149 [Report] >>105821186 >>105821266
>>105821093
>>105820378
>>105819457

What the story of illustrious? I am a newbie.
There are a lot of merges with that name. And now theyr 3.0 it's via paywall in TensorArt, but there are models free, until 2.0.
Anonymous No.105821185 [Report] >>105822628 >>105822647
>>105821093
That's not an acestep song with a fox singing.
Anonymous No.105821186 [Report]
>>105821149

Sorry, I wish I knew how to answer that question. I just use a model based off of Illustrious (KiwiMix XL v3). I know nothing about paywalls or TensorArt or anything of the sort.
Anonymous No.105821250 [Report]
Anonymous No.105821266 [Report] >>105821336 >>105821379
>>105821149
Small Korean team, an alleged "leak" of the v0.1 model. The model is times better than the previous best Pony v6, the team says this is but a preview and the actual release will be times better. But allegedly since it's a leak their bosses put a stop to the release, that v0.1 is "this is all you get, but here's a paper on it". People start to finetune it, NoobAI etc.
According to the paper the better versions, 1.0, 2.0 etc are all done when 0.1 leaks. But we don't see them for many months when due to what the community does to 0.1 they aren't needed nor wanted anymore. But to make things worse the Korean team fucks up by "paywalling" the releases even though they originally promised to open weights it all (with more papers too). Dumb, sad, not important anymore. They lost their credibility, almost, but since they also released a 0.0.1 pre-alpha of the Lumina 2 finetune there's like a speck of trust preserved. Though there has been no public news about any new versions, maybe in their cord but doubt, so they're as good as dead now.
Neta-Lumina is all now.
Anonymous No.105821303 [Report]
>>105819638
No
Anonymous No.105821316 [Report]
Does anyone know why Skip Layer Guidance doesn't work with self-forcing?
Anonymous No.105821324 [Report]
Anonymous No.105821336 [Report]
>>105821266
>release your own model early, claim that's why you can't release better versions
I love all the clever ways companies renig.
Anonymous No.105821379 [Report] >>105821685
>>105821266
Thanks for your answer!
Another question but for everyone.
My problem is that my gens aren't random enought. I make a batch of 8 gens and are all the same but the seed is different. Does anyones know how to fix that?
Anonymous No.105821384 [Report]
Anonymous No.105821414 [Report]
>trained my first chroma lora on v42
>perfect likeness
damn, can't wait until it's complete in a few more weeks!
Anonymous No.105821477 [Report] >>105821618
Anonymous No.105821478 [Report] >>105822474
>damn, can't wait until it's complete in a few more weeks!
I thought this wasn't the politics board
Anonymous No.105821563 [Report] >>105821618
Anonymous No.105821573 [Report]
Anonymous No.105821618 [Report] >>105821704
>>105821563
boooring, where's the acestep?

>>105821477
> no seppaku
Anonymous No.105821679 [Report] >>105821693 >>105822092 >>105822863
finally got new pc and trying this shit for first time ever
Anonymous No.105821685 [Report]
>>105821379
Some models are overcooked so you get the same pose instead of variations of that pose despite changing seeds.
Anonymous No.105821693 [Report] >>105821706
>>105821679
what's the prompt? or catbox pls
Anonymous No.105821696 [Report]
Anonymous No.105821704 [Report]
>>105821618
>acestep?
red pill me on this
Anonymous No.105821706 [Report] >>105821741 >>105822092
>>105821693
some variation of

wall of flickering CRT televisions playing horrific violent VHS slasher film clips, center figure ominous male silhouette, morphing between abstract angelic and demonic forms, electric blues and toxic reds, melancholy meets horrorcore aggression, VHS haze, found footage

Idk i kept messing with it
Anonymous No.105821741 [Report]
>>105821706
thx
Anonymous No.105821767 [Report] >>105821810
Anonymous No.105821780 [Report]
>>105820439
1kk pixels have channels
Anonymous No.105821796 [Report] >>105821817 >>105821883 >>105823298
Do you really need 20-30 reference images of a character from different angles to accurately capture their artstyle and likeness?
Anonymous No.105821810 [Report]
>>105821767
is that who i think it is
Anonymous No.105821817 [Report]
>>105821796
At the end of the day any dataset is "I want more of this". If you have diversity you are saying "I want this thing that looks like this don't change it".
Anonymous No.105821883 [Report] >>105822127
>>105821796
No, but the less images you have, the quicker it will train, and thus overtrain, meaning you will get very little flexibility.

For example, you don't have any training images with the character having outstretched arms, but you want to be able to generate it, if the model can be trained for long enough without overtraining, it will learn 'grok' the character and be able to portray it with outstretched arms.

If it can't be trained for long enough to prevent overtraining, it will do a poor attempt at portraying the character with outstretched arms, and if you overtrain it, it won't be able to do the character with outstretched arms at all (since you had no such training images) because it will pretty much just generate near 1:1 copies of your training images.

It's impossible to say exactly where the 'enough images' line is drawn, but 20-30 is generally a good starting point.
Anonymous No.105821956 [Report] >>105822024 >>105823297
>>105820313
flux barely has a usecase for the good stuff unlike sdxl and wan
Anonymous No.105822014 [Report]
Anonymous No.105822015 [Report] >>105822097 >>105822729
>2D is lagging behind 3DPD
its not fair...
Anonymous No.105822024 [Report]
>>105821956
I like it for training loras of artstyles, I find it very good for this purpose, assuming there's no nudity, because then one out of ten gens will do remotely decent nipples and genitals are even worse.

For real people, no, you can get rid of the Flux chin but you can't get rid of plastic skin and dead eyes, and of course the same problems with nudity.

It's a shame.
Anonymous No.105822043 [Report]
faux-to realistic
Anonymous No.105822065 [Report]
Anonymous No.105822089 [Report] >>105822305 >>105822498 >>105823340
I would like to point out that this gen is perfectly SFW because there are no nipples shown
Anonymous No.105822092 [Report]
>>105821679
>>105821706
>finally got new pc
let him cook
Anonymous No.105822097 [Report]
>>105822015
still nice. we need more anime style for based wan
Anonymous No.105822101 [Report] >>105822110 >>105822122 >>105822178
Is there a good, up-to-date guide for getting started with wan? I heard the rentry guide is outdated.
Anonymous No.105822110 [Report]
>>105822101
https://github.com/deepbeepmeep/Wan2GP
easiest option
Anonymous No.105822122 [Report] >>105822138 >>105822174
>>105822101
it boils down to using either pinokio or uncomfy UI
i think
Anonymous No.105822127 [Report] >>105822187
>>105821883
>For example, you don't have any training images with the character having outstretched arms, but you want to be able to generate it, if the model can be trained for long enough without overtraining, it will learn 'grok' the character and be able to portray it with outstretched arms.
Can you re-phrase this? How would you train a model to generate a character with outstretched arms without a reference image? What does it mean to "grok" the character? And how do you ensure such a process goes smoothly?
Anonymous No.105822138 [Report] >>105822174 >>105822217
>>105822122
That's just for installing. I'm more interested in a configuration guide and what options I should be using for the best combination of quality/duration.
Anonymous No.105822174 [Report] >>105822182 >>105822320
>>105822122
you don't need pinokio. you can just use the gradio the repo comes with

>>105822138
the optimizations are all trade offs. biggest speedup is self-forcing Lora (should have a link in the guide), there isn't much point to the duration extending addons since most outputs past 81 frames just suck
Anonymous No.105822178 [Report] >>105822391
>>105822101
whats outdated about it?
Anonymous No.105822182 [Report]
>>105822174
also, tea/mag cache for an extra boost but piling on opts starts wrecking the quality every now and again. you can control the strength and slow it down a little for better outputa
Anonymous No.105822187 [Report] >>105822618
>>105822127
The model knows what 'outstretched arms' mean so it can apply this to basically anything, however to effectively apply it to the character you are training, it needs to 'learn' the character well enough so that it can do a good job of showing it in different poses than those in your training data.

And to learn the character traits well, it needs to train for a long enough time, and this is where having too few images becomes a problem, since it will learn very fast but not well, and with well I mean be able to do things with the character (like poses) than what is in your training set. You can try mitigating this by having a lower learning rate, or introducing regularization images to slow learning down, but the best solution is to have more images.
Anonymous No.105822188 [Report]
Anonymous No.105822217 [Report]
>>105822138
The Rentry workflow is fairly optimized for just basic i2v or t2v. You can just change from 480p to 720p and you're fine. If you want even more quality, don't use TeaCache or don't use Self Forcing and experience 30+ minute generations. If you want more duration, you RifeXRope or learn to VACE or stitch end frames but that has it's own issues.
Anonymous No.105822285 [Report]
a decent bake with a static collage, would you look at that. nice touch with the extra collage in the middle. who made that cool gen in the top row, the one in the middle? report for interrogation
Anonymous No.105822305 [Report]
>>105822089
Thank you. I showed this to my boss and he is seething yet powerless to reprimand me. He even sent it to HR and they said there is nothing they can do because there are no visible nipples.
Anonymous No.105822307 [Report]
>>105820530
kek
Anonymous No.105822320 [Report]
>>105822174
>there isn't much point to the duration extending addons since most outputs past 81 frames just suck
no they dont. I use the rife node to bump it to 7 seconds and there's no issues.

>biggest speedup is self-forcing Lora
which also has a massive trade off, that being neutered motion and general motion fidelity. you cannot 'fix' it by increasing the fps or prompting. it is the equivalent of taking a 12fps video and then trying to interpolate it to 60fps. it wont ever look good because it simply lacks the key frames.
Anonymous No.105822357 [Report]
Anonymous No.105822391 [Report]
>>105822178
ignore him. it's an ad for that shitass wan2gp ui
Anonymous No.105822400 [Report]
hell yeah
Anonymous No.105822444 [Report]
>>105820432
kek'd
Anonymous No.105822464 [Report]
Anonymous No.105822470 [Report] >>105822523
>saas models are better than local
it's over
Anonymous No.105822474 [Report]
>>105821478
Anonymous No.105822481 [Report] >>105822497
Question to Chroma users: which prompts do you use to avoid the "AIslop" look altogether?
Anonymous No.105822497 [Report]
>>105822481
I just generate until I hit a lucky seed
Anonymous No.105822498 [Report]
>>105822089
now do a vagina
Anonymous No.105822523 [Report] >>105822538 >>105822550 >>105822563
>>105822470
Are you telling me a massive model that can't even remotely run on consumer hardware is more powerful than a model than can run on consumer hardware, holy shit, when did this happen ?

Next you're going to tell me that the screen at my local cinema is larger than the one in my living room, you're right, it's all over.
Anonymous No.105822538 [Report] >>105822567
>>105822523
Even if you had the hardware you still wouldn't be able to beat SaaS because their models are proprietary, retard.
Anonymous No.105822550 [Report] >>105822570
>>105822523
>Are you telling me a massive model that can't even remotely run on consumer hardware
Cope.
Most SaaS image models are at most 20b, at least we can speak for Seedream and the other one I forgot the name. Probably Krea's model is not beyond that, too.
I can definitely run those with my 48gb hardware.
Anonymous No.105822563 [Report] >>105822582
>>105822523
dont let the /api diffusion general/ trolls get to you
Anonymous No.105822567 [Report]
>>105822538
The models are better because they are HUGE, not because they are proprietary.

If the SAAS models from big tech were not better than local models it would be crazy, they spend insane amounts of money making big ass models that needs extreme hardware to run.

The actual shocking part is that the gap is so small as it is.
Anonymous No.105822570 [Report] >>105822617
>>105822550
Things you just pulled out of your ass
Anonymous No.105822578 [Report] >>105822586 >>105822606 >>105822631
It feels wrong to keep a model loaded while I'm not actively generating, as in I go on a hike or take a nap. Is it wrong?
Anonymous No.105822582 [Report]
>>105822563
True, responding to them was a rookie mistake
Anonymous No.105822586 [Report] >>105822605 >>105822627
>>105822578
Why would it be wrong
Anonymous No.105822605 [Report]
>>105822586
Not him, but won't it get lonely?
Anonymous No.105822606 [Report]
>>105822578
More like it feels wrong if my GPU isn't generating or training, it feels like I'm wasting time
Anonymous No.105822617 [Report] >>105822666
>>105822570
Not really
Anonymous No.105822618 [Report] >>105822709
>>105822187
Do they have to be like character sheets or reference files? Can models learn what an anime character looks like based on a series of screenshots?
Anonymous No.105822627 [Report] >>105822641
>>105822586
In the same way I turn off my computer when I'm not using it (as a show of respect), I feel I must do the same and unload any models. Do you not feel the same?
Anonymous No.105822628 [Report]
>>105821185

Maybe foxes make terrible singers.
Anonymous No.105822631 [Report]
>>105822578
just unload it? comfyUI has a button for that
Anonymous No.105822634 [Report] >>105822722 >>105822776
Anonymous No.105822641 [Report]
>>105822627
I let my computer run 24/7 and literally never turn it off, even if I walk away I want this nigga to run.
Anonymous No.105822646 [Report] >>105822680
Anonymous No.105822647 [Report] >>105822668
>>105821185
why do you retards keep mentioning acestep? It's a terrible model, I've never seen anything noteworthy being done with it
Anonymous No.105822666 [Report]
>>105822617
kek, some shitty model from march 2024 that was an also-ran from a small startup.

are you for real ? is this your 'proof' ?
Anonymous No.105822668 [Report] >>105822731
>>105822647
I used to think Loras could save it, but it seems they barely make a difference, lol
It's also heavily biased towards shitty rap/hip-hop "songs" (if you can even call those songs, for me they are not music) and zoomer pop
Anonymous No.105822680 [Report] >>105822707 >>105822742
>>105822646

Damn, this turned out really good. Did you have to fine-tune your gen, or did you just get a lucky seed?
Anonymous No.105822707 [Report]
>>105822680
>6 fingers
>illogical pose

>really good
Anonymous No.105822709 [Report] >>105822764 >>105822803 >>105822828
>>105822618
Not really, but the results will be much better.

Like for example having a side view is good, but the model should probably be able to do a decent side view even if you only have 3/4 views of the character.

Also it is a good idea to caption things like front view, side view, back view on corresponding images, I would suggest you put it last in the prompt, like: 'blabla character wearing dark hood and holding katana, front view'

As for screenshots, sure, but it will try to mimic the look of those screenshots, not just the character. If you want to teach a character, the best way is actually to have many different style images of said character, like for example you want to teach the model Harley Quinn, best thing would be to pick some line art, some painted artworks, some cosplay images etc, and make sure that the character traits are the same, blonde hair, the same outfit etc, this way the model will see that character pattern rather than picking up art styles, composition etc.
Anonymous No.105822720 [Report]
Anonymous No.105822722 [Report]
>>105822634
good stuff
Anonymous No.105822726 [Report]
what's the formula for determining the amount of steps needed based on how many images you have?
Anonymous No.105822729 [Report]
>>105822015
I think what makes this look weird is the high framerate. Anime is usually something really low like 10 fps or something.
Anonymous No.105822731 [Report] >>105822745
>>105822668
Upcoming version 1.5 will be great, have faith anon!

It's true that lora training was bad, I've seen them comment in the issues that you should do full finetune if you want decent results.
Anonymous No.105822742 [Report]
>>105822680
The gen itself is decent
Anonymous No.105822745 [Report]
>>105822731
>Upcoming version 1.5 will be great, have faith anon!
I'd have to see promising material (outputs) from the alleged version that can make me restore my "faith"
Anonymous No.105822754 [Report] >>105822891 >>105823338
Anonymous No.105822764 [Report]
>>105822709
Understood, thank you very much.
Anonymous No.105822776 [Report]
>>105822634
Most interesting image I've seen in these threads in ages. Makes you wonder what she's thinking.

Also getting a european vibe from it.
Anonymous No.105822803 [Report] >>105822886
>>105822709
Out of curiosity, is it common for ppl to use a program like photoshop to crop characters out of their environment and put them on a whitepaper character sheet?
Anonymous No.105822828 [Report]
>>105822709
I remember training a lora for my mmo character and it seemed to get the face fairly well without the game graphics bleeding in too much. I think if you're careful with the tagging screenshots should be fine.
Anonymous No.105822831 [Report] >>105824129
>>105819861
I like

Alexa Bliss?
Anonymous No.105822845 [Report] >>105822865
>>105819383
I like the artstyle, not so much the content
Anonymous No.105822857 [Report]
>>105819459
the greatest ai video of all time
Anonymous No.105822863 [Report] >>105823206
>>105821679
That's good stuff. You'll go far.

specs?
Anonymous No.105822865 [Report] >>105822903
>>105822845
What kind of monster hates children?
Anonymous No.105822886 [Report]
>>105822803
I don't know if it's particularly common, but if you want to train on for example a anime character, having that character against a white background in many of the images is likely ideal since then its just a character and whitespace which will remove any ambiguity as to what you want the model to learn.

You can also train with masking, where you create a image mask to tell the model what it shouldn't try to learn, this is time consuming though.
Anonymous No.105822891 [Report] >>105823338
>>105822754
armor detail and weapon coherence has advanced pretty far from the old days
Anonymous No.105822896 [Report]
Anonymous No.105822903 [Report] >>105822933
>>105822865
Better question: why does merely noticing pederast behavior seemingly always label the observer as the child-obsessed individual themselves?
Is it supposed to be some kind of anonymous culture reference?
Anonymous No.105822914 [Report] >>105822964
Been off the saddle for a year or so.

What's the current meta for anime 1girl gens?
Anonymous No.105822918 [Report]
found out why ai-toolkit ui didn't have the other optimizers. its just outdated

goto ai-toolkit\ui\src\app\jobs\new\SimpleJob.tsx

{ value: 'adamw8bit', label: 'AdamW8Bit' },
{ value: 'adafactor', label: 'Adafactor' },
{ value: 'automagic', label: 'Automagic' },
{ value: 'prodigy_8bit', label: 'Prodigy8Bit },


now i need to figure out why there is no option for lr_scheduler in the ui at all. like how the fuck do you use cosine_with_restarts
Anonymous No.105822933 [Report] >>105822958
>>105822903
>when the climate change disaster failed to materialize, Greta went insane and started killing pajeets to decrease overpopulation
Anonymous No.105822944 [Report]
>>105819400
nothing will change. money and censorship have already killed online ai. only local ai is the hope. but we're so slow
Anonymous No.105822948 [Report]
>>105822947
>>105822947
Fresh
>>105822947
>>105822947
Anonymous No.105822958 [Report] >>105823350
>>105822933
CHEKT
Anonymous No.105822964 [Report]
>>105822914
https://rentry.org/comfyui_guide_1girl#up-to-date-comfyui-guide-for-1girl-and-beyond
Anonymous No.105823206 [Report]
>>105822863
5060ti 16gb
and 32gb ram ryzen 7600
Anonymous No.105823284 [Report] >>105823312
Has anyone tried using Kontext-Dev for spot fixing certain sections? 90% of the time it just wastes my GPU power and spits out an image with no change
Anonymous No.105823297 [Report]
>>105821956
>flux barely has a usecase
There is, but I don't want you to be smart, but to be stupid.
Anonymous No.105823298 [Report]
>>105821796
More images are usually better, but there's no way to predict how a particular model would react to lora training with your dataset. For example, you can train with a subset of your dataset and have drastically different results, good or bad, than training with the full dataset. You can train on a 2.5D model with 2d images and get much better results than with a pure 2D model. There's no way to tell how well your lora would turn out.
Anonymous No.105823312 [Report] >>105823359
>>105823284
kontext has several features, and you need to trigger the correct one.

But kontext also apparently has "safety" that someone needs to disable. At some point it just "nopes" out if tits or something are detected - I think
Anonymous No.105823338 [Report]
>>105822891
Nope, your neural network is being trained to ignore the many nonsensical and/or asymmetrical and/or downright defective details both in >>105822754 and your pic. The brain can filter out constant, unchanging stimuli and predictable patterns to avoid being overwhelmed by sensory input and to focus on what is relevant for a task. These defects are becoming like the feeling of your clothes on your skin or the background noise in your house.
Anonymous No.105823340 [Report]
>>105822089
Its smug aura mocks me.
Anonymous No.105823350 [Report]
>>105822958
>How daaare yoou?!
Anonymous No.105823359 [Report]
>>105823312
Really wish BFL weren’t such pussies
Anonymous No.105823538 [Report] >>105823835
Total noob here, is there any way I can output higher resolution images with limited VRAM?

I have 12GB of VRAM and 1080p seems to be my upper limit for upscaling before I error out due to lack of VRAM. Using ComfyUI with a Radeon GPU if it matters.
Anonymous No.105823628 [Report]
How long until I can use stablediffusion UI but with a video tab? I tried installing this stuff but it's way more complex than setting up the image gens
Anonymous No.105823632 [Report]
Hi I'm retarded. AMA
Anonymous No.105823835 [Report]
>>105823538
Tiled ksampling or upscaling
Anonymous No.105824129 [Report]
>>105822831
yes
Anonymous No.105824227 [Report] >>105824889
>>105820039
what might get banned?
Anonymous No.105824889 [Report]
>>105824227
There's some retarded drama on reddit where some minor dev is reporting chroma for bestiality while hiding behind his minor dev clout.