← Home ← Back to /g/

Thread 106556603

316 posts 122 images /g/
Anonymous No.106556603 >>106556614 >>106556673 >>106558444 >>106559150
/ldg/ - Local Diffusion General
26b Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106553794

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.106556614 >>106556620 >>106556648
>>106556603 (OP)
Weak highlights today
Anonymous No.106556620
>>106556614
which one was yours?
Anonymous No.106556648 >>106556655
>>106556614
>Weak highlights today
HunyuanImage and SPRO turned out to be nothingburgers, so there's nothing to showcase recently
Anonymous No.106556655 >>106556669
>>106556648
SPRO is a good technique. too bad they used it on a distilled model
Anonymous No.106556669
>>106556655
they'll use it on Qwen Image soon, we'll see if the method is really the ultimate unslopper
Anonymous No.106556670 >>106558946
I have an obscure problem

I'm trying to gen women with black skin, like actual black skin not african brown skin. And whenever my workflow goes to the next sampler/pass they go from black to brown. I've tried different prompts but no changes have worked. One interesting thing about the problem is even though the sample preview looks like black skin on my high pass, the first frame is brown skin.
Anonymous No.106556673 >>106556819 >>106557257
>>106556603 (OP)
>instagirl grifter slop lora reddit images in the highlights

you can't go any lower than that
Anonymous No.106556694 >>106556708
So, now that the dust has settled. just how censored is Wan 2.2?
Anonymous No.106556700
localchads we will rise again
Anonymous No.106556708 >>106556737
>>106556694
it's too censored...
https://files.catbox.moe/xzc8j0.mp4
Anonymous No.106556729 >>106556796
Worst collage EVER
Anonymous No.106556737 >>106556776 >>106556789
>>106556708
now run this without the lora
Anonymous No.106556747 >>106556751
hi anon,
is qwen inpainting model out?
Anonymous No.106556751 >>106556757
>>106556747
no
Anonymous No.106556757
>>106556751
than you sir
Anonymous No.106556775
Blessed thread of frenship
Anonymous No.106556776 >>106556792
>>106556737
>Oh you can nail down nails with a hammer easily? Now do it with only your hands, heh.
Smartest brown
Anonymous No.106556789
>>106556737
why do you not want to use a lora? if it works it works
Anonymous No.106556792 >>106556829
>>106556776
you know what he meant in the first place when he first said it, you disengenuous fucking retard.
Anonymous No.106556796
>>106556729
which one was yours?
Anonymous No.106556801 >>106556835
>Use my civitslop lora SIR its trained off best henti SEX and GOODEST CAPTIONS
Anonymous No.106556819
>>106556673
ugliest collage set in memory, op really must be that guy
Anonymous No.106556829
>>106556792
If someone can train a lora on a basic local gpu thats just porn and the model just gets everything because it wasnt lobotomized like a lot of other models then it's not censored.
Anonymous No.106556835
>>106556801
You forgot to attach your own image showcasing true art?
20Loras No.106556841 >>106556861
Kek, I wanted to see how this turned out. Taking the last frame of the gen and just keep on going.
Even with a reference image it shits itself extremely quickly.
Anonymous No.106556861 >>106556902
>>106556841
def a problem somewhere in the workflow to have that big color shift
Anonymous No.106556877 >>106557053
okay what's the catch local gen? Is my pc a ticking bomb?
Anonymous No.106556894 >>106557088 >>106557111
Anyone uses the clown samplers with wan 2.2?
Picrel is what I use, is it what you'd recommend?
(I don't use lightx2v)

It's quite slow compared to what I was using before (unipc)
20Loras No.106556902
>>106556861
Oh that was just a shitty mediaplayer screenshot I shoved back in, I just wanted to test the seamless part. I'm sure I can do much better if I try.

Saw someone automating the FFLF and having new prompts for each one stacked up.
Anonymous No.106556908
What one anon brought up before is why wan 2.2 uses so much RAM. Isn't it just supposed to load the model into VRAM? Is it raelly so demanding that it instantly overflows the GPU and loads into RAM?
Anonymous No.106557053 >>106557059 >>106557075
>>106556877
>what's the catch local gen?
If you want good outputs you need to have the skill
Anonymous No.106557059
>>106557053
oh yeah true that. but having a lot of fun now
Anonymous No.106557074 >>106557673 >>106557965 >>106558369
2B cant spell for shit
Anonymous No.106557075 >>106557084 >>106557131
>>106557053
>you need to have the skill
the skill that allows me to buy a 5090?
Anonymous No.106557084 >>106557099
>>106557075
>he thinks better card = better gens automatically
oof...
Anonymous No.106557088 >>106557203
>>106556894
unipc sux

how is your ui differnt
Anonymous No.106557099
>>106557084
faster prompting with a 5090 allows me to polish my dick while i read the thesaurus. checkmate
Anonymous No.106557111
>>106556894
I think clownsharkbatwing whateverthefuck the author goes by is kinda schizo. I stopped using his nodes months ago.
Anonymous No.106557112 >>106557322
would i need to add a lora to make boobs bigger than what is being genned? or add it to the prompt?
Anonymous No.106557115
how big of a dataset do you guys go for?
Anonymous No.106557131 >>106557139 >>106557210 >>106557229 >>106557250
For chroma at a conservative step count for constant coherency and composition 30 is the safest number for me and 10 steps on high res seems to be fine, with a style lora to keep everything stable this model is still a bully on hardware even a 5090
>>106557075
Honestly you needed foresight to get that thing, I knew recent events but if I wasn't so tuned in my instincts would have told me to wait. If you're a burger things are fucked

Posting comparisons for first and second pass now
Anonymous No.106557139 >>106557229
>>106557131
first pass
Anonymous No.106557203
>>106557088
>how is your ui differnt
Anonymous No.106557210 >>106557229 >>106557233
>>106557131
Just 10 steps? Are you using only a second ksampler or some other nodes?
Anonymous No.106557229 >>106557233
>>106557131
>>106557139
>>106557210
can you please stop seeking attention? you post the most boring slop in the general, blog post about shit nobody cares about and publicly pat yourself on the back for validation. you are so much worse than ani, comfy and even debo. just stfu already
Anonymous No.106557233 >>106557248 >>106557369
>>106557210
>Nodes
No just the webui high res fix
>>106557229
Meds?
Anonymous No.106557248 >>106557269
>>106557233
>Meds?
what, you can't find them? sad, you really need to take them
Anonymous No.106557250 >>106557256 >>106557269
>>106557131
that reminds me about the other day when an anon was saying how he was doing 100 steps for a chroma gen with another 30 on top for a hires fix and then he was complaining about the 10+ minute gen times. kek
Anonymous No.106557256 >>106557397
>>106557250
you are actually replying to that very retard lmao
Anonymous No.106557257
>>106556673
how do we cope
Anonymous No.106557269 >>106557276 >>106557322 >>106557546
>>106557248
4 u
>>106557250
I was that anon, the problem is without a strong style anchor you do need higher steps to reduce style swing. Also the lower the step the more volatile the composition
Anonymous No.106557276
>>106557269
your ugly hag found your meds. it's bed time faggot
Anonymous No.106557280
Oh the loser crew is seething because they have nothing going on in the containment zone.
Anonymous No.106557299
Video gen on a 5090:
- using fp16_accumulation -> 19min
- not using fp16_accumulation -> 19min
wtf
Anonymous No.106557314
Anonymous No.106557322 >>106557333
>>106557112
literally just add weights, start with (huge breasts:1.5) and try changing the number

>>106557269
this is upscaled? something is horribly wrong with your workflow
Anonymous No.106557330
>106557322
He's getting desperate because him and his crew kept getting rejected during the time I was gone
Anonymous No.106557333 >>106557351
>>106557322
>(huge breasts:1.5)
thanks man will try! is there anywhere to find the tags that work best with gens?
Anonymous No.106557346 >>106557358
so does going over 5 secs shoot up gen time exponentially? it doesn't increase linearly? 5 secs was 4mins and then i went to 8 and its now 12minutes to gen
Anonymous No.106557351
>>106557333
danbooru or any doujin hoster. test each tag individually or in isolation to ensure prompt adherence
Anonymous No.106557358 >>106557388
>>106557346
yes, u just answered your own question
Anonymous No.106557361 >>106557368 >>106557371
--use-sage-attention

thoughts?
Anonymous No.106557368
>>106557361
yes
Anonymous No.106557369
>>106557233
based, how its chroma on Neo?
Anonymous No.106557371 >>106557413 >>106557489
>>106557361
why is it a fucking fag flag?
Anonymous No.106557380
Are the NetaLumima derivatives good yet
Anonymous No.106557388
>>106557358
Just wanted to make sure I wasn't fucking up somewhere, today i went from 5 hour gen nightmare fuel to getting help from a kind anon and 5 minute gens.
Anonymous No.106557397
>>106557256
Anonymous No.106557413 >>106557474 >>106557477
>>106557371
would you like to reiterate your question so you don't sound like a third worlder?
Anonymous No.106557424 >>106557435 >>106557475
Man VAE really is cancer. One more decode in the WF and it deleted all the warmth the color had
Anonymous No.106557435
>>106557424
orig pre-upscale
Anonymous No.106557452 >>106557458 >>106557467
Hello, I hate Comfy
Anonymous No.106557458
>>106557452
Hello I hate comfy too, let's get married!
Anonymous No.106557467 >>106557506
>>106557452
Same. i cant install neo sadly
Anonymous No.106557474
>>106557413
why is it a flag in the flag append of the program that is used to launch comfyui with bat files in the windows?
Anonymous No.106557475 >>106557487
>>106557424
>Man VAE really is cancer.
yes, that's why I want lodestone to succeed
Anonymous No.106557477
>>106557413
why is it a homosexual tranny command flag instead of a node I can use at runtime?
Anonymous No.106557487
>>106557475
It needs some speed-ups first. Using it as a daily driver is madness. Might be decent as an upscaler tho.
Anonymous No.106557489 >>106557505
>>106557371
>why is it a fucking fag flag?
you can use kj's node to activate it instead >>106555496
Anonymous No.106557505 >>106557511 >>106557523
>>106557489
this breaks qwen you stupid nigger. pay attention
Anonymous No.106557506 >>106557965
>>106557467
why, let me help you
Anonymous No.106557511 >>106557531
>>106557505
it doesn't you retarded monkey, you have 4 options and that one doesn't give the black image, you're so fucking dumb, think before opening your trash mouth
Anonymous No.106557520 >>106557526
How do i make the dude dark-skinned?
Anonymous No.106557523
>>106557505
nope, works on my machine
Anonymous No.106557526 >>106557545
>>106557520
"dark skinned male"?
are you retarded?
Anonymous No.106557531 >>106557538
>>106557511
retard. it doesn't actually work. kijais sage attn node does NOTHING.
try it. turn the flag off then generate an image with kijai set to auto then to disabled. no difference.
now please uninstall your OS and drown your computer.
Anonymous No.106557538
>>106557531
>it doesn't actually work.
absolute skill issue, it works fine on my machine
Anonymous No.106557539 >>106557544 >>106557568 >>106557703
I FUCKING HATE COMFY DEVS SO MUCH IT'S UNREAL. THIS FUCKING SHIT IS MORE A DEBUGGING SIMULATOR THAN A FUCKING IMAGE GENERATOR. I'M SO FUCKING SICK OF IT
Anonymous No.106557544
>>106557539
haha yeah i know right?
so, did you catch that game last night?
Anonymous No.106557545 >>106557563 >>106557569
>>106557526
I tried that and then it consistently does this. a dude humping at the side.
https://files.catbox.moe/pe9axb.mp4
Anonymous No.106557546
>>106557269
it's a fake catjak or he started doing H
Anonymous No.106557563 >>106557585
>>106557545
your lora or prompts are fucked
Anonymous No.106557568
>>106557539
Haha, imagine working with that UI every day!.
The more time passes, the more convinced I am that Comfy is more for AI hobbyists than for people who work with AI.
Anonymous No.106557569 >>106557585
>>106557545
what lora? that means the lora isn't trained in black dudes
Anonymous No.106557585 >>106557593 >>106557615 >>106557618 >>106557678
>>106557563
>>106557569
this is the prompt and the lora is https://civitai.com/models/1923528/sex-fov-slider-wan-22
Ohhh so I'd have to find an actual dark-skinned one. rip.this is what i put in the prompt
Anonymous No.106557593 >>106557615 >>106557626
>>106557585
Write actual descriptive sentences, wan is about natural language, it doesn't understand the (xxx:1.5) syntax.
Anonymous No.106557615 >>106557626
>>106557585
what >>106557593 wrote, but also use synonyms, repetitions, etc in the prompt you want
Anonymous No.106557618 >>106557626
>>106557585
if you're doing video gen, weights don't work as other anon said, you have to describe it instead. i thought you were doing SDXL with my original response
Anonymous No.106557626 >>106557649
>>106557593
>>106557615
Oh okay I'll try that.
>>106557618
Ah I should have specified, Yeah I'm doing video gen on wan 2.2
Anonymous No.106557649
>>106557626
cucked by brown penis again frfr ong
Anonymous No.106557657
What too much denoise does to a mf
Anonymous No.106557673 >>106558063
>>106557074
are you post-processing these to add the noise and aliasing?
Anonymous No.106557678 >>106557684
>>106557585
you should give up. right now :D
ur brain too smol
Anonymous No.106557684
>>106557678
grrrr it is!! .___. but still have to try.
Anonymous No.106557703 >>106557770
>>106557539
You can always use Diffusers
Anonymous No.106557770
>>106557703
fuck python in general desu
Anonymous No.106557799
I HATE SNAKES
SNAKES IN MY WALL
SNAKES IN MY PIPES
Anonymous No.106557881
#comfy killed the hype
Anonymous No.106557893 >>106557909 >>106557914 >>106557938 >>106557945 >>106557982 >>106558000
why hasnt anon posted this level of kinosoul with seedream
https://xcancel.com/fofrAI/status/1966142589289329015
Anonymous No.106557909 >>106558037
>>106557893
because this is the local thread. get out shill
Anonymous No.106557914
>>106556332
>I still have yet to see an image that is better than what we can achieve locally.
there you go >>106557893
Anonymous No.106557936 >>106557969
For those using a 5090 on linux, do not upgrade from 575.57.08 (or below) to 575.64.03, at least in my case, the vram usage has gone up, and while I could send wan2.2 fp8 to vram, now it also needs me to send parts to ram for it to work.
Anonymous No.106557938 >>106557982
>>106557893
it looks like the Wall of Fayth from FFX, beautiful
https://www.youtube.com/watch?v=WBjbY1dwO_Q
Anonymous No.106557945 >>106558419
>>106557893
>>>/g/adt
Anonymous No.106557965 >>106558063
>>106557074
>>106557506
I think it's because there's a lot of conflicts with Easy Comfy installation.
Anonymous No.106557969 >>106557981
>>106557936
Is that why I'm getting fedora shutting down my konsole webui session?
I thought it was because of the model but I never had that problem with flux and now I'm using 64gbg of actual ram during generation after 8 or so hours
Anonymous No.106557981
>>106557969
No idea, it just got OOMs after OOMs for me.
Anonymous No.106557982 >>106557985
>>106557893
>>106557938
Bigger version.
Anonymous No.106557985
>>106557982
this is actually fucking impressive, the anatomy is on point
Anonymous No.106558000 >>106558009
>>106557893
This does look really good, but I think you can make similar stuff on local. Most people just arean't interested in that kind of art.
Anonymous No.106558009 >>106558102
>>106558000
>I think you can make similar stuff on local
prove it
Anonymous No.106558019 >>106558466
Anyone uses that? What parameters do you use?
Anonymous No.106558037 >>106558069
>>106557909
my point was why werent the cloud shills posting outputs at that level when they were deep in their shill campaign in this thread
Anonymous No.106558063 >>106558216
>>106557965
>>106557673
Missed this. Yes, I have a sharpen and noise nodes.
Anonymous No.106558069 >>106558086 >>106558120 >>106559775
>>106558037
It's a bitter circle of anons that have been doing this for years, it could be anything they will try to lower the thread. Look at the threads they come from and you will understand why
Anonymous No.106558086
>>106558069
> dom female
> wearing leash

o-oh my
Anonymous No.106558102
>>106558009
Not gonna lie, I got no idea how to prompt it. You got any prompts?
Anonymous No.106558116 >>106558124
Are there any nodes that just extract the raw prompt text and not add a bunch of json mess and shit? I can't link these to the prompt window. The SDprompt reader didn'twork either.
Anonymous No.106558120 >>106558153
>>106558069
try it with 1000 steps. it should be so much better
Anonymous No.106558124 >>106558172
>>106558116
yes
https://github.com/BigStationW/ComfyUi-Load-Image-And-Display-Prompt-Metadata
Anonymous No.106558153 >>106558158
>>106558120
You don't have a rig that can run chroma so you sit here and seethe lol
Anonymous No.106558158
>>106558153
flawless seethe logic anon. well done
Anonymous No.106558172
>>106558124
Fucking finally. I hate the raw metadata shit everything else has.
Anonymous No.106558196 >>106558218
is there any way to use flux kontext or qwen edit to inpaint a lewd feature into a photo?
Anonymous No.106558216
>>106558063
cool, it works well for these, especially the misato earlier
Anonymous No.106558218 >>106558223
>>106558196
can you even use loras on the edit models?
Anonymous No.106558223
>>106558218
yes
Anonymous No.106558253 >>106558268 >>106558301 >>106558436
To clarify, can I crudely photoshop a lewd feature over features and ask qwen or kontext to blend it seamlessly?
Anonymous No.106558268 >>106558301
>>106558253
>qwen or kontext
for sure no for kontext, they explicitely do everything in their power to fight anything nsfw
Anonymous No.106558301 >>106558318 >>106558330 >>106558331
>>106558253
>To clarify, can I crudely photoshop a lewd feature over features and ask qwen or kontext to blend it seamlessly?
You can use inpaint with any uncensored checkpoint to do that, sdxl finetunes work

>>106558268
>for sure no for kontext, they explicitely do everything in their power to fight anything nsfw
Doesn't it require lora trained with image pairs? massive pain in the ass
Anonymous No.106558318 >>106558327 >>106558368 >>106558507
>>106558301
I'm going to need a cat box for this. just, wow
Anonymous No.106558319 >>106558394 >>106559786
CPU got fried Chroma bros. I'm back.
https://files.catbox.moe/muhruw.png

Btw these are basically my first try with Chroma HD Flash. Truly is a blessed model.
Anonymous No.106558327
>>106558318
You couldn't make that image even if he gave it to you
Anonymous No.106558330
>>106558301
Thanks, haven't had much luck w/ sdxl inpainting.
Anonymous No.106558331
>>106558301
>Doesn't it require lora trained with image pairs? massive pain in the ass
It's not really clear yet. Training qwen loras in general is a pain, so there's still a lot of testing to do. Image pairs are the proper method, but it also looks like training the concept alone can sometimes be enough to teach it the necessary information. It's almost impossible to get A-B pairs for many contexts, so hopefully that will work out.
Anonymous No.106558332
Why do people recommend using the wan 2.1 lightx2v lora on wan 2.2? All it does is harm fidelity
Anonymous No.106558368
Civitai added Chroma tag

>>106558318
I'll upload the lora
Anonymous No.106558369 >>106558446
>>106557074
alright buddy
nice try
now have kaine jizz "hussy" over the pages of weiss...
Anonymous No.106558374
finally blessed
Anonymous No.106558379
Anonymous No.106558380 >>106558390 >>106558426
Holy fuck chroma base is so much better than HD for upscaling. How tf can a so called HD model suck at fine details so much?
Anonymous No.106558390 >>106558397
>>106558380
So you can post a reaction image and not a gen displaying that?
Suspicious
Anonymous No.106558394 >>106558492
>>106558319
In case you have a VR headset: 3DSVR-0438
Anonymous No.106558397
>>106558390
I am currently genning only diaper porn so I can't post it here.
Anonymous No.106558419 >>106558433
>>106557945
>implying pedowaifu slop is soulful
Chair and rope
Anonymous No.106558426 >>106558437 >>106558699
>>106558380
>How tf can a so called HD model suck at fine details so much?

Try Chroma HD Flash. It handle 2k just fine.
Anonymous No.106558427 >>106558431 >>106558983
What do I use to start captioning videos? Does local even have an option for that?
Anonymous No.106558431 >>106558455
>>106558427
Type lazy captions by hand.
Anonymous No.106558433 >>106558483
>>106558419
>obese vomit hag lover being toxic to randos
please just take your meds
Anonymous No.106558436
>>106558253
>that question
>that image
This nigga trynna add penises to the women, isn’t he.
Anonymous No.106558437 >>106558492
>>106558426
>flash
Distill doesn't have the same outputs as base. You pay for the speed somehow.
Anonymous No.106558444
>>106556603 (OP)
Oh wow he did a release https://huggingface.co/lodestones/Chroma1-Radiance
Anonymous No.106558446 >>106558477
>>106558369
>kaine jizz "hussy" over the pages of weiss
tf you mean?
Anonymous No.106558455 >>106558460
>>106558431
This is an AI general bruh why would I do that?
Anonymous No.106558460
>>106558455
Garbage in garbage out.
Anonymous No.106558465
holy shit seedream is insane. china absolutely destroyed local
Anonymous No.106558466 >>106558604
>>106558019
>PerpNegAdaptiveGuider
>CFG = 3.6
>cfg_start_pct = 0.25
It's unlikely to help, but that's what I settled on.
Anonymous No.106558477 >>106558841
>>106558446
Kainé right, is called a hussy by Weiss in the events prior to the webm.
Kainé loses control, grabs the stupid book throws his ass to the god damn ground, whips it out, strokes and releases all over his leather bound cover, his "face" so to speak and spells out the word hussy in jizz.
It's symbolic, it represents the trauma that Weiss feels and internalises with humour and the fine line of anger that kainé walks.

idk what emil does, that dude's a gay little skeleton, he probably plays wth his boner.
Anonymous No.106558483
>>106558433
You keep exposing yourself by thinking you are talking to the same person retard, by the way sei shoujo is not western you disabled retard.
I was going to tell you that in the other thread but all I asked for is for anons to make chroma loras.
Anonymous No.106558485 >>106559355
What image-to-video models can I run on 4060 with 8GB vram?
Anonymous No.106558492 >>106558568 >>106558578 >>106558694 >>106559298 >>106559905
>>106558394
>3DSVR-0438
Kino

>>106558437
In case of the Flash experiment? I don't know how, but it somehow is very close to convergence. It's like a completely fixed v48. Though the default one messes with prompt following (which I'm currently using). There's a way to fix that though, the delta weight mixed with the HD weight is pretty strong at 2k and still preserves prompt following of original.
Anonymous No.106558507 >>106558593 >>106558617
>>106558318
https://civitai.com/models/1948914/chroma-lora-tsukasa-jun-style
Anonymous No.106558538 >>106558547
does anyone know of a fag Discord group that works collectively on short films (2 minutes or longer)?
im really keen to do something together
Anonymous No.106558547
>>106558538
Ask reddit
Anonymous No.106558553 >>106558562 >>106558732
Can anyone help a guy with a shitty 2060 mobile make some videos? I already have comfyUI installed.
Anonymous No.106558562 >>106558569
>>106558553
You're priced out of this subsection unless you're packing 16 or more vram and a modern card
Anonymous No.106558566
>prompt for penis sniffing
>keep getting fellatio
have to get a lora for fuckin' everything man
Anonymous No.106558568 >>106558607
>>106558492
Basically what would potentially take a bunch of tries/seeds on regular Chroma versions you get first try on or 2 Flash.
Anonymous No.106558569 >>106558573
>>106558562
Shit. Thanks anon. What about normal image generation?
Anonymous No.106558573 >>106558687
>>106558569
struggle bus but you might be able to do light XL?
Anonymous No.106558578 >>106558699
>>106558492
I'm gonna give flash a try. Do you mind sharing your prompt?
Anonymous No.106558593
>>106558507
tyvm anonie!
Anonymous No.106558604
>>106558466
Thanks!
I'm not sure I'll use it, it completely fucks my outputs for some reason.
Anonymous No.106558607
>>106558568
yummy ol like teacher, nice
Anonymous No.106558617
>>106558507
TY man
Anonymous No.106558687
>>106558573
is there any json ready to use for that?
Anonymous No.106558694 >>106558886
>>106558492
please flash her panties.
Anonymous No.106558699 >>106559074
>>106558578
>Amateur photograph of a beautiful Japanese female idol woman sits on a stage chair, performing with an acoustic guitar. She is wearing a white off-the-shoulder top and a short, vibrant yellow miniskirt with her panties slightly visible. With a focused expression, she looks down at her guitar while a microphone stands ready in front of her, suggesting she is singing as well. The surrounding stage equipment indicates she is at a live outdoor concert or festival.

>>106558426
Same but without mention of panties
Anonymous No.106558732 >>106558744
>>106558553
Not a chance. Consider renting time from a cloud GPU provider, once you get ComfyUI set up you can basically one-click deploy and have a top of the line GPU for a whole day for like $10 or something. Might be even less.
Anonymous No.106558744 >>106558768 >>106558773
>>106558732
Can you suggest me a non-scam provider?
Anonymous No.106558768
>>106558744
if you are asking these types of questions you are in for a world of pain trying to figure this shit out. I've used RunPod a few times for training loras, it was fine. You still need to install comfy, download the models/loras (through jupyter), then you can get to fucking around with comfy. good luck lol
Anonymous No.106558773 >>106558810
>>106558744
runpod, vast.ai, tensorboard off the top of my head. just remember that you'll be running on other people's hardware, so don't be uploading/genning stuff that would get you in any trouble
Anonymous No.106558810
>>106558773
Thanks for the heads up, I guess I'll save up some to buy a nice GPU in the future.
Anonymous No.106558841
>>106558477
ahh ok
Anonymous No.106558842 >>106558855 >>106558930
local lost
https://blog.comfy.org/p/seedream-40-now-available-in-comfyui
Anonymous No.106558855 >>106558860
>>106558842
hey buddy, we also got qwen-edit controlnet masks today, we still in it
Anonymous No.106558860
>>106558855
>qwen-edit controlnet masks
*yawn*
Anonymous No.106558876
>chroma: 512x512
>seedream: 4096x4096
sigh…
Anonymous No.106558881
>Qwen-edit
>controlnet
do people really?
Anonymous No.106558886 >>106561775
>>106558694
Anonymous No.106558890
>nano
>banana
hehe my penis is bigger
Anonymous No.106558893
Norway will be the salvation
noisy outputs (TM) No.106558902
Chrome?
Pooma more like
Anonymous No.106558920 >>106559004 >>106559020 >>106559093 >>106559136 >>106559378
Babe wake up, a new video model got released
https://huggingface.co/bytedance-research/HuMo
https://phantom-video.github.io/HuMo/
Anonymous No.106558930 >>106558976
>>106558842
SAAS shills really working overtime lately, /ldg/ threads are like anudda shoah
Anonymous No.106558946
>>106556670
you might want to find some images of fully black skinned characters and give them to Qwen or gemini and see how they caption the image

prompting for "dark skin" or "black skin" often results in the same issues as prompting for "young" does where it can mean a lot of different things in different contexts so the model doesn't really know what to actually do with the token
Anonymous No.106558976 >>106559005
>>106558930
No it's just a small group praying for us all to fall. We have so many eyes on this thread when we shouldn't
Anonymous No.106558983
>>106558427
local has an option technically because you can use Qwen, but you really should use Gemini Pro to make captions for videos/images, that's what the chinese do lol
Anonymous No.106559004 >>106559025
>>106558920
>HuMo-17B
>VideoGen from Text-Image - Customize character appearance, clothing, makeup, props, and scenes using text prompts combined with reference images.
>VideoGen from Text-Audio - Generate audio-synchronized videos solely from text and audio inputs, removing the need for image references and enabling greater creative freedom.
>VideoGen from Text-Image-Audio - Achieve the higher level of customization and control by combining text, image, and audio guidance.
>The model is trained on 97-frame videos at 25 FPS. Generating video longer than 97 frames may degrade the performance. We will provide a new checkpoint for longer generation.
Hum.
Anonymous No.106559005 >>106559009 >>106559082
>>106558976
>No it's just a small group praying for us all to fall. We have so many eyes on this thread when we shouldn't
anyone who supports uncensored video/image diffusion supports a pedo bar. no normalfag on the planet thinks you should be able to ai generate literally anything. you have to be a radical libertarian/cypherpunk to think that's ok
Anonymous No.106559009 >>106559025
>>106559005
That's a odd post to say when those types moved to the anime thread. We don't want them either and have always wanted them out. They can stay there and act like that
Anonymous No.106559020 >>106559055
>>106558920
>still locked to 5 second

i'll pass
Anonymous No.106559025 >>106559029
>>106559004
this is a bytedance release so there's no way its going to be good. it's nice to see that we're almost in the post-character LoRA era where I just need one yearbook photo to blackmail the fathers of schoolchildren systemically

>>106559009
>We don't want them either
you do not make the rules on what types of local diffusion are allowed here
Anonymous No.106559029 >>106559055
>>106559025
Of course and they eventually left because they were mocked and reported for 3 years
Anonymous No.106559055 >>106559119
>>106559020
we don't actually know if its "locked" the same way wan is or not until we can test it with more than 97 frames.
also don't discount the framerate increase. there's no need to interpolate anymore. if a similar self-forcing and lora ecosystem shows up, or if its better for porn/lewds or if it somehow inferences faster then it will have its place.

>>106559029
well, the anime pedophiles did maybe. i still think that thread is just one schizo samefagging though
Anonymous No.106559074
>>106558699
Thanks a ton anon.
Anonymous No.106559077 >>106559085 >>106559110
if it weren't for drooling tards like this, wan 2.1 nunchaku would of been here by now
Anonymous No.106559082 >>106559110
>>106559005
Pure thought police nonsense argument

Ai generated images / video of 'uncensored' nature not shared with anyone can be nothing more than 'thought crimes'
Anonymous No.106559085 >>106559160
>>106559077
>wan 2.1 nunchaku
wan 2.2 exists you know? kek
Anonymous No.106559093 >>106559101 >>106559110 >>106559117 >>106559151
>>106558920
This is a wan 2.2 tune you poopoo head.
Anonymous No.106559101
>>106559093
>poopoo head
you must be 18 to post here
Anonymous No.106559110
>>106559077
>would of

>>106559082
>Ai generated images / video of 'uncensored' nature not shared with anyone can be nothing more than 'thought crimes'
agreed, but discussion about them fundamentally doesn't matter because they're not "real". never understood why people ever discussed the legality/unlegality of simple possession, possession of literally anything on the planet is legal if you're not retarded. the only time it becomes relevant is when its "shared" anyways e.g. when the police find it or you "shared" knowledge of the existence of your possession of the item in the process of obtaining it

>>106559093
i'd refer to Pony as a "new model" even though its a tune of SDXL
Anonymous No.106559117 >>106559130
>>106559093
>this 17b model (HuMo) is a finetune of a 14b model (Wan)
you're so fucking retarded
Anonymous No.106559119 >>106559145
>>106559055
https://github.com/Phantom-video/HuMo
>huggingface-cli download Wan-AI/Wan2.1-T2V-1.3B

Its based off wan, however they did say..

>The model is trained on 97-frame videos at 25 FPS. Generating video longer than 97 frames may degrade the performance. We will provide a new checkpoint for longer generation.

Many said this before and we still dont have proper long gen apart from tricks, wouldnt hold your breath
Anonymous No.106559123
Does this error mean anything?
the Gradio thing. I'm using wan2gp 2.2 ITV, the gen is working but i keep seeing that error on each new video. do i just ignore it?
Anonymous No.106559130 >>106559141 >>106559145 >>106559148 >>106559158
>>106559117
You are the absolute retard moron nigger down syndrome mongoloid that cannot even read the model's project before sucking his own nigger dick in forchink.
Anonymous No.106559136
>>106558920
>wan2.1
LOL
Anonymous No.106559141
>>106559130
>debo is so mad
nice
Anonymous No.106559145
>>106559119
we can do more tricks with more framerate though. no need for a 4fps lora, now we can do 6fps which is 50% more information. also the fact that they fixed framerate with a tune at all sounds very very impressive to me but i dont know the technicals that well

>>106559130
damn dude at least i dont write posts like this yet
Anonymous No.106559148
>>106559130
What kind of mentall illness is this?
Anonymous No.106559150 >>106559161 >>106559203
>>106556603 (OP)
What image-to-text tools are people using these days to generate prompts from images? Having a hard time find one that ... actually works.
Anonymous No.106559151
>>106559093
shut up doodoo head
Anonymous No.106559158
>>106559130
gonna run a t2v with this prompt one sec
Anonymous No.106559160 >>106559173
>>106559085
Just because 2.2 exists doesnt mean the new shiny object should get priority. 2.1 is still great. See another picrel, this is the 4th time a new model got in the way of wan so you'll be lucky to see any 2.2 advancements, nevermind 2.1
Anonymous No.106559161
>>106559150
Gemini, joycaption
Anonymous No.106559170
ooo-eee-ooo
Anonymous No.106559173 >>106559262
>>106559160
Honestly it's just because these people are so fucking scatter brained.
We should have gotten wan with lora support, and same with qwen.
Anonymous No.106559197
what did he mean by this
Anonymous No.106559203 >>106559234 >>106559624
>>106559150
i would use gemini or qwen for sfw, joycaption for nsfw, and modifiying outputs from the aformentioned ones or just writing it yourself if you're doing something illegal
Anonymous No.106559234 >>106559239
>>106559203
Grok 4 can caption nsfw fine, no need for jailbreaking etc...
Anonymous No.106559239 >>106559248
>>106559234
Does it have a local version?
Anonymous No.106559244
Anonymous No.106559248 >>106559253
>>106559239
No but just like Gemini it can be used via API
Anonymous No.106559253 >>106559295
>>106559248
Ok, but I'm not sending my freak porn pics to Elon
Anonymous No.106559262 >>106559293 >>106559312 >>106559827
>>106559173
Save this image when they decide to do all of these new models for reference, kek
Anonymous No.106559293
>>106559262
how do u englihrs?
anyway I fucking love I can gen in 8s THANKS CHINKAMAN, I wonder on the 5000 series how faster it is compared to 4000
Anonymous No.106559295
>>106559253
thats the trade offer
you get captions, he gets your freak porn pics
nothing in this world is free. you literally don't matter btw so unless this is literally child rape (not even real people you know, or real kids, i mean full on nudity or rape) stop making your life more difficult
Anonymous No.106559298 >>106559442
>>106558492
she is cute, her song 01001100 01101111 01110110 01100101 00100000 01000010 01100001 01110100 00100000 01010011 01101111 01110101 01110000 is a banger.
Anonymous No.106559312
>>106559262
sad it's not out yet
Anonymous No.106559355
>>106558485
Check previous thread for anon's wan2gp workflow catbox
Anonymous No.106559378 >>106559418 >>106559519
>>106558920
>open source
>from bytedance
Anonymous No.106559418 >>106559566
>>106559378
I think Alibaba is pushing envelope on these guys, Hunuyan even released a non-distilled version of their model when they always release distilled ones because of them.
Anonymous No.106559439 >>106559483 >>106559775
The high res upscale is killing me on this model 10 steps takes 5 minutes on my 5090
Anonymous No.106559442 >>106559464
>>106559298
Her headliner
Anonymous No.106559449 >>106559682
i had 0 expectations but am still disappointed
Anonymous No.106559464
>>106559442
I recognize that face.
Anonymous No.106559467 >>106559480 >>106559592 >>106559786
Can the guy with the iryna zarutska lora share it please?
Anonymous No.106559470 >>106559563
Is Chroma1-Flash similar to the old "turbo" SDXL models?
Anonymous No.106559480 >>106559508 >>106559549
>>106559467
how did he even train a lora for that don't you need at least 6 images absolute minimum
Anonymous No.106559483 >>106559500 >>106559501
>>106559439
>heun
>40 steps
>10cfg
not sure if shitposting
Anonymous No.106559500 >>106559511
>>106559483
i believe it because 40 steps used to be a popular flux number
Anonymous No.106559501 >>106559775
>>106559483
You don't explore new models?
Anonymous No.106559508 >>106559535
>>106559480
1 image is the minimum
Anonymous No.106559511 >>106559535 >>106559537
>>106559500
Heun is a lowstep sampler tho. It's actually one of the slowest overall. And how doesn't 10cfg fry his pics is beyond me.
Anonymous No.106559519 >>106559561
>>106559378
bytedance only open sources slop so the localpiggies have something to eat. Their actual good video model is seedance
Anonymous No.106559535
>>106559508
but won't 1 image just create that one image whenever you use it

>>106559511
>Heun is a lowstep sampler tho. It's actually one of the slowest overall.
oh right samplers can have different speeds i completely forgot about that because i have been using euler for the last 3 years and have literally never found a need or desire to switch from it, even on video

nevermind i now think it was a shitpost
Anonymous No.106559537 >>106559619
>>106559511
I use dynamic thresholding why wouldn't you?
Anonymous No.106559549 >>106559597
>>106559480
Just a quick image search shows at least 8 different images of her in various poses from her social media postings, you could probably find at least 20+ if you really wanted to
Anonymous No.106559561
>>106559519
Such a huge corp like that does not have much to gain from closed source models. In fact they could profit much more from open sourcing them, like Alibaba (providing infrastructure for inference). A shame.
Anonymous No.106559563 >>106559604
>>106559470
It's made for low steps and overbakes easily, so pretty much.
Anonymous No.106559566 >>106559603
>>106559418
>I think Alibaba is pushing envelope on these guys
Ali Baba is in the unique position where they are a national champion with ZERO skin in the social media game unlike Tencent/ByteDance so they don't care about conflicts of interest releasing a local image/video model. Since they're the Chinese Amazon with AliCloud they also probably have a fundamental interest in releasing the best models so chinese people/researchers try them out on their cloud
Anonymous No.106559590
Qwen has the potential to beat bytedance but they need to sort out their datasets. hunyuan is complete slop thoughever
Anonymous No.106559592 >>106559608
>>106559467
Iryna Zarutska qwen LoRA https://gofile.io/d/6IZUNy
Anonymous No.106559597
>>106559549
I forgot how easy it is to find images of anyone on the internet because I deliberately checked out from social media as soon as I graduated highschool with no regrets


I'm excited for 15 years in the future when the AI automatically creates loras for all faces of the cute girls I pass by on my morning commute from the cameras in my smart glassses while I sleep and injects them into my dreams

oh wait my brain already does that for me
Anonymous No.106559603 >>106559640
>>106559566
>they also probably have a fundamental interest in releasing the best models so chinese people/researchers try them out on their cloud
Then why doesn't Amazon and Google do the same ?

Your argument makes no sense.
Anonymous No.106559604 >>106559775 >>106560085
>>106559563
Seems kinda slopped but I'll keep experimenting with the settings. Really need to be able to run chroma faster to make it useful.
Anonymous No.106559608 >>106559786 >>106559850
>>106559592
this would be a nice moment to share the training data as well since its a relatively simple lora and would be educationally useful
Anonymous No.106559611 >>106559616
Anonymous No.106559616
>>106559611
Lovely
Anonymous No.106559619 >>106559622
>>106559537
ok what does any of this mean?
Anonymous No.106559622
>>106559619
I don't use comfy UI
Anonymous No.106559624 >>106559640
>>106559203
>gemini or qwen for sfw, joycaption for nsfw,
I wouldn't really know what to do to be illegal. Does that make it more likely to happen?

Anyway. Does Joycaption intentionally go out of it's way to skew things to Nsfw? Because overall I like using this with more range and options. But if I feed it pictures of wizards I don't want it constantly telling me their wearing dicks for hats.
Anonymous No.106559634
Stop being fucking poor holy shit
Anonymous No.106559640 >>106559720
>>106559603
Amazon completely missed AI somehow while focusing on Alexa so they just sell pickaxes, and Google DOES do that shit. Show me one of their open source models as good as their cloud models. And Google plugs their Colab and Compute Cloud every time they mention using and running inference on Gemma

>>106559624
>Does Joycaption intentionally go out of it's way to skew things to Nsfw?
Just try it out. You can kind of direct how you want your images captioned by prompting it
https://huggingface.co/spaces/FiditeNemini/joy-caption-beta-one
Anonymous No.106559648
Anonymous No.106559682
>>106559449
Is that John Carmack
Anonymous No.106559691
Anything new in the i2v world besides what's in this guide?
https://rentry.org/wan22ldgguide
Anonymous No.106559694
Is there a setting for network/neuron dropout in OneTrainer?
Anonymous No.106559713
I see Radiance got its own repo. Did he give up on that?
Anonymous No.106559720 >>106559771 >>106559784
>>106559640
>Amazon completely missed AI somehow
They're raking in money from their AI cloud, if your argument held any water they'd be releasing SOTA models to use
>and Google DOES do that shit
They do practically nothing compared to Alibaba etc, practically all their AI stuff is proprietary, western big tech in general are keeping their AI stuff proprietary

Even the White House has called this out and said they need to release open models else the US will lose influence

You keep bending yourself into a pretzel to defend western companies proprietary AI strategy, it's pathetic

The western big tech SHOULD be the ones providing open free models, instead you need to turn to China for that
Anonymous No.106559723 >>106559822
Anonymous No.106559763
please I need someone to generate a compilation video of different Star Wars characters that are turned indian
so SAAR WARS
pew pew thanks
Anonymous No.106559771 >>106559793
>>106559720
>They're raking in money from their AI cloud,
yeah that's what i said

>They do practically nothing compared to Alibaba etc, practically all their AI stuff is proprietary, western big tech in general are keeping their AI stuff proprietary
china is more locked down in general at the cultural level. since neither of us live in china or work in the chinese tech space, neither of us are authorities to claim one way or the other

the rest of your points i agree with and are just limitations of capitalism. when the only metric that matters is profitability these are the examples of priorities you get. nothing you can do other than nationalizing the company and running into state capitalism issues. you don't get to have your free market cake and eat the state capitalist one too (but actually WE do get the second cake, because china is using AI as part of an asymmetric [dis]information campaign against the West)
Anonymous No.106559775 >>106559861
>>106558069
>>106559604
>>106559501
>>106559439
I can do this in SDXL in 40 seconds wtf its this?
also
>>>/g/adt/
Anonymous No.106559784 >>106559875
>>106559720
nta western companies are fairly hell bent on making money by any means necessary. I can't see the proposal to release sota models out for free going over well in a investor meeting. China also has the added backing/blessing from the government to release these to undermine american tech industry influence. I assume if AI bubble ever does pop in america, china will probably close up too. Not defending american corpos (fuck them) just giving perspective.
Anonymous No.106559786 >>106559885
>>106558319
ty for pointing that out. All the schizos negged me into not trying it somehow

>>106559467
Iryna Chroma1-HD https://files.catbox.moe/fsvmpl.zip

>>106559608
>a nice moment to share the training data as well
You're welcome for the weights anon
Anonymous No.106559793
>>106559771
>asymmetric [dis]information campaign
? Explain the disinformation here
Anonymous No.106559816
old guy just chillen
Anonymous No.106559821 >>106559828 >>106559854
China is not releasing open models as part of some "asymmetric warfare campaign". This is a meme. They are doing it, mostly, because no one would pay attention to their models otherwise, even in China. In this regard they are behaving a bit wiser than American companies. In that there is no reason to be secretive about the weights if there's no money to be made off of said weights. Deepseek is an exception. They really are believers in open source. Whether they are allowed to remain as such will be interesting to see.
Anonymous No.106559822 >>106559909
>>106559723
is that cringe-acle
Anonymous No.106559827
>>106559262
Hey hey let's go! Kenka suru
Taisetsu na mono wo protect my balls.Let's qwenimage love!
Anonymous No.106559828
>>106559821
>believers in open source
If they were believers the models would be uncensored
Anonymous No.106559850
>>106559608
you can easily find that anywhere if you need examples
Anonymous No.106559854 >>106559880
>>106559821
Is chatgpt and stuff even allowed in china? If not I don't see how they would even care about being noticed, they rule the market there. Not like releasing free models for the rest of the world is helping them earn money lol
Anonymous No.106559855 >>106560400
Fresh when ready

>>106559851
>>106559851
>>106559851
>>106559851
Anonymous No.106559861
>>106559775
You're not a bright one I get it
Anonymous No.106559875
>>106559784
>to release these to undermine american tech industry influence
This I fully agree with, it's not altruism that fuels the China open model releases, but it doesn't matter, what matters is that the western big tech want to lock down AI as a proprietary service, for monetary and control reasons, and China is the one that is propelling open models forward at an impressive pace.

We'll have to see if this can drag western big tech kicking and screaming into releasing SOTA open models (well, not 'OpenAI', they never will), the best outcome would be a open model prestige race between them and China, here's hoping. Sadly it's more likely western big tech will lean on lawmakers to have Chinese open models banned.
Anonymous No.106559880
>>106559854
I know claude is banned (by Anthropic, not China). Not sure about chatgpt. A lot of Chinese use claude anyway, getting around the block in various ways. Chinese companies would like their models to be used by the rest of the world, just like they want any product they make to broadly popular everywhere if they can help it. They can't really make any money yet so they just release it for free instead as they build their capabilities.
Anonymous No.106559885
>>106559786
Np anon. Remember there will always be anti-Chroma schizos here and there, they do not see Chroma for what it is. While not perfect, it's a model with a lot of potential. Further finetuning could fix remaining imperfections like faces in background in pic rel.
Anonymous No.106559905
>>106558492
>the delta weight mixed with the HD weight is pretty strong at 2k and still preserves prompt following of original.


how do you mix those models? or you just add a second sampler?
Anonymous No.106559909
>>106559822
yes
Anonymous No.106560085
>>106559604
Yeah, it plastics up the image easily. Some anons have said it's better for anime stuff, but I haven't tried that.
Anonymous No.106560156 >>106560400
Is this the technical/training thread?
Anonymous No.106560400
>>106560156
this is now the previous one but yes >>106559855
Anonymous No.106561775
>>106558886
thx