← Home ← Back to /g/

Thread 106317608

326 posts 210 images /g/
Anonymous No.106317608 >>106317930 >>106318112
/ldg/ - Local Diffusion General
Discussion of Free and Open Source Text-to-Image/Video Models

Prev:>>106312195

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI:https://github.com/comfyanonymous/ComfyUI
SwarmUI:https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic:https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next:https://github.com/vladmandic/sdnext
Wan2GP:https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://github.com/Wan-Video
2.1:https://rentry.org/wan21kjguide
2.2:https://rentry.org/wan22ldgguide
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-HD/tree/main
Training:https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond:https://rentry.org/comfyui_guide_1girl
Tag Explorer:https://tagexplorer.github.io/

>Misc
Local Model Meta:https://rentry.org/localmodelsmeta
Share Metadata:https://catbox.moe|https://litterbox.catbox.moe/
Img2Prompt:https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers:https://stable-diffusion-art.com/samplers/
Txt2Img Plugin:https://github.com/Acly/krita-ai-diffusion
Archive:https://rentry.org/sdg-link
Bakery:https://rentry.org/ldgcollage

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.106317618 >>106317694
Is there a good model/Lora for makeup removal?
Anonymous No.106317650 >>106317815
>>106317610
Uncropping/reframing (ie outpainting with natural language) is one of the selling points of the model, and it's normal to expect a generative model to be able to generate new content. No one is expecting it to know the face of the original model, but it should have a plausible human head.
Anonymous No.106317671
Poor Major Kusanagi turned into a pygmy by Qwen-Image-Edit attempting to fill in the rest of her body.
Anonymous No.106317685 >>106319521
Anonymous No.106317694
>>106317618
https://civitai.com/models/1859952
Anonymous No.106317695 >>106317745
the man in the blue shirt is standing in front of a suit store. The store has a sign that says "BIG GUYS" in neon text. The store windows have business suits on display. The store is in New York City. Keep his facial expression and face the same.

it was just a low quality jpg headshot so it did pretty well.
Anonymous No.106317745
>>106317695
the man in the blue shirt is sitting on a Boeing passenger jet drinking a beer. Keep his facial expression and face the same.
Anonymous No.106317747
Anonymous No.106317773
a big beer for you.
Anonymous No.106317815 >>106318056
>>106317650
>plausible human head
I only accept them if they have buttchins
Anonymous No.106317819 >>106317824 >>106318724
Blessed thread of frenship
Anonymous No.106317824 >>106318724
>>106317819
syke
Anonymous No.106317830
Anonymous No.106317831
>>106316876
>is there any point in wan in using cfg > 1 for the low noise model? It seems to be that once the high noise has done the basic job, cfg can be kept at 1 for a faster result.
I just did that, went in i2v with cfg 5 then 1 in low noise, and the result is perfectly fine while it was way faster.
Anonymous No.106317847 >>106318208
Anonymous No.106317856 >>106318007 >>106318125
Anonymous No.106317878
A journey of 35 steps begins with a single iteration.
Anonymous No.106317892 >>106317985
I asked in the old thread but runpod won’t cancel you for genning goon material right? Google colab used to. Can’t afford a 50 card but runpod credits maybe. And it’s not even about illegal shit chuds ChatGPT gets mad if I even talk about titties, big tech is hypersensitive about boobs
Anonymous No.106317894
the man in the blue shirt is wearing a sombrero and is eating tacos in a mexican taco restaurant. keep his expression the same.
Anonymous No.106317930 >>106317989 >>106318368
>>106317608 (OP)
>>106302341
>Prompt executed in 10.95 seconds

chroma 49 lora trained with TREAD can go down to CFG 1 no problem. This is 20 steps OSS.

Add this branch for support https://github.com/feffy380/diffusion-pipe/tree/tread
Anonymous No.106317935
Anonymous No.106317941
Anonymous No.106317958 >>106317972 >>106317978 >>106318105
I just started messing around with Qwen image edit and I'm having a lot of difficulty making small edits without changing the whole image. Kontext (supposedly) did best with explicit instructions to change nothing other than what was requested, but I'm not seeing that make much of a difference with Qwen. The handful of examples they provided didn't include that, either. Has anybody figured out the trick to keeping Qwen from fucking up the whole image and turning everybody slightly Chinese?
Anonymous No.106317972
>>106317958
>and turning everybody slightly Chinese?
lmao
Anonymous No.106317978 >>106318039
>>106317958
No, it seems somewhat random so far. Sometimes it keeps parts of a body exactly the same while altering others, resulting in weird proportions. Other times it overhauls the whole image unnecessarily. Telling it "not" to do things has the classic problem, and negative prompt causes rng results. More testing definitely needed.
Anonymous No.106317985
>>106317892
dont use runpod use vast ai its cheaper and datacenters dont care unless you're committing real crimes off that server
Anonymous No.106317989 >>106318024 >>106318368
>>106317930
What is TREAD ? Is there a paper somewhere ?
Anonymous No.106318004
https://files.catbox.moe/k5wea4.json

multi image qwen edit, works like 2 image kontext I guess?
Anonymous No.106318007
>>106317856
Lol
Anonymous No.106318009 >>106318013
Anonymous No.106318013
>>106318009
wut
Anonymous No.106318024 >>106318057 >>106318236
>>106317989
>TREAD: Token Routing for Efficient Architecture-agnostic Diffusion Training
https://arxiv.org/abs/2501.04765
Anonymous No.106318039
>>106317978
Too bad. I heard you could mitigate it by encoding the image and passing it as the latent but I don't really see a difference. What are your impressions on the lightning lora? I've only just started using it and so far the quality drop isn't very noticeable for me.
Anonymous No.106318056
>>106317815
>buttchins
I always wondered if people outside of here noticed it too for flux gens
Anonymous No.106318057 >>106318075 >>106318236
>>106318024
That sounds unbelievably good, and the resulting lora just works with programs like Comfy ?
Anonymous No.106318075 >>106318082
>>106318057
nobody trains with comfy except retarded redditors
Anonymous No.106318079
GenJam when?
Anonymous No.106318082 >>106318095
>>106318075
I'm talking about inference, of the resulting lora
Anonymous No.106318095
>>106318082
probably in a diffusers format so yeah most likely or requires little change
Anonymous No.106318105 >>106318145
>>106317958

It also seems to rubberify everything. It was either intentional to keep the outputs from looking too believable, or it was cheaper/easier to train it on all slop.
Anonymous No.106318112
>>106317608 (OP)
local nano banana when
Anonymous No.106318125
>>106317856
Earthworm Jim??
Anonymous No.106318145 >>106318256
>>106318105
Yeah, I haven't used a lot of base qwen because I noticed how rubbery the people looked. I suspect that is just carrying over to the image edit version. I'd be more inclined to believe it was due to too many extreme filters (asians love them) and slop in the dataset.

It's aggravating because sometimes it really will keep the while image the same and only change the detail you wanted. It's just that the odds of that happening and the detail also looking good are very slim.
Anonymous No.106318157
Anonymous No.106318169 >>106318191 >>106318219
remove the hair from the anime girl and give her short blonde hair.
Anonymous No.106318191
>>106318169
QEI seems good at some narrow fixed-function tasks. Replacing hair/clothes/objects is one of them. Prompting it is like searching for unknown activation keywords.
Anonymous No.106318208
>>106317847
>asian woman
nuclear grade anti-white weapon of mass (genetic) destruction right there
Anonymous No.106318219
>>106318169
remove the hair from the anime girl and give her red drill hair that is curly.

not quite drills but cute.
Anonymous No.106318236
>>106318057
>>106318024 (You)
>just works with programs like Comfy ?
Yup
Anonymous No.106318254 >>106318262
the japanese girl is holding a sign that says "LDG", and is standing in front of several desktop computers in a computer lab.

actually impressive it got the swimsuit cutout right based on a sliver of skin.
Anonymous No.106318256
>>106318145

It's weird too since qwen almost has the opposite problem of following the prompt *too* closely. The seed variety is non-existent.
Anonymous No.106318262 >>106318295
>>106318254
the japanese girl is wearing a white racing suit with "LDG' on the front of it, while standing in a garage at a race track with a sports car in it.
Anonymous No.106318295
>>106318262
diff girl, slightly diff prompt (unzipped suit)

the japanese girl with large breasts is wearing an unzipped white racing suit with "LDG' on the front of it, while standing in a garage at a race track with a sports car in it. keep her expression the same.
Anonymous No.106318299 >>106318397
Anonymous No.106318309
Add Hatsune Miku between and behind the 2 existing characters matching the simple style

not bad first attempt at qwen edit
Anonymous No.106318332 >>106318574
>candyhorn
Anonymous No.106318335 >>106318418
Anonymous No.106318368 >>106318388 >>106318401 >>106320054
qwen image hilariously doesn't know what Unitree robots look like, but with qwen edit it doesn't matter. This shit is mindblowing. fuck kontext btw

>>106317930
>>106317989
with TREAD and EQ-VAE, did we just get a 259x speedup in training that also happens to unfuck anatomy??

how many more 10x speedups do we need before we can bake our own unslopped Qwen Image at home from scratch?
Anonymous No.106318388 >>106320054
>>106318368
>with TREAD and EQ-VAE, did we just get a 259x speedup in training that also happens to unfuck anatomy??
I sounds to good to be true, TREAD alone sounds insanely good, praying it's not another snake oil...
Anonymous No.106318397 >>106318885
>>106318299
GOATed
Anonymous No.106318401 >>106318614
>>106318368

The only thing mindblowing is the blatant shilling.
Anonymous No.106318404 >>106318423
give the woman a black leather jacket:
Anonymous No.106318418
>>106318335
>No gag reflex
Superpower for women
Anonymous No.106318423 >>106318450
>>106318404
the woman is wearing a shiny black latex outfit. keep her expression the same.

yep, better results than kontext for sure.
Anonymous No.106318450
>>106318423
this time lets try a multi edit prompt:

the woman is wearing a white dress and is painting a picture of anime style Miku Hatsune, in an art gallery. keep her expression the same.
Anonymous No.106318451 >>106318471 >>106318523 >>106318614
also chromas pixel space experiment seems to be working

we might be training our own base models on consumer hardware soon
Anonymous No.106318459
ah good, quen edit is good at isolating smaller elements too.

replace the man on the left with Miku Hatsune in the same artstyle.
Anonymous No.106318471 >>106318516
>>106318451
Explain this to me like I am retarded
Anonymous No.106318498 >>106318679
What's the best sampler to use with Chroma? Heun?
Anonymous No.106318516 >>106318534 >>106318619
>>106318471
you no longer need a vae, therefore you don't lose quality of having to shittily reconstruct the image by compressing / decompressing them, also it will be much much much faster / cheaper to train due to not having to train a vae

https://arxiv.org/abs/2507.23268
Anonymous No.106318523
>>106318451
Wake me up when it's finished. My hope was already spent on Ostris's experiments.
Anonymous No.106318534 >>106318546
>>106318516
Now explain to me like I am a loli
Anonymous No.106318536
Are there nodes that allow me to store the text encodings locally and reload them so that I only have to load the diffusion model and the precalculated encodings? This would be particularly useful for the edit models, where I often use the same prompts, and especially for qwen image :3
Anonymous No.106318541 >>106318614
in comfy ui, can I queue up multiple prompts in a row and change the image, prompt text, and so on? like, will it remember the changes for each new prompt in the queue? or does it just go off of the most recent settings?
i wanna i2v a batch of images
Anonymous No.106318546
>>106318534
take that shit to /adt/ faggot, we're team shota here
Anonymous No.106318553
Anonymous No.106318574 >>106318580 >>106318603
>>106318332
>he's actually making pony into saas
Anonymous No.106318580
>>106318574
they always said they would make it paid only for like a month before releasing it
Anonymous No.106318596
hopefully pixel space and thread work out. That + wan + nunchunku fp4 training will be the golden age.
Anonymous No.106318603
>>106318574
It actually blows my mind that people still shill for pony after... shit how long has it been?
Anonymous No.106318612
the only partially converged images i care about are from bigma anon desu
Anonymous No.106318614
>>106318401
I wish I was getting Xi bux, idk how. Any Alibaba shills ITT to explain how I can join the PLA shill team?

>>106318451
isn't this only for inference, or can this work for training too?

>>106318541
yeah it has a job queue
Anonymous No.106318619
>>106318516
Wait no vae means infinite img2img passes right?

Also 14B takes way too long to gen anything lol, and it's not even that much better aside from prompt adherence
Anonymous No.106318634
Just did a few tests. Is there any advantage to using the fp16 of qwen-edit? For image generation it's better, but for editing seems like fp8 + lightning 8 is good enough.
Anonymous No.106318639 >>106318648 >>106318671
replace the character with green hair, with Miku Hatsune.

neat, it replaced Terra just fine, even though she's a small sprite.
Anonymous No.106318648 >>106318655
>>106318639
Can you replace "Reflect ring" with giant dildo?
Anonymous No.106318655
>>106318648
not on a blue board
Anonymous No.106318671
>>106318639
replace the character with green hair, with Miku Hatsune as a small pixelized sprite in the same proportions.

there we go
Anonymous No.106318679
>>106318498

Euler is usually fine but definitely is inclined to switching between realism/illustration depending on the prompt. ddpm and gradient seem to favor realism alongside huen, but the latter is much slower.
Anonymous No.106318698 >>106318704
replace the pink hair girl with Miku Hatsune.

did better than kontext did when I tried this, also the source image is super low res (like 250 pixels wide)
Anonymous No.106318704 >>106318725 >>106318757
>>106318698
Ask it to remaster / upscale the image (more useful than garbage miku tests)
STATLER + (or) Waldorf & Company. No.106318724
>>106317819
>>106317824
>the duality of man
bwahuehuehue
Anonymous No.106318725 >>106318762 >>106318922
>>106318704
Everything is more useful than this retard’s β€œMiku tests”, that’s just his gay excuse for spamming his shitty fetish character
Anonymous No.106318742 >>106318757
pretty good

just as a note, kontext might be better at copying text styles, but not as good at manipulating images or figures. I couldnt change the cyberpunk logo into the same style text easily like kontext did.
Anonymous No.106318757 >>106318768
>>106318742
Will you do it or not? >>106318704
No one cares the model has ability to replace people with Miku, we already understood that given your spam
Do actual tests people want to see instead
Anonymous No.106318762
>>106318725
HOT. ;)
Anonymous No.106318768
>>106318757
did "upscale the image" but im using the speed lora so it may not work as intended

the original is very potato quality. (200 pixels)
Anonymous No.106318771 >>106318807
>Getting austically butthurt over Miku
Typical chroma poster.
Anonymous No.106318772 >>106318784 >>106318825
the babby who claps and MUST share every output regardless if it's redundant info has something important to tell us while sharing the most boring shit possible on par with emad dog and sunglasses tier garbage
Anonymous No.106318784 >>106318816
>>106318772
None of these threads ever reach the image cap. Why are you acting like it's some valuable resource?
Anonymous No.106318800 >>106318820
fox with hat :D :D
Anonymous No.106318807 >>106318837
>>106318771
I am one of the anons who called out the Miku spam and in fact I am training a Qwen lora as we speak, lol
Anonymous No.106318816
>>106318784
we used to have standards. the collage isn't something to strive for but a personal goal to contribute something worthy of it. doing small experiments is fine but every single little thing as a separate post and boring outputs? nigga compile all your findings in a post or two. making the thread a realtime blog of trial and error is a buzz kill
Anonymous No.106318820
>>106318800
Punting this shit head across the neighborhood
Anonymous No.106318823
Gayditor pajeets are seething again fot both, qwen chads and miku, kek
Anonymous No.106318825
>>106318772
A nogen acting like he's a janitor for this place is by far the biggest loser in the thread, btw
Anonymous No.106318827 >>106318848
change the image to a pencil drawing:
Anonymous No.106318837 >>106318868
>>106318807
>I am training a Qwen lora
I have also trained a Qwen lora. Being able to run a training script doesn't give you magical authority over who posts what in a thread that never even hits the image cap.
Anonymous No.106318841
man i WONDER where all this seethe came from. just outta nowhere, huh.
Anonymous No.106318848
>>106318827
Looks more like a bad filter desu.
Anonymous No.106318856 >>106318860 >>106319506
generate a 3d polygon version of the character.

cool, got a low poly 3d game type of 2B.
Anonymous No.106318860 >>106318873 >>106319506
>>106318856
this time with polygon omitted: just 3D
Anonymous No.106318868 >>106318880 >>106318887
>>106318837
Moving goalposts after you said people against Mikutrannyposting are "chroma posters"?

This shit is getting more obnoxious than the Rocketanon OC spam, and that says something. The Rocketanon at least had some originality and made original characters (even if they all used the same prompt and pose), lol
Anonymous No.106318873 >>106319506
>>106318860
show the character from behind:
Anonymous No.106318880
>>106318868
I never said you weren't a chroma poster?
Anonymous No.106318885
>>106318397
ty
Anonymous No.106318887 >>106318900
>>106318868
Honestly while mikuposting is gay, it’s not nearly as bad as the faggot on Sdg who uses that ugly retarded witch character as his avatar. He’s even less interesting or original than mikuposter, and that’s saying something.
Anonymous No.106318892
how organic
Anonymous No.106318899 >>106318910
the green cartoon frog is standing on a beach and holding a glass of orange juice. the frog is wearing a blue shirt and red shorts.

qwen edit can do pepes from a source too.
Anonymous No.106318900 >>106319014
>>106318887
This is a Miku hobby wanted or not
Anonymous No.106318903 >>106318918 >>106318949
I went ahead and counted 7. SEVEN images in this thread with miku in it. Truly deranged and out of control. Not an overreaction personal autistic vendetta at all to call out the Mikus.
Anonymous No.106318910
>>106318899
the green cartoon frog is on a fishing boat near the beach, holding a fishing rod. the frog is wearing a blue shirt and red shorts.
Anonymous No.106318914
in comfyui it looks like you can just insert loras into the text encoder, but it has no weights. what method do you use to influence weights on lora tags that are in the text encode node?
Anonymous No.106318918
>>106318903
15% seems pretty high. You would want to lose 15% or your cock, would you?
Anonymous No.106318922
>>106318725
HOT.
Anonymous No.106318926
Anonymous No.106318932
Anonymous No.106318933 >>106318976
the green cartoon frog is on a fishing boat near the beach, holding a fishing rod. the frog is wearing a blue shirt and red shorts, and white running shoes.

actually did really well considering the source is just a face and 1 hand, and has no other anatomy visible.
Anonymous No.106318949 >>106318968 >>106318985
>>106318903
are you new?
miku is literally everywhere in all llm and ai threads in g
Anonymous No.106318968
>>106318949
thanks to one or two severely mentally ill faggots
Anonymous No.106318970
I like miku stop being a fag
Anonymous No.106318976 >>106318984
>>106318933
the green cartoon frog is standing in a living room. the living room has a large window showing a large explosion outside and tall skyscrapers on fire, in New York City. the frog is wearing a blue shirt and red shorts, and white running shoes. the frog has his hand on his chin.

too smug to care.
Anonymous No.106318982
MIKU RULES!
GO MIKU GO!!
Anonymous No.106318984
>>106318976
Anonymous No.106318985
>>106318949
I can't tell if this is an endorsement or denouncement of the Mikus. Miku has been in all the AI threads since they began.
Anonymous No.106318988
Literally anything is better than 1girl giant boob giant ass slop. If anything we need more Miku.
Anonymous No.106318991
WITHOUT MIKU LDG WOULD BE NOTHING
I ONLY COME HERE TO SAVE RARE MIKU
Anonymous No.106318993 >>106319002 >>106319020
is qwen edit worth my time as 16gb vramlet?
Anonymous No.106319002
>>106318993
I am using Q8 with a 4080 just fine right now, so yes
Anonymous No.106319004
there is nothing else that I care about more in this entire world than miku
Anonymous No.106319007
dude, honestly, miku ROCKS!
Anonymous No.106319012
i LOVE migu
Anonymous No.106319014
>>106318900
HELL YA !
Anonymous No.106319019
I think the Miku hater should just leave desu.
Anonymous No.106319020
>>106318993
u not gonna post the hot-glue version???
Anonymous No.106319028
>singular miku hater theory
Anonymous No.106319033
[Chorus: Pamela Long (w/ The Notorious M.i.K.U.)]
MiKu, MiKU, MiGU, can't you see?
Sometimes your words just hypnotize me
And I just love your flashy ways
Guess that's why they broke, and you're so paid
MiGU, MiKU, MIKU, can't you see? (Uh-huh)
Sometimes your words just hypnotize me (Hypnotize)
And I just love your flashy ways (Uh-huh)
Guess that's why they broke, and you're so paid (Ha)
Anonymous No.106319040 >>106319061 >>106319902
change the location to a desert with a large forest in the background, keep the character in green in the image.

neat, even the new stuff remains in the pixelized style from the source (kings quest)
Anonymous No.106319046 >>106319079 >>106319328
is gwen edit good?
I heard people said it's mid frfr
Anonymous No.106319049
>MIKU MIKU MIKU MIKU MIKU
Anonymous No.106319056
Now that I've stopped trying to get Qwen to make a minor pose adjustment and started using it for lighting and background changes I've decided it's pretty great. I just wish it worked for minor pose adjustments.
Anonymous No.106319061 >>106319095 >>106319902
>>106319040
change the location to a desert with a spooky forest with a haunted house in the background, keep the character in green in the image.

even with a single screenshot you can make some cool pixel art bgs with this edit model desu, so many applications
Anonymous No.106319069
Migger is a big girl
[spoiler]4U[/spoiler]
Anonymous No.106319071 >>106319080
Beef: squashed
Anonymous No.106319078
will miku let me fuck her small tits?
Anonymous No.106319079 >>106319237
>>106319046
it's not as fun as qwen image, it doesn't obey prompts as closely. slower too. seems to be the best of its kind though, better than kontext or omnigen2. LORAs will be crazy.
Anonymous No.106319080
>>106319071
should have used the bulge groe lora on him
Anonymous No.106319081
anon video request
a pov video somewhere in a supermarket where the viewer puts on glasses, after which the scene is stylized in anime style and karens are catgirls now.

its as if he is using a real-time diffusion model embed in his glasses
Anonymous No.106319093
ooo-eee-ooo
Anonymous No.106319095 >>106319109 >>106319902
>>106319061
change the location to a cyberpunk city with neon lights at night, the character in green is walking into a store on the left with a sign saying "LDG mats" over the door.

almost
Anonymous No.106319109 >>106319114 >>106319902
>>106319095
change the location to a bridge very high over the ocean, with a forest below it, the character in green is standing on the bridge.

I swear i've seen an island similar to this in some lucasarts game.
Anonymous No.106319114
>>106319109
>lucasarts
more like lucasFARTS LOL
Anonymous No.106319119 >>106319123 >>106319129 >>106319139 >>106319266
I just don't understand how you can be alive today and not completely amazed at this technology, let alone be anti AI
Anonymous No.106319123
>>106319119
people who went to school for art are mad technology makes art accessible to more people.
Anonymous No.106319128 >>106319144
Would you listen to Mon Papy?
Anonymous No.106319129
>>106319119
Political ideology and brainwashing really. Certain political group has been programmed to hate technology as part of anti-corporation, anti-private company, thing. This push happened early 2010s when journos decided that they will only push negative stories about tech companies.
Anonymous No.106319139
>>106319119
>be anti AI
Anti AI people are pretty deranged. There are legitimate arguments to had about them replacing entry level work but even those arguments seem to come second to the idea that people can generate images and videos with minimal talent. Its a complete misplacement of priorities. Look in places like game development. When an indie solo developer asks what they can do instead of using AI people pose the most insane solutions.
Anonymous No.106319144 >>106319181
>>106319128
they twan?
oh my goodness!
Anonymous No.106319150 >>106319191
change the location to a large castle floating in the sky high above an island with a large city, the character in green is in the large city looking up at the castle.

you can conceptualize so much stuff in a few seconds, in any style, technology is pretty cool.
Anonymous No.106319181
>>106319144
Starring famous pop-star Eveny Irontiar!
Anonymous No.106319191
>>106319150
change the location to a grass field near a medieval castle, where a fire breathing dragon is shooting fire at hundreds of medieval soldiers who are running towards the dragon. The character in green is watching the battle unfold nearby.

what makes AI neat is you can do all this stuff just with ideas and sometimes (with edit models) a source. with qwen edit for example my lora is basically the old game screenshot.
Anonymous No.106319192
>The woman has lowered her feet and is now wearing shoes.
Anonymous No.106319202
Can we save and load text embeddings so that we don't have to constantly load the text model? I would prefer to create a series of prompt embeddings and switch between them rather than always having to load Qwen 2.5.
Especially with edit models, at some point you have a few that you always use.
Anonymous No.106319222
The woman is now turned and sitting at her computer. Her skirt is longer.
Anonymous No.106319234 >>106319246 >>106319452
Why did /adt/ ultimately succeed?
Anonymous No.106319237
>>106319079
I just tried web version. And it changes the face too much. Is that normal?
Anonymous No.106319238
how do i generate breast expansion ai videos?
Anonymous No.106319242
What is your most over-engineered text to image workflow, and what does it accomplish?
Anonymous No.106319246 >>106319257 >>106319270
>>106319234
Same reason free to play online games do.
Catered to vramlets.
Anonymous No.106319257 >>106319267 >>106319270
>>106319246
So why have you been seething about it?
Anonymous No.106319266 >>106319280 >>106319309
>>106319119
most anti-AI people are shitty artists who can't get some bucks for their ugly scribbles anymore
Anonymous No.106319267 >>106319274
>>106319257
Me? Seething? I tell We've had like zero SDXL outputs since the thread was made. I'm estatic.
γƒγ‚Ήγƒˆγ‚«γƒΌγƒ‰ !!FH+LSJVkIY9 No.106319270
>>106319246
>>106319257
you KNOW why ;3
Anonymous No.106319274 >>106319291
>>106319267
nah you have been seething
Anonymous No.106319280 >>106319305
>>106319266
What pisses me off about them their supposed entitlement to my money. Like if I generate an image for a fraction of cent on my own GPU I'm actually stealing an entire commission from the artist who saw it directly.
Anonymous No.106319291 >>106319297
>>106319274
I questioned it initially, but I came around to the idea when I saw the benefits. It wasn't seething.
Anonymous No.106319297 >>106319307
>>106319291
You are seething this very second, as much as you're trying to be discreet.
Anonymous No.106319305 >>106319322
>>106319280
Anonymous No.106319307 >>106319337
>>106319297
I think you'll find I'm as cool an early October evening breeze with the sound of crickets chirping peacefully. Completely calm and content.
Anonymous No.106319309 >>106319377 >>106319423
>>106319266
https://rdrama.net/post/353213/oh-as-an-artist-you-are
Anonymous No.106319322 >>106319359
>>106319305
Who actually pays for this? I must know. Like what function does paying 20 bucks for an image in that style serve? You can't even jerk off to it because they want do nsfw.
Anonymous No.106319328
>>106319046
mid is about right.
It does specific things well (the ones they promote).
It seems to do very poorly on requests outside its training set, or if you phrase things too differently from the training captions.
I think it has the potential to be good.
Anonymous No.106319331
So last night, I plugged Qwen edit into the kontext workflow and changed the necessary nodes. I thought it was kind of shit.
After trying it again today. I actually think it's pretty good.
Anonymous No.106319337
>>106319307
>I think you'll find I'm as cool
Nah, you're clearly seething.
Anonymous No.106319353
difference between 4 and 8 step lora, quality-wise?
Anonymous No.106319359 >>106319379
>>106319322
they approach other twittards/redditors in a 'friendly way' and then keep pestering these people to buy their shitty art, if you don't, they start major drama and throw all kinds of issues in their lives onto you so you can feel pity
Anonymous No.106319377 >>106319400 >>106319422 >>106319570
>>106319309
stuff like this annoys me, I can do traditional art better than this, they have limited themselves by hating AI instead of wanting to transform what they can do already with inpainting or controlnet work
Anonymous No.106319379 >>106319410
>>106319359
So they're the twitter equivalent of the hobo that cleans your windshield at a stop light?
Anonymous No.106319387
I think kontext is better at copying fonts/text styles/text edits. But maybe my settings with the lora aren't ideal yet. it could copy the cyberpunk font style fine, this can't so far.
Anonymous No.106319400 >>106319414
>>106319377
god you know this person goes on endlessly about how their art was stolen to train AI despite it's only use being "bad quality, child's drawing, score 0"
Anonymous No.106319410
>>106319379
exactly, kek
Anonymous No.106319414
>>106319400
It was Greg Rutkowski
Anonymous No.106319419
I'm in the media industry and everyone loves ML tools. It's really only these tumblr/discord friends that complain so much about the artists' plight.
STATLER + (or) Waldorf & Company. No.106319422
>>106319377
BEAHAGHAHAH
Anonymous No.106319423
>>106319309
The fuck is that website? But someone made a good point. A good portion of people who seethe at AI art are often people who do fan art or iterations of the work of others. I can see why AI would bother them as it kind of horns itself in on their niche.
Anonymous No.106319427 >>106319464
>load up Qwen edit
>prompt: her arms are raised above her head
>she is built like a linebacker in every output
I just want cute pits.
Anonymous No.106319432 >>106319447 >>106319455 >>106319486 >>106319512
wow qwen edit is freaking slow. kontext nunchaku >>>
STATLER + (or) Waldorf & Company. No.106319447 >>106319451 >>106319486
>>106319432
HOLY FUCKING SLOPPA
Anonymous No.106319451
>>106319447
oh I know
Anonymous No.106319452 >>106319462
>>106319234
>2 day old thread
Anonymous No.106319455
>>106319432
use the 8 step lora and 8 steps/cfg 1
Anonymous No.106319462 >>106319465
>>106319452
race race RACE!
TALK TO YOURSELF UNTIL THE BUMP LIMIT IS HIT!!!!!!!!!!
Anonymous No.106319464 >>106319482
>>106319427
Edit the image canvas size first to give it a taller aspect ratio. The QIE model tries too hard to fill the canvas and will sometimes squish the proportions of the subject / turn them into dwarves if it fills the frame better.
Anonymous No.106319465 >>106319477
>>106319462
you need help non fren
Anonymous No.106319477
>>106319465
just let him seethe
Anonymous No.106319482
>>106319464
I'll give that a shot.
γƒγ‚Ήγƒˆγ‚«γƒΌγƒ‰ !!FH+LSJVkIY9 No.106319486 >>106319491
>>106319447
rude. i liked goosebumps in the mid 90s ;c
>>106319432
Statler & Waldorf No.106319491 >>106319498
>>106319486
only problem is you were 27 and ESL in 1997 (still are)
brahafahaha
Anonymous No.106319493
Why is lilbro changing his name and replying to himself though
γƒγ‚Ήγƒˆγ‚«γƒΌγƒ‰ !!FH+LSJVkIY9 No.106319498
>>106319491
rude.
Anonymous No.106319506
>>106318856
>>106318860
>>106318873
That's pretty good.
Anonymous No.106319508 >>106319533 >>106319584
replace the text "cyberpunk" with "cyberjank".

didnt work on qwen edit. BUT kontext does it. 20 steps cfg 1.0.

however, qwen is better at non-text edits and can do more overall seemingly. so unless you want edits that match the text style, qwen is better? unless someone can replicate this with qwen edit and post their settings.
Anonymous No.106319512
>>106319432
awktually once I freed some vram qwen came through with this slop much quicker. less realistic and baby face but least hands better.
Anonymous No.106319521 >>106319531
>>106317685
you could gen it better by just cropping out the art, upscaling it, then re-adding the frame. You wouldn't even have an overhead because you're currently animating the entire card
Anonymous No.106319531 >>106319608
>>106319521
Could probably all be done in comfyui too. comfyui has been a surprisingly good simple video editor for me.
Anonymous No.106319533 >>106319625
>>106319508
source image here.

can anyone replicate the text on qwen edit?
Anonymous No.106319544 >>106319549
should cfg be 1.0 with the 8 step lora? or 2.5?
Anonymous No.106319549
>>106319544
1
Anonymous No.106319554 >>106319560 >>106319569 >>106319583 >>106319619
Wake me up when v51 is out. Censored models are useless to me.
Anonymous No.106319560
>>106319554
Seems like it'll be relatively soon.
Anonymous No.106319569 >>106319619
>>106319554
JUST
TWO
MORE
EPOCHS
Anonymous No.106319570
>>106319377
Bewilderment over consent amuses me. They should know the guys using AI wouldn't even ask for sexual consent let alone training on someone's art.
Anonymous No.106319583
>>106319554
this is your brain on PORN
Anonymous No.106319584
>>106319508
do cybernigger
I remember this used to be a fun game on /v/
Anonymous No.106319608
>>106319531
Definitely, the only thing I wish I had in comfy was the ability to draw arbitrary colors onto images for color based region masking
Anonymous No.106319619 >>106319633 >>106319634 >>106319716
>>106319554
>>106319569
Seems like we are entering the peak schizo arc with how we are getting quants of random debug versions.
Anonymous No.106319625 >>106319639 >>106319707
>>106319533
I tried and got nothing. I don't think it can recognize the "cyberpunk" title or any text that deviates too much from common fonts.
Anonymous No.106319633
>>106319619
Coming soon, schizo merge with v48 and v50
Anonymous No.106319634
>>106319619

Silveroxides picking up the slack.
Anonymous No.106319639 >>106319703
>>106319625
miku anon would have got it
Anonymous No.106319655 >>106319665
the japanese girl with large breasts is standing up. keep her expression the same.

even the watermark is intact (could prompt it out but still)
Anonymous No.106319665 >>106319685 >>106319713
>>106319655
the japanese girl with large breasts is standing up. The view is from behind her, and she is showing her ass and looking back at the camera. keep her expression the same.

ass is the easy way to get figures to turn. again, qwen wins this round
Anonymous No.106319685
>>106319665
holy slop hair
Anonymous No.106319703
>>106319639
MIKU MILKY MIKU!
WHERE R U
Anonymous No.106319707
>>106319625
I tried to. I think you're right about cyberpunk being too stylized to read.
Anonymous No.106319713 >>106319747 >>106319842
>>106319665
the japanese girl is standing up and wearing a business suit and short black skirt, with black high heels. keep her expression the same.

see, no hobbit women, scaling actually works better with quen edit.
Anonymous No.106319716 >>106319734
>>106319619

Love all the work that has gone into this but just a tiny bit more transparency would make a world of difference.
Anonymous No.106319734 >>106319854
>>106319716
I'm 90% sure the guy doesn't document anything he does and just feels out settings and hits trains.
Anonymous No.106319747 >>106319752
>>106319713
if only it was still the same girl...
Anonymous No.106319752
>>106319747
if you really want 1:1 in the case it's off, you could just use reactor to faceswap.
Anonymous No.106319774
the japanese woman is wearing a white bikini.

pretty neat that it handled multiple layers/objects without much issue. no need to change it, but just a test.
Anonymous No.106319775 >>106319782
Anonymous No.106319782 >>106319785
>>106319775
which model and what settings? I could only do it with kontext.
Anonymous No.106319785 >>106319794
>>106319782
Qwen.
>Replace the yellow text in the center of the image with "CyberMiku"
Anonymous No.106319794 >>106319798
>>106319785
how many steps? lora or no lora?
Anonymous No.106319797 >>106319855 >>106319873
Anonymous No.106319798 >>106319804 >>106319836
>>106319794
20 steps cfg 3. Q8. No LoRA.
Anonymous No.106319804
>>106319798
ok, thanks
Anonymous No.106319809 >>106319845
damnit i only noticed the 4 toes after posting
Anonymous No.106319836 >>106319847
>>106319798
default comfy workflow? I got this with 20 steps 2.5cfg:
Anonymous No.106319837 >>106319893
anyone use an A6000 Pro for genning?
Anonymous No.106319842 >>106319850
>>106319713
You get midgets from QIE if your aspect ratio is too short.
Here's a midget with a 1:1 ratio image.
Anonymous No.106319845
>>106319809
gettin all wet over hea
Anonymous No.106319846 >>106319852
I've never used this shit before. How can I create nudes/deepfakes on a local client with no internet access?
Anonymous No.106319847 >>106319868
>>106319836
https://files.catbox.moe/cx435s.png
Anonymous No.106319850
>>106319842
Here's a not-midget with the canvas pre-scaled to 3:2.
Anonymous No.106319852 >>106319867
>>106319846
Glowie glowie I don't knowie. Go away 'cause we won't showie.
Anonymous No.106319854
>>106319734

Trying to get information out of lode goes like this

>testing
>in progress
>*incomprehensible jargon*
>like lmao
>*goes afk*
Anonymous No.106319855
>>106319797
what is this weird bullshit
do art the clown walking in on gen z boss plzzz
Anonymous No.106319867 >>106319887
>>106319852
I'm not a glownigger, I just want to create nudes of my coworkers and friends without having to upload the images to sketchy sites
Anonymous No.106319868 >>106319876 >>106319896
>>106319847
ah, im using the scaled clip, maybe thats causing the text issue.

yours has: quen_2.5_vl_7b

mine was using: qwen_2.5_vl_7b_fp8_scaled.safetensors
Anonymous No.106319873 >>106320034
>>106319797
I love your plowsows, I just wish you would do them without the weird creep shit angle you’re set on
Anonymous No.106319876
>>106319868
I'm telling you boys. Never skimp on the text encoder.
Anonymous No.106319887 >>106319892
>>106319867
I told you, (not) glowie. I don't know. Go away.
Anonymous No.106319892
>>106319887
what a faggot
Anonymous No.106319893
>>106319837
at first i thought you said gaming instead of genning and was going to ask why the fuck you did that
Anonymous No.106319896
>>106319868
but with scaled in that same workflow I get this

where do you even get the non scaled one?
Anonymous No.106319902
>>106319040
>>106319061
>>106319095
>>106319109
this is cool
Anonymous No.106319908 >>106320123
i hate how my worst slop repies on X go viral and get likes, while my best slop is never seen by anyone. its almost like upvotes are based on what people see rather than what is good
Anonymous No.106319948 >>106319952
replace the skirts on the Japanese women with a white bikini.

kinda funny how you can turn any image into a gravure shoot now.
Anonymous No.106319952 >>106319958 >>106319959
>>106319948
No it's no. It's horrifying.
Anonymous No.106319958 >>106319969 >>106319973
>>106319952
how so? the ability to smart edits with text is something people with millions of hours in photoshop can't do efficiently. lewds are just a test, this has billions of applications.
Anonymous No.106319959 >>106319973
>>106319952
>horrifying
Woman moment
Anonymous No.106319969
>>106319958
like this:

the Japanese women are wearing a mexican sombrero and mexican poncho, and eating tacos off a white plate.

how would you do this in 33 seconds otherwise?
Anonymous No.106319973
>>106319958
>>106319959
jebaited.
Anonymous No.106320034 >>106320163
>>106319873
>I just wish you would do them without the weird creep shit angle you’re set on
How else do you get butt closeups when they're standing if it's not at waist level
Anonymous No.106320039 >>106320052
I haven't been around for a few threads. Did the anon testing t5_xxl vs flan for Chroma ever share his results?
Anonymous No.106320052 >>106320059
>>106320039
The thread collectively decided that Chroma and Qwen are bad and that we're moving back to SD1.5. Sorry, anon.
Anonymous No.106320054 >>106320100
>>106318368
>>106318388
wouldn't that make a full booru finetune of qwen or chroma very economically viable? how many h100 hours has chroma used already?

rowei:
>Spent compute - over 8k hours of H100 (apart from research and fail attempts)
https://civitai.com/models/950531/rouwei

could the next gen of finetunes be better and cheaper than rowei?
Anonymous No.106320059
>>106320052

Damn, should've known.
Anonymous No.106320100
>>106320054
It sounds too good to be true but nothing will happen either way if nobody actually tests it and tries it out.
Anonymous No.106320120 >>106320151
After all why not. Why shouldnt it work?
Anonymous No.106320123
>>106319908
same but for my slop that I post on civitai it's always some low effort slop that gets a lot of likes and the stuff I'm satisfied with gets like 1 or 2
Anonymous No.106320151 >>106320194
>>106320120
update. Outputs were garbage.
Anonymous No.106320153
I wanted to like swarm and wan2gp
But at the end of the day comfy is just better
Anonymous No.106320163 >>106320327 >>106320334
>>106320034
I mean angle as in the theme, not literally the camera angle. The camera angles are fine, close ups of cake. But the little kids always around in them kinda sus bruh
Anonymous No.106320194
>>106320151
post one for laughs?
Anonymous No.106320205 >>106320224
Anonymous No.106320224 >>106320234 >>106320266
>>106320205
kontext seems to handle text way better, qwen edit is hit and miss with various types of fonts.

but it's better overall, idk why text is diff
Anonymous No.106320230
seems pretty cool.. though it really doesn't like "CyberSecks" as a word, it just straight won't do that
Anonymous No.106320234
>>106320224
A lot of the text I've seen from qwen has been basic legible font for benchmaxxing.
Anonymous No.106320240 >>106320244
replace the anime girls clothes with a white bikini. keep her expression and pose the same.

works for animu
Anonymous No.106320244 >>106320262
>>106320240
Thank you for indeed confirming that it works for anime. We were all very curious.
Anonymous No.106320262
>>106320244
glad you were, so here is another rei!
Anonymous No.106320266
>>106320224
probably because kontext was trained on a lot more text challenges
but qwen is better because it doesn't basically anything, it only make mistakes but doesn't just do nothing
Anonymous No.106320268 >>106320278 >>106320281 >>106320302
Qwen Edit: Hot or not? I'm leaning towards hot. It feels like a more capable Kontext and cooperates. But it also seems to rely heavily on know exactly what words the model wants to perform the edit.
Anonymous No.106320270 >>106320302
Anonymous No.106320278
>>106320268
it's very good and much better at various tasks, including human characters/reposing/changing clothes/etc.
Anonymous No.106320281
>>106320268
honestly it's what Kontext should've been, both models have specific magic words you need to know, I think realistically you need to have an assistant LLM
Anonymous No.106320292
Another thing I noticed is it doesn't slowly fry the image between edits.
Anonymous No.106320302 >>106320329
>>106320268
it can actually undress people including showing tits, and that's without LORAs or finetunes.

>>106320270
based
Anonymous No.106320313 >>106320338
mario kart: middle east

In the background two tall skyscrapers are on fire, with smoke emitting from the top of the building. change the background behind the dirt road to a middle east city in the desert.
Anonymous No.106320321 >>106320326
im still stuck on sdxl, i dont know how to prompt all of your fancy new models
Anonymous No.106320326 >>106320350
>>106320321
I think you'd fit in better in /adg/
Anonymous No.106320327
>>106320163
>the little kids always around in them kinda sus bruh
smaller hands are better for hard to reach places
and more importantly you can pay them less
Anonymous No.106320329
>>106320302
I want her to yell at my balls.
Anonymous No.106320330
make a pixel art version of the image in the style of a nintendo 8-bit game.
Anonymous No.106320332
>>106320331
>>106320331
>>106320331
>>106320331
>>106320331
Anonymous No.106320334 >>106320457
>>106320163
lol this guy is gay
Anonymous No.106320338
>>106320313
kek
Anonymous No.106320350 >>106320669
>>106320326
i didnt think /a/ had a thread
Anonymous No.106320457 >>106320527
>>106320334
>not wanting to see little boys means you are gay
Never change 4chin
Anonymous No.106320527
>>106320457
>not wanting to see little boys because of your insecurity at your sexual reaction means you are gay
Correct. Nice attempt at lying by omission though.
Anonymous No.106320669
>>106320350
isn't /a/ hostile towards image gen?
Anonymous No.106321747
Goddamn, I got hooked on AI video gen. This shit is some serious black magic.