Discussion of Free and Open Source Text-to-Image/Video Models
Prev:
>>106133377https://rentry.org/ldg-lazy-getting-started-guide
>UISwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows
>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>WanXhttps://github.com/Wan-Video
2.2 Guide: https://rentry.org/wan22ldgguide
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y
>Chromahttps://huggingface.co/lodestones/Chroma1-Base/tree/main
Training: https://rentry.org/mvu52t46
>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/
>MiscLocal Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage
>Neighbourshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
What's the best one for coomer animations of anime girls?
Wretched thread of foe-boat
When is Chroma v49 expected? It's been a week.
Blessed thread of frenship
Reminder that Lightx2v doesn't actually work for 2.2. You're just completely destroying the quality by using it and might as well use 2.1.
hey anons, been playing with wan2.2 via comfy and so far its incredible, but the 16fps limit is annoying for output - what frame interpolation flows do you have for AI interp? for example, 16fps input then doubling for 32fps output?
was seeing people mention topaz, but not OS and heard it wasnt the best, others mention RIFE which seems to be the best OS solution available atm, any info/sources appreciated
>>1061377012.1 is balls comapred to 2.2
>>106137701not entirely untrue but you do get better prompt adherence and motion with 2.2
>>106137285The eye pic looks like from the reddit slop collage
>>106137701well 2.2 version is up but current loras needs to be fixed to work with comfy, Kijai said he is working on it
>>1061377112.2 can gen at 24fps
>He doesn't know Owen
Lmao
>>106137711Would be a very DSP thing for the gun to jam kek
>>106137677It is being trained on 1024p images, which reportedly is several times slower, even though lodestone is now using a subset of the dataset
Will likely take another week
>>106137741what?
>>106137748it doesn't gen at any fps. the fps is arbitrarily set by the user
>>106137711you can try gimm-vfi
https://files.catbox.moe/npdu7e.mp4
>>106137749clive owen?
owen wilson?
micheal owen?
file
md5: 06a4fcd463a1e8bb87c34ab1f213677a
๐
bwo... 20B.. thats the biggest shit yet..
>Qwen
>artificial intelligence chatbot developed by Alibaba Cloud
Reminder for those not in last thread
>>106137738I like how normalfags (or whoever that chink is) get all hot and bothered over shit like "close up of an eye, abstract"
>>106137749UN Owen was her??!?!!
>>106137788The weights aren't even released
https://huggingface.co/Qwen/models
>>106137805broken atm though, wait for kijai to upload ones with fixed alpha
file
md5: 7a7e2cc354d01fb13523eca17e6af2d8
๐
>>106137788hidream for comparison is 17b
>>106137805https://huggingface.co/lightx2v/Wan2.2-Lightning/tree/main/Wan2.2-T2V-A14B-4steps-lora-rank64-V1
https://huggingface.co/lightx2v/Wan2.2-Lightning/tree/main/Wan2.2-T2V-A14B-4steps-lora-rank64-V1
https://huggingface.co/lightx2v/Wan2.2-Lightning/tree/main/Wan2.2-T2V-A14B-4steps-lora-rank64-V1
AHHHHHHHHHHHHHHHHHHHHHHHHHH ITS FUCKING REAL
>>106137738>>106137796It's literally just a pic to go along with his vague hype post, you braindead snarky redditarts. What do you think he should've put there, a photo of your mother's anus in 8k?
>>106137677As someone who got tricked into sucking furry dick and thinking that model was going to be good as it is, I'm over it. It's 48 epochs in and still kinda dogshit for what I want it for, which is realistic NSFW. It needs a finetune AGAIN on top of that to save that model.
I ended up going back to a biglust setup where I basically just use chroma as a facedetailer and might try and use it for base comp but IDK.
>https://github.com/Yuan-ManX/ComfyUI-ThinkSound
>check requirements.txt
what the fuck
this thing is a nightmare, and does not work with python 3.12. why the fuck does it require so much shit? it will 100% break your comfy
>>106137838wtf are you using that beats it for nsfw realism?
You guys ready to spin for genjam3's theme?
>>106137825bigger doesn't mean everyone is going to adopt it. the thread is very vocal about the param creep. if it's slow and locks up your hardware to make reddit slop it just isn't worth it.
>>106137805Hopefully we get a new AIO with that, and hopefully this time someone bothers making a GGUF for AIO
>>106137849the aio is shit
>>106137788The code is already pushed to Diffusers. It's a 20B dense model with a fairly standard architecture, not that different from Flux or Wan. Text encoder is Qwen-2.5-VL-7B. VRAMlets eternally BTFO'd.
>>106137829No need to get upset anon. I dunno, he shoulda used a cool image not slop
https://huggingface.co/Kijai/WanVideo_comfy/tree/main/Wan22-Lightning
>>106137847genjam3
genjam3
genjam3
>>106137864>>106137825Why is the second half the size? Which one to get?
>>106137860you can run Qwen2.5 VL 7B on a 4gb gpu doe
>>106137825>>106137825>>106137805Do they work? Can they be dropped into kijai's workflow?
>>106137864Seksikรคs ja hieno mies!
>>106137780catbox didn't work for some reason but anyway
https://github.com/kijai/ComfyUI-GIMM-VFI
>>106137825waiting on i2v
>>106137847Did you finally remove the e-mail collector thingy anon?
>>106137870if you are using comfy you will need kijais versions, if you are using lightxv2 then use the first ones
STOP.
There's a new image model? Can I download and run it right now?
>>106137857Better than autistically waiting two solid minutes for a model to swap and get like 5% quality improvement with a chance of OOM
While also not being compatible with the MultiGPU node
file
md5: 9df79e69d708e1d07c204b98228e76ad
๐
WHAT DOES THIS MEAN BROS
WHAT DOES THIS MEAN BROS
WHAT DOES THIS MEAN BROS
WHAT DOES THIS MEAN BROS
>>106137893oh you're poor and don't have an nvme. k
>>106137906you dont need to do that with
>>106137864
>>106137906You're too beta to understand, don't worry about it.
>>106137884Emails were never "collected". I initially made it a requirement to submit themes to prevent abuse but removed it.
I am rolling again from the same pool of themes, minus genjam2 themes.
>>106137914I have nvme and two 24gb cards (48gb)
>>106137927comfy persona music again please
SHOULD I PUT BOTH LORA WEIGHTS AT 1.00 WITH KIJAIS LIGHTX2V ???
>inb4 anime and schizo wins
>>106137954yes, how many times do you have to be told
>>106137918
>>106137955GenJam2 themes are excluded from the next roll
>>106137955what themes did you submit anon
>>106137927Can we add something along the lines of "comfortable animated juliens"?
>>106137973If you vote before I start the stream sure.
lcm or unipc or euler for light?
>>106137930
unironic question has ani ever delivered anything of substance
>>106137838lmaoo
tell me you're a promptlet without saying you're a promptlet
what, is anything more than 1girl, standing too much for you?
>>106137973>comfortable animated juliensKek
Imagine a thread full of animated juliens
>>106137701Doesn't work? It definitely speeds it up. Maybe it doesn't look as good, and I feel like setting CFG really low makes it not follow the prompt as well, but damn, 5 minutes vs 30... I'll take the 5 minute route.
T2V is frustrating. You get a short test, it looks great, you increase the frames, it looks totally different.
here is new 2.2 lightning with 4 steps total
spinning the genjam wheel very soon, last chance to get your vote in
https://forms.gle/WWpjwXw5tCgFSJbP9
why is this shit so ass? both loras at 1.00 and unipc sampler
wan 2.1 lightx2v with 1/3 weight is better..
so is lightx2v broken or not? Should I replace the loras in kijai's workflow?
>>106137825distill bros ww@
>>106138028you know its a T2V lora right? You will need to wait on I2V
>>106138023new lightning? where lora.
>>106138024You will not get my mail address (and also never catch schizo anon)
>>106138048open a private tab or incognito window
even worse with lcm
>>106138042the wan 2.1 t2v lora worked just as well as the i2v lora (with wan 2.1)
>>106137988I'm just not a fan of mutated feet/hands, genitals, backgrounds... could go on and on.
another with lightning, and yea its only T2V atm
>>106138046disregard, am retard.
>>106138061>the wan 2.1 t2v lora worked just as well as the i2v lora (with wan 2.1)no it did not, it massively hurt motion / prompt following
>>106137864>T2VI2V when THOUGHBEIT???????
>Return type mismatch between linked nodes: scheduler
how do you setup a scheduler selector? I want to reuse the same scheduler for all my node.
wait the new lora is 4 step, does this mean I have to go 2 high 2 low or 4 high 4 low? right now im at 3 high 3 low
>>106138069cool slow motion. is that just a bad prompt or on purpose.
>>106138082https://files.catbox.moe/wf0s5h.mp4
no it did NOT hurt as much as this one
see vid/workflow^
>>106138083>THOUGHBEITzillenial here. i'm drawing the line in the fuckin sand. your slang is SHIT
>>106137829Are you just mad because LMG had the same reaction keke
>>106137260>>106137289
>>106137825huggingface co/lightx2v/Wan2.1-I2V-14B-720P-StepDistill-CfgDistill-Lightx2v/commit/2739d76
this commit was also two hours ago
>>1061381100kb file
cool video.
euler
bruhs... whats happening ITS FUCKING OVER ITS NEVER BEEN THIS OVER ITS FUCKING JOEVER
THIS SHIT IS WORSE THAN WAN 2.1 T2V LORA
>>106138111It's not slang, it's retarded.
>>106136824And did you see the ~25% supposed speedup?
>>106138096nevermind, figured it out.
https://github.com/ClownsharkBatwing/RES4LYF/issues/142
WARNING: IF YOU USE RES4LYF, IT WILL BREAK ALL YOUR SCHEDULE SELECTORS.
my gen are so unique and personally identifying that often i do not post them
few understand this burden
>>1061381310.125 strength, you fat fuck
file
md5: c6f1d56d17fe8e80283ccfd95ff9add2
๐
>>106138147>https://github.com/ClownsharkBatwingThis shit was cool initially but then I realized I don't need all of it.
>>106138157anons told me to use 1.00 strength i donloaded it from kijai repo im using kijais workflow
>>106138153gonna try using it at higher values
>>106138164yeah i didn't really notice it being better than uni_pc/etc anyway so im disabling it until it's fixed.
>>106138170> kijaideserved
native is simply better.
>>106138170ignore him, its 1.0 with kijais, in fact I think that is too low
wheel
md5: d8bd73d89908e19a24e17529910febd0
๐
It begins...
https://www.youtube.com/watch?v=T0RppYr3_oI
Choosing three categories for genjam3 in 10 minutes. First spin at xx:25.
>>106138183the regular loras dont really work with comfy retard
>>106138153can you show a comparison of it without lightx2v?
also do you mind posting the script you use to compare videos? thanks
ITS OVER
also just found out that only one model had been using torch compile kek, with both models on torch compile my gen time went from 170s to 160s
2.1 lightx
>2d anime style. a magical girl in iridescent silvery robe. she holds up a bismuth crystal and it tranforms into a magic wand with a cybernetic aesthetic
>>106138194fuck this royalty free music play something by peter gabriel
actually i take that back after hearing the kino lyrics but still play some gabriel
>>106138204nta
https://github.com/BigStationW/Compare-pictures-and-videos
>>106138211this looks like complete dogshit.
>>1061382132.2
both on kijai's wf. I'll check native too
I'll re-run this one with the 2.2 lightning lora and see if it looks less fuzzy.
>>106138225yea i tried unipc lcm and euler so far
all 2+2 steps, weight 1.00 and kijai workflow
>>106138213>>106138226why the slow motion anon?
>>106138211Have you considered that nobody wants to see your failed gens?
Works good enough with I2V too, did 4 + 4 steps, no other loras added, using kijai's workflow
>>106137650 (OP)Yo can some of you who's good at the prompting shit make something in this style but better? (Perchance.) Would be funny to see what you lot can do.
>>106138245that's what lightx does. it kills the motion.
>>106138249>4+4 stepsBRUH
huggingface co/lightx2v/Wan2.2-Lightning/discussions/3#6890c92d54a8b9ff771c8a88
>Kijai
>27 minutes ago
>They didn't include the alpha in the weights and instead have hardcoded alpha 8 in their inference code, this means you should use these with strength alpha / rank, which is 0.125.
https://huggingface.co/lightx2v/Wan2.2-Lightning/tree/main/Wan2.2-T2V-A14B-4steps-lora-rank64-V1
we cant stop winning anons.
>>106138256this is just some generic anime style
Welp, I got ROCm working under WSL2 for RX 9070 XT, but it BSOD's after first txt2image generation. Speed is 2.3Th/s - idk if it's good or not? Anyways, fuck it all I'm not bothering anymore.
>>106138262wow it's nearly like that one anon was right.
this shit is brand new, let's give it a few hours for all the shit bugs to get ironed out
>>106138260bruh what nigga
4 more minutes until genjam 3
>>106138194
>>106138272>kijai says to put 0.125 str on both lorasbtw
>>106137788>guidance_scale 0.0>inference_steps 4Oh great another ultra distill slop model.
Jamming very hard right now
>>1061382804+4 steps is too much
wan2.1 lightx2v was made for 4 steps
Let's take a moment to appreciate how Alibaba is BTFO of western kikes, during this entire year.
1 - New open weights LLMs competitive to the SOTA proprietary ones, especially in coding, in several different sizes (covering different setups). They always had the best text embedding models too (if you're into rag)
2 - A local video model as good as the API-only ones
3 - Now a big 20b open image model that will probably "know" a lot (given that in the demo they prompted for an specific artist, Edward Hopper)
That's why OpenAI, Anthropic & co were crying to daddy government to ban chinese AI and restrict the sales of retail GPUs
I think those smaller labs like Midjourney, Luma, Runway, Black Forest Labs & co will get extra fucked lol, some will end up just becoming API providers for open weights models instead
>>106138280supposed to be 2+2 but i doubt it matters if you double the steps
>>106138257if you use it wrong. correct. there's a reason it was updated a while ago on 2.1 to not be as bad on motion.
>>106138274Sort of yeah the idea though is not done much with. Can't run AI on my end to do shit with it. Perhaps you can improve from it rather than keeping generic using some specific art style that fits it?
>>106138285im assuming the same instructions apply to his 600mb versions too
1 more minute
>>106138194
>>106138297its 4 steps per stage
>>106138297but 2+2 produced this garbage. I get it's the T2V lora, but gen still took only 140s
>>106138302then all these anons must be using it wrong because i'm seeing slow motion in all of them. then again I never liked lightx2 even in 2.1.
>>106138311Put timer into video
>>106138285NO HE DID NOT, he said that for the original ones, he said to use 1.0 with his fixed ones
>>106138324I will at some point
>>106138299There's also huge security risks for Chinese AI because you really think even as a fucking Chinese Wumao that they WON'T be using AI as a way to sniff out people's data? LOL LMAO EVEN. Getting tiktok to the army bases and having their user data leak every detail? Whoops, remember that? LOOOL.
>>106138299Ok Junyang release the weights and we'll see
>>106138312kijai didnt [rape] correct the poster in the lightx2v 2.2 issue so ur lying
>>106138330> try both versions> both just produce blurry nonsense ok.mp4
>>106138330Kijai
28 points 22 minutes ago*
Great work from the Lightx2v team once again!
There's bit of an issue with these weights: they are missing alpha keys and they are using alpha 8 in their inference code. This means for the intended 1.0 strength you need to use alpha / rank, which is 0.125.
I added the alpha keys and also saved as fp16 since that's what we use mostly in Comfy anyway:
https://huggingface.co/Kijai/WanVideo_comfy/tree/main/Wan22-Lightning
Edit: to clarify, strength 1.0 with these = 0.125 in the original.
here is 5 + 5 steps with new lora, use the lora at 2.0 weight on both, 1.0 is too low
882
md5: d28673b87f2270e651bbf9e207fa71c1
๐
Category 1. Can't escape anime stuff.
Next roll at xx:29
>>106138299>I think those smaller labs like Midjourney, Luma, Runway, Black Forest Labs & co will get extra fucked lolWhich makes me think they may lobby hard to get local banned due to the risks "unsafe content", deepfake scams and cp. The currently EU boomers would happily approve it
WHO THE FUCK IS YOIMIYA GET THAT SHIT OUT OF HERE I HATE THIS FUCKING GENJAM SHITTY THEMES FUCKY OU
>>106138350oh, he made his own update
>>106138299>Now a big 20b open image model that will probably "know" a lotIs it natively nsfw?
>>106138249>Works good enoughlooks like dogshit. use a normal workflow before saying something works. You're not comparing it to anything.
4bfgh
md5: 16e54f5bcb3a5bf85d57947a0887615b
๐
>>106138360>big butt selfie
>>106138350here, fuck the troll, and 1.0 is still too weak imo
>>106138366https://huggingface.co/Kijai/WanVideo_comfy/tree/main/Wan22-Lightning
ITS GENSHIT REEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
>>106138371Wan is also made by Alibaba and knows nipples + areolas, butts and pubic hair, perhaps there is hope
tried the new 2.2 lightning lora (kijai)
same result as just using the 2.1 light2xv + fastwan lora kek
>>106138395remember you can be creative and not literally have to include the character
>yt thumbnail
fuck well looks like i have to give chroma a try now
ytt2
md5: e8bddef71e0d79d81308860ce1a4d43c
๐
>>106138360Category 2: Youtube Thumbnail
>>106138397It doesn't know much freak fetish content unlike chroma. And loras are much harder to make for it.
might need higher shift as well, still testing
Next roll at 00:33 (2 minutes)
>>106138360
>Too early to say best settings, but minimum that works is 2+2 for 4 total steps, but it does seem better with more steps.
>>106138416ok im just going to inject a fake txt2img workflow into a real youtube thumbnail of goymira and submit that fuck this gay shit
also, you might not need the low noise one
file
md5: 10633a4f6e4268310ae1d6487eafe357
๐
ITS FUCKING OVER ITS OVER
8STEP NIGGERS GET BURNED AT THE CROSS
>>106138446? even 5+5 cfg 1 steps is like 8x as fast and without
>>106138416Category 3: Starry Skies.
GenJam 3 Theme:
Yomiya
Youtube Thumbnail
Starry Skies
>>106138417If you are asking "will it do pens in vagene?" then the answer is no, but there is a chance it will know basic female anatomy, especially since there are lots of "SFW" online content featuring nudity like paintings and "tasteful art" stuff
12 shift, 6 + 6 looks good so far, not sure if 2.0 weight is a little too much though
>>10613846010 total steps would bump the time from 160s to 400s minimum on a 3060
its FUCKING OVER
>>106138473STOP INCREASING THE FUCKING STEPS STOP IT FUCKCKKKKKKKKK
>>106138454when i used 6 back in 2.1 it seemed like the sweet spot, more details than 4 and motion not jittery like 8
>>106138473kijai wip workflow doesnt use shift I dont think, try that too (testing now)
>>106138446Kijai has the same authority in videos as comfy in image generation, aka , he spends too much time coding instead of generating, so take his word with a grain of salt
>>106138478you do know 1.0 cfg halfs the time a step takes right?
So 12 1.0 cfg is like 6 steps with cfg and without light you need at least 30
Also doing what I did with 2.1 before and having the first high noise step with 0 light weight / 4 cfg is good
>>106138481It does. It's in the sampler
videofags are insufferable
damn.. this is a better result
2+2 steps
1.0 weight low 3.0 weight high
>>106138489i know but the wan2.1 lora was working well enough already at 4 total steps (with 2.2)
>>106138501I get some noise in the face with default + 1.0 str on the new loras
GenJam 3 stars now. Submit your gens, jammers:
https://forms.gle/hWs19H4vTGTdwARq8
The themes are:
Yomiya
Youtube thumbnail
Starry Skies
i loaded kijai's 2.2 workflow. disabled torch compile. genned a video with the old 2.1 loras. genned another video with the new 2.2 lightning loras set to 1.0, same seed. the motion on the new loras is blurry
new loras, kijai workflow, high/low at 1.0, 3/3 steps:
will try 3 str high next like the previous setup
Hopefully the Qwen image release will tone down the shitty video spam a little
>>106138504It's a mental disorder. And they're here to stay so enjoy :)
>>106138465I've just looked at those options for the first time and they're all dogshit
what's wrong with you
you need 3 wheels, a subject, an activity and an enviroment or theme
Had to write a script to check for gpu_1x_gh200 availability
Got one now, precomputing more embeds then starting another run.
More people need to try distilling stuff to fight the API SaaS menace
>>106138509> using t2v lightning lora with i2vso you're like a special kind of stupid yeah?
something like this may be best to give each model 1 step without lora / cfg
>>106138549>what's wrong with you
>>1061385413.0 high lora, got a transparent todd, will retry
>>106138565but don't take my word for it, check your engagement trend, how is it looking?
>>106138559calm down fag. wan t2v and i2v loras are interchangeable to variable effect
>>106138559before, t2v lightx2 worked just fine with wan i2v 480p/720p.
>>106138356What are the vram/gpu requirements to run that locally?
>>106138575Don't really care.
>>106138559it just worked with wan 2.1
https://litter.catbox.moe/1epxaubih8zj0nfa.mp4
>>10613858316GB Vram + 64GB ram bare minimum
>>106138586I'll remember that when you're issuing your reminders to participate
>>10613858312gb vram + 64gb ram
>>106138601I'm sure you will sperg.
>>106138594>>106138602So, impossible to run on an m4 laptop with 32gb?
>>106138580yeah. i can see that. it works so well.
run very hard against a wall.
there we go, that is with my 1 step disable each WF
>destroys prompt adherence
>destroys motion
>makes everything look like 1fps slow motion
>yep, it's working good!
why are people like this?
>>106138613rot in hell and die
Guess Ill try 2 steps without lol, guessing only 1 without was too jarring for high noise, that or shift 12 was too much for that
WERE FUCKING BACK ITS FUCKING WORKING 3.0 3.0 2+2
>>106138607>m4lol
>laptoplmao
>>106138621even the 2.1 lightx2v lora worked better than the 2.2 one kek
>>106138624tiny lil babby bitch.
>>106138624I think you have it backwards...
trying again with shift 5 instead, I think that was the issue there
forgot to scrub all of the previous prompt, came out better strangely
a
md5: c9b9f4283353611a48e5af6f3e592b2d
๐
noooo stop having fun with videos you guys
sdxl 1girls is all we need bros amiright?
>>106138160it's safetensor format not pickled
Even though lightning is a t2v lora it still worked
>>106138628> that quality degradation in the movemntanon.. it's actually over. look how her arm blurs and smears the image.
that is not a good result.
Is the lightx team working on a version for 2.2 or did they give up?
>>106138638Yep, mental illness.
>>106138652did you not bother reading the thread?
2.1 i2v 480p at 3.0/1.0 str (kijai workflow) still works best, but let people figure out this new t2v one/settings
the "older" one is still an i2v specific lora, so it should be better for motion/etc. new one might be great for t2v though.
Look I'm just going to stay on the 2.1 lightx2v until the i2v version comes out. Simple as.
>>106138638In a few hours it's going to be 1girls made on Qwen Image 20b
havent gotten a single t2v gen thats not slow motion with new 2.2 lightning
2.2 + 2.1 lightx2v is still better
>>106138652its made, just trying to dial in settings cause it works differently with the 2 stage
ITS FUCKING HAPPENING ITS HAPPENING
IT FUCKING WORKS WERE BACK AHHHHHHHHHHHHHHHHHHHHH
>>106138648how many steps? weights?
Either 3 strength or shift 12 is too much because I'm getting duplications in pics.
>>106138673Not in this general.
Does even know what anime is?
>>106138701No one cares about your tranny general, fuck off
>>106138689quality and motion is good now but im getting artifacts, that means 3.0 is too high for 2nd step, gonna try 0, 2, 1... now
>>106138708Take your meds schizo
ITS FUCKING OVER ITS OVER
THEYRE NOT GETTING NAKED FUCKING BITCH NIGGER
until there is 2.2 i2v, 2.1 i2v lora/setup still seems better
>>106138725>16 stepsits over.. still thanks anon
https://www.reddit.com/r/LocalLLaMA/comments/1mhhctd/meet_qwenimage/
>>106138628I don't get why this makes anime fags seethe so much
>>106138742>https://www.reddit.com/r/LocalLLaMA/comments/1mhhctd/meet_qwenimage/kinda looks like slop
https://huggingface.co/Qwen/Qwen-Image
bench
md5: 00efb25e24a39d56db1877e908aa9c1a
๐
https://huggingface.co/Qwen/Qwen-Image
>image edit
HOLY SHITTT AHHHHHHHHHH
>>106138742>ghiblisloplol
>>106138660Yeah I meant i2v version, it's confusing which one got released.
When will we get an image model with Soul
>>106138770You just unreleased it with this powerful mockery, bravo anon
s1
md5: 541cdaba4d737b0332eea8d35dff2518
๐
>>106138780It's called Qwen-Image
>>106138734Yeah I dunno maybe you can use less. That's just what I had settled on for lightx2v. It doesn't look fried at 8.
comfy fp8 workflow for qwen when?
s2
md5: 434e3223630020ee59c572f9da24566e
๐
oh wow, this looks incredible, qwen with wan and now this
>>106138769guess we'll have to wait for a quant model
>>106138804https://huggingface.co/spaces/Qwen/Qwen-Image
qwen has been fucking killing it lately, text, video and now image sotas and fucking apache 2.0
>>106138813I guess we'll have to see if it's got built in refusals and censorship like Kontext. Hopefully it's as unsafe as Wan :)
1
md5: 00fa7bc5c65b0b00ebc5339eb977f526
๐
>>106138742>But Qwen-Image doesnโt just create or editโit understands.
>>106138813>https://www.reddit.com/r/LocalLLaMA/comments/1mhhctd/meet_qwenimage/QWEN LORA TRAINING WHEN????
>>106138786>https://huggingface.co/Qwen/Qwen-ImageNice. Waiting for the combined file version to be made, then I'll try that in comfy.
>>106138831>built in refusals and censorship like KontextStill crazy that they continue releasing dead end """safety""" obsessed models while the competition doesn't give a shit about it.
Every day I wish AI was a thing in 90s/00s with the less safety obsessed culture.
>>106138809usually a few weeks before it's out
>>106138858We'll have to see, Qwen is on the "safer" side. I bet it doesn't do nsfw
>>106138858UMM SIRE THINK OF THE CHILDREN MMKAY
So it's confirmed using the 2.1 lora with 2.2 currently produces better results than the 2.2 lora?
Where's the day 0 ComfyUI support???
wow very natural qwen posting guys
shill harder
god i fucking hate everyone here
>>106138833but does it understand "uncensor this image"
If using I2V then yea, big shocker but the I2V lora works better
>people are getting excited seeing new models, wow crazy must be some conspiracy
meds
>>106138877neither chroma dev or qwen devs are trying to shill to diaper fetish pedo freaks hate to break it to you
>>106138877>what the fuck they are discussing a new diffusion model in the diffusion generalpills
>>106138894>diaper fetishplease, this isn't deviantart
>Furthermore, the NSFW Filter is applied to
exclude content containing sexual, violent, or other offensive material.
Of course it would have been stupid to think otherwise
>>106138877>no you shouldn't be excited about anything that isn't SDXL
>>106138894Chroma can do diapers naturally tho
file
md5: 3c295607c48357dbd8e5040d9fadae85
๐
ITS FUCKING OVER QWEN 20B IS KEKED
>>106138934every model is that sadly, its what finetuning is for, look at what chroma did to barbie doll flux
wow very natural qwen posting guys
shill harder
god i fucking hate everyone here
>>106138821Took 10 minutes and it errored out kek
>>106138934And into the trash. Call me when an uncucked tune comes out
>>106138946fuck off to the sharty faggot
>>106138934Wan (and even HunyuanVideo) say the exact same thing in their technical reports. We have no idea how aggressive the filter is.
>>106138934its so annoying that its filtered out of models, I get its hard to get funding when you can generate gore and tits but come on, life is NSFW
>>106138946fuck off to the sharty faggot
>>106138934wan:
https://arxiv.org/pdf/2503.20314
>Through our internal safety assessment model, we systematically evaluateand filter inappropriate content based on computed NSFW scores in all training data.
And yet.
>>106138958It also depends of how the filter is trained. The ideal scenario is the dataset is uncensored and they used a neural network to steer to safe. This is easy to remove with finetuning because you can steer the model back to do doing nsfw which it has knowledge of.
>>106138944i want to see you finetune a 20b model
>>106139209qwen image is not distilled meaning it will be much much faster to train, it will be tons cheaper than training flux
>>106139224baseless copium
>>106139283I see you've never tried training schell before, sounds like your a coping vramlet
>>106139297i see people making claims how a model will surely be saved by a finetune (made by someone other than them of course) every single release
>>106139315sdxl was saved by pony then illustrous, then flux was saved by chroma. Are you really just a coping vramlet still running 1.5?
>>106138275Ok so I guess the issue is adrenalin driver on windows side, this requires a version that hasn't been released yet. Damn.
>>106139333and you know how many models there were other than sdxl and flux?
hidream, hunyuan, sana, sd3, sd2, some russian models i dont even remember anymore
every time
if you think only sdxl and flux exist and magically got immediately finetuned then you must be new
>>106139435why the fuck would anyone train SD with their license and it being much worse than flux, same with he others, all worse than the alternatives