Discussion of Free and Open Source Text-to-Image Models
Prev:
>>105618255https://rentry.org/ldg-lazy-getting-started-guide
>UISwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
>Models, LoRAs, & Upscalershttps://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
>Cookhttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>ChromaTraining: https://rentry.org/mvu52t46
>WanX (video)https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1
>MiscShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Archive: https://rentry.org/sdg-link
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate
Local Model Meta: https://rentry.org/localmodelsmeta
>Neighborshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/celeb+ai>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
>>105621430 (OP)>no ani in collage>no anistudio in OP
h
md5: 5874f4921b4ce874b5cbaa2922006ff1
🔍
>>105621370don't use teacache. like at all. use mag cache as it is not only faster but also doesn't rape quality so hard.
anons, what do these three settings do? any of them noteworthy for speed/quality?
having played with the flowmatcher for a few gens now: it seems okay for the most part. i could swear it has some quality loss compared to lcm/simple sched but could be placebo.
Don't engage.
>Maintain Thread Quality
>previous on page 5 before new bake
ldg has fallen
so is this new video stuff snake oil or nah?
absolutely no respect to the anon trying to get rid of comfy. you guys don't deserve him
>>105621519far from it, this shit is amazing
>>105621430 (OP)You missed out this one, despite being "boring" it's a decent showcase of camera movement and motion, and the physics of the box is impressive
uh oh the disgusting and untalented nigger is here everyone
544
md5: e5690ea1077389a507d467d56e285886
🔍
ok
>>105621571it's an error with torch compile, remove that and try again
this thead is ranfaggot's safe space
>3 minutes to generate a 5-second video with a resolution of 480x832
>the 1girl's arms keep phasing in and out of existence
Should I feel blessed, or can I do much, much better than this?
>>105621593Don't engage.
>Maintain Thread Qualityhttps://rentry.org/debo
>>105621581oh dear it's that schizo time again
>>105621593Just ignore him he has nothing left
>>105621594lean into it and prompt her turning invisible
>>105621519nag + mag cache and the self force lora are genuinely good. quality is amazing and gen times are down to less than 2 minutes for 1024x640 t2v
>105621604
looking at your images it's more like you don't have anything left or ever had anything in the first place
Blessed thread of frenship
>>105621547> the box even jostles the cablekino, aside from the quality
>>105621580Still nothing. Do I need to set the ram offload to some specifc value? I just randomly picked 10.
>>105621595Your schizo pastebin doesn't work here.
>>105621661we can't help you if we don't see what's goin on, show a screen of your workflow
>>105621634Yeah, even though Wan does not understand a lot of concepts or has trouble following some of the prompts, it has a great grasp of physics and worldly things, especially in "realistic" inputs
>>105621681The basic unchanged i2v from https://rentry.org/wan21kjguide. Only changed the ram oofload value.
>>105621691and what gpu do you have? we really need more info
>>105621571>windows portable
029
md5: 161260f76526469db3775e4ff40a0ef4
🔍
>>1056217034070 super. Running the Q8_0 wan, since I was told I can offload it to ram.
>>105621571Busted install and/or venv. Reinstall, follow the rentry guide, make sure you run the .bat installer too.
>>105621748This was a separate installed instance just for the wan. Everything was by the book in the rentry.
>>105621519snake oil, the only people who are excited are vramlets who have never been able to gen a video using wan at its full force, so they don't know how good wan really is compared to that shitty distill lora, a real WAN user can tell its results are clearly better to those causvid loras but not close enough to match wan at its full potential
>>105621766>https://github.com/woct0rdho/triton-windows/releases/download/v3.0.0-windows.post1/python_3.12.7_include_libs.zipPaste that into ComfyUI_windows_portable\python_embeded
Then restart it. Tell me if that fixes it.
>>105621766>https://github.com/woct0rdho/triton-windows/releases/download/v3.0.0-windows.post1/python_3.12.7_include_libs.zipExtract the two directories inside of it into ComfyUI_windows_portable\python_embeded
Then restart it. Tell me if that fixes it.
>>105621519>add realistic twerk lora>gen>she barely moves her ass>remove lora, gen normally>shaking her ass like her life depends on itIt's fucked for anything but the simplest shit. Anything with physics and lots of movement? Trash
>>105621838And by 'remove lora', I mean the lightx2v one
Does running a lower quant for 720p give better results than a higher quant at 480p?
I'm getting oom with Q8 720p for Wan, but not with the 480p model, so I was wondering if it's worth using a lower 720p quant if it works better
i2v with 16gb vramlet
>>105621845post your wf and settings. it's probably something wrong there.
i use the rentry wf with the lora at strength 1~0.7 and LCM with 4 steps and everything has been working fine.
don't forget the patch model order node because without that you will indeed not get motion
>>10562177848gb here, I've been using Wan since day one, and I am telling you that you are full of shit
It's better to simply perform several gacha gens with the lightx2v lora and you'll get like 1 good gen out of 3 (so about 6 minutes) instead of waiting for 15 minutes for a single good gen and keeping track of it to abort it if the preview looks bad
It's not like the base Wan model is an amazing flawless model to begin with
>>105621838of course you fucking brainlet, this is essentially almost a new model
you'd have to retrain the loras using a merged base with self force
I need a uncensored vision model for ollama comfy usage, quick.
>>105621913well you could use ..
> quickok not with that attitude mister.
>>105621801Yeah this was it. Thanks.
>>105621934How the fuck strong is she, goddamn.
>>105621911clearly you don't know what you're talking about because WAN full motion mogs anything that that distill lora can generate, is not comparable
>>105621944That .zip should have been downloaded and autoextracted when you ran the .bat file. Either you didn't run the .bat file or it fucked up for some reason. If you did run it, I updated it a couple of hours ago, did you use that version or an older one? Ie, did it give you the two GPU options when you ran it?
>>105621911>you'd have to retrain the loras using a merged base with self forcenot entirely true. i have yet to find a lora that doesn't work well. in most cases it comes down to playing with the lora weights if at all.
which lora doesn't work for you?
>>105621965>.bat fileThe auto_installer? No, but it threw me errors for the disc folder not being in PATH (I added it manually later) so maybe something borked there.
>>105621930it was meant to be a lighthearted joke. do you understand jokes, anon?
wanvids
md5: 1188f6bba6fa048684da65cdaf182a07
🔍
>>105621951I have made hundreds of wan videos so far (before self-forcing), so yes, I do have some authority here
Yes, base Wan without distillation or performance tricks is obviously better, but not "it's worth it waiting 15 more minutes" better, you will have better luck just gachaing with outputs in the same timeslot
file
md5: 609b080013bcebaed9bde461553219f3
🔍
stfu
md5: 43f8800793254252947c4098aa5305f9
🔍
>>105621995if you have to wait 15 minutes for a video then you're a fucking newfag, I can tell you dont really dont know shit
>>105621992honestly all the constant trolling and shizoposting has jaded me, even with 99% of the posts hidden.
you can try jocaptionbeta, it is fully uncensored. though idk if it works with the ollama stuff.
https://github.com/silveroxides/joycaption_comfyui/tree/rework
the rep i linked has seed control and a bunch of stuff that is better than the base version.
it's been fun using that in a workflow that feeds the text directly into flux/chroma. it can also output booru tags instead etc. fun stuff.
>>105622004what do you mean?
>>105622004Show us your card and how long are you taking to generate a video, with the number of steps
I have spent this whole morning comparing a custom Skyreels V2 t2v lora with and without lightx2v. 30 steps euler CFG, vs 4 steps euler no cfg with lightx2v. I have a workflow that generates 2 videos with the same seed for the 2 cases.
I can't consistently tell a difference. The gens are different with the same seed, but one is not noticeably better than the other. If you gave me a blind test I would be no better than random chance at saying which is which. Actually fucking insane. At the resolution and length I'm using it's 30 seconds vs 8 minutes to gen. Some anons are saying that it destroys quality and motion, but I'm not seeing it. Maybe my own lora is why, idk.
>>105622039>Some anons are saying that it destroys quality and motionsome anons are also just wrong or trolling.
the self-forcing lora has singlehandedly made me run wan 24/7 again.
>>105622029better show some examples before talking trash, with >muh authority
while I agree that this new distill lora is better than those previous causvid loras, full WAN is still just better, the motion and everything it add is just better there is not comparison yet
>>105622073you did not answer my question
john11
md5: 34b5f749ddae916f6ad5a7c3dd144418
🔍
>>1056220416000 years ago, newfriend
>>105622073a 720p version with the distill lora, I agree its good, but if I generated the same 720p video without the lora it would look even better, and thats because I've genned tons of 720p full WAN videos, the quality is amazing is just the waiting time that kills the buzz
>>105622073are you implying that this video looks good or something
If I want the video to point the gun at me is it "aim at user" or "aim at camera"? or does it have a specific prompt?
ABF1A732
md5: 16d9df32667392212e3d89bcb3d61fe9
🔍
>>105622048No. You might just not have an eye for detail. Using the self-forcing lora 100% fucks the motion. it's not even debatable.
>>105622114i've had good results with "viewer".
"user" obviously makes no sense.
>>105622106>overwhelming arrogance>no artistic capabilities whatsoeverVery common in these threads.
>>105622013100% understand what you mean. I've been trying to stay friendly and help other anons here for a long time but it is just not a nice place. hardly get anything back either, other than weirdos taking my gens and turning them into cum-animations. anyways, thank you
>>105622147>eye for detailI wouldn't say anything.
>>105622073werent you the anon who said those vids were Sora or some SaaS gens and trolled the thread about how local sucks
>>105621838because it is trash, everything these clowns use to speed things up is trash.
This however.
https://civitai.com/models/1678575/wan21fusionx-the-lora?modelVersionId=1899873
combined with this
https://huggingface.co/Kijai/WanVideo_comfy/blob/main/Wan21_T2V_14B_lightx2v_cfg_step_distill_lora_rank32.safetensors
cfg 1
4 steps
lcm
works really well, and other loras work fine with it.
>>105622110>are you implyingclearly the post talks about motion and added details, why are you so dumb
>>105622189This lora(FusionX) has MPS baked in which changes the faces. I'm not using that shit.
also if you're still using the wrapper nodes then you are a fucking retard, same goes for using teacache or any other crap which degrades the quality and motion. I don't give a fuck if it can enable gen times of 70 seconds, its all fucking trash.
>>105622206Alright, calm down and have another drink, Ivan. Don't want you to start calling people cunts and then crashing out with 80's nostaglia music videos.
>>105622204yes. you shouldn't use that.
the lightx2v lora is more than enough and doesn't fuck with loras.
>>105622204it does not change faces, at least I've not seen that using the iv2 version of the lora...
thank you rentryanon for the fast workflow
>the camera rotates and follows her as she sleds down the snowy mountain. she kicks her feet in glee and moves her arms. her hair and clothing flows in the wind.
>>105622206honestly we are all retarded for propping up a shitty python app that gets worse as time goes on but has the latest midels
>slow motion
I'VE BECOME SO NUMB
I CAN'T FEEL YOU THERE
FUCKING SHOOT ME YOU DUMB HOE.
>>105622217Many users have already complained about it. I seriously can't trust you people when it comes to quality if you can't tell when something is fucking your outputs.
>>105622213Just pulling anons away from that snake oil salesman, you can just tell by all the idiots trying to shill their workflows that it reeks of excrement.
>>105622233jesus what is going on there. your wf seems messed up that it's breaking that badly. give me a few minutes and i'll try with that image
D270CC3E
md5: ab7e59110cf1b8309822b6ad266087ff
🔍
>>105622228That's my only real complaint so far0
>>105622228>slow motioni've seen that myself happening with this fusionx lora and it was suggested to place slow motion into the negative prompt.
>>105622222>quints>but it's a shitty cumfart getchecked but dissapointing
>>105622242>give me a few minutes and i'll try with that imagerun it through a small amount of blur or wan will fuck it because wan don't like working with its own frames for some reason. it only needs a very subtle amount of blur not enough to be noticeable to the naked eye, just trust me on this because gen from a wan frame tends to over saturate any new frames.
>>105622206Teacache for up to 0.19 speeds up the gen by a lot for almost non-noticable degradation, if you are using full Wan Q8 that is.
>>105621778This
>>105621911>It's better to simply perform several gacha gens with the lightx2v lora and you'll get like 1 good gen out of 3 (so about 6 minutes) instead of waiting for 15 minutes for a single good gen and keeping track of it to abort it if the preview looks badEvery single sped up version of Wan has stiffer motion, more plastic looking texture, and usually more oversaturated colors or whatever you would call it, alongside of more grainy/shittier quality motion that warps instead of smooth movement.
I don't disagree that these advacements are good enough for memes and very simple things like
>>105622222but even there you see how her hand warps frame to frame for example instead of smoothly moving. This is a far cry away from full Q8 Wan that doesn't do that.
It's also especially evident in the details of the eyes when you look at any realistic gen too.
>>105622242>>105622286Here's the orig. Post the setting and prompt. I'll try with your since I have no clue.
>the winds blows hard. the girl's hair and cape flow in the wind. white petals detach and fly across the field. trees sway in the wind and leaves fly into the sky.
>>105622286what the fuck are you talking about. not only are you a promptlet but you have no idea what you are saying. i took the very first img of your video and did this first try.
yes, the gun looks off but it is no where near as broken as your gen.
also, for double insult: i'm using 720p with a 420p res because fuck the system.
>>105622324holy shit nice
Any tips on getting good, coherent hands?
Is the issue that im using Pony?
Is Illustrious significantly better at that?
>>105622311>>105622324ok now that i got that out my system:
USE THE RENTRY WF. it's what i used in that gen (+ the new light2v lora so i don't die of old age).
the prompt was: "a cute girl wearing overalls and holding a gun turns towards the viewer and then aims the gun at the viewer. we can see down the bore of the pistol.".
what many tards don't understand is that wan has a very specific way of prompting. you have to talk to it like it is a retarded child, like: "and then x, and then y" etc. if you prompt like it's SD you will get cancer in return.
>>105622348shitmixes are the way to go for both models to fix hands. get noob or some derivative. pony is obsolete
>>105622324was that you who put the pause in the video before she turns to the camera and points the gun at it? I haven't seen any of these models do something like that before
>>105622351Ok, I'll try with that
Wan2.1-T2V-14B-StepDistill-CfgDistill works at 4 steps, yes, but 8 are preferred. Anyone actually try 8 for the LoRA? Its based off and extracted from it.
>>105622348what the other anon said, upgrade to illustrious or n00b
file
md5: cada60f0bcafb670610253cc88a288b4
🔍
> picreli fucking LOVE the light2v speed lora. i dropped wan ages ago because i couldnt justify the long times but this shit genuinely slaps now.
>>105622362no, complete random chance. i was going to add "she smiles smugly at the viewer" but that might fuck with the way she aims.
you have to understand that every single thing you put in the prompt it will try to add into the video. so the smugness might suddenly completely fuck with her turning pose or aiming. i have several TB of gens to back this up. you really do have to be turbo autismo when prompting.
>>105622382huh. interesting point. haven't tried 8 because 4 seemed fine. give me a min to try it
>>105622355appreciate the help but can you elaborate
whats a shitmix?
huh, first time im hearing about noob
whats the difference between illustrious and noob, is one better for porn?
light2v + NAG + flowmatchsigmas is amazing, this takes less than 1 min on a 4090: https://files.catbox.moe/7kak88.mp4
>>105622411https://rentry.org/localmodelsmeta#image-generation-models-anime
My advice? Start with wainsfw. Far more newb friendly. Its in the list
>>105622420>this takes less than 1 min on a 4090We can tell
>>105622445its not that far from base wan now and it takes fucking 50 seconds to gen instead of 8 mins.
https://files.catbox.moe/jgo9de.mp4
>>105622351Too bad the gun is fucked, but good scene. Can you post your guider/scheduler values pls?
been out for 3 days wtf happened, why is there video everywhere, did they finally figure out something for vramlets?
so best quality/speed is just nag + magcache no distill?
>>105622485can you though? I'll use one more step
>>105621447Thread is healing
>>105622491>did they finally figure out something for vramlets?no its just quicker now :(
>>105622420what is flowmatchsigmas?
alright i ran that girl aiming the gun img through flowmatch with 4 and 8 steps. where/how can i combine them (where both play next to each other) for comparisons sake.
>>105622488yeah gimme a min, i'll clean up my wf (it's full of a billion wildcards) and post it. also, the way the camera focusses on the barrel there is dope as fuck
>>105622348illustrious is staring to get some alright anime real models but still pony is better in that department, as for hands just make sure your using adetailer, nagetive embeddings and negative prompts. illustrious is not really an upgrade in the hand issue.
>>105622494No, teacache and slg is still king
>the girl's hair and cape flow in the wind. flowers bend in the wind and white petals detach and fly across the field. trees bend in the wind, branches sway and leaves fly into the sky.
>720p 8 steps
>>105622522oh shit, that one's nice. the parallax effect there.. unf.
>>105622471thanks
loras are not compatible between sdxl/pony and illustrions and/or noob, are they?
one of the reasons why i havent upgraded yet is because i have a huge pony lora library
>>105622509https://github.com/BigStationW/flowmatch_scheduler-comfyui
>>105622530Not really, no. Same base model, sdxl, but radically different finetunes.
any videofriends tried video gen on AMD?
I've avoided video due to mid quality and long gen times so far, but now I'm tempted to try. 7900 XTX... the VRAM should be enough right
Can I not allocate more than 24 gigs of sytem ram? I still have some unused RAM that I could thro at it (if it even does anything)
>>105622509>https://rentry.org/wan21kjguide#lightx2v-nag-huge-speed-increaseBottom of this section
Finally got vid generation working!
>>105622519>hands just make sure your using adetailer,im using adetailer for faces but i can never quite get it working for hands, they still come out mangled.
any trick to it? im using hand_yolov8n
anything better maybe?
whats your fav denoise strength for hands?
or maybe i should switch to a particular model for it?
ok well i can't remember or find how to concat two videos so seperate posts it is.
here's the i2v with the image posted above. prompt was: "a cute girl wearing overalls and holding a gun turns towards the viewer and then aims the gun at the viewer. we can see down the bore of the pistol."
with flowmatch and 4 steps, using NAG.
Gen time: 191 seconds.
notice how she shoots the gun even though i never prompted it lmao
can wan do breast milking in 2d?
>>1056224951:07
https://files.catbox.moe/ph5tjh.mp4
gonna see what injecting some latent noise does now, hopefully get some more movement
>>105622584same settings, same seed same everything but with 4 steps instead.
honestly 8 steps seems like a massive waste of time but needs a larger sample size.
>>105622553Sure, just hit start and go to bed. By the morning you'd probably see it's about halfway done.
>>105622572honestly easier than going through noodle shit and the shitty manager.
>The June NVIDIA Studio Driver provides optimal support for the latest new creative applications and updates including the arrival of the Stable Diffusion 3.5 update which adds TensorRT and FP8 support, improving performance by 70% and reducing VRAM consumption by 40%.
stablechads...ww@
>>105622602forgot to add, gen time was 108 seconds
https://github.com/aorav/ComfyUI-sdxl_NAG
https://imgsli.com/MzkwMDA1
>>105622616I tried to get it working on Comfy for waaay too long. Gave up and went Pinokio. Worked right outta the box!
>Control the motion of anything without extra prompting! Free tool to create controls
https://whatdreamscost.github.io/Spline-Path-Control/
>It's essentially a mix between kijai's spline node and the create shape on path node, but easier to use with extra functionality like the ability to change the speed of each spline and more.
>It's pretty straightforward - you add splines, anchors, change speeds, and export as a webm to connect to your control.
>If anyone didn't know you can easily use this to control the movement of anything (camera movement, objects, humans etc) without any extra prompting. No need to try and find the perfect prompt or seed when you can just control it with a few splines.
>So is there anyway we can integrate this with comfyui for local generation?
>Of course! That's what it's for. If you connect the exported video to a VACE control it will control the motion.
>I used the same background image as the reference for VACE (using a VACE i2v workflow) to generate these videos, but you can you also use this without a reference image and let it control whatever your prompt!
>Just make sure the dimensions are the same as the control video though, if your using a reference image.
https://www.reddit.com/r/StableDiffusion/comments/1lddjkx/control_the_motion_of_anything_without_extra/
>>105622636"all" it does is generate a video for VACE to use as the control.
still very nice though.
>>105622620why are these retards still doing updates for SD3.5. Both AMD and Nvidia are doing this. Do they have ANYONE in the office who actually uses image gen?
>>105622584>>105622602there is more nuance to the movement in the 8 step version, but can you use flowmatch with 8 steps?
>the girl turns around. a man on a red motorcycle appears at the top of the stairs and drives down the stairs bouncing up and down
>>105622651i mean, i used flowmatch with 8 steps but i suppose you're asking if it works from a technical standpoint? no idea.
>>105622626with recommended settings at cfg 7
the negative is just "bad anatomy, low quality, text"
https://imgsli.com/MzkwMDA2
>>105622622>>105622666add it to the snake-oil list
I suggest using inject latent noise at like 0.5 between wanimagetovideo and the sampler. It adds a ton extra movement to the scene. Still testing values.
>>105622039It doesn't destroy it but if you can't tell there's a loss in quality you need your eyes checked. It's worth it though, it just comes with a cost.
are any particular finetunes of whatever base model markedly better or worse for things like hands and feet?
or is it pretty much the same no matter the finetune?
>>105621456How do I install? Do I just reroute around teacache to the node that's in comfy or do I need to follow the install instructions on GitHub?
>>105622577anon there more adetailer models besides those 10 you get with the extension
https://huggingface.co/Anzhc/Anzhcs_YOLOs/tree/main
https://huggingface.co/Bingsu/adetailer/tree/main
the denoiser is 0.4 which is default for adetailer, make sure resolution 1024x1024 of sdxl model.
https://x.com/viccpoes/status/1934983545233277428
>>105622722if you have teacache and magcache installed you have to uninstall teacache as they have conflicting files and it will give you the wrong node. i have no idea what you mean with installation. just do what it says on their git if you haven't installed it yet.
https://github.com/Zehong-Ma/ComfyUI-MagCache
>>105622561I'm trying this, but now my GPU isn't utilised fully. Keeps fluctuating between and 100 and the entire thing is 10 degrees cooler so
>>105622740>we built Krea 1 to maximize artistic exploration and avoid the "AI look" that most image models have.Krea 1 generates detailed skin textures, dramatic camera angles, cinematic lighting, and hundreds of expressive styles.
it also supports style references and custom trainings, learn more in this thread
>>105622750it looks like they are gonna open source it, that is the point dumbass
Holy fuck the NAG cut my gen time in half.
>>105622764and where are you getting that info from? they are a paid service site and i couldn't find any info on it.
>>105622772one of the head people is teasing should we open-source? on their twitter
>>105622778considering they collaborated with Black Forest -"maybe we release, maybe we don't"- Labs, i'll hold my breath. it's just ad bait.
same with reve. was a fun model but it died very fast because it's a paid service.
>>105622789>was a fun model but it died very fast because it's a paid service.I think companies are seeing that, they need the community
>>105622666>>105622622I tried this and it feels like it's doing nothing?
>>105622770AAARRRGHH FUCK FUCK I'M BLEEDING OH SHIT ARRGHHHH
>>105622770damn that's a pretty clean gen anon.
>>105622798Do you have some examples with noob vpred/epsilon and illu?
>>105622740>supports style referencesSo you wouldn't need to train a style lora anymore?
>>105622819you could already do style transfers with ipadapter and the union controlnet model with SDXL. will be interesting to see to what degree you can style transfer though.
>>105622622Is this in custom node manager or do I need to git clone that..?
>>105622740>A surreal aerial view of flat, geometric farmland over which five perfect cube-shaped clouds hover, casting soft square shadows onto the grid below. Artistic rendering method: digital 3D compositing, photorealistic rendering, visual genre: surreal landscape, style: hyperclean minimalism, vibe: contemplative, mathematical, dreamlike, degree of visual complexity: medium due to geometric repetition and visual precision, visually adjacent artists: René Magritte (surrealism), James Turrell (spatial form), Simon Stålenhag (digital surrealism), color palette: jade green, pale ochre, muted tan, sky blue whites, soft shadows in cool grays, exposure and contrast: evenly lit, daylight balance, low dynamic range with soft shadows, camera settings: aerial perspective, wide-angle lens (~24–35mm equivalent), high altitude shot, orthographic-like viewpoint, no visible artifacts. The entire ground plane consists of uniformly divided agricultural fields, rendered as a patchwork quilt of rectangles in shades of green, tan, and muted gold, aligned in perfect Cartesian rows, some fields with visible crop texture, others solid. Five cuboid clouds float slightly above the land—each a pristine white cotton-like cube with fuzzy, rounded corners—centered in a diagonal arrangement from bottom left to top right. The largest cube is near bottom left, medium-sized ones stagger toward the center and right, and the smallest cube is topmost left and bottom right, all casting equally square shadows directly below, giving a disorienting, impossible geometry that feels physically real yet cognitively implausible. The land stretches endlessly beyond the frame in every direction.Do I need an art degree to prompt with this thing?
how long before I can make seasonal anime from my computer?
>>105622843> em dash in promptman why. so it's very likely they also trained on ai slop
>prompt: she does a tik tok dance with fast and dramatic movements
>does a pose
>>105622854>1080p 20 minutes fileHow much vram do you have?
>>105622858the images they show are the least ai arty I have seen though
>>105622854when we get a full video editor with gen controls. comfy is advertised as it can do anything in a node but it's just too tedious to noodle and hop back and forth between apps. comfy is single execution too so it's extremely clunky adding filters and transitions
>>105622861Look at her go!
>>105622915looks like link decided to become a tranny
>>105622861where is the motion?
>wan2gp absolutely mogging cumfarts fags
What's the most popular model for anime rn
>>105623005popular doesn't mean good. wai gets shilled here like crazy but i prefer literally any illust 2.0 shitmix
>>105620683https://news.mit.edu/2019/answer-life-universe-and-everything-sum-three-cubes-mathematics-0910
what are the chances..
>>105621571time to install linux
>>105622963it's artoria my nigga
>>105623005for nsfw wai
for artfagging plant/milk/bubble/insert-trendy-artshitter-term-here mix
>>105623036that looks nothing like saber
like at all
i mean holy shit if you think that's her you have far more shit taste than i previously imagined.
>>105623049https://civitai.com/models/653582/artoria-pendragon-lancer-fategrand-order-sdxl-lora-pony-diffusion
>>105623051so this..is the power..of the 4step distilled lora..interesting
> take cumshot lora
> make everyday household items or unrelated objects spurt liquid
my sides have yet to be recovered
>he drops his cigarette. the cigarette falls onto his pants. he panics as the cigarette burns.
>>105623088>Ponyfound the poroblen
>>105623103I had the perfect cum gen until the last 2 seconds when cum started coming out her eyes.
>>105623138same. it also likes to leak out of their mouth and tits like some very odd eldritch abomination. usually have to hope on RNG or lower lora strength
>>105623094Can I not use the lora with more than 4 steps? 6 and 8 turned out way worse.
>>105623126Oddly appropriate for the setting.
>the man puts the cigarette in his mouth. he grabs the liquor bottle and pours into the glass cup
the future is auto-cloaking liquor bottles
>>105623245>bottle manifests out of thin air
>>105623254Jensen asked for this
what's the favoured interpolation model, finally willing to give it a go
hope this is the right thread this time
>>105623254Hey we can all dream of a utopia bro
The slow motion is giving me motion sickness.
>>105623282rude works well for realistic if the clip isn't doing crazy fast movements. not much really changes from years ago unless you got access to proprietary interp models. anime/cartoons just look like shit with it so don't bother
>>105623311It's horrible. Yes, it's nice you can generate videos in a few minutes but there's still a long way to go before it's actually usable over default WAN.
>>105623325okay that was top of my list to try, just wondered if I was missing anything
>>105622910probably because of the lora used, lora training fags need to understand wan is 16 fps and not 30+ fps but what ever.
If I can easily spot if a gen used the distilled lora, then that's not good.
>>105623136pony has it problems especially adherence to complex pose prompts but frankly its better than illustrious in photorealism, realism, 3d, semi-real and natural lighting. Creativitij and pornmaster are decent illustrious models but they don't have the look pony models have that i desire.
>>105623374>photorealism, realism, 3d, semi-real and natural lightingchroma exists and it btfo both those models
>>105623374>dherence to complex pose promptsi use openpose often and pony works far better with it than illustrious in my experience
The fact that Pony V7 isn't out yet is shocking, and it doesn't even seem to be getting particularly close. Going with shit ass Auraflow as a base model was such a shitty decision from the jump, and as it goes it looks worse and worse.
I'm in the beta for it now, and the gens are absolute dog shit if you want it to look anything like a known character/style. Illustrious runs circles around it with anime, Chroma runs circles around it with realism.
What do they even do at this point? Sell the company and hope someone else fixes it for them? Restart the whole process? Secretly, I believe that's exactly what happened, and they don't want anyone to know. We were told it was going to be releasing late last year/early this year. We're now halfway through and it's barely cooked
>>105623458leak it faggot
>i-it works only thru le 'cord bot..go back then, !
>>105623458post your own gens from it i need a laugh
>>105623471Works through their Fictional.ai site now if you got an invite
>>105623374and what model is that? cyberpony?
>>105623430I'm yet to an see an interesting gen from chroma that actually proves it's a step up from sdxl finetunes, all i see that gets posted is are a bunch of goofy miku memes, retro gaming graphics and camera filter photographs. Post something or share collage of decent gens.
just train a chroma or illustrious lora on greasy pony gens
>>105623544sdxl has too much plastic skin shit for doing realism is the point. semi-realistic is a cope meme to excuse AI sloppa effect
>semi-realistic is a cope meme to excuse AI sloppa effect
at least somebody said it out loud. fucking hate those kinds of gens. too uncanny
>>105623458It's a fast moving world and he genuinely thought he had exclusive rights to decent models.
If he's still hashing artist tags then it's dead in the water.
>>105623458That's what happens when retards don't want to listen and spam "BUT FLUX CANT BE EASILY TRAINED BRO".
The ego didn't want to admit that auratrash was shit and just switch to flux asap. Now it's vaporware and too late to do anything, and seemingly they don't have enough money or whatever to just drop the trash model and instantly move on to training something else.
IMO the best course of action would be to wait for Chroma to finish training soon and then use that.
>>105623590>If he's still hashing artist tags then it's dead in the water.the artist tags are just being removed altogether
>>105623594giving lodestone compute was the best play he made. at least he was useful for something
>>105622798I think you are supposed to use your normal negative prompt the normal way and then NAG is for targeting specific things. When I do that it is seems more effective than putting the term in the negative prompt.
>>105623458>Going with shit ass AuraflowYeah, the idea was to sell commercial use of it, which meant the Flux dev license was not going to work, but as Chroma shows, Flux Schnell is fine as a base model and it has a fully permissive license.
I've only seen a couple of images from Pony 7, the last was a hamburger, which looked ok but nothing amazing.
Eventually he will have to release it in whatever state it's in, so we'll see.
>>105623508creativitji, its a illustrious model. The pony i usually use are reanima, 3x3mixx, spicypizza, sensualmindxccentric and cyberrealisticpony.
>>105623566This, same with Flux dev, with 'realistic' loras etc you can get decent results, but 90% of the gens will still look 'plastic' in skin and off in eye reflections, Chroma is much better here.
For art, both SDXL (and finetunes) and Flux work great, but I still get more variation with Chroma, even if it lacks the styles I want (which can be solved with loras).
>civitai doesnt let me search for loli models
>tensorart is filled with paywalls
when will some anon save us
>>105621838try this..
>combine singularity's twerk v2 + real twerk lora both at 1 strength>loras causvid 14b v2 at 0.5 str + accvid at 1 str>use NAG leave neg prompt empty>disable teacache, torchcompile and skip layer guidance>uni_pc, 6 steps, 1 cfg, simple scheduler>480 x 600 resolution image for testing or to shave off more gen time>regular wan2.1 i2v Q6 480p ggufwith these settings, I typically get 8 out of 10. speed and quality even beats teacache. this took A LOT of testing.
>>105623805https://civitai.com/models/1506082/cunnystyle
>>105623773sdxl is good for what it is if you accept its shortcomings (background logic, always a pleasure) and you can get pretty far with the right settings. comparing that to chroma is like comparing an old rusted 1984 fiat to a 2025 ferrari prototype (with 2 naked furries sitting inside).
>>105623566realism is an actual photograph, everything else is whatever
file
md5: b3ed026c64dd4a8f2b46d50bf2a1804e
🔍
>>105623856FUCKKKKKKKKKKKKKKKK
>>105623878what kills sdxl for me is just the shitty old 4 channel vae
if someone made an SDXL that used the flux vae I'd probably switch to it due to sdxl's superior art style and artist knowledge. but I think changing vaes requires full retraining from scratch
>>105623887https://gofile.io/d/VC03kG
It's down there. I forgor gofile can't direct links sry.
>>105623914>>105623893uh.. thank you anon
nta but thank you
>>105623805tensor art is a den of glowies.
they have actual illegal paid shit on there. i really would avoid it.
>>105623894yeah pipedream. sdxl got the artist knowledge, flux has the resolution, well fuck. hello chroma.
>>105623954really? who even runs that shit
>>105623954glowies? maybe
illegal shit? meds
>not getting your anime girl to draw for you
ishygddt
>>105623566show what your precious flux finetune can do that is better cyberrealistic sdxl models can't do.
feet guy, you are summoned for another time loop.
>>105623544>>105623999Imagine being so blinded by slop that your forget what real images of humans look like. Chroma doesn't produce real images, but it's the only AI that can produce anything resembling that. That you refuse to see that must be bait.
>>105623998very fucking cute
>>105624028I don't think he forgets I think he simply enjoys that style. I've refrained from shitting on anons like that because I believe there's not much one can do to dissuade another from not enjoying a style. I could be wrong though.
>common *** samefag sighted
shameful display
reminder to eat your greens fellas
>>105623998where's her penis??
>>105624053coleslaw is a pretty good way to work vegetables into a typical american diet
>>105623983Tensor has loras of famous sexy children, from Instagram "account run by mom" types all the way to darknet legends
They're all shit, but still
>inb4 not illegal Yeah but that's what the anon was referring to.
>>105624077Coleslaw is junk food if you eat more than 2 spoons of it
>>105624048I mean it's a neat style, sure. There are interesting SDXL photoreal tunes. It definitely is capable of photorealism to an extent. But it is a very basic model with limited capabilities. Outside of creating neat aesthetic images (of which Chroma already does a much better job out of the box, with the right kws and with no tunes needed), or photos that are very close to the training set for the specific model in question, there is no much use for them.
SDXL is also an absolutely horrendous base for anime. I'm not saying it currently sucks, but it could've been so much better. It's a miracle that IL/Noob exist.
>>105624091they were on civit until the recent purge
>>105624113Thanks for the (You) but I'm not sure what that has to do with anything. Didn't you see my inb4? People call t2v gens of 11 years olds in bikinis CSAM in this thread what did you expect
>>105624127i call it gross and creepy
>>105624152thats exactly what it is. like those uber creepy gens from that fuck (who is now in the sdg thread I believe) with his sameface 10 year olds with that fucked up retard smile. and always so friendly.
>waiter, what's this woman doing in my soup
>"the backstroke, sir"
also, let me get 2.5 soft boiled eggs please
>>105624091a proper salad is always better
d14-oj
md5: 93ca36c9c208c47810cdfce7ca3659bf
🔍
orange juice has a healthy reputation but it has about as much sugar as any soda or beer
>>105624279the gun pointing seems a little dragged out from the past two threads
>>105623894soon https://xcancel.com/ostrisai/status/1934947964209824113?t=qJeWgl3Jsf8ziEsqzntOJQ&s=19
>>105624242taiwanDollLikeness_v20
Did anyone do a magcache vs teacache comparaison? I don't see much talk about it beside an anon here, so I would guess it's another snake oil?
YKYYyVCl
md5: c7dceef03d57e3d4622885f8edee5dc0
🔍
>>105623894It's called SD3.5-Medium and it's art. When you call any SDXL good you never mean summer 2023 with the refiner. Similarly 3.5m could be made great, and unlike Flux or Flux-based it isn't cutting away 50% of the genners that are us 8gb vramlets. And it has 1.4MP base resolution. Now if only it didn't go fucking ignored by everyone. Never gonna forgive.
>>105624407i'd oil that snake
>>105623594Flow models are hard to train. Should've done a Pixart derivative, he could've done a from scratch model with 16-channels and be done.
>>105624407this is a very nice gen
>>105624419In the defense of the community, SD 3.5M is nowhere near as good as it could've been either. When looking at Flux dev, schnell, SD3. 3.5M, etc.. What we are looking at is strength of base models. What we know from SD3/3.5M is that it is not nearly as good as it should be. Does not work as advertised is an underestatement. I mean, it does not just lack anatomy knowledge. It has a skewed anatomy knowledge. To the extent trying to fix this issue would be a complete waste of time.
>>105624419>When you call any SDXL good you never mean summer 2023 with the refiner.NTA XL0.1 no refiner was kino and anon thought I was wrong for thinking it would become the meta heh
>>105624480The problem with SD 3.5M is it's unstable just like Flux. Any amount of training makes the model fail.
>>105624480they both didn't live up to the hype because they were made by the same people
i think i've hit the bottom of the barrel with this concept
>>105622622So this is just supercharged negative prompt?
>>105623594The distinction between Flux Dev and Flux Schnell is lost on most anons unfortunately
making tommy vercetti look gay is as simple as making the flowers on his shirt large and adding saturation
>>105624523You can go deeper
>>105624490Nah, Flux only had a distillation issue. The distilled Flux Schnell model is still leagues ahead of what Stability gave us. Note how good out of the box Flux is with hands, etc... SD3/3.5 is as if someone took a model that was working, and messed with the weights so that humans got mangled. SD 3.5's issues are just very inorganic. They are not the result of normal training. Notably, SDXL despite being considerably weaker does not exhibit any of the issues that SD3.5 exhibits.
>>105624523Make a french girl with a saucisson
>>105624480what ui's besides comfyui are compatible with sd2, stable diffusion cascade, sd 3, 3.5 and 3.5 large?
>>105624591That said, Flux is not perfect. It was definitely censored with styles. But it's better in every other way that matters, such as prompt following (this is the main one that makes up for the lack of styles which can be added in other ways), and text.
Behold
md5: 3a7c18fe7709b51ee4f8a050d1fd78b4
🔍
>>105624536Flux Dev 16p is almost required. All of the "Flux can draw hands" stuff is assuming you're using Flux Dev 16p, and 1024x1024. You get monster claws about as often as Illustrious/Pony if you go with Schn/8p
>>105621934pissbuttfag looks like THAT??
neuro
md5: ecdca5de14ab38da4b46bbbce877c427
🔍
>>105623374Problem with Pony is that has less concepts and variety than Illustrious. However, it's got scat. I'll give it that.
>>105622791>I think companies are seeing that, they need the communityI like the fact we have this much power, if krea open source their models and it's a good one, everyone will talk about it and it'll be on the map and relevant