Discussion of Free and Open Source Text-to-Image/Video Models
Prev:
>>105987645https://rentry.org/ldg-lazy-getting-started-guide
>UISwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows/home
>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>WanX (video)Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1
>ChromaTraining: https://rentry.org/mvu52t46
>Illustrious1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/
>MiscLocal Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate
>Neighbourshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
Blessed thread of frenship
>>105992261i like the frenzone! i liek frens! :3
>>105992273>>105992160shiiiet only 1\3 so is it a push or loss? :o
>>105992273That bikini top is working hard
>>105992296now do Gotou Hinata.
>>105992296>Why do I get these weird blue bar artifacts on my 'checking for breast cancer' gens ?
why did he change his name and reply to himself
man rocketgurlp*** really is a deep gooner
>>105992340Seek help, far away from here
Remember to submit your gens to /ldg/ GenJam.
https://forms.gle/ZQMNMTaxGxAZZTAD8
Theme is technology
>>105992335thanks for reposting.
more neta lumina testing... it struggles with a simple prompt that noob does a great job at. even after some prompt re-engineering. bad hands and anatomy.
https://files.catbox.moe/ihmwnt.png
Anyone want to criticize this prompt? Clearly, looking at the samples here https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd, the model is capable of better. but it seems like you have to prompt it a special autistic way to get it to work, and I'm failing.
>>105991800this benchmark is outdated and people should stop posting it. the 7900 XTX is around a 3090 in perf, not a 4060 Ti lmao. I'm getting 1it/s at standard resolutions in Lumina rn, with no optimizations (eg FlashAttention) enabled. this pic took 30 seconds for 30 steps.
>>105992393what no pussy ever does to a mf
Correct me if I'm wrong but now since radial attention implemented sage 2 would using lightx2v and this https://github.com/dvlab-research/Jenga bring gen times down to under 10 seconds? The jenga wan example shows "24s" for that generation.
>tfw gening videos will be faster than images
Tomoko jabber-gen from my wan t2v lora and the i2v fun camera control workflow adapted to t2v.
Workflow is here: https://files.catbox.moe/3k4vy0.png
>>105992428More like GlowJam lmaooo
>>105992428do I have to use the glowform?
>>105992456>Wan 1.3b - 24s>Wan 14b (not tested yet)>radial attention not fucking even implemented yetyeah just 2 more weeks... and then it'll have some bullshit where the quality comes out like shit no matter what
>>105992767>Censoredthen dont bother posting, post the uncensored with a catbox link.
fucking retard.
>>105992784You are being very rude.
>>105992784i dont think you are in a position to be making unreasonable demands ;3
>>105992795dont act like a retard next time.
>>105992828>dont act like a retard next time.I'm not the one sperging out over a censored wan video though.
>>105992840stop shitting up the thread nigger
>>105992848I think someone who gets overly aggro about a simple bar is doing a lot more to "shit up the thread" than me.
You need to calm down, my friend.
>>105990285>>105990654I managed to get pretty consistent VHS-like gens with:
>Analog VHS camcorder footage still with Chromatic aberration, overexposure and interlacing artifactsPicrel is:
>Analog VHS camcorder footage still with Chromatic aberration, overexposure and interlacing artifacts, a female dressed as Asuka Langley from evangelion sitting in a wooden chair at an outdoors garden. She is wearing a dark red plugsuit.
>>105992856you need to stop posting garbage my nigger.
>>105992795who let lil bro outside his hugbox?
>>105992874Hmm, I don't think it's garbage. Seeing an spacker like you losing his marbles is also pretty funny.
>>105992874Go away retard, only one messing up this thread is you
I thought local diffusion general was supposed to be the "smart" one between itself and its sister stable diffusion general.
105992940
touch grass retard lil nigga
>>105992613beahagahagahahah
That samefagging is super stealthy wow
Julien should take notes
>chroma v46
he finally fixed chroma. i can finally using a fresh version
>>105992784Alright, I take your advice. Instead of censor bars, I will instead leverage the humble crop.
Better?
>>105992767Wtf anime doesn't look like shit now. I mean it still looks a little 3D but it's not eye cancer anymore
>>105993244Anon, Ani_Wan is a base model. It has T2V and I2V versions.
>>105992767Looks good. Always hated the side mouth skin flap or the smallest line for a nose in animes. One could argue its to save animators time but its just a lazy practice that makes everything look like forgettable slop. Now with this technology, aint no excuse to cut corners
>>105993066Better still, just post a catbox link.
>>105993205anything more complicated than doing nothing turns into chaos since this fag keeps posting the same ones over and over again
How well does ani wan work with the various loras out there for wan?
>>10599354190% of a current day anime episode has less motion than that.
>>105992432Procyon is not outdated but it does run on Windows and ROCm isn't available there so they used ONNX to benchmark it and not any other hacks like DirectML or ZLUDA. I assume once AMD actually released ROCm officially, UL will update the benchmark to use it.
>>105993541he will hate it no matter what, daily reminder ;3
>>105993583I have found it works well for getting dynamic motion out of the characters, but it changes the eyes to generic anime eyes too often.
>>105993269nice use of perspective & floral<3
>>105992870im sorry but no, just NO.
Are guns also censored in WAN? It keeps giving them the toy gun red tip.
>>105993959they're devs are chinese, censorship runs through their blood. To get around it you have to find or train a lora.
am i the only one tryin 2 create the cupping hands shit for more realistic photos of women? even tho this ones kind of mid the workflow is the best i've gotten. i also tried openpose controlnets and a mix of the two processes and didn't see much progress. i'm intending on trying this with flux later on but this is using wainsfw as a base for the prompt adherence and then a ton of other things to convert it to a reasonable facsimile of a human being
>>105993274by i2v I mean vanilla wan. oh come on, i thought this was a nerd thread...
>>105994094Then say vanilla wan, anon. All you did was indicate you're a newbie who needs to be correctly informed.
can anyone gen a man in manacles, hanging inside of a torture chamber, with a dominatrix babe in black leather holding a whip standing next to him? i cant get the proper output
>>105994094https://www.youtube.com/watch?v=LqT9hPE2WCU
>>105993982>censorship runs through their bloodYet way less censored that western models, like those from BFL
if you want uncensored chinese models you just need to upload a synthetic dataset of 512x512 SD1.5 gens for them to train on!
>>105994226That is true, but it's stupid in this case because they're providing a local model for people to download and run on their own machines. Why did they tune their model to show red jam in place of genitals? Or have girls transform into creepy mandolls when context got too lewd?
>>105994215>i cant get the proper outputwith what? llm might help with prompt
>>105994258flux. its easy to do women in chains, but i cant do a bound man
>>105993643oh my mistake, I forgot sites like tom's shartware pander to windows goyslaves.
>>105994251>but it's stupid in this case because they're providing a local model for people to download and run on their own machinesSo do BFL, yet they censor theirs 10 times more, what's your point ?
Never saw the red jam, just the model not knowing about genitals because it hasn't been trained on them, Flux dev actively mutilates nipples, which is a bitch to train away.
>>105994309The benchmark, as in Procyon AI benchmark for Stable Diffusion and Text generation both uses ONNX on AMD. No mainstream tech publication is deep enough in the weeds to tell the difference.
>>105994248>>105994226>muh censorshipif you cant outsmart the gooks there is no hope for u
>>105994311>whataboutismStop trying to deflect attention away from your shithole's authoritarian censorship Zhang.
so what's new for imagegen, muppets? did anything important happen after sage attention 2?
>>105994311>Never saw the red jammust say, im quite... jelly
>>105994105not my fault if you can't use your brain, because of burgers. i will not insist with fake nerds. also, i'm not asked help, i just asked which model
>>105994374>ask the question "ani_wan or i2v???">someone politely and correctly points out those are two different things>accuse said person of not using their brainYou are simply a moron, anon.
>>105994351VAE EQ experiments
any loras for crossing legs in a seated position?
>>105992296>>105992767What loras are you using? Are you genning at 12 fps to get the more authentic animation movement? Also based Oreimo genner
>>105994476Those two webms were generated using the Ani_Wan checkpoint without loras. Ani_Wan can work well so long as it doesn't change the eyes, which is very artstyle-dependent. Works well with Oreimo.
>>105994491Interesting. Might be worth the download then. Any quirks you've found with Ani_Wan so far? Things to avoid?
>>105994348>China of all places releases models that are way less censored than those of the westNoooo, shut up
>>105994294chroma can do it, but it requires wrangling
>>105994500It's very motion-happy. Characters a more dynamic than other checkpoints, but sometimes it goes a little overboard. They might look slippery at times.
As usual, make sure you take measures to keep the camera under control.
>>105994515You're not selling anyone on your smelly shithole, Zhang. Being less censored is not an excuse for tuning in gore to replace genitals.
>>105994215sure
I got you champ
>>105994389I'll just hide the anim posts, and your message first.i hate this animation style anyway
>>105994534>but sometimes it goes a little overboardBy that, think something along the lines of webm related. A lot of people liked this one tho.
>>105994552Next time try not to make a complete retard out of yourself anon. Try to appreciate advice and friendly corrections give to you.
>>105994534I've searched for ani_wan but I can't find any writing on it, is it based off of VACE or just a t2v / i2v model?
any idea on any loras or prompt i can use to make this work, been trying to generate this asshole for a while but the right card and hand won't come out right
my prompt
"pov, you can see a woman with a facial scar on a table sitting, a hand appears infront of the camera and places an ace on the table, she looks at it the ace placed on the table by the man, she frowns and sighs,"
JUST SHUT UP ALREADY. STOP FUCKING YAPPING. GOD DAMN. JUST KEEP YOUR FUCKING MOUTH SHUT! SHUT THE FUCK UP!
>>105994515t. jellyfag
>>105994561https://civitai.com/models/1626197?modelVersionId=1852433
Try both I2V_new and I2V_old. Some ppl prefer the old version.
>>105994571Thanks, I will pass until there's something with controlnet functionality like VACE though
Check it out. Anon here asks:
>>105993244>aniwan or i2v?He was then politely corrected by a second poster:
>>105993274>Anon, Ani_Wan is a base model. It has T2V and I2V versions.The first poster then proceeded to have a salty angry fit:
>>105994094>by i2v I mean vanilla wan. oh come on, i thought this was a nerd thread...>>105994374>not my fault if you can't use your brainSome people really can't handle being corrected.
>>105994541Your shilling for censored western models is overflowing, you start complaining about chinese censorship without even mentioning the MUCH WORSE western censorship, and then start attacking the moment it's brought up.
Go back to /sdg/ and shill your SAAS AI
>>105994596It's pretty obvious i2v was referring to wan i2v. The akshually response was unnecessary
hm... computer... yes computer gen... hm...
>>105994553no they didn't
>>105994605>Your shilling for censored western modelsQuite the imagination you have there Zhang. I threw some shade on Wan for censoring genitals with gore and you've responded by screeching about muh western models, and now you're accusing me of being a shill. Hilarious. You stupid fuck chinese just can't think outside your nationalistic programming can you?
>>105994615There's no need to pretend to be someone else to defend your poor behavior, anon. Of course it wasn't clear. Ani_wan itself has i2v and t2v models.
Just swallow your pride, take the L and move on.
>>105994622I know you're salty after being exposed for being a moron, but the replies on the /a/ thread a couple of days ago spoke for themself.
>>105994633bro your posts are actual fuckin aids on god
When did it stop being about the Coom?
>>105994635Don't dig yourself into a hole next time, my unintelligent friend.
>>105994625>Anyone who disagrees is a chinese state agentThe desperation that sets in when you have no arguments and there's no downvote button
>>105994638baker is an autistic sperg
he fights with everyone
>>105994615its just the ldg schizo ignore report move on
the thread gets nuked almost everyday with banwaves and mass deletions/insta-archive etc
>>105994629You failed to use basic context clues to infer what anon was talking about. Take the L
>>105994642Does Ani_Wan still prefer being genned at 16fps?
>>105994649You are getting waaaay to defensive over Wan's censorship and your homeland, Liu Bei
>>105994679just read the fucking page. jesus christ
>>105994679It's what I use. Rendering at 12fps just makes a choppier version of the 16fps version. No point in chasing that native 12fps anime-like motion until a checkpoint or lora is trained for it. The 3DCG look is fine for now.
>>105994666source is his own schizo singular baker theory
>>105994690anime varies in frame-rate which is why it's fucked up movement. it goes up to 24 fps (23.8 to be more precise) compared to american cartoon standard 16fps
>>105994652>the thread gets nuked almost everyday with banwaves and mass deletions/insta-archive etcNo it doesn't. Way to literally just confirm that you have schizophrenia.
>>105994712>good morning incident never happenedyou stick out like a sore thumb and are actively sabotaging your own own bakes/threads
just chill out man
You are literally obsessed you need to seek medical help
>>105994658he is obsessed "with winning"
He is neurotic, co-dependent, compulsive and, obsessive compulsive
>>105994746I'm not the OP, my mentally unstable friend.
Consider the following:
https://desuarchive.org/g/search/subject/ldg/
The only threads that have been deleted are a couple of duplicates. A far cry from "the thread gets nuked almost everyday".
This is what happens when you don't take your medication, sperg-kun.
is nvidia planning to compete with the 4090D 48GBs or just wait for stock to run dry? $3200 vs $4300 for the RTX Pro 5000 48GB is damn compelling.
5090s are back down to 25% over MSRP and will likely only dip one more time before the holidays. if a 24GB 3090 equivalent refresh for the 50-series is released, it'd make the Pro 4000 24GB redundant unless they gimped it somehow. it couldn't go below $1000 without the memory bandwidth being crippled or being made x4 or something, and it wouldn't make sense (to me) to make it a "gamer" card and target a higher cost, either, unless its generative performance was stellar, which would kind of be abandoning the current 5090.
5090ti 48GB on the horizon?
>>105994772rad
this 2 img workflow works pretty good, todd for a demo:
https://gofile.io/d/faahF1
using the schnell lora at 8 steps for quicker gens as well
>>105994877>radJudge Holden
>>105994897and the output as it is:
>>105994990>stable cascadeerm... young indeed
>>105995122lmao. why is it the most coherent gen in the comparison though
>>105994897the man is holding the pink hair anime character, standing on a beach. The man is wearing a straw hat. keep their expressions the same.
>>105994793I do like the look of these, but that posture will cause back problems, just sayin
>>105995442the man is holding a framed photo of the teal hair anime girl, while standing on a beach. The man is wearing a straw hat. keep their expressions the same.
with a diff 2nd image.
>>105995442the man is at a horse racing track in Japan. He is holding a framed photo of the pink hair anime girl with his right hand. keep their expressions the same. Behind him, there is a horse race going on on an oval shape track made of grass.
one more todd test:
the man is sitting at a computer in an office, on the screen is an image of the pink hair anime girl. keep their expressions the same. the view is from behind the chair the man is sitting on, at his computer.
>>105995536this time one image with the second image nodes bypassed
The white hair anime girl is holding a beer bottle that says "LDG gen juice" on the label.
Change the text from "Steam" to "LDG". Change the pink hair anime girl to Miku Hatsune.
kek it actually replaced steam-chan pretty well
how do I use flux kontext + depth/canny control net? I want to change the view angle of a scene by giving it a new depth or canny map while maintaining the same scene appearance. is it even possible?
>>105995647just prompt directions or camera angles, kontext can do the side profile of a front image for example
>>105995644Replace the sand and water on the right with a cliff, with trees below it.
>>105992428wait you want anons to submit instead of just posting here?
sus af ngl
>>105995671no, I want to control the exact geometry from depth or canny
>>105995647for example
show a side view of the anime girl on the scooter. keep her expression the same.
face can be touched up with adetailer but it usually is fine.
I failed to gen videos yesterday, short attention span gave up.
Heres my issue, what kind of errors am I looking for to know if its a VRAM issue versus something else? I am on the low end, I don't really care about gen time tho desu I usually leave this shit on when I'm at work or asleep.
>>105995687*also this prompt was specific to the girl, you could do the entire scene too, also wan has 360 loras you could try as well.
>>105995695has anyone got a very basic but optimized t2v workflow i can use and try, so i can go from there?
all the guides and shit lead me to all kinds of bs nodes and downloads and pages and its overwhelming
The man in the middle of the image wearing a black trenchcoat and black sunglasses is sitting at a PC and typing. Change the text from "Deus Ex" to "LDG".
are iterations of chroma just more training data? a new version every 4 days seems like very fast release cadence, can people actually tell the difference?
>>1059958042 image, this time with urara:
replace the man in the middle of the image with the pink hair anime girl holding a plushie of an anime girl. Change the text from "Deus Ex" to "UmaMusume".
>>105994897this workflow works well, for 1 image just bypass the 2nd image node and the linked vae and kontextimagescale nodes.
>>105995832this time with miku:
replace the man in the middle of the image with the teal hair anime girl. Change the text from "Deus Ex" to "Miku Hatsune". Keep her expression and pose the same.
neat thing about kontext is it's doing inpainting essentially but is aware of other elements and doesn't overwrite them, note the eidos and ion storm logos are untouched.
Is this right? Should I be using this? I cant find this exact lora that the guide is using.
>>105995859thats fine but there is an i2v one that has much better motion and quality, even though the t2v one was good:
https://huggingface.co/lightx2v/Wan2.1-I2V-14B-480P-StepDistill-CfgDistill-Lightx2v/tree/main/loras
use this one
>>105995820>are iterations of chroma just more training data? a new version every 4 days seems like very fast release cadence, can people actually tell the difference?It just goes through the same training data. Not much difference between epochs. I think itโs starting to pick up on some artists now, but that might just be sleep deprived delusion.
>>105992428inpaint chads rise UP
>>105995870i give up
i cant do it
it doesnt work
chatgpt cant help me either
is animatediff less vram intenstive? idk if thats the issue
>>105996047anon use the rentry workflow in the OP, it's good. use a GGUF node with the Q8 wan models (480 or 720p)
https://rentry.org/wan21kjguide#lightx2v-nag-huge-speed-increase
use that lora in this workflow, ez pz
>>105996066iv tried
it doesnt work
>>105995849why are you still shilling kontext after everyone dropped it?
>>105996071what doesnt work? specifically what node is red or what is the issue
>>105996076I dont care what "everyone" does, it's a very fun model like wan or noobai.
>>105996071https://github.com/deepbeepmeep/Wan2GP
don't use comfy
>>105996084im certainly not attached to comfy, was it a mistake listening to ppl in here that if i wanted to try i2v i should use comfy?
>>105995452beagagahahagah
https://github.com/deepbeepmeep/Wan2GP
this can do wan + multitalk too, apparently?
>>1059964082 img workflow from before, "the two characters are shaking hands. keep their expressions the same."
>kontext+redux
>direct the source to kontext i2i with low denoise
>use a detailed reference img for redux
>prompt to maintain the same refined detail
TIL you can enhance img detail like this
redux is truly a hidden gem
>>105996475Somehow I've never even heard of Redux
got this so far. now i need to inpaint the dominatrix somehow
Trying to install ComfyUI Zluda for my 9070. Spend all night till 4AM yesterday. Deleted everything and now doing it again.
First install could not detect the GPU. I'm trying the newest version and if it does not work I GIVE UP.
>>105996502I use it primarily for chaining style transfers
>>105996525Return it and get a 5070 Ti 16GB
>>105996537I can't I'm a gamer with a responsible budget. Aka I'm poor.
>>1059965555060 Ti 16GB then, will be adequate for anything image-wise and passable for video
i should start inpainting
i will start inpainting
>>105996575try lanpaint
it's decent
>>105996575look up "Inpaint Crop (Improved)" node. It's very fast and has context knob
Anyone know why in forge/reforge when I do an upscale of an image it uses my CPU instead of the GPU and how to fix it?
This is what it shows:
>tiled upscale (CPU Composite)
e2 f5 tts is pretty good for voice cloning. just messing with it on the pinokio app (same as github, just has a frontend/gui on this)
https://voca.ro/1lXcQgrbnssy
>>105996660baneposting the navy seals pasta: even better
https://voca.ro/1OxE592hta8P
>>105996660Does it work for jp too?
I tried a few models but never got any good results with them
>>105996681It's good quality, possibly the best overall local image quality we've had directly from the base model.
But it is too slow and too hardware demanding, meaning it was DOA, practically nobody is going to use it, practically nobody is going to make loras / finetunes for it.
lmao, ace-step is actually great, not suno tier but it's free
enjoy a song I call "ACK!", with the country preset (there are multiple). it's actually pretty catchy. and I just made 8 lines of lyrics.
what a time to be alive, anons.
https://voca.ro/1nGhT5WFbxJ7
>>105996711it should be fine, can clone any source audio
>>105996718sdxl people said the same when flux came out
>>105996720using the pop music preset, same lyrics:
https://voca.ro/134ZdfsX9mB6
>>105996739metal preset. i'm fucking dying, how does this even work, music generation but it's coherent and actually works with whatever you type.
https://voca.ro/1bPGnKTvsniD
How do I decensor images?
I tried masking pixelated area, but that always produces artifacts, or, if I add too much noise, ai will try to draw a whole new picture in that small area.
I need ai to consider whole picture, while only editing a small part of it.
>>105996754jazz preset. classy as fuck:
https://voca.ro/1gEe5cuZkMu7
>>105996732The speed difference between SDXL and Flux is less than Flux and HiDream.
But it's up to you, you can run it if you want, obviously given how little interest there's been from the community, most people are saying no thanks, it's just too slow and demanding. HiDream has been out for at least three months by now.
>>105996778and this time I added [chorus] to the second 4 lines. still jazz. added more saxophone to the prompt:
classy!
https://voca.ro/11zcLZIjfieW
>>105996720>>105996739It's not too bad but you can really tell the instruments are low quality and noisy.
one more this time EDM preset:
also, this is random, you can use a song for audio 2 audio too. lmao
https://voca.ro/18ltViG8YP52
https://github.com/ClownsharkBatwing/RES4LYF
has anyone tried this v2v editing?
Whatโs a free nsfw gen that can do simple stuff from my phone when Iโm taking a shit and bored
Gentlemen, the AMD pipeline is on. Feels like it's gonna break on its own.
used lm studio and llama 3.1 uncensored to generate lyrics for an ubisoft song with the proper formatting in the style of a 4chan post, and used ace-step to make a song with the "country" preset. kino. actually music you can drink 20 beers to:
https://voca.ro/1jJeIOlK2HT8
file
md5: a0c9c1c0e5c6c17d901a0a044c137bc0
๐
why is midjourney so soulful
how is it that their model can produce so many great styles without any loras
Scam Altman runs into a critic (e2-f5-tts model/gradio UI):
https://voca.ro/18xJM5BBJ7Z4
so far, the only text to speech model that clones as good as elevenlabs (which is a paid service).
>>105997412and now, with a made up convo with a simple LLM:
https://voca.ro/1klCcmOLsLv0
kek
>>105994877If you're thinking of a 4090D 48GB, totally go for it. I have one. It solves so many problems - no more dicking around with quants and offloading and other bullshit, you can finally just use the native image and video models. It's just one gen back so you can still benefit from fp8 and sageattention.
With teacache and sageattention2 this video was done in 320 seconds. The card is power-limited to 350W too.
Go for it.
>>1059974442/2:
https://voca.ro/112Aaw9sXUtg
>>105997455where can I get one? what if it bricks?
>>105997541https://www.c2-computer.com/products/new-parallel-nvidia-rtx-4090d-48gb-gddr6-256-bit-gpu-blower-edition
The page is wrong, it's the same bus width as a regular 4090. There's a warranty. If it dies you ship it back to them. I've been genning non-stop on mine, it's solid.
One word of warning, I did have to pay $100 in customs fees, first time I ever had to pay. Not a big deal, but still, I wasn't expecting it.
what is chroma-unlocked-v46-flash.safetensors now?
>>105996910I think you can run sd 1.5-tier stuff locally on a high-end iphone with drawthings. Gonna kill your battery, though.
fun fact, the kontext clothes remover loras (anime and realistic) are good for clothes too. sometimes it wont let you clothes swap if the model thinks it is nsfw.
>>105995709it's in the op
>>105997643lora off, same prompt "give the girl a white bikini".
so even for sfw gens, a nsfw lora still has use!
file
md5: 8426bbecb7147e80814d8e572e4c5530
๐
haha yeah man, i'm 100% gonna credit you for your
*checks notes*
lora
lmao
man the FAST lora made my video gens from 80 minutes to 20, it's a whole other world, now I can fucking slop FASTER
>>105997232how'd you get this picture of my daughter?
>>105997589no problems with drivers or anything? how's the quality difference between fp8 and bf16 for wan?
>>105997758>prompt>keep getting patreon logo / artist signature in the gen>add stuff to negs:>artist name, logo, signature, text, watermark>the signature remains, but the image quality gets worseIs this because any image with an artist signature on it is usually high quality, so prompting to remove signatures has the side-effect of lowering the gen's quality?
>>105997604>chroma-unlocked-v46-flashOh sweet, thanks for noticing! Downloading now. Set CFG to 1 because it's obviously a distilled Chroma
>>105997813remove it with kontext, kontext can even remove watermarks on stock image sites.
>>105997813I didn't mean to reply to that anon's post by the way, that was an accident
Change the text "Quake" to "LDG". Change the red logo in the middle to a cartoon version of a computer monitor.
what is this kontext nigger schizoing about?
>>105997818if you do this cut out the top left corner in gimp after you fix it and paste it on the original image because kontext fries images
>>105997835he has turbo autism, don't bother
>>105997867https://huggingface.co/lodestones/Chroma/resolve/main/chroma-unlocked-v46-flash.safetensors
>>105995789This is the original animation tho, no?
does chroma take flux loras?
Bit fried but I liked it. If you can guess the game you get absolutely nothing.
>>105997975looks like megaman but DORK SOLS
>>105997813exatly, also I avoid using negative tags at all because of the same thing, is faster . Instead of put as negative "bad hands", I prefer to put "good hands" as positive, this is my logic in image gen promtps and the quality is better than ever. Another example, instead of "blurry" or "lowres" I putemphasis on "sharp" and "high res"
Negative prompts are slop and bloat
>>105997974some do, some don't
>>105997974There's this https://github.com/EnragedAntelope/Flux-ChromaLoraConversion
Let's say I want to build a PC only for meme gen, I would not need a beefy CPU right? Just lots of RAM (DDR4 or DDR5?) and a recent Nvidia GPU?
>>105998145you need as much VRAM as possible.
so a 5090, if you cant afford a 5090 you probably shouldnt bother with AI.
>>105994551this is mistress dena with the seeker of truth
>>105997994Turrican, actually. But I can see the megaman thing.
>>105997813i usually describe cfg as how tightly you're gripping the wheel to force your car to follow a track, but that's obviously as a function of the track (prompts).
when you have too many prompts and directions, a high cfg will try to satisfy all parts of the prompts more exactly, which over and under develops the output (high-cfg burn). lowering the cfg gives it leeway to follow the prompt less rigidly, so you end up with more best-effort intent than explicit direction following.
tl;dr is yes, negative prompts subtract more than just the thing, and enough of them with high cfg will degrade your gens. you can try lowering cfg to help, but few negs is best.
>>105998145not super beefy (unless you're trying to CPU gen), but fast. and as other anon said, VRAM is what matters most, but DDR5 has more bandwidth than DDR4, so if paired with fast NVMe storage, things that load into system RAM will happen faster.
>>105996525do you need zluda
isnt 9070 has rocm native support on windows
>>105998264I managed to get it run, but the tutorial I followed said I needed it.
Apparently it's less hassle on linux, but I'm not sure.
>>105998307>less hassle on linux
>>105997974There's this https://github.com/EnragedAntelope/SDXL-Wantxt2imgLoraConversion
Is there a way to make seed compatibility betwenn Comfy and WebUI?
>>105998371>https://github.com/ltdrdata/ComfyUI-Inspire-Pack
>>105997870>>105997816so why are there 3 versions and which one is best?
>>105998416fragmentation
whatever one you like is the "best"
>>105998003>Negative prompts are slop and bloatdemonstrably false
>>105998464but only bc of the subject matter at hand
file
md5: 078cd1076c0294dcf4873b43b0a495fd
๐
>make breaking changes to your nodes
>don't bother updating your own example workflows
>nodes look correct when opening workflows just spit out useless errors
>literally just have to drop a new instance of the node to make it work
fuck you for wasting 10 minutes of my life
Anyoen made comparisons between aniwan and anisora?
>>105998475>he cant into comfy
>>105998365kek
>>105998416>why are there 3 versionsNo additional label is trained on 512px for now (1024 soon), detail is 512 merged with a 1024 trained on an older 512, fast is distilled something and you can 1CFG
>which one is best?fast at 1CFG for smol prompts and LoRA usage. I find long prompts gives it flux-like distortions. For long prompts 512px and detail versions, and also both are fine for LoRA training
>>105998477both are just yapfest trash, mouths move constantly no matter what you prompt
newfag here, I have wan2.1 + lightx2v running, but I wanted to gen anime now... how the fuck do I do it?
>>105998496forgot to specify, im using the i2v 480p Q6 GGUF (16gb ramlet)
>>105998037that's cool but is it work converting until chroma is done training?
>>105998484i expect a minimal amount of competence from developers
but i suppose this is too much to expect from a community of pedophiles
>>105998496watch a 28+ minute tutorial on youtube filmed (horribly) by an indian man
>>105998508whoops I thought you were a legitimate poster with a legit concern it's just you again
>>105998507>that's cool but is it work converting until chroma is done training?Only if you want to see a result and don't need validation from other people
>>105998519the quality of code produced by pedophiles is a legitimate concern
>>105998529>im too retarded to understand comfyui therefore they are all pedofilesplease stop posting
>>105998511with a workflow on his patreon
file
md5: 1441356605faea3e2ae90f4c9cd07cba
๐
>>105998536which one of these nodes works?
maybe your pedosenses can detect it
>>105998529Is it because it's obfuscated to prevent the unforgiving eyes?
someone bake a real thread
which thread to we use???
>>105998713the one not made by a tripfag
>>105998606fake thread
>>105998713the one without the rocketslop
>>105998539>with a workflow on his patreonThis. Its not just indians but most of the youshitters do this. You can only access the workflow either behind a paywall or you have to join their shitty list. I stopped watching comfyui youtube vids for over a month now.