How Do We Feel About This Edition
Discussion of Free and Open Source Text-to-Image/Video Models
Prev:
>>105689724https://rentry.org/ldg-lazy-getting-started-guide
>UISwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Models, LoRAs, & Upscalershttps://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
>Cookhttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>WanX (video)Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1
>ChromaTraining: https://rentry.org/mvu52t46
>Illustrious1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/
>MiscLocal Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate
>Neighborshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/celeb+ai>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
Blessed thread of frenship
>>105695074;c
>>105695065 (OP)>>>/vp/naptis your neighbor<3
>>105695053>I'm hoping we'll see more strong image models be released, Chroma has the most promise out of those in the pipeline, but as always we won't know what it can really do until it has a lot of lora and perhaps further finetunes, just like with SDXL and Flux.If a model can do video really well then images are trivial no?
xxx
md5: 436a70960f3269cbdfaa76aed534c358
๐
I think Chroma 512x512 is still better than the detailed one
>>105695311curios
have you ever found a way to make characters "shut up" when using klingai?
i tried it and they just always talk or sing and it's driving me crazy
>>105695440i use wan mostly
kling i only spam when my throwaways reup for their monthly credits kek
adjust your negative prompt fields;
a weird quirk with wanNSFW is the ladies always end up puking :o
SaaSfags are so much ahead of local right now... Fucking unfair
>>105695354Why exactly? (genuinely curious)
poal
md5: 2b59cc415d7025742c3d519966b4d78d
๐
Doomers lost
>>105695494Chroma is best for realism NSFW already.
>>105695450challenge failed
>>105695543huh? i just showed you one where she isn't 'yapping'
Is there a way in Comfy to basically say if SAM doesn't find any hands skip hand detailer etc.?
I wish there was a site like Pixiv but for AI art where it pulled prompt metadata automatically and you can search by any combination of tags to see the highest rated art people made with them
>>105695585Somehow the omnibus nodes like FaceDetailer manage to do that.
>>105695639Literally most Chroma gens have good hands. That's like a vramlet misconception or something.
>>105695628https://aibooru.online ?
>>105695628Any site like that would be flooded by low effort slop, since AI opened the gates for every troglodyte to express themselves.
>>105695781Clicked on a random pic and the aI metadata column was empty. It either doesn't pull it automatically or people strip it.
>>105695800Unless you have a site that uses only on-site generation there is no way most people won't strip metadata just for privacy.
>>105695781This is pretty good but the problem is the danbooru interface is more designed for reposting rather than self-uploading, it needs pixiv features like following accounts, having a recommended feed, sorting by popularity by default, etc
>>105695628CivitAI is the closest and there really isn't much else as much as that hurts to say :/
If you just want an AI booru just scrape tags of categories you like and just make your own site with randomly generated images.
>>105695857she is PRETTY <3
>>105695861>>105695628i hate civit
tensor isn't much better sadly
i only save what i love
most of it comes from 4chinz kek
>>105694845>Pytorch issue. Downgrade to 2.7.1 or 2.7.0.I'm already on 2.7.1, though.
We don't have any IPAdapter Face stuff for Chroma yet, do we?
>>105696159No, since there's no point in training any ipadapter or controlnet for it until its finished, given the model keeps diverging each epoch. Ie, a controlnet/ipadapter trained on epoch 39 will work progressively worse as the epochs go along
>chroma v50 rolls out
>it can't draw hands at all
>it can draw perfect paws and dog dicks
>lodestone declares it a great success
Can any anons send tips on how to write good Wan prompts or LLM prompts to write good Wan prompts?
https://huggingface.co/NSFW-API/NSFW_Wan_14b
>no model card
hmm
>>105696417Was just about to post this. The author's made a lot of other NSFW LoRA's for Wan, so it's probably legit.
I assume the full model is just the 1.3GB NSFW LoRA merged into the model, right?
What's the difference between T2V and I2V if you generate the image first with AI?
>>105696417It's probably just a merge of several porn loras
No one has the budget to do a full fine-tune of wan, especially if it's for pornographic purposes, unless a rich furfag takes interest like what happened to Chroma
>>105696417Someone please test and tell us if this can at least do dicks out of the box
>>105696417perhaps it's similar to https://huggingface.co/NSFW-API/NSFW_Wan_1.3b ? Anyone tried this?
>>105696454So it's an actual finetune, albeit a limited one.
>>105696159>>105696261there is a pulid for chroma
https://github.com/PaoloC68/ComfyUI-PuLID-Flux-Chroma
>>105696454>Mixed Dataset: Instead of separate phases, the new run was trained on a mixed dataset of 30k video clips and 20k still images simultaneously. will that be enough?
>>105696454You can see what it's been trained to do here
https://huggingface.co/NSFW-API/NSFW_Wan_1.3b/raw/main/prompting-guide.json
>>105696429You have way more control over the intial composition if you generate an image first. You can even edit it or inpaint it.
>>105696480so much gay and furry stuff
>>105696417>>105696454it's not finished though, he went for 20 epochs on the 1.3b, so we probably have to also wait for the 20 epochs to be finished on the 14b before trying it, and there's no quants yet so...
>>105696523>so much gay and furry stuffonly furries are rich in this current economy or what? kek
>>105696498Is there any reason to use the T2V model?
jesus christ what are the system requirements for Chroma? This is the only model I've been unable to run for my page file being too small.
>>105696577Get more system ram ya bum. There's no excuse to have below 64gb these days given how cheap it is
>>105696573if you're too lazy to make an image and then go for I2V, going for T2V is just more convenient
>>105696417Tested the LoRA. It's still underbaked and too early to tell, it has that issue where it gives women labias that look like ballsacks. I had the same thing happen when I trained a female anatomy LoRA, it took time to bake it out. Also, there's lots of dick body horror. I mean a LOT. So yeah, needs more time.
>WAI-NSFW-illustrious-SDXL https://civitai.com/models/827184?modelVersionId=1761560
>130k downloads
what the hell
https://www.nbcnews.com/tech/tech-news/federal-judge-rules-copyrighted-books-are-fair-use-ai-training-rcna214766
>Federal judge rules copyrighted books are fair use for AI training
yayyy
But, the judge ruled, AI companies shouldnโt be pirating the books theyโre training on.
kek
Is the state of local good or bad right now
>>105696668Video, decent.
Image, stagnant.
Audio, obsolete.
>>105696668>>105696673>Audio, obsolete.that's the worst part, I want udio at home ;-;
>>105696673that krea open sourcing is coming any day though... r-right?
>>105696675The only ones who can do it are the Chinese, since western tech companies are scared shitless of the small hats who run the record industry
loo min ah too i believe in yoo
>>105696676localsissies, they took advantage of our free advertisment power to shill their API model again, THEY CANT GET AWAY WITH THIS
>krea status : api
>sage2++ status : rugpulled
>chroma status : cooked
>everything else status : copium
it's over...
abscess tooth is driving me nuts. at least I'm almost through this shitty build system hell
>>105696703Go to see dentist dumbass
>>105696703>abscess toothdo Canadian dentists perform executions instead of regular dental work or am I missing something?
>>105696729>be canadian>sore tooth>can't afford a dentist>they recommend assisted suicide instead
>>105696716I did. next open slot for a root canal is july 31st
>>105696729sadly no
>>105696732find the nearest canadian suicide booth to unsubscribe
>>105696744Wtf, can't an abscess be dangerous?
>>105696750yes but I have antibiotics and a stinky toothpaste to hold it off. it's just so goddamn hard to concentrate with the lingering pain
>>105696757They would nuke your body with antibiotics BEFORE doing the treatment? Very cool!
>>105696773nta but possibly dying because he didn't take them is a retarded way to go
>>105696757get well soon ani. you'll find a way to code through the pain
>>105696697you forgot
>kontext dev status: in the cucking stage
>>105696673>Image, stagnant.I would say there's been a lot of progress recently on getting stable results for multiple subjects.
what if instagram was made in the 50's
>>105696687facts, and I hope the US will be aware that they'll lose the most important technological race of the 21th century if they keep acting like faggoted cucks, the us is really puritain, but I think their ego are too high to not make adjustments and compete with China again
>>105697007looks like a drag queen
>>105697030a woman looks like a man pretending to be a woman
well you're not wrong
>>105697030she looks like an old woman who hasn't eaten enough desu
was generating and suddenly saw this face, it legit scared me
>>105697078is this supposed to be an actual celebrity?
>>105697086no, it's nobody you'd know
realised I'm 1girling again
I'm just impressed because this char lora was made entirely with 15 year old webcam footage stills
is mmaudio comfyui broken for anyone else?
>>105697170Worked lik 2 week ago for me with some fix for another problem that someone solved in github issues page, but mmaudio is shit anyway, along with everything like it
>>105697170don't bother with this shit, it's ass
>>105697170>>105697188ooh is that the thing that makes sound effects
it's not great
You guys should play today's Wordle. I promise this isn't off-topic!
>>105697269not gonna, may as well spill
>>105697282I'll just post a link with the answer to not spoil directly https://www.today.com/life/todays-wordle-hints-answer-puzzle-1467-june-25-rcna214850
>>105697158What's the gen time for wan image?
>>105697269Wordle 1,466 4/6
pretty aight lol
>>105697395that took 2~ min with teacache
help with making better videos please. 8GB peasant here.
What LORAs are good for the facefuck stuff people as for on /r/? Is it possible to achieve results like those with 8GB?
I can GGUF Q5 pretty easily, or use i2V 480 for sizes like this which are not too big (320x480)
What's the best way to get those results?
Better workflow/prompting? Or are they only possible at higher resolutions?
>>105697577Doing literally anything with WAN needs a LoRA for the specific pose. Just search on civ and take the top result.
>>105696703Not medical advice, but some anon mentioned before that colloidal silver can soothe the pain quickly.
>>105697589is it fine to mix LORAs whiich were not trained on the same model
EG can i use models trained on 720 for 480, or mix 720 and 480 models and use them both on 480 at the same time
I have some charactersvim training but both of them have specific hair accessories and I want the model to learn both of them. But at generations if I want to generate the hair clips, it blends the two of them, I can't manage to generate the one I want.
How do you tag it or what do you do?
I'm having problema in my training where some characters have other characters stuff in the generation.
>>105697833since im tech illiterate I use paint.net and draw things manually, then i smooth it out with low denoise mask
>>105697833Use unique tokens for the clips would be my approach.
>>105697833I would crop those images to make sure it learns the accessory
Are there any decent realistic pony or illustrous models yet? Or do they still have all those 2.5Disms that make them immediately recognizeable?
>>105697078>from crackwhore to tradwife/ldg/ provides me with yet another fetish
What's generally the approach when images are smaller than model's native resolution, rescaling to native? In this case lora learns the blurriness. What's the tag to explain this blurriness?
>>105698166You don't use blurry images in lora datasets. Images smaller than native resolution isn't a big deal.
>>105697482It takes 40 seconds with lightx2v lora at 8 steps for Wan to gen a 720p image.
>>105698202I use onetrainer. There is an override resolution parameter in the image options tab, and a training resolution parameter in training options. Should I disable the first and set the second to 1024?
>>105698166Avoid training on images smaller than the training resolution, whatever snakeoil solution someone proposes isn't worth it.
If you NEED said image, upscale it, even using base Flux with a low (~0.1) denoise should be good enough.
>>105698259alright, thanks
>>105697999desu if possible look to flux for this
pony/il/noob are trained disproportionately on these kinds of images so you're kind of asking to undo that here
>>105697947What kind of tokens? Is it better to tag "hat", "special hat", "sp3c14l hat" or "sp3c14l"?
I've been training some models and the results are quite good, but one problem I'm having is that tags blend into each other: if I'm training many characters and one has one hat and the other has other hat, even if I tag the hats differently, sometimes while generating one hat, the other hats blends into it; I can't completely separate concepts.
Some are correctly generated and others are a mix of different clothing and I don't know what or why is this happening and how to have more control about it.
prompted a beatrix potter image but forgot I had left a Frank Frazetta lora enabled
look at those guns
>>105698307I haven't done this yet myself so I can't give you a working recipe. However, what I plan to do next with my character lora after the 1st version is to introduce unique tokens for design elements that seem to be tricky. With my character token being char, I would then go for char_hat, char_necklace, etc. possibly while also training the text encoder.
>>105696703>abscess tooth is driving me nutsBased
>>105696716
>>105698213make the cig bigger
>>105698644nice
>SwarmUi sucks
>Comfy sucks worse
>Forge dead
>reforge dead
what's left
>>105698696Forge isn't dead, just yesterday it got Chroma support.
>>105698696Bespoke Python scripts in Jupyter Notebooks.
Does anyone know how to use the new Train Lora node in ComfyUI?
I am not sure how to set the caption for each image in the dataset, do I just apply a global caption?
touchรฉ wanx, I never said the umbrella was open
get on the goddamn bridge already
any interesting samplers to try with chroma aside from euler?
why is bemused so aesthetic
am I a sadist?
>>105695861It's great that a site like CivitAI exists, unfortunately it's the worst constructed site I've seen in a decade at least.
Crazy slow and buggy, retarded navigation, it's like someone deliberately went out of their way to make something extremely bad.
>>105698901dpmpp_2m and sgm_uniform for better skin I've been told.
>>105698951making fictional digital women cry isn't a crime yet, enjoy it.
>>105698951>Men are generating bemused women, misogyny!
oof the bleed
>>105699012I never thought that people would get up in arms about how video game characters dress, that seemed ridiculous too at one point but here we are
>>105698213Nice gen
>>105698836>>105698951I tried wan T2I before and it was kinda meh and slow. These are pretty decent.
>>105699035I know it's not fantastic quality but wan knows so much obscure shit, and the anatomy is always spot on, and the vae is workable. You never see the detail it's capable of in 480p videos
>>105699031unimportant people who wanted to feel relevant in a world that doesn't care about them, and the companies that wanted to turn them into brand slaves.
a tale as old as marketing.
>>105699031Oh, I have no doubt that could be a headline in a real article, the 4th wave feminism at full force, whoring yourself on OF = female empowerment, sexy women in entertainment = female exploitation.
>>105699031wait does that work in reverse? if you have a lora of a dude and prompt for a girl does it girlify the dude?
>>105699093why do you ask
>>105699052>but wan knows so much obscure shitHave fun lol
>>105699070This looks cool, but what the hell is going on in that image?
584
md5: efcff6f23b4b35beeb8bca39024436d9
๐
>>105699112The solid colors streaks are an interaction between the classic noise and the shader noise. I have no clue what it actually does, but the AI seems to know how to integrate it into the pic.
>>105699148it's literally just img2img with shader noise
>>105699093it's more fun to swap races
Is there a list with all command like args for ForgeUI somewhere? The one on the forge github page doesn't seem to be complete.
Simple tag based prompt comparison between
chroma-unlocked-v27-Q8_0.gguf
chroma-unlocked-v39-Q8_0.gguf
chroma-unlocked-v39-detail-calibrated-Q8_0.gguf
Result
https://files.catbox.moe/qi0pr2.webm
>>105699268>plastic botox asian vs plastic botox asianwhat are we supposed to see here?
>>105698818just use a trainer ui
>>105698901>>105698990unipc seems to be great for better skin for 1/5 gens but much shittier for 4/5 lol
>>105699334I am just curious about the new node, and if it works well it might even replace some of the other UI's.
>>105698818Don't think it's working yet, they showed a img input but the actual build doesn't have any, and instructions about caption isn't clear either
>>105699437well the answer is nobody is using it because it's just going to over complicate training. you'd think they'd have docs
I've been using Easy Diffusion for a while now and tried ComfyUI recently. It's harder to manage Loras with their extension and honestly just think Easy is better. Is there any benefit whatsoever to using Comfy? Does it produce better images all else being equal?
jannies gave me 3 day ban from on all boards for posting convenient censorship involving 1girl steam censor gen. wtf.
>>105699467>>105699469I see, but well still, it would be a nice addition once they work more on it.
>>105699647it's not really handy to change the lora json format again which just makes using other trainers more of a fucking pain. I'm not going to like trawling through json manually for training settings in another app
>>105699052Yea, it's also good for genning decent weapons and making them to be properly held
>>105699614Nice gen. What model? Im just getting into local and am planning on setting up comfyui today for the first time.
What's the threads consensus on best model for local video gen and local image gen (NSFW for both)
>>105699871this one is less overcooked.
https://github.com/FreedomIntelligence/ShareGPT-4o-Image
Janus has been finetuned with 4o images lol
>>105699961Imagine the yellow tint
>>105699871>making them to be properly held
>>105699961doesn't nintendo own that mario font?
i still have some preference for framepack. waiting for a wan2gp update to add wan2.1 i2v fusionX model.
>>105699961>trained with pisswhy? they could've trained it with kontext pro, it's better than 4o at this point
>>105699922use a gradio webui like a1111, forge or reforge instead of node base system like comfyui. comfy is for advanced stuff.
https://civitai.com/models/1398870/creativitij
https://civitai.com/models/1046064/ilustreal
https://civitai.com/models/1710752/uncani-sfwnsfw?modelVersionId=1935932
https://files.catbox.moe/bs7x57.png
>>105696673why has image stagnated so much, anyway?
>>105700316getting better in one direction makes it worse in another, okay at most things is optimal
>>105700316the more a task is perfected the less gains there are at the cost of more work, you can and could already do basically everything you wanted with image gen models, the next frontier is still waiting for a proper natively multimodal and unified modalities ai architecture
https://huggingface.co/KBlueLeaf/TIPO-500M-ft
Are prompt optimizers a snakeoil?
Pixelwave is still awesome.
>>105700316I'd say Flux Kontext is a pretty significant development.
>>105700352if only there were diffusion benchmarks that models could be tuned to cheese, then we'd see improvements
>>105699614>posting convenient censorshipWhat is this supposed to mean ?
>Chroma knows what Converse All-Stars are and gets them right in every situation regardless of the prompt or settings
>Chroma has no fucking clue what an iMac G3 is and literally never gets it right or even close regardless of how it's prompted or what settings are used (it's the most recognizable computer of all time)
I'm going to lose it.
>>105700352isn't that how all generative systems work so far?
>>105700398>you can and could already do basically everything you wanted with image gen modelsi guess, but for someone just jumping in it's a pain in the ass to wade through 10,000 models, color smears, goo-faces and still get something worse than shitposting at bing/china/whatever to get an image past their filters. you'd think there'd be more of an effort for easier setup and plug and play stuff, or at least refinements of the guides that ARE like that that actually explain what's going on so people don't get lost. text gen stuff is super easy to set up now in comparison and that wacks me out
all i want is 1-click 1girls without porcelain skin and broken collarbones WITHOUT 12 separate loras by the same goddamn author
>>105700410>I'd say Flux Kontext is a pretty significant development.if you talk about kontext pro, it's not local, if you talk about dev, we still haven't in our hands so we don't know how shit it will be
>>105700476>gets them right in every situationChucks don't have the star on the outside of the heel. Cute chubster though.
>>105700480Fair enough, but based on the difference between flux.pro and flux.dev I presume It'll be worth using(assuming they actually release it).
>will still need a lora for styles
>will still need a lora for characters
>will still need another finetune for anime
>
>>105700479>text gen stuff is super easy to set up nowFor vramletniggers maybe, otherwise you are fucking with ik_lcpp vs lcpp with 30 cmd arguments to test the unsloth/ubergam quants and hope to god they didnt fuck up while quanting them for the 15th time in a row
Then you have to read the model cards for new models to see the recommended specific sampling settings and other templates, assuming the lcpp/ik_lcpp didnt fuck up the implementations of the new model architectures and ST didnt fuck up the templates for those new models, of course
But anyway, text models for vramlets are easier to set up since you are using trash models from months ago that dont need dynamic quants, same as if someone were to only download forge and use flux, yes its all gonna work out of the box
And also, image gen isnt that hard either beyond the most bleeding edge comfyui nodes for papers/code that came out on that same day where you have to manually install packages, otherwise its as simple as going through civit, finding an image or model you like, copying the workflow and using it, illustrious/noob for cartoon/anime style, chroma for realism and thats it
But yes, there are too many things changing with image models at every stage of their use unlike with LLM's that simply no one can keep up with and make an up to date guide that needs to be modified on sometimes a daily basis, when you pair that with the fact that open source models want to give you the control instead of locking you into the "hypercinematic and vibrant" style of the proprietary models when someone prompts a basic 1 sentence prompt
ugh that moment where you get home from work, running gens all day, only to find you fatfingered the denoise on the first ksampler to 0.3 so everything is just brown noise
Thanks, but it personally ruins it for me if the computer is incorrect.
>>105700488Thanks, but it personally ruins it for me if the computer is incorrect.
>>105700602>For vramletniggers maybeOnly vramlets use llama.cpp because they want to offload to RAM.
>>105700631that's prompt fatigue, man
cuz if you prompt hard it is going to happen
>>1057007511-2x3090 + 128-256gb+ ram is already enough for 4-7+t/s with deepseek for 1-1.5k$ as opposed to spending x10 that amount of money on a rack of gpus that you cant even upgrade easily when bigger models come out leading you to a huge performance loss when you start swapping, and nobody is going to spend money on 500gb vram
>>105700870In other words, you're a vramlet.
>>105700656if you increase the steps to 50+, details like keys and cables usually resolve.
also, the more attention it tries to apply to the computer, the worse it seems to get right now. i prompt for eras and features, not models, and then negate whatever problems arise.
>>105700917i also downloaded that girl. never seen the anime but i like her almond eyes
>>105698901quite a few samplers work with chroma. to get you started: euler, dpm adaptive, dpmpp_2m, ipndm, deis, ddim, uni_pc_bh2, gradient estimation (and probably many more). from my observations some samplers are better at producing a certain style.
if you don't have those yet, you can check out https://github.com/ClownsharkBatwing/RES4LYF for a ton of new samplers, supposedly cutting edge shit. you also may want to look at the descriptions for some of those samplers on that github (under sampler settings).
matteo did a comparison of samplers for flux, still somewhat valid for chroma.
https://www.youtube.com/watch?v=WtmKyqi_aFM
>>105698307Ok I really don't understand this, a piece of clothing that only appears in one image, the Lora is able to copy it perfectly, but a weapon that appears in many images is struggling with. Can anyone help me?
>>105701188the most generic image ever fucking created BRAVO
>>105701049Thanks. I'll give that a go. I guess it's not super precious to me that the computer is an iMac, but it's a shame I can't prompt for one if I want one, even when inpainting.
Anyone have an idea yet what the best low-step samplers are for Chroma? I've mostly been doing 40-step gens with a euler_A (using only about 0.3 eta noise), but I want to do some rapid genning down in the 20-ish range and I don't know what samplers get the best results down there.
>>105701202I say it all the time, the generations in this general are garbage
>>105701201more regularization would probably help.
for instance, if a model has only the faintest idea what a gun is, more work needs to be done to teach it what it both is and isn't.
imagine you're talking to someone who is arguing with you that an AR is an assault rifle, and you're trying to explain the difference to them. you'll probably have to tell them a lot about what an assault rifle isn't using examples of things they kind of know but don't really understand before they even acknowledge that there might be a distinction.
clothing styles, though, are platonic ideals. if the universe were destroyed and recreated, they're the only thing that would be the same. the machines know this and don't fuck around with it.
>>105701320euler/beta is decent. I get 1.8s/it at 1024x1024. 18 steps is my personal floor, but as low as 10 can be enough for a rough idea.
>>105701085Is the artist for this Shinkiro?
>>105698836>>105698213>>105697158>>105697078Nice faces. Good to see gens that arent
>anime eyes>straight eyebrows>pig snout>diamond alien head>caked plastic doll>happa margo robbie flux face
>>105701519Oh and this one too is good
>>105701085
>>105701399>EulerShit, I was hoping there was something that outperformed it at low steps, the way uni_pc did in the SD1.5 days
>>105696466The examples don't look so promising, but thanks
Do I need to install the python shit for visual studio ot use some custom nodes that need VS or is the base enough?
>>105701499yes, and the characters mixed are kuna mashiro and minami mitsuko
I was in the middle of genning smegma snorting and got bored, came up with this lovely piece.
>>105701399That's what I was asking before, but I have never done regularization. I don't know what to do. Are there any general regularization databases? I'm using illustrious 01
>>105701399Say I wanna copy an anime sword, it isn't really a normal sword
wan + lora is so fast on 480p.
error
md5: 135d5d275a608d8fb3f281e5665b9f97
๐
I <3 comfy
>>105702060looks nice
>>105701673Can you do Susano'o?
>Sakurai is pro AI
do you think he's using ComfyUi? kek
Any news on Flux Kontext Dev (the open weights version)? Have they said anything about it? I heard some people saying it was bad, where it was released?
>>105702149>wan + lora is so fast on 480p.that's why I'm using 720p now, it's a bit longer but still fast and the quality is insanely good
144
md5: e9d48001d0809aec2455fcdfc8671e83
๐
Anyone knows what could be problem? I have vs studio installed and vswhere.exe is where it needs to be, but it still fails.
>>105702332The throne I post gens from
>>105702332The only way to prevent pajeets from shitting the streets
>>105702275>SG editor notes kek what pussies. genai is the future, luddites.
>>105702596>genai is the future, luddites.this, if AI means a small team (or even a single dude) can make a PS2 scale game, I'm all for it
>>105702747that's already possible
I did it !
I generate 3 seconds video in just 2 hours. 25 minutes, and 14 seconds with my GTX 1080 !!
>>105702074>>105702081example: https://github.com/bmaltais/kohya_ss/discussions/2056
if you're trying to train a certain sword, you'll give it a class of "sword", include your differentiator (ex- "animuSword") in its tags, and then include regularization images that contain some but not all elements of your sword to teach it that not all one-handed, scabbarded, or epee-style swords are animuSwords.
illustrious trains the same as sdxl, so you should be able to use other sword datasets for that. just make sure to match up the tags.
There's a gigabyte 5090 for sale at my local best buy for $2998. Should I sell my 4090 and buy it bros....?
Is smell the next conquest? How would AI smell work? Surely you would need to buy some form of liquid, kek. Imagine
>>105702908Just need to figure out the base components of every smell. Surely everything you'd ever want to smell can be made from different amounts of 3 or 4 chemicals.
>>105702779absolutely not, you're tripping
>>105702908I would pay bis cheques to be able to make **** smell "my ass in the summer" digitally
>>105702867now that 5090's are in-stock everywhere I have been debating this myself but I can't bring myself to buy one. Maybe when some sick workflow shows up that needs 32GB of ram or if gigabyte actually makes some of those custom loop waterblock cards I'll pick one up, but until then i'm good with my 4090.
>>105702957I keep thinking I can sell my 4090 for more than I paid for it and when all is said and done I'd be about $1200 cash out of pocket for the 5090. But it looka like prices are very very slowly trending down so idk.
>>105702908>>105702946I can already see it now...
>people paying AromAI $45 monthly for the "Dirty Sanchez" cartridge
hands
md5: 5a9d17c86452ada25088203084c1154d
๐
Do you think Chroma hands now are better now? I just added e621 and furry into negative, set CFG to 5
About 512x512, I don't know how to say, but it's honestly SOVL meme. There is something about large reso models that made me disliking them. Maybe too much details can ruin imagination
>>105702288yeah, 720p is really quick now with the lora too, gonna try the q8 gguf.
>>105703214also for testing outputs, 800x448 is very fast with 720p. same with 480p at 600x480
>>105702149please catbox. I need a new workflow to learn from, ty.
>>105703257use the rentry i2v with the light2x lora from OP, same workflow I use.
>>105702908>next conquest?Brainwave to img. We can already do this with dreams and brainwave to sound to some extent. We're close.
success
pink hair anime dog jumps high in the air.
> sageattn2++ is "early access"
what the fuck is this shit? why?
>>105703314ok, thanks for the directions.
>>105703332>he doesn't have fake chinese lab creds
800x448, 720p wan q8, around 60-70s on a 4080. ty speed light2x lora.
>>105703332>what the fuck is this shit? why?they took advantage of the local kekmunity and rugpulled everything for financial gain
>>105703169>e621 and furry into negativeive never thought to put "e621" in the negs instead of a multitude of furry tags. did he actually tag all e621 with that?
>>105703363>800x448, 720p wanI'm surprised it can render such low res, 800x448 is a res made for 480p usually
>>105703332Are you going to be the one to implement it in [inference engine]? If not then there's little reason to care. I'm sure [notorious coder] is already working on it.
>>105703332It has commercial relevance for inference, I would not be surprised if it goes commercial completely at some stage. A lot of applications quickly supplanted its use of other forms of attention for Sage Attention with how much of a loss you take on accuracy which is nearly none and at worse, slightly less than 1-2%. Sage Attention 3 is where I don't know if people will go for that, the losses are a magnitude higher at 14-15%
>>105703314there's no "rentry i2v" in the OP anon
>>105703394yep, it's the "now or never moment", I think they know they won't find any improvement anymore so they gatekeep their last secret sauce to themselves, a lot of companies will be happy to pay for some code that improve their inference time by 30% with no noticable quality loss
>>105703388Would just fuck off and rope already, why do you even post this shit?? You have a containment board, thread and god knows what else.
>>105703388keep posting just to make that faggot seethe
I have a dream that one day local won't be a laughing stock of a community
>>105703382with wan 480p I get good outputs with 600x480, and pretty fast: you can scale one side down for more speed at the expense of some quality potentially.
pink hair anime dog eats a cheeseburger from McDonalds.
112s, 720p wan q8: but a lot of that was the interpolation step. I'd say 80-90s for this gen non interpolated.
>>105703388>he un ironically boughted ayymd
>>105703395the wan one:
https://rentry.org/wan21kjguide
>>105703456how? closedAI is charging people for a few prompts a day and with censorship. open source won with wan/noobAI.
>>105703332So are all larger models before public release, sd14, sd15, SDXL, Flux etc. Whatever you're releasing, it's good to get feedback from people with technical expertize to catch any major problems before it's out in the wild.
>>105703462better doro mcdonalds:
>>105703453You're just as bad as them, you too will get your comeuppance someday
>>105703466its fine for most ai stuff nshitia crashes all the time on loonix when trying to play vidya or normal desktop usage so its not worth it as a general purpose card
I just checked the wan workflow from the rentry anon https://rentry.org/wan21kjguide and noticed the UnetLoaderGGUFDisTorchMultiGPU now has the option to use_other_vram
Does this mean that basically multi-gpu is solved? can we launch 40-80GB models into 2,3 or 4 3090's?
Also I set the virtual_vram at 0.0 and it's working, with a single 3090 and wanI2V720p model, it used to require up to 10GB extra with loras etc. Does this now have "automatic" fallback into system RAM like the comfy's native workflows always had?
>>105703517>nshitia crashes all the time on loonix when trying to play vidya or normal desktop usageSkill issue.
>>105703388You have a fork at least.
https://github.com/EmbeddedLLM/SageAttention-rocm
The main issue is I want more than 2 players to not get into the same situation the general GPU market for gaming has gotten into. So where Intel is more appropriate and they are MIA everywhere, even for basic stuff like Flash Attention.
>>105703540its not automatic you can oom the virtual vram uses system ram i normally set it to 10gb
>>105703517Unironically not on my system. Must have been quite awhile since you last tried it.
>>105703577oh nice i didn't know about this will try it out next time i use wan thanks
>>105703608a couple years i bought a 3090ti for stable diffusion but had so much crashing i seethed and sold it after a couple months kek put me off nvidia completely or id get a 50 series card im hoping amd or intel releases a high vram card
>>105703378idk, I just invent it myself, lmao
but maybe...
>>105695065 (OP)How to cure my Proooooooooompt Addiction