Discussion of Free and Open Source Text-to-Image/Video Models
Prev:
>>105857606https://rentry.org/ldg-lazy-getting-started-guide
>UISwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows/home
>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>WanX (video)Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1
>ChromaTraining: https://rentry.org/mvu52t46
>Illustrious1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/
>MiscLocal Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate
>Neighbourshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/celeb+ai>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
>>105866229 (OP)i love that some farting shit actually made it into the college.
Death thread
Stagnate hobby
Why did you make another geberak if the other its in page 2?
Can anyone explain how these tensors work in vae?
For SD 1.5 and SDXL models:
[[1, 4, 160, 106]] for compressed and for output [[1, 1280, 848, 3]].
I fully understand the output; batch size, height, width, color channel.
As for the compressed image; 1 is the batch size and 160 and 106 are compressed resolutions (divided by 8). But I can't make sense of "4". (3 colors plus alpha, maybe?)
It gets even more weird when you have SD3, SD3.5 and Flux with [[1, 16, 160, 106]]. Why did it become 16 now?
The VAEs of video models like hunyuan and WAN have [[1, 16, 1, 160, 106]], I assume the 1 in the middle is frame-number (I am testing images) (though interesting that it still outputs [[1, 1280, 848, 3]]) but I still don't know what 4 and 16 are.
Also do the "4"s in SD 1.5 and SDXL represent the same thing, considering that forcing 1.5 VAE on SDXL doesn't return great results? (Pic related)
currently using WAI-NSFW-illustrious-SDXL (v14) with loras. is there anything that's a direct upgrade i should be trying out? got an rtx 3080 with 12gb of vram, and it seems like this model is pretty flexible. i heard about lumina and that it might support those loras so that seems like a logical next step to try stuff out.
>>1058664144 -> 4 channel latent image
16 -> 16 channel latent image
They are not normal images, just lossy compressed representations of the original image, something like a .zip but losing part of the original data on every compression/decompression process.
See:
https://gist.github.com/madebyollin/ff6aeadf27b2edbc51d05d5f97a595d9
give me a status update on ldg
Blessed thread of frenship
>>105866551Oh I see these are "feature maps" that correspond to arbitrary shit, got it. I ought to make a deep dive reading about vaes.
So sdxl vae encoded different arbitrary information than 1.5 vae expected to, hence the gibberish outputs.
Anyone know what was used for these? It kinda cleaned up the pixelated camera footage which I like
>>105866680Looks like a sora/chatgpt gen to me
>>105866680>piss filter4o
calling guy who bought ugly $4000 computer with a 5090. I think it'd be interesting to see how long of a wan video you can gen before it just totally loses coherence. like crank the length past 161 and see how far it goes
>>105866516Not really, anon, WAI Illust is pretty solid. At this point you just have to try different checkpoints to see which base style goes well with your favorite loras/artists. If you are navegating through civitai you can try: Hassaku, Prefect Illustrious, One Obsession, Hyphoria and Raehoshi just to name a few.
>>105866750anyone can swap to ram and generate past 161 with a gpu 1/10th the price, the problem is the video mostly loops after and doesn't continue properly, because its a limitation of the way wan was trained, you simply need to chain generations
>>105866996so THIS is the power of pornmasterPro_noobV3VAE
so why does chroma have shit fingers still? its fixable but still... dataset?
>baker is the dog poster
Yikes
>>105867099>>105867094quite obvious at this point
>>105866624why post this if it's not true?
why baker?
>>105867084Chroma is a continual pretrain, not a finetune. The goal is to get rid of all the slop Flux introduced while adding more knowledge.
Once Chroma is finished training, that's when the finetuning begins to fix the fingers and stuff. Knowing the community, it will probably be slopped and overfitted again though.
>>105866516if you are willing to dedicate time to fiddle with the model, pick loras, use things like cfg skip: https://civitai.com/models/833294/noobai-xl-nai-xl
base noob offers the best flexibility
easier choices:
https://civitai.com/models/1301670/291h
https://civitai.com/models/1201815?modelVersionId=1412644
wai is terrible and only should be used by absolute beginners or civitai genners, I hate that it gets shilled here all the time now, it's basically learning wheel of a model
does svdquant flux limit me in any way compared to q8 flux?
>>105866401he HAS to control his precious bake, even if it means posting way too early
he is on 4chan 17 hours a day
x
md5: 5f2ecb9faa2c555283903515f4d50659
๐
>>105866609flooded with AI generated images
>>105866609status:
https://suno.com/s/RcDPuxktDnO2J9eq
>>105866498Super clean, love it, catbox?
>>105866780>hey anon its great to finally meet u>u dont really seem to look like your profile pics >your hobbies are Ai image generation software technologies?>yeah i think i better go this is really giving me liverking vs joe rogan vibes
I wish there was some autist who gave in-depth analysis and reviews of every single checkpoint on Civitai. Closest thing is that person who posts the grid with Link and Zelda kissing but that doesn't say much other than the fact that 90% of the models are some variation of generic anime.
>>105866609>one schizoid baker & malicious intent>splinter general will disappear with him once hes deported or in jail for posting pizza
>>105866252>collage>its only chosen by one autist every time>he actively fucks over posters every thread by omitting them intentionallycringe
>>105866609status????????
>>105866328best gen in thread is an anime b/w turd coming out of a rectum
>>105867243at least he has refined taste in farts.
>>105866966Are miners going to buy all the 5070 Ti Super 24GB's?
file
md5: fab593ce23fff6ca1aaffedaf9e7e0b6
๐
>>105867190pretty certainly not
>>105867221>u dont really seem to look like your profile pics>mfw my profile picture
>>105867380BWAHAHAHAHAHAHA
>>105867084>its fixable but still... dataset?You're thinking very small if your concern is hands. If all Chroma had to worry about was hands it would be done by now. It does a decent job with hands.
Is this normal? I'm following the rentry guide, I'm up to the Control Net part, and I'm trying to install comfyui_controlnet_aux from the Custom Node Manager and it says it has 60/65 conflicts.
I haven't installed any nodes besides the ones from the guide.
>>105867561just install everything required for any workflow, if theres an error then look it up and thats it, bathe in dependency and python hell until we get AGI which will clean it up for you and debloat 70gb of your comfyui install in 2027
>>105867561>I haven't installed any nodes besides the ones from the guideyou must be on some kind of pre-configured install with some custom nodes, but it's OK, if the nodes have the same name they're likely copypasted and do the same thing.
>>105867561>Is this normalabsolutely
>>105867561>>105867622In short you can just go ahead and install.
If you have any problem just uninstall ComfyUI-tbox and that "FLUX PRO" node pack that probably came with whatever container/preset you're using.
>>105863390update: managed to gen this dumb moving avatar, looks pretty ok too.
>>105867639>>105867622for what it's worth i installed via pinokio
>>105867750Conflicts just mean the nodes have identical names and you'll only be able to use one of the dupe nodes. In most cases this doesn't matter because the nodes with identical names are identical.
guys you better dl everything you want because I'm gonna pull and bankrupt the whole thing
>>105867156rookie numbers honestly
>>105866229 (OP)v disappointed moot didnt make the catalog
do better!
>>105867137Is it going to be done by v50?
>Once Chroma is finished training, that's when the finetuning begins
Chroma is the only finetune we got for flux, what makes you think we will have more? Even if by some miracle someone had the fonds to do a Chroma finetune on weebshit for exemple, they would have to convert all the danbooru tag into NL, with no guarantee it would work.
I mean, look at Chroma, we are almost done with the training and yet it doesn't know any artist style
i can't gen stuff for shit so instead i just use actual art that people have made and rape it with img2img because for some reason img2img is some magic shit but i never get satisfactory results with text
how evil am i for turning peoples' hard-drawn art into AI sloppa porn?
>>105867874https://desuarchive.org/g/thread/105866229/#q105867874
It really is all so tiresome
So regarding the guide, when using regional prompting and specifying 2girls in both region prompts, I have a question:
What if it's a 1girl, 1boy image? Or 2girls, 2boys? Do I specify that in both prompts too?
>>105868162having to use external third-party archive sites is just the norm at this point
Why are stable diffusion models capped at 6.67GB?
liverking vs joe rogan vibes
>>105868175>Regarding the guidemy professional advice is not to click on any of that trash
>>105868175Use comfy couple, although it's still a bit janky with the outputs. I found nothing better for getting consistent character interactions between defined characters, although it can still be janky and force you to inpaint.
>>105868162>>105868187its happening on multiple boards though?
>>105868190(((they))) dont want to make it too obvious with the masonic symbolism of rounding to 0.01gb lower
>>105868276prob true
>>105868271its just one faggot mass reporting ignore
>>105868276No I just ment like why dont they make them bigger, my gpu can handle more
>>105868291Because there are too many 8gb vramlets who are holding us all back. In the meantime use q8 chroma https://huggingface.co/silveroxides/Chroma-GGUF/tree/main
>>105868162might as well add the archive link every thread what a schizophrenic
>>105868299I've used chroma fp16 and I do find it superior. What I miss is the lazer focus models like illustrious and noobai have when it comes to making 2D smut
>>105868232>coupleWhat if I want more than a couple of characters? Looking for something more versatile.
>>105868321krita ai plugin with regional prompting is the final fontier until we get better text 2 image editing
>>105866229 (OP)fetch the paper doggy
>>105868323Then inpainting is probably your only real tool.
>>105868330I'll check this out anon thanks
>>105868337why is italy so horny ALL the time and how do i get in on it
>>105868349well firstly, I'd recommend cutting down your time on 4chan from 17+ hours to something a bit more reasonable
>>105868175Yes, you do. You only reduce the number of tagged girls/boys if you start getting extra characters.
>no need to inpaint if it was sloppa all along
>>105868337>GOODBYE MALE>The war of the sexes seen through the American presidential elections (and a somewhat perverse little game). Female hegemony underway, man besieged in the trenches of eroticism>By Antonio Gurrado>There is a recipe for happiness, proposed by a German philosopher in the 19th century, which recommends choosing a woman who knows how to cook, abstaining from reading novels, and spending Sundays outdoors.>A few decades later, the French developed their own recipe for happiness, which consisted of the freedom to wear no underwear, especially under a mini-skirt.>Today, thereโs another one, Japanese this time, that calls for dressing up as characters from anime or video games and engaging in ambiguous roleplay involving suggestive positions, whips, and domination.>The erotic imagination has gone full circle: now men are no longer the ones who dominate but the ones who are dominated.>(Caption under the title image)>BANG BANG โ In the world of hentai, itโs women who dominate, not men.lmao what the fuck
>>105868371hes right you know
>>105868201this, skip this guide shit and use anistudio instead, no guide required
>>105868377the baker is an autistic dumbass
>>105868201>my professional adviceAnd who are you? Can I see your credentials?
>>105868386>>105868377prob him attacking comfy dev
grim
>>105868387i made a ui that has an actually good license and doesn't need a phd to use
>>105868387since I'm probably the only one here with an electrical engineering degree, no.
>>105868397I'm the only one here with an 8 figure valuation. I'm not interested in your degree, just wanted to confirm you have no capability to substantiate your statement of "don't read the guides".
>>105868407>I am emotionally invested in the guides I typed here for that do nothing but confuse low-level shittersbeahahah
Why does mental illness look like this?
>>105868414
>>105868422>>105868407i have billions! gajillions!!
beahahahahahahah
you do not truly understand a subject if you can't explain it to a five-year-old
>>105868299>>105868291More like 40xH100lets who are holding us all back, being not able to train larger models in acceptable time.
>splinter general baker does nothing but drive users away
>longest thread yet was this last one
its almost dead boys
my favorite part was when he report-bombed his own thread because people said "good morning" & then the janitors archived everything
Okay, chroma is clearly matching and surpassing noob in creativity for 2d stuff as long as you're okay with no artist tags. Neat.
>>105868447he is a fat autistic retard
Who can't even type a proper guide to help redditors
>>105868448why are you still using comfyui? we have better alternatives now
/ldg/ was doomed the moment it decided to not include the only hope we had for a non-shit ui in the op
>>105866609status: v stinky, v bisgustin
>>105868505i've personally been using anistudio and it's much comfier than cumstain
>>105868513>just install this totally legit glowie software lel
>>105868504a 60 line diffusers script?
>>105868504>non shit uiPlease do say
>>105868504which one is that?
>>105868532>>105868538>>105868544https://github.com/FizzleDorf/AniStudio
>>105868546I heard this guy is some sort of opsec guru
>>105868546>it's the mentallly attenttionwhore trannoidLMAAAAAAAAAAAOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO
im really v sorry but this "general" is a total waste
>>105868571so true xister!
we should pack it up and join our fellow worthless avatartrannies in the 'cord
>>105866531>schiit audiobased
>>105868601please clean your house
If I want to train a LoRA that mimics the aethetics of LotR how should I do it?
Take loads of screenshos in 1024x1024 resolution and train a concept lora?
>>105868600you are mentally ill, stop following me
How does Swarm compare to ComfyUI and Forge?
>>105868546Comfy made fun of you for your shitty wrapper btw
>autistic Baker continues his tirade
>>105868627>i want to train a lorda lorano one here has even the slightest technical capabilities to help you do such a thing you are better off blindly stabbing at surge engines are begging redditors to spoonfeed
Is FramePack only useful for single characters dancing? When will we be able to generate 1-minute videos of two characters interacting?
>>105868627That's basically what I did to make a lora based on the movie Heavy Metal. But that was a more obvious style lora and what you are wanting is a little different I suppose.
>>105868627Yes, you should try.
>>105868689sauce(on dat metal)
>>105868689she is cooking her fuckin arm!! :(
>>105868638hes right u know
>>105868706Unfortunately I lost it, along with most of my 1.5/early sdxl stuff when my old laptop died. There are other heavy metal loras though that people have made out there.
>>105868714I prefer to think that she lost her right arm past the elbow.
>>105868486>why are you still using comfyuiI'm mainly using swarmui, it's just native swarm chroma outputs are shittier than comfy + detail daemon
>>105868486its called FUN
I do it for FUN
>>105867150just tried out noob with some loras and it's pretty damn expressive. it has a far more painterly feeling to it compared to wai. thanks.
>>105868627High quality dataset + proper tagging. Everything depends on the checkpoint you are training on.
>>105868668Stop projecting.
>>105866498>>105866711>>105866733dead fish eyes
slop face
stiff body
2025 its easy to do this kind of slop
>trani """stealth""" shilling again
>>105868956careful now, else he'll samefag on you to damage control
>>105867561>he didn't pay the ComfiUI suscriptionAnon, to use ControlNet you have to pay
>>105868984Stop spreading misinformation, ComfyUI is free.
>>105868414>confuse low-level shitters basado
Hi, I need Loras tips. Found some paid AItools and non-Windows software. Any recommended YouTube channels or experts?
>>105869000*free and bloated
>>105869086use this to caption your images
https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
where do I get a flux kontext lora dataset from so I get an idea how the images are tagged?
>>105869102Thanks! The issue is that generous anons give me info slowly, so I can't see the bigger pictur of how to do it.
"Get images or video clips (img1.jpg, img2.jpg)
Write captions (img1.txt, img2.txt)
Use diffusion-pipe or ai-toolkit"
An anon told me this yesterday. How do I merge your info with his?
>>105869139create the link to generate captions and then copy paste them in img1.txt and so on
>>105869139>https://chatgpt.com/>How do I train Lora using tool x?You can copy+paste github pages there if it gets too difficult.
>>105869102This is a meme model? The caption is wrong and it's an AIgenerated SFW image. I prefer using free Corpo models from LMarena.
>>105866314kek, anon delivered
>>105869182>>105869139>>105869185>>105869102Ok so in my "How2make a Lora" backpack I have this info:
*************************
Get images or video clips (img1.jpg, img2.jpg)
Write captions (img1.txt, img2.txt)
Use diffusion-pipe or ai-toolkit
*********************
use this to caption your images
https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
********************
create the link to generate captions and then copy paste them in img1.txt and so on
******************
>https://chatgpt.com/>How do I train Lora using tool x?You can copy+paste github pages there if it gets too difficult.
********************
Is there really no entry, tutorial, YouTuber, or famous lore maker here? Just scattered info?
I won't rent a GPU or capture images if Lora fails due to missinformation
>>105869098The bloat hard to notice.
>>105869235>joy-caption-beta-onetaggui is a good gui for mass captioning images that has an option for that model https://github.com/jhc13/taggui
Does anyone know why wan2.1 isn't working for me? I had it installed and working before, but now for some reason it returns this error saying Torch not compiled while cuda enabled. I've tried a clean install and everything through pinokio.
>>105867561normal. you can create alternate venvs if you have hard dependency conflicts, so it's at least better than nexus mods.
>>105869271https://github.com/comfyanonymous/ComfyUI?tab=readme-ov-file#troubleshooting
>>105866314>There, now your bonzai tree can give you handjobsWhat a time to be alive
What's the benefit of using
>FluxKontextImageScale
if any
bhira strikes again, his replies are hidden because of the downvotes but he basically admitted that he reported tensorart to payment processors
https://www.reddit.com/r/StableDiffusion/comments/1lx164s/tensorart_no_longer_allowing_nudity_or_celebrity/
>>105869475someone should take the old yeller out
>surely nothing bad will happen if we centralize everything again
rant: theres 10 billion noobai and illustrious merges on civitai and they all perform about the same
>>105869475heโs got a point tbqh
>>105869475>Owing to mandatory regulations from credit card organisationsYes
>and regulatory authoritiesLie, there are no laws against hosting celebrity loras and/or pornographic content, it's only the payment processors who are making demands
Civitai still has porn, their new payment processor told them to choose celebrities or porn
Tensort.art payment processor is likely VISA/Mastercard because they demand both celebrity and porn to be banned
Tensort.art is like 90% 'exclusive' content that you can only run on the site, so I've never bothered checking their content, but I can only assume it's vast majority porn, which would mean it is dead.
>>105869475Yeah he's off his rocker. It's not even about deepfaking or nsfw, he's just content policing what he doesn't like.
At first I thought it was just him going after celebrity and deepfaking things which is whatevs, but contacting these sites with the payment processor angle is much deeper. I wouldn't be surprised if he was behind the Civit stuff too.
Upload all your stuff on torrent trackers for fuck's sake
>>105869522No, the attacks on open models have nothing to do with porn and celebrities, it's all about control and killing open models, (((they))) will still come for them even if there's no porn or celebrities.
He is retarded or just full of shit.
Just move everything to china
>>105869522He's literally just gaslighting and virtue signaling, dude is insane. Notice that it's just him, and he's not trying to work with the overall community to decide what should or shouldn't be acceptable.
He's even trying to say models are illegal which is just absurd.
>>105869475These redditors are unhinged, you should be on a special list because you generate nsfw ai content
>>105869547I mean, yes, piracy is as rampant as ever, games, movies, tv shows, apps, often pirated same day as release and spread throughout the world.
Meanwhile these files aren't even illegal, it's fully legal to download and share celebrity and porn loras / finetunes, the only thing that is illegal is to share sexually explicit deepfakes of real people online without their permission.
All you need is a simple web-page with magnet links, a seedbox somewhere which makes sure there's at least one seed for every torrent, the vast majority of these files are in the ~300mb range.
>>105869604>it's better to watch real people having sex, feeding the abusive porn industry, than just generating harmless porn locallyActual mental illness
>>105869604I doubt this is even a real person, just some bot or paid industry shill
>>105869636it's just BFL weaponizing the same indignant autism you see in the likes of the rocketpedo or cumfart.
>>105869235A lot of people make a living by doing Loras on commission, so obviously here they won't teach you anything. You have to try it for yourself.
>>105869666I pity the poor bastard who makes a living from it
pocket change sure, but not a living
>>105869604probably a sour onlyfans whore who lost their simps income to AI.
How does Wan2GP compare to Wan+ComfyUI? Is it like Forge vs Comfy?
>>105869687>>105869677yup. you're getting spanked
>>105869547Zoomers don't know how to pirate, they were born in the age of Steam and Netflix. It's over.
They don't have a reward system in their brains or the pirate spirit of feeling good about giving things away for free. They need buzz, likes, retweets, tensor art coins, hashtags, Discord turbo. It's over.
>>105869666It's free alcohol + computer upgrades. "How to make lora" is just so stupid question that I'm not personally going to waste time answering it when there's LLM's that can easily help with basics.
>>105869733>its so easy that any LLM can answer you.That's a lie. Asking an LLM is synonymous of doing it and failing, then searching in forums to find out where I went wrong and wasting time doing it again.
If you don't say how to do it, it's because you literally make a living doing that and part of your income comes from Lora commissions.
Come on! Tell me once and for all, what's your Discord, Tensor, or Civit account? That way I can contact you privately and you can make me a LORA.
The fact that there's no torrent of all the celeb loras out there really makes me sick
>>105869235I'm the guy who gave you:
>Get images or video clips (img1.jpg, img2.jpg)It's not misinformation, making Loras isn't hard per se but you will have to practice.
Because besides having:
dog1.jpg (your dog)
dog2.jpg (This is a white dog named "Spot", he has curly white fur and a large spot around is right eye, he is sitting on the grass at a barbeque")
You have to recognize the patterns and features of your dataset and understand what you're attempting to teach the AI which doesn't have a brain. If your dataset is poorly designed (and there is no way to teach you this, you have to do trial and error) you might teach your model wrong things like "Spot only is seen with blue collars so make sure there's a blue collar in every picture" or "Spot is never with people, so never put people in pictures".
>>105869766No single torrent, but you can download practically all the celebrities here: https://civitasbay.org/
yay
md5: c0e09bdb7676709086b9e11af6c57eb6
๐
>>105869475damn it's over no more celebs ever again
>>105869805she's underage my dude
Been experimenting with 4fps with Wan Loras and it seems like a way to cheat to longer videos. You could probably do start/end key frames across the sequence to restore the lost frames. diffusion-pipe seems buggy though with low fps so you have make the clips 24fps (basically fast forward) and then at gen time set to 4fps.
>>105869475what an absolute lolcow faggot trying to sabotage open source ai with his antics. what does this faggot get out of doing this shit?
What "run_nvidia_gpu.bat"? This is the first time in the guide "run_nvidia_gpu.bat" is mentioned.
>>105869843Lack of attention and control in real life mixed with a moral crusade from someone with with a guilty conscience.
>>105869787>>105869795>>105869800not so fast
https://old.reddit.com/r/StableDiffusion/comments/1lx164s/tensorart_no_longer_allowing_nudity_or_celebrity/n2iocqr/
>goon to genenerated blue archive porn
>use post nut clarity to work on sfw project
>>105869778Or if you specify "black spot", it will gen any other color often even if "black spot" is prompted.
What is people's aversion to just making a fucking torrent tracker for models? Or even just a website with magnet links and gallery images?
>>105869840Interesting, more info?
>>105869890I don't know of a single country where it is illegal to download/share celebrity or porn loras, it's perfectly legal in the US.
Maybe it's illegal in the middle east ?
>>105869910Depends on the model, the smarter models like Flux and Wan will correctly associate features as long as they're captioned. If Spot's fur color, spot color and collar collar are always mentioned and labeled the model is more likely to not make it an intrinsic feature.
>>105869778Thanks for the info. In other words, I have to spend time and money to create a good Lora. And I can do this by renting a GPU and trying different things until I get the result I want. And I would spend the same amount of money on a pay to go service. Also, my Lora won't be compatible after new structural image generation model is released.
I am starting to think that this hobby is for dead-end people who have money and free time but no future in sight. Each one reinventing their own personal wheel.
Thank you for your honest information, anon. <3
>>105869925I mean there's not much more information than that, instead of doing 24 or 16fps videos you make the clips 4fps and fill up the maximum frame count you can train on for a single clip, I'm still experimenting but it seems to work just fine. I'm also experimenting with doing the same but for converting an image gallery photoshoot into a similar 4fps clip, this also seems to work. My first experiment was simply doing 3 images as a slideshow and Wan was able to figure it out and extrapolate to basically doing a gallery at 81 frames.
>>105869692wan2gp is more comfier than comfyui, noob friendly with its ui, multiple optimize profiles to select and automatically downloads the model for you from huggingface.
>>105869869It's included in the ComfyUI windows portable version, you jackass.
>>105869840>>105869972Btw do you drop anything, but each n frame, or select frames the other way? Because I think for such low fps frames should contain motion peaks, like if you always wanted to capture left and rightmost positions of a swaying pendulum.
i want to use subgraphs for comfy, but half my workflow uses cg everywhere node, which I don't think works with it yet. damn
>>105869930I was trying to train a lora for a character on illu and put color of the outfit in the captions, but gens were with other colors. And it made me wonder because if you prompt for a character the model knows, it will gen colors correctly despite colors were specified in a dataset.
>>105870055I typically never caption colors, only 'bright', 'light' and 'dark' (for example, 'a woman is wearing a dark hat and a bright dress'), the model knows colors extremely well and using them in your own captions when training just seem to confuse it.
>>105869941What kind of conclusion is that? AI art is no different from any other creative hobby, it's like indie game development, music production, or painting. People pour time, effort, and money into their passions without expecting the world to revolve around them.
You say "no future in sight" like everything has to be utilitarian or part of a collective. Are you upset people arenโt making LoRAs for you? Should I hand over my game source code too? Not everything is a communal project.
LoRAs are personal, creative, technical projects based on the creatorโs interests, like a painting, a song, or a game concept. I made many VR game projects before I got into AI and those never left my computer either. And despite what youโre implying, theyโre not even that expensive. Renting GPU time costs about $5โ$10 for most LoRAs. Or you can buy a GPU for $1-2k, and that setup will serve you well for 3-5 years (3090s still kicking by the way), with electricity as your only ongoing cost. Compare that to fishing, music gear, or even traditional art supplies which have up-front costs (often bigger than the cost of an AI computer) and ongoing costs.
People make things because they enjoy it, not for your validation. If you're only in this for shortcuts or handouts, maybe this isnโt the right hobby for you.
>>105869941you got serious issues bud
>>105870091But why does this happen? Should models drop colors from their dataset captions during training eventually too?
>>105870000You could hand pick your key frames but that seems like work. Wan already has a very robust training set so it's likely to understand any implied motion and it feels in the gaps and extrapolates missing data on it's own. You're really just teaching/reminding it of the "fast forward" concept. I think the most important thing is the start frame which should be high quality especially if you're planning on img2vid.
Local is an absolute joke. Censored garbage now. Pathetic
>>105870055You may need to train for longer. The model might start outputting the character but it doesn't mean it's done. As the model trains it gets better at generalization as it properly associates the captions with the features of the image.
>>105869941>Also, my Lora won't be compatible after new structural image generation model is released.True, but it's not a major problem since it doesn't take an eternity to train a lora, I've been able to train good Flux loras of ~50 images on my second machine which has a 3060 in ~8 hours, it's not a big issue.
Secondly it's not as if there are new viable models out very often, right now there are basically two widely used models, SDXL and Flux, Chroma seems poised to be a strong contender as well, but that depends fully on community adoption.
>but no future in sight???
If anything it is the one technology that will end 'the future' of practically every other interest.
>>105870166No it's not, local is the only thing that is not censored, and will remain so.
I can generate, train, anything I want, no restrictions. Meanwhile everything non-local, which includes Civitai and Tensor.art, as they both push hard on using their SAAS to generate, become increasingly censored by the payment processors.
Does anyone know how to install checkpoints in a pinokio wan2GP installation? I downloaded this from civitai:
https://civitai.com/models/1626197?modelVersionId=1852433
I put it in /wan.git/app/ckpts/, but it still won't appear in the UI under the big drop-down menu in the top middle.
I get that wan2GP allows you to download checkpoints directly from within the UI but how do I download and use custom ones?
Sometimes I dream of a world where I could finetune Flux/Chroma on my own machine instead of renting a cloud GPU.
>>105870201Not to mention datasets are reusable across model architectures, everyone does img1.jpg/img1.txt pairs so at worst is just rerunning it on the new model which takes a couple of hours.
>>105870252hee I've been using the same images since 1.5
tags had ot change tho, took me a while to settle on what to use
>>105870251That world exists and it starts with leaving your computer and getting a job.
>>105870251if I won the lottery and got millions of dollars, I'd 100% build a room dedicated to ai training. like 10x h100's all running 24/7 complete with proper cooling, a back up generator and so on. it'd be glorious.
>>105870285kill yourself bootlicking retard
>>105870285I already get paid six figures to post on /g/ - /ldg/ - Local Diffusion General
>>105870308nothing is more bootlicking than someone that yearns for corporation made and designed AI GPUs, models and tooling but then acts like a socialist
You aren't truly local if you're buying a GPU instead of making processors in your garage from your own homegrown silicon dies.
>>105870328if you don't have a job it's your mom's garage and you lose it when she dies
>>105870347>having to provide myself food, water, shelter is fascist, people should work to give me those things for free!Yeah you're in for a bad 50 years when mommy dies.
>>105870339> and you lose it when she diesBut inheritance.
if you train a character lora on images of two people, will the result be an amalgam of the two?
>>105870364You think NEET anon with a single mother has a financially literally parent? Assuming she isn't a renter she probably has a home equity loan and reverse mortgage.
>>105870251Don't believe the chroma hype, its a slow model that I still have to figure it out whats good for, I have yet to see a good looking women come out of that model, it seems like it was trained on tranny porn
>>105870369yes, it'd be the same as using two character loras.
>>105870378I know, but Flux is plastic-skin crippleware and I'm getting tired of using SDXL.
>>105870397The model is under your nose and it's Wan.
wan is the most plastic unimpressive crap available tho
>>105870397SDXL outputs better women that chroma, chroma really idk whats good for, its a slow as fuck model, is not good at generating hentai/anime content (noob and its shitmixes get better results), the only thing I would say that chroma has its prompt adherence, but whats the point? to generate memes? I mean if you could use it as part of a workflow, like generating an initial image with chroma and then re-use those gens with SDXL/noob, but you can't since its a heavy vram model you would only get OOM errors and instead of taking 10 seconds to generate an image (like using SDXL) you take like 2 minutes with chroma, is not worth it, I've been testing chroma since like v27 and I still have to find its point, I've been in their official discord for a month, they have a channel where they share their generations and they are all ugly, literally looks like tranny porn
>>105870477Chroma with a character lora is actually wild desu. It's nice to get that increased likeness and the photoreal textures that chroma can produce.
It's going to be to everyone's expectations when the model is fully finished and someone runs a porn finetune on it as a base akin to biglust/biglove.... that's when everyone will start switching over.
I quit lads
it's no fun struggling with a 4 GB VRAM
makes me wanna cry
I'll return when I buy a 4090 or 5090 once I get a job
>>105870470everything is plastic then and it's amazing you think SDXL is good
from here it smells like *sniff sniff* 8 GB of VRAM
>>105870477this is the kind of gens they are showcasing
>>105870477Show an impressive SDXL image that isn't 1girl standing.
So has anyone figured out what bghira's issue is yet
>>105870505you know how expensive a chroma finetune is gonna be? To train a decent lora it takes 8 hours, I can't imagine a full finetune
I think they should just take all these big models and simply make them smaller. Not sure why anyone hasn't thought of that yet.
>>105870538Living in a third world country
>>105870537I got a near likeness with a standard 2 hour training... it shouldn't be THAT expensive if the base model is there. It'll cost a pretty penny, but nothing that's unobtainable.
>>105870521That happens when there are no fag disablers
>>105870528first define impressive, second, I already said that chroma has a good prompt adherence, but whats the point if you for example want to generate a beautiful woman, chroma is unable to generate that
>>105870531Mentally ill, by his own admission woke, probably having some life crisis, will 110% transition
>>105870549dat y u cut off your dick? loool
>>105870509Don't give up bro, i have fun generating slop with a gtx 1650 (still takes two whole minutes and some to generate slop with 15 steps)
>>105870531Projecting own guilt on everyone. If cops investigate anyone it should be that guy.
>>105870562>a beautiful woman, chroma is unable to generate thatI think chroma is great for pretty 1girl gens
>A single deranged central american theythem is able to cripple local image gen because he's angry that other porn slop models are getting more attention than his
>tensor art banned nudity models
>>105870562Something that shows more than just 1girl standing, e.g. an interactive scene, medium shot. Also "beautiful woman" means many things. Take a real image of a woman you like and then use Joycaption to write a caption, then prompt Chroma with that caption and give the results.
>>105870509If you just want to gen and make single concept loras, a 5060 ti 16gb is good enough, 4090 / 5090 is if you are really DEEP into this stuff.
>>105870604Given what people here said was posted there I'm shocked they weren't raided by the FBI or Interpol. Getting their money cut off is the least of their potential problems.
>>105870591If you have a sand castle it's not going to last.
>>105868371>November 3, 2016 .... >now men are no longer the ones who dominate but the ones who are dominated>Female hegemony underwayTOO SOON! LMAO, They thought Hillary had already won the election and stopped pretending that they didn't hate men.
>>105870604They say 'temporarily', which means they are probably looking to change payment processor to whatever Civitai is using, which allows porn but not in conjunction with offering celebrities.
If Tensor.art loses porn it shuts down, just like Civitai would.
I'm making nudifies in flux kontext, and everything is perfect except nipples, they are blurry and kinda wonky 70% of the time. Does anyone know how to fix it?
>>105870639there was a breast fix lora
someone link it
>>105870583thats cute but SDXL girls are still better looking, I can see for example some issues with your gen, the limbs are too long (look at her left arm, thats from the furry content in the chroma dataset), also chroma has a big issue with breasts, they mostly look like shit, they look like bad implants or saggy tits, no in between, I think the dataset of female images of chroma is very limited, also, chroma women tend to have squared jaws, there are no round faces even if you try to prompt it, chroma is biased to give them a very masculine look
Can I use sketch/depth/canny controlnets with Chroma?
>>105870624Aren't they based in Vietnam or something?
>>105870591local has always been about bending over backwards for ethical censorship. been like that since day 1 with SAI refusing to release 1.5. funnily enough it was SaaS company Runway who decided that 1.5 should be released open weights. SaaS understands quality, local just basks in slop until they get run over by licenses and payment processors.
>>105870680The NAI leak \ 'ack was probably responsible for the continued momentum after the 1.5 leak
>>1058705094060ti 16gb or 5060ti 16gb is solid for sdxl. I remember getting ooms with just 8gb vram.
>>105870531he's just attention seeking faggot. Most likely a paid shill from either the deepstate, feds or openai/google to sabotage open source ai and Chinese ai models. Notice lately antis are keen to attacking open source ai than closed source api.
>>105870166sorry, I didn't realize local is censored now. I'll go delete my collection of uncensored local models to stay in accordance with this new mandate