No Model Card Edition
Discussion of Free and Open Source Text-to-Image/Video Models
Prev:
>>105695065https://rentry.org/ldg-lazy-getting-started-guide
>UISwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Models, LoRAs, & Upscalershttps://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
>Cookhttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>WanX (video)Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1
>ChromaTraining: https://rentry.org/mvu52t46
>Illustrious1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/
>MiscLocal Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate
>Neighborshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/celeb+ai>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
Blessed thread of frenship
I just checked the wan workflow from the rentry anon https://rentry.org/wan21kjguide and noticed the UnetLoaderGGUFDisTorchMultiGPU now has the option to use_other_vram
Does this mean that basically multi-gpu is solved? can we launch 40-80GB models into 2,3 or 4 3090's?
Also I set the virtual_vram at 0.0 and it's working, with a single 3090 and wanI2V720p model, it used to require up to 10GB extra with loras etc. Does this now have "automatic" fallback into system RAM like the comfy's native workflows always had?
>>105703675>Does this mean that basically multi-gpu is solved?always was, this shit has been there for month at this point
>>105703675with wan q8 I use 10gb of virtual vram, I have a 4080 with 16 physical, so you can use larger models with it. If you have enough vram (you have 24) then there is no need.
oddly enough the virtual vram doesnt slow it down much if at all for me.
wan 720p isnt that much slower than 480p with the lora, which is neat
using a smaller res to test/for speed but it's a little over a minute...pretty cool imo
>>105703735>wan 720p isnt that much slower than 480p with the lora, which is neatwhich is normal, you haven't changed the resolution, only the resolution and the number of frames changes the speed of inference
>>105703751usually with 480p wan 600x480 works fine, im just testing diff resolutions now. but it's still super fast with the new lora.
>>105703688>always was>been there for month[s]
does someone else have problems with the wan workflow from the rentry not saving the videos in the output folder? i always have to change the prefix to something else, the default prefix doesn't work
idk I think 480p wan works faster and the quality is good at the default preset
>>105703577I think Intel already have FlashAttention or some kind of optimized kernel built in IPEX, because if not, I think I cannot run Chroma BF16 on A770 at 7it/s
At some point when I upgraded to new beta driver, the speed and VRAM usage suddenly became worse, so I reverted back to an old driver. So I think Intel already had some optimization built in their drivers already
>>1057039317s/it, typo lol
>>105703930that's because the 720p is not intended to run on a 480p res, you have to get 720 minimum on either the height or width
>>105703517F U C K O F F
crawl back to whatever kindergarden bush you came from
>>105703930the quality looks good at the intended resolution, whoa no way
>>105703975why would you link that here, retard?
>>105704024what is your problem? hm? fucker
>>105703966how much memory does 720p wan need at 1280x720? got oom even with 10gb virtual ram (using a 4080/16gb)
>>105704056>how much memory does 720p wan need at 1280x720?a lot, that's why I go for 720x720
>>105703975Based
>>105704024Nta because that little freak and his nonce defenders have been posting in these generals now.
its the 25th
where is SageAttention2+
https://github.com/thu-ml/SpargeAttn
>SpargeAttn based on SageAttention2++ will be released around June 25.
>SpargeAttn based on SageAttention2++ will be released around June 25.
>SpargeAttn based on SageAttention2++ will be released around June 25.
been out of the loop, what is chroma? Is it based on flux? is it on civitAI? Will it work with webui forge?
>>105704075I changed it from 10 to 16 virtual and now no oom, so we'll see how this works. Yeah with 480p wan I like 600x480 for speed with minimal quality loss.
>>105704090chroma is a model based on flux
https://huggingface.co/lodestones/Chroma
yes it's on civitai
https://civitai.com/models/1338204
yes forge supports it as of this commit
https://github.com/lllyasviel/stable-diffusion-webui-forge/commit/963e7643f07dca155de5de2f617cc17adc2aee4d
>>105704090A large finetune of Flux Schnell, it is currently still training, at epoch 39 of an estimated 50.
Every trained epoch is released, so you can try it out while it is unfinished.
Support for Forge was merged ~two days ago.
https://huggingface.co/lodestones/Chroma
>>105703931So I simplified what they actually did but I have done research on this. They put in an optimized version of standard SDPA into standard stock Pytorch 2.7 which piggybacks off their oneDNN library for SYCL and etc. It uses a very generic version of SDPA built with OpenCL.
This is where the speed is coming from on top of BF16 support in the Alchemist architecture where they can get it fairly fast. However, that is not Flash Attention and it could go even faster if they had that. They did have a pull request in the main Flash attention repo, but they closed it.
https://github.com/Dao-AILab/flash-attention/pull/1528
And that only supports Battlemage and Ponche Vecchio since it relies on some lower level primitives Alchemist is missing. And obviously as I mentioned earlier, MIA on other attention types.
slightly smaller width, seems to work decent
>>105704166I do notice the eyes in particular are clearer compared to the 480p model, among other things, with the interpolated output (this isnt) it's even smoother as well
>>105704089what? I think you sent the wrong link, this is SpargeAttn, not SageAttention
>>105704150So much better than Darth Plagueis
>>105704186>I do notice the eyes in particular are clearer compared to the 480p model, among other thingsof course, more pixels = better details
>>105704166interpolated with a new gen, smooth
>>105703975thanks for the you
asian girl puts a can down on a table.
720p is fast with the lora, scaled down a bit I can go back to 10 virtual vram with a 4080 and no oom error, plus slightly faster.
>>105704356er...it was at 16 still. but multigpu is flexible anyway. 10 seems fine now, no oom. so I have to bump it higher at full 720p default res.
I remember back when I used the 720p model, it had a slightly better prompt adherence than the 480p model. Can anyone confirm if that's still the case with the distillation lora?
sith
md5: 1a6257a6b177186ee69de2c5f9e8b350
🔍
>>105704089https://github.com/thu-ml/SpargeAttn?tab=readme-ov-file#project-updates
>SpargeAttn based on SageAttention2++ will be released around June 25.So basically if you choose a sparsity of 0 you get the OG SA2++?
girl holding can dives into a pool
file
md5: b1ba09782e940e17f082fc7a58a5732d
🔍
I'm trying to just use the fp16 version of wan instead of the gguf but it breaks in the rentry workflow. All I did was add a unetloader node and connect it where the unetgguf node was connected. However, the gens get all jacked up. Is there something else I need to do?
>>105703169>e621 and furry into negative>>105703378LOL, I tried this and it actually works. So far in my testing, it almost always makes significant improvements to anatomy and background details.
PUT E621 AND FURRY IN NEGATIVE. the furry has been training on too much furry data, chroma is literally inserting furry mutations everywhere.
>>105704463grab the gguf q8
https://huggingface.co/city96/Wan2.1-I2V-14B-720P-gguf
>>105704543I was intentionally trying to use the fp16, but your link helped me since there's also fp16 ggufs too, thanks!
anime dog flies high in the air
wan gave doro wings:
You wouldn't... you know... a mushroom, would you?
AI boomer here. Is Waifu2X still considered a relatively "modern" upscaler for anime-style images or is it stone age tech today ?
>>105704672nice, is that VACE with force enforcing lora?
>>105704693definitely not.
couldn't remember the names but i've seen a ton of shit that completly mogs it since.
a girl jumps into a swimming pool
not bad, interpolation on the 720p gens takes much longer I notice, this is the regular output.
>>105703501 (OP)totally missed that thomas jefferson webm last thread
>>105704778Sir that's Benjamin Franklin.
the ball necklace lora cant handle my waifu's big neck
387
md5: b644686908057394e3caf53746fc8cb0
🔍
If the depth controlnet has trouble properly transfering the pose, should I cut out the background or raise brightness/contrast?
a girl sits at a desk with a computer.
amazing, it's a stellar blade image and she seems to be playing a gacha like nikke (made by the devs)
>>105704669WOULD.
would you a big titty succubus?
a girl dives off a cliff into a swimming pool.
AND out
>>105704471what's this art style again?
>>105704976She's fat as fuck so maybe Baroque?
>>105704889>>105704975damn these are really cool
>>105704090>>105704136I used this gguf with forge and it works but my gen looks like this on a simple prompt
I'm clearly doing something wrong
>>105705006this is chroma btw
a girl dives off a cliff into a swimming pool.
pretty good!
>>105705023interpolated output:
when you think about it, AI waifus are a real life succubus.
>>105704987kek. no, but that reminds me to do some Rubens gens.
>Masterpiece very dark French Neoclassical orientalist oil painting on canvas by Jean-Auguste-Dominique Ingres with visible brushstrokes and Craquelure.
a girl gets into a red sports car and drives away in the desert.
>>105705006Yeah it should look a bit better than that. Let me know your settings or catbox the gen so I can take a look and see if I can spot what's missing
>>105704796making out with sally hemmings??
>>105704863shouldnt you be using open pose or something, not depth?
>>105705104Openpose is for the premade stick figures
pouring one out into an invisible glass
>>105704898Depends on my survival chances. Ah, fuck it, WOULD.
Would you a marble statue, however?
>>105705173what's better, regular or the detailed chroma version? i'd assume detailed but maybe not
>>105705119open pose creates a stick figure which can be used to pose the character. you said you want the pose transferred, and that's what it does?
>>105705207Yes, but if I already have an image, I can just use depth or canny
node based interfaces will age like milk
>>105705271houdini is doing fine anon
>>105705271it's honestly the complete mismanagement of community nodes that poison the well of workflows. they refuse to put redundant and outdated shit to rest
For my fellow 80's/90's kids.
>>105705173nope. hard, cold, and dry, sounds awful.
here's Rubens.
>>105704898don't they steal your soul if you fuck them
I'm horny but not eternal damnation horny
I nothing to add to this conversation so here's this
>>105705376just for that, the game over of banjo kazooie was worth it kek
0
md5: 9f32f6b07284a9cbec780716c5b19127
🔍
>>105705242I really don't know what you want. Are you saying you only want the pose or everything else in the image, hence the depth CN?
anime girl miku hatsune dives into a swimming pool.
anime girl with blue eyes punches a hole in the wall to her left.
>>105705204I'm using the detail calibrated version, but I haven't done any testing whatsoever. I just know that it werks.
>>105705497Very kino of it to hallucinate an instant hole drilling guitar
>>105704736>>105705033Looks like she gave up on life once she hit the water
Ah yeas, four leaf clover
finally, death stranding honest edition.
would you a squid?
>>105705562kek
>>105705006>>105705073ah shit had to afk, heres the gen though
https://files.catbox.moe/p2mhpe.png
If anyone can chroma this and post the result that would nice.
bumped res a bit, amazon man wont be stopped:
its so hard to make women with large hooked noses
download
md5: b484440b7d256f17af3549a880dd879e
🔍
okay, NOW it's a success with a sorta proper logo.
>>105705611small hooked nose enjoyer here I hate liking things that will never be tagged, or arent tagged in english like seiza and wariza even though they are common poses
>>105705611make a lora for it? find a bunch of images of women with large hooked noses.
>>105704987The only thing she baroque is the scale
forge, reforge, or classic for chroma?
>>105705680Judging by this guy's pics, you need to use comfy:
>>105705006
>>105705593>cfg scale 1.0Go to 4.
This is the gen with cfg set at 1 and my negatives.
>>105705700Same seed, prompt etc with CFG at 4 (also upscaled because I forgot to disable it, lol!)
Also, your prompt is... not great for chroma. You need to kinda tell it a bedtime story for a decent gen most of the time.
Of course the silicone demon gives a robot the correct hand.
>>105705593I set the cfg to 3.5 (recommended for chroma, iirc)
I also set steps to 26 (same)
Negatives: "aesthetic 0, aesthetic 1, e621, furry, low quality, 3D, render, drawing,"
I also used comfyui, just because I'm used to that with chroma at this point.
Not sure if adetailer is messing with it either. You'll also want to experiment with a more detailed prompt.
https://github.com/comfyanonymous/ComfyUI/pull/8669
>get omnigen2
>put image of girl
>use prompt "make her naked"
Enjoy gooners, this one is for you.
>>105705700>>105705709Oh thanks anon. I got old advice that flux used distilled cfg over cfg and didn't like negative prompts.
>>105705748Is this gen or just inpainting model?
>>105705754It's like GPT 4o without censorship.
>>105705748but the model looks like shit from a butt desu
>>105705765The model looks extremely good, you are most likely not using it correctly.
>>105705765nta but yeah, I tested omnigen pretty extensively on my 3090 with their official repo and it was shit
constantly doesn't do what you ask until you try 5 or 6 different seeds, alters faces, changes random stuff etc
it's barely any less shit than the first omnigen. and txt2img is as slopped as Cosmos2B, while being much slower
how is NIGGIDY NAG for chroma?
>>105705748When can i expect this in AniStudio?
>>105705817two more weeks
>>105705748it's working for me
https://files.catbox.moe/fkf3y5.png
>>105705748I never tried the official implementation but I'm using this one now. It appears to be completely uncensored? Can do breasts, nipples, pussy, dick (kind of). Cocks look a little bit weird but it's clearly like 90% of the way there. It can even almost do POV missionary sex. Downside is that the textures and lighting are completely slopped but that can be finetuned away.
>>105705968actually, though it's working for me, it does have a plastic skin problem. and tendency to change faces a bit. surprisingly good anatomy, nudity, sex organs, even seems like you can add in sex with other people with enough prompt engineering. It struggles a bit with oral.
>>105705984Strange that you neglected to include a catbox.
>>105706004It's in ComfyUI now, just download the model and try it yourself you lazy fuck
>>105705918Are you mocking ani?
>>105703533:/
>>105703501 (OP)>Neighborshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/vp/naptv disappointed w\ the 'community' here
v disappointed w\the 'innocent bakers' here
v disappointed w\ the 'harmless thread blesser'
v disappointed w\ the thread quality in the last 4 months (possibly longer)
where's a good vace workflow
>>105706158she looks SO sad wow
>>105706186she does look like she regrets something
but what?
stop shitting up /napt/ asshole ;c
two men get in a chopper and fly away
interesting.
>>105706158>>v disappointed w\ the 'community' here>v disappointed w\the 'innocent bakers' here>v disappointed w\ the 'harmless thread blesser'>v disappointed w\ the thread quality in the last 4 months (possibly longer)Why?
>>105704160Then it's pretty hard, optimized attentions such as xformers or SageAttention are very catered to NVIDIA specifics. I think FlashAttention is fairly platform indepedent though?
Anyway we shouldn't hope for community efforts for such things as Sage, because most of them only think about NVIDIA NVIDIA.
And Intel is pretty slow on implement things not related to LLM, well...
>>105706019please take care of my daughter she likes milkshakes and pringles
>>105706251(self) appointed thread authority, you gain this ability (delusion) by posting the same image for a year
>>105706199>>105706186im posting sadgirls until you update your neighbors list
NO FUN ALLOWED!
>>105706268ask me for my workflow again
get mad again that you dont know how to use comfy kek
get mad at the lora stack i chose again
get mad at rgal-posting (yet again)
saltmining is tough work, but i must continue
>>105706268what is authoritarian about asking to be on the neighbors list kek
>>105706277Can you make her fart?
>>105706301v nice img2img :3
Watch out for tight underwear
chroma is kino.
>>105705578any video frends want to try animating this one, pretty please?
penis
md5: f9b5e81b66fd5958bc580879fe1ce047
🔍
>>105706380Yes most of my gens will be a femboy or chick's with dicks. I mean I have regular stuff.
>>105706361Some times you just know
>>105706358Not local, post this garbage in the /aco/ thread or something.
>>105706398>Yes most of my gens will be a femboy or chick's with dicks. I mean I have regular stuff.that's pretty gay
a man with sunglasses walks to a computer and starts typing.
well, I guess it works.
>>105706426it's like those youtube channels from the 00s with the fancy graphic then the video starts and it's a goofy looking dweeby mofo
What is the secret to stop the slow motion shit?
>>105706426second try: JC does laptop tutorials
>>105706426if you add "stable camera" you wont have faggy jumpcuts in your wanvideos
bye.
2649388
md5: 2f3fbac16ec50a879e3b912ff32e8494
🔍
>>105706453>4:3 ROG laptop
>>105706438Try adding "slow motion" to negative and "movement is fast" to positive?
>>105706453 >>105706426I want a deus ex review from these guys like the old days
>>105706473I already tried that genius, not working
now this is a transition.
>>105706460thats the only R0G aspect ratio i would consider though
>120+ fps with black-frame insertion >4:3 retro vidya mobile \bst\W O U L D
>>105703501 (OP)Thank you for baking this thread, anon.
>>105703533Thank you for blessing this thread, anon.
As a pokemon fan, I used to like Team Rocket fanart a lot, I thought it was cool.
This faggot is not only shitting up the threads with his terrible and slopped 1girls, but is also misrepresenting what team rocket grunts are supposed to be
>>105706488Lower lora strength?
imagine unironically complaining about "slop" on \LDG\
aka local diarrhea general
>not supposed to be that!
clearly unfamiliar with true canon
educate yourself.
not my job to give lectures to fill in the gaps in your tiktok\doomscroll fried zero attentionspan zoomer brain
>drunk\obsessed\schizoid no-genposter continues shidding\farding as per usual
ew.
download
md5: 53a96122e9232a93782ed6a70b84b442
🔍
>>105706488>>105706539>"neg: slow motion, static, still, incomplete, frozen, freeze-frame"
As opposed to every other gen in this thread, yours look like shit that could be made on fucking SD1.5. That's the clearest definition of slop.
You also NEVER, EVER, even post the slightest effort in your avatarfagging, like making the Rocket girls take any action outside of just posing, or making the scene interesting, adding pokemon and pokeballs in it etc.
>>105706576>yours look like shit that could be made on fucking SD1.5well guess why? local models are still shit
if you could read you would see that i reply\help people every thread anonkun
wanvid can be finnicky
>>105706231looks like a screenshot from youtube spot on
>>105706604As long as you are not a vramlet or a mentally handicapped gooner like some of the users and the rocketnigger, that statement is thoroughly incorrect. You can do complex and interesting scenes with Chroma and Flux just fine, as shown time and again in several of the threads.
>>105706644v yucky fetish
>>105706629So the anon screaming about the sloppa being bad do i indulge them or just ignore?
where are the pixartsexuals nowadays
>>105706713Have a vampire
>>105706718what's the tldr of pixart models? and hunyuan dit for that matter? why no adoption?
>>105706424>satisfaction>>105706453can it emulate the old unreal engine style?
>>105706705>screaminghow do you scream on 4chan
>>105706003>it does have a plastic skin problemHave you tried upscaling using Chroma?
>>105706830tencent fucked us on licencing and Alibaba basically got them btfo. pixart was always a meme
bugged animations in this game
>>105706968He wanted lemon lime
Want can't seem to make sense of the background and keeps wanting to change it.
>>105707013add description of bg to your prompt+ "stable camera"
or cutout in photoshop & move her to a by of your own :3
>>105706858>le scramzshidpoast daily saying the same crap? ;3
>>105707128like the guy who asks op to add veepee?
>>105706858i2i a face using the goatse pic
In A1111 I remember in img2img there was a control for how much noise was injected into the source image essentially to control how much the generated image can diverge from the original - how do I set that up in comfy?
>>105707313just look for a denoise setting
>>105707313You can't. Comfy is pos software and its dev is basically a money hungry jew.
>>105707305nice gag clip for a youtube video
>>105707231no, like no-gen complainer/saltmine shitposter schizoids
What am I doing wrong with chroma?
Photorealism: check
Aesthetics: check
Detailed backgrounds: check
anatomy: kick rocks
anyone know wtf this is - (*bias): last dimension must be contiguous
>>105707372already in my negative prompt
>>105707364try increasing cfg and steps
>>105707401Wow the first coherent one.
>>105707364>>105707400Might also want to try an ancestral sampler if you're not doing so already. I often see euler ancestral being able to fix bad anatomy during generation.
>>105707425I use euler, does it matter the sample type?
>>105707400maybe the spaghetti girls are slightly better
man with sunglasses gets onto a black helicopter.
well, it's still true.
>>105707364Try using res multistep for higher quality than Euler or for highest quality heun or dpm 2. Take at least 30 steps as opposed to 20, the more, the better. Remember the model is not done training yet, and it attempts to be very dynamic which is good, but that comes at cost of mistakes. Turn image previews on to cancel ahead of time obviously bad gens.
>>105707441Yes. See here for example (from OP): https://stable-diffusion-art.com/samplers/#Ancestral_samplers
It's not explained too well, admittedly, but basically the added randomness at each step gives the model more opportunity to fix weirdness during generation - at least in my experience specifically with Chroma and photorealistic generations.
If you have a disembodied leg somewhere, other samplers might be forced to just lock in and go with it, whereas with an ancestral sampler, there is more of a chance it can turn into a towel or something. Do some experiments with and without ancestral would be my recommendation.
>>105707400>>105707456Increasing cfg is bad advice. Results in overfitted slopped Flux look, not sure that is "better" anatomy.
>>105707495>Increasing cfg is bad adviceexperimenting with 0.1 increases/decreases works pretty often, you dont have to go overboard
>>105707484ah fuck me I forgot euler a exists
My CFG is 4.5, default from Chroma workflow, never felt need to experiment at least for photorealism cause it's perfect, but feel free to play with it
is using --listen argument and prompting with my phone over the internet safe?
>>105707523Oh wow ldg sure has all this technical knowledge...
>>105707537If you need to ask: no
But you have already compromised your system because you're a tard normie, so please go ahead.
>>105707549Oh wow ldg sure has all this technical knowledge...
>>105707539>>105707562Oh wow ldg sure has all this technical knowledge...
bout to make a big stinky
>>105707484I just don't use Euler to avoid mistakes. In my case I've found it to be inferior to res multistep or dpmpp 2m. If swap to Euler I always find entire image quality to be significantly worse and never better.
>>105707368NVM i found the problem had to downgrade pytorch
>>105706299If you need to ask: no
But you have already compromised your system because you're a tard normie, so please go ahead.
https://ace-step.github.io/
Looking unlikely these guys will open source their next improved model. Looks like we will never get a local Udio tier model because no one wants to mess with record companies.
>>105707616>next improved modelWhere? And is it actually Udio tier or just another generic chinese pop rap dataset?
Why did you tell anon to increase CFG :(
>>105707631>Where? And is it actually Udio tier or just another generic chinese pop rap dataset?It's not really out. But those guys can easily make the model Udio tier, and I don't think it'd be local because they wouldn't have an incentive to release it. They are a small Chinese lab and there are millions to be made from such a model behind API.
>>105707655That's a shame. Basically the only hope for a true competitor to Udio is some random autist with a training rig and nothing to lose, which I doubt there are any out there.
>have one parameter which can go up or down
Normies are crying about it.
>>105707636INCREASE CFG. HIGHER. not under 6.9
>>105707581euler is ok, just needs more steps. but yeah dpmpp2m and other samplers seem to be better. really depends on the content tho
what distilled cfg do you guys use for chroma
>>105706231reminds me of a carmen san diego cutscene
>OmniGen2
>Finally getting better results than Closed AI 4o at home.
Sweet. Finally I can generate my obscure waifus from 1 single picture then train LoRas some more from synthetic data.
https://github.com/VectorSpaceLab/OmniGen2
>>105707305if you follow aoe2, this actually looks like something that would appear on memb's channel kek
>>105707313there are about 100 different ways to inject noise into an img2img process. need to be more specific.
>>105707731how fast is it?
done for the night catboxing this one just in case someone wants the prompt
https://files.catbox.moe/bw99dn.png
>>105707755~16 seconds, 4090
>>105707787>HarpueaWhat kind of a username is this?
lightx2v is only good for animated wallpapers and fake live2d
>>105707787is it censored?
>>105707787nice. so, according to my precise calculations, around 25 secs on my african space force 3090
>>105707833Not cencored. Nipples and pussy shows.
>>105707840What model is this? Box please?
>>105707848https://files.catbox.moe/7ddgvl.png
a man with sunglasses turns around and starts running far away.
>>105707787>~16 seconds, 4090wtf, takes me 4 mn on my 3090, how??
>>105707939Flash attention and Triton installed?
>>105707942yes, I'll test the native Comfy one soon and see how much faster it is with sage
>>105707616>Looks like we will never get a local Udio tier model because no one wants to mess with record companies.yeah, the music industry is worse than the mafia, even the chinks are terrified about copyright when it's about music
>>105707731Can omnigen for instance fatten up a skinny girl? Asking for a friend
https://github.com/mit-han-lab/radial-attention
>We present Radial Attention, a sparse attention mechanism with O(nlogn) computational complexity. Radial Attention accelerates pre-trained HunyuanVideo by 1.9× at its default video length while maintaining comparable video quality. When generating 4× longer videos, it reduces tuning costs by up to 4.4× and speeds up inference by up to 3.7× versus dense attention.
ok now we're talking
>>105707968CANNOT and WILL NOT
>>105707968just gen something tasty, why bother. seriously.
t. dipping into chubby after having exploited.. everything else.
>>105707973>accelerates pre-trained HunyuanVideoIt seems lots of researchers turned their heads to Hunyuan when it first released and spent the last few months with it, huh? Too bad that model was only relevant for like 3 months
On a side note, going forward, we will probably be receiving papers in the coming months using Wan as the base
>>105708002yep, it takes time to make such a project so I'm not surprised one bit, at least they're not shilling fucking Sora mini you know what I mean kek
To this day, I haven't seen any Wan gen nearly as soulful as this. This was Skyreels I think. Higher base fps makes quite a difference at times
>>105708016Wan as comparison
https://github.com/thu-ml/SageAttention/issues/190
I'm glad that those fuckers are getting shitstormed lol
>>105708044>Why are you all locking it down pretending like this is Flux Kontext with risk to public safety????Meanwhile in chink's world:
>Here, have our Omnigen 2, it can do nudity have fun!
>>105708034y aint it the same starting frame tho
>>105708016>>105708034I prefer Wan's one, the structure is better it has less AI weirdness
>>105708044https://github.com/thu-ml/SageAttention/issues/190#issuecomment-3003693626
>the license in the gated HF repo seems to be unchanged. Weird.https://huggingface.co/jt-zhang/SageAttention2_plus
>apache 2.0 licenceyou know what this means? the first mf who gets the hands on the code can post i on github without much issues
>>105708104a sign of THE MAN to dive deeper into chubby huh. aw man. chubby koreans are tasty.
>>105708090Yeah, I forced a frame trying to get results as close as possible to the other one (also because I liked that smile better, heh)
But here is one with the same starting frame as the other
>>105707731>better results than Closed AI 4oobviously no pissfilter and vagene is amazing but does it also surpass it in other areas?
>>105708162comparing it to 4o is useless, the new SOTA model to compare with is Kontext pro
>>105707973Wait so youre saying, longer videos + faster gen times all in one? Oh, keeping a fat eye on this one. This is the news I come here for
>>105708192>Wait so youre saying, longer videos + faster gen times all in one?yeah, it gets faster and faster when it's longer (and has more resolution) because the attention layer is not quadratic anymore, if they managed to keep good quality on it that's a big deal
>>105707973The code's already there, right? Can't kijai or comfy integrate it into a node themselves?
>>105707973zased, that'll fix the issue with riflexrope being a piece of shit that likes to loop
>>1057081834o is still ranked above gpt-image. this is like saying base flux beats dall-e kek.
>>105708218>4o is still ranked above gpt-image.how? it changes the image (instead of editing it like kontext pro) and that piss filter is awful
>>105705748>https://github.com/comfyanonymous/ComfyUI/pull/8669how can we make multiple image input comfy? your workflow doesn't have that