Discussion of Free and Open Source Text-to-Image/Video Models
Prev:
>>106190450https://rentry.org/ldg-lazy-getting-started-guide
>UISwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.com
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows
>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>WanXhttps://github.com/Wan-Video
2.2 Guide: https://rentry.org/wan22ldgguide
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y
>Chromahttps://huggingface.co/lodestones/Chroma1-Base/tree/main
Training: https://rentry.org/mvu52t46
>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/
>MiscLocal Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage
>Neighbourshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
>>106193870 (OP)b l e s s e d
t h r e a d
o f
f r e n z o n e ;3
>>106193885stop reportbombing my posts
i generated that cutie for a user here
how can he find her is she is balate?!
>https://files.catbox.moe/a1qv37.mp4
fingers and emma seem ok with annealed
>>106193958real schizos never stop
Yet another ComfyUI option that isn't default set to this making the ui everything but comfy
>>106193983stop what? mass-reporting? acting in bad faith? LARPing?
being an all around nuisance?
what do they never stop? (besides fapping)
enlighten me
anime girl opens a box of pizza and eats a slice.
>>106194008and max ui fps is limited to 120 at max, lmao
>>106194012its not supposed to be that serious mate
but he MAKES it this way;
now i HAVE to be here
because he doesn't
want it to happen;
its a mexican -
s t a n d o f f
;3
>>106194012schizoing... and... posting mikus i guess...
>>106194026why did you post "https://desuarchive.org/g/thread/106190450/"
i dun get it
nta btw
>>106193934it sadly seems gover ;c
if we get tardposters corrupting the categories (intentionally) it ruins it for everyone else
gen jam should be ONE topic
and one chosen by the wheel
>>106194021And also no matter what the limit is, the gpu seems to work ~5% harder if there is a limit on rendering the ui even at 30fps rather than if its uncapped at 165hz/fps
>>106194040>post got report-bombed>i didnt even notice until he asked for 'catbox' :c
file
md5: f39e202d02d5fa114144281350ecc4e8
๐
>>106194056i dont get it, why are you posting a desuarchive link of previous op inside previous op?
can i get a /ldg/ discord welfare check? these threads are getting slow...
file
md5: 8fe8007e429fb0b264ae90ff47d672d0
๐
>>106194081>>106194074so you can see which posts were removed anon
>my pizza is cold, you bitch!
>>106194140one post that he didnt ask catbox for was removed aswell
are you a femboy?
>>106194181hello, i do not want my posts removed when they are on topic; if that happens you will get a desulink.
thank you for playing.
goodbye ;3
>good luck figuring out which collage-bait im cooking up this time muehehee
>>106194153better output:
>>106194206c-c-chroma does all t-this? anon-sama? ;_;
please do mai shiranui
>>106194217are you a femboy
>>106194202>>good luck figuring out which collage-bait im cooking up this timeglad to see the therapy is working :)
After testing it Chroma actually works well and stabilizes if you're using the flash lora.
Reposting
>>106194334Here's seed in question (same problem with every seed I tried on that prompt)
https://files.catbox.moe/lvkdi9.png
>>106194345I'm not aware of that
I haven't tried the annealed version yet
>when you see a white person in the UK:
>>106194340Sadiq needs a part and parcel sign to go with this.
>>106194074not much of what he does makes sense, best to just ignore
a man with a robotic arm lights a cigarette and smokes it.
so is radial attention snakeshit or does it work
>>106194447where are her organs
>>106194506where did it come from, where did it go?
how does he get so much smoke from a cig
whats the difference between chroma annealed and flash-heun?
Fake VACE
>Fake VACE
https://huggingface.co/CCP6/FakeVace2.2
>>106194550because vaping was probably also in the training data
>>106194140this is a good example of what they're taking about
https://desuarchive.org/g/thread/105895325/
>>106194519It does work but it's got annoying resolution restrictions.
>>106194574>vibe coded>2 days ago>no example outputsdownloading now
>the jokes are over, now it's your turn
>>106194012you have decided to be a bad doggo
The default settings for Diffusion pipe may be incorrect for Chroma v50, loras are broken when training Chroma v50 (HD)
Any ideas?
>>106194730it's good he doesn't shoot the camera because then there'd be no footage
I don't tend to notice it as much anymore but there's a lot of wayward lavalier mics in videos
>>106194710kek, I'm just waiting for real vace. wan 2.2 fun is already out, quality looks decent but its still massive in file size
>>106194380>>106194363No luck with annealed. I'll reword the prompt to see if it improves
https://civitai.com/models/1598938?modelVersionId=1967502
Forgecucks? Now you don't have excuse!
a man holds up a sign saying "LDG".
>>106194878but it's just custom nodes that I have to bloat even more? who does this actually help if it just adds to the amount of shit that can potentially break in comfy?
How do you prompt for camera angles in Chroma? Any examples of how I should do it? Right now it's just ignoring anything I tell it to do in that regard.
>>106194931a man drinks a beer from a beer can on his desk.
neat, it worked
>>106194950STOP USING FORGE RIGHT NOW!
YOU ARE NOT COLLABORATING!
a man throws his PC monitor on the floor.
>after using chroma from months i hear that you should use "aesthetic 11" in positive and "aesthetic 0" in negative
'night and happy latent space dreams
>>106194877Okay, so from what I understand the Chroma dev shrank the dataset size for this version. Since now if I remove the neg entirely, I get a very fuzzy looking image for that prompt even when asking for a photograph. So my guess is that I need a much stronger neg (but I tried a bunch of new words and it barely improved), and the v50/v50 annealed are strongly biased towards drawings. Here is one of my usual footfag prompts I test with no changes in neg. Yeah, that is exactly what is happening.
>>106195017I doubt it matters by might as well try
hm. either I suddenly can't prompt for shit or there's something wrong with the final chromas. i've already swapped to the new vae.
>>106194883>>106194837>>106194583>oh yes, Iodestones, I'm going to download your shitty model to do the same thing SDXL does, but it takes 10 times longer!
>>106195097>new vaeThere's a new vae for it?
>>106195087wow, left has the "chroma look" that people have been so optimistic about, while right looks like it wants to converge on the standard blown out slop look. eww.
>>106194290no, that's really cool
enjoying fluid motion, interactions, shades so much
>>106195087Can you give me the deets on that gen so I can play around with it? Catbox would be cool, too.
The v50 version looks rather disappointing.
I'm still running the scheduler/sampler tests right now and there's some promise in there, for sure, though.
don't make me do a single vote survey per ip to see who uses chroma...
>>106195087Is this normal v48 or v48-detail-calibrated?
the man holds up a plate with a McDonalds cheeseburger and McDonalds fries on it.
reviewbrah could AI generate his videos and just collect the money desu
just bought 128GB of RAM
that means tomorrow they will release Wan 2.3 which requires 256GB
>>106195087I find that v49 gives me the best results, which kind of makes sense since it's the one trained at 1024 resolution without any merges against other highres experiments.
>>106195087You guys still don't get it do you? The classic "Chroma look" of the left image that some people like, is solely due to the fact that the model was trained at 512 res but you're genning at 1024. This manifests as essentially additional noise added at all stages of the diffusion process. In the low noise timesteps, the model diffuses this additional noise into realistic-looking texture rather than something that looks overly smooth.
The right is the "correct" version. It matches what most of the images in the training data look like. The same process that is making the left image look realistic also makes it fuck up anatomy and have random details in the image that don't make sense. You just prefer the look it has on the textures, without realizing all the other aspects of the image that it's hurting.
Chroma v50 has much better anatomy and more consistent details than any previous version, because it's actually trained at the resolution people gen at. The solution to textures that are too smooth is a carefully curated aesthetic finetune that avoids any slopped looking training images.
>>106195147why does it matter tho
>>106195231NTA but can you post a better comparison showing that v50 is in fact superior
>>106195209the cheeseburger is smaller than it seems!
>>106195141Sure, here's gen anon
https://files.catbox.moe/bip9ro.png
Keep in mind the fake skin issue shows up other places in other prompts as well, but those are isolated cases (pic rel), I've never seen it this bad. I've tried that footfag prompt consistently across every Chroma version and this is the first time I've seen a slopped result.
I migrated my 3090fe to a 5090fe, and the difference in speed is simply insane.
For the following conditions on wan2.2 i2v (fp8) :
3090@260W
5090@460W
720p/113 frames/4+5 steps/cfg 1
3090: ~25 min
5090: ~6 min
Power limiting the 5090 at 80% gives me 93% of the performances from my tests. So I just limited to that, it's summer and the nvidia melting cable thing is still in my mind.
I also did some tests for block swapping in wan2.2 (fp8) text to image.
211.67 seconds 30 blocks swapped
200.14 seconds 10 blocks swapped
193.55 seconds 0 blocks swapped
The difference is pretty small, surprising to see the model not being fully in vram having this little of an impact.
I guess it means I can have 40 swapped blocks + long 161 frames videos for example on wan without having to get a 48GB+ card once the repetition problem is solved.
>>106195264post v48/v50 version of the Filipino slut
>>106195264Thanks brother. Appreciate it.
I'm at 12/17 sampler/scheduler plots now and I'll run some tests on your setup afterwards.
>>106195231From my understanding, that would imply that we could get a similar result genning at a bigger resolution again, due to the additional noise we'll be gaining again, right?
the man takes a slice of pizza and eats it.
using the 2.2 i2v lora, pretty good desu
>>106195312very nice, thanks for posting these tests
crazy speedup
..however
>460wdang, my whole rig consumes that much
>I guess it means I can have 40 swapped blocks + long 161 frames videos for example on wan without having to get a 48GB+ card once the repetition problem is solved.have you tried radial attention?
>>106195264Nta
Hey man, try this workflow instead, slight difference but I had success with having less of the grain with it I think
https://files.catbox.moe/d03j58.json
>>106195231OK, so why does the left image obviously have much more detail? Look at the hair!
>>106195312but the fp8 quality sacrifice is large, q8 is almost full quality
I finally found out why my prompt had so little effect on 2.2 image to video: setting the cfg to >1 solved the problem, suddenly my prompts were having quite the impact.
I thought this wasn't a requirement anymore but it is, which is very annoying since it almost doubles my gen times.
>>106195327this time, less impressed:
the man throws the pizza out the window behind him, and shakes his head in disgust.
>>106195312just wait till jenga, radial attention and sage 3 fixed drops, speed be nutty
>>106195390NTA but I always thought FP8 had more quality than Q8. Is it the other way around?
Even though it can sometimes want to gen cartoony images, there is some body horror, and there's a little gacha like with all other models, Chroma was always good enough for a lot of things, picrel v48.
From initial testing the new versions do seem to be better too but I'm genning something else now so I can't test directly.
>>106195406works on my machine
>>106195357>very nice, thanks for posting these testsNo problem, it's the stuff I wish more anons posted so I might as well do it.
>have you tried radial attention?No, is it supposed to solve that?
>>106195390I don't think it really matters when quite the sacrifice was made when I started using lightx2v.
And I'm using scaled fp8 which is like q8 and close to fp16 from what I understand.
>>106195315She's meant to be Japanese. Anyways, I think Chroma v48 does slop quite a few of those as well, as do other versions (though I recall it happening less on v36), but every now and then there's realistic looking gen.
That being said, with v50 I've not gotten an unslopped result with that prompt. Here's catbox to check performance across different seeds yourself
https://files.catbox.moe/qu35uz.png
why do image to video first frames always look so bad/weird?
>>106195223Trying v49 now
>>106195420With fp8 scaled there's no difference in quality, with normal fp8, q8 is better quality, but it will always be slower than fp8 / fp8 scaled
adf
md5: 99a524a7fff4d456761b44e2b3f6310b
๐
I like to prototype larger prompts with changing parameters, but I found the solutions for wildcards too complicated for quick use. I made this myself, but are there any decent solutions that make it just as easy for me with more functionality?
>>106195455>scaled fp8 which is like q8Would like to see that comparison
>>106195420picrel
>>106195419Yeah but hopefully the quality won't be too much.
I know sageattention 3 speedup is constrained by looking noticeably worse compared to sage2++.
>>106195464Try v49, it's the version trained at 1024 without any weird merges
>>106195486it is good and it is faster
see vidrel
>>106195455>No, is it supposed to solve that?yeah
>>106195486That's not fp8 scaled, that's an old comparison using plain fp8 from ages ago you downloaded from reddit, I know, I read the post back then as well
>>106195467I wish I knew, maybe it's the video creation shitting up something?
Either way, the first frame always look blurry to me to.
Poll:
https://poal.me/z3ek04
https://poal.me/z3ek04
https://poal.me/z3ek04
https://poal.me/z3ek04
https://poal.me/z3ek04
since you're talking about chroma, how do I get a realistic photo orc? I can only get it in 2d or 3d.
file
md5: 6289fa362ad8baa2ad3547e0f5c4a57b
๐
>>106195539Be really specific about it being a photo, about what camera is being used to take the photo, about how the photo was taken, etc. Do something like:
>A candid amateur photo taken with a Sony Cybershot>A high-quality digital photo taken with a Fujifilm X100V using a Fujinon 23mm f-2 lens
the man puts down the pizza on a table and covers his face with his palms.
>my day is ruined
>>106195539Add photo, film still etc to your prompt, and if needed you could add things like: sketch, drawing, illustration, painting, art, cartoon
to your negatives, there is a LOT of control to be had with negatives
>>106195486>>106195509>>106195455Seems like scaled still takes a hit, but might be ok depending on what you want to gen
https://www.reddit.com/r/StableDiffusion/comments/1gc0wj8/sd35_large_fp8_scaled_vs_sd_35_large_q8_0_running/
>Wanna jump with me anon ?
>>106195510great gen, whats the prompt/lora for the digi photo look?
>>106195464Alright, v49 is downloading, but before that here's this prompt
>Amateur photograph of a stunning Japanese alt emo idol with noodle cup in her hand, she is in her couch, day time, bright, she is on fours, view from the front of her face, her soles are up in the airHere v50 does look a little sharper, but I'm not sure if being more polished is what you want, after all Qwen images look clean as well but they are more slopped.
the man closes the pizza box. The top of the box says "LDG pizza" with a logo of Miku Hatsune on the box.
almost
>>106195734holy esl what a prompt
>>106195505OK thanks anon.
>>106195734Alright, slightly altered the prompt to add panties to make it SFW. Much better results on this prompt overall, but win seems to go to v48. v49 is not bad though, will try other prompts
>>106195775i was talking about FP8_scaled btw, ggufs are slower
>>106195766great gen anon
>>106195231>>106195316I think he is right. Changed the resolution from 1024 and the flux gens seem to have stopped.
>>106195788Well, v49 is v48 + one epoch of 1024 resolution training
v50 and v50 annealed are frankenstein merges with other high resolution training tests, they might be better for some things, but I haven't come across any yet
>>106195788It seems that subsampling the dataset obliterated the organic look Chroma had
>W-who are you ? And what do you mean with /ldg/ ?
>>106195734your prompt is shit.
shit goes in - shit comes out
>>106195788try higher than 1024 res for the 49/50
is unipc still the recommended for wan2.2?
>>106195773>>106195986bros wtf am i supposed to be prompting??
i dont want to pay for grok just to gen with chroma
>>106196009try lcm and see which u like better
>>106196018you can literally use chatgpt for free you iliterate twat.
> in her couchIN her couch?! what.
> is on fourson what fours? it's "she is on all fours" but this still doesn't make sense because you say her feet are up, retard.
honestly. 99% of issues is always the person prompting, not the model.
>>106196043>>106196018can you show that anon how you properly prompted for a jap woman
>>106196030Why did you enlarge it ? I can see the fucking pixels
>>106196052though the iliteracy and obvious laziness make sense, given he's a footfag
>>106196056just use any free llm, chroma needs lots of yapping i think
>Amateur snapshot of a striking Japanese alt-emo idol with vibrant dyed hair and dramatic makeup. She's positioned on all fours on a cozy couch during bright daytime lighting, with her face directly facing the camera and soles of her feet prominently visible in the air. Her delicate hands clutch a steaming instant noodle cup with chopsticks. Soft natural light streams through windows, creating gentle highlights on her porcelain skin. Background features cozy domestic setting with throw pillows and blankets. Style: candid amateur photography with shallow depth of field, slightly overexposed bright tones, and intimate framing. Emphasis on youthful energy, kawaii aesthetic meets alternative fashion, and playful domestic moment. High-detail capture, authentic snapshot qualitycopypaste this prompt https://github.com/QwenLM/Qwen-Image/blob/45bccb917f87c40fb557a4f0d5bff8350047332c/src/examples/tools/prompt_utils.py#L42
>>106196116you might have the tism
Prompt
>Amateur photograph, a stunning Japanese female cosplaying as sailor, sitting in front of a restaurant at night, she is holding a drink with left hand, and food with right hand, she is candidly laughingneg
>3D, render, drawing, vintage, bokeh https://files.catbox.moe/qbpnxq.png
With that, I conclude v48 is best for photorealism, yeah.
>>106195986I know it's shit, but no amount of prompt engineering is going to fix the sloppiness, look at
>>106194380 it was VLM generated, and the output looks even worse with no negs.
>>106196030>>106195788>>106195734>>106195665All this talk about Chroma gens reinforces my argument that what you're proposing with your blatant shilling doesn't bring anything new to the local ecosystem or NSFW. All these images can be created with 10 times better quality in SDXL with any checkpoint finetuned for realism, much more quickly and easily and less heavier.
Stop wasting time with this crap.
>>106196130This could be do with SD 1.5
>>106196138Then use your SDXL, nobody is stopping you
Better yet, post your SDXL gens to convince people
Just stop bitching about Chroma like an absolute mental case
>>106196138Go anon.
Show me your SDXL versions. I will make it easier for you, I will not post a catbox that shows your obvious lying.
>>106196138tell me more about this sdxl
>>106196129you definitely have sickle cell anemia
file
md5: 35e089ea399e62b4d01eba13a9996a6c
๐
Fix v50 lora. My dataset sucks though.
>>106196153I mean, if anon can really do this with SDXL or 1.5, I want to see it.
file
md5: 1dcd50484a0a29b82c98ae7e7fc8d0af
๐
>>106196185Whoops wrong gen. Meant to post this one.
>>106196185it has that 90's tv movie vibe going
>>106196163Not explaining myself to you, backed by entire AI sites. You're wasting your time, faggot. Any thing you share about new chroma version are worst and worst. You are lost, unable to fix your product.
>>106196185>>106196203w-whut does she look like in leggings or perhaps panties tho
Some of my chroma 50 gens come out super blurry, what's up with that? Never had that before.
>>106196225Your whole life is focused on Chroma failing, like how sad is your existence ? You can just not use it, how can you be this obsessed ?
Imagine caring about what models a random anon may or may not use
>>106196230are you using a gguf from the reddit post? some of them had that issue and got deleted later
>>106196246Because he can't run Chroma. He's never posted a gen either. He's just a troll
>>106194984his muscles were just smoke
he is fat as fuck
>>106196246You are shilling a product, dev, expect reviews good and bad. And here is a bad one(or mayne a real one not made by you), deal with it. The quality is dropping, your posts show it and for every fix you attempt ten things worsen. You are lost, you don't know what to do to improve your product.
I appreciate chroma and qwen but the anime models simply have more sovl than them, even the most random shitmix
lodestone should be dragged out on the street and shot
>>106196324That's a very compelling example, anon.
>>106196316kek, I'm not the 'dev' of Chroma, your mind is fractured, in fact I'm in no way affiliated with Chroma or any other model, unless you count visiting their discord channels
>>106195860Didn't entirely eliminate it, but it made it very less likely that you do get an organic looking gen.
>>106196364In fact, if you train a realism LoRA, you probably wouldn't notice this issue.
file
md5: e5736197670aebf7b205e9cb319f6d41
๐
>>106196348arguing here is pointless. there are far too many mentally unstable people here to hold an actual conversation or god forbid a real argument that doesn't devolve into "nu uh".
i will continue to use chroma just to make the idiots suffer.
lodestone it is time for the qwen finetune please
Can anyone explain the process or the workload of integrating image loras into a wan node tree?
I am running wan2.2 i2v, but sometimes a character in the input image has their eyes closed, which means their eye information is lost. When this is the case, I adjust the prompt to make sure their eyes remain closed for the whole video.
Assuming there was a good quality illustrious lora for the character in question, could i integrate it and somehow let wan know that a specific character from the input image should inherit lora data from a specific lora? Maybe something like how region mapping + controlnet works with masking, except for i2v videos.
what I remember most from the SD to HD transition was how many actors' careers didn't survive it. brad pitt's craggy face didn't hold up well, while catherine zeta jones's uninteresting perfection became a showcase display for the benefits of the new medium.
the results I'm getting with v50 remind me of that. what looked pretty good two epochs ago with the 512px beer goggles on now highlights every liverspot and cheesy toenail. it's like the "gross-ups" from ren & stimpy.
I'm haven't cracked myself up this much since chroma first launched. what a fucking hoot.
>That's a man, baby
Would you anon, knowing it's a dude ?
>>106195464good testing anon, I'll stick with v48
how do you anons manage to use lightx2v without destroying obvious details and motion like hair moving in wan2.2? or even faces for that matter
I can't find a way to make it less horrible in impact
what's the deal with this qinglong business? I usually use sigmoid if I need fine details
I wish testing this stuff properly didn't take hours to days
>>106194984the more he is animated the more he looks like george michael.
file
md5: 85c6bcc5bac70c655e3cdd933d8448ef
๐
what a SLVT
>>106196575Wholesome chungus!
>>106196575this is a children's board oh noes
"can your sd1.5 model do that"
>>106196638that's just your average anons feet
>loving wan
>think gens are smooth af
>best of the best
>trying to rip pic off insta for img2prompt
>notice its ai genned, view desc
>#midjourney
>havent seen that in a while, lets see where its at now
>new video model
>absolutely gorgeous and best ai vids ive seen so far
bleak for local
> A hyper-realistic POV shot, grainy imperfect 2000s CCTV footage, from a ceiling-mounted security camera inside a brightly lit grocery store. A large male cat head is looking at a female customer. Fluorescent halogen lights cast cold bluish tones.The camera is stationary, with a subtle VHS-style time-stamp overlay like 'CAMERA 02 2009/04/17 14:32:07'. No dramatic music, just low ambient hum. Aspect ratio 4:5 vertical. Color grading should match old security footage with muted, slightly greenish hues.
>>106196722Nah Hailuo 02 is superior to that slop. It's the best video model ever made
https://youtu.be/5yI9wEys2dc?t=251
>>106196792wow you werent kidding, this can seriously be used professionally wtf
>>106196819Yeah, China shitted on Veo 3 a while back, it's wild
https://xcancel.com/WuxiaRocks/status/1935298213613027521#m
>>106196832> june 18ththe chinks are fucking cracked wtf?? the reflections and physics are perfect AND has sound????
>>106196792the chinaman is on FIRE, goddamn! when local?
>>106196893a redditor faggot already did it. both this and the other are completely shit
Chroma is a subpar model for realism and styles. It has deformed feet and hands, poor proportions, and zero community support. Why use a mediocre model when you can get better results faster with embeddings or any SDXL model? Tags are better.
>>106196792but the boring minimax, doesn't even sell infinite usage for Hailuo 2. I definitely prefer the free wan,kek
when will the wan chinks fix the overtalking problem? things getting a little weird now.
https://litter.catbox.moe/h33yt9eyfauh53fu.mp4
Is there a way to replicate the dynamic calculation of SNR to switch from the HIGH wan model to the LOW one like the official repo does?
>>106196997A trick I've been trying is do a vace vid 2 vid gen with multitalk but give it a silent audio clip. Still sucks you gotta do that though.
The flan-t5xxl changes things too much
>>106196348Your product has no market, no target, no audience, no strengths, and is below average.
Here and out there, people use SDXL and WAN. Furries and bronies stick with it too.
Shilling here and replying to my posts shows your failure and your product's failure.
Call me what you want, I'm behind a screen and until proven, you're a Chroma dev
>>106196832Is something close to this even possible in wan 2.2? Does it knows what a skate grind is?
>>106197052It's a finetune, it is supposed to do that. It's like saying Pony/Illustrious changes things too much from SDXL. Well, duh?
>>106197061Wan 2.2 is not even Veo 2 tier.
>>106196906Yeah I just got done testing them. Absolute garbage. Wan low noise v2v unironically better.
go let your chatbot do something else for a bit anon
>>106197065meant to reply to this, wan2.2 t2v
>>106196792Hailou 02 test
any way to get Wan2.2 to stop generating a png along with my mp4? i'm using kijai's workflow from the rentry guide
>skip a few threads
>seeing lightx2v lora update
>come back
>every clip same shitty slow-mo
When it ends bros?
>>106197186chinks make great foundational models. they make the shittiest optimizations however
>chroma-unlocked-v50.safetensors
is this the final version?
>>106197186just make it run at a higher fps lol
>>106197202we know it works but it looks like shit
>>106197200unfortunately yes
https://huggingface.co/lodestones/Chroma/discussions/100
>struggling with v50
joever
https://huggingface.co/lodestones/Chroma/discussions/99
>yeah v50 is the final "base" model, speed model will come after
>>106197221pls... just two more epochs
>>106197251what processing did you do on this anon? Looks great, good length too. what was original output and what was it frame interp'd to?
where is the chroma v50 fp8 version?
>>106193870 (OP)is there any LLM AI model that doesnt need a GPU to do this? i have a shitty PC with no gucci video card why cant i just make pics with the CPU?
>>106197248>https://huggingface.co/lodestones/Chroma/discussions/100>literal who promptlet complaining This thread gets enough of that already. Who cares.
Man v50 really just released with just some general attention, the hype completely died out. Props for him actually fronting money but man, I really still feel model creators muck around too much with training and just keep going despite irredeemable mistakes because of sunk cost fallacy in the thousands of dollars.
>>106197265this is just wan 2.2 i2v, i took the last frame of the first gen, genned again and just concatenated the two videos
frame interpolation with FILM VFI node
playing around with img2img with wan2.2 and llm prompt interrogation from original image
>>106197285If it hadn't released around the same time as Wan 2.2 or Qwen then it would've gotten more attention.
where is the chroma v50 nunchaku version
>>106195231unless the dataset is full of synthsloppa, there is no way the image on the right is its distribution
Is 2x24GB enough to train a Chroma lora?
>>106197288wow, couldnt even see the seam, when ive tried prior it would always have slightly different camera movement which would make it a blatant splice
>>106197285>body horror anatomy still present with more than 1 subject or non-close up shots>v50 somehow more slopped with plastic skin and ultra bokehi don't know what i expected
>>106197275https://huggingface.co/Clybius/Chroma-fp8-scaled/blob/main/v50/chroma-unlocked-v50_float8_e4m3fn_learned_svd.safetensors
>>106197311A single 24gb gpu is enough
>>106197323svd?? qUANT?!?!?!
>>106197291original always on left, right is wan upscale
> 0.45 denoise, anime tokens taken out of prompt
Maybe I'm missing something since other people keep swearing that v50 is perfect, but the only way I can get Chroma to be functional and not give me extreme body horror or artifacts is by using the Flash models/loras. But then you don't have a negative prompt so the fuck's the point?
>>106197316yeah the seam is there but hard to spot if you're not looking for it. i think adding (shaky selfie cam:1.2) to the prompt helped in this instance
but otherwise it's a bog-standard wan gen, with the 2.2 lightning loras at 1 strength and 3+3 steps
does chroma work with flux lora and flux redux? these are the reason why flux ecosystem is so good
>>106197346preciate the info anon, thanks
>>106197354I've seen people say they were able to get both working, yeah. I think it kind of depends on the lora
What should I type into the positive prompt if I want to gen images like this one?
Is the Chroma 1-HD just renamed V50?
>>106197367they really fucked up with the constant talking thing, it ruins a lot of videos
>>106197339is left a gen?
>>106197339>>106197416screencap from Evangelion
>>106197367did you overcompress this or is that the normal output?
>>106197380strange. no one talks in my realistic gens. i'll try anime
>>106197339why even try this? there are far better image upscalers. This seems like an exercise in retardation.
>>106197460not an upscale, its just 1:1 img2img with wan and basic denoising, just curious to see how it handles it
>>106197472ignore the retard anon keep going
>>106197472going to try for prompt only style changes now, probably realism <-> anime to start
>>106197472Boo, nta, but I thought that was prompt interrogation.
>>106197453yes only happens in anime or cartoon
>>106194386I want to know more