← Home ← Back to /g/

Thread 106628570

348 posts 192 images /g/
Anonymous No.106628570 [Report] >>106631184
/ldg/ - Local Diffusion General
Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106625151

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
AniStudio: https://github.com/FizzleDorf/AniStudio

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.106628594 [Report] >>106628643 >>106628652 >>106629247 >>106629715 >>106629894
Seedream thread
Anonymous No.106628597 [Report] >>106628619 >>106628694
hello, im a newbabretardfuckingidiotmongoloid who just started messing with onetrainer, is this the base model for sdxl i should download?

https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/tree/main
Anonymous No.106628619 [Report] >>106628637
>>106628597
i wish you the best of luck on your quest, newbabretardfuckingidiotmongoloid !
Anonymous No.106628622 [Report] >>106628704
what nag node do i use
Anonymous No.106628637 [Report]
>>106628619
but you didnt answer the question AAAAAIIIIIIIIIIIIIEEEEEEEEEEEEEEEEE

i'll figure it out ;)
Anonymous No.106628640 [Report] >>106628650 >>106628669 >>106628671 >>106628679
Total ComfyCloud API Node Victory
Anonymous No.106628642 [Report] >>106628651 >>106628744 >>106628953
Anonymous No.106628643 [Report]
>>106628594
this scares the chroma foot faggot kek
Anonymous No.106628650 [Report] >>106628665
>>106628640
How does a 14B model need that much memory to run?
Anonymous No.106628651 [Report]
>>106628642
computer, add a sonichu medalion without changing the rest of the image
Anonymous No.106628652 [Report]
>>106628594
from the thumbnail, i thought it was a giant ribbed dildo.
Anonymous No.106628662 [Report] >>106628851 >>106628953
For the lulz here's all 68 images from previous in a single collage.
Anonymous No.106628665 [Report]
>>106628650
VRAM requirements increase to widen the SaaS moat. Do not let those filthy localhoards cross!
Anonymous No.106628669 [Report] >>106628680
>>106628640
>64 × 180 GB = 11,520 GB
is this a joke or something?
Anonymous No.106628671 [Report] >>106628680
>>106628640
>5B is 20 gigs
>14B needs 11TB
??
Anonymous No.106628679 [Report]
>>106628640
I guess he means to server every request? Because otherwise this makes zero sense.
Anonymous No.106628680 [Report]
>>106628669
>>106628671
Just stop thinking about it, you cant run it regardless so lets all just calm down and subscribe to ComfyUI API.
Anonymous No.106628691 [Report] >>106628696 >>106628953
Anonymous No.106628694 [Report] >>106628732
>>106628597
If you pick the SDXL preset the field will be automatically filled and when you start training the first it will automatically download the model
Anonymous No.106628696 [Report]
>>106628691
I mean poland and serbia are really white, but no one want to go there lol
Anonymous No.106628704 [Report] >>106628754
>>106628622
none theyre all snake oil
Anonymous No.106628732 [Report]
>>106628694
i already downloaded every file here

https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/tree/main

so i'll find out soon enough
Anonymous No.106628733 [Report] >>106628953
Anonymous No.106628744 [Report] >>106628775
>>106628642
Is this the banana
Anonymous No.106628754 [Report] >>106629421 >>106629457
>>106628704
i installed it and it's not, it works as advertised. what i'm not understanding now is why you would use it instead of using one of the speed loras that let you use cfg>1, since nag basically doubles gen times. maybe i'm missing something though
Anonymous No.106628775 [Report]
>>106628744
nope qwen + this https://civitai.com/models/1934100/anime-to-realism?modelVersionId=2189067
Anonymous No.106628814 [Report] >>106630898
Anonymous No.106628827 [Report] >>106628865 >>106631043 >>106632660 >>106632818
https://huggingface.co/fredconex/SongBloom-Safetensors
https://github.com/fredconex/ComfyUI-SongBloom
we got Suno at home?
https://files.catbox.moe/96i90x.flac
https://files.catbox.moe/olajtj.flac
Anonymous No.106628851 [Report]
>>106628662
Lovely
Anonymous No.106628863 [Report] >>106628938
Can you train the same lora with same settings and datasets and get different results or does retraining do nothing?
Anonymous No.106628865 [Report] >>106628873
>>106628827
Not bad, from these samples it's better than Ace-Step 1.0 (will 1.5 ever be released?)

Wonder what the range of music is, and most importantly if it can be effectively finetuned with other musc
Anonymous No.106628873 [Report] >>106628895
>>106628865
>Wonder what the range of music is
you can put a real music in there and it'll remix it, I find it fun to play with
Anonymous No.106628895 [Report] >>106628911
>>106628873
Does it require music input ?
Anonymous No.106628911 [Report]
>>106628895
it's not mendatory
Anonymous No.106628925 [Report]
Anonymous No.106628926 [Report] >>106629032
Anonymous No.106628931 [Report] >>106628963 >>106628975
hunyuan image looks better now in comfy

https://github.com/comfyanonymous/ComfyUI/pull/9882
Anonymous No.106628938 [Report] >>106628950
>>106628863
If you are talking about the same model, a training run with the same dataset will make pretty much almost the same lora at the same epoch.
Anonymous No.106628944 [Report]
does nag not work with chroma flash?
Anonymous No.106628950 [Report]
>>106628938
Yeah same model. So it's just about resolution and scheduler?
ポストカード !!FH+LSJVkIY9 No.106628953 [Report] >>106632894
>>106628733
thank god
someone finally trained a lora that i actually WANT (sure feels like forever ;D)
>>106628642
>he makes posts like this but gets mad at migusama ;3
>>106628662
n e a t !
>>106628691
ew!
Anonymous No.106628963 [Report]
>>106628931
>hey look Comfy I fixed your implementation of that model!
I bet 20 dollars he won't merge it, again
https://github.com/comfyanonymous/ComfyUI/pull/7965
Anonymous No.106628975 [Report] >>106629816
>>106628931
impressive, that mf saved HunyuanImage
Anonymous No.106628982 [Report] >>106629041
more and more people realizing comfyui is no longer about local models
Anonymous No.106628999 [Report] >>106629012 >>106629023
The man in the red tie raises his arms, and the various trays of fast food on the table in front of him float in the air.

behold my power!
Anonymous No.106629000 [Report] >>106629036 >>106629082 >>106629088 >>106629092 >>106629135
https://xcancel.com/LodestoneE621/status/1968687032605065528#m
>Peak GPU mem: 16,139 1,736 MB (on dummy mlp forward pass)
>Speed ratio: 0.99× (compute & comms perfectly interleaved)
GET OUT!
Anonymous No.106629012 [Report]
>>106628999
I don't know how he managed to stay healthy after spending 80 years of his fatass life eating only McDonald's.
Anonymous No.106629023 [Report] >>106629046 >>106629053
>>106628999
that looks like something i shat out with sd1.5 circa 2023 what are you DOING nigger?
Anonymous No.106629032 [Report]
>>106628926
Nice Huke style.
Anonymous No.106629036 [Report]
>>106629000
So what’s the obvious drawback he is choosing to ignore? Because we know from chroma that there were many
Anonymous No.106629041 [Report] >>106629050
>>106628982
No, it's only you, since all local models are supported, often within hours, and it even supports local models that aren't even close to being finished training

There is a LOT to complain about when it comes to Comfy, but it's not local model support, which is stellar

Go lie somewhere else
Anonymous No.106629046 [Report] >>106629071
>>106629023
Sorry meant this for rocketpajeet
Anonymous No.106629050 [Report]
>>106629041
truth nuke, and I say this as someone who don't really like this autistic bitch
Anonymous No.106629053 [Report] >>106629071 >>106629084 >>106629095 >>106629129
>>106629023
the source image isn't very good/high quality.
Anonymous No.106629071 [Report]
>>106629046
didn't notice him, pretend i quoted him too.

>>106629053
wan can take images from the 1900s and turn them into (masterpiece:1.5), its just (You)
Anonymous No.106629080 [Report]
after the botched implementation of hunyuan image and chroma, more and more people are switching to better UIs where output quality comes before model quantity
Anonymous No.106629082 [Report]
>>106629000
Has the furry solved the 'too little vram' problem ?

Big if true
Anonymous No.106629084 [Report]
>>106629053
McDonald Trump
Anonymous No.106629088 [Report] >>106629131
>>106629000
as much as I question the furry's decisions, he honestly seems like a good researcher
Anonymous No.106629092 [Report]
>>106629000
How is this gonna work when even DDR5 is glacially slow compared to gpu vram?
ポストカード !!FH+LSJVkIY9 No.106629095 [Report] >>106629139
>>106629053
the amount of salt this originally caused will always be so funny to me
Anonymous No.106629129 [Report]
>>106629053
So this is it. This is the true power of Americans. My god..
Anonymous No.106629131 [Report] >>106629134 >>106629136 >>106629180 >>106629285 >>106629815
>>106629088
He's not a fucking researcher he just half bakes he saw in a paper and some guy on discord forwarded to him shit and never explains his reasoning before autistically moving on to the next snakeoil.

Why am I the only person who sees this?
Anonymous No.106629134 [Report]
>>106629131
>Why am I the only person who sees this?
you aren't, haven't you noticed the amount of seething everytime he made a retarded move on chroma's training? kek
Anonymous No.106629135 [Report] >>106629220
>>106629000
isnt this what flash attention does? you can only move things from ram->vram so fast, i dont see how this will work
Anonymous No.106629136 [Report]
>>106629131
You aren't
Anonymous No.106629139 [Report] >>106629186
>>106629095
It's funny because I don't know anyone in the entire history of ever who didn't love going to McDonalds after playing sports as a kid. Hell, even as an adult.
Nobody wants to be sucking down on weird french shit after a big game.
Anonymous No.106629142 [Report] >>106629163 >>106629252
https://github.com/comfyanonymous/ComfyUI/pull/9898
>Reduce Peak WAN inference VRAM usage
>The first git commit alone improves performance some and the second further increases it. I standardized on 1024x1024 for the image size and varied the frames. Before the changes the maximum number of frames it can handle is 49 and this increases it to 65 for my setup.
based
Anonymous No.106629163 [Report] >>106629172
>>106629142
>WAN2.2 I2V 14B Q4_K_S GGUF + lightx2v 4steps LoRA (based on video_wan2_2_14B_i2v template) 1024x1024x61frames video generation
>wan q4
>61 frames
grim, also is 1024x1024 one of the "officially" listed resolutions? i dont think so
Anonymous No.106629172 [Report]
>>106629163
>grim, also is 1024x1024 one of the "officially" listed resolutions? i dont think so
it's not, I guess he just wanted to do a test
Anonymous No.106629180 [Report] >>106629196
>>106629131
>he just half bakes he saw in a paper and some guy on discord forwarded to him shit and never explains his reasoning before autistically moving on to the next snakeoil
that's what most professional researchers do
ポストカード !!FH+LSJVkIY9 No.106629186 [Report] >>106629211 >>106629222 >>106632116
>>106629139
fav timeline: finding out the supersize me guy gained weight and was destroying himself from being a wastoid\boozer not from eggmcmuffs

>free unlimited mcdonalds when they first launched the app
>i ate so much mcdond i should be dead
>i lost around 3lbs kek
eggmcmuff has real egg >;3


good, simple, pure, fun times
Anonymous No.106629193 [Report] >>106629206
what does that have to do with image generation? nb4 spergout
Anonymous No.106629196 [Report]
>>106629180
They also document their shit.
ポストカード !!FH+LSJVkIY9 No.106629206 [Report] >>106632116
>>106629193
the images generated were about\of mcdond and the time trump was being cheeky

its too bad about the face detail from far away
i would imagine things will finetune\tighten in the next few quarters\months
Anonymous No.106629211 [Report] >>106629239
>>106629186
>supersize me guy
The guy was an absolute fraud.
Tbh, I think McDonald's gets way to bad a reputation for no real reason. Their chicken McNuggies are as close as you get gen to bare basic "Human food" and I don't mean that in a bad way.
Anonymous No.106629212 [Report] >>106629786
Anonymous No.106629220 [Report] >>106629232 >>106629243
>>106629135
No, Flash attention is all about keeping things in the GPU as much as possible

This is about being as efficient as possible when you have to offload parts of a model to ram, this is not a new concept optimization, it's been around in most trainers and inference tools for quite a while, the difference is that this claims to have near zero overhead

If the claims hold up, this would be enormous
Anonymous No.106629222 [Report] >>106629233
>>106629186
You cannot pay me enough to try an egg mcmuffin
Anonymous No.106629232 [Report]
>>106629220
>If the claims hold up, this would be enormous
like, he's using a paper to make this node or something?
ポストカード !!FH+LSJVkIY9 No.106629233 [Report] >>106629352 >>106632116
>>106629222
you can make one at home with a cookie cutter and egg and 2 slices of cheddar...
surely you eat breakfast sandwiches anon ;3
ポストカード !!FH+LSJVkIY9 No.106629239 [Report] >>106629268 >>106632116
>>106629211
>vegan girlfriend guy was a drunk fraud
WOW hahahah
Anonymous No.106629241 [Report]
Anonymous No.106629243 [Report] >>106629272
>>106629220
>No, Flash attention is all about keeping things in the GPU as much as possible
>This is about being as efficient as possible when you have to offload parts of a model to ram
These are not different, the highest efficiency possible IS to keep everything in vram as much as possible while offloading when you need to, given the speed of the gpu vs ram being x10 difference while the time to move from one to another has a big cost too, meaning this can't really be anything new, i doubt its even a better FA
Anonymous No.106629247 [Report]
>>106628594
Miyazaki chill
Anonymous No.106629252 [Report] >>106629261
>>106629142
just tested it, went from 22.3gb of usage to 21.7gb, it's not much but I'll take it
Anonymous No.106629261 [Report] >>106629269
>>106629252
speed diff?
Anonymous No.106629268 [Report]
>>106629239
bigot
Anonymous No.106629269 [Report]
>>106629261
it's the same, but now I can make it slightly faster by offloading less to the ram I guess
Anonymous No.106629272 [Report] >>106629315
>>106629243
>These are not different
Yes they are, Flash Attention optimizations are ALL about optimizing WITHIN the GPU, it have the benefit of fitting more into vram but it have no strategy whatsoever for when it doesn't fit into vram

Offloading optimizations are specifically for when it doesn't fit into vram

So no, they are very different
Anonymous No.106629285 [Report] >>106629349
>>106629131
Anyone who has ever trained the exact same NSFW concept dataset in a lora at both 512x512 and 1024x1024 on Flux for the same number of epochs is aware that the rate of anatomy errors will always be drastically higher with the 512x512 one unless you actually inference at 512x512. Chroma simply didn't get anywhere remotely close to enough training at 1024x1024.
Anonymous No.106629303 [Report] >>106629367
Anonymous No.106629315 [Report]
>>106629272
You're right, I've must have confused FA with some other tech I read about a long time ago
Anonymous No.106629327 [Report] >>106629330 >>106629360 >>106629404 >>106629639
>ACK!
Anonymous No.106629330 [Report]
>>106629327
BASED
Anonymous No.106629349 [Report]
>>106629285
More HD training would have been exponentially more expensive, but I'm not sure it's that. The HD version is fucky and slops prompts, but the anatomy is noticably better than Base. Base is still better because you can fix a fucked hand with more steps and better prompting.
Anonymous No.106629351 [Report] >>106629369
Is there an ideal resolution ratio I should set for WAN videos to not come out fuzzy as shit or otherwise lose their shit in Comfy gens?
Anonymous No.106629352 [Report] >>106629639
>>106629233
I'll lose the cheese and double-side fry the egg
also streaky bacon
cheese and egg, especially cheddar, do not mix imho
Anonymous No.106629353 [Report]
>slow mo video
Anonymous No.106629360 [Report]
>>106629327
fk yeah
Anonymous No.106629367 [Report]
>>106629303
Flux Krea full or fp8?
Anonymous No.106629369 [Report]
>>106629351
just dont use anything below q8 and it shouldnt be a big problem, but anyway, 720x1280 or 1280x720
Anonymous No.106629373 [Report] >>106629411 >>106629639
wan is slowly making the troon more female
Anonymous No.106629403 [Report]
i take it back what i said many therads ago, i in fact love chroma again and just had aworkflow skill issue. not only that but chroma is really good at inpainting.

i would post an example but my gens are too strong for you, proompter.
Anonymous No.106629404 [Report] >>106629639
>>106629327
Anonymous No.106629411 [Report]
>>106629373
kek, it's true

based Wan
Anonymous No.106629421 [Report]
>>106628754
>since nag basically doubles gen times
no it doesn't??
Anonymous No.106629457 [Report] >>106629465
>>106628754
NAG is a buffed neg prompt, not a cfg1 hack.
Anonymous No.106629465 [Report] >>106629469
>>106629457
>not a cfg1 hack.
it works for models with cfg 1 though
Anonymous No.106629469 [Report]
>>106629465
working with=/=enabling
Anonymous No.106629559 [Report] >>106629584 >>106629678
>SPRO is 1st on the trending page
>HunyuanImage isn't even on the list
I hope Tencent is gonna learn from that and will give us kino next time
Anonymous No.106629584 [Report]
>>106629559
SRPO is relevant only because of the unfucking method. It's literally just flux without it.
Anonymous No.106629620 [Report] >>106629639 >>106629717
Anonymous No.106629626 [Report] >>106630049
ポストカード !!FH+LSJVkIY9 No.106629639 [Report] >>106629641
>>106629620
they used to be SO mean to me for showing panties\chonies ;3
>>106629404
>>106629373
>>106629327
the image saddens me every time i gaze upon it.,..
>>106629352
im more of a softboiled\poached kinda guy personally hehe
Anonymous No.106629641 [Report] >>106629658 >>106629689
>>106629639
can you leave
Anonymous No.106629658 [Report]
>>106629641
Can you ? Someone who offers absolutely nothing to these threads
Anonymous No.106629678 [Report]
>>106629559
Yes I will take your leaderboard b8
ポストカード !!FH+LSJVkIY9 No.106629689 [Report] >>106629712 >>106632116
>>106629641
sunset complete friend
i'll miss ya <333

>love one another
>read the gospel daily
THE
END
D R A W S
N E A R . U S. A L L .
Anonymous No.106629712 [Report]
>>106629689
no :(
Anonymous No.106629715 [Report]
>>106628594
Based SaaS API node Enjoyer
As our master and Sensei Comfy, we adapt and know how to enjoy our local SaaS technology.

Welcome to the future, welcome to:
/sldg/ - SaaS Local Diffusion General
Anonymous No.106629717 [Report]
>>106629620
really nice style
Anonymous No.106629786 [Report]
>>106629212
badass
Anonymous No.106629815 [Report]
>>106629131
You are a NAZI
You are a PEDO
You are a SCHIZO
You have to KYS
You are the cancer of /ldg/
Anonymous No.106629816 [Report]
>>106628975
To be clear, right is only less slop. It is disingenuous to call it trvesovl.
Anonymous No.106629854 [Report] >>106629865 >>106629949
be the change you want in the world, post milfs
Anonymous No.106629865 [Report]
>>106629854
Are you sure?
Now
Right now?
Is it time?
Anonymous No.106629894 [Report] >>106629929
>>106628594
Giga based

Two days ago I moved to /sdg/ much better thread quality, there are schizos but at least it's more diffusion oriented, this thread is like a church or sect. Example threads >>106627772 >>106627168 >>106624229

They are already various anons from here dual posting,
Anonymous No.106629902 [Report]
Anonymous No.106629903 [Report] >>106629942
Anonymous No.106629912 [Report]
Anonymous No.106629917 [Report] >>106629925
>106629894
>random colorful nonsense pictures yay :D
the absolute state
Anonymous No.106629925 [Report]
>>106629917
brilliant
Anonymous No.106629929 [Report] >>106629941
>>106629894
Yes, you're absolutely right... /sdg/ is much healthier and more fun
/sdg/ discuss local and cloud without discriminating and with a more open mind
Anonymous No.106629932 [Report] >>106629950
Anonymous No.106629940 [Report] >>106629983
the man holds up a white sign saying "BUY SKYRIM, OR ELSE!"
Anonymous No.106629941 [Report] >>106629961
>>106629929
What happens is that they actually love AI diffusion and technology in general. They're not seething faggots who censor people, call them schizos, or shill trash models like Chroma just because it's local.
Anonymous No.106629942 [Report]
>>106629903
you should try some of these with sneedream
Anonymous No.106629949 [Report] >>106629955 >>106629959 >>106630006
>>106629854
Anonymous No.106629950 [Report] >>106629966
>>106629932
Kino. Prompt?
Anonymous No.106629955 [Report] >>106630006
>>106629949
oof... those seams and the green line
Anonymous No.106629959 [Report] >>106629968 >>106630001
>>106629949
whut model? lumina?
Anonymous No.106629961 [Report]
>>106629941
Yes I'm /sdg/ now an anon is genning a workflow that mixes API nodes and Qwen image editor. It's great what you can achieve mixing local and cloud technology. And most importantly without ideological prejudices.
Anonymous No.106629966 [Report] >>106629976
>>106629950
by Ansel Adams, , orbit
dawn on Titan
Steps: 28, Sampler: Euler, Schedule type: Simple, CFG scale: 1.1, Distilled CFG Scale: 3.5, Seed: 3797686900, Size: 1472x712, Model hash: 4610115bb0, Model: flux1-dev, Version: f2.0.1v1.10.1-previous-669-gdfdcbab6, Module 1: ae, Module 2: clip_l, Module 3: t5xxl_fp16
Anonymous No.106629968 [Report] >>106630420
>>106629959
Anonymous No.106629971 [Report]
tfw you find a new artist to train on
Anonymous No.106629976 [Report]
>>106629966
ty
Anonymous No.106629983 [Report] >>106629997 >>106630008
>get new gpu
>ecsatic at it just works >>106629940 in reforge
>jump to comfyui after needing to wait a little while longer with the hype building up

>errors out the fucking anal cunt over how i tried to go about installing sageattention/triton

>finally get around them(?)

>now errors out the wazoonus about ??? in every wan2.2 workflow

i now understand why anons are driven to schizophrenia over the mere thought of a new UI this is unusable dogshit
Anonymous No.106629997 [Report] >>106630004 >>106630008
>>106629983
You forget the most important thing, people gate keep their workflows and don't help
Anonymous No.106630001 [Report]
>>106629959
Chroma testing my lora epochs
Anonymous No.106630004 [Report] >>106630022
>>106629997
Have you tried asking for help in /sdg/?
Anonymous No.106630006 [Report] >>106630016 >>106630070 >>106630083 >>106631655
>>106629955
>>106629949
yeah, protip. if you have the VRAM, just don't use ultimate upscale. use tile or blur controlnet and upscale directly in the intended resolution.
I'm feeling generous so here's the workflow for this, the workflow works for any art style if you use the right model and tweak accordingly:
https://files.catbox.moe/s9qkzc.png
Anonymous No.106630008 [Report]
>>106629983
like HEH how does it get this busted?!

>>106629997
there's plenty of already working workflows out there, its niggerware comfyui or trannyware python that decides to waste a bit more of your limited lifespan
Anonymous No.106630016 [Report]
>>106630006
>blur controlnet
thats a new one for me thanks anon
Anonymous No.106630022 [Report] >>106630047
>>106630004
No, never, in fact I've never gone, they told me they're more oriented towards cloud and SaaS fagging.
Anonymous No.106630047 [Report]
>>106630022
Oh no! They told you wrong anon! Actually we're super versatile here! We have anons generating with Chroma >>106629622 also anons generating anime with WAI then with See Dream >>106629729 and some animate their videos with WAN! >>106624803

We have really fun here!
Guess who else is here? >>106629141
Anonymous No.106630049 [Report] >>106630073
>>106629626
How many images do you usually like to have for your LoRAs?
Anonymous No.106630054 [Report]
just to be clear thats not a wan gen kek its anidiff
Anonymous No.106630070 [Report] >>106630083
>>106630006
>error in image
Catbox dead again?
Anonymous No.106630073 [Report]
>>106630049
Anon you like anime? Look the same person is here!! >>106620604 maybe you can ask him the same question there!
Anonymous No.106630075 [Report] >>106630130
Anonymous No.106630083 [Report] >>106630177 >>106631655
>>106630070
>>106630006
shit, here https://litter.catbox.moe/55sskg5uohf7vjok.png
Anonymous No.106630117 [Report]
Anonymous No.106630130 [Report] >>106630140
>>106630075
outstanding vistas
Anonymous No.106630140 [Report] >>106630248
>>106630130
He is also in /sdg/!
Anonymous No.106630146 [Report] >>106630921
Anonymous No.106630149 [Report] >>106630155 >>106630166 >>106630208 >>106630212 >>106630231 >>106630275
Wan-Animate page up.

>https://humanaigc.github.io/wan-animate/
Anonymous No.106630152 [Report]
Anonymous No.106630155 [Report]
>>106630149
animatorbros.............................................................its over
Anonymous No.106630166 [Report] >>106630214 >>106630256
>>106630149
...
Anonymous No.106630177 [Report] >>106630193 >>106632457
>>106630083
I don't use comfy
Anonymous No.106630193 [Report] >>106630198
>>106630177
lol, have fun with broken gradio trash then
Anonymous No.106630198 [Report]
>>106630193
Meds onigai
Anonymous No.106630204 [Report] >>106630220 >>106631843
So prompting in French sort of works for Chroma, but you need to set your cfg ~40 to 60% higher than normal and the results will not be as good for the most part, plus it still needs a little bit of English guidance in there to not turn out like a messy SD1.5 gen. This is obviously not what Chroma was trained to do and is not really an optimal prompting strategy, but it was fun to try. Apologies to anyone who actually speaks French if they should happen to read my very translator-plus-chatbot-assisted prompt, I ofc do not speak French

>Photo floue du plus beau décolleté de ma cousine Hélène, 20 ans, à la maison de campagne du Lac Léman, 2001, debout au bord du lac. Elle est canon avec des seins incroyables en maillot de bain deux-pièces! [hot panting emoji] cute college girl up close
Anonymous No.106630208 [Report] >>106630311 >>106630743
>>106630149
>https://huggingface.co/Wan-AI/Wan2.2-Animate-14B
Now it's out.
Anonymous No.106630212 [Report]
>>106630149
imagine what anons will do with this. taking videos of themselves doing disgusting things
Anonymous No.106630214 [Report]
>>106630166
wasnt trained on unique ghibli style, thats def the worst vid in the otherwise very impressive examples
Anonymous No.106630219 [Report]
the man on the right shoots a black pistol at the man on the left, who falls to the floor. the blue text at the bottom is unchanged.
Anonymous No.106630220 [Report]
>>106630204
my french gf
Anonymous No.106630223 [Report]
Holyshit I dont believe it, is that radial attention getting real updates?
Anonymous No.106630231 [Report]
>>106630149
Huh, this looks kind of good? No "fun" in the name either.
Anonymous No.106630237 [Report]
t minus 24 hours until someone recreates goatse but with an anime character
Anonymous No.106630238 [Report] >>106632457
Need to update the denoising on high res fix to preserve the style
Anonymous No.106630248 [Report]
>>106630140
No, I'm not.
Anonymous No.106630256 [Report]
>>106630166
looks good
Anonymous No.106630275 [Report]
>>106630149
Animators on suicide watch
Anonymous No.106630277 [Report]
Anonymous No.106630278 [Report] >>106630402
xibros... are we tired from winning?
Anonymous No.106630283 [Report] >>106632457
Anonymous No.106630311 [Report] >>106630371
>>106630208
>dual clip
At least it's not dual model again. Thank god.
Anonymous No.106630315 [Report] >>106630368 >>106632929
Anonymous No.106630339 [Report] >>106633709
Anonymous No.106630344 [Report] >>106630359
The first person to post a gen from Wan animate will get SO many (You)s.
Anonymous No.106630359 [Report]
>>106630344
1 for each python dep and cuda allocation error he manually has to hardcode the fix for with qwen-code first until the codebase works probably
Anonymous No.106630368 [Report] >>106630438
>>106630315
top gen
Anonymous No.106630371 [Report]
>>106630311
>At least it's not dual model again. Thank god.
I think that's because they made that from wan 2.1
Anonymous No.106630387 [Report]
If I wanted to make a lora for comics/manga paneling, do I need to caption the scenes in the panels or would the layout and number of panels be enough?
Anonymous No.106630402 [Report]
>>106630278
too bad anime is still 3dcg. but the potential is great now
Anonymous No.106630403 [Report]
gguf, motherfucker, where is it?
Anonymous No.106630420 [Report] >>106631583
>>106629968
Anonymous No.106630438 [Report]
>>106630368
ty
Anonymous No.106630449 [Report]
Anonymous No.106630459 [Report] >>106630550
time to wait for a shitty comfy implentation, a shitty slowass lighting lora, so its another month for decent gens
Anonymous No.106630488 [Report]
Anonymous No.106630510 [Report]
https://files.catbox.moe/6hsk2p.png
Anonymous No.106630513 [Report]
Anonymous No.106630550 [Report]
>>106630459
Kij will do it first.
Anonymous No.106630569 [Report] >>106630605
imagine the upgraded rocketgirl dances
Anonymous No.106630605 [Report] >>106630632 >>106630689
>>106630569
porn parodies will be hilarious. fuck the meme dances
Anonymous No.106630607 [Report]
Anonymous No.106630617 [Report]
Anonymous No.106630632 [Report] >>106630730
>>106630605
porn parodies? how about full blown hentai with perfect fluid animation now??
Anonymous No.106630670 [Report]
How do you upscale your videos?
Is it possible to upacale with denoise?
Anonymous No.106630689 [Report]
>>106630605
>fuck the meme dances
na i accept everything, the potential to do funny shit is endless if you have the imagination.
Anonymous No.106630693 [Report]
>tfw 8gb Radeon RX 6600
Anonymous No.106630702 [Report] >>106630746 >>106630748 >>106630778 >>106630790 >>106631100
is this true?????
https://files.catbox.moe/154lqh.mp4
Anonymous No.106630730 [Report]
>>106630632
no it still sucks ass at anime and even the animated stuff lacks the exaggeration expected from even western cartoons. very soulless and uncanny.
Anonymous No.106630743 [Report]
>>106630208
Kijai got access to it before comfy lmao, oh oh no no API sisters?
Anonymous No.106630746 [Report]
>>106630702
These takes used to be hot perhaps two or so years ago. "It's ability" to "extrapolate" is more mature than what he portrays but I just realized I'm replying to a post that uses multiple question marks in a row so I will stop
Anonymous No.106630748 [Report] >>106630771
>>106630702
Yes, it's true, but I think he's describing a scenario where the use case for this tech is weakest.
Anonymous No.106630771 [Report]
>>106630748
To follow up from this: I wonder whether the strongest case for AI video will end up being like AI images, where touching up manual edits to give them that hard-to-fake 'texture' of reality can enormously enhance an image with very little effort. Maybe someone will 95% mock up some cheap CGI for a scene and then do some light passes with some kind of "video inpainting" model to 'take off the rough edges'.
Anonymous No.106630778 [Report]
>>106630702
the idea that it can only generate what it was trained on comes from a lack of experience desu
Anonymous No.106630790 [Report] >>106630799
>>106630702
not to mention the carbon footprint!
Anonymous No.106630799 [Report]
>>106630790
Everytime I hit generate on my AI generation software my GPU asks me to feed it a tall glass of fresh water
Anonymous No.106630849 [Report]
>Gens 1 girl

https://www.youtube.com/watch?v=2sBFkfcB8WA
Anonymous No.106630880 [Report] >>106630891 >>106630913 >>106630931
so does comfy just not add new models anymore and just lets these researchers make garbage nodes?
Anonymous No.106630891 [Report]
>>106630880
Comfy adds tons of new models to API nodes
Anonymous No.106630898 [Report]
>>106628814
always liked your work.
Anonymous No.106630913 [Report] >>106630942
>>106630880
He usually gets the models first so he has time to implement to native, but seems like wananimate went to Kijai first lmao
Anonymous No.106630921 [Report]
>>106630146
nice
Anonymous No.106630931 [Report]
>>106630880
too busy impregnating chink spy gf
Anonymous No.106630942 [Report]
>>106630913
sounds great, any links if i don't know where kijai posts?
Anonymous No.106631043 [Report]
>>106628827
Nice. How much vram does this require to run?
Anonymous No.106631100 [Report]
>>106630702
>video of astronaut riding giant lizard on mars
No, it in fact knows how to extrapolate unknown information from data, that's literally the point of generalization. And anyone would know this after training even a basic LoRA on a concept the model knows nothing about, for example, you can train a LoRA only on anime images and magically the model can create photorealistic images of that character too, same facial structure and everything. Qwen Image is particularly good at that.
Anonymous No.106631125 [Report] >>106631140 >>106631390
The future
Anonymous No.106631140 [Report] >>106631390
>>106631125
Could this model in theory turn white western roasties into beautiful delicate asian women?
Anonymous No.106631184 [Report]
>>106628570 (OP)
My waifu!
Anonymous No.106631189 [Report]
God I feel so comfy bros
Anonymous No.106631197 [Report]
Comfortably accelerating my workflows with API nodes, strategically tapping in to the most powerful SaaS to deliver maximum results
Anonymous No.106631211 [Report] >>106631230 >>106631280
I'm so proud of the boys
Anonymous No.106631230 [Report] >>106631331
>>106631211
why can't comfy use a toilet like a regular person?
Anonymous No.106631237 [Report] >>106631324
Anonymous No.106631258 [Report] >>106631263 >>106631265 >>106631266 >>106631306 >>106631343 >>106631881
what is this?
https://github.com/lodestone-rock/RamTorch
Anonymous No.106631263 [Report]
>>106631258
That is it. I'm sick of this guy. His assets should be seized and his access to a computer should be limited by a court of law.
Anonymous No.106631265 [Report] >>106631294
>>106631258
Code created by one of most genius minds of the furry community
Anonymous No.106631266 [Report]
>>106631258
wtf I love chroma now
Anonymous No.106631280 [Report]
>>106631211
Why would they censor their faces? We know what they all look like lel
Anonymous No.106631290 [Report] >>106631316 >>106632835
Am I able to generate video with ComfyAI using generally the same settings as I do with images?
Anonymous No.106631294 [Report] >>106631311
>>106631265
race condition? do i need to be white or brown to use this?
Anonymous No.106631306 [Report]
>>106631258
Already discussed earlier in this thread
Anonymous No.106631311 [Report]
>>106631294
its just 2 20 lines wrappers around torch, vibecoded too from what I can see, picrel
Anonymous No.106631316 [Report] >>106631321
>>106631290
What the fuck is this stupid question?
Anonymous No.106631321 [Report] >>106631360
>>106631316
I'm retarded
Anonymous No.106631324 [Report] >>106631534
>>106631237
but can you make it look real and not like a render tho
Anonymous No.106631331 [Report]
>>106631230
Easy to use litterbox while vibecoding
Anonymous No.106631343 [Report] >>106632693
>>106631258
Kohya:
>RamTorch is really amazing. I thought it might be possible to completely replace block swap, but it seems that block swap still has advantages in cases where the transfer time is longer than the computation time, such as image generation (because it can transfer blocks that are far away). However, it looks very promising for training (with large batch sizes) and video generation.

So, perhaps not a improvement for image generation, but sounds like a big optimization for training and video generation
Anonymous No.106631353 [Report]
Anonymous No.106631360 [Report] >>106631630
>>106631321
but my question stands, can I do it?
Anonymous No.106631388 [Report]
Anonymous No.106631390 [Report] >>106631440
>>106631125
>>106631140
>Could this model in theory turn white western roasties into beautiful delicate asian women?
Remember when people were discussing replacing the ugly black Little Mermaid with a white redhead? I think it's literally possible to do it on a single 5090 at 720p. Assuming a moderate amount of gacha for consistency and you could probably make a minute of progress per day

Will try this out myself soonish since I'm burnt out from video gooning. I want to turn the orcs from Lord of the rings into Muslims but I'll practice on The Little nogmaid first
Anonymous No.106631440 [Report] >>106631475 >>106632014
>>106631390
How does it work with multiple people I wonder. How does it know who to replace?
Anonymous No.106631475 [Report] >>106631636
>>106631440
might have to keyframe a mask around the subject... should be quick with blender?
Anonymous No.106631534 [Report] >>106631773 >>106631861
>>106631324
I prefere the 3dcg blender style over the anime hyperrealistic or photorealistic that I've been doing for ages.
Anonymous No.106631583 [Report]
>>106630420
nice
Anonymous No.106631630 [Report] >>106631638
>>106631360
I dont even know what your question means. Same settings? Have you even tried video yet?
Anonymous No.106631636 [Report]
>>106631475
idk why I'd use blender for that when comfy has plenty of tools for it.
Anonymous No.106631638 [Report] >>106631648
>>106631630
actually I ran into a new issue where I can't launch it because I got this after updating and I'm not sure how to fix it because I tried installing python again and it didn't work

AssertionError: Torch not compiled with CUDA enabled
Anonymous No.106631645 [Report] >>106631698
Anonymous No.106631648 [Report] >>106631649
>>106631638
You downloaded the wrong pytorch.
Anonymous No.106631649 [Report] >>106631678
>>106631648
all I did was run the script that ComfyUI came with
Anonymous No.106631655 [Report]
>>106630006
>>106630083
>404 twice
Guess I will never know what a non Ultimate Upscale upscale workflow looks like
Anonymous No.106631678 [Report] >>106631682
>>106631649
Okay, what is your GPU and python version?
Anonymous No.106631682 [Report] >>106631700
>>106631678
RTX 4070, I'm unsure of my python version, I'll upgrade to the newest one right now and see if I can replicate it.
Anonymous No.106631698 [Report] >>106631711 >>106631881
>>106631645
https://huggingface.co/Comfy-Org/Wan_2.2_ComfyUI_Repackaged/blob/main/split_files/diffusion_models/wan2.2_animate_14B_bf16.safetensors
https://huggingface.co/Kijai/WanVideo_comfy_fp8_scaled/tree/main/Wan22Animate
if anybody needs em. i wanna play with it but no time yet.
Anonymous No.106631700 [Report] >>106631703
>>106631682
https://pytorch.org/get-started/locally/

Make sure you get the right pytorch version for the cuda and python version you are using.
Anonymous No.106631703 [Report] >>106631709
>>106631700
thanks for the help anon, I feel retarded but I want to get it to work
Anonymous No.106631709 [Report] >>106631777
>>106631703
We've all been through it.
Anonymous No.106631711 [Report] >>106631946
>>106631698
Thanks but I'm waiting for GGUF
Anonymous No.106631750 [Report]
So many new models, I have to buy new SSD
Anonymous No.106631773 [Report] >>106631837
>>106631534
moar

how do you keep the environment so stable as it spins?
Anonymous No.106631777 [Report]
>>106631709
Unfortunately didn't work, I'll have to try to figure it out tomorrow since it's 4:30am here, thanks again for sending some links though
Anonymous No.106631812 [Report] >>106631852 >>106631881 >>106632018
Anonymous No.106631837 [Report]
>>106631773
I'm at work right now, don't remember the entire prompt off head. It's was 360 camera orbit around around the character show left, back, right and front with the description of the environment of each viewing side. I post prompts and gens later in the day when I'm free.
Anonymous No.106631843 [Report] >>106631865
>>106630204
I wonder if writing in 2 languages back to back (the same thing) would help in any way.
Anonymous No.106631852 [Report] >>106631874
>>106631812
this is a whole new level of slop
Anonymous No.106631861 [Report]
>>106631534
modern lara croft style in an alternate universe
Anonymous No.106631865 [Report] >>106631909
>>106631843
I tried something similar to that (concat on two prompts saying the same thing, one in English) but the English was by far the stronger prompt and took over the image, plus it got the sort of look that gens get when the prompt gets too long.
Anonymous No.106631874 [Report]
>>106631852
I can't even get it to work again because the face mask node shat itself.
Anonymous No.106631881 [Report]
>>106631258
Anyone tried this yet?

>>106631698
>cries in 16gb vram

Somehow, looks better than VACE, cant wait for gguf

>>106631812
Kek
Anonymous No.106631909 [Report]
>>106631865
Oh well, it was worth trying I guess.
Anonymous No.106631946 [Report] >>106632835 >>106632843
>>106631711
https://huggingface.co/Kijai/WanVideo_comfy_GGUF/tree/main/Wan22Animate
Anonymous No.106631957 [Report] >>106632018
Anonymous No.106631963 [Report] >>106632184
kijai farting out those commits, hopefully it'll be mostly decent in a few hours.
Anonymous No.106632014 [Report] >>106632018
>>106631440
>How does it work with multiple people I wonder. How does it know who to replace?
It'll either replace both people, or whoever is uglier. This isn't my first rodeo with Chinese trained models I can already predict their behavior
Anonymous No.106632018 [Report]
>>106632014
I got my answer. You need to select and mask the subject you want to change.

>>106631957
>>106631812
Anonymous No.106632051 [Report]
I can already tell from my tries so far that it's way better than fun vace.
Anonymous No.106632116 [Report] >>106632228
>106628953
>106629095
>>106629186
>>106629206
>>106629233
>>106629239
>>106629689
hang yourself
Anonymous No.106632163 [Report] >>106632169
Here's what happens when the masking fails to recognize a body.
Anonymous No.106632169 [Report]
>>106632163
i hate horsies
Anonymous No.106632177 [Report]
Sex with Teio.
Anonymous No.106632184 [Report] >>106632195
>>106631963
Still early day in Finland, more to come
Anonymous No.106632195 [Report]
>>106632184
sounds good, couple more hours and i can finally load it up for some sloppa.
Anonymous No.106632227 [Report]
Anonymous No.106632228 [Report]
>>106632116
he's worse than the deebster by far
And the worst part is he chooses to act like this when he can just have a normal conversation which is somehow more gross and faggy than actual faggotry. I wish furshit was contained to /trash/ and blatant pokeshit was contained to /vp/ or at least the /v*/ family similar to how MLP content is contained to /mlp/
Anonymous No.106632437 [Report] >>106632445 >>106632473 >>106632627 >>106632730
anyone migrated to CUDA 13.0? does it work?
Anonymous No.106632445 [Report] >>106632464 >>106632473
>>106632437
Broke everything last time I tried
Anonymous No.106632457 [Report]
>>106630283
>>106630238
>>106630177

Hi, I see you post about Chroma quite often. I wanted to ask you, based on your experience, if you could give me an objective and tangible overview of the positive aspects of Chroma compared to SDXL besides text.
Thanks!
Anonymous No.106632464 [Report] >>106632473
>>106632445
thanks anon, will avoid that hassle then
Anonymous No.106632473 [Report] >>106632627
>>106632437
>>106632445
>>106632464
There are now cuda 13.0 versions of torch (since 2 weeks), maybe worth looking into.
Anonymous No.106632627 [Report]
>>106632437
i've been using it for a while with >>106632473 but i didn't test to see if anything really changed nor have i noticed.
mainly just did it for the autism of being up to date with the newest releases.
Anonymous No.106632660 [Report] >>106632669
>>106628827
i get a remixing of the music but it won't do the FUSTERCLUCK text. is this censored...?
Anonymous No.106632669 [Report]
>>106632660
>is this censored...?
oh shit, if it is it's a fucking DOA model
Anonymous No.106632693 [Report]
>>106631343
if kohya sees the potential in it them it means that it's the real deal
Anonymous No.106632712 [Report] >>106632722
where wan animate workflow
where node
Anonymous No.106632722 [Report]
>>106632712
Kij has both.
Anonymous No.106632730 [Report] >>106632734
>>106632437
ymmv
Anonymous No.106632734 [Report] >>106632747
>>106632730
did you notice any improvement on speed/vram usage?
Anonymous No.106632747 [Report]
>>106632734
nothing impressive in my use-case
Anonymous No.106632762 [Report] >>106632767 >>106632783 >>106632795 >>106632863
https://github.com/comfyanonymous/ComfyUI/pull/9939
Wan animate is on comfy native btw
Anonymous No.106632767 [Report]
>>106632762
But it requires updating
Anonymous No.106632783 [Report]
>>106632762
nice, comfy was awake already
Anonymous No.106632795 [Report] >>106632805
>>106632762
oh thank fuck.
nothing against kijai nodes but.. nah
Anonymous No.106632805 [Report]
>>106632795
yeah, this
Anonymous No.106632818 [Report] >>106632830
>>106628827
Pretty good from the tests I've done but not really seeing any way to control the voice, even just a gender option would be nice.
Anonymous No.106632819 [Report] >>106632843
> no fp16 for wan animate
i'm sorry for what i'm about to do to my gpu
Anonymous No.106632830 [Report] >>106633306
>>106632818
>Pretty good from the tests I've done
show some good beats anon
Anonymous No.106632835 [Report]
>>106631290
just use one of the video workflows - included or from the internet

>>106631946
praise KJ boss
Anonymous No.106632843 [Report]
>>106632819
Anon, did you see >>106631946 ?

Well yes that's not FP16 either but it will probably not be bad.
Anonymous No.106632863 [Report]
>>106632762
What's the workflow?
Anonymous No.106632892 [Report] >>106632903 >>106632927 >>106632951
what is bro waffling about? i swear to god Ai brings out the clinically insane https://huggingface.co/Wan-AI/Wan2.2-Animate-14B/discussions/1
Anonymous No.106632894 [Report]
>>106628953
his gens really do have a whimsical quality to them
Anonymous No.106632903 [Report] >>106632934 >>106633107
>>106632892
>i swear to god Ai brings out the clinically insane
yeah, I love this hobby but it has a lot of insane people in there, look at /sdg/ it's basically an asylum
Anonymous No.106632927 [Report]
>>106632892
i'd also like an AI to replace all of what he mentioned with AI so it does it by itself
Anonymous No.106632929 [Report]
>>106630315
tits too small
Anonymous No.106632934 [Report]
>>106632903
>look at /sdg/ it's basically an asylum
i feel that way when i look at 4chan as a whole desu but it's a familiar asylum compared to reddit
Anonymous No.106632951 [Report]
>>106632892
to be fair well sure, people would like controlnets and so on

it's just not a feature you can stick on by changing three weights in the model and sticking on "the controlnet that justwerks"
Anonymous No.106633081 [Report] >>106633102 >>106633107
My hopes are quickly fading.
Anonymous No.106633102 [Report]
>>106633081
it doesn't seem to be able to keep the lightning on par with the original video, meh, I'll pass on that one, still on the Wan 2.5 waiting room
Anonymous No.106633107 [Report] >>106633111
>>106632903
computer literacy even has gone down with so many people using computers daily almost all day

>>106633081
hopes of what?
Anonymous No.106633111 [Report] >>106633240
>>106633107
Animate being good.
Anonymous No.106633124 [Report]
>inb4 trani bake
20Loras No.106633166 [Report]
>tfw you get an amazing result but it was in your 480p testing phase
Anonymous No.106633213 [Report]
new
>>106633210
>>106633210
>>106633210
>>106633210
Anonymous No.106633240 [Report]
>>106633111
what doesn't work tho, the camera / scene type prompts? or the motion referencing?

IIRC these were the main features, right?
Anonymous No.106633306 [Report]
>>106632830
This was one of the better ones I got. I'm trying to include voice in the input file to try and control the gender of the singer, seems to cause it to make way more errors though.

https://files.catbox.moe/wbfx2c.mp3
Anonymous No.106633325 [Report]
so it uses a video reference + image, it's like a vace model then?
Anonymous No.106633709 [Report]
>>106630339
Box?