← Home ← Back to /g/

Thread 106497264

361 posts 186 images /g/
Anonymous No.106497264 >>106499300 >>106501126
/ldg/ - Local Diffusion General
Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106494102

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://rentry.org/wan22ldgguide
https://github.com/Wan-Video
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.106497283 >>106497298
All round town anons ask me if I know of the "blessed thread" and do you know what I tell them
Anonymous No.106497293 >>106497311 >>106497316
Comfy needs WebUI style workflow or node improvements for users. Swarm and SDNext aren't solutions. People want Gradio.
Anonymous No.106497298 >>106497375
>>106497283
Please tell me
Anonymous No.106497311
>>106497293
people dont know what they want until you give it to them desu
Anonymous No.106497316 >>106497325 >>106497345 >>106497486
>>106497293
ComfyTip:
Group reusable nodes using Nest Node Builder.
Load/ungroup them when needed instead of adding individual nodes.
Save time with saved nested nodes (ControlNet, Prompt, Upscale, etc.).
Export as JSON to transfer/share.
ComfyUI is faster and more convenient than Forge once a workflow is finalized, trust me.
Anonymous No.106497324
Blessed thread of frenship
Anonymous No.106497325 >>106497335 >>106498151
>>106497316
Comfy It's trash, a big pile of ugly frustrating trash, and why the hell is it called comfy ui? it anything but comfy its awful. I'm will never using it again, fuck !
Anonymous No.106497334 >>106497346 >>106497351 >>106497358 >>106497384 >>106497451
Graphical UIs suck. Why can't we write code directly?
A basic workflow is less than 10 lines of code:
text_encoder = load_text_encoder("te.safetensor")
model = load_model("model.safetensor")
vae = load_vae("vae.safetensor")
model.load_lora("lora.safetensor")

latent = new_latent(width=1024, height=1024)
latent = latent.ksampler(steps=20, cfg=4, seed=12345)
image = vae.decode(latent)
image.save("out.png")


Ability to write functions make abstraction possible:
latent = new_latent(width=1024, height=1024)
latent = my_custom_first_pass_sample(latent, steps=20, cfg=4)
latent = my_custom_hires_fix(latent, steps=30, cfg=4, denoise=0.7)
latent = my_custom_adetailer(latent, ...)

No more spaghetti. No more janky node to workaround node UI deficiencies like reroute, switch, number to string to number, numeric operations on numbers, etc.

Not to mention with loop statements it's easy to make custom XYZ plots, which node UI just can't do.

With a real programming language, it has the power of both forge-style UI and node-based UI do while also better than both in flexibility and cleanness.

Pair it with a Jupyter notebook style UI, you can nicely iterate and inspect results anywhere in the middle of generation and nicely iterate until you get a good result.
Anonymous No.106497335
>>106497325
>Comfy It's trash
Can I push a big orange button to convert a "comfy" workflow to python or some real scripting language and hack that?
Anonymous No.106497345 >>106497355
>>106497316
What is killing me on my Comfy are the updates that and other shenanigans that keeps breaking constantly , like, the project is great but it's so fragile and takes a misplaced period to almost make you pc explode.
Anonymous No.106497346
>>106497334
Easy peasy, lemon squeezey ! What imports do I need?
Anonymous No.106497351
>>106497334
this is the best
Anonymous No.106497355
>>106497345
Seriously, they should have guideline like Forge and enforce them before allowing them to be integrated into WebUI. Seriously, that's what is keeping me from using it as my main stable diffusion ui.
Anonymous No.106497358
>>106497334
the only thing i want to write is a prompt
Anonymous No.106497369
Anonymous No.106497375 >>106497451
>>106497298
that gap man
THAT FUCKING GAP
it does things to me
Anonymous No.106497384
>>106497334
do it faggot
Anonymous No.106497385 >>106497391 >>106497392 >>106497411 >>106497440
After all this time trying comfy, I still absolutley hate it's fking guts. I tried, I learned, I made mistakes, I studied, I failed, I learned again. Debugging and debugging and debugging... I'm so sick of it. I hated it from my first git clone up until now, with my last right click delete of the repository. I have been using A1111, reForge, and Forge as my daily before Comfy. I tried Invoke, foocus, and SwarmUI. Comfy is at the bottom. I don't just not enjoy it, it is a huge nightmare everytime I start it. I wanted something simple, plug n play, push power button and grab a controller, type of ui. Comfy is not only 'not it' for me, it is the epitome of what I hate in life.

Why do I hate it so much? Here's some back ground if you care. When I studied to do IT 14 years ago I had a choice to choose my specialty. I had to learn everything from networking, desktop, database, server, etc... Guess which specialties I ACTIVELY avoided? Database and coding/dev. The professors would suggest once every month to do it. I refused with deep annoyance at them. I dropped out of Visual Basic class because I couldn't stand it. I purposely cut my Linux courses because I hated command line, I still do. I want things in life to be as easy and simple as possible.
Anonymous No.106497391
>>106497385
okay .. well you're kinda retarded so you're not helping the rest of us with our case
Anonymous No.106497392 >>106497403 >>106497420 >>106497454
>>106497385
Comfy is like browsing the internet in a browser with html format only. Imagine a wall of code, a functional wall of code. It's not really the spaghetti that bothers me, it's the jumbled bunch of blocks I am supposed to make work. The constant scrolling in and out is annoying but the breaking of comfy from all the nodes (missing nodes) was what killed it for me. Everyone has a custom workflow. I'm tired of reading dependencies over and over and over again.

I swear to Odin I tried my best. I couldn't do it. I just want to point and click and boom image. I don't care for hanyoon, huwanwei, whatever it's called. I don't care for video and all these other tools, I really don't. I just want an outstanding checkpoint and an amazing inpainter.

Am I stupid? yeah sure call me that if you want. I don't care. I open forge. I make image. I improve image. I leave. That's how involved I am in the AI space. DESU, 90% of the new things, cool things, new posts in this sub is irrelevant to me.

You can't pay me enough to use comfy. If it works for you great, more power to you and I'm glad it's working out for you. Comfy was made for people like you. GUI was made for people who couldn't be bothered with microscoptic details. I applaud you for using Comfy. It's not a bad tool, just absolutely not for people like me. It's the only and the most power ui out there. It's a shame that I couldn't vibe with it.
Anonymous No.106497403
>>106497392
I use comfy for video, forge for the images. I do a fair amount of nodes in unreal and blender at work, so it's not too scary, but I won't add another node system I need to master unless I have to!
Anonymous No.106497405
>chatgpt write schizobabble from the perspective of someone who hates comfyui
Anonymous No.106497411
>>106497385
>. I tried, I learned, I made mistakes, I studied, I failed, I learned again.
Engaging but not empowering. sigh
Anonymous No.106497420
>>106497392
I completely get where you are coming from. I have tried ComfyUI too, and while it is insanely powerful, it always feels like I am stuck fixing broken LEGO instructions made by 10 different people. Every time I load someone’s workflow, half the nodes are either deprecated, custom, or renamed, and I end up spending more time debugging or hunting missing certain type of files than generating.
Anonymous No.106497428 >>106497436 >>106497438 >>106498545
Anonymous No.106497436 >>106497477
>>106497428
You ruined that skindentation with that ugly face.
Anonymous No.106497438 >>106497447
>>106497428
dat phase shift
Anonymous No.106497440
>>106497385
That is why I stick with Stable Diffusion Forge for images. It just works. PNG Info gives me everything I need at a glance, even if the image was not made in Forge. Prompt, model, LoRA,
Anonymous No.106497447
>>106497438
that was on purpose.. it took so goddamn many tries to get it to do it too
Anonymous No.106497451 >>106497504
>>106497334
Can't you just use the tensor library in python and kinda just do that?
Implementing/porting all functionality of custom nodes might be a pain, though.
That being said, I don't mind ComfyUI. I kinda like the previews/compare nodes. If you'd want that in pure code it'll require a UI again, anyway.

>>106497375
This is a blue board, this is all friendly fun.
Anonymous No.106497454
>>106497392
ComfyUI has its fans and massive support and I respect that, but it is clearly not built for everyone. If you like clean workflows and hate excessive tinkering, Forge hits the sweet spot. So you are not alone!
Anonymous No.106497467 >>106498333
blue board, blue balls
Anonymous No.106497473
did I tell you before? I don't like comfyui
I know there are other webuis, but I don't care about them, I don't even actually gen, I just want to say I don't like comfyui
Anonymous No.106497477
>>106497436
that was actually on purpose and it took many attempts to pull it off
Anonymous No.106497484 >>106502045
Anonymous No.106497486 >>106497501
>>106497316
You'd be surprised at the number of nodes people include in their workflows that are completely unnecessary. I've even seen requests to install a custom node package to set an integer or just to hide 2 spaghettis.
Anonymous No.106497494
It's all well and good to want to write high-level code and click run but it does have some problems. Main one being that importing PyTorch and then loading your models takes forfuckingever so you're going to need to have some sort of server that holds onto that stuff for you and somehow knows what models to cache when, otherwise every time you fiddle with your gen you have to wait the best part of a minute just for it to get going. Maybe you could do it with some combination of Jupyter notebooks idk, but I've never used Jupyter and you'd also need to implement some way to easily embed those notebooks into your outputs and then convince everybody to adopt that standard.
Anonymous No.106497498
with all this fresh pasta i dont need the noodles XD
Anonymous No.106497499
To AntiComfySchizo:
You don't hate comfy, you hate python. And rightly so, it's a fully trash peer dependency environment.
Anonymous No.106497501
>>106497486
It's very annoying when workflow use random exotic nodes while perfectly fine versions exist in core or even well known packs.
I just convert them usually.
Anonymous No.106497502 >>106497518 >>106497955
Anonymous No.106497503 >>106497535
I HATE PYTHON
Anonymous No.106497504 >>106497672
>>106497451
>This is a blue board, this is all friendly fun.
it makes me horny in a friendly way
Anonymous No.106497515
I LOVE SNAKES
Anonymous No.106497518 >>106497528
>>106497502
FUCK OF NETA LUMINA SCAMMER
SHARE WORKFLOW OR GTFO
Anonymous No.106497521
Anonymous No.106497525 >>106497725 >>106497751 >>106502499
ah, friday
Anonymous No.106497528
>>106497518
Anonymous No.106497534 >>106497545 >>106497551 >>106497557 >>106497566
give me prompt suggestions pls
Anonymous No.106497535
>>106497503
>I HATE PYTHON
I like python. I hate python package management and dependency hell.
Anonymous No.106497545 >>106497581
>>106497534
A cute 1girl sleeping under the sun.
Anonymous No.106497551 >>106497581
>>106497534
"An old man punching a horse"
Anonymous No.106497556 >>106497582
>>106495675
anyone? though reading past the last thread I’m gonna assume that even if it is possible, it’s not as easy as I think it is.
i’m basically modifying an image of a hand pulling something out of a box, and I want to use the reference image as the item being pulled out.
Still using an unmodified workflow from the guide here.
Anonymous No.106497557 >>106497581
>>106497534
1gil, looking at viewer, waving
Anonymous No.106497566 >>106497581
>>106497534
an indian uncle harassing a hot girl for sexi sex
Anonymous No.106497581
>>106497545
>>106497551
>>106497557
>>106497566
i could inpaint all of these into a single awesome gen but i use comfy so you know im not fucking around with that shit kek
Anonymous No.106497582
>>106497556
use vace
Anonymous No.106497600 >>106497612
I've started to notice some similarity on the wan 2.2 faces, is there anything I can do about it without prompting face traits or ethnicity?
Anonymous No.106497612
>>106497600
use loras, or use i2v instead
Anonymous No.106497672 >>106497697 >>106498601
>>106497504
I'm not going to give you a friendly handshake, that's for sure.
Anonymous No.106497697
>>106497672
that's ok anon
Anonymous No.106497725
>>106497525
You asuka sucks you know?
Anonymous No.106497747
tfw ani was right
Anonymous No.106497751
>>106497525
pretty good Asuka
Anonymous No.106497762
k enough of the discord invasion
Anonymous No.106497819
Anonymous No.106497872 >>106498936 >>106498961 >>106502095
Anonymous No.106497882 >>106497895 >>106497959
Am i really the only cumfartnigger who sees this in vram usage during every vram allocation until it stabilizes and works normally? Started happening in the last couple of days
Anonymous No.106497892 >>106497907 >>106499681 >>106502095
Anonymous No.106497895
>>106497882
Maybe comfyui should be dragged on the streets and ACKed
Anonymous No.106497901 >>106497908
man we really need better interpolation software
Anonymous No.106497907
>>106497892
pretty damn good
Anonymous No.106497908
>>106497901
film vfi is good enough for 16 to 32, and we just need some cracked topaz node connection in comfyui for slightly higher quality to 32 and for anything more
Anonymous No.106497955
>>106497502
Anonymous No.106497959
>>106497882
>mfw ram usage expands
Anonymous No.106497991
i want these goddamn snakes out of my goddamn computer
Anonymous No.106498024
Anonymous No.106498098 >>106502095
Anonymous No.106498141 >>106498172
Anonymous No.106498151
>>106497325
>I'm will
you will be sorely missed, frenchy.
Anonymous No.106498169
Anonymous No.106498172 >>106498378
>>106498141
why is that man drooling like a retard
Anonymous No.106498188
Anonymous No.106498196
Anonymous No.106498206 >>106498231 >>106499318
Anonymous No.106498231 >>106498237
>>106498206
kek'd
Anonymous No.106498237
>>106498231
hors
Anonymous No.106498256
Anonymous No.106498295 >>106498310
Don't get all the shit Comfy gets, when the entire community has had to rely on Civit for far too long. We're talking actual tangible, long term damage from such a garbage platform thriving.
Anonymous No.106498309 >>106499172
Anonymous No.106498310
>>106498295
Yet people here rarely talk about real problems in need of an urgent solution.
Anonymous No.106498325 >>106498344
>>106497275

whoever genned this, please PLEASE catbox
Anonymous No.106498333
>>106497467
Not if you visit /adt/
I mean, it's crazy what they get away with posting there.
Anonymous No.106498344
>>106498325
PLEASE get better taste
Anonymous No.106498370
Anonymous No.106498378 >>106498437
>>106498172
c'mere anon
Anonymous No.106498437
>>106498378
bisgustin
Anonymous No.106498439 >>106498530
Anonymous No.106498530
>>106498439
Typical DiT manlet (because manlet syndrome isn't unique to kontext or QIE, it is some kind of transformer-wide intrinsic property, noticable in Dalles, too. Young meatbag artists also share it with transformers when they run out of bottom margins on their real world paper but still need to draw feet.)
Anonymous No.106498539
Anonymous No.106498545
>>106497428
2000 years post wall
Anonymous No.106498571
Anonymous No.106498601
>>106497672
catbox?
Anonymous No.106498708 >>106498718
Can any of you anons make me a believable black-and-white photo of Sigmund Freud mixed with George Floyd? I want to print it and
Anonymous No.106498718
>>106498708
...and put it on my best friend's wall.
Anonymous No.106498746 >>106498773 >>106498840 >>106498922 >>106503085
Is GenJam never coming back?
Anonymous No.106498773
>>106498746
it will return
Anonymous No.106498840
>>106498746
Just say it with, move your lips when reading it so you can feel how good it feels saying.

"GOON JAM"
Anonymous No.106498878 >>106498923
Anonymous No.106498895
wan 2.2 vace WHEN
Anonymous No.106498922 >>106502499
>>106498746
>didn't participate
>somehow miss it
feels weird
Anonymous No.106498923 >>106498929
>>106498878
I'm curious of the prompt
Anonymous No.106498929
>>106498923
A scenic view of an old bicycle in a field. The camera pushes out to reveal scraps of torn and ripped clothing strewn about and broken beer bottles on the ground. There are smears from bloody handprints on the tree. There is a puddle of of blood in the grass and on the ground near the torn clothes. It looks like a murder scene.
Anonymous No.106498936 >>106498996
>>106497872
Model?
Anonymous No.106498958 >>106499026 >>106499032
How much RAM you guys have? I plan to upgrade from 64 to 128GB, but not sure if it's worth the upgrade
Anonymous No.106498961 >>106498996 >>106499232
>>106497872
kek, fucked up the heels
Anonymous No.106498996 >>106499232
>>106498936
noob and a lora trained on 80 imgs curated from https://x.com/schauermannx2
i think it can be much better tho
>>106498961
perspective wise i think the video makes more sense heh
Anonymous No.106499026 >>106499215
>>106498958
96 GB
Anonymous No.106499032 >>106499047 >>106500745
>>106498958

Downloaded RAM from 64 to 128 and it reduced my Wan2.2 gens by 80 seconds. Worth it.
Anonymous No.106499047 >>106501005
>>106499032
where can u download ram from???
Anonymous No.106499051
Anonymous No.106499064
Are there wan2.2 vace or alternatives yet?
Anonymous No.106499100
Anonymous No.106499147
Anonymous No.106499172
>>106498309
Badass
Anonymous No.106499186
Anonymous No.106499215 >>106499237
>>106499026
no screenshot? comeon hommie, flex that dick
Anonymous No.106499218 >>106499244
>>106496741
>Now I kind of want to make a Suno.ai song with this phrase as the chorus:
>>he made his own bed
>>by forcing the model to run faster starting at v30, >it all went downhill from there
got ya senpai
https://vocaroo.com/1loPzeLJD8qK
Anonymous No.106499232
>>106498961
>>106498996
true, i think i got it right this time though
Anonymous No.106499237 >>106499247
>>106499215
here you go
Anonymous No.106499244
>>106499218
kek
Anonymous No.106499247 >>106499250
>>106499237
dawg I got a 5090, thought you had that blackwell pro, tryin to jerk off over here
Anonymous No.106499250
>>106499247
no 96 gig ram not vram
Anonymous No.106499255
Anonymous No.106499264
odd how onetrainer wont let you define sampler/scheduler/etc for training samples. i wonder how training with comfy compares in general
Anonymous No.106499300 >>106499316
>>106497264 (OP)
Middle right is amazing
Anonymous No.106499316
>>106499300
Ikr, I didn't find any flaw on that one, I kinda expect Wan 3.0 to have this kind of quality consistently
Anonymous No.106499318
>>106498206
LMAOOO, this is a gem
Anonymous No.106499331
>>106499113
now I get why Microsoft wants to shut this model down kek
Anonymous No.106499405 >>106499420 >>106499463
>>106499113
https://github.com/paperwave/VibeVoice
doesn't look taken down to me?
Anonymous No.106499420 >>106500838
>>106499405
the corrected 7b model is "on the way", it'll be a lobotomized version of the one we already have lol
Anonymous No.106499455 >>106499468 >>106499505 >>106499578 >>106501068
I just realized WAN's generation are tuned for 16 fps. Increasing fps makes the motion fast most of the time.
Anonymous No.106499463
>>106499405
because that obviously is not the original microsoft repo
Anonymous No.106499468
>>106499455
yep, I hope their next version will be at 24 fps, that's the threshold where it doesn't look chopped as fuck
Anonymous No.106499502 >>106499513 >>106499544 >>106499556 >>106499585
https://www.theverge.com/anthropic/773087/anthropic-to-pay-1-5-billion-to-authors-in-landmark-ai-settlement
Anthropic to pay $1.5 billion to authors in landmark AI settlement
holy fuck dude, this is bad, like really really bad
Anonymous No.106499505 >>106499509
>>106499455
>I just realized WAN's generation are tuned for 16 fps.
I have no idea where people got the idea that it isn't. It's been a really hard myth to dispell.
Anonymous No.106499509
>>106499505
>I have no idea where people got the idea that it isn't.
that's because the 5b version is actually working at 24fps, so I also thought the 14b version would be too
Anonymous No.106499513
>>106499502
That's like 80% of what I've spent gooning to opus 4.1
Anonymous No.106499544
>>106499502
>Let's fuck with the development of this groundbreaking technology because some fat bitch wants money for her chad thundercock schlock novel
Grim. I hate the antichrist.
Anonymous No.106499556 >>106499585 >>106499694 >>106499722
>>106499502
It's actually good in the sense that they're only paying for illegally downloading the books, not for using them in training. 1.5b is nothing to them but (like everyone's been saying) this case further widens the gap between the big guys and the little guys (which is what the authors proclaim they are against kek).
Anonymous No.106499578
>>106499455
The frame interpolators are decent for solving this issue. I use GIMM-VFI, which you can search for with comfyui custom nodes manager and install it.

I believe these are quite a bit better than the old interpolated frames you'd get with TVs. Though they're obviously not perfect, as they're only spending like 60 seconds to generate the interpolated frames for your entire video.

The way it works is your frame count gets increased (like if every 2nd frame is interpolated then it goes from 81 --> 160 or whatever), so now your video is in slow motion, but then you fix this by increasing your FPS to make it faster. And then it'll look right.
Anonymous No.106499580 >>106499604
Testing Chroma Radiance
I think the output is pretty interesting. It has sort of a weird mottling pattern that seems unique. It's probably something that should go away with further training progress, but I actually like it.
Anonymous No.106499585 >>106499596 >>106499641
>>106499502
>>106499556
The judge explicitly ruled that training on protected works is fair use which is the biggest win for AI bros
Anonymous No.106499592 >>106499638
https://vocaroo.com/1eCSSHLSRPJ0

You know, for the first time using vibe voice it's actually pretty good. I was expecting another so-so model but it's actually pretty good.
Anonymous No.106499596 >>106499665
>>106499585
yeah we won but it'll be reported as a loss.
as usual, we pretty much have to just wait about 12 fucking years until the whiners die out and the AI zoomers take over, then they'll start making a bunch of youtube videos finally correcting the record for all the misinformation being spread.
Anonymous No.106499604 >>106499613 >>106499648
>>106499580
Poor anatomy and other Chroma problems seem about the same as the regular models
Anonymous No.106499613 >>106499624 >>106499648
>>106499604
Problem is radiance is slow. And raising the resolution increases memory requirements like crazy.
Anonymous No.106499624 >>106499655
>>106499613
Whoa. It's almost like the Vae exists for a reason.
Anonymous No.106499634
>havent updated in months because comfy runs fine
>start it up, suddenly it crashes when loading a specific controlnet
>switch to different controlnet and it works fine
Wha
Anonymous No.106499638
>>106499592
Yeah the 7B model is crazy good.
Anonymous No.106499641 >>106499665
>>106499585
>The judge explicitly ruled that training on protected works is fair use which is the biggest win for AI bros
then why do they have to pay to use it? that's the fucking problem
Anonymous No.106499645
https://voca.ro/13lAHtQGa9KR
Anonymous No.106499648 >>106499751 >>106499762 >>106501231
>>106499604
Unfortunately it looks like the biggest thing hoped to improve with Radiance isn't any better. Small, high frequency details are still get melted and deformed. Not using VAE isn't helping. I think it was already doomed when they decided to train with 512x.

>>106499613
I am overly GPU-rich so I didn't really notice, but I thought skipping the VAE could have lead to speed improvements eventually?
Anonymous No.106499655 >>106499762
>>106499624
>Whoa. It's almost like the Vae exists for a reason.
for edit models, vae is a disaster, you want the model to only modify certain parts of the image but with a vae you have a compression loss on all the pixels, vae-less is the way to go for edit models
Anonymous No.106499665 >>106499678
>>106499596
I cannot think of a bigger win for training other than forcing those included in datasets to pay the trainers. It's that big.
>yeah we won but it'll be reported as a loss.
True but in time, it won't.
>>106499641
>then why do they have to pay to use it?
Not to use it, anon. To obtain it. I agree it's still gay and retarded but until the entire copyright apparatus is taken down that's the way it'll be.
Anonymous No.106499678 >>106499694
>>106499665
>Not to use it, anon. To obtain it. I agree it's still gay and retarded but
I don't think you realize how fucked up this is, every uncoming company will need billions of dollars to get the data needed to train their models, it'll kill everything, only giant companies will afford to do that, the US eldorado is over
Anonymous No.106499681
>>106497892
same seed and prompt, different epochs of a new version
need to figure out why noob lineart cnet crashes comfy tho
Anonymous No.106499694
>>106499678
>I don't think you realize how fucked up this is,
See >>106499556
>this case further widens the gap between the big guys and the little guys (which is what the authors proclaim they are against kek).
It sucks but I'm a half glass full kind of person. It does prove that the artists and authors suing are either 1. hypocrites or 2. being fooled by large copyright holders but we've all suspected as such already.
Anonymous No.106499714 >>106499756 >>106500281
https://vocaroo.com/1EeqoY7Nm8wo
Anonymous No.106499722 >>106499740
>>106499556
>It's actually good in the sense that they're only paying for illegally downloading the books, not for using them in training.
3000 dollars for a single book? really? they're not paying to buy a book, they're paying the extra to use it for training, how is that "fair use"? the judge is fucking RETARDED
Anonymous No.106499723 >>106499734
Anonymous No.106499734 >>106500904
>>106499723
What I've been seeing going around is the first frame being the image and the second frame being the box display, so like the character walks up on to the desk.
Anonymous No.106499740 >>106499748 >>106499755
>>106499722
Put yourself in Anthropic's shoes. Spending 1.5b to save 183b is a steal.
>the judge is fucking RETARDED
Pretty sure that number was reached between the two parties. Actually I think it was Anthropic that came out and said "that's fine we'll pay it.
Anonymous No.106499748
>>106499740
for anthropic it's fine, but this sets a precent, now every company that wants to replicate their success know they will have to first need billions of dollars to make their first model, this will be impossible for almost everyone, the US is dead, China has won
Anonymous No.106499751
>>106499648
>VAE could have lead to speed improvements eventually
Theory is supposed to learn better so technically but in practice I doubt it. We only have one pixel model in the wild and it's really meh cause it was undercooked, which I am assuming is from the high training resource this technique requires.
Anonymous No.106499755
>>106499740
$183 billion, but they are still operating at a loss, they are still not profitable.
Anonymous No.106499756 >>106499794
>>106499714
It kinda starting speeding up like crazy near the end but speech was pretty natural until then. Don't know who it is though.
Anonymous No.106499762 >>106499960
>>106499648
>Not using VAE isn't helping.
it is >>106499655
Anonymous No.106499790
I bet if you're able to obtain your dataset via the clear web and not torrents or other illegal means it'd be fine.
Anonymous No.106499794 >>106500848
>>106499756
It's Dagoth Ur from morrowind. It sounds pretty much exactly like him but then again I think he's one of the easiest voices to replicate.
Anonymous No.106499836 >>106499856
https://files.catbox.moe/no4k4q.flac
Anonymous No.106499856 >>106499879
>>106499836
kek, who's voice is it?
Anonymous No.106499879 >>106499897
>>106499856
https://www.youtube.com/shorts/idvtat3TbTE
Anonymous No.106499897 >>106499941
>>106499879
so it managed to replicate the voice with only 7 seconds of examples? that's insane
Anonymous No.106499941 >>106499951
>>106499897
anudda victory for the OGs
Anonymous No.106499951
>>106499941
https://files.catbox.moe/81zqj5.flac

A short clip of megumin in Japanese from youtube.
Anonymous No.106499960 >>106499962
>>106499762
Good thing Chroma is an edit model oh wait
Anonymous No.106499962 >>106499981 >>106499982 >>106500183 >>106500466
>>106499960
>Good thing Chroma is an edit model oh wait
that's funny because he wants chroma to be an edit model
https://xcancel.com/LodestoneE621/status/1963467050501992811#m
Anonymous No.106499981
>>106499962
>Base model still mangles anything and everything a lot of the time
>let's waste some more compute/money to become a bad editing model
Can he see one thing trough for once and make it good before his ADHD kicks in?
Anonymous No.106499982 >>106500029 >>106500466
>>106499962
This feels so ill conceived at this point. Chroma should be marked as done and he should use those resources on more promising models.
Anonymous No.106499987 >>106499993
remember when he said he was going to do a wan tune :(
Anonymous No.106499993
>>106499987
No, because I don't hang on to his every word like some people here seem to.
Anonymous No.106500006
https://files.catbox.moe/wpf4ai.flac
Anonymous No.106500014
Anonymous No.106500029 >>106502328
>>106499982
this, at this point he should just finetune Qwen Image Edit so that it doesn't zooms in randomly, can do porn and has nice skin texture, it won't be too expensive and he will really be considered a legend, trying to undistill schnell was a giant mistake, you can't save distilled models, now it's obvious
Anonymous No.106500032 >>106500054
Chroma Radiance creates this mosaic artifact if you try to generate at very high resolution. Obviously it's not expected to actually produce good output in this case, but the artifact is interesting. It's almost like it's trying to extrapolate the scale of the pixels themselves instead of the number of pixels.
Anonymous No.106500050
What's the best way to make pixelated low resolution gens? Like anime style pixel art with flat colors and simple shadows.
Anonymous No.106500054
>>106500032
that's because this pixel method uses square patches, so it's literally some 16x16 mosaics, I wonder how they can fix that
Anonymous No.106500060 >>106500071
https://files.catbox.moe/6i1zok.flac

Indian scammer
Anonymous No.106500071 >>106500076
>>106500060
kek, but for real though, indians will use this techology to reproduce a real american voice and their scam will be harder to notice
Anonymous No.106500076 >>106500083
>>106500071
I mean... have you seen the leadership at microsoft lately?
Anonymous No.106500083 >>106500093
>>106500076
I do, and I'm glad it's a jeet at the top, look at their fuckup, now we have a good model on the wild because if their incompetence (I still believe they did this shit in purpose to help the local ecosystem forward, like the """leak""" of llama1)
Anonymous No.106500088
Can you convert flux loras to Qwen?
Anonymous No.106500093
>>106500083
It's honestly really good. I can't believe I almost ignored this model.
Anonymous No.106500096 >>106500141 >>106500165
Since VibeVoice large is like 17gigs I'm currently trying to implement block swapping for the custom nodes I found so you can offload parts of the model to DRAM and run it on lower VRAM systems.
The normal nodes didn't offload anything for me and OOM during initialization of the model.
Can some Anon with 16gb VRAM tell me what kinda nodes they're using for the large model before I waste any more time?
Anonymous No.106500105
Anonymous No.106500137
Anonymous No.106500141 >>106500146
>>106500096
What is that node anyway? The node I'm using doesn't look like that
Anonymous No.106500146 >>106500159
>>106500141
That's from
https://github.com/Enemyx-net/VibeVoice-ComfyUI

That's why I'm asking, what nodes are you using?
Anonymous No.106500149
Anonymous No.106500153 >>106500188 >>106500423
https://files.catbox.moe/hdi8ls.flac
Asmon comes out as trans.
Anonymous No.106500159 >>106500176 >>106500781
>>106500146
https://github.com/wildminder/ComfyUI-VibeVoice/commits/main/

I can't help you though, I'm on 24gb and it just works. The node has a quantization option for lower vram users though. Good luck with your block swapping.
Anonymous No.106500165 >>106500176
>>106500096
I'm on 12gb and I use quants.
Anonymous No.106500176
>>106500159
Yeah, block swapping works already with the code I wrote, so it loads on 16GB VRAM, but I'm still doing the compute on DRAM right now because I think I'm using wrong logic to get the source of the tensor.
But your node seems to have offloading already... so I just did that for nothing.
Joke's on me for not searching for another node and relying on fucking plebbit of all places.
Thanks man.

>>106500165
I'd rather run full models and offload most of the time.
Anonymous No.106500177
Anonymous No.106500183
>>106499962
>10 more epochs
he has zero patience and will introduce random experiments and distillations that will fuck it up halfway
Anonymous No.106500188 >>106500195
>>106500153
that doesn't sound like him at all
>t. looks at his videos everyday to get news of the new woke slop drama
Anonymous No.106500195
>>106500188
I think it's the cadence that ruins it.
Anonymous No.106500215
Anonymous No.106500261 >>106500279 >>106500344 >>106500561 >>106502357 >>106502416
Some super lewd WAN gens

files.catbox.moe/ buvot5.mp4

files.catbox.moe/ opnfsd.mp4

files.catbox.moe/ ulztfl.mp4

came out pretty good I'd say
Anonymous No.106500279 >>106500283
>>106500261
>Put a space in the url
>Censored the images on catbox anway

You are a real piece of shit, you know that?
Anonymous No.106500281 >>106500289
>>106499714
vibevoice or chatterbox?
Anonymous No.106500283 >>106500485
>>106500279
I don't want a forced vacation ok
tryin to play it safe
Anonymous No.106500289
>>106500281
Everything in this thread is vibe voice.
Anonymous No.106500344 >>106500380
>>106500261
Anonymous No.106500380
>>106500344
ogopogo! ogopogo!
Anonymous No.106500413
Damn. VibeVoice works kinda alright for non-English languages.
Neat model.
Anonymous No.106500423
>>106500153
>https://files.catbox.moe/hdi8ls.flac
>Asmon comes out as trans.
Do a Total Biscuit WTF podcast, returning from the grave to review trash games.
Anonymous No.106500458
>taggui can't do batches
Literally what is the point then
Anonymous No.106500466
>>106499982
>>106499962
One more training bro
Anonymous No.106500485
>>106500283
anon, catbox is fine
Anonymous No.106500492
>having fun generating generic npcs
>the final boss appears
pretty rad
Anonymous No.106500561
>>106500261
why would you censor them
Anonymous No.106500745 >>106500780
>>106499032
Is this sarcasm or real? im thinking of increasing ram too
Anonymous No.106500757 >>106500781
the large vibevoice doesn't run with 16gb even with quants, meh
Anonymous No.106500780
>>106500745
I also increased ram from 64 to 128 but I have two gpu to feed, I don't think you need more than 64 for 1 gpu.
Anonymous No.106500781 >>106500789
>>106500757
I'm running it on 12 with this node
>>106500159
Anonymous No.106500784 >>106500796
What's so good about the voice model leaking? Can it do nsfw?
Anonymous No.106500789
>>106500781
I just get OOM. Oh my fucking god it's Comfy again isnt it
Anonymous No.106500796 >>106500808 >>106500821 >>106500889 >>106500941
>>106500784
>leaking?

It didn't leak. It got the wizard LM 2 treatment. Nobody bothered to look at it until Microsoft pulled it for being too good. And yeah, it does NSFW.

It's just all around solid.
Anonymous No.106500807 >>106500816
>prompt WAN i2v
>action looked good
>increase length by 12 frames
>action changes completely
we will never get exactly what we want
Anonymous No.106500808 >>106500818
>>106500796
>And yeah, it does NSFW
can someone show a catbox of that
unless nsfw is just "I can make someone say tits ass"
Anonymous No.106500816
>>106500807
you mean you stitched two videos or you actually added 12 frames to the 81?
Anonymous No.106500818 >>106500838
>>106500808
>>106499113
Anonymous No.106500821
>>106500796
>Microsoft pulled it for being too good
Oh I see, so basically a happy accident.
Anonymous No.106500823 >>106501056
>>106499415
I can easily fap to this
Anonymous No.106500838 >>106500854 >>106500856
>>106500818
That's actually better than anything else I've heard locally. Can you control emotions? Or just write text and it figures it out?

>>106499420
>the corrected 7b model is "on the way", it'll be a lobotomized version of the one we already have lol
This will be funny to watch, and see the differences, so we'll know if it's the nsfw abilities or the "too good for local" that triggered its deletion.
Anonymous No.106500848
>>106499794
the old 11labs ones are better imo
Anonymous No.106500854
>>106500838
Basically you have to have those kinda sound in the reference and it kinda figures how to implement them through context.
Anonymous No.106500856
>>106500838
>Can you control emotions?
no but if the base model is good people will figure out ways to create ""loras"" out of them
would be happy to hear a jav one for sure
Anonymous No.106500885
comfy fork when
Anonymous No.106500889 >>106500898
>>106500796
>Nobody bothered to look at it until Microsoft pulled it for being too good. And yeah, it does NSFW.
congrats Microsoftrannies, the streisland is at full effect kek
Anonymous No.106500898
>>106500889
I downloaded it just because someone said they deleted it. I'm not even interested in voice but it had to be done.
Anonymous No.106500904
>>106499734
example?
Anonymous No.106500941 >>106500961 >>106501017
>>106500796
> it does NSFW.
what kind of nsfw? just moaning
Anonymous No.106500961 >>106500981 >>106501017
>>106500941
>what kind of nsfw? just moaning
I feel stupid for needing to ask this, but what kind of NSFW stuff do you expect beyond moaning for a voice model?
Anonymous No.106500981 >>106501017
>>106500961
nta but shlops and plaps
Anonymous No.106501005
>>106499047
newegg
Anonymous No.106501017
>>106500941
>>106500961
>>106500981
-> >>106499113
Anonymous No.106501028 >>106501106 >>106501124 >>106501126
https://files.catbox.moe/7fwj5k.flac
Anonymous No.106501056 >>106501099
>>106500823
very nice style mix
Anonymous No.106501068 >>106501091
>>106499455
>I just realized WAN's generation are tuned for 16 fps
nigger this was explicitly known since wan 2.1
Anonymous No.106501091
>>106501068
the wan 2.2 5b model is 24fps though
Anonymous No.106501099 >>106501240 >>106501361
>>106501056
thanks
Anonymous No.106501106
>>106501028
Wow, this model is nuts
Anonymous No.106501109 >>106501124
https://files.catbox.moe/q5alyj.flac
Anonymous No.106501116 >>106501134 >>106501194 >>106501269
https://xcancel.com/bdsqlsz/status/1964279441305030725#m
seems like one of the 2 edit chink models that will be released (one of them will be local) is Seedream 4.0
Anonymous No.106501124 >>106501129
>>106501028
>>106501109
do you think the normies will accept the fact the Simpsons will use the Homer/Marge original AI voices once the actors will die of old age?
Anonymous No.106501126 >>106501319
>>106497264 (OP)
Why haven't you trained that LoRA yet?

>>106501028
Decent Homer
Anonymous No.106501129
>>106501124
I give marge weeks before she is dead. Have you heard her lately?
Anonymous No.106501134
>>106501116
good on them for showing failure cases, not many model makers do that
Anonymous No.106501137 >>106501155
https://files.catbox.moe/s0jkzc.flac

Gollum
Anonymous No.106501155
>>106501137
>Gollum
desu this should be the benchmark for audio models, he has the perfect voice to test out a model's limit
Anonymous No.106501194
>>106501116
the left one is impressive
Anonymous No.106501197 >>106501203 >>106501261
Anonymous No.106501203
>>106501197
Oops, I changed "dream" to "gen" and it hit the wrong way. This is what I meant to post
Anonymous No.106501231 >>106501241 >>106501252 >>106501272 >>106501335
>>106499648
>Not using VAE isn't helping
the entire chroma finetuning project amounts to lodestone going "guys, I may have conceived an idea most ingenious!" only to find out there's a reason no other model does it.
Anonymous No.106501240
>>106501099
This belongs to Anime Diffusion Thread.
Anonymous No.106501241 >>106501252
>>106501231
Donating to lodestones right now is basically throwing your money in a shredder.
Satisfying his curiosity on other's dime.
Anonymous No.106501252 >>106501291 >>106501795
>>106501231
>>106501241
to be fair, I want a future without VAEs, so lodestrone trying that PixNerd paper and see if it actually works seems like a good experiment
Anonymous No.106501260
I am going to fucking kill myself
Anonymous No.106501261 >>106501272
>>106501197
Anonymous No.106501269
>>106501116
LOL! seedream 3 (mogao) is one of the top models right now. there is no way they release v4 openly. and even that still looks behind nano banana.
Anonymous No.106501272
>>106501231
idk it's cool that he explores in the open. It's not like lode's a know-it-all- he's just curious.

>>106501261
ty much better
Anonymous No.106501291 >>106501312
>>106501252
I haven't used a VAE in like a year
Anonymous No.106501312 >>106501336
>>106501291
how?
Anonymous No.106501319 >>106501411
>>106501126
>Why haven't you trained that LoRA yet?
Been testing different settings. Adamw8bit + cosine seems to be the way to go. I don't know if OneTrainer knows Huber loss, would like to try with that. Chroma noise offset settings are also one mystery.
Anonymous No.106501335
>>106501231
I'm ashamed to say that but if I was on his shoes I would do the exact same shit, with a shit ton of money all I would do would be trying all the obscure papers and see if something sticks, there's probably gold in there
Anonymous No.106501336 >>106501345
>>106501312
idk just bein myself I guess
Anonymous No.106501338
Anonymous No.106501345 >>106501354 >>106501355
>>106501336
based
Anonymous No.106501354
>>106501345
vaesed
Anonymous No.106501355
>>106501345
I just tried it and it didn't do anything. I stopped when models started saying that they had the vae baked in.
Anonymous No.106501361 >>106501376 >>106501593
>>106501099
pow
Anonymous No.106501376 >>106501562
>>106501361
kek I was going to post that on twitter in a bit but this one is better, thanks
Anonymous No.106501411 >>106501462
>>106501319
>huber loss
I tried huber loss on diffusion pipe (quick code edit) a while back and it was worse. I only tested a few delta values, so ymmv
Anonymous No.106501462 >>106501508 >>106501788 >>106502741
>>106501411
Have you tried multiple concepts per lora? I get pretty terrible character bleed
Anonymous No.106501508 >>106502741
>>106501462
>multiple concepts per lora
No, but it sounds fun! I'll try that today. Maybe the bleed can make new people with desired attributes
Anonymous No.106501562 >>106501574 >>106501593
>>106501376
Anonymous No.106501574 >>106501603
>>106501562
can you get a jab from the left so we can see more of her face in pain?
Anonymous No.106501593
>>106501562
>>106501361
motion really highlights the downs syndrome features
Anonymous No.106501603
>>106501574
gotta go wagie
Anonymous No.106501788 >>106501834
>>106501462
Have you tried training separate loras and then merging them using some fancy concept-retaining (allegedly) merger like k-lora? Or even simple svd?
Anonymous No.106501795 >>106502552 >>106502611
>>106501252
>a future without VAEs
What did the vaes ever did to you?
Anonymous No.106501834
>>106501788
>Have you tried training separate loras and then merging them using some fancy concept-retaining (allegedly) merger like k-lora? Or even simple svd?
first time hearing about such thing
Anonymous No.106502045 >>106502821
>>106497484
Very nice, abtract geometric art is underrated
Anonymous No.106502095
>>106497872
>>106497892
>>106498098
These are really good, very cool style as well
Anonymous No.106502143 >>106502168 >>106502264 >>106502295 >>106502489
Friendship ended with chatterbox.
VIibeVoice is my new best friend.

https://files.catbox.moe/rhxshe.wav
Anonymous No.106502168
>>106502143
>https://files.catbox.moe/rhxshe.wav
Microsoft, what have you done?
Anonymous No.106502264
>>106502143
Can it do Microsoft Ashley voice?
Anonymous No.106502295 >>106502326
>>106502143
Can the moans work on any voice or does it require moans in the sample vocie?
Anonymous No.106502326 >>106502379
>>106502295
The sample voice I used didn't have any moans. Took a few gens though and tweaking the script.
Anonymous No.106502328 >>106502339
>>106500029
>it won't be too expensive
are you stupid?
chroma is a 8.9b model and cost near 150k+
qwen is 20b, the training would then also take a magnitude of time to train
Anonymous No.106502339 >>106502429
>>106502328
he doesn't need millions of images to save qwen image edit though, the model is solid, it just need a few examples to learn porn and how to make normal skin
Anonymous No.106502357 >>106502416 >>106502604
>>106500261
> space in url
> censored
never ever post here again
Anonymous No.106502379
>>106502326
Sick, time to gen some degen.
Anonymous No.106502397
i adore the fact that they tried to remove the model because it can degen so easily. if they hadn't, i would have never learnt about it.

thank you streisand effect
Anonymous No.106502416
>>106502357
>>106500261
>never ever post here again
this
Anonymous No.106502429 >>106502491
>>106502339
Yes, you need millions of real world images to remove the synthetic slop look of Qwen, it's arguably even more overtrained that Flux, with the exception of the 'flux chin'
Anonymous No.106502430 >>106502667 >>106504711
About to try vibe, which one is better so far? Both seem to have retarded issues.

https://github.com/wildminder/ComfyUI-VibeVoice
https://github.com/Enemyx-net/VibeVoice-ComfyUI
Anonymous No.106502489 >>106502771
>>106502143
sick, how did you prompt the moans?
Anonymous No.106502491 >>106502548
>>106502429
>you need millions of real world images to remove the synthetic slop look of Qwen
I don't think you need that many images though
https://civitai.com/models/1927710?modelVersionId=2181911
Anonymous No.106502499
>>106497525
>>106498922

workflow?
Anonymous No.106502548
>>106502491
But this is just a small scope lora for a specific look, with very limited variation and applied on a concept Qwen already knows well (non-nude human beings), it nowhere near a full finetune.

If you are happy with this then why are you clamoring for said full finetune of qwen to begin with ? Just download / train loras.
Anonymous No.106502552
>>106501795
eh compresses my image and doesn't afraid of anything
Anonymous No.106502588 >>106503383
uninstalled
Anonymous No.106502604 >>106502646
>>106502357
its takes 2 seconds to fix the url
Anonymous No.106502611
>>106501795
Lose detail, particularly in complex anatomy like hands
Anonymous No.106502646 >>106502656
>>106502604
and yet the videos are still censored
Anonymous No.106502656 >>106502725
>>106502646
Well I thought I would get offd for 3 days if they werent!
Anonymous No.106502667 >>106502683
>>106502430
VibeVoice-ComfyUI seems to be more updated but I have no idea if it's better.
I need to try it, the samples posted here are pretty titillating.
Anonymous No.106502683 >>106502728 >>106503397
>>106502667
the only thing i find interesting is the moans and being able to direct it more through prompts. quality wise it's pretty bland compared to rvc+xtts using alltalk
Anonymous No.106502725
>>106502656
fair. the jannies here are unhinged and ban based on mood.

anyone have a prompt guide for vibevoice? i can't find shit.
Anonymous No.106502728 >>106502744
>>106502683
so alltalk is better in everything except moans and lewd voices?
Anonymous No.106502741 >>106503275
>>106501462
>character bleed
Confirmed bleeding all over the place

>>106501508
>new people with desired attributes
It kinda happened. Excuse the cataracts from 512px training. It was a quick 1 1/2hr run just to see what happens
Anonymous No.106502744 >>106502792
>>106502728
imo yes, from the samples posted here.
actually no, rvc and xtts can't generate as long but eh.
Anonymous No.106502771
>>106502489
Just ohhh, mmm, hmm that kind of stuff. I think contextually the words in the script helped it figure out what I wanted.
Anonymous No.106502785
Baker?
Anonymous No.106502792 >>106502824
>>106502744
RVC requires a sample to convert and xtts tends to be extremely monotone. I find the emotive aspects of VibeVoice way better than xtts, not just the moaning and stuff, breathing, inflection of voice, that kind of stuff is better.
Anonymous No.106502799 >>106502855
Hmm...
Anonymous No.106502821
>>106502045
>Very nice, abtract geometric
thanks
Anonymous No.106502824
>>106502792
just finished downloading vibevoice.
i take everything i said back. vibevoice is gonna make me lose my balls.

it can flawlessly copy asmr voices without breaking.
it's so joever
Anonymous No.106502855 >>106502880 >>106502898
>>106502799
>black Miku with jewish nose
...
Anonymous No.106502880
>>106502855
she's mixed race kek
Anonymous No.106502898 >>106502921
>>106502855
It's not a Jewish nose. The prompt was:
>an illustration of a dark-skinned Hatsune Miku sitting at a table and reading a book. She has black African features, including a big flat nose. The book's title is just "DAS RITE KAPITAL". Behind her on the wall is the logo of the Black Panther Party.
And I meant a very different nose. Chroma just decided that that means the nose of a proboscis monkey.
Anonymous No.106502921 >>106502974
>>106502898
Anyway, I shouldn't have blown my VERY FUNNY "Das rite Kapital" joke. which apparently no one else has ever made, on this gen.
Anonymous No.106502974
>>106502921
Surely one to go down in the anals of history
Anonymous No.106503063
I miss the times when the collage had no videos. Thinking of becoming a new reviled personality on /ldg/.
Anonymous No.106503077 >>106503249
no bake?
Anonymous No.106503085
>>106498746
as one of the two participants of genjam 3, i would like to say that yes, i am down for another round
Anonymous No.106503249
>>106503077
be the change you want to see, faggot.
Anonymous No.106503275 >>106503425
>>106502741
>Excuse the cataracts from 512px training
Have you tried 768? 1024 kills my machine
Anonymous No.106503353 >>106503381
I thought there were schizo shills patrolling here 24/7?? Make a new thread, I have stuff to say.
Anonymous No.106503381
>>106503353
2min
Anonymous No.106503383
>>106502588
666% tranny
Anonymous No.106503397 >>106503466
>>106502683
I don't know what you are doing, but your posted example is much worse than what vibevoice 7b gives me
Anonymous No.106503408
>>106503402
>>106503402
Anonymous No.106503425
>>106503275
>Have you tried 768? 1024 kills my machine
Yes. 640 is decent too. I can go up to 1280 without block swapping on batch size 1 with float8 (24GB VRAM). Tested float8 with validation and it's within a 0.0001-0.0005 difference. The loss is so little that I just leave it on for flexibility
Anonymous No.106503466
>>106503397
wrong anon. i didn't post shit.
Anonymous No.106504711
>>106502430
where do you get the models from? looks like they're taken down from huggingface