← Home ← Back to /g/

Thread 106848716

317 posts 208 images /g/
Anonymous No.106848716
/ldg/ - Local Diffusion General
Copyright Safe Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106844207

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.106848731 >>106848743 >>106848747 >>106848748
How does Illustrious keep winning without doing anything?
Anonymous No.106848736
Anonymous No.106848743
>>106848731
no weights no care
beyond that, basically all open weight anime models are useless beyond noob and variants right
not sure that's an illustrious win so much as an everyone else loss
Anonymous No.106848747
>>106848731
they didn't get the genius idea of fucking with the training halfway through and actually completed training unlike the competition
Anonymous No.106848748 >>106848756
>>106848731
lumina will save us
Anonymous No.106848754 >>106850187
Hello, **/ldg/** downloaded chroma1HDGGUFFP8_fp8ScaledHybridRev2, running on swarm, what do you think? I want a clean OC characters anime/2d gens.
What samplers, steps, cfg, loras or chroma versions work best?
Please, share your workflow if you got one!

In /sdg/ gave me some help >>106848657 but looking for more suggestions!
Anonymous No.106848756
>>106848748
how can it when everyone is fine-tuning an unfinished model? pre-training is the most important stage
Anonymous No.106848761
noob is fucking trash, lumina sucks and pony just committed suicide

illustrious reigns
Anonymous No.106848767
>Qwen image / edit
>Wan 2.2
>Chroma
>Neta / Lumina
>Illustrious / Noob / SDXL
>Pony

Localsisters ...
Anonymous No.106848783 >>106848867 >>106848987 >>106849211
>he hasnt tried yume v3.5
Anonymous No.106848798
Anonymous No.106848848
Sorry for the repost, I ended up at the end of the last thread…

Has anyone who’s tried Grok Imagine found even a moderately comparable comfy based workflow (using any GPU available on runpod)? New to wan and I can’t believe how good the grok results were for nsfw photo animation, but public results and moderation flakiness kills the concept… however for me it has set the bar and I wonder what’s even realistic to achieve diy. New to diffusion and been learning lora training for qwen and chroma and having good results with t2i but now that I’ve seen what grok can do with a single image I wonder whether just how close anyone can hope to come with available models. Default comfy wan 2.2 i2v on an L40S gave me some interesting results but not even in the ballpark of grok but that’s my current starting point for learning. Any tips appreciated
Anonymous No.106848859 >>106848931
why would i use neta lumina or noob when illustrious has everything already, i dont get it.
Anonymous No.106848864
>finally get the motion I'm after
>it fries the parts into plastic

REEEE
Anonymous No.106848867
>>106848783
It's better than base Illustrious 2.0 and certainly better than 0.1 / the other versions. The author finally cracked it, it seems.
Anonymous No.106848926
is nunchaku a meme for QIE?
Anonymous No.106848927 >>106849712
Anonymous No.106848931
>>106848859
illustrious doesnt have a chad vae
https://civitai.com/models/1790792/netayume-lumina-neta-luminalumina-image-20
Anonymous No.106848945
Anonymous No.106848987 >>106849000
>>106848783
just tried 3.5, the anatomy is still dogshit. this shit is worthless
Anonymous No.106848997
Anonymous No.106849000 >>106849028
>>106848987
post your prompt and settings and if youre lucky ill tell you what youre doing wrong
Anonymous No.106849007 >>106849071
I'm literally on drugs rn
Anonymous No.106849020 >>106849151
Anonymous No.106849028 >>106849039
>>106849000
>and if youre lucky ill tell you
nah, im not going to play gacha on whether you feel like helping or not, faggot.
Anonymous No.106849039 >>106849085
>>106849028
no worries ill accept your concession
keep at it tho i believe in you anon
Anonymous No.106849061
Anonymous No.106849071
>>106849007
model?
Anonymous No.106849085 >>106849117
>>106849039
you never planned to even remotely help even if i complied, bitchboy. i already deleted it too, dead model. i havent seen a single image made with lumina that looks good and you wont post one either since you know they sucks kek
Anonymous No.106849100
Anonymous No.106849106
Anonymous No.106849117 >>106849129 >>106849156 >>106849179 >>106849188 >>106849312
>>106849085
show me illustrious attempting to do 5girls all individually prompted as well as this
https://civitai.com/images/105196050
Anonymous No.106849129 >>106849133
>>106849117
yikes
Anonymous No.106849133 >>106849169 >>106849179
>>106849129
we all know illustrious couldnt get close to that without regional prompting even with the fuck ups kek
Anonymous No.106849151
>>106849020
this is what i want
Anonymous No.106849156 >>106849160 >>106849169
>>106849117
you cant GENUINELY fucking tell me you think this looks good
Anonymous No.106849160
>>106849156
lol i was thinking the same thing
Anonymous No.106849166 >>106849949
Anonymous No.106849169 >>106849272
>>106849156
>>106849133

you could post an illustrious version of 5girls but you know it wont even be close
Anonymous No.106849175
Anonymous No.106849179 >>106849194 >>106849263 >>106849268
>>106849117
>>106849133

SDXL will always be best because it is the most logical and directive model for tagging. These newer models with enormous tag plus prose prompts are tedious for editing. They are not logical. With SDXL I feel like I am using software with checkbox and sliders where "1girl, crouching, happy, yellow hair" gives me exactly that. With Neta or Chroma I have to prose slop it. I will never know the better way to prose it, whether verbose or direct or whatever.
Anonymous No.106849188
>>106849117
Okay now make yuri porn out of this. If you can't do this, it's shit.
Anonymous No.106849194 >>106849216
>>106849179
you dont need to use NLP with it. tags only works fine
>SDXL is the end all be all
kek
Anonymous No.106849211 >>106849256
>>106848783
wait so the faggot went back on the v4 designation? lol im dling to see if it's different than the v4 test I was playing around with
Anonymous No.106849216
>>106849194
>kek
Retard.
Anonymous No.106849232 >>106849256
By the time Neta gets as friendly and useful as SDXL, we will already have multiple models made open weight like NovelAI 4.5 with its style and character transfer tools and all this Neta stuff will be pointless
Anonymous No.106849236 >>106849278
Anonymous No.106849256 >>106849267 >>106849375
>>106849211
yeah its IMO the best version. substantially less overt "default style" as well
>This version is a pre-trained model (I’m not sure what to call it, but it’s basically a continuation of the previous work by the Neta team, using the Neta Lumina v1.0 model). To clarify further, versions 2.0 Plus and 3.0 were fine-tuned from this pre-trained model. My workflow involves using the best checkpoint from this pre-trained model at that time and fine-tuning it.
>>106849232
it already is, for me. youll have to wait for a shitmix though it seems
Anonymous No.106849263 >>106849550
>>106849179
>Chroma I have to prose slop it.
Just sentence or two with natural language, rest can be tags.
>traditional illustration of a sitting demon girl next to a tree, morrigan aensland from vampire \(game\). 1girl, demon girl, green eyes, long hair, green hair, large breasts, cleavage, leotard, pantyhose, bare shoulders, head wings, bat wings, low wings, purple wings, bat wings, bat print, bridal gauntlets, detailed skin texture, bursting breasts, river, brown oak tree, leaf, Expertly drawn and painted pin-up style artwork of a demon girl with wonderful details. Pastel colors in the background.
Anonymous No.106849267
>>106849256
fullres of pic here whoops https://civitai.com/images/105200664
Anonymous No.106849268
>>106849179
>With Neta or Chroma I have to prose slop it.
That's purely a flux problem, and old school t5 hybrid models. But mostly a flux problem. No recent models with good prompt comprehension has it, which includes HiDream (which gen extremely bad quality images, to be fair) and Qwen Image.

With recent models, natural words behave like your keywords/tags. If you write "a gorgeous angel with angel wings three meters right to that point with huge golden halo doing a middle finger", you will, actually, get an angel, with angel wings, three meters right to that point, with a golden halo, doing a middle finger. Crazy I know.

Flux is so bad in prompt understanding we used models to be able to purple prose Flux in just the right way in order to make it understand, but it's not a general problem. Just a Flux one.
Anonymous No.106849272 >>106849283 >>106849292 >>106851587
>>106849169
hmmm
Anonymous No.106849278 >>106849810 >>106850198
>>106849236
Anonymous No.106849283 >>106849290
>>106849272
post metadata and prove that was purely prompt
Anonymous No.106849290 >>106849325
>>106849283
https://files.catbox.moe/f5ki67.png
Anonymous No.106849292 >>106849298
>>106849272
That 4ch VAE looking real rough desu
Anonymous No.106849298
>>106849292
ah well I always use oekaki as a style tag which is the main offender I think
Anonymous No.106849312
>>106849117
it's an ambitious prompt
Anonymous No.106849325 >>106849343 >>106849348 >>106849377
>>106849290
i wont point out how you used a shitmix but for some reason i can only see the negative prompt in my text editor
Anonymous No.106849329 >>106849662 >>106850187
Anonymous No.106849335
i think i'm done with chroma, going back to flux
Anonymous No.106849343 >>106849354
>>106849325
Anonymous No.106849348
>>106849325
though this only worked since all these characters are frequently drawn together. I doubt i could get 5 characters from different franchises and especially not 5 OCs
Anonymous No.106849354
>>106849343
ishygdds
Anonymous No.106849375 >>106849419
>>106849256
wait 3.5 is completely different... lets see how many pissing korbos differ!!!!
Anonymous No.106849377
>>106849325
>i wont point out how you used a shitmix
When will anon learn that comparing slopmerges to actual tunes is retarded
Anonymous No.106849381 >>106850187
Anonymous No.106849419
>>106849375
it has an updated booru dataset as well. up to september
Anonymous No.106849463 >>106849472
I tested the latest (definitive?) Chroma 1 HD version of Chroma using the provided workflow, and the results where lackluster. But it's nothing like the last checkpoint I used, so I must be doing something wrong.
Anonymous No.106849464 >>106849573 >>106849607 >>106849767
actually. lodestone might have cooked here
Anonymous No.106849465 >>106849623 >>106850596
Is it possible to control lora influence in output in a gradual manner? Like, "Only start applying lora after x amount of steps" and things like that.
I tried searching around and couldn't find any info or nodes that would manage that, so I imagine it's just a thing that doesn't really work like that?
Anonymous No.106849472 >>106849483 >>106849503
>>106849463
Use Base or 2k
Anonymous No.106849479
Anonymous No.106849483 >>106849569
>>106849472
Yeah, downloading base now. I don't know how I got to that HD thing.
Anonymous No.106849503 >>106849517 >>106849651 >>106849919 >>106850187
>>106849472
I've been using Chroma-DC-2K-T2-SL4-Q8_0
Anonymous No.106849517 >>106849525 >>106849651
>>106849503
link? google and HF search is not helpful. I thought HD was the recommended version?
Anonymous No.106849525 >>106849651 >>106849919
>>106849517
damn I wonder if this is the version that was deleted
Anonymous No.106849533 >>106849597
always converge
Anonymous No.106849550 >>106849561
Is there something like an llm node that will cocnvert a flux style prompt to a tag based prompt?

>>106849263
Oh, that's good to know tyyy
Anonymous No.106849556
I like the new neta yume
Anonymous No.106849561
>>106849550
Any decent instruct llm will do that for you.
Anonymous No.106849569
>>106849483
the documentation says to use it, that's how i got there.
Anonymous No.106849573
>>106849464
Resembles Chris-chan quite a bit. If Chris-chan was a milf.
Anonymous No.106849576 >>106849585 >>106849588 >>106849957
lmao, Sora Pro killed everything >>>/wsg/5994477
Anonymous No.106849585
>>106849576
Just pay $2 per gen
Anonymous No.106849588
>>106849576
It's impressive for what AI can do right now, but if that was a real anime, I wouldn't get past that shit opening.
Anonymous No.106849597
>>106849533
I still believe
Anonymous No.106849604
Anonymous No.106849607 >>106849671
>>106849464
this looks like the most stereotypical midwestern woman ever
Anonymous No.106849623 >>106849666
>>106849465
if you do a two step sample this is trivial to do
Anonymous No.106849651 >>106849674
>>106849525
>>106849517
>>106849503
https://huggingface.co/silveroxides/Chroma-Misc-Models/tree/main
The memeversions are in a separate repo
Anonymous No.106849660 >>106849698
Just forget about Chroma since he's moving on to Qwen now
Anonymous No.106849662
>>106847834
awesome miku!

>>106849329
the looks work here too
Anonymous No.106849666 >>106849680 >>106849976
>>106849623
Doesn't doing 2 step + 2step samplers work completely differently than 4steps though? As in, it doesn't actually process the image the same and would give a botched result
Anonymous No.106849671 >>106849834
>chroma 60%, flux 40%, flux facedetailer
i can live with this, except the weird haloing

>>106849607
raised on tater tot casserole
Anonymous No.106849674
>>106849651
>The memeversions are in a separate repo
It's the best version so far, meme or not
Anonymous No.106849680 >>106849834 >>106849855
>>106849666
nah i do it all the time, literally what i'm using with these images. just have to make sure if the VAE changes you reencode the image to the correct one
Anonymous No.106849687 >>106849761
Anonymous No.106849698
>>106849660
the situation is funny though
>Pony fag: "Hey lodestone, want to join us and work on V8? we won't use Chroma btw, your finetune is shit, Qwen is the perfect candidate"
>Lodestone: "AWESOME, CONSIDER ME IN"
Anonymous No.106849712 >>106849825
>>106848927
I'll never understand Americans linking watermelons to black people
Anonymous No.106849761
>>106849687
cool
Anonymous No.106849767 >>106849822 >>106849834 >>106849855
>>106849464
korn fans used to be younger, damn
Anonymous No.106849790 >>106849810
Anonymous No.106849810
>>106849790
>>106849278
Anonymous No.106849822
>>106849767
everyone used to be younger, anon
Anonymous No.106849825 >>106849897 >>106850496 >>106851846
>>106849712
Because people have eyes? Black people unironically disproportionately like grape juice, fried chicken and watermelon. Anti-racists are such fags, you can't see anything with your eyes you're so mind broken.
Anonymous No.106849834 >>106849845
>>106849671
>>106849680
>>106849767

Their grandfathers lost the Winter war
Anonymous No.106849842 >>106850636
https://github.com/dvlab-research/DreamOmni2?tab=readme-ov-file
the lora has been released btw
https://huggingface.co/xiabs/DreamOmni2/tree/main
Anonymous No.106849845
>>106849834
>Winter war
Liberate Petsamo !
Anonymous No.106849851
Harry x gta6
Anonymous No.106849855 >>106849976
>>106849767
jonathon davis is 54 anon

>>106849680
example workflow
https://files.catbox.moe/55ag6j.png
Anonymous No.106849858
Anonymous No.106849875 >>106849890 >>106849940
>3 Oct, 2025: We've combined DualParal with the Wan2.2-T2V-A14B model.
https://github.com/DualParal-Project/DualParal
Anonymous No.106849890 >>106849967
>>106849875
https://dualparal-project.github.io/dualparal.github.io/
this is so ass, change the moment of the video several time and the color changes drastically everytime
Anonymous No.106849897 >>106849907
>>106849825
>disproportionately like grape juice, fried chicken and watermelon
That's just southerners in general you mouthbreather. The whole working south run on fried chicken and Grapico, and if you're not spitting out watermelon seeds at least a couple of times a year, you're not living.
>you can't see anything with your eyes you're so mind broken
You can't see anything with your eyes because they're too close together and misaligned because you were born retarded,
Anonymous No.106849907 >>106849916 >>106849920
>>106849897
yeah you're a mind broken leftists that can't see reality any more because you're terrified of being called a racist, no need to reply, you're a fag and you'll die from your brain rot
Anonymous No.106849916 >>106849922
>>106849907
chill
Anonymous No.106849917 >>106849939 >>106849966 >>106849967 >>106850096
Will Alibaba save us again?
Anonymous No.106849919 >>106850002
>>106849525
QRD?

>>106849503
Is this the 2k version?
Anonymous No.106849920
>>106849907
>everyone whos not retarded is a leftist
Anonymous No.106849922
>>106849916
no, these people keep shooting people over believing everyone is a racist nazi, I'll chill when they're removed from all social media
Anonymous No.106849925
Anonymous No.106849935 >>106849978
@106849922
d*bo
Anonymous No.106849939 >>106849957
>>106849917
only if they realise that their SaaS stuff is only relevant if it can beat or match western SaaS
that shit they pulled with Wan 2.5 left me assblasted
especially when Sora, Grok Aurora/Imagine, etc. dropped that totally mogged it
Anonymous No.106849940 >>106849967
>>106849875
i am not even sure this actually works?
Anonymous No.106849949
>>106849166
Very cool
Anonymous No.106849957
>>106849939
to be fair I can see them giving up making video models, their only goal was to be SaaS Competitive, now that Sora exists, might as well give up, the mountain is way too high >>106849576
Anonymous No.106849966
>>106849917
yus long live china
Anonymous No.106849967 >>106850006 >>106850015 >>106850049
>>106849890
>>106849940

kek, yeah it just looks like last frame method. there's more wan goodies but nothing usable has yet been released for comfy :(

https://github.com/TencentARC/RollingForcing
https://github.com/NVlabs/LongLive
https://github.com/dc-ai-projects/DC-VideoGen
https://github.com/mit-han-lab/radial-attention
https://github.com/dvlab-research/Jenga

>>106849917
>Wan2.5 5b model
Anonymous No.106849976
>>106849666
nice trips, but also this link is for you
>>106849855
Anonymous No.106849978
>>106849935
/d.bo/is
/@\d/is
Anonymous No.106849983 >>106850059
Anonymous No.106850002 >>106850083
>>106849919
>Is this the 2k version?
yeah, only version I use
Anonymous No.106850006 >>106850081
>>106849967
>nothing usable has yet been released for comfy :(
can't comfy implement that by himself?
Anonymous No.106850014 >>106850023 >>106850078 >>106850092
Anonymous No.106850015
>>106849967
DC-VideoGen seems quite interesting if it is gennerally working, but yes I can imagine there's really not enough time or motivtion to implement all the stuff people come up with.
Anonymous No.106850022
Gemini or JoyCaption for wan captions?
Anonymous No.106850023
>>106850014
Shieet
Anonymous No.106850049
>>106849967
radial attention too, that seems very useful if it works as indicated

people come up with some cool stuff. thanks for the linkage.
Anonymous No.106850052 >>106850064
https://www.reddit.com/r/SoraAi/comments/1o2z011/sora_2_overhyped_and_underdelivers_while_wan/
the fuck is this cope? even the ledditors don't fall to that bullshit on the comments lmao
Anonymous No.106850059 >>106850198
>>106849983
cute
Anonymous No.106850064 >>106850073
>>106850052
Sora won't ever show you a blow job and $2/meme seems like a steep price for anyone that isn't a Youtube grifter and even they don't make anything themselves, they just steal other people's posts.
Anonymous No.106850073 >>106850233
>>106850064
>they just steal other people's posts.
ironic, don't forget Alibaba stole billions of videos to make Wan, I won't die on that hill, that would make me hypocritical
Anonymous No.106850076 >>106850089 >>106850368
is it just me or is comfyui's subgraph feature completely broken, inconsistent, and unusable?
Anonymous No.106850078
>>106850014
Melancholy.
Anonymous No.106850081
>>106850006
not anymore. comfy just pushes PRs nowadays and increments the version. the project is in bloat freefall constantly and the runtime fps keeps getting hit constantly. the new frontend being "faster" is a complete lie since they didn't even fix the fps counter. how the fuck can you claim it's faster if you have no receipts? saying it's an OS is the most delusional thing as well. nobody knows what the fuck to do with the repo anymore and it just keeps getting worse
Anonymous No.106850083 >>106850103
>>106850002
this is a cool one, anon
Anonymous No.106850089 >>106850151
>>106850076
seems to work for me.
Anonymous No.106850092
>>106850014
Fear of a Snack Planet.
Anonymous No.106850096 >>106850127
>>106849917
Please be a music gen model. Those who tried using Qwen-Omni and uploaded real songs for it to describe know what I am talking about.
Anonymous No.106850103
>>106850083
Syurptitous.
Anonymous No.106850127 >>106850189
>>106850096
for those of us who didn't, what do you mean?
Anonymous No.106850151 >>106850206 >>106850223
>>106850089
maybe its an issue related to the CR upscale image node then, but nothing seems to go wrong when i use it outside a subgraph so its weird
basically the links between inputs and the nodes inside seem to get jumbled because when i made this subgraph at first it worked just fine, but when i drag in the workflow again and try to edit the upscale factor it gives me the upscale mode options instead, even though those inputs are correctly linked inside the subgraph
Anonymous No.106850187 >>106850260
>>106849503
>>106849381
>>106849329
Can you share your metadata? Getting into Chroma >>106848754 and learning different prompt and setup methods. Your gens are clean and sharp, you seem experienced. Would appreciate if you share it!
Anonymous No.106850188
Anonymous No.106850189 >>106850246
>>106850127

It can describe songs with insane accuracy, it knows its instruments, genre/style, bpm, it can timestamp parts of the song (chorus, bridge, verses etc), can transcribe song lyrics accurately even knowing where the chorus etc are, knows which parts of the song certain instruments kick in etc

Try it yourself any audio under 3 min on:
https://chat.qwen.ai/

They can easily train a Suno tier model with the data they likely have that they used to train Qwen-Omni.
Anonymous No.106850198
>>106850059
>>106849278
Anonymous No.106850206 >>106850223
>>106850151
also half the time i cant even leave the subgraph view because escape stops working
Anonymous No.106850223 >>106850301
>>106850151
nothing jumbled here

>>106850206
also I can leave via escape and clicking the parent graph. is the frontend package up to date and the browser nothing too crazy? i don't really know what's going on on your end tho
Anonymous No.106850227 >>106850236 >>106850254 >>106850317 >>106850326
Guess which model
Anonymous No.106850233 >>106850244
>>106850073
>you watched a Disney movie once so you can't ever make animated movies
argument
If you don't understand the difference between copy and pasting a movie and learning to draw your own movie you can't be helped
Anonymous No.106850236 >>106850243 >>106850247
>>106850227
pony v7 (some people waited 2 years for this btw)
Anonymous No.106850243
>>106850236
must be what nigbo and Nick are using
Anonymous No.106850244 >>106850253
>>106850233
sora can literally recreate the cowboy bebop music Opening but go on king >>>/wsg/5993868
Anonymous No.106850246
>>106850189
Every AI company tips their hand based on what they release as tools, even Sora 2 was obvious after ChatGPT released Whisper showing they were working hard on accurate transcriptions. Even Dalle-3 was preceded by a decent VLM (closed) model.
Anonymous No.106850247
>>106850236
Yeah! ^^
Anonymous No.106850253
>>106850244
And it can also reimagine Cowboy Bepop's opening theme as an opera bad faith shill. :)
Anonymous No.106850254 >>106850272 >>106850288 >>106850318 >>106850324
>>106850227
Part2 guess which model^^
Anonymous No.106850260 >>106850276 >>106850296 >>106850302
>>106850187
too lazy to clean workflow but this is what makes Chroma gens clean and sharp. ignore tile size setting
Anonymous No.106850272
>>106850254
reading those comments is so satifying desu, it's like seeing the bad guy lose on a movie, that never gets old
Anonymous No.106850276 >>106850292
>>106850260
Thanks, prompts same as your earlier posts? short sentence + tags format?
Anonymous No.106850288 >>106850303 >>106850308 >>106850329
>>106850254
>noooo you don't get it, it took him 2 years to make that model but it's unfinished, just 2 more weeks bro ;-;
Anonymous No.106850292
>>106850276
yeah, check civitai for prompts
Anonymous No.106850296
>>106850260
Model? Any loras ? any tags to give 2d that quality?
Anonymous No.106850301
>>106850223
pulled after the epsilon scaling feature came out and that made my group node malfunction so i changed it to a subgraph
ill try updating all the packages and see if that does anything
Anonymous No.106850302 >>106850354 >>106850354
>>106850260
nta, how long does it take you to gen? it takes me 70 seconds with upscaling on a 5090 for chroma.
Anonymous No.106850303
>>106850288
>he doesn't know that all the hate is just the petra spammer
Anonymous No.106850308 >>106850312 >>106850338
>>106850288
>ts
I'm seeing this everywhere now. Are people too fucking lazy to write "this" or does it mean something else?
Anonymous No.106850312
>>106850308
it means tiny sneed
Anonymous No.106850317 >>106850327
>>106850227
100% chroma. I'll gen in 6 batches at a time and 2 of 6 of them are mangled just like this or even worse, kek
Anonymous No.106850318 >>106850341
>>106850254
Can you do the same with a Chroma model?
Anonymous No.106850324
>>106850254
This is when you include Iodestone in your work team
Anonymous No.106850326
>>106850227
it's unfortunate that the AuraFlow training didn't work out, they maybe kept trying a bit too long instead of trying other models.
Anonymous No.106850327 >>106850507
>>106850317
no it's pony v7
https://civitai.com/images/104983866
Anonymous No.106850329
>>106850288
It's wild when you see lots of new models coming out training on a tenth the resources and trained in several months. In the time he's stalled he could've trained a full model from scratch, even Auraflow was trained relatively successfully in a fourth the time.
Anonymous No.106850337 >>106850714
This is a troll upload, right? He HAS to know the lora is fucked, right?
Anonymous No.106850338
>>106850308
as I understand it the original meaning was negroidspeak for "this shit" but even that was beyond certain zoomers, who use it to simply mean "this"
Anonymous No.106850341
>>106850318
the civitai feedback for chroma is good, same for neta-yume lumina IIRC
Anonymous No.106850344
Anonymous No.106850354 >>106850360 >>106850367
>>106850302
this took 147 sec with that i2i workflow. 4070ti super (16gb), no optimizations. With sdxl resolutions it takes way less time

>>106850302
using gta6 lora I trained
Anonymous No.106850360 >>106850370
>>106850354
Whih model are you using? Chroma 2k?
Anonymous No.106850367 >>106850374
>>106850354
1girl, full body , smiling, looking at viewer, beach
Anonymous No.106850368
>>106850076
It is buggy for sure, but right this second it is more useful than not. I have encountered a couple of persistent bugs with placing certain custom nodes in a subgraph, how it interacts with bypasses etc. But for all I know, it is a bug with the custom nodes and not the subgraphs themselves, because you can get things like Impact Switches just suddenly deciding to not take inputs if too many switches already exist etc.
Anonymous No.106850369 >>106850393
are all the flash chroma checkpoints supposed to look like shit? i'm using the recommended 8 steps with heun and they come out horribly.
Anonymous No.106850370 >>106850375
>>106850360
https://huggingface.co/silveroxides/Chroma-Misc-Models/blob/main/Chroma-DC-2K-T2-SL4/Chroma-DC-2K-T2-SL4-Q8_0.gguf
Anonymous No.106850372
I bought a 6000 blackwell for 1girl SDXL slop. I'm built different.
Anonymous No.106850373 >>106850385 >>106850542
Anonymous No.106850374
>>106850367
1faggot, will never be a real woman, typing a comment
Anonymous No.106850375
>>106850370
Thanks!
Anonymous No.106850377 >>106850398
If you used pony at any point (even the """good""" version) you are worth less to me than cloudkeks
Anonymous No.106850385
>>106850373
Kino
Anonymous No.106850390 >>106850414
all this slop will be forgotten tomorrow, like tears in the rain
Anonymous No.106850393 >>106850452
>>106850369
i think one anon managed to pick some subject matter and settings where it's ok but most of the (not that many) chroma users including me don't use them

probably just use a non-flash checkpoint if it doesn't work for your prompt
Anonymous No.106850398
>>106850377
quietly devastated right now
may not recover
tell my 1girl I love her
Anonymous No.106850414
>>106850390
Honestly, that itself is kind of hot. Thousands upon thousands of women needed to take their clothes off on the internet for me to produce my own endless stream of personalized private porn catered specifically to my tastes. It's unholy, genuinely some kind of social catastrophe, and it is so hot.
Anonymous No.106850452
>>106850393
yeah, im just using the non-flash ones, i was just curious if i was missing something but it seems to be the general experience.
Anonymous No.106850470 >>106850477 >>106850509
Why these generals got all branched, sdg, ldg, animedg, etc
Anonymous No.106850477
>>106850470
for shits and giggles
Anonymous No.106850484 >>106850535
Anonymous No.106850490 >>106850506
anything new in the local scene aside from qwen edit
Anonymous No.106850496
>>106849825
I'm not American, the watermelon thing is very American.
Anonymous No.106850498
Anonymous No.106850506
>>106850490
nope
Anonymous No.106850507
>>106850327
Cant view it, I live in united englanistani but I believe you.
Anonymous No.106850509 >>106850551
>>106850470
sdg was cancer so ldg was made
cancer from sdg tried to split the general multiple times to kill ldg. at one point there was landscape ldg, realism ldg, etc. now there's just the anime one which is barely alive and only used by like 3 anons.
Anonymous No.106850535 >>106850549
>>106850484
that's a cool effect, what did you use? or catbox if its easier and you dont mind
Anonymous No.106850542
>>106850373
When did hasan piker get a new pet?
Anonymous No.106850549 >>106850590
>>106850535
'dark theme' and oekaki
Anonymous No.106850551
>>106850509
Anime general was likely made by anons from russian 2ch's /ai/, it even had their info in the first post initially.
Anonymous No.106850590 >>106850613
>>106850549
Yours looks a lot more painterly, are you using a specific lora? Mine came out more scary with those words.
Anonymous No.106850596
>>106849465
Should be possible. Something like keyframes,
Anonymous No.106850613 >>106850707 >>106851118
>>106850590
prompt was:
1girl, skinny, solo, bikini, wide-eyed, wavy mouth, full body, looking at viewer, beach, oekaki, impressionism, painterly, dark theme, jaggy lines, oekaki
with negative: shiny skin, simple background
5.5 cfg. using my own unreleased mix. 28 steps, euler beta
the style is wildcarded so there was two oekaki
Anonymous No.106850636 >>106850659
>>106849842
anyone tried this lora? does it make kontext better than qie?
Anonymous No.106850659 >>106850677
>>106850636
I'm not sure it's that simple, it's also using the text encoder Qwen VL (wheras Kontext classic uses T5) so...
Anonymous No.106850677 >>106850796
>>106850659
I was checking the pipeline, it's still using the dual clip (t5+clip l), it's using the VLM to encode the input OR videos but yes, this needs a dedicated node in comfy most likely. I downloaded the weights at least
Anonymous No.106850707 >>106850719
>>106850613
I keep getting a fucking spotlight on the character kek. Thanks for the prompt, i'll mess around with it.
Anonymous No.106850714
>>106850337
he likes 'em skinny what can he say
Anonymous No.106850719 >>106850760
>>106850707
tomoko?
Anonymous No.106850737
Anonymous No.106850752 >>106850780
What's the state of NovelAI compared to local gen UIs like (re)Forge and Comfy? Many an AI Discord server seems fond of aggressively shilling the former, (NovelAI). I haven't used it since early v3.
Anonymous No.106850760 >>106850769
>>106850719
>tomoko?
ye, wanted to see if i could get it like yours where she was barely visible. i'm just going to force it with a lora.
Anonymous No.106850769 >>106850778
>>106850760
I'll join in the dark ladies at the beach genning once I finish this batch of 20+ images of girls pissing in a toilet
Anonymous No.106850778 >>106850798
>>106850769
I got it to work. Your pissgirl's right leg is FUCKED bro.
Anonymous No.106850780 >>106851152
>>106850752
Kill yourself.
Anonymous No.106850796 >>106850805
>>106850677
lool, now you have to load t5 + qwen vl? this is ridiculous, that's why it was a dumb idea to go for kontext, QIE has qwen vl naturally
Anonymous No.106850798 >>106850828
>>106850778
man im not sure I like this new neta model desu, I have way more fucky anatomy compared to testv4 or v3, generally less stuff MAKES sense, like I'm genning a lot of ladies with panties on the side but still wearing panties, or fused fingers way more than before. or even the pic I posted you see the fucking background? like it compeltely lost cohesion. I'll experiment a bit more, but I might jump back to testv4
Anonymous No.106850805
>>106850796
doesnt really matter that much desu, you'll be parking the models to ram anyway... you have at least 96gb of ram right?
Anonymous No.106850809
Anonymous No.106850826
Anonymous No.106850828
>>106850798
i usually just drop the model right away if there's too many issues with anatomy. im just sticking to wainsfwv14 for the time being since 15 gives me too many issues with hands. good luck with your piss adventures.
Anonymous No.106850870 >>106850917
Howdy faggots.
Has anything new dropped in the last two weeks?
Anonymous No.106850881 >>106850891 >>106850943
Can someone explain GGUF/quantization to me as if I'm a drooling retard? I get that it makes it easier to run for lower end hardware, is there any benefit to using it if you have 24gb of vram? I've never had to use it so I've never had to look into it, but I'm curious if I've been missing out on cozy genning speeds
Anonymous No.106850891 >>106850979 >>106851019
>>106850881
If you can load the full or fp8 version of the model there's no need to use quants
Anonymous No.106850894
Anonymous No.106850917 >>106850935 >>106850944
>>106850870
>Has anything new dropped in the last two weeks?
Check your pants.
Anonymous No.106850935
>>106850917
>Check your pants.
whoa, what was lumina doing there?
Anonymous No.106850943 >>106851019
>>106850881
Quantization: if you round the numbers that make up the model, then you sacrifice some precision (the decimal places you rounded away) in exchange for using less space and therefore a speed increase.
GGUF: run on CPU instead of GPU

If you have 24 gb VRAM, you don't really need either, except maybe quants for increasing batch size.
Anonymous No.106850944 >>106850962 >>106850990 >>106851016
>>106850917
I'm like 12, so they still haven't dropped yet...
But I'll have you know I've been posting here since I was 1 years old.
>that's how I know all the jokes.
Anyways, what's new?
Has anything surpassed illustrious yet?
Anonymous No.106850954
nothing will ever surpass illustrious
Anonymous No.106850962
>>106850944
>Has anything surpassed illustrious yet?
for some anon, yes. others will have to wait for a jeetmix unfortunately.
Anonymous No.106850969 >>106850978
nothing will ever surpass NTR mix
Anonymous No.106850970
Anonymous No.106850975
Anonymous No.106850978
>>106850969
that thing sure breasts boobily
Anonymous No.106850979 >>106851122
>>106850891
q8 is better than running fp8.
Anonymous No.106850990
>>106850944
>Has anything surpassed illustrious yet?
not overall

but for specific uses sure - wan, chroma [radiance], qwen [image edit], neta yume lumina and if you can run it on your H100/H200 hunyuanimage3.0 are better at various things
Anonymous No.106851000
Anonymous No.106851006 >>106851015 >>106851160
nobody will read this so ill just say it, i havent even tried to use noob because i dont know what the heck epsilon and v-pred means
Anonymous No.106851015
>>106851006
Hecking wholesome nigger chungus
Anonymous No.106851016
>>106850944
>Has anything surpassed illustrious yet?
No. Newer models may have better prompt adherence but they're not fine tuned for anime and therefore lack the vast knowledge XL finetunes have, not to mention the near infinite loras XL has for it.
Anonymous No.106851019
>>106850943
>>106850891
Thanks doodz. So no speed gains, just strictly space occupied in vram
Anonymous No.106851021 >>106851035
You should be asking has anything surpassed Noob, not illustrious.
t. day 0 XL user and day 0 illustrious 0.1 user
Anonymous No.106851035
>>106851021
Googah!
t: caveman who made the paintings
Anonymous No.106851071 >>106851226
what noob should i be using?
Anonymous No.106851084 >>106851100
Anonymous No.106851100
>>106851084
shuma nigerath
Anonymous No.106851118
>>106850613
Anonymous No.106851122 >>106851149 >>106851161
>>106850979
youre meaning the degradation of q8 is less than fp8, correct?
Anonymous No.106851124 >>106851139 >>106851213 >>106851246
>local thread free and open source
>gatekeeps his workflow
>gatekeep his lora
Anonymous No.106851131 >>106851139
lilbro cannot stand koff posting in the blessed thread of frenship
Anonymous No.106851139
>>106851124
nicholas is a reject from /sdg/ as debo kept posting one random image in the OP and not nick fe's works
>>106851131
>MrCatJak
Anonymous No.106851149 >>106851161
>>106851122
yes, q8's quality is closer to fp16 than fp8.
Anonymous No.106851152
>>106850780
I just want to weigh my options.
Anonymous No.106851160
>>106851006
v-pred version changed stuff under the hood of SDXL and you need to pick settings that work

the images are often like they have a wider color gamut
Anonymous No.106851161
>>106851149
>>106851122
The AI has spoken. Q8 is better.
Anonymous No.106851178
Anonymous No.106851213
>>106851124
he just tried the alibaba style kek
Anonymous No.106851226
>>106851071
chads use vpred 1.0 while those who require a bit more handholding use... i dunno cyberfix or something
Anonymous No.106851239 >>106851413
There are so many samplers these days. I am completely at a loss as to what is best so I just keep using euler. People make comparisons but it all seems so fickle and unreliable. I tried res 2s / bong_tangent as people were saying that was best with newer models and was surprised at how consistent the image was from seed to seed but I don't know if that's a good thing actually.
Anonymous No.106851240
Anonymous No.106851246 >>106851354
>>106851124
I haven't seen a non-template workflow posted here in months. i dont think anyone shares unique workflows anymore.
Anonymous No.106851308 >>106851318
How is NoobAI better than Illustrious?
Anonymous No.106851313 >>106851624
You know what to do.
Anonymous No.106851318 >>106851336 >>106851340
>>106851308
the king is NAI
Anonymous No.106851336 >>106851357
>>106851318
Yes, NoobAI
Anonymous No.106851340 >>106851357
>>106851318
>NAI
uhh, lilbro, that's what unc said
Anonymous No.106851354
>>106851246
Whenever I do anon asks for the dozen or so nodes I wrote myself and even then he can hardly make heads or tails of my spaghetti kino
Anonymous No.106851357 >>106851382
>>106851340
>>106851336
I meant NovelAI
Anonymous No.106851367 >>106851393
https://files.catbox.moe/v1pnmx.png
Anonymous No.106851382 >>106851407
>>106851357
Why is the king begging in the local models thread? Are you too much of a pussy to go against OpenAI or Google?
Anonymous No.106851393
>>106851367
Anonymous No.106851405 >>106851427
NAI is the king of sending your cunny promptos to the alphabet boys.
Anonymous No.106851407
>>106851382
who rattled your cage?
Anonymous No.106851413
>>106851239
feel free to run huge test series to get your statistical evaluation, but it'll probably kind-of vary by model anyhow

simply use some of the samplers you think are good?
Anonymous No.106851418 >>106851445 >>106851458 >>106851460
AI will never replace 3D artist :)
https://x.com/gleb_alexandrov/status/1976382622688543215
Anonymous No.106851427
>>106851405
A, B, C
Easy as 1, 2, 3
Or simple as Do-Re-Mi
A, B, C, 1, 2, 3, baby, you and me, girl!
Anonymous No.106851434
what's the recommended version of python for a stable experience genning wan videos of comfy?
Anonymous No.106851445 >>106851458
>>106851418
I genuinely hope the 2d to 3d ai pipeline gets perfected in the coming years because i have a few vidya ideas i really want to try but i'm stuck at base prototyping because making custom assets is costly.
Anonymous No.106851448 >>106851459
There used to be something called GauGAN on Nvidia's playground. I loved how ugly it was. Nvidia's Canvas is too "good" -- but every time I try to get one of these numerous GauGAN repos working, I fail miserably. Any advice? Anyone do this successfully?

Altneratively, in the absence of having an RTX card, is it possible to install Canvas on a cloud server?
Anonymous No.106851458 >>106851493
>>106851418 >>106851445
for the sake of discussion: qwen image edit, kontext, wan and others *already* usually generate the other perspectives you can't see

the techniques to turn it into a textured 3d model are not high resolution enough, but that probably won't be that way for long. maybe it already isn't with a H100/H200.

that said xitter is retarded
Anonymous No.106851459
>>106851448
Understand that I don't have much to offer you boys but hopefully karma will pay off
Anonymous No.106851460
>>106851418
i find it amusing that, in order to prove no ai use, 3d artists will post very highres images with multiple angles which in turn makes it easier to train on their style. or theyll post the unbaked versions which also helps in training.
desu no one sane is saying itll replace them, just as digital photography never replaced analog.
Anonymous No.106851475
When ready

>>106851472
>>106851472
>>106851472
>>106851472
Anonymous No.106851493
>>106851458
there are still problems with consistency and while it is impressive from a hobbyist perspective it looks like shit compared to having objects in blender. I feel like endgame AI image/video gen will be a frontend agent for a 3d engine that will build a scene with AI generated objects/textures/shaders etc and just render it. 100% consistency, infinite length, much lower compute/memory requirements.
Anonymous No.106851587
>>106849272
>Koakuma
Not enough love for her
Anonymous No.106851624
>>106851313
I want to become hentai creator. Give me some good free A.I. to convert pictures to video. No programing, just simple creator for monkey.
Anonymous No.106851740 >>106851808 >>106852671
Anonymous No.106851808
>>106851740
what a goofy resolution
Anonymous No.106851831
Anonymous No.106851846
>>106849825
That is jus negro american thing, here blacks like cocos and fish
Anonymous No.106851847
The difference between NetaYume 3.0 and 3.5 is a bit harder to define than 2.0 Plus vs 3.0, but I do think 3.5 is another modest improvement overall. Main thing I've noticed is eye proportions for both male and female characters make a bit more sense in 3.5, and it adds some nice relevant details in appropriate contexts where 3.0 didn't, like the sword here. Prompt (sans boilerplate / neg) was just `masterpiece, best quality, very aesthetic, a 2d digital anime illustration of a samurai warrior in traditional armor, standing in a cherry blossom garden.`
Anonymous No.106852486
Anonymous No.106852671 >>106852777
>>106851740
>yap yap yap yap yap
Anonymous No.106852777
>>106852671
always.. i hate that so much
Anonymous No.106852787
Anonymous No.106853511
y so ded