Discussion of Free and Open Source Text-to-Image/Video Models
Prev:
>>105992236https://rentry.org/ldg-lazy-getting-started-guide
>UISwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows/home
>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>WanX (video)Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1
>ChromaTraining: https://rentry.org/mvu52t46
>Illustrious1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/
>MiscLocal Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate
>Neighbourshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
posting in the real thread
>>105998704Please keep your trip on.
>>105998854Can it not do pantyshots?
keep this thread going for 5 days like /dmp/
Blessed thread of frenship
now that's an okay collage, here's a bump, even if I'm probably only going to lurk
i wanna see all the gens submitted to the gen jam thus far desu
>>105998854i cant find the i2v gguf of this model
>>105999197aniwan doesn't have gguf. Use the load diffusion model node
>>105999197>>105999210They're here:
https://www(DOT)modelscope.cn/profile/DesignerAZ
>>105999197There isn't one as far as I know, only a t2v q4ks. Kinda strange.
>>105999237ai is dominated by china tho
>>105999247And they still use HF and github anyway
>>105999221what will the CCCP do with this info now?
>>105999221>install our program with a command to download a fileWell let me know when someone rehosts
Wan 2.2 waiting room. Reminder they said on Discord it would be released until the end of the month
Using the tutorial workflow, my gens get really strange somewhere between 8-16 gens when I leave it on overnight
Either unload model or clear model and node cache fixes it. Am I supposed to automate this?
>>105999267This let me DL directly
>>105999221This is just a Q5? Is it the same as the aniwan from HF?
>>105999307based detective anon
>>105999268Reminder Wan T2I mogs base Flux too
I would use it over Chroma if it wasn't so slopped (Wan also have a smooth skin problem and is pretty bad at anime/2D too)
If Wan ever got the Chroma treatment Flux received (a large fine-tune to unslop the model and add NSFW), we would have a model better than even the API/cloudfags
>>105999268I'm still waiting for Illustrious 3.5 vpred.
>>105999303That's an issue with torch compile supposedly. I don't use it.
>>105999374Wan is slopped but the only one that's really offensive is bottom right. All the other examples look really good actually.
I'm just going to use this thread because the other one seems... kinda unfit.
I'm still continuing my Chroma experiments, the 'photographic medium' prompts that some anon posted here a while ago got me testing some more.
So, I did some plots. 10 sets of prefix prompts, 3 different suffix prompts, same seed, sampler and scheduler.
This might be a bit 'spammy', but I think posting it here might be good for archival's sake.
These are the first prefixes that the kind Anon shared:
- An image taken on a 1990s analog disposable camera of
- An amateur candid photograph taken on an iphone of
- An amateur candid photograph taken on Sony Cybershot from Flickr of
- aesthetic 5, a still from a movie scene of
- profressional 35mm film photography of
- 1960 technicolor film still of
- classic cinema film still of
As for suffix promtpts I used the red dress woman he used, the other girl from my tests with a more elaborate and 'modern setting' prompt, and another generic 1girl:
- an attractive young woman in a red dress sitting in a wooden chair at an outdoors garden
- a young asian woman with pale skin and long brown hair, wearing a black and white maid outfit with a purple bow, and black cat ears. Her clothes are slightly stained. She is sitting on a wooden floor in a dimly lit room with scattered trash and bottles. Her face is slightly flushed, and her eyes look directly at the camera. The camera angle is a medium close-up, taken from above. The image has a pinkish hue. Composition follows the rule of thirds. The photograph has a slightly cluttered background with leading lines from the floor and trash directing attention to the subject. The woman is in her late teens to early twenties.
- a young woman with pale skin, green eyes, and red hair standing on a road. Her right hand is raised, showing a peace sign.
4.0 CFG, DPMPP_2M/Simple on Chroma V46 detail calibrated.
Catbox: https://files.catbox.moe/nfbn6h.png
>>105999374If the chroma furry was smart, he would have switched over to training Wan the moment it released. Chroma was only on like epoch 10 at the time. I think it wouldn't have taken long at all for the Wan tune to completely surpass where chroma was at.
>>105999460Switching to JPEG, otherwise it's kinda silly. Catboxes contain the full PNG plots.
- Polaroid instant photograph from the 1970s of
- Kodak Brownie box camera photograph from 1950 of
- Hasselblad medium format photograph of
- Leica M6 rangefinder photograph of
- Canon AE-1 35mm photograph of
- Nikon FM2 photograph from the 1980s of
- Pentax K1000 student photography of
- Minolta X-700 photograph of
- Olympus Trip 35 vacation photo of
- Fujifilm X100 street photography of
- Lomo LC-A photograph of
- Diana F+ toy camera photograph of
- Holga 120N photograph of
- Pinhole camera long exposure of
This set turned out quite disappointing, except for the first to, but I think the year does the heavy lifting, of course.
Why we're starting to get titties is anyone's guess.
Catbox: https://files.catbox.moe/cgdbkq.png
>>105999180>toonbabes, sweetcreamcake, incaselycoXL, visualnovelif its a real request im not gonna be a fucking dick & ignore them ;p
>>105999374Wan not being nsfw is the only thing stopping it from killing flux and chroma outright lol.
>>1059995442 more weeks ;3
>>105999521Last one was Camera Types & Brands, this is Film Stocks & Formats. Nothing to interesting except for the expired/light leak image, perhaps.
- Kodak Portra 400 photograph of
- Fujifilm Velvia 50 slide film of
- Ilford HP5 black and white photograph of
- Kodak Tri-X 400 pushed to 1600 of
- Cinestill 800T night photograph of
- Kodak Gold 200 photograph of
- Fujifilm Superia 400 photograph of
- Kodak Ektachrome E100 photograph of
- Expired film photograph with light leaks of
- Cross-processed slide film photograph of
- 120mm medium format photograph of
- Large format 4x5 photograph of
- 110 cartridge film photograph of
Box: https://files.catbox.moe/n5pq4w.png
>>105999307>click>nothing happens>click>nothing happens>--->suddenly 4 DLs>max speed 1.4MBkek fucking china
>>105999574Era-Specific Styles:
- 1920s silver gelatin print of
- 1930s depression-era documentary photograph of
- 1940s wartime press photograph of
- 1950s Life magazine photograph of
- 1960s fashion photography of
- 1970s National Geographic photograph of
- 1980s glamour shot of
- 1990s grunge photography of
- 2000s digital camera photograph of
- Early 2010s Instagram filter photograph of
Probably the most interesting out of all the sets.
Box: https://files.catbox.moe/2576c8.png
>>105999460Based comparison chad
i'm fairly sure flux was not trained on camera brands at all i.e Canon 1D shot of blabla
>>105999606maid token is toooooo strong
>>105999606Film & Cinema Styles:
- 16mm indie film still of
- Super 8 home movie still of
- 70mm IMAX film still of
- Anamorphic widescreen film still of
- French New Wave film still of
- Italian Neorealist film still of
- German Expressionist film still of
- Film Noir cinematography of
- Dogme 95 handheld footage of
- Wes Anderson symmetrical shot of
- Terrence Malick natural light shot of
- Wong Kar-wai neon-lit scene of
- Studio Ghibli inspired live action still of
- A24 atmospheric film still of
Some gems, I think. The intention of the ghibli style was not to get an anime image, the red dress version kinda worked.
Box: https://files.catbox.moe/f91tos.png
>>105999639Cheers.
>>105999660Yeah. Entirely useless, but that point I was already genning like a freak... If I happen to revisit this with other sampler/schedulers, I will certainly replace the prompt.
OK since the other thread was an abortion I'll ask again here:
Does anyone know of a 360 rotate lora which works with fun camera wan 14b i2v? I'd like to have rotation and zoom.
Please don't ruin /ldg. /lmg is already ruined. If this place goes there's nothing left.
>>105999677Documentary & Journalistic - 3 More to go after this and then you'll be free from my spam.
- National Geographic wildlife photograph of
- Magnum Photos street photograph of
- War correspondent photograph of
- Sports Illustrated action shot of
- Time Magazine cover photograph of
- Associated Press news photograph of
- Documentary photography in the style of Dorothea Lange of
- Photojournalistic capture of
- Embedded journalist photograph of
Box: https://files.catbox.moe/bi9823.png
>>105999654Well. Test results seem to kinda confirm that. I do see it in a lot of prompts, though. "Shot on fartenbox poopmaker". The results are incidental, at best.
I'll compare shutter speed/aperature prompts in another plot... It's probably done before but I like doing plots. Heh.
>>105999725Vintage & Antique Processes:
- Daguerreotype from 1850 of
- Cyanotype blueprint photograph of
- Sepia-toned albumen print of
- Tintype photograph of
- Wet plate collodion photograph of
- Autochrome color photograph from 1910 of
- Hand-colored photograph from 1890 of
- Glass plate negative photograph of
- Stereoscopic 3D photograph of
Some cool ones again, I think.
Box: https://files.catbox.moe/1auxma.png
>>105999709dw this thread has seen worse
>>105999606it took decades for dress necklines to recover from the great depression. This is proof that the 08 financial crisis was nothing and boomers just wanted to brag about being in an economic crisis.
is there a stopwatch node so I can see how long a gen took?
>>105999880just look in the console
>>105999760please consider trying
- DSC
- DSC JPG
- DSC_0123.JPG
etc
>>105999307Is there a torrent or DDL somewhere else? The chink site is slow as shit.
>>105999913>>105999928when I gaze into the console it stares right back
>>105999760Specific Photography Genres:
- Fashion editorial photograph by Vogue of
- High-key studio portrait of
- Low-key dramatic portrait of
- Tilt-shift miniature effect photograph of
- Long exposure light trails of
- High-speed photography capture of
- Macro photography extreme close-up of
- Aerial drone photography of
- Underwater photography of
- Infrared photography of
- Multiple exposure photograph of
- HDR photograph of
Box: https://files.catbox.moe/ukg1c2.png
I'm very sure it's possible to enhance the effect of those that show some change by prompting for it more elaborately, Chroma needs its bedtime story after all. But I think it should give an indication of what would work decently well.
>>105999942Good idea, will do.
>>106000015Second to last one:
TV & Video Aesthetics - some of these need stronger prompting for sure.
- 1980s VHS camcorder footage of
- 1990s Hi8 video camera footage of
- Early 2000s MiniDV footage of
- Security camera CCTV footage of
- Dashcam footage of
- GoPro action camera footage of
- Webcam screenshot from 2003 of
- Public access television still from 1985 of
- MTV music video still from 1999 of
Box: https://files.catbox.moe/dn1rzy.png
>>105999980ok nietzsche
>>106000061late 90's early 2000's mtv probably deserves a lora
>>106000061Last set: Artistic & Experimental
Bit of a letdown this one.
- Lomography experimental photograph of
- Double exposure artistic photograph of
- Light leak experimental photograph of
- Photogram without a camera of
- Scanner photography of
- Disposable underwater camera photograph of
- Redscale film photograph of
- Solarized photograph of
- Chemigram abstract photograph of
And with that, you're free from my spam. Hope it gave some of you Anons a few ideas on what to prompt for. Looking forward to seeing your gens. Cheers, /g/entlemen.
Box: https://files.catbox.moe/8s67a0.png
>>106000096lot of interesting stuff, thanks for posting all these
>>106000096>pixart sigma is still the only open source model that can do a proper double exposure we need to go back
How much time does torch compile save? If I have to clear vram and reload the models after X amount of gens is it still worth it? Can I even know the exact number of gens before it starts to fuck them up?
wait, why is this tensor art site full of models and loras that are either for sale or locked behind online gens
what is this cancer
what have you faggots done
lightx2v has spoiled my penis, 5 minutes is too slow. I want it to instantly generate a scrollable video collage
>>105998704kys. almost had me fooled.
>>105999654wrong. you need the file extension trick
https://youtu.be/HxKeyQKhTOg
anyone was able to get kontext nudify working, I always get fucked up nipples, loras dont help
>>106000380have you tried turning the nipples off and back on again?
>>106000380use two loras together
nudify + better breasts
Can anyone go into further detail on how the Apply ControlNet node works? Specifically for the end_percent parameter.
In the guide it says:
>Lower end_percent removes the ControlNet at an denoising earlier step. This is especially useful when using a source image in a very different style from your desired output (e.g. live action), as applying a ControlNet the entire way through will prevent your desired style from coming out properly.
Every time I play around with this parameter, even when translating from 3D to 2D, it only makes a very slight difference and it's hard to say which looks better. Should this generally be kept to a low value when using a 3D input image and generating an anime output image?
>>105998854using the latest lightx2v?
>>105999002No, it definitely can.
>>105999460Good job anon
OG comparison poster here
However, I must point out you shouldn't use the detail calibrated version, from my experience, that version tends to not follow the styles as authentically as the vanilla Chroma, and it produces "slopped" results more often
I am running the grid here with my own settings, I can already spot some differences with the "disposable camera" outputs in particular, I will post it when it's done
>>106000449>It was like she was looking at walking garbage
>>105999967No clue, this is the only site I've seen these on.
>>105999460>>106000619I also noticed you added a bunch of negative prompts related to quality... That tends to steer the model away from the desired styles, don't use that for all of them
>>106000619>>106000733After detail calibrated came out I just stuck to it since it more often than not gave me more interesting results. Good to know that it ruins some styles.
As for the negative prompts, yeah, I kinda just set those and keep re-using them. Since I'm lazy about those this kinda comparison is mostly helpful for me, should have removed those before. Sorry.
Did you submit to the /ldg/ technology genjam yet?
https://forms.gle/ZQMNMTaxGxAZZTAD8
>>106000807How many have submitted already?
>>1060008203, with some people saying they will submit on the weekend.
Would be better if we had something in the OP like with /dmp/'s album collaborations, but it's ok I guess.
>>106000836you should invite /sdg/ and /de3/ to participate too
>>106000836Heatwave probably isnt helping either
>>106000859How about no
Looks like the rotate lora I'm using with wan can't handle spinning two things at once.
>>106000836>submit on the weekendoh nice i thought we only had 48 hours
>Would be better if we had something in the OPi agree thoughever i think some are put off by the jewgle form
Why didn't you guys tell me there are models that can generate figurine-style images pretty well?
>>106000898>thoughever i think some are put off by the jewgle formKnow any alternatives? I made it so you don't have to login.
>>106000920>I made it so you don't have to logino nice. nvrmind then idk why anon would care. yeah should be put in OP
>>106000915There's an onnx based one which is really fast, under a minute on a 3090. Thing is, you're getting a solid with painted on clothes, there's not much you can do with it other than look at it. At least last I tried. Maybe now you can get a properly rigged model?
image
md5: 73af76e038eed22d12cb5a6bc269a4aa
๐
>>106000762Here are my results
I used cfg 4.5, 30 steps, the og chroma loader (not comfy core), Euler sampler with beta scheduler, 1024 x 1024 res
I may have to rethink the "movie scene" prompt later, it still seems inconsistent. It's a tricky one to get right, sometimes it turns out ok, others you get slop
The disposable camera one sometimes you get a good "middle ground" output, others you get extreme low quality or a quality too high for what you'd expect
The interesting thing about the "Flickr Sony Cybershot photo" prompt is that it makes women look more like regular people than instagram models/thots, lel
>>106000943By style, I did just mean style. 1girl, but 3D-looking in a way that resembles figs.
>>106000955>The disposable camera one sometimes you get a good "middle ground" output, others you get extreme low quality or a quality too high for what you'd expectI may have to combo that with the "aesthetic {num}" prefix later to make it more predictable
>>106000998Ah I see. Sounds like something flux would actually do.
>>106000509it's a case by case value, if the cnet is too strong, you increase the settings, if it's too weak, you decrease them. it's hard to say when to tweak strength, when to tweak end step and when to tweak both, just try them until you better your intuition for these settings
my nsfw prompts sometimes reach the level of bad erotica and it makes me cringe so much
Since starting to use teacache and sageattention2, I notice I occasionally have comfy just die on me with a "killed", no other error or message. I'm not short on system RAM or VRAM while the gen is running. What other reason could it be? Please, please, I don't want to run this in strace...
I haven't checked in two years is there finally a fast model that can run on a mid range gpu and generate images in <1s?
>>106001136SDXL with DMD2
or the equivalent to it for SD1.5 (considering you are saying "mid range"), but it will look like shit though
>>106001170Afaik Sana is reportedly fast enough for even CPUs, but I haven't looked into it
>>106001136researchers would rather benchmaxx bigger models nobody can run without quanting so you are SoL like the rest of us
Let me know if this is right, and if it is it should probably be in the guide
I was setting the virtual vram high to prevent my vram from overflowing around the time when it's loading the text encoder / wan, whichever it is that spikes your vram usage. That seemed to just do more of it in ram though which slowed it down. Changing the prompt would add a lot of time to my generations. And when actual generation would start, I would only be using 60%ish of my vram. I lowered virtual vram and both things got faster and it rides at over 90% usage when generating, with some headroom because the usage goes up when it hits the VAE phase.
>>106001213Try to set it to 0
>>106001218I was OOMing on 720p with a 3090. Is that normal?
hidream 1.1 text 2 edit is shit? Any special parameter to tune here to fix it? It seems worse than kontext, it changes the image too much no matter the variables here or the prompt
thanks based comparanon. this was supposed to be a grateful expression
after ages i managed to learn and train my first own model from scratch. and it sucks.
was fun
>>106000807not gonna use jewgle form
>>106001316Ok, so what's stopping you from submitting in this thread?
>>106001315Are you the anon that was posting 512p outputs from a mid-training model in other threads?
e2-f5-tts
https://github.com/SWivid/F5-TTS
have a sample with bane quotes as the source audio + some LLM generated speech from LM Studio for a fast script.
https://voca.ro/12mJzETUsIRp
>>106001593can you please stop posting CSAM?
>>106001679If you believe that is csam, then report it for "This post violates United States law".
>>106001534sounds worse than chatterbox or zonos
>>106000955I switched to euler beta and removed all of my negatives.
I'm both terrified and deeply amused.
Does the lightx2 lora rank matter for T2I with Wan?
>>105999460>>105999460>>105999521>>105999606>>105999677>>105999725>>105999760>>106000015amazing finds anon, its like changing the wording of your prompt and using different loras will give you a different result, wow, great insight, thank you so much
>>106001737use 64. 128 and 256 don't really improve the outputs that much but they're double and quadruple the size respectively
>>106001782aren they just patched into the main model instead of being placed on top of it during comfyui inference?
>>106001730Either the detail model is slopping up the movie prompt or the different resolution is making them look bad.
>>106001775We are trying to find out which Chroma prompts makes unique and interesting aesthetic differences to allow us make the most out of the model (some styles may not even need loras).
This is something significantly more interesting and relevant than spamming tranime, "1girl, big boobs" and pedo shit like the other anons, probably you too, are doing.
But I know, a retard like you wouldn't understand.
>>106001876who the fuck is that?
I want to undervolt my GPU right?
How do I find the frequency/voltage I want to start flattening the voltage curve at?
>>106001876cute
Illustrious?
>>106001920thats wai v14, trying the new hassaku 3.0 now to compare
>>106001916if you are asking these types of questions don't undervolt your gpu. It is much better to start with MSI Afterburner and just lower the power limit. Undervolting is kind of a meme anyway, you'll run into a lot of stability issues and even if you do find something stable it is only marginally better than adjusting the power limit.
>>106001947https://civitai.com/models/827184?modelVersionId=1761560
this one, my go to anime model then I pick diff checkpoints or use loras for diff styles (or just prompt with tags)
>>106001807With any kind of floating point weights (including fp8) they are patched in and don't incur additional cost, but this isn't true with GGUF which most people are probably using. For GGUF, the lora weights are kept separate, so it will use more compute and memory the larger the lora is.
In wan, what should you put in the negative prompt to prevent characters from turning to face the camera?
>>106001916just adjust the power limit if you want to in msi afterburner, it's safe
like on my 4080 I set it to 70%, the framerate in games is like 95% the same and it draws less power (so fans can basically run at like 30%, or silent).
>>106001962a test gen with hassaku 3.0, colors are nice
I find that if you add "jpeg artifacts" to the negative prompts it tends to make slop more often.
With no negatives, you sometimes get outputs that are well aligned to the prompt at the cost of receiving lowres artifacted garbage like picrel.
I wish there was a way to simultaneously not get slop and not get artifacted garbage consistently.
>>106002011and wai 14 again:
>>106002227>>106002011should look more mature
that's why i stopped using wai
>>106002264thats just with a basic ayanami rei, swimsuit, pool prompt. you can do whatever you want with tags.
speaking of which this extension is invaluable for tags:
https://github.com/DominikDoom/a1111-sd-webui-tagcomplete
if you type "fate" you get all the booru tags (exact tags) related to fate, characters etc, so you can prompt easier for booru based models.
I've noticed in some porn pics the geniral area and asshole are a darker skin color than the rest of the body and I like that
what prompt should I use to get that?
>>106002314The face and hands look great. Do you use ADetailer for that?
>>106002356don't bother, he's a gatekeeping fag
>>106001346we can just post them ITT can't we? why not just put a post at the top of a new thread saying "reply to this with your submissions"
>>106001443that's what I'm wondering. no need to make this shit complicated
>>106002374>that's what I'm wondering. no need to make this shit complicatedNo the google form makes more sense because all the submissions are arranged nicely in an excel sheet. The problem with posting submissions in this thread is that submissions will be scattered across several different threads at different points, and it's way too much trouble going through the archives to search for each one.
>>106002394oh well then, not doing your gay little contest if you insist on using jewgle
>>106002407Trust me, nobody cares about what a worthless moron like you does. I'm not the one organizing this either.
>>106002356yes it's necessary.
https://files.catbox.moe/qth3jn.png
>>106002407>anon goes to the trouble of organizing a jam, a fun little community activity>gets nothing in return for doing this>autismo acts like his participation is some kind of privilege>insults anon for not accepting submissions they way he likes
>>106002284and one more with wai14:
Do you guys know any code to bypass pixverse filter. Someone from /b/ said he is using it .thread got deleted before i got reply.
>>106002446lmao you talk like a bioware fan
>>106002502>lmao you talk like a bioware fan
>>106002475No. I'll stay and wait.I believe there are helpful people here among shitheads like you .
>>106002519Nobody is going to help a braindamaged /b/tard dumbfuck like you. Piss off idiot.
This is the thread for local generation and you come here asking about cloud garbage, and you're actually stupid enough to think anyone here will help you. LOL.
>>106000193I've only used it when training, since the torch.compile node is flaky beta in Comfy and I don't use any third party nodes. For training it cuts ~15-20% of the training time.
>>106002519unfortunately like that anon said, this is for local gens only. no one here uses cloud services.
>>106000224That has always been the case with tensor art, which is why nobody gives a shit about it. As of now it is Civitai or bust, maybe something else will come along.
even if chroma manages to make all 5 fingers, the fingers are all the same size which makes it look creepy.
>>106002533Its ok . I'll wait anyway. You keep being asshole.
>>106002550Thanks . you are helpful . I'll look for other threads.
>>105999460this is exactly the right place, thanks a ton! and you are right, there are some real gems in there. the docu style ginger for example.
>>106001730Negatives REALLY do help with Chroma, I noticed this when I tried it on Forge, first I was like 'wow, the Forge implementation of Chroma really sucks', then I noticed I had forgotten to copy over the negative prompts, and voila, looked just as good as on Comfy.
Makes you wonder though, lodestone talked about locking the Chroma CFG at ~4 for a special 'fast' version, won't that prevent negative prompting ?
>>106002472>>106002573try asking >
>>29183721 they sometimes post nsfw vids from ai cloud services
>>106002740>>106002472>>106002573whoops, I meant the /vdg/ - AI video general in /gif/
>>106002766OK . I'll keep an eye on it .
>>106002314Why does mine come out fried?
>>106002916ouch. something's wrong there anon. incompatible scheduler?
>>106002916not sure, i used pornmasterpro_noobv4vae model, the settings are in meta data. i'm using reforge gradio ui.
lora: RealisticSkin_PornMaster-Pro_v1
setting: Steps: 40, Sampler: DPM++ 2M SDE, Schedule type: SGM Uniform, CFG scale: 5, Hires CFG Scale: 5, Hires upscale: 1.5, Hires steps: 15, Hires upscaler: 4x_NMKD-Siax_200k,
adetailer settings: ADetailer model: Anzhc20seg20v2%20y8n.pt, ADetailer negative prompt: "bad eyes, eye defects, eye glitch, deformed iris, ugly face, bad face, masculine, male, boy, ", ADetailer confidence: 0.6, ADetailer method to decide top k masks: Confidence, ADetailer mask only top k: 6, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.4, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use inpaint width height: True, ADetailer inpaint width: 1024, ADetailer inpaint height: 1024, ADetailer model 2nd: hand_yolov8n.pt, ADetailer negative prompt 2nd: "band hands, ugly hands, extra fingers, bad fingers, bad nails, hand deformities, bad finger nails, extra digits, ", ADetailer confidence 2nd: 0.3, ADetailer method to decide top k masks 2nd: Confidence, ADetailer mask only top k 2nd: 6, ADetailer mask min ratio 2nd: 1.0, ADetailer dilate erode 2nd: 4, ADetailer mask merge invert 2nd: Merge, ADetailer mask blur 2nd: 4, ADetailer denoising strength 2nd: 0.4, ADetailer inpaint only masked 2nd: True, ADetailer inpaint padding 2nd: 32, ADetailer use inpaint width height 2nd: True, ADetailer inpaint width 2nd: 1024, ADetailer inpaint height 2nd: 1024,
>Neta Lumina UPDATE
โ Fine-tuned, high-quality anime-style image-generation model (Diffusion Transformer) built on Lumina-Image-2.0
โ Excels at illustration, posters, storyboards, character design, etc.
โ Leverages Gemma text encoder for strong prompt understanding and multilingual support (EN/JP/ZH)
โข Key Features
โ Optimized for diverse styles: furry, Guofeng, pets, and more
โ Understands both natural language and Danbooru-style tags
โ Supports complex, multilingual prompts (best in ZH/EN/JP)
โข Model Variants
โ Neta-lumina-v1.0 (official release, best overall)
โ Neta-lumina-beta-0624 (ฮฑ-test; 13 M images, 46k A100 hrs)
โ Private alpha versions (apply on HF page)
โข System Requirements & Runtime
โ ComfyUI only (latest version)
โ โฅ 8 GB VRAM
โข Installation Options
Component release (three files)
โข UNet: neta-lumina-v1.0.safetensors ComfyUI/models/unet/
โข Text Encoder: gemma_2_2b_fp16.safetensors ComfyUI/models/text_encoders/
โข VAE (16-ch FLUX): ae.safetensors ComfyUI/models/vae/
All-in-one checkpoint
โข neta-lumina-v1.0-all-in-one.safetensors (md5: dca5 โฆ)
โข Basic Workflow Nodes in ComfyUI
UNETLoader VAELoader CLIPLoader Text Encoder Sampler
โข Recommended Generation Settings
โ Sampler: res_multistep | Scheduler: linear_quadratic
โ Steps: ~30 | CFG: 4 โ 5.5
โ Resolutions: 1024ร1024, 768ร1532, 968ร1322, or โฅ 1024
โข Prompt Resources
โ Prompt guide: https://civitai.com/articles/16274/neta-lumina-drawing-model-prompt-guide
โข Roadmap Highlights
โ Continual base-model training (reasoning, anatomy, background richness)
โ Enhanced tagging tools & LoRA tutorials
โ Advanced control/style-consistency features (e.g., Omni Control)
โข Extra Resources
โ TeaCache repo: https://github.com/spawner1145/CUI-Lumina2-TeaCache
โ Sampler & TeaCache guide (Chinese): linked QQ doc
URL: https://civitai.com/models/1612109?modelVersionId=2036419
>Wan 2.1 Lightspeed
Generates Wan 2.1 videos in a fraction of time.
Recommended settings:
Sampler: LCM
Steps: 4-8
CFG: 1
Sigma-Shift 5
Original Model from Lightx2v converted to FP8 quantisation.
Recommended specs:
8 GB VRAM, 32 GB RAM
Sample times: <2 minutes for 81 frames, 4 steps on RTX 4070 Ti Super.
Compatible with 14B LoRAs.
If needed:
Clip: https://huggingface co/lightx2v/Wan2.1-I2V-14B-480P-StepDistill-CfgDistill-Lightx2v/blob/main/fp8/clip-fp8.pth
Text Encoder: https://huggingface co/lightx2v/Wan2.1-I2V-14B-480P-StepDistill-CfgDistill-Lightx2v/blob/main/fp8/models_t5_umt5-xxl-enc-fp8.pth
URL: https://civitai.com/models/1802623/wan-21-lightspeed
>>10600297540 steps, why anon. for a single subject > 25 = waste of electricity
>>106003011the insane high quality 34x78 image rounds it up nicely
>Nova Orange XL V11 NEW UPDATE
Nova Orange XL is anime checkpoint with detailed skin and depth
Recommend Settings
Sampler: Euler a
Steps: 20~30
CFG Scale: 3-5
Clip Skip: 1-2
Denoising Strength: 0.4 - 0.6
Prompt: masterpiece, best quality, amazing quality, very aesthetic, high resolution, ultra-detailed, absurdres, newest, scenery, {Prompt}, BREAK, depth of field, volumetric lighting
Negative Prompts: modern, recent, old, oldest, cartoon, graphic, text, painting, crayon, graphite, abstract, glitch, deformed, mutated, ugly, disfigured, long body, lowres, bad anatomy, bad hands, missing fingers, extra digits, fewer digits, cropped, very displeasing, (worst quality, bad quality:1.2), bad anatomy, sketch, jpeg artifacts, signature, watermark, username, simple background, conjoined, bad ai-generated
>>106003029should have created q8 ggufs instead of trash fp8
>>106003043URL:https://civitai.com/models/967405/nova-orange-xl
NO MERGE - Universal CLIP (FLUX, SDXL, PONY & illustrious)
URL: https://civitai.com/models/1784001/no-merge-universal-clip-flux-sdxl-pony-and-illustrious
unironically buy an fucking ad
This is the CivitAI news for today that I found relevant.
Happy genning ^^
>>106003043Do people really like the defaultslop faces?
>>106002975>sgm uniformThat's better, thank you. The workflow had it set to normal. You used some kind of forge to comfy workflow converter?
>>106003112Understand here is a alternative model inspired by oleo!
>Chosen-mix_XLThe new fusion model uses the new version of noob and wai nsfw, as well as some other models. Like 1.0, the output is very stable, the characters and artist strings can be directly recognized, the hands are also quite stable, and it is also very compatible with my various lora. The main upgrade of this model is that the V1 version is not good in thick painting and Korean style. The current V3 version optimizes the effects of these two styles. You may ask, why there is no V2 version and directly V3, in fact, it is because I was lazy and did not release V2. Now there is V3, so there is no need for V2.
URL:https://civitai.com/models/1064295/chosen-mixxl
>>106003107fuck off rocketcunt.
>>10600303635-40 steps is necessary from realism especially hyper real or anime to real models. Usually set cfg 4-6 so it doesn't look cooked. Its 2d anime/cartoon gens is alright at 20-25 steps.
>>106003145plz have non slopface chicken grease gens to show off a model not this trash. anime models only matter if the artist tags are accurate
>>106003134no, didn't use any converter. Here a link to the huggingface for the more adetailer models.
also there other checkpoint to experiment with like waiXREAL_V10, uncaniSFWNSFW_v10 and creativitij_v10.
>>106003112some honestly do
>>106003168Genuine question, why do you guys like this shitty style? It has the most soulless and generic AI slop look, looking like a straight output from a typical SD1.5 merge (back when people merged NovelAI with realism models).
All these outputs would look so much better in either vanilla 2D style or actual photorealism
>>106003319>why do you guys like this shitty style?I dunno, I just do. I'd ask the guy posting censored porn what the fuck is the appeal of generic ass anime girl wiggling around.
>>106003319Instead of complaining, post something you think is better which could inspire, you won't, because you're a cunt who doesn't contribute anything
>>106003422>you have to gen or your criticism is invalid.This makes genners more insufferable than actual artists.
>>106003422Sure, suggest me one of the slops you posted and I'll do it. Preferably with the prompt
>>106003029omg :D
>>106001679canonically? lusamine is an adult
>>106001265cute!
>>106000151didnt expect an 'apology' but yeeesh
>>106003319i do gen 2d and share my stuff on /trash/, /e/ and /h/ but not often because it's boring, lost interest in anime and it doesn't please me. I'm a maladaptive day dreamer, i preferer my waifu looking 3d, semi real or anime real than 2d. While 2D can be very versatile compared to 3D/realism, i frankly find 2d boring especially when it comes to sfw gens.
new sota tts, with voice cloning
https://www.boson.ai/technologies/voice
https://github.com/boson-ai/higgs-audio
>>106003435Your 'criticism' is worthless, it boils down to 'I don't think this is good' which is purely subjective, there's nothing remotively constructive, it's just 'change the style to something I like'
>>106003453Just do it, what are you waiting for ? You don't need permission.
And stop samefagging, it's pathetic
>>106003029workflow please? i have not genned video before
>>106003487Nigga actually defending slop.
>>106003487>Just do it, what are you waiting for ? You don't need permission.I want to work on -your- gens specifically, where would the fun be at? :^)
Go on, identify yourself, point me one of your gens
>>106003487just ignore him
he does it every thread haha
even the laughing-poster is better
bc he atleast has comedic timing (sometimes)
[although still mean, & don't support such behavior, poor ani...]
>>106003496we all know its still u anonkun
i suggest you learn how to larp better
>>106002310this is cool. i'm guessing "blacklight", but what makes it so vivid?
>>106003499I haven't made a single gen in this thread, the last gen I posted was probably this which was many threads ago:
Which was a quick Chroma lora test
>>106003560>eyeliner wingsi fall for it everytime
now that all the dust
has finally settled down
should i download chroma?
>>106003560I was specifically talking about the CGI-like (shitmix-like) gens, which was posted by the guys you were defending
my professional medical advice
you know who you are ;3
smell ya later <3
kontext is a good meme generator
the cartoon frog is sitting on a chair at the beach, wearing a blue tshirt and red shorts. keep his expression the same.
>>106003478im unable to listen right now, is it good?
>be bong
>just deleted civit
>use VP-
I got everything I need
>>106003601the cartoon frog is at a 1950s style diner eating a cheeseburger and french fries. A menu nearby says "fren menu"
pretty good since the source is just a pepe face with no body.
>>106003550>Iridescent (acid colors)is in the prompt
>>106003578It's easily the best local model for photo realistic images, and it hasn't even started the last two epochs where they bump the resolution from 512 to 1024.
For non-realistic it's harder to say, Flux has a shit ton of loras for every artstyle you can imagine, and the Flux 'plastic skin' doesn't really matter here. I've only done a few non-realistic tests, and on pretty poor datasets to see how well Chroma does, for example here is one for Pete Hawley (mid-century illustrator), it captures the style well, although this test was overtrained:
When we see Chroma artstyle loras come out after the full release, we will see what it can do.
>>106003633im sorry about how things are going over there across the pond; the collapse of civilization begins with the individual... best of luck <3
>>106003672With lots of testosterone, oh, and Wan 2.1 with breast expansion lora
>>106003672looks like: https://tensor.art/models/843552021249407909
+
>>106003431
>>106003644>non-realisticim happy w\ my software i currently have
i dont mind pony being 'depreciated' <3
i'll look into it over the weekend...
>>106000895What an "ass backwards", "two-faced" post. Eeee heuhheuh!
>>106001689We have a CASM Expert here, thank god, where did you learn about it? Self-study, right?
>>106003710anon its just common sense
knowing the meaning of the words helps a lot ;3
>>106000895it will happen with a single subject also,
if you turn your weights\cfg too high
try: 0.8(9) weight, and cfg below 8(7)
just edit it and salvage what frames you can and blip it into your own loop
expecting wan to be perfect 100% of the time is just not realistic
>>106003029I dunno if it's like flux schnell I'll pass, it's much noticeably worse.
>>106003710>We have a CASM Expert here, thank god, where did you learn about it? Self-study, right?I wonder what mental illness caused you to reply like this? Tell us about yourself anon.
>>106003745he wont.
i tried in my own bake for 70+ times
he can only parrot the same thing repeatedly
retard here, what is sage attention and should i put it in my command line args
>nooo stop posting what i DONT like!!
>>>>/leddit/
PROMPT:
"he closes his eyes, and smiles sweetly, he is holding a painting of a tiktok dancing cartoon girl, the horses in the background turn into dancing cats"
I'm starting to get the impression this is not a hobby pursued by mentally healthy and stable people.
>>106003822You would be correct. I, however, am mentally stable which is why you never see me get into these petty squabbles.
>>106003802>>106003816damn so you're also the attention-starved tranny that can't stop shitting /a/ with its spam
you really oughta slit your worthless subhuman throat
>>106003786it helps videos gen faster. If you have it installed (you don't) the comfy command is --use-sage-attention
so we have until the weekend to submit to the Gen Jam?
All Trannies Need Is Attention
i only frequent \wg\ \vr\ \g\ and \vp\napt
not everyone posting things you dont like is me
i am not "removing my tripcode and messing with you"
you are insane
pls act right
<3
>>106003835i sadly, will not be participating <3
>>106003849nobody cares you boring retard
>>106003854>report bombing againits all so tiresome
>>106003827if the fake bakes he linked really werent him then i can see why he retaliated. this general is such a drama-shithole now
>>106003822well its compounded by the fact that 4chan is occupied by literal retards myself included. there ARE """normal""" people who happen to be into image generation. also important to take into account how loud the retards are which drowns out and overshadows the perfectly normal posters
>>106003924the drunk guy attacking people every thread is lowkey problematic nigga, im just trying to find lora
>>106003828>>106003839AI threads on /g/ are created by, monitored and spammed 24/7 by exposed AGP janitors, they post porn on a blue board, avatarfag and spam all the time but ban anyone who calls it out within seconds.
4chan is basically dead for anyone who isn't mentally ill.
>>106003941>its another whos the janny episode
>>106003924There is no such thing as 'normal' on 4chan, and that's why I come here.
>>106003918I have little reason to ponder such trivialities. I come here to see cool images and learn new ways to gen which has yet to be impeded desu. I don't share your view but that's not important. What is important is not getting filtered by the noise methinks.
>>106003933As is the way it goes with 4chan. I think anon often forgets that an entire site exists outside this general and it is often in a far worse shape than what can be viewed here, in my opinion.
>>106003941>not being mentally ill
>>106003956cool style
ReBoot art deco batman the animated series kmfdm 90s cgi?
>backed up nearly 600 gigs of wan loras from civitai
>>106003941>having a meltdown over a janitor conspiracy theory 'they're out to get me!'>pretends to not be mentally illAnon...
>>106003978Upload on huggingface and then link on https://civitaiarchive.com/
>>106003918im with postcard
>>106003952 this is true however actively destroying things weakens the overall discussion..
>>106003941BEAHAGAhHahahah
>>106003816The only thing that could make him smile like that is TES VI surpassing TES V in success/sales. But that is impossible because it would require an immense creative freedom(by today standards), to the point of giving the players the choice to side with factions that say things like: "Skyrim belongs to the Nords!!"
anyeone else here never gen nsfw?
it doesn't turn me on because my brain "knows" it's not real, i'm more attracted to irl situations and what could actually happen stuff
>>106003999will you PLEASE refrain from that??
why do you do it? ;c
>>106004000kek\
>>106003978doin the lords work
>>106003978There's 600gb worth of Wan loras, I thought it would be way less. They're like 300-400mb each ?
>>106003985>stop attacking AGPs and 4chan internet janitors y-you conspiracy theorist!!!uh oh, not like this, sis
>>106004003I don't gen NSFW in the same way I don't AI roleplay NSFW. I'm way too vanilla, so it's boring after 2 minutes.
>>106004003its not that i have 'nsfw' or whatever
& i think the 'not real' thing comes from
you, yourself, being the creator-
the painter sees the flaws :3
most of my gens i cant post bc they are celebs or people that i dont want to get legal trouble for posting like
>>106003598the laws change quickly anon
>>106004022What is this about, can you translate it for non-trannies ?
>>106003989its not about sides or which general or whatever
& i made the bake to ask a specific question which was never answered
>>106003988The majority are already on hugginface (which is how I got it in the first place). See https://huggingface.co/ApacheOne/WAN_loRAs
>>106004018Yeah, it includes older versions as sometimes they work better (from feedback from users).
>>106004039>>106004010>why do you do iti think its time for a vacation
im starting to recognize specific posters\habits
>>106004003>because my brain "knows" it's not realAbout as real as any porn, it's just pixels on the screen
>i'm more attracted to irlYeah, that would be real sex, that's what 99% prefer if given the choice
>>106003941Don't go on twitter/x. Holy shit that place makes here look normal.
>>106004003yeah i cant even jerk off to my own gens so theres little point in me doing NSFW
kills it for me to basically know what the image will look like
>>106004081i definitely don't judge people for nsfw, i was just wondering if anyone else had this failure to get aroused by ai generated stuff
and yes regarding pixels and porn, i obviously prefer real sex, or even just my imagination, it's way more malleable and vivid than some porn video with random people having sex whom i don't know
>>106004096And then there's BlueSky which makes Twitter look normal.
>>106004114>or even just my imaginationWhat are you, some kind of crazy person ?
>>106004096>twitterits all fake\gay anon
>less than 60% are actual people>less than 2% of those active users account for all trafficits worse than an echo-chamber
the irony is elon championed 'muh free speech'
but if you made an account today no one would see your posts due to new accounts being neutered in the live-feed
>>106004100if you're gonna kill me
please do it on monday before my work-week starts ;3
>>106004131have you never had a wank just imagining sex with someone? can be way better than watching porn
>>106004131some of us have v good imaginations ;3
i hate to overshare but my memory-loops are quite vivid if you know what i mean
having gorgeous ex-girlfriends helps too i guess....
>>106004078THATS A BIG BIRB!
>for you ;3
>>106003822i just creamed my pants, goddamn
file
md5: 97d4653be32de4816d9589a1e4e8ba79
๐
>>106004140>if you know what i meanfor those of us on the 1 scale it's basically like ultra hd 3D interactive video in your mind that you can control, so much better than watching a 2d laptop screen recording
Is the cpu option on the clip loader something like offload or is it just a copium choice?
>>106004170It's offloading, you're opting to run it on CPU as opposed to on your GPU. Usually, you should not do that unless you are really pushing your memory limits but I guess if you're thick in the weeds, you are doing that quite often.
>>106004165im a musician so, my problem is the audio
but overall i think im quite eccentric\crankish
i think your endocrine system changes+pheromones are warped by fapping
every time i hit 60+ day no fap i seem to have a girl on my arm...
could be coincidence though
>>106004151Ok... but what did you think about the picture ?
>>106004185plot-twist: she had a fat benis
>>106004183didnt ask dont care
>>106004183>60+ day no fapcould never imagine lasting this long. after one week i'm usually ready to sleep with the nearest 4/10 i find at a random bar
>>106003941unironically give me a source tho
>>106004222pls do NOT do that
>date: october 2021
>>106004053At least 24h vacation from breathing will make your parents much happier at least
3good
md5: 62ff9ff8b50535ef34a1e0fd58256ac4
๐
>>106004209think of it like a fun game
break your own records until you become the wizard
>MUH RESOLUTIONmy pc gpu was a fucking toaster at the time please no bully ;c
>>106004218Lurk for any amout of time
https://desuarchive.org/g/thread/104414999/#q104418525
https://desuarchive.org/g/thread/104414999/#q104418574
>>106004022
>>106003941serious question why are you still here if you may or may not have identified an unfixable problem
just to warn others? or is there another reason
>>106004301>another reasonhe is the baker
this is his splinter general
>>106004309why would the baker call himself these things
>>106003941 and attempt to drive posters away?
>>106004301I'm not much at all anymore, for those very reasons, I enter once in a while to see the news, but even that is worthless now since localllama posts everything before /ldg/ and /lmg/ nowadays anyway
>>106004309cool headcanon
>if i samefag enough it comes true!
tds sdgds wow
deranged
>>106004328>localllamadesu doesnt look like they post anything about imggen
>spiteniggers spiting lmg and ldg
a sad day indeed
Anons, quick question:
I want to start #1 /local anime diffusion general/thread here in /g/
I will promote it in the gacha games community like blue archives, arknight etc etc and /a/
can i do it or i have to ask a jannitor first because i cam get banned?
file
md5: 0e0c630a6967e3f65a10a50dade1bc17
๐
how do you avoid these artifacts in img2img in comfyui?
in a1111, I didn't really have to think about it too much