Discussion of Free and Open Source Text-to-Image/Video Models
Prev:
>>105722843https://rentry.org/ldg-lazy-getting-started-guide
>UISwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Models, LoRAs, & Upscalershttps://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
>Cookhttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>WanX (video)Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1
>ChromaTraining: https://rentry.org/mvu52t46
>Illustrious1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/
>MiscLocal Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate
>Neighborshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/celeb+ai>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
Blessed thread of frenship.
change the text at the top to "LDG TOWN". Replace the house in the center of the image with some grass.
What's your favorite realistic model?
And for the lowlifes, what's your favorite model that generates realistic nudity?
>>105727917:c
>>105727902 (OP)>>>/vp/naptis your neighbor, consider (again) updating your neighbors list
>>105727939chroma is trying so hard to be my favorite but i just hope lode's 1024 training for the last two epochs fixes faces and hands
or if he doesn't i hope someone else does
>>105727939BigLust1.6 was good for NSFW realism, but I'm kinda over the SDXL look. Chroma has actually been looking great, can't wait to see a further finetune/loras for it when it is finished.
>>105727946>>105727901>only posts cute ladies w\ rad boobs>doesn't have sexprojection anon ;3
file
md5: 41b71244d6cdf689afbe271fbf270979
🔍
dorkos
md5: 6f8c2a88d9b7310218486e571c51968a
🔍
>>105727957>>105727939>realistic nudity>horrible floppy outtie vaginas that look like roastbeef im good
>>105727965im not the one making assumptions about the sex-lifes americans in california kek
i DO hope you learn to read soon though :c
>>105727975>>horrible floppy outtie vaginas that look like roastbeefhad a gf for a few years and i broke up with her because she didn't have an innie but i couldn't tell her the real reason
>>105727950fp16 t5 and 30+ steps dont have this problem
>>105728007nigga look at these hands at 50 steps and fp16 t5 and tell me they are not mutated hellspawn
>>105728007mfer text encoder makes no difference to hands. it's a model issue.
but even kontext keeps giving me hands with way too many fingers.
ai simply can't hands.
bDVyqSb
md5: 91d8ec9f0ebd762ce892654d7a002372
🔍
>>105727906How'd you know
>>105728001shoulda fattened her up ;3
>>105727975my point was, training lora models on yucky subjects is precisely that, YUCKY.
>>105728025>hellspawnisn't that what they all look like anyway? ;3
>>105728044>shoulda fattened her up ;3man speaks the truth
give the cartoon frog a wizard hat and beard.
>>105728034he is stalking the catalog spamming F5 duh
>>105727921>grassi can see the stitching :c
>>1057279752D is pretty much solved so idc about that
>>105728069Anyways, I created some custom voice files from a female friend. I can now jack off in peace
>> 2025>> highly sophisticated technology>> uses it to jack off
>>105727950>>105727957you guys think
>>105728025 looks great even ignoring the hands?
real talk how do we make chroma faster? that's the achille's heel.
give the cartoon frog a black sunglasses and have them point at the camera with both hands.
put the cartoon frog in a movie theatre where they are eating popcorn.
now this is a good fren
Why the fuck are all models so allergic to drifting. Not even powerslide prompt does anything.
>>105728109kinda neat
know of anything to isolate guitar\vocals?
i have some recordings of a female friend i need to remaster\remix
but the isolation is quite poor
sdg recommended some programs but they cost
>>105728153it gets confused by countersteering
had that problem even on dalle for ages
>>105728167there is a place in pokemon xy that was like this :c
i miss when they actually tried anon
things were so much better then
still no stable nag or mag workflows for chroma? I'm getting disastrous results with nag slop
file
md5: 78911c6f3616d58083d6d8db0483224f
🔍
yfw GBP is at an all time high and you can buy $10 of runpod credits for £7.50
thanks mr. trump
can you use teache with kontext?
I tried 30 step and quality is shit
>>105728153SEND YOUR CUTEST DELIVARY B0Y!!! ;3
alice26
md5: 149d176b49dd0af92fe9478d19060c0c
🔍
>>105728206>>105728167unironically better than going to yucky LA and seeing "art installations" like this in real-life
I said front brake discs glow, you stupid fucking machine. Why does it feel like the model is becoming more retarded?
what prompt do i use to get one human girl and one catgirl? it adds ears to both but i only want one to have it
>>105728178i don't care about speed or vram usage just give me quality
fuck vramlets from pakistan who can't afford 24g
>>105728245>becoming more retardedthis is intentional
the entire earths populace will continue this trend
until nothing is left, until it is all destroyed
until only the mongoloid slave-caste is all that remains
surely you can see the spiral downward
>>105728259i dont get all that extra shit
you know how long it takes to undo all those little black clips?
as a guitarist, people posing with the instrument lowkey bugs me kek
>>105728260underage faggot
We are creating folk art.
How do the chroma versions improve over each other? I have 39 but I see there is already 40 out.
put the cartoon frog in a red sports car, that is on a race track during the day.
>>105728267instruments are a great benchmark because models will get the wrong amount of strings and wrong proportions all the time
just like the dots on her fretboard are in the completely wrong place
>>105728259i'm not pajeet, i use a laptop. it's not my fault that jews only sell vamlet settings. and i can't go to work with a desktop, lol. besides, i prefer laptops in general
>>105728304prompt to remove all fret dots
my fav instruments lacked them irl<3
put the cartoon frog in a kitchen, where they are cooking a meal on a stove. give them a white chef hat.
very fun model for edits
>>105728160I think descript has a free version for personal use to clean audio, it works pretty well as far as I am concerned.
>>105728160UVR (Ultimate Vocal Remover) Audio Isolation
>>105728346checkin it out thnx
>>105728360think their github is a safe place to download from?
have you tried it?
Takes 5 minutes to generate an img2img prompt on kontext dev GGUF on my 2060S. Is a 500 dollar GPU going to make a big difference or should I save up longer for a better one? I am not that impatient but 5 minutes is a bit too long.
kekked
md5: 1d8f7a7764b7fed093084be2d147a192
🔍
>>105728346>>105728376it will take some time it seems,,,,
>>105728382Github is theoretically the safest place to download anything.
I used it a lot to feed vocals into RVC to make Squidward sing Taylor Swift. I was impressed by the ease and quality honestly
>>105728402Don't bother with upgrading unless you can get something with 16+ gigs of vram
C_0144
md5: f19a9b2ad19cba10b56f773e8a05280b
🔍
flux kontext is so fucking good for fast shitpost
>>105728402any 24gb vram GPU now cost $1000+.
probably get some used shit for 500$.
16gb vram works, but you have to spend time to adjust workflow.
>>105728519Works straight out of the box on my 4080, but it's 40s per img2img
>>105728417nice gen, used it for a kontext test:
change the background to a bakery with delicious vanilla cakes. above the girl is a sign saying "CAKE SHOP".
The mythical 3 door Impreza.
>>105728501are you changing the size in kontext from 1024x1024? does it work at other resolutions?
I was just getting started with ComfyUI and tried some custom workflows, which had me download some custom nodes. I did not realize how little vetting went into custom nodes, so now I made myself paranoid. Are any of these nodes potentially risky? Some of the workflows were pretty old.
ComfyUI-JakeUpgrade
Endless-Nodes
efficiency-nodes-comfyui
ComfyUI-Easy-Use
ComfyUI-AnimateDiff-Evolved
rgthree-comfy
tinyterraNodes
ComfyUI Image Saver
WAS Node Suite (Revised)
wtq-rembg-comfyui-node
I think these were bundled into some of the packs above:
Text Dictionary Convert
Image Size to Number
UltraLyticsDetectorProvider
TexT Find and Replace by Dictionary
Image Svaer
Text Multiline
DPCombinatorial Generator
>>105728542It's all slop, anon.
>>105728548Yes. It doesn't seem to care what the output resolution is but if you keep a similar aspect ratio it works better
86
md5: aadae12c83acc7daec6dafd79bf35c23
🔍
>>105728554if you start rgal postin you can become nonsloppa ;3
>>105728508reminds me of brandish so much haha
change the girl's clothes to a black business suit.
her original outfit is best but again, just testing.
We should start a new thread for non-anime gens, it should filter out the trash
>>105728566>if you start rgal postin you can become nonsloppa ;3It is all slop and it always will be slop. You are not an artist.
>>105728575Unfortunately kontext just nukes the art style with any large scale transformations. Smaller scale ones like changing hand gestures or hairstyle do work.
>>105728554Nuh-uh, there's quality slop and then there's hyperrealistic human atrocity slop like this
>>105728501
wh0ops
md5: 6006791f8f6eebf643c630fb0e8a2a3a
🔍
>>105728579>>105728580tell me what to say
tell me what to think
tell me how to make my art
wont you please? statists.
many such cases, SAD!
change the girl's clothes to a white dress.
not bad
>>105728599does it know\respect danbooru tags?
>>105728575
>>105728606no idea im still new and learning. prob not, but it knows miku like flux did.
>>105728589it's actually pretty impressive, look how it copied the style of the text from the indiana jones atlantis game. just need more time to learn what it can do.
>>105728617do you have to feed it a 150x150px potato to start with?
do you not see the distortion on your potato mobile phone???//
>>105728566I don't know what rgal is but I can turn her back into anime. I'm just playing with this shit, seeing what it can do.
>>105728617change the man in the hat on the left into anime Miku Hatsune wearing a brown fedora.
>>105728613>>105728606so then probably:
https://danbooru.donmai.us/wiki_pages/list_of_families_of_pokemon_main_characters
kinda neat
>>105728250please respond
>>105728628the original image is low quality. I could get a new one, but it is an older PC game.
>382x397
>>105728634change the man and woman on the right of the image into anime style.
>>105728638>>105728250remove unwanted ears manually
generate multiple images at a time (randomize)
prompt properly so only ONE woman has cat-ears
with multiple subjects things can get fucky depending on the lora\checkpoint
we aren't mind readers and cant see your screen while you click on things
>>105728610hes right you know
>>105728657change the lucasarts logo at the bottom right of the image into the text "LDG"
>>105728550Yeah, make sure to check the code of each custom node you install
>>105728657>>105728672>change the BY HAL BARWOOD to >BY TARDPOST SCHIZOIDS
>>105728672last one:
change the style of the image into manga style, in black and white.
plus
>>105728682
Untitled
md5: 044abf0c513a99e96bcb4b156836a33a
🔍
>>105728532mine take all 23gb with kontex Q8.gguf for whatever reason.
1024x1024 flux takes 30seconds to gen on 3090 thou
>>105728712text is better in this one.
>>105728724I am using FP8 for model and text encoder, maybe that's why
we're in a new era of AI edits now. I like how text swaps keep the original font style/typeface.
It's annoying how hard it is to get anything resembling consistency with 2-subject images even with regional prompting and comfy couple.
lmao
have the black woman holding a sign saying "We wuz TONY STARK n sheet". The woman is smoking a white joint.
>>105728809use controlnet canny or depth, or openpose: it will always try to follow the original lineart/depth map/pose.
>>105728817>>105728712>>105728726kekked
i am starting to love you guys
onegirl
md5: 6d3e59e67442759479cb6cfa9444d03d
🔍
how censored is flux kontext? i'm asking for memes and nsfw both
>>105728842the only onahole salesman i can actually trust
>>105728847Doesn't do tits if you ask but will do tits if the original image has them.
>>105728847If you use an input image of a nude woman, it works fine especially if it's full body nudity
But it does not nudify people
For "memes", it doesn't refuse prompts like cloud models since there are no guardrails
change the location to a grocery store. have the black woman walking into a grocery store. she is pushing a grocery cart. Above the store is a rectangular sign that says "EBT cards only".
there is INSANE potential for this model, infinite meme possibilities and even actual practical professional uses (fast edits of content in different situations)
>>105728860kek that's clever
but what about things like guns, explosions, violence etc?
I just made myself slightly muscular for my tinder profile
Wish me luck
>>105728877anon, just go >>>/g/lmg and be happy
buy a vr headset, buy an ona hole
be happy dont worry be
>>105728872thanks for the info
>>105728877have you tried working out instead kek
but honestly, i don't see how its any different
filters, edits, makeup, etc
your main issue will be the reality-check irl
u lil catfish ;3
>>105728876Seems fine.
>>105728877I bet you are Indian.
file
md5: 4ea3b2a27c3417fcfdcd85febb502003
🔍
>>105728896>I bet you are Indian.If indians knew about this general we'd be in deep shit. Look what they did to /r/stablediffusion
>>105728915>look at redditno.
>>105728929i wouldnt mind
>>105728929What a weird necklace, it's just buttons
>>105728915Ummm, they're already here saar.
>>105728942>a choker necklace with circular decorative pieces
>>105728929Man, Chroma has anatomy flaws and all, but after trying Flux again thanks to Kontext, seeing this really makes you realize how slopped base Flux is
make the image like a 1930 war propaganda poster. At the top of the image the text "Haruhi wants YOU" is visible. At the bottom of the image the text "to go to WAR!" is visible.
https://docs.bfl.ai/guides/prompting_guide_kontext_i2i
reference for how to do stuff.
>>105728919she looks like my old roommate
>>105728965cmon maaan
file
md5: 40b9e075e80395117c9301487926c245
🔍
Messed around for like an hour makin this with the VACE extension workflow. Had to drop down the rez to get it under 4 megs.
>>105728976>no masterpiece booru tags mentioneddisregarded
>>105729002i had one w\ an ass that fat irl ;3
>>105728994kek wtf
make the anime girl a glass sculpture.
neat
Using style transfers in Kontext is interesting when you feed it multiple art styles as the source
>>105729045i can still see her though
transform the image into pixar style.
this is what they do at calarts btw,
>>105729077with the original bocchi image:
using this style, make a photorealistic japanese woman with pink hair at the beach, wearing a pink two piece bikini.
>>105729095>photorealisticcmon maaan
>>105729002can you share this workflow? this is literally the basic workflow i want
>>105729095alternative:
make a photorealistic japanese woman with pink hair at the beach, wearing a pink two piece bikini, based on this anime image.
>>105729099I omitted that and got this:
Been gone a while bros. Did laura kinney bro ever get his new PC?
>>105728838thanks
we hate you rocketgurlp***
>>105726893>update: flux kontext output images can be used commerciallydon't be fooled, it was intended from the start
https://en.wikipedia.org/wiki/Decoy_effect#/media/File:Decoy_pricing.svg
give the anime girl spiky yellow hair. her eyes are now the color green. change her outfit from a pink tracksuit to an orange karate uniform.
give the anime girl black color hair. change her pink tracksuit into a black tshirt that says "LDG" in playful text. she is pointing at the camera.
transform the image into ghibli style.
>>105729230>give the anime girl black color hair.you can do that with inpainting though?
>change her pink tracksuit into a black tshirt that says "LDG" in playful text. I agree that the text editing is cool, you can't do something like that with inpainting
>>105729245the amount of stuff you can do fast is amazing, also I couldnt do the text at the top like in
>>105728617 with img2img inpainting.
it's a very neat tool, cause it can copy styles or transform stuff in ways inpainting cant do easily, even with controlnets.
>>105729230>3 edits at oncey'know, what this anon did is smart, because if you make too much iterations you'll notice the jpg artifacts, so might aswell ask the model to do most edits on one or two iterations
>>105729266would anally destroy this girl
vidgen will never reach these heights
hillary declares war on pepe the frog
but revised:
>>105729299I feel like this is easier to do with inspect element
file
md5: 4365abd8c86356c1149aebcdb446bf1d
🔍
>>105729293im not that faggot, i had to unhide your post to check...
>>105729305stuff like the indiana jones example copies the style exactly, also for images, kontext has the ability to replicate the font exactly. i'd need the grim fandango font for this, for example. also i'd have to do the gradient the same way.
it can do all that without the actual font, it's neat.
>>105729315the issue is the style, it always output the same generic flux anime style, look at your migu it doesn't fit at all with the other characters from Grim Fandango (btw if you never played that game what are you doing? DO IT)
>>105729324ive beaten it twice, but what's neat is kontext can replicate a font style, or copy styles, or put characters/objects in new poses and so on. it's not perfect but it can do really neat things.
>>105729319would NOT
did she become black after exiting the home or did the home become black because she's black?
>>105729299the woman on the left is sitting in a chair reading a book that says "TRUMP WON" in bold black text. She is crying. keep the same structure in the image.
not bad!
>>105729319sadly chroma has a big composition issue, all images look they are a cutout plastered on top of the background, they rarely have any depth
>>105729352>rarely have any depthy-you mean like depth of field or something else
>>105729352that what happens when you try to transform 2d images into photorealism, realistic pony models suffer from the same issue
>>105729364I mean the way the subjects are positioned on the images, its like you copied and pasted some transparent png of some character into another background and matched the light, but its like they are not interacting, it looks flat
>>105729386>it looks flattrue, looks like Chroma has SDXL's vae even though it still has the good Flux's vae
cool
md5: 74562d8db2f12083daea8c076a4eb7a8
🔍
>>105729315>it did it better than 4obased
source image: jc denton
the man is pointing an ak-47 to the right.
neat
file
md5: 6ddd763780fbc6b276d25365576ccbda
🔍
>>105729315>>105729403I wanted to test it out on kontext max but Migu is VERBOTTEN
>>105729324>the issue is the style, it always output the same generic flux anime stylewill a finetune fix this
>>105729417Well, it's a gun anyway. It looks like some kind of ar-15 rifle, sort of.
We need better prompting for lots of stuff. Especially faces. Highly detailed facial descriptions should be possible, enough to bear uncanny resemblance (like a police sketch).
>>105729459>mfw I'm browsing gif and I see a cute girl>girl
image
md5: 8437e9767b46ef71611da3ac3631b815
🔍
I tried the great photoshop killer released today. it's shit
>>105729479why are you using 1.4 negative prompts?
>>105729487Defaults, and it still fucks up
remove the gun in the picture. the man in the image is holding a sign that says "GOT AWAY WITH IT".
he can't...
source image was shitty but it still works kek
>>105729459>All hail, Macbeth! Hail to thee, Thane of Glamis!>All hail, Macbeth! Hail to thee, Thane of Cawdor!>All hail, Macbeth, that shalt be king hereafter!
>>105729352run some images you like through joy caption, use maximum output length and check all the composition boxes.
use that as a background description
chroma can be directed very well composition wise
>>105729516Don't plasticize my gens you faggot
>>105729459>mfw I post the wrong WAN_ITVV_000xx.mp4 video on /ldg/ doxxing myself and my neighbor
>>105729493the man in the picture is in an orange prison outfit. the location is a prison. the man is behind bars.
>>105729537>the location is a prison.kek, they sleep in the woods, poor them
always use official resolution for cosmos 2b (1280x704)
any other value with result in inconsistency and body horror
anyone who wants to finetune on it must know about this caveat and only train on 1280x704
Hey I just want to make an apology.
I am one of the "Anti chroma" people. I'm making this apology not because I now think Chroma is good. It definitely has its flaws. But I argued them in a disingenuous way. It's a fine model with its ups and downs like all models.
I got into an argument with someone about something I felt passionate about and I realized how bad it made me feel to have my feelings clowned on like that. After a little reflecting, I was so obsessed with "winning" the argument I became a monster who took a small shortcoming of a model model and used it as a weapon to attack people who were passionate about something.
So yeah. I'm sorry if I made anyone feel bad. I don't want anyone to feel bad like that. I forgot about the person behind the screen.
the man in the image is pointing a black pistol at the camera. at the top of the image is the text "buy Skyrim OR ELSE!" in bold, playful text.
>>105729554>always use official resolution for cosmos 2b (1280x704)>any other value with result in inconsistency and body horrorwow, what a great model!
>>105729108It's from this reddit post.
https://www.reddit.com/r/StableDiffusion/comments/1llx9uq/how_to_make_a_60_second_video_with_vace/
You generate a series of overlapping clips (the overlapping parts differ slightly in each video, the overlap helps with maintaining motion and coherence between the clips). Then you crossfade the clips together where they overlap to help with the variance. I had chatgpt write a python script to handle the crossfade using ffmpeg.
>>105729554>always use official resolution for cosmos 2b (1280x704)I want to call bullshit but that is unironically the most coherent cosmos gen I've yet to see
>>105729558the man in the image is sitting on a throne in a castle. he is wearing a crown. Above the throne is a sign saying "king of LIES", made out of marble.
>>105729555>It definitely has its flaws*paws
https://www.reddit.com/r/StableDiffusion/comments/1lm0xec/style_transfer_with_kontext/
>Pro and max can transfer a style from a source image, why dev can't?
the goyim knows, SHUT IT DOWN
>>105729571i posted this a few threads back, very large chance of nonsense background and body horror on 1024x1024
on the other hand all 1280x704 with the same prompt look nice and consistent
try it yourself and you will be surprised how nice it is on 1280x704
>>105729591I too want to know why. but maybe it can, working on it... like maybe. Probably not...
change the grass in the image to lava.
now it's a hellscape.
She's way too big, but I have success.
Step 1: add a woman to backrooms
Step 2: in Gimp, I added and scaled the source image into place. Intentionally, I did very minimal work. This is the before pic here.
Step 3: prompt: Replace the area surrounding the woman's head to make it match the rest of the room.
>>105729591to me that's the most dissapointing part of kontext dev, it can only do one image input (no, stitching 2 image together is a cope I don't count that), it's more fun when you put 2 images and you mix them together (mixing with characters or styles, or dresses...) like on 4o or Omnigen 2
>>105729618I'm aware the scale is bad, I just wanted to prove that it could be done. It can be done w/o huge skills.
Admittedly, I'm fairly good with shooping, so my positionin
>>105729596im curious what that prompt looks like at the official resolution kek
>>105729275>buttstuffgaaay
I know how to inpaint and use controlnet stuff but this is a really fun model and a very useful tool to use along WITH controlnets, you could use openpose to gen a 1girl then use kontext to make clothing/hair changes, or change the setting/actions.
>>105729617conversely...
change the grass in the image to delicious cakes. remove the skeleton from the picture. delicious cakes are scattered around the image. change the text "THE RIDE NEVER ENDS" to "Fun ride time only!"
>>105729625My suspicion is we have promptlet issues here. I'm working on it:
>>105729618>>105729630
>>105729629what is taking him so long? is writing a C architecture that time-consuming?
>>105729617Maybe say detailed red and yellow lava?
>>105729310>>105728838newfag here, why do hate this again?
>>105729692Imagine seeing the same style of gens every day for months or years.
>>105729698Neat. friendlier lava
>>105729692we don't he's just an unhinged schizophrenic
>>105729698btw I hated that lava level in Mario 64
kontext is also a fantastic pepe generator.
the cartoon frog is sitting on a couch with the same expression. on a TV nearby the text "LDG" is visible on the screen.
real comfy hours.
>>105729735Honestly I would fuck this dead body
file
md5: a90942b876a76c086675ea973a1750f5
🔍
>>105729741the cartoon frog is sitting on a couch with the same expression and clothing. on a TV nearby the text "LDG" is visible on the screen.
there, details matter
aaa
md5: 3ddde76f0d0f4ea9e72b9f9600132607
🔍
>>105729741looks like it has been badly pasted with photoshop kek
>>105729762>has hidden the hands and feet perfectly because Chroma can't do thatthat's art
>>105729762see thats what im talking about, the girl is too big compared to the house, it doesnt make any sense perspective-wise, sorry chroma is DOA to me
>>10572978>hid handsworks for the rocketposter
>>105729788>chroma is DOA to mefor me it's DOA but for another reason, the future of image models is now something like Kontext, because these kind of models can do t2i by themselves and with an image input, that alone makes everything we had before obsolete
the cartoon frog is in a condo and looks outside, to reveal a hellish landscape with lava and fire, with the same expression and clothing. he is smoking a cigar.
world ending? don't care.
>>105729575>change man to bernie sanders
>>105729807the cartoon frog is in a condo and windows behind him reveal a hellish landscape with lava and fire. he has the same expression and clothing. he is sitting in a recliner and watching a TV that says "news: world ending" as a headline.
>>105729801Until kontext gets a finetune like chroma it's gonna be inferior for getting an initial gen. Yeah sure kontext lets you take an existing picture as a reference but it's kinda shit in terms of censorship.
the cartoon frog is in a condo and windows behind him reveal a tropical resort with palm trees and an ocean. he has the same expression and clothing. he is drinking a tropical drink, and a bowl of fruit is on a coffee table nearby.
so kontext is for i2i?
sorry I'm slow
>>105729824of course, that's why I said "something like kontext" because kontext dev aint it, and it'll never will, the licence is way too bad to have a healthy ecosystem on top of that, at this point I'm just waiting for the chinks to steal the technology and make it uncucked and have the nice apache 2.0 licence on top of that
>>105729833default pepe this time:
the cartoon frog is in a condo and windows behind him reveal a tropical resort with palm trees and an ocean. he has the same expression and is wearing a white t-shirt and blue shorts. he is drinking a tropical drink, and a bowl of fruit is on a coffee table nearby.
it just works
>>105729847>it just worksdo you care that it looks like SHIT?!??
>>105729854>do you care that it looks like SHIT?!??nta, but I also don't like it because it can't keep the original style, it always transform it onto some digital art slop, you can tell those retarded trained their model on """copyright free""" shit
>>105729847the cartoon frog is on a sunny beach walking around, with palm trees and an ocean. he has the same expression and is wearing a white t-shirt and blue shorts. he is drinking a tropical drink.
>>105729858damn, even pepe has the manlet effect, god I hate the funky pop style that kontext dev is doing
>>105729857>>105729854Look, we know you're only here to steal stuff and farm it.
>>105729858same prompt except feelsgoodman source
it's a fun model.
>>105729870Works better to do one thing at a time.
kek
md5: df129b39ef5452c1836eb3d80d69567a
🔍
>>105729879>Look, we know you're only here...
kek
the cartoon frog is in a condo, windows behind him show a sunny beach, with palm trees and an ocean. he has the same expression and is wearing a white t-shirt and blue shorts. he is drinking a tropical drink. A TV nearby shows the headline "Ironheart massive FAILURE!", with a black woman above the headline. the cartoon frog gives the thumbs up.
this is a fantastic model EVEN JUST for the editing capability and text stuff. then there is style edits.
>>105729858>>105729881I still can't unsee the artifacting. oh well, at least you are having fun
>>105729782the struggle continues
>>105728436what lora did you use for this one?
>>105729903>I still can't unsee the artifacting.why is this the case though? is it because the VAE isn't perfect?
>>105729917It's because it's in the original, and kontext doesn't seem to do any image enhancement.
>>105729917it's using the sota 16 channel vae. this is a kontext issue. you can really see it the more editing you do on the same image
file
md5: 6d3ac7ce8109f5ab40a8ff2c3ccbb755
🔍
>>105729923>kontext doesn't seem to do any image enhancement.wrong, everytime you open your mouth debo you say something completly retarded
>>105729261
kek it actually copied the SIPS style chips from the source image properly
the cartoon frog is wearing a white t-shirt and blue shorts. He is standing outside at the beach, with palm trees and the ocean. he has the same expression. the cartoon frog is holding a magazine with a cartoon frog with the headline "FRENS".
pretty good
>>105729591isn't one of their example is style transfer
though I tried on oil painting and it doesn't work
>>105729927to be fair it's the first time we got a model like this, I think I need more models of this type to make up my mind, for example 4o doesn't make jpg artifacts even if you go for a lot of iterations, but they use another architecture that's not using a VAE so...
>>105730010no but the image degrades in a different way. an anon posted a video of the asian girl who kept feeding the output of herself repeating for 100 edits. it just turns her into a black man. also piss filter
>>105730027yeah but that's not because of the architecture, more because of the system prompt that asks the model to render diverse heckin fat black woman, OpenAI is a really woke company, and for the piss filter it's made so that you can ID their image as an AI image
have anyone figured out how to clothes swap yet?
When I heard about kontext I thought you could prompt "use clothes in image ref 1 and make person in image ref 2 wear it" etc... but it isn't the case
Is there negative prompt on Kontext? If you do anime, they all come out loli looking. Flux Dev has the same shitty issue.
til that the cia doesn't want me to use Kontext.
wow.
Joke's on them, I've been nofapping for 48 hours, so my loush is positively charged.
>>105730086nooooooooooo
that's islamophobic!
give the man on the right very long blonde hair.
>ghibli
>pixar
sure is 2022 here
>>105728842>>105728929gib workflow
my attempts with the example workflow always end up looking slopped
I can't find Kontext model on civitai
>>105730092Have you succeeded at changing people's height?