Discussion of Free and Open Source Text-to-Image/Video Models
Prev:
>>105754990https://rentry.org/ldg-lazy-getting-started-guide
>UISwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Models, LoRAs, & Upscalershttps://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
>Cookhttps://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe
>WanX (video)Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1
>ChromaTraining: https://rentry.org/mvu52t46
>Illustrious1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/
>MiscLocal Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate
>Neighborshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg>>>/b/degen>>>/b/celeb+ai>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
suspicious collage of unclad maidens
gen quality has gone downhill last few threads
recap:
>>105758177>TARDanon believes people can't tell if an image is AI 9/10 times. possibly due to ego, drugs, mental illness, or a combination of the three.
Blessed thread of frenship
so Flux kontext is a little more intelligent local model better than google flash 2.0?
>'Convert the image to realistic photo'
>52 iterations
M A N L E T O V E R D R I V E
>>105758284this is entirely the fault of Kontext.
>>105758288wow you're absolutely seething and you don't even understand why you're stupid
so anyways anon, why don't you start posting images and let's guess which ones are AI
>Since v29 it has no sovl anymore and can't do vintage, just trust me bro
Meanwhile, effortlessly
file
md5: 7a0d467007e4d907ce3d55a147dda8cb
🔍
Is this AI? Anon has told me if I train on this it will make all my outputs for "red color" bad.
>>105758343sd 1.5 tier slop
file
md5: 5a7aa0017463e3cc4121b1f3694c2f6c
🔍
>>105758343>Since v29 it has no sovl anymorethis is a fact yes
what's that little mask that sluts wear in amateur porn called, you know what I'm talking about
thought it was an opera mask but that's something else
>>105758288what else to add other than an equally bizarre 90s video
https://www.youtube.com/watch?v=xo9vf9YmrqA&list=RDxo9vf9YmrqA
ok now he goes full bananas. figures. maybe it's the heat?
>>105758345I need a definitive answer if this is AI or not.
file
md5: 6e37612b00fe6e1d9e92d85936d3cfb3
🔍
I'm not interest in this discussion, but the proprietary imagen is getting really good, way better than the "sdxl / flux / chroma" slop.
Pic related its Reve.
>>105758362masquerade mask
>>105758380go back to /adg/, this is not the place to shill API models
>>105758380Why would I want to create an image like this instead of
>>105758163 ?
Can your shitty model do that Patchouli? I reckon not
>>105758381>masqueradehttps://www.youtube.com/watch?v=_-LpDo9vVN8
>>105758360You cherry-picked an image to prove your point and it doesn't prove anything...
Are you even trying ?
>>105758403>Can your shitty model do that Patchouli?Kontext Dev can do whatever you want lol
>>105758422>nooo, when you do it it's cherry pick, when I do it it's legityou can't be your own judge anon
>>105758440it's only got prompt adherence going for it, but aesthetically we're evolving backwards
>>105758452you're learning they're bad faith contrarians you can't possibly win any sort of "is this image good or not" contest, they'll say everything is shit
>>105758330Okay I, who believes you can always tell the difference, will host a retarded game of your choosing to accomplish what? You just wanna lie and pretend you don't know which ones are AI? Has anything in any of these threads since the beginning of AI made you think an image wasn't AI generated? No. Because you can always tell and it's not even close
>>105758488Okay, so I train a model on images that you can't tell are AI which you admit you can't always do, will the outputs automatically be bad? Or are you going to concede the outputs ultimately are determined by the inputs.
>>105758360Neither of your examples look vintage to me. It is a prompting issue.
>>105758502What are you talking about? That's like 80s glamor and Earth to anon, the 80s were almost 50 years ago. Now you're just moving goalposts to what you think "vintage" means.
>>105758500>>105758288>a combination of the three.
>>105758502you're arguing in bad faith, it's not about v29 being great, it's about v29 being better than the most recent versions, how much do they pay you to shill like that?
>>105758360at first I was sad that that horse fucker decided to kill that model with his distillation bullshit, but now that Kontext Dev exists, I simply want a model like that, I'll disregard any image model that can't do reference shit like Kontext, this is the new standard, this is the future of imagegen
>>105758380>shilling api model>don't even give proompt for comparison purposeshere's attempt #1 with chroma, having no idea what the prompt was. I already got similar image and lighting quality, but the composition isn't good.
>>105757723I'm back to report that putting slow motion and slow-mo in the negative prompt is halping, I'd love to see your actual workflow though
>>105758488diff anon here. there are gens that just look fucking real but those are rare. we've trained our perception to detect the specific ai errors ever since this tech came out too. imagine showing your past self (from around the time this tech got released) good chroma gens
>>105758452I would have expected than when you cherry-picked an image to show this quality degradation you speak of, it would actually show quality degradation.
>>105758565>here's attempt #1 with chromathat looks like ass, look at the green part of the flowers, it's so noisy, looks like a bad SD1.5 gen, and SD1.5 had the excuse of a bad VAE, what's Chroma's excuse? it has the same vae as fucking HiDream and Flux dev
>>105758542> when gravity stops working
>>105758586still waiting for that prompt, APIcuck. do you think I know everything about your faggy flower collection hobby?
>>105758582>I would have expected than whenESL spotted.
>>105758597>nooo, it must be a really specitic prompt to get a quality imageanon, the image should be of quality with every single prompt you put in it, wtf is that weird cope?
>>105758608API isn't even comparable because they use LLMs to pollute your prompt with tokens.
>>105758360>Convert this photo to a vintage 1980s magazine photograph style. Maintain the general context and composition of the photo.It's... something? It added some grain
>>105758633For example, you would get a much better result with Chroma if you finetuned an LLM on their training caption vomit and "enhanced" your prompts to look like their vomit.
>>105758608>the image should be of quality with every single prompt you put in itthe image should be of the quality I ask for.
I get it now, you just ask your API model "give me picture of guy with flowers" and it gives you a bunch of random shit you didn't actually ask for. this is why you won't give a prompt for comparison lol.
>>105758662>average HiDream render
>>105758542Wai? Also, did you use face detailer for the eyes?
>>105758683>the image should be of the quality I ask for.and you asked for shit? because that's what this oversaturated, grainy image is
>>105758565
API models are shit
Chroma is shit
someone should break into BFL office and steal kontext max and then decensor it and then we will have local models solved
>>105758666nah. you're better off running a low pass resample and/or post process in anusshop/krita
>>105758696look at that floating hand, course its detailed lol
>>105758709>someone should break into BFL office and steal kontext max and then decensor it and then we will have local models solvedunironically this, an uncucked kontext max would be fucking game over, absolute peak
>>105758732I just want krea to gen floating porsches until I die =/ KREA KONTEXT.
>>105758672>For example, you would get a much better result with Chroma if you finetuned an LLM on their training caption vomit and "enhanced" your prompts to look like their vomit.it's unfortunate how much CogVLM style prose is required, but it responds surprisingly well to early dalle-3 style prompting where you reinforce main prompt with short summary in the end
>>105758515Yes but skin looks glossy in all your examples images though. Where is the texture, where is the grainyness? Even a single bad token can cause sloppiness.
>>105758666I don't know what it did but it looks authentic
>>105758539I don't agree.
Should I get good results in a training without having to use adetailer or high-res fix?
Im training illustrious xl and the generated images look low quality. I have to use adetailer and high-res fix but some stuff in the images still looks like crap.
What am I missing? The database is good
file
md5: 93ae92de45b3b1134d9eaed732997ba5
🔍
>>105758809I don't agree with your disagreement
>>105758845white socks barefeet>
>>105758809why did you make this one backwards
Guys, it don't matter how photo-realistical your model is if you can't use that model to render photor-ealistical teats with it.
tl;dr post your most photo-realistical teats. any teats renders you have. toons styles welcome too.
>>105758709>someone should break into BFL office and steal kontext max and then decensor it and then we will have local models solvedI bet these companies don't actually have great security. Someone could probably hack BFL (or NovelAI or MJ) and steal their models like a modern day Robin Hood. Does anyone in a 3rd world shithole with no law enforcement want to be a hero?
>>105758781>it responds surprisingly well to early dalle-3 style prompting where you reinforce main prompt with short summary in the enddo you happen to have an example of this, or comparisons?
>>105758917>Someone could probably hack BFL (or NovelAI or MJ) and steal their models like a modern day Robin Hood.it already happened with llama 1 and NovelAI, but it's been long ago though
>>105758928>but it's been long ago thoughdude it was literally yesteryear
>>105758917Given how rare leaks are the fact is unless you're completely incompetent and you're passing around model files with USB sticks or Huggingface downloads most people who use cloud services for training can effectively lock down all access by making it literally impossible to directly access or download the model files by putting them in difficult to access without also being a stakeholder.
>>105758343>>105758360>>105758809>>105758845chroma is a bad model and people cope by prompting for blurry candid vintage analog crap. the model is still incredibly noisy thanks to the initial de-distillation and lacks the texture of better Flux models like Pixelwave and Krea. it will only continue to get worse because flux schnell is an adversarial architecture
>>105758949>blurry candid vintage analog crappost a base flux img of blurry candid vintage analog crap tho
>>105758396>>105758565suck my dick and balls, the discussion was about either you could use synthetic data for ai training, I'd say if its good enough for the human brain, its good enough for AI
calling me a shill won't shelter you from seeing how far behind open source models are currently
Here is the prompt
>A rustic indoor scene of a drying room with bundles of herbs and flowers hanging from wooden ceiling beams. The herbs are various shades of green, with some clusters of white flowers among them. A person wearing a brown shirt, carrying a small vintage camera on a shoulder strap is seen inspecting the plants near a window, with their back partially turned to the viewer. The space has wooden elements, soft natural lighting, and a slightly cluttered, functional atmosphere, evoking a warm, organic, and rural aesthetic. Shot with a vintage film camera with the flash on, drenching the herbs in flash. Vintage film photography with harsh flash illumination and high contrast between light and shadow.
>>105759017>I'd say if its good enough for the human brain, its good enough for AIretarded take, AI don't reason the same as us, you're completly retarded dude
>>105758817Can anyone post illustrious training settings please? Nothing I'm doing seems to work. Generated images are low quality
>>105758969flux dev + amateur photo lora looks better than chroma and isn't covered in layers of noise
https://civitai.com/models/652699/amateur-photography-flux-dev
https://civitai.com/models/970862/amateur-snapshot-photo-style-lora-flux?modelVersionId=1944723
chroma fucked up with the schnell de-distillation, and detail-calibrated makes it clear that all the 'heckin authentic amateur phone quality!' is just the result of training a 1024x model on 512x images and not actually a style the model learned
>>105758945Now that's some good teats.
I like her enthusiasm too.
>>105759057>needs a lora >doesnt even post his own outputs >examples look more slopped than chroma >getting more and more upset On behalf of anon I will graciously accept your concession.
how do I reference the second image in the 2 image workflow? green cartoon frog doesnt work, I get a different frog and not pepe.
>>105759122>>examples look more slopped than chromain the rate on how chroma is getting more and more slopped through epochs, it's fair to say it'll end up to the flux level of slop in epoch 50 kek
i feel bad. i have never touched regional prompting because i dont care about multiple girls or how the male looks. what other uses in NSFW material can i use it for if im content with 1boy(faceless male) +1girl?
>>105759142disregard, im being dumb: I disabled the vae encode to make a single pepe on the beach at first.
>>105759142"anthropomorphic frog" works well on my end
green_
md5: 0be1cf661bdb06db356bd67a827fb69e
🔍
have i finally figured this out?
Does someone have the AI video of the 2 shupogaki cosplay inside of like some old train?
>>105759160kek, it happened to me again, easy mistake to make
>>105759160that is, if you disable the vae encode at the top you can use the bottom image to generate a single image based on the 1 source not both. it worked, then I enabled it.
the man is holding the green cartoon frog, they are on a sunny beach.
is there a rentry page for kontext yet? this workflow is better than the default, way more options.
>Pic related its Reve
Can that censored/slopped model depict a female doing contortions?
Can it look like a real amateur photograph? Note the room can't be blurred (which is a symptom of slop). Can it even do feet? If not, then it's substantially worse than Chroma (unsurprisingly).
>>105759017>the discussion was about either you could use synthetic data for ai trainingdid you forget to reply to said discussion or something??? you know we can see your original comment, where you mentioned no such thing. lmao
>calling me a shill won't shelter you from seeing how far behind open source models are currentlyI just wanted to do a comparison. There's no point bringing up Reve and saying it's better if you don't give us a basis for a comparison.
For the record, some API models are clearly ahead of local in key areas such as text and comprehension. 4o and novelAI stand out here. but it's not a clear knockout, as they have their own limitations. Ultimately, I want local to get more powerful, but I am not concerned or trying to deny when API is ahead. I'd have enough entertainment for life if we stopped with SDXL.
here's an output with your prompt, Reve is clearly far above chroma at handling this scene. in particular, it has trouble with making more than one row of flowers, and with the camera and camera strap. I highly doubt that chroma will be able to handle this prompt very well when it's done either, but I'll save it as a benchmark.
that said, Reve is also failing at some parts here:
>flowers hanging from wooden ceiling beamsit's hanging from thin rods
>with the flash on, drenching the herbs in flash>harsh flash illumination and high contrast between light and shadow
>>105759122>doesnt even post his own outputs>getting more and more upsetOn behalf of anon I will graciously accept your concession.
>>105759207>>105759198Note that it must also be able to do the precise pose that the prompt describes, or it's shit. E.G. this one was
>Amateur photograph, a Japanese idol woman, performing an advanced contortion pose indoors, likely in a studio setting. She is sitting on a surface with her legs bent backward and extended over her shoulders, so that her feet are positioned and touching over her head, displaying an impressive level of flexibility.>A white towel is draped over her front for modesty. She has straight black hair with bangs, and she wears a black wristband or watch on one wrist
>>105759017>if its good enough for the human brainit's not. fuck off
>>105759244Epic background for a retro fighting game. Share the workflow?
>>105759209>did you forget to reply to said discussion or something??? you know we can see your original comment, where you mentioned no such thing. lmaoI don't care about your feminine feud you and that other anon are having, but you've just outed yourself as a newfag. Lurk moar.
the girl is holding the green cartoon frog, they are on a sunny beach. keep the frog's expression the same.
>>105759254I don't usually save workflows in images but give me a bit and I'll clean up the json file and upload it somewhere.
storm
md5: 2bc17ab8ac1d134860f732b4f4399cd2
🔍
>only in chaos can I find peace
took a random old palworld screenshot:
the green cartoon frog is sitting in the hot tub and is smiling. keep their expression the same.
>>105759254https://files.catbox.moe/jh68bz.json
also lora
https://files.catbox.moe/4w1102.safetensors
the asian girl is holding the green cartoon frog. keep the frog's expression the same.
Internet will be filled to the brim with AI slop in 10 years. It will be so ubiquitous you would be hard pressed to stumble upon real art
is this the future you want?
>>105759546standing by to see some examples of "real art". can't wait to see!
>>105759546By filled to the brim you mean completely decluttered because retards get automatically get filtered.
>>105759546>muhhh beloved internetit went to shit 15 years ago, and now you start to worry about its quality? give me a break
>>105759370>i will now pleasure myself with this fish
the green cartoon frog is on the stage to the right of the asian girl, holding a mic. the frog is wearing a white tshirt that says "singer" in scribbled text. the frog's face is unchanged.
there we go, got diff frog variations, last line fixed his expression:
>>105759578true internet peaked around 2005 when non whites didnt have access to it. internet with ai would be fine if we had a firewall that blocked out indians
>>105759546we're in an era like the first days of the internet now, where normies knew nothing about netscape/ftps/file sharing. now we have that with AI tools like noob/flux/wan/kontext.
>>105759546>you would be hard pressed to stumble upon real arti wonder how much damage this will do to the new generation of artists. who in their right mind would spend countless hours on art as a hobby knowing there is no future for it as a career? everything you do will be undermined by someone using ai tools.
>>105759629>why would someone do a hobbyYou do realize people do many hobbies for the fun of it right? Why do we do anything? Also career artists are why things are so shit right now.
a tiny version of the green cartoon frog is standing on the dirt path. the frog is wearing a red shirt and blue shorts. keep the frog's face and expression the same.
>>105759485That is an impressive workflow. Have you been doing it for a while? Is that bf16 pixelwave flux? Doesn't it requite over 24GB of vram?
>>105759629artists can harness AI the most, they can do all kinds of cool shit with img2img, look what controlnet scribble can do, even the worst line sketch can become a nice anime girl or whatever.
>>105759652>You do realize people do many hobbies for the fun of it rightalot of artists start drawing as a hobby and then turn it into a career once they feel their skill level is good enough, but that threshold will never be reached. the threat of ai will always be looming over them, and they wont feel as compelled to take it seriously because of it. it will remain just something they do on the side for fun, and someone doing something for free definitely will not output the quality of content as an artist that does it for a living.
>>105758788the guitar actually looks pretty close neat
>>105758945the boobs were literally gravity-defying magic haha
>>105758284most of eldeegee is not good hehe
>>>/g/sdg if you want inspiration\quality>>105758237 (OP)GREETiNGS FR0M >>>/vp/napt
we are you relevant neighbor board ;D
update your neighbors list
>>105759661I use the fp8 version of pixelwave, which is 11 GB, though I have a 4090. I have never tried the bf16 one so don't know if it would work. That workflows is way simplified. I removed all the stuff I didn't use that for image to get rid of the custom nodes, pic related is what that workflow looks like normally.
>>105759629>>105759665I think that real artists could do well if they adopted AI in their workflow. They have knowledge of art fundamentals which means that technically, they're still ahead of the rest but AI could increase their productivity.
I can understand why they are hesitant though. AI seems sacrilegious when you've spent so much time to acquire the holy grail.
>>105759694why are you still generating uncanny ugly 1.5 shit?
>>105759692I'm going to give a truth bomb, artists are dumb fucking people who work for pennies, who sold you out by allowing for some of the worst media and games to ever be created, who happily work on $50 horse armor DLCs without narry a whine, and the people you cry about would be better off slaving in the fields or mines rather than polluting our culture with bullshit and propaganda. No, I don't give a shit about them, and no, the people who do it for a living aren't actually good and I can point to every modern TV show, movie and video game made by a large studio employing these cockroaches.
-art degree haver
>>105759697as they should, artists learned to use pressure tablets and photoshop/illustrator/other apps, and layers, so why not harness these tools to make cool stuff?
>>105759546the feds\deepstate have been using Ai for decades probably
so you best start believing in Ai-psyops\psySLOPs stories\conspiracies
YOURE IN ONE
anyways art is ART regardless of your opinion on it
>>105759697There's also the inherent fact that AI means less control over their canvas. For a veteran, they will be constantly battling the AI gacha to get the results they want to the point it'd probably be faster if they just did it themselves.
>>105759727There's also the inherent fact that at a certain point you have to let go and become a senior artist and learn how to delegate which means things aren't always what you want.
>>105759713there he is!
our little friend who seethes\cries every thread
the one who hates beauty
the one who wants to dictate other peoples arts
the one who wants to censor other peoples speech
the one who wants to control the bakes
the one who wants to fight every thread
the one who drinks too much (prob alone)
the one who probably carries such a profound sadness that this general is all he has
the one who NEVER posts a GEN <3
the red hair anime girl is holding the green cartoon frog with her hands. the frog is wearing a red shirt and blue shorts. keep their expressions the same.
>>105759697>>105759727it really doesn't help all the ai tooling right now is advertised or kitted to "automating" everything. ai should work with existing tools and it would probably have a lot more adoption
>>105759727Yes but artists also have more control because they can do rough concepts first and only then feed it to the AI for polish meaning they can get much closer to what they want that your regular guy.
>>105759767it's neat how wan understands physics despite having no actual physics model. it's all learned from data.
>>105759774>it's neat how wan understands physics despite having no actual physics model.pattern learning is a really strong tool, they learn patterns way better than us
>>105759771You say this, but no real artist with a huge following has adopted AI yet, despite image gen being pretty damn good now. Anyone caught using AI will be roasted alive by their artist peers. I don't think that stance is going to change for a long time.
the asian woman is holding the green cartoon frog with her hands, sitting in her lap. the frog is wearing a red shirt and blue shorts. keep their expressions the same.
Blade Runner Westwood lora in progress
>>105759794also what's neat about this is it made the image sfw as the right was bordering on nsfw.
>>105759751you are right. i think the next important step in ai development is identifying and developing proper and useful utility. Of course there are a lot of grifters who want to get in on the tech early and make a quick buck along the way. I see there are already books out on how to use stable diffusion to make art.
>>105759794so endearing... a truly beautiful moment caught in time.
the green cartoon frog is sitting to the right of the asian girl, on the bench. the frog is wearing a red shirt and blue shorts. the face of the frog is unchanged.
whats the best way to make an image reference static? dont change the expression?
>>105759032I'm out of credits on reve.
But I'm certain it can do "amateur photograph", and as for "feet" and "contortionists" here is some old gens:
https://files.catbox.moe/zvncdr.png
https://files.catbox.moe/pz1hk1.webp
>Note the room can't be blurred (which is a symptom of slop)???? as opposed to "amateur photography" being used to mask the model as pointed by
>>105758949>>105759209Sorry, it was being discussed in the last thread.
I know the image not yet perfect in terms of prompt adherence, but it compensates by being aesthetic pleasing.
>>105759250>>105759032lmao. Every single SOTA model is using some form of synthetic data, its a whole field of ai research, its quite ridiculous this is even being discussed here. You guys still are in 2020.
https://arxiv.org/pdf/2408.16333
https://arxiv.org/pdf/2408.16333
>>105759813This is great. Bladerunner is a fantastic game, even today.
Are feet censored in flux?
>>105759870hmm just "green frog" seems to work.
the green frog is sitting on the bench to the right of the asian girl, on the bench. the frog is wearing a red shirt and blue shorts.
im getting kontext shill fatigue
>>105759790Sadly yes. Unfortunately it will open the door to less skilled artists who are willing to use these tools to get a foothold in the industry.
>>105759813looks cool, classic aesthetic
>>105759925look friend. context is hot shit right now. it opens up many doors in image creation. I for one enjoy seeing it in action.
denying that it is a game changer is foolish on your part.
>>105758565Please this is garbage in comparison
Anyone trained loras on wai140? I'm gonna see if it plays well with Wicked City (retro noir anime with tentacles) dataset
>>105759813Such a good game. Syndicate was another with dystopian feel.
kek, used a persona 3 screenshot
the green frog is standing in front of the tv in the back of the room.
I dunno, I kinda like the Chroma output apart from her arm fuckery. Good enough for me right now to not spend credits on some service.
>>105759942its just better inpainting. its not this revolutionary model, especially with how cucked the model is.
>>105759923>>105759870i like her eyes\outfit\face-shape
>>105759774>>105759785wan can also be disappointing though
i have had several renders that dont have proper 'bounce'
>>105759972imagine the smell
>>105759524catbox / prompt?
>>105759988im sure it would be great actually
i wish i could afford a property with a growshed like that
the green frog is standing at the desk. change the text to "you talk to a fren". keep the frog's expression the same.
>>105759904>lmao. Every single SOTA model is using some form of synthetic data, yeah we can immediately tell, that doesn't prove your point. Flux skin is a direct consequence of this. Everything that makes newer models worse is a direct consequence of this.
>>105759979no, it's not just better inpainting. It actually preserves elements of the original image, whereas inpainting will always look to add or change detail.
So better inpainting, yeah i guess. But way better.
>>105758309Man, it's really overtrained with that "style", huh? I wonder if it's on purpose or they fucked up somewhere.
>>105760021kek
https://www.youtube.com/watch?v=egKF1UvMcZA
>>105759524>please stop fightinghttps://www.youtube.com/watch?v=fF3b3kNVaBI
>>105759972this. but it's a weed farm/lab.
>>105758309AI: Is this not the woman you asked for? I do not understand the problem.
what is the default nag scale for the 2 image workflow? 5?
>>105759972box? your version is way better than what I got.
>>105760150>ask Ai to generate a beautiful\artful\soulful woman>it generates your ex-roommate as a demon possessed wide-eyed spook oof.
Can you guys recommend a good video loop workflow please? I've got a 5090 since I'm addicted to AI stuff in all forms. The stuff I've tried either lacks good options or it gives me errors. I'm (currently) too ignorant of how to make one myself.
I like this one, https://civitai.com/models/1681541?modelVersionId=1903407 , because it's fast and I can go up to like 10 seconds, but it does this weird flashing over-exposure thing at the end which you can see in the attached clip.
This one, https://civitai.com/models/1720535/wan-21-image-to-video-loop-or-workflow?modelVersionId=1948904 , works quite well in regards to the flash thing being less obvious since it has an option to cut out frames at beginning or end, but that seems like a less than ideal solution. Is the flashing thing maybe just something people deal with with looping?
>>105760211let me guess this is using light2x lora. slow motion, check. barely any movement, check.
>>105760239have you tried adding flash to the negative prompt? Maybe it's trying to emulate flash photography happening in the background.
Idk, haven't really tried making perfect loops myself.
>>105759961Works like any other Illustrious model so far
>>105760207>896x1152found your issue
>>105760239install linux rich boy
there we go, "keep expression" seems to work, got some pepe/other frog variations.
the green cartoon frog is standing beside the man on the right. change the text "NSF terrorist leader" to "Pepe". keep the expression the same for the green cartoon frog.
>>105760285Can you convert him to the old unreal engine visual too?
>implying chroma cant do anime>>105760281>>896x1152
>found your issueare you genning at 2159x1233? vramchad i kneel
>>105760285>>105760300would be cool if peppo would fit to the style of the image too
>>105760306might work with a pixel art lora or a pixel prompt
>>105760308bwo u should gen at like 1200/800 not 800/1200
the green cartoon frog is standing near the man on the left. change the text "NSF terrorist leader" to "Pepe". keep the expression the same for the green cartoon frog. the green cartoon frog is in pixel art form.
copied the reference well (pepe I generated based on an image, walking on a beach)
>>105760265i told her to 'hold pose' which causes fuckups sometimes sadly; &
i feel like if you put "slow motion" in your negative prompt it will randomly do things that you put in the neg field if cfg is too low kek
>the lorano i started with a screengrab from the anon that was pregnant posting rocket girls a few threads ago hahaha
>>105760324IE it's just the aspect ratio? For this comparison though, that API model was good at handling the tall aspect ratio.
>>105760317You can put animated gifs into Kontext, or did you overlay that separately?
>>105760351chroma refused to render english text and preferred chinese letters if I gave the style prompt too much eastern influence lmao
the green cartoon frog is standing near the girl on the right, holding a magic staff.
>>105760351it is just the aspect ratio
>>105760366use flux kontext to 'ix it
>>105760354the pepe was overlay over the initial video. The original image was of jet set radio gum which was then converted from img to vid ai.
>>105760272Interesting idea. Sadly it didn't work. It seems more like a weird Denoising thing done abruptly instead of slowly.
>>105760281But I like video games, I don't want to install linux.
We must stop the illuminati from close sourcing ai tools
>>105760387if you're not a zoomer that plays multiplayer games with invasive kernel anticheat then all games you wanna play run on linux
>>105760387ignore all arch-posters
the only place that shit belongs is in a VM ;3
>>105760300>>105760285taxation is theft
nafta was a disaster
fiscal irresponsibility @ the 'federal' reserve
is the root of financial disparity\ evaporation of the USD
>if you print indefinitely w\o consequence the currency is not sound and gradually collapses audit\end the fed.
>>105760405AI is so fun, you can make basically anything with a gen + wan + kontext for edits
file
md5: a82eaeb3c0f525218b4abccdd181a546
🔍
what do you generate aside from 1girl slop and porn?
i really enjoy gen'ing food stuff. mostly because it looks pleasing and because sometimes it makes really funny stuff.
yum burger juice.
>>105760427>ai is funthat is the entire point dont let the schizoids get you down bb <3333
file
md5: 5635e19a6d0de02c77856885f4d9e65b
🔍
food, no humans
do it with your fav model. you are allowed to specify one food group/type.
post results
the green cartoon frog is on a large poster on the wall behind the man sitting at his desk. keep the expression and pose of the frog the same.
kek it outpainted a desk too cause the img was a diff size
>>105760440this anon is so fucking annoying i swear i wish i could punch you in the face
file
md5: 993e702da3c17d1e2c820eddf16e32f7
🔍
burger, night
>>105760440i'm watching you
close your eyes really hard and you can see the noise seed of reality
the green cartoon frog is walking beside the tank in the middle. keep the expression and pose of the frog the same.
okay, now we have a good pepe:
the green cartoon frog is walking beside the tank in the middle. he is very small. keep the expression and pose of the frog the same.
the green cartoon frog is sitting on the tank in the middle. he is very small. keep the expression the same.
the green cartoon frog is standing beside on the tank in the middle. he is very small, and wearing a military outfit and helmet. keep the frog's expression the same.
>samethumbnail slop
first i2v and now kontext. thread quality is absolutely crippled
Did BFL strip out every style from the Dev model? Like PRO and MAX version of Kontext is completely different and has way more better character recognition, positioning, detailed backgrounds and subject matter.
Anyone else noticed this?
Chroma is weird. You can change one character in a prompt and suddenly the same seed results in something that looks straight out of flux. But you can still tell both images are from the same seed by the composition. The flux slop is still alive underneath.
>>105760637well duh, they want us to just get a slight taste of the real deal, once we get bored of kontext dev we'll go for the bigger model max/pro, that's their goal, to lure us to their API lol
Has anyone tried RouWei?? I had no idea it could do text like this
https://civitai.com/images/82884637
>>105760478>>105760435green is my pepper
So...
How many you guy made child porn?
sometimes Kontext produces noise?
>>105760644It's pretty strange. Perhaps it's the case that dataset has ai slop tagged as photos, I don't know.
jeet
md5: 2ace996b7531de50b9d7c589082feb71
🔍
>>105760655sex with mita..
>>105760658learn english
what the fuck? i swear i put text here=> kash patel ? <= i put text here
>>105760651>we'll go for the bigger model max/proMore like we'll wait for based chinese company to make an equivalent uncensored version that has apache 2.0 license for finetuning
Not sure if this is the right thread for this, but is there any model to fix/improve grainy or lower quality artbook scans?
https://openart.ai/workflows/amadeusxr/change-any-image-to-anything/5tUBzmIH69TT0oqzY751
another workflow to try, lots of options, 1 or multi image:
the frog is holding a picture of the pink anime dog.
>>105760778You could just use Photoshop or any freeware image manipulation software and apply degrain/sharpen/blur. Wtf?
the frog is holding the anime dog with his hands.
seems to work well, fairly intuitive workflow
>>105760797but, the NAG workflow has more utility, so use whatever suits your use case
We finna deport you weeb haters
>>105760778I've used Topaz AI for that
>>105760839hail president Miku
>>105760637am I missing something? kontext dev looks the best here
>>105760870It looks kinda decent because the subject matter is already an anime character. However, when you apply this style to vast range of characters from realistic, cartoony, anthropomorphic and various others it completely folds. Dev is complete drek in comparison to the two others.
Dont believe me try using your prompt here
https://fal.ai/flux-kontext/chat and compare the results with local version
Ok, I guess 7TV isn't in the dataset lol
seeing as bfl is cuntpunting anyone trying to boobalora kontext, is there going to be some alt repo for good kontext loras
>>105760622you do understand that gen beta will think pepe was in the square during that event right? kek
>>105760501>tits are too small>tits are too fatthen there is no pleasing him ;3
Here is another for comparison
>>105760870>>105760637
Oh yeah, almost like the og.
>>105761061localkeks in shambles
>>105761061>>105760637Can you apply weighting to text in these prompts? The reason I ask is because I don't think these AIs have ever been good at applying strong art styles without either weighting them or applying loras.
add the frog to the image of the anime girl standing to her right and change the style to pixar.
calarts, how scary.
>>105761061I really don't get this marketing strategy,, as if the target audience for Flux Max are people with 5090s but if you make Max local you get word of mouth from the 5090s.
>>105761106from what ive seen people just make loras or try to use NAG to try to make it work. In both cases, its a hassle and will consume alot of time doing trial and error.
>>105761102I bet BFL has a division specifically dedicated to cuck the model before release as a ploy to get anons to switch to api.
>>105761061Test mimicking a style from an image on the pro/max version. E.G. Draw character on left using style from right. Local sucks at this.
>>105761145That looks like the Toy Story aliens lol.
>>105761165>switch to the APIIf you have a $1000 graphics card you're not using the API, period. It's one of the most retarded ideas. APIs only work on people who DO NOT have a graphics card which is the vast majority of people. This isn't rocket science either, you have a model gain lots of buzz either by a) having a free API anyone can use quite a lot or b) have people with graphics cards spam.
>>105760974d-did the model communicate to you this way that it doesn't know what you want?
add the frog to the image of the anime girl standing to her right.
>>105761186I prompted for 7TV emoticon :"WHAT"
>>105761181mmm nyo.. i paid 600$ for my 300$ card (3060) and i can run kontext just fine
haha.. atleast i have 12gb vram.. if cards were cheaper i probably would've gotten a 3070 or 3080 with 8/10 gb instead :'(
>>105761201You're still not the majority of people. "Just fine", yeah, 5 minutes per gen.
>>105761153i mean people managed to downscale wan2.1 from like 30 to 40 gb to under 10 gb. If it was a truly open source model people will always find a way to use it and finetune it. I would be suprised if someone manages to use kontext max on a calculator in under a year if it was released now
>>105761207no i can do a 20step 1024x1024 gen in under a minute..
The anime girl is holding a bag of popcorn.
>>105761218You completely missed my point, people who can run Wan aren't the fat paypigs paying for video gen AIs. If you're someone running AI locally you're one of the last people to use an API service because why would you pay when you can do it for free. The people who pay for APIs are the people who don't know how or can't bother with a GPU. It's like fucking Linux users when your real target demographic is Apple users.
>>105761231that is a basket of popcorn. wtf how could bfl do this?!?!
>>105761257without a style prompt, just holding popcorn:
default fortnite Miku w/ popcorn:
cute! original image didnt show the hair clip on the right so that explains why it's still hidden.
>>105761275ah. now it's there. the model knows Miku.
The anime girl is holding a sign that says "love Miku or else" in scribbled text.
>mikuposting 24\7, schizoid doesn't bat an eye
>ONE LETTER 'R' & HE L0SES HiS FUCKiN MIND
I set up the Gemini API in comfy, but the generated text disappears after I tab out to another workflow. Is it a security feature or is it just broken?
>>105761240i mean its the same with alot of things really. The target demographic is basically brainlets. The only issue here i see is that flux is primarily getting attention because of local fags whereas nai, midjourney and others were always closed sourced companies. Its comparable to Stability ai when it comes to their approach on getting customers. The only downside is the licensing which would make it practical impossible for anyone to use Dev model.
hmm, i think the real reason everyone keeps talking about capitalism whenever AI comes up is because GPUs are too expensive for most people. they keep talking about how you have to give money to corpos to use AI. They're so unrelatable, it's going to take 5 years before they're on our level.
>Miku eating something
>pepe riding skateboard
>Miku and pepe hold hands
>pepe sitting in a chair
>e-shill and pepe
>Miku but in a different outfit
>3d pepe
>manga Miku
>Miku wearing a funny hat
>pepe in video game
>3d pepe in video game
>pepe image, no kontext
*anon points out these suck*
>WAHHHH JUST LIKE HAVE FUUUUN CHILL OUT MAN
>pepe eating a cookie
>...
>>105761287complain about mikuposting in >>>/g/lmg/ , you're going to find a likeminded tranny over there
Training an Alien 9 model now, halfway through
>>105761308dont forget the booba posting (why im here) kek
>>105761298thats because nvidia basically has a monopoly on the entire industry. AMD is practically no where to be seen in this ai market. We are basically being chokehold because america is forcing the chinese to bow to the knee to use their hardware. Thats why their trying to find ways to implement 48gb for local gpu and no one here seems to give a care since their using api paid services.
>>105761295My point is Flux gains nothing gating these models because the people who they are gating aren't paying customers so the smart brain take is considering those people like beta testers and paid word of mouth advertising much like OpenAI gives basically everyone free compute for gens. It's really like those retarded game publishers that think video game pirates are actually lost sales.
the anime girl with teal hair is in a bedroom playing videogames on a CRT television and holding a gamepad. On the TV is the second image.
>but keyboard
only true boomers play with a keyboard.
>>105761321tell me anon how that approach worked out for stability ai. i will wait here.
>>105761330>>105761308>>105761280u really wanna rile him up? miku cosplays team rocket uniform+animate it
>>105761334Stability AI made shitty models. What are you even talking about? Even free SD 3.5 Medium and Large are shit.
>>105761319>We are basically being chokehold because america is forcing the chinese to bow to the knee to use their hardware.That's an interesting angle I haven't heard before, that the nvidia monopoly could be intentional in order to beat china in the race.
>>105761336I dont care about any resident schizos. im here to have fun with a new AI toy.
>>105759925This. Kontext dev is not that good, and outside of memes what fun is modifying an image anyways? The original image is always better than original version. You will not use slop to modify the original and make it better or more aesthetic.
>>105761347Because it's a glimpse of the future which is being able to fix images without resorting to hacks like inpainting.
>>105761346>i dont careoh YES you do boy
>>105761287Maybe stop seething at everyone all the time and anons would like you more?
>>105761347you can not duplicate a font without the exact .ttf like kontext can, or even copy styles or gradients.
>>105761354And that's not also talking about "thinking" versions like LLMs have which means getting an output, having a vlm look for problems, and prompting with fixes.
>>105761354so instead of choosing exactly how and where an edit goes use another model gacha and do the exact same thing with more typing?
>>105761361the rocket poster is the least of our problems\concerns
>>105761364if you tried to do this with inpainting, high denoise would fuck up the elements below the text. it doesnt do that with kontext. for example:
see the gradient + font? see the camels unaffected? inpainting cant do that (yet). this model treats it as a separate layer. and it can dupe elements or copy the style.
>>105761372showing them wanvideo was probably a mistake
>>105761370Anon I hope you're not the micromanaging retard that puts his greasy finger on the screen when talking to an artist to fix something.
>>105761342tell me anon, why every country is now having pride parade and pride month all in the same time frame. Why practically everything is going globohomo do you honestly think america doesnt have chokehold on what they want to accomplish. Why is it that midjourney gets sued fast and mickey mouse veo and openai are protected. Its because they're in on the globohomo agenda. Deepseek got shit right away and other countries we saying to stay away from china programs like tiktok and the rest for spying.
i grow tired of migu and frong
>>105761387I hope you aren't that retard that wants to type everything out. I want the software to be fun, not an exercise in writing novels to feet to an llm that feeds into another llm and it gets cucked somewhere down the pipe so you have to fix it constantly by writing more novels
>>105761377another example: remove all the characters.
the elements below the characters remain. high denoise would generally fuck up the textures or layers. this is different.
>>105761342You think its coincidence
>>105761425Like you, everyone has a finger ready to point.
>>105761339Flux will end the same fate if they dont adhere to use localchads. Simple as. Dont defend there greedy hands after use local kings help promote them they give us leftover scraps and they fully kek'd to cuck us more
>>105761432i will wait until you sip that last remaining kool aid in your drink before i tell you, dunce.
>>105761354>fix images without resorting to hacks like inpaintingHm, I wouldn't be so sure about that. To fix images, the model base has to better than the model it is fixing. That's not the case at all. The model is no different from a Flux dev attempting to "fix" images. It wouldn't be able to identify all the errors in AI generated image even if you point them out.
>>105761459>hacksyou mean almost the literal entirety of \ldg\??????
>>105761364This is neat, and there's an use case like for story-driven AI generation, or perhaps genning game assets (similarly to what was advertised for 4o but was nonetheless cucked with a piss filter). Also a breakthru in advertising, marketing etc... But I'm not referring to those aspects in particular. The model is still too cucked for my liking. It is far behind its API offering, and editing humans always yields weird proportions and plastic. It is also limited in function. The Chink model (omni gen) shows more promise in raw style copy, because they bothered to cuck that too.
>>105760700>>105760700anyone? Like way too much noise left in the image.
Anyway, I loaded the basic wf and fixed it back up, it's working fine...
....I must have changed something, any idea what in the Kontext wf would do that?
>>105761425chinks will end up making their own hardware which will over time mog nvidia's hardware.
in the long run they'll regret creating a new competitor by being too greedy.
Does Kontext do anything else better than Flux apart from the image modification?
anyway to speed up chroma? It takes like 3minutes for 30 steps on my 3060