41
submitted 11 months ago by TehBamski@lemmy.world to c/asklemmy@lemmy.world
you are viewing a single comment's thread
view the rest of the comments
[-] FaceDeer@kbin.social 9 points 11 months ago

I've been using Stable Diffusion (via Automatic1111) for a long time, I've become fairly adept at it. Recently Bing's Dalle-3 has surpassed it in terms of composition and instruction-following, but I still find it really important for doing "finishing" work on Dalle-3's outputs so I don't expect to stop using it any time soon.

Lately I've been experimenting with Koboldcpp and locally-run large language models. I've been coming up with little ideas for scripts and programs that use its API to do stuff.

[-] surewhynotlem@lemmy.world 4 points 11 months ago

You can use stable diffusion to alter existing images? I somehow never realized that. What ui do you use?

[-] randomsnark@lemmy.ml 4 points 11 months ago

He mentioned he uses automatic1111

The stable diffusion mode for working with existing images is called img2img

[-] FaceDeer@kbin.social 3 points 11 months ago* (last edited 11 months ago)

Yup. It has a couple of different ways of doing img2img work. The most basic img2img just uses an existing image as a "starting point" and creates whole new images based on it. You can also do targeted "inpainting", which lets you paint a mask onto the image and then it only regenerates that bit, trying to keep it blended seamlessly into the unchanged parts of the image around it. And then there's ControlNet, which is an additional layer of processing that takes an input image and analyzes it, trying to create outputs that match what it "understands" to be there rather than just what the visual appearance of the source image is. So for example you could take a photo of someone in a particular pose and then generate new images of completely different characters who are also in that same pose.

Automatic1111 takes some technical fiddling to get set up, and you'll need to download models for it that match your needs (Civitai is a good source), but it's really neat how I can play around with stuff. A few days back I made this image of a naga for a D&D campaign by crudely splicing together photos of two different snakes, a woman's face, and some sheep horns in Gimp and then doing repeated passes through inpainting to clean everything up and get each bit exactly right. Took hours but this is the best example I've done yet of picturing something in my mind and then generating an image that matches it almost exactly. I'm rather proud of it.

[-] surewhynotlem@lemmy.world 1 points 11 months ago

Ahhh, thanks! I somehow missed that.

this post was submitted on 18 Nov 2023
41 points (87.3% liked)

Ask Lemmy

26700 readers
2078 users here now

A Fediverse community for open-ended, thought provoking questions

Please don't post about US Politics.


Rules: (interactive)


1) Be nice and; have funDoxxing, trolling, sealioning, racism, and toxicity are not welcomed in AskLemmy. Remember what your mother said: if you can't say something nice, don't say anything at all. In addition, the site-wide Lemmy.world terms of service also apply here. Please familiarize yourself with them


2) All posts must end with a '?'This is sort of like Jeopardy. Please phrase all post titles in the form of a proper question ending with ?


3) No spamPlease do not flood the community with nonsense. Actual suspected spammers will be banned on site. No astroturfing.


4) NSFW is okay, within reasonJust remember to tag posts with either a content warning or a [NSFW] tag. Overtly sexual posts are not allowed, please direct them to either !asklemmyafterdark@lemmy.world or !asklemmynsfw@lemmynsfw.com. NSFW comments should be restricted to posts tagged [NSFW].


5) This is not a support community.
It is not a place for 'how do I?', type questions. If you have any questions regarding the site itself or would like to report a community, please direct them to Lemmy.world Support or email info@lemmy.world. For other questions check our partnered communities list, or use the search function.


Reminder: The terms of service apply here too.

Partnered Communities:

Tech Support

No Stupid Questions

You Should Know

Reddit

Jokes

Ask Ouija


Logo design credit goes to: tubbadu


founded 1 year ago
MODERATORS