6

kijai/ComfyUI-MochiWrapper - ComfyUI wrapper nodes for Mochi video generator (github.com)

submitted 5 hours ago* (last edited 5 hours ago) by Even_Adder@lemmy.dbzer0.com to c/stable_diffusion@lemmy.dbzer0.com

0 comments fedilink

10

Stable Diffusion 3.5 Prompt Guide — Stability AI (stability.ai)

submitted 1 day ago by Even_Adder@lemmy.dbzer0.com to c/stable_diffusion@lemmy.dbzer0.com

0 comments fedilink

Total mayhem in c/animemes@ani.social

[–] Even_Adder@lemmy.dbzer0.com 3 points 2 days ago

I forgive him for being 19.

Total mayhem in c/animemes@ani.social

[–] Even_Adder@lemmy.dbzer0.com 4 points 2 days ago (2 children)

I won't stand for libel against Kou. He was a great test pilot that provided superb data despite his awful circumstances and went to prison fighting fascism.

5

dim/Shakker-Labs_FLUX.1-dev-ControlNet-Union-Pro-fp8.safetensors (huggingface.co)

submitted 2 days ago by Even_Adder@lemmy.dbzer0.com to c/stable_diffusion@lemmy.dbzer0.com

0 comments fedilink

A quantized version of ControlNet Union for Flux for less powerful computers.

8

Astropulse/mixamotoopenpose: Convert Mixamo animations directly to OpenPose image sequences (github.com)

submitted 6 days ago by Even_Adder@lemmy.dbzer0.com to c/stable_diffusion@lemmy.dbzer0.com

0 comments fedilink

15

SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models (cdn.prod.website-files.com)

submitted 6 days ago* (last edited 6 days ago) by Even_Adder@lemmy.dbzer0.com to c/stable_diffusion@lemmy.dbzer0.com

0 comments fedilink

TL;DR

A new post-training training quantization paradigm for diffusion models, which quantize both the weights and activations of FLUX.1 to 4 bits, achieving 3.5× memory and 8.7× latency reduction on a 16GB laptop 4090 GPU.

Paper: http://arxiv.org/abs/2411.05007

Weights: https://huggingface.co/mit-han-lab/svdquant-models

Code: https://github.com/mit-han-lab/nunchaku

Blog: https://hanlab.mit.edu/blog/svdquant

Demo: https://svdquant.mit.edu/

1

jhj0517/sd-webui-AdvancedLivePortrait: sd webui (forge) extension for AdvancedLivePortrait (github.com)

submitted 1 week ago* (last edited 1 week ago) by Even_Adder@lemmy.dbzer0.com to c/stable_diffusion@lemmy.dbzer0.com

0 comments fedilink

Nice little nap in c/cat@lemmy.world

[–] Even_Adder@lemmy.dbzer0.com 3 points 1 week ago

Chiaroscuro Scooter.

9

Training-free Regional Prompting for Diffusion Transformers (imgur.com)

submitted 1 week ago by Even_Adder@lemmy.dbzer0.com to c/stable_diffusion@lemmy.dbzer0.com

0 comments fedilink

Abstract

Diffusion models have demonstrated excellent capabilities in text-to-image generation. Their semantic understanding (i.e., prompt following) ability has also been greatly improved with large language models (e.g., T5, Llama). However, existing models cannot perfectly handle long and complex text prompts, especially when the text prompts contain various objects with numerous attributes and interrelated spatial relationships. While many regional prompting methods have been proposed for UNet-based models (SD1.5, SDXL), but there are still no implementations based on the recent Diffusion Transformer (DiT) architecture, such as SD3 and this http URL this report, we propose and implement regional prompting for FLUX.1 based on attention manipulation, which enables DiT with fined-grained compositional text-to-image generation capability in a training-free manner. Code is available at this https URL.

Paper: https://arxiv.org/abs/2411.02395

Code: https://github.com/instantX-research/Regional-Prompting-FLUX

2

rupeshs/fastsdcpu Release v1.0.0-beta.90 (github.com)

submitted 1 week ago by Even_Adder@lemmy.dbzer0.com to c/stable_diffusion@lemmy.dbzer0.com

0 comments fedilink

Add Intel Core Ultra Series 2 (Lunar Lake) NPU support by @rupeshs in #277
Seeding improvements by @wbruna in #273

Draw an arrow to the cats tail in c/imageai@sh.itjust.works

[–] Even_Adder@lemmy.dbzer0.com 6 points 1 week ago* (last edited 1 week ago) (2 children)

If you want to mess with Omnigen it was designed for this kind of thing. The code and model were released a few days ago.

11

Nerogar/OneTrainer: OneTrainer now supports efficient RAM offloading for training on low end GPUs (github.com)

submitted 1 week ago by Even_Adder@lemmy.dbzer0.com to c/stable_diffusion@lemmy.dbzer0.com

0 comments fedilink

Details: https://github.com/Nerogar/OneTrainer/blob/master/docs/RamOffloading.md

Flux LoRA training on 6GB GPUs (at 512px resolution)
Flux Fine-Tuning on 16GB GPUs (or even less) +64GB of RAM
SD3.5-M Fine-Tuning on 4GB GPUs (at 1024px resolution)

Top 10 Anime of the Week #05 - Fall 2024 (Anime Corner) in c/anime@ani.social

[–] Even_Adder@lemmy.dbzer0.com 6 points 2 weeks ago

Dandadan dropping this week is wild. It looks like Yakuza Fiancé finally caught on, watching those two is like a train wreck I can't take my eyes off of.

Help has arrived in c/imageai@sh.itjust.works

[–] Even_Adder@lemmy.dbzer0.com 4 points 2 weeks ago (1 children)

You're killing it with these gens.

The Role of Open Data in AI systems as Digital Public Goods in c/opensource@lemmy.ml

[–] Even_Adder@lemmy.dbzer0.com 5 points 2 weeks ago

Fair use isn't a loophole, it is copyright law.

8

Invoke 5.3 Released (youtu.be)

submitted 2 weeks ago* (last edited 2 weeks ago) by Even_Adder@lemmy.dbzer0.com to c/stable_diffusion@lemmy.dbzer0.com

0 comments fedilink

Release: https://github.com/invoke-ai/InvokeAI/releases/

thrulesday in c/196@lemmy.blahaj.zone

[–] Even_Adder@lemmy.dbzer0.com 7 points 2 weeks ago

Don't believe this dog, for it only tells lies.

Wahooo in c/microblogmemes@lemmy.world

[–] Even_Adder@lemmy.dbzer0.com 11 points 2 weeks ago* (last edited 2 weeks ago) (2 children)

I thought Mario was canonically 24?

Adobe Says Artists Should Embrace AI If They Want to be Successful. in c/technology@lemmy.world

[–] Even_Adder@lemmy.dbzer0.com 2 points 2 weeks ago

Here's a video explaining how diffusion models work, and this article by Kit Walsh, a senior staff attorney at the EFF.

3

vladmandic/automatic: SD.Next Release for 2024-10-29 (github.com)

submitted 2 weeks ago by Even_Adder@lemmy.dbzer0.com to c/stable_diffusion@lemmy.dbzer0.com

0 comments fedilink

Highlights for 2024-10-29

Support for all SD3.x variants
SD3.0-Medium, SD3.5-Medium, SD3.5-Large, SD3.0-Large-Turbo
Allow quantization using bitsandbytes on-the-fly during models load Load any variant of SD3.x or FLUX.1 and apply quantization during load without the need for pre-quantized models
Allow for custom model URL in standard model selector
Can be used to specify any model from HuggingFace or CivitAI
Full support for torch==2.5.1
New wiki articles: Gated Access, Quantization, Offloading

Plus tons of smaller improvements and cumulative fixes reported since last release

README | CHANGELOG | WiKi | Discord

12

Stable Diffusion 3.5 Medium Released (huggingface.co)

submitted 2 weeks ago by Even_Adder@lemmy.dbzer0.com to c/stable_diffusion@lemmy.dbzer0.com

1 comments fedilink

Adobe Says Artists Should Embrace AI If They Want to be Successful. in c/technology@lemmy.world

[–] Even_Adder@lemmy.dbzer0.com 2 points 2 weeks ago

Your comment made my day. Thanks.

Adobe Says Artists Should Embrace AI If They Want to be Successful. in c/technology@lemmy.world

[–] Even_Adder@lemmy.dbzer0.com 0 points 2 weeks ago (9 children)

Anyone spreading this misinformation and trying gatekeep being an artist after the avant-garde movement doesn't have an ounce of education in art history. Generative art, warts and all, is a vital new form of art that's shaking things up, challenging preconceptions, and getting people angry - just like art should.