r/StableDiffusion

Alibaba-DAMO-Academy - LumosX

# LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation

(https://github.com/alibaba-damo-academy/Lumos-Custom/tree/main/LumosX#lumosx-relate-any-identities-with-their-attributes-for-personalized-video-generation)"Recent advances in diffusion models have significantly improved text-to-video generation, enabling personalized content creation with fine-grained control over both foreground and background elements. However, precise face-attribute alignment across subjects remains challenging, as existing methods lack explicit mechanisms to ensure intra-group consistency. We propose LumosX, a framework that advances both data and model design to achieve state-of-the-art performance in fine-grained, identity-consistent, and semantically aligned personalized multi-subject video generation."

This one is based on Wan2.1 and, from what I understand, seems focused on improving feature retention and consistency. Interesting yet another group under the Alibaba umbrella.

And there you were, thinking the flood of open-source models was over. It's never a goodbye. :)

https://github.com/alibaba-damo-academy/Lumos-Custom/tree/main/LumosX

https://huggingface.co/Alibaba-DAMO-Academy/LumosX

https://redd.it/1rzoqhw
@stablediffusion_r

GitHub

Lumos-Custom/LumosX at main · alibaba-damo-academy/Lumos-Custom

[ICLR-26, NeurIPS-25] Lumos-Custom Project: research for customized video generation in the Lumos Project. - alibaba-damo-academy/Lumos-Custom

1 view14:04

r/StableDiffusion

Flux2klein 9B Lora loader and updated Z-image turbo Lora loader with Auto Strength node!!

https://redd.it/1rztbjm
@stablediffusion_r

From the StableDiffusion community on Reddit: Flux2klein 9B Lora loader and updated Z-image turbo Lora loader with Auto Strength…

Explore this post and more from the StableDiffusion community

1 view15:31

r/StableDiffusion

1 view15:31

r/StableDiffusion

Flux 2 Klein 9b — 4 steps, ~3 seconds per style transfer.

https://redd.it/1rzs1vw
@stablediffusion_r

From the StableDiffusion community on Reddit: Flux 2 Klein 9b — 4 steps, ~3 seconds per style transfer.

Explore this post and more from the StableDiffusion community

1 view17:28

r/StableDiffusion

1 view17:28

r/StableDiffusion

0:07

This media is not supported in your browser

VIEW IN TELEGRAM

WAN2.2 FFLF 2 Video

https://redd.it/1rzy41y
@stablediffusion_r

1 view20:04

r/StableDiffusion

Qwen 2512 is very powerful. And with the nunchaku version, it's possible to generate an image in 20 to 50 seconds (5070 ti)

https://redd.it/1s00mbg
@stablediffusion_r

From the StableDiffusion community on Reddit: Qwen 2512 is very powerful. And with the nunchaku version, it's possible to generate…

Explore this post and more from the StableDiffusion community

1 view23:58

r/StableDiffusion

1 view23:58

About

Blog

Apps

Platform