r/StableDiffusion
15 subscribers
29.4K photos
2.44K videos
1 file
13.9K links
Download Telegram
Alibaba-DAMO-Academy - LumosX

# LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation

(https://github.com/alibaba-damo-academy/Lumos-Custom/tree/main/LumosX#lumosx-relate-any-identities-with-their-attributes-for-personalized-video-generation)"Recent advances in diffusion models have significantly improved text-to-video generation, enabling personalized content creation with fine-grained control over both foreground and background elements. However, precise face-attribute alignment across subjects remains challenging, as existing methods lack explicit mechanisms to ensure intra-group consistency. We propose LumosX, a framework that advances both data and model design to achieve state-of-the-art performance in fine-grained, identity-consistent, and semantically aligned personalized multi-subject video generation."

This one is based on Wan2.1 and, from what I understand, seems focused on improving feature retention and consistency. Interesting yet another group under the Alibaba umbrella.

And there you were, thinking the flood of open-source models was over. It's never a goodbye. :)

https://github.com/alibaba-damo-academy/Lumos-Custom/tree/main/LumosX

https://huggingface.co/Alibaba-DAMO-Academy/LumosX

https://redd.it/1rzoqhw
@stablediffusion_r