Alibaba-DAMO-Academy - LumosX
# LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation
(https://github.com/alibaba-damo-academy/Lumos-Custom/tree/main/LumosX#lumosx-relate-any-identities-with-their-attributes-for-personalized-video-generation)"Recent advances in diffusion models have significantly improved text-to-video generation, enabling personalized content creation with fine-grained control over both foreground and background elements. However, precise face-attribute alignment across subjects remains challenging, as existing methods lack explicit mechanisms to ensure intra-group consistency. We propose LumosX, a framework that advances both data and model design to achieve state-of-the-art performance in fine-grained, identity-consistent, and semantically aligned personalized multi-subject video generation."
This one is based on Wan2.1 and, from what I understand, seems focused on improving feature retention and consistency. Interesting yet another group under the Alibaba umbrella.
And there you were, thinking the flood of open-source models was over. It's never a goodbye. :)
https://github.com/alibaba-damo-academy/Lumos-Custom/tree/main/LumosX
https://huggingface.co/Alibaba-DAMO-Academy/LumosX
https://redd.it/1rzoqhw
@stablediffusion_r
# LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation
(https://github.com/alibaba-damo-academy/Lumos-Custom/tree/main/LumosX#lumosx-relate-any-identities-with-their-attributes-for-personalized-video-generation)"Recent advances in diffusion models have significantly improved text-to-video generation, enabling personalized content creation with fine-grained control over both foreground and background elements. However, precise face-attribute alignment across subjects remains challenging, as existing methods lack explicit mechanisms to ensure intra-group consistency. We propose LumosX, a framework that advances both data and model design to achieve state-of-the-art performance in fine-grained, identity-consistent, and semantically aligned personalized multi-subject video generation."
This one is based on Wan2.1 and, from what I understand, seems focused on improving feature retention and consistency. Interesting yet another group under the Alibaba umbrella.
And there you were, thinking the flood of open-source models was over. It's never a goodbye. :)
https://github.com/alibaba-damo-academy/Lumos-Custom/tree/main/LumosX
https://huggingface.co/Alibaba-DAMO-Academy/LumosX
https://redd.it/1rzoqhw
@stablediffusion_r
GitHub
Lumos-Custom/LumosX at main · alibaba-damo-academy/Lumos-Custom
[ICLR-26, NeurIPS-25] Lumos-Custom Project: research for customized video generation in the Lumos Project. - alibaba-damo-academy/Lumos-Custom
Flux2klein 9B Lora loader and updated Z-image turbo Lora loader with Auto Strength node!!
https://redd.it/1rztbjm
@stablediffusion_r
https://redd.it/1rztbjm
@stablediffusion_r
Reddit
From the StableDiffusion community on Reddit: Flux2klein 9B Lora loader and updated Z-image turbo Lora loader with Auto Strength…
Explore this post and more from the StableDiffusion community
Qwen 2512 is very powerful. And with the nunchaku version, it's possible to generate an image in 20 to 50 seconds (5070 ti)
https://redd.it/1s00mbg
@stablediffusion_r
https://redd.it/1s00mbg
@stablediffusion_r
Reddit
From the StableDiffusion community on Reddit: Qwen 2512 is very powerful. And with the nunchaku version, it's possible to generate…
Explore this post and more from the StableDiffusion community