This media is not supported in your browser
VIEW IN TELEGRAM
🐙Human-Centric Video Generation🐙
👉Tsinghua & #ByteDance unveil HuMo: a unified, human-centric video generation framework designed to produce HQ fine-grained, and controllable human videos from multimodal inputs: text prompt following, consistent subject preservation, synchronized audio-driven motion. Repo released under Apache2.0💙
👉Review https://t.ly/3S8Yb
👉Paper https://arxiv.org/pdf/2509.08519
👉Project https://phantom-video.github.io/HuMo/
👉Repo https://github.com/Phantom-video/HuMo
👉Tsinghua & #ByteDance unveil HuMo: a unified, human-centric video generation framework designed to produce HQ fine-grained, and controllable human videos from multimodal inputs: text prompt following, consistent subject preservation, synchronized audio-driven motion. Repo released under Apache2.0💙
👉Review https://t.ly/3S8Yb
👉Paper https://arxiv.org/pdf/2509.08519
👉Project https://phantom-video.github.io/HuMo/
👉Repo https://github.com/Phantom-video/HuMo
🔥7🤯3❤1