This media is not supported in your browser
VIEW IN TELEGRAM
๐ฆCharacter Mixing Generation๐ฆ
๐MBZUAI unveils the first ever video-gen system able to preserve character ID, behavior & original style while generating plausible interactions between characters that have never coexisted - from cartoons (We Bare Bears, Tom & Jerry) to realistic humans (Mr. Bean, Young Sheldon)
๐Review https://t.ly/tN84a
๐Paper https://lnkd.in/dhKMwukv
๐Project https://lnkd.in/dBkJs48h
๐Repo https://lnkd.in/dw_uzgAk
๐MBZUAI unveils the first ever video-gen system able to preserve character ID, behavior & original style while generating plausible interactions between characters that have never coexisted - from cartoons (We Bare Bears, Tom & Jerry) to realistic humans (Mr. Bean, Young Sheldon)
๐Review https://t.ly/tN84a
๐Paper https://lnkd.in/dhKMwukv
๐Project https://lnkd.in/dBkJs48h
๐Repo https://lnkd.in/dw_uzgAk
๐คฉ4โค1๐1๐1
This media is not supported in your browser
VIEW IN TELEGRAM
๐งทGenerative Point Tracking w/ FM๐งท
๐Generative Point Tracker (GenPT) is a novel generative framework for modelling multi-modal trajectories. Able to capture the multi-modality in point trajectories. Repo under MIT๐
๐Review https://t.ly/MMFrt
๐Paper https://arxiv.org/pdf/2510.20951
๐Project mtesfaldet.net/genpt_projpage/
๐Repo https://github.com/tesfaldet/genpt
๐Generative Point Tracker (GenPT) is a novel generative framework for modelling multi-modal trajectories. Able to capture the multi-modality in point trajectories. Repo under MIT๐
๐Review https://t.ly/MMFrt
๐Paper https://arxiv.org/pdf/2510.20951
๐Project mtesfaldet.net/genpt_projpage/
๐Repo https://github.com/tesfaldet/genpt
๐ฅ7โค2๐1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฆUnified Region-Level MLLM๐ฆ
๐PixeRefers is an unified multimodal LLM framework that supports precise, region-specific understanding in both static images and dynamic videos, overcoming the holistic, scene-level bias of prior MLLMs. SOTA results. Demo, Repo & Dataset available๐
๐Review https://t.ly/WH4dQ
๐Paper arxiv.org/pdf/2510.23603
๐Project circleradon.github.io/PixelRefer
๐Repo https://github.com/alibaba-damo-academy/PixelRefer
๐PixeRefers is an unified multimodal LLM framework that supports precise, region-specific understanding in both static images and dynamic videos, overcoming the holistic, scene-level bias of prior MLLMs. SOTA results. Demo, Repo & Dataset available๐
๐Review https://t.ly/WH4dQ
๐Paper arxiv.org/pdf/2510.23603
๐Project circleradon.github.io/PixelRefer
๐Repo https://github.com/alibaba-damo-academy/PixelRefer
๐ฅ3โค2๐คฏ2๐1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฑPlanarTrack: Large Planar Tracking๐ฑ
๐PlanarTrack is a large-scale HQ and challenging benchmark for planar tracking: 1,150 sequences with 733K+ frames, including 1,000 short-term & 150 long-term videos. Repo & Dataset available๐
๐Review https://t.ly/mYNi7
๐Paper arxiv.org/pdf/2510.23368
๐Repo https://lnkd.in/edb3GMyT
๐Project https://lnkd.in/eC-hVB-U
๐Data https://lnkd.in/eew2j4tM
๐PlanarTrack is a large-scale HQ and challenging benchmark for planar tracking: 1,150 sequences with 733K+ frames, including 1,000 short-term & 150 long-term videos. Repo & Dataset available๐
๐Review https://t.ly/mYNi7
๐Paper arxiv.org/pdf/2510.23368
๐Repo https://lnkd.in/edb3GMyT
๐Project https://lnkd.in/eC-hVB-U
๐Data https://lnkd.in/eew2j4tM
๐ฅ9โค5๐2๐1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ขGenerative View Stitching ๐ข
๐GVS is a novel approach that enables collision-free camera-guided video generation for predefined trajectories, it's a non-autoregressive alternative to video length extrapolation. Full repo under MIT๐
๐Review https://t.ly/TiN_5
๐Paper https://arxiv.org/pdf/2510.24718
๐Project https://andrewsonga.github.io/gvs/
๐Repo github.com/andrewsonga/generative_view_stitching
๐GVS is a novel approach that enables collision-free camera-guided video generation for predefined trajectories, it's a non-autoregressive alternative to video length extrapolation. Full repo under MIT๐
๐Review https://t.ly/TiN_5
๐Paper https://arxiv.org/pdf/2510.24718
๐Project https://andrewsonga.github.io/gvs/
๐Repo github.com/andrewsonga/generative_view_stitching
๐ฅ9โค2๐1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ชTracking Object Transformations๐ช
๐"Track Any State": tracking objects through transformations while detecting/describing state changes. Repo & Dataset available under MIT๐
๐Review https://t.ly/NPyW4
๐Paper https://lnkd.in/d4pA3bXJ
๐Project https://lnkd.in/dgbNfCuj
๐Repo https://lnkd.in/dtVWq2z7
๐"Track Any State": tracking objects through transformations while detecting/describing state changes. Repo & Dataset available under MIT๐
๐Review https://t.ly/NPyW4
๐Paper https://lnkd.in/d4pA3bXJ
๐Project https://lnkd.in/dgbNfCuj
๐Repo https://lnkd.in/dtVWq2z7
โค1