AI with Papers - Artificial Intelligence & Deep Learning

🦗Character Mixing Generation🦗

👉MBZUAI unveils the first ever video-gen system able to preserve character ID, behavior & original style while generating plausible interactions between characters that have never coexisted - from cartoons (We Bare Bears, Tom & Jerry) to realistic humans (Mr. Bean, Young Sheldon)

👉Review https://t.ly/tN84a
👉Paper https://lnkd.in/dhKMwukv
👉Project https://lnkd.in/dBkJs48h
👉Repo https://lnkd.in/dw_uzgAk

🤩4❤1👍1👏1

3.81K views07:24

This media is not supported in your browser

VIEW IN TELEGRAM

🧷Generative Point Tracking w/ FM🧷

👉Generative Point Tracker (GenPT) is a novel generative framework for modelling multi-modal trajectories. Able to capture the multi-modality in point trajectories. Repo under MIT💙

👉Review https://t.ly/MMFrt
👉Paper https://arxiv.org/pdf/2510.20951
👉Project mtesfaldet.net/genpt_projpage/
👉Repo https://github.com/tesfaldet/genpt

🔥7❤2👍1

2.77K views08:37

AI with Papers - Artificial Intelligence & Deep Learning

0:03

This media is not supported in your browser

VIEW IN TELEGRAM

🦄Unified Region-Level MLLM🦄

👉PixeRefers is an unified multimodal LLM framework that supports precise, region-specific understanding in both static images and dynamic videos, overcoming the holistic, scene-level bias of prior MLLMs. SOTA results. Demo, Repo & Dataset available💙

👉Review https://t.ly/WH4dQ
👉Paper arxiv.org/pdf/2510.23603
👉Project circleradon.github.io/PixelRefer
👉Repo https://github.com/alibaba-damo-academy/PixelRefer

🔥3❤2🤯2👏1

3.16K views15:03

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌱PlanarTrack: Large Planar Tracking🌱

👉PlanarTrack is a large-scale HQ and challenging benchmark for planar tracking: 1,150 sequences with 733K+ frames, including 1,000 short-term & 150 long-term videos. Repo & Dataset available💙

👉Review https://t.ly/mYNi7
👉Paper arxiv.org/pdf/2510.23368
👉Repo https://lnkd.in/edb3GMyT
👉Project https://lnkd.in/eC-hVB-U
👉Data https://lnkd.in/eew2j4tM

🔥9❤5👏2👍1

3.21K views07:31

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👢Generative View Stitching 👢

👉GVS is a novel approach that enables collision-free camera-guided video generation for predefined trajectories, it's a non-autoregressive alternative to video length extrapolation. Full repo under MIT💙

👉Review https://t.ly/TiN_5
👉Paper https://arxiv.org/pdf/2510.24718
👉Project https://andrewsonga.github.io/gvs/
👉Repo github.com/andrewsonga/generative_view_stitching

🔥9❤2👍1

3.47K views07:39

AI with Papers - Artificial Intelligence & Deep Learning

Greetings from the SMART CITY WORLD CONGRESS in Barcellona. If you are around, ping me ;)

🤣28🤩2❤1👍1

2.11K views14:19

AI with Papers - Artificial Intelligence & Deep Learning

0:03

This media is not supported in your browser

VIEW IN TELEGRAM

🔪Tracking Object Transformations🔪

👉"Track Any State": tracking objects through transformations while detecting/describing state changes. Repo & Dataset available under MIT💙

👉Review https://t.ly/NPyW4
👉Paper https://lnkd.in/d4pA3bXJ
👉Project https://lnkd.in/dgbNfCuj
👉Repo https://lnkd.in/dtVWq2z7

❤1

134 views10:52

About

Blog

Apps

Platform