AI with Papers - Artificial Intelligence & Deep Learning
15.4K subscribers
143 photos
254 videos
14 files
1.33K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🧷Generative Point Tracking w/ FM🧷

πŸ‘‰Generative Point Tracker (GenPT) is a novel generative framework for modelling multi-modal trajectories. Able to capture the multi-modality in point trajectories. Repo under MITπŸ’™

πŸ‘‰Review https://t.ly/MMFrt
πŸ‘‰Paper https://arxiv.org/pdf/2510.20951
πŸ‘‰Project mtesfaldet.net/genpt_projpage/
πŸ‘‰Repo https://github.com/tesfaldet/genpt
πŸ”₯7❀1πŸ‘1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦„Unified Region-Level MLLMπŸ¦„

πŸ‘‰PixeRefers is an unified multimodal LLM framework that supports precise, region-specific understanding in both static images and dynamic videos, overcoming the holistic, scene-level bias of prior MLLMs. SOTA results. Demo, Repo & Dataset availableπŸ’™

πŸ‘‰Review https://t.ly/WH4dQ
πŸ‘‰Paper arxiv.org/pdf/2510.23603
πŸ‘‰Project circleradon.github.io/PixelRefer
πŸ‘‰Repo https://github.com/alibaba-damo-academy/PixelRefer
πŸ”₯3❀2🀯2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🌱PlanarTrack: Large Planar Tracking🌱

πŸ‘‰PlanarTrack is a large-scale HQ and challenging benchmark for planar tracking: 1,150 sequences with 733K+ frames, including 1,000 short-term & 150 long-term videos. Repo & Dataset availableπŸ’™

πŸ‘‰Review https://t.ly/mYNi7
πŸ‘‰Paper arxiv.org/pdf/2510.23368
πŸ‘‰Repo https://lnkd.in/edb3GMyT
πŸ‘‰Project https://lnkd.in/eC-hVB-U
πŸ‘‰Data https://lnkd.in/eew2j4tM
πŸ”₯10❀5πŸ‘2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘’Generative View Stitching πŸ‘’

πŸ‘‰GVS is a novel approach that enables collision-free camera-guided video generation for predefined trajectories, it's a non-autoregressive alternative to video length extrapolation. Full repo under MITπŸ’™

πŸ‘‰Review https://t.ly/TiN_5
πŸ‘‰Paper https://arxiv.org/pdf/2510.24718
πŸ‘‰Project https://andrewsonga.github.io/gvs/
πŸ‘‰Repo github.com/andrewsonga/generative_view_stitching
πŸ”₯9❀3πŸ‘1
Greetings from the SMART CITY WORLD CONGRESS in Barcellona. If you are around, ping me ;)
🀣39❀3πŸ‘3🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”ͺTracking Object TransformationsπŸ”ͺ

πŸ‘‰"Track Any State": tracking objects through transformations while detecting/describing state changes. Repo & Dataset available under MITπŸ’™

πŸ‘‰Review https://t.ly/NPyW4
πŸ‘‰Paper https://lnkd.in/d4pA3bXJ
πŸ‘‰Project https://lnkd.in/dgbNfCuj
πŸ‘‰Repo https://lnkd.in/dtVWq2z7
πŸ”₯18❀7🀯3πŸ‘2πŸ‘1
πŸ”₯πŸ”₯ Sunday mood πŸ”₯πŸ”₯
🀣31❀2
🎸Another BRIXEL in the Wall 🎸

πŸ‘‰BRIXEL allows the user to produce high-resolution feature maps using the DINOv3 backbone without requiring large amounts of compute. Repo releasedπŸ’™

πŸ‘‰Review https://t.ly/fZPwC
πŸ‘‰Paper arxiv.org/pdf/2511.05168
πŸ‘‰Repo github.com/alexanderlappe/BRIXEL
🀩7🀯3πŸ”₯2❀1πŸ‘1😍1
This media is not supported in your browser
VIEW IN TELEGRAM
🐼Pixel-Dense Embedding🐼

πŸ‘‰FlowFeat is a novel high-resolution and multi-task feature representation that embeds a distribution of plausible apparent motions, or motion profiles. Repo available under πŸ’™

πŸ‘‰Review https://t.ly/aUx_U
πŸ‘‰Paper arxiv.org/pdf/2511.07696
πŸ‘‰Project tum-vision.github.io/flowfeat
πŸ‘‰Repo github.com/tum-vision/flowfeat
πŸ”₯4πŸ‘3❀2
🍿🍿🍿
🀯17πŸ”₯8πŸ‘2❀1πŸ‘1
🚨 Announcement 🚨

I’ve received numerous reports of people blatantly copying my content on LinkedIn just to get a few likes.

Let me be very clear: I put a great deal of time and effort into reviewing papers and creating original, meaningful content. It’s disappointing to see professionals (some of whom are even members of this group or my connections) resorting to plagiarism instead of contributing their own ideas.

πŸ‘‰ Starting today, I’ll be removing these connections from LinkedIn and banning such individuals from this group.

πŸ“’ I also encourage everyone to report these cases whenever you come across them. Every single report helps stop this bad habit and keeps our community fair, respectful, and authentic.
❀48πŸ‘18πŸ‘14😒1
This media is not supported in your browser
VIEW IN TELEGRAM
🟩 Foundational Humanoid 🟩

πŸ‘‰#NVIDIA unveils SONIC a novel foundational model for high-precision teleoperation & interactive control capabilities (running, jumping, crawling) with natural human-like movements. Code announcedπŸ’™

πŸ‘‰Review https://t.ly/_3wnt
πŸ‘‰Paper https://lnkd.in/dctfShu8
πŸ‘‰Project https://lnkd.in/d_inmA2p
🀯7❀3πŸ”₯1