AI with Papers - Artificial Intelligence & Deep Learning
15.5K subscribers
145 photos
255 videos
14 files
1.34K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅDepth Anything 3 is out๐Ÿ”ฅ

๐Ÿ‘‰ByteDance unveils Depth Anything 3 (DA3), a model that predicts spatially consistent geometry from arbitrary visual inputs, with or without known camera poses. Repo under Apache 2.0๐Ÿ’™

๐Ÿ‘‰Review https://t.ly/AOPu7
๐Ÿ‘‰Paper arxiv.org/pdf/2511.10647
๐Ÿ‘‰Project https://lnkd.in/dnByyn2z
๐Ÿ‘‰Repo https://lnkd.in/daCVz_4a
๐Ÿ‘‰Demo https://lnkd.in/dKUZiJt
๐Ÿ”ฅ18โค9๐Ÿ‘1๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŒฉ๏ธ It's "Time-to-Move" ๐ŸŒฉ๏ธ

๐Ÿ‘‰Technion + Nvidia Time-to-Move (TTM) is a training-free, plug-and-play framework for motion- and appearance-controlled video generation with I2V diffusion models (Wan 2.2, CogVideoX, & Stable VD). Impressive results!

๐Ÿ‘‰Review https://t.ly/0pwXm
๐Ÿ‘‰Paper https://lnkd.in/dxD3uHYb
๐Ÿ‘‰Project https://lnkd.in/dcE5juyM
๐Ÿ‘‰Repo https://lnkd.in/dMMUjybJ
1๐Ÿ‘2๐Ÿ”ฅ2โค1
This media is not supported in your browser
VIEW IN TELEGRAM
โŒš Multi-Shot Video Segmentation โŒš

๐Ÿ‘‰Fudan focuses on an underexplored task of multi-shot video object segmentation (MVOS). Benchmark and repo available (the extension part of SAM) under Apache 2.0๐Ÿ’™

๐Ÿ‘‰Review https://t.ly/WBW00
๐Ÿ‘‰Paper https://arxiv.org/pdf/2511.13715
๐Ÿ‘‰Project https://henghuiding.com/SAAS/
๐Ÿ‘‰Repo https://github.com/FudanCVL/SAAS
1๐Ÿ”ฅ6โค2
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ SAM 3/3D are OUT!! ๐Ÿ”ฅ

๐Ÿ‘‰#META released SAM 3, a unified model for detection, segmentation, tracking of objects in images & video using text, exemplar & visual prompts. Repo/Models under proprietary license๐Ÿ’™

๐Ÿ‘‰Review https://t.ly/lnRZN
๐Ÿ‘‰Paper https://t.ly/5tq9N
๐Ÿ‘‰Project https://ai.meta.com/sam3/
๐Ÿ‘‰Demo: https://segment-anything.com
๐Ÿ‘‰Repo https://github.com/facebookresearch/sam3
๐Ÿ”ฅ22โค4๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸฏUnwrapping of 3D Meshes๐Ÿฏ

๐Ÿ‘‰PartUV is a novel part-based UV unwrapping method for 3D meshes; it combines learned part priors with geometric cues to generate a compact set of part-aligned charts. Repo released๐Ÿ’™

๐Ÿ‘‰Review https://t.ly/8dNIY
๐Ÿ‘‰Paper arxiv.org/pdf/2511.16659
๐Ÿ‘‰Project www.zhaoningwang.com/PartUV/
๐Ÿ‘‰Repo github.com/EricWang12/PartUV
โค14๐Ÿ‘2๐Ÿ”ฅ1
๐Ÿ• Upsample Anything ๐Ÿ•

๐Ÿ‘‰Upsample Anything, a novel universal, training-free up-sampler via lightweight test-time optimization. No code but it's a relevant paper๐Ÿ’™

๐Ÿ‘‰Review https://t.ly/7LE6G
๐Ÿ‘‰Paper https://lnkd.in/dsUfdtih
๐Ÿ”ฅ7โค3๐Ÿ‘2๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸฆžSingle Synthetic Image per Class๐Ÿฆž

๐Ÿ‘‰MIT unveils Linear Gradient Matching (H/T Torralba), a novel method of distillation to use a single synthetic image per class for linear classifiers training (and more). Repo available๐Ÿ’™

๐Ÿ‘‰Review https://t.ly/dD3un
๐Ÿ‘‰Paper arxiv.org/pdf/2511.16674
๐Ÿ‘‰Project linear-gradient-matching.github.io/
๐Ÿ‘‰Repo github.com/GeorgeCazenavette/linear-gradient-matching
1โค6๐Ÿ”ฅ2๐Ÿ‘1๐Ÿ˜1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿงช EfficientSAM3 is out ๐Ÿงช

๐Ÿ‘‰Bristol announces EfficientSAM3, a family of efficient models built on Progressive Hierarchical Distillation that transfers capability from SAM3 to lightweight students. Code coming (in sync with SAM3 release)๐Ÿ’™

๐Ÿ‘‰Review https://t.ly/bfXP2
๐Ÿ‘‰Paper arxiv.org/pdf/2511.15833
๐Ÿ‘‰Project simonzeng7108.github.io/efficientsam3/
๐Ÿ‘‰Repo github.com/SimonZeng7108/efficientsam3
โค3๐Ÿ‘2๐Ÿ”ฅ1๐Ÿ‘1