AI with Papers - Artificial Intelligence & Deep Learning
15.4K subscribers
139 photos
253 videos
14 files
1.33K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯¬ "Perception Test" by #DeepMind πŸ₯¬

πŸ‘‰Huge dataset with obj & point tracks, temporal sounds, multiple & grounded vQA

😎Review https://bit.ly/3Vqh96Q
😎Dataset github.com/deepmind/perception_test
😎Project www.deepmind.com/blog/measuring-perception-in-ai-models
πŸ‘15πŸ”₯4😱3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‡ Bootstrapping TAP πŸ‡

πŸ‘‰#Deepmind shows how large-scale, unlabeled, uncurated real-world data can improve TAP with minimal architectural changes, via a self-supervised student-teacher setup. Source Code released πŸ’™

πŸ‘‰Review https://t.ly/-S_ZL
πŸ‘‰Paper arxiv.org/pdf/2402.00847.pdf
πŸ‘‰Code https://github.com/google-deepmind/tapnet
πŸ”₯5πŸ‘3πŸ₯°1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
🍾TAPVid-3D: benchmark for TAP-3D🍾

πŸ‘‰#Deepmind (+College London & Oxford) introduces TAPVid-3D, a new benchmark for evaluating long-range Tracking Any Point in 3D: 4,000+ real-world videos, composed of three different data sources spanning a variety of object types, motion patterns, and indoor/outdoor environments. Data & Code available, Apache 2.0πŸ’™

πŸ‘‰Review https://t.ly/SsptD
πŸ‘‰Paper arxiv.org/pdf/2407.05921
πŸ‘‰Project tapvid3d.github.io/
πŸ‘‰Code github.com/google-deepmind/tapnet/tree/main/tapnet/tapvid3d
πŸ”₯3πŸ‘1🀯1