AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
135 photos
247 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‡ Graph Neural Network in TF πŸ‡

πŸ‘‰#Google TensorFlow-GNN: novel library to build Graph Neural Networks on TensorFlow. Source Code released under Apache 2.0 license πŸ’™

πŸ‘‰Review https://t.ly/TQfg-
πŸ‘‰Code github.com/tensorflow/gnn
πŸ‘‰Blog blog.research.google/2024/02/graph-neural-networks-in-tensorflow.html
❀17πŸ‘4πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
β˜€οΈ One2Avatar: Pic -> 3D Avatar β˜€οΈ

πŸ‘‰#Google presents a new approach to generate animatable photo-realistic avatars from only a few/one image. Impressive results.

πŸ‘‰Review https://t.ly/AS1oc
πŸ‘‰Paper arxiv.org/pdf/2402.11909.pdf
πŸ‘‰Project zhixuany.github.io/one2avatar_webpage/
πŸ‘12❀3🀩3πŸ”₯2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺŸ BOG: Fine Geometric Views πŸͺŸ

πŸ‘‰ #Google (+TΓΌbingen) unveils Binary Opacity Grids, a novel method to reconstruct triangle meshes from multi-view images able to capture fine geometric detail such as leaves, branches & grass. New SOTA, real-time on Google Pixel 8 Pro (and similar).

πŸ‘‰Review https://t.ly/E6T0W
πŸ‘‰Paper https://lnkd.in/dQEq3zy6
πŸ‘‰Project https://lnkd.in/dYYCadx9
πŸ‘‰Demo https://lnkd.in/d92R6QME
πŸ”₯8🀯4πŸ‘3πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’¦ ObjectDrop: automagical objects removal πŸ’¦

πŸ‘‰#Google unveils ObjectDrop, the new SOTA in photorealistic object removal and insertion. Focus on shadows and reflections, impressive!

πŸ‘‰Review https://t.ly/ZJ6NN
πŸ‘‰Paper https://arxiv.org/pdf/2403.18818.pdf
πŸ‘‰Project https://objectdrop.github.io/
πŸ‘14🀯8❀4πŸ”₯3🍾2
πŸ¦‘ Hyper-Detailed Image Descriptions πŸ¦‘

πŸ‘‰#Google unveils ImageInWords (IIW), a carefully designed HIL annotation framework for curating hyper-detailed image descriptions and a new dataset resulting from this process

πŸ‘‰Review https://t.ly/engkl
πŸ‘‰Paper arxiv.org/pdf/2405.02793
πŸ‘‰Repo github.com/google/imageinwords
πŸ‘‰Project google.github.io/imageinwords
πŸ‘‰Data huggingface.co/datasets/google/imageinwords
❀11πŸ”₯3πŸ‘2🀯2🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ€OmniGlue: Foundation MatcherπŸ€

πŸ‘‰#Google OmniGlue from #CVPR24: the first learnable image matcher powered by foundation models. Impressive OOD results!

πŸ‘‰Review https://t.ly/ezaIc
πŸ‘‰Paper https://arxiv.org/pdf/2405.12979
πŸ‘‰Project hwjiang1510.github.io/OmniGlue/
πŸ‘‰Code https://github.com/google-research/omniglue/
🀯10❀6πŸ‘2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘— SOTA Multi-Garment VTOn Editing πŸ‘—

πŸ‘‰#Google (+UWA) unveils M&M VTO, novel mix 'n' match virtual try-on that takes as input multiple garment images, text description for garment layout and an image of a person. It's the new SOTA both qualitatively and quantitatively. Impressive results!

πŸ‘‰Review https://t.ly/66mLN
πŸ‘‰Paper arxiv.org/pdf/2406.04542
πŸ‘‰Project https://mmvto.github.io
πŸ‘4❀3πŸ₯°3πŸ”₯1🀯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯₯ OmniNOCS: largest 3D NOCS πŸ₯₯

πŸ‘‰OmniNOCS by #Google (+Georgia) is a unified NOCS (Normalized Object Coordinate Space) dataset that contains data across different domains with 90+ object classes. The largest NOCS dataset to date. Data & Code available under Apache 2.0πŸ’™

πŸ‘‰Review https://t.ly/xPgBn
πŸ‘‰Paper arxiv.org/pdf/2407.08711
πŸ‘‰Project https://omninocs.github.io/
πŸ‘‰Data github.com/google-deepmind/omninocs
πŸ”₯4❀3πŸ‘2πŸ‘1πŸ₯°1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ„ Diffusion Models for Transparency πŸͺ„

πŸ‘‰MIT (+ #Google) unveils Alchemist, a novel method to control material attributes of objects like roughness, metallic, albedo & transparency in real images. Amazing work but code not announcedπŸ₯Ί

πŸ‘‰Review https://t.ly/U98_G
πŸ‘‰Paper arxiv.org/pdf/2312.02970
πŸ‘‰Project www.prafullsharma.net/alchemist/
πŸ”₯17πŸ‘4⚑1❀1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐺 Diffusion Game Engine 🐺

πŸ‘‰#Google unveils GameNGen: the first game engine powered entirely by a neural #AI that enables real-time interaction with a complex environment over long trajectories at HQ. No code announced but I love it πŸ’™

πŸ‘‰Review https://t.ly/_WR5z
πŸ‘‰Paper https://lnkd.in/dZqgiqb9
πŸ‘‰Project https://lnkd.in/dJUd2Fr6
πŸ”₯10πŸ‘5❀2πŸ‘1