AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
135 photos
247 videos
13 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯¬ Diffusive Sketch-Guided Text-to-Image πŸ₯¬

πŸ‘‰#Google unveils a universal approach for T2I (pre-trained) diffusion: free-hand, saliency-guided, etc.

😎Review https://bit.ly/3XFVMj2
😎Project sketch-guided-diffusion.github.io/
😎Paper sketch-guided-diffusion.github.io/files/sketch-guided-preprint.pdf
🀯4⚑1❀1πŸ‘1πŸ”₯1πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯« Plug 'n' play self-checkout πŸ₯«

πŸ‘‰#Google's new shelf-checking #AI: recognizing billions of products, even purchased/moved

😎Review https://bit.ly/3J58hQe
😎News https://cloud.google.com/blog/transform/nrf-2023-google-cloud-big-show-big-moment-hybrid-retail
🀯8πŸ‘7
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“DREAMIX:General Diffusive Video EditorπŸ“

πŸ‘‰#Google unveils the first diffusion-based method able to perform text-based motion/appearance editing of general videos

😎Review https://bit.ly/3I3Hq6B
😎Paper arxiv.org/pdf/2302.01329.pdf
😎Project dreamix-video-editing.github.io/
🀯24😱3πŸ‘2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯¦ ReBotNet: Neural Enhancement πŸ₯¦

πŸ‘‰#Google unveils ReBotNet, novel real-time video enhancement for live video calls & streams

😎Review https://bit.ly/3z8oqhG
😎Paper arxiv.org/pdf/2303.13504.pdf
😎Project jeya-maria-jose.github.io/rebotnet-web
πŸ”₯13πŸ‘3❀2πŸ₯°2🀩2⚑1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯¦ Zip-NeRF: the Anti-Aliasing NeRF πŸ₯¦

πŸ‘‰#Google unveils a novel version of NeRF able to fix the aliasing problem being 22x faster in training than SOTA.

😎Review https://bit.ly/3L1hZ6M
😎Paper arxiv.org/pdf/2304.06706.pdf
😎Project https://jonbarron.info/zipnerf
🀯13πŸ”₯4πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
🌈 Track Everything Everywhere 🌈

πŸ‘‰#Google unveils OmniMotion: full-length motion tracking for every pixel in every frame of video.

😎Review https://t.ly/Krvw
😎Paper arxiv.org/pdf/2306.05422.pdf
😎Project omnimotion.github.io/
😎Demo omnimotion.github.io/#interactive_demo
😎Code github.com/qianqianwang68/omnimotion
πŸ”₯23❀5🀯3🀩1πŸ’©1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“Έ Computational Burst Photography in App πŸ“Έ

πŸ‘‰#Google unveils a novel computational burst system to democratize the professional photography via smartphone

😎Review https://t.ly/5ibJX
😎Paper arxiv.org/pdf/2308.01379.pdf
😎Project https://motion-mode.github.io
πŸ”₯6πŸ₯°3πŸ‘2🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯Lumiere: SOTA video-genπŸ”₯

πŸ‘‰#Google unveils Lumiere: Space-Time Diffusion Model for Realistic Video Generation. It's the new SOTA, tasks: Text-to-Video, Video Stylization, Cinemagraphs & Video Inpainting.

πŸ‘‰Review https://t.ly/nalJR
πŸ‘‰Paper https://lnkd.in/d-PvrGjT
πŸ‘‰Project https://t.ly/gK8hz
πŸ”₯18❀4πŸ‘3πŸ‘2🀩2πŸ₯°1🀯1πŸ’©1
🧠350+ Free #AI Courses by #Google🧠

πŸ‘‰350+ free courses from #Google to become professional in #AI & #Cloud. The full catalog (900+) includes a variety of activity: videos, documents, labs, coding, and quizzes. 15+ supported languages. No excuse.

βœ…π†πžπ§πžπ«πšπ­π’π―πž π€πˆ
βœ…πˆπ§π­π«π¨ 𝐭𝐨 π‹π‹πŒπ¬
βœ…π‚π• 𝐰𝐒𝐭𝐑 𝐓𝐅
βœ…πƒπšπ­πš, πŒπ‹, π€πˆ
βœ…π‘πžπ¬π©π¨π§π¬π’π›π₯𝐞 π€πˆ

πŸ‘‰Review: https://t.ly/517Dr
πŸ‘‰Full list: https://www.cloudskillsboost.google/catalog?page=1
❀13πŸ‘3πŸ‘2🍾2πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‡ Graph Neural Network in TF πŸ‡

πŸ‘‰#Google TensorFlow-GNN: novel library to build Graph Neural Networks on TensorFlow. Source Code released under Apache 2.0 license πŸ’™

πŸ‘‰Review https://t.ly/TQfg-
πŸ‘‰Code github.com/tensorflow/gnn
πŸ‘‰Blog blog.research.google/2024/02/graph-neural-networks-in-tensorflow.html
❀17πŸ‘4πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
β˜€οΈ One2Avatar: Pic -> 3D Avatar β˜€οΈ

πŸ‘‰#Google presents a new approach to generate animatable photo-realistic avatars from only a few/one image. Impressive results.

πŸ‘‰Review https://t.ly/AS1oc
πŸ‘‰Paper arxiv.org/pdf/2402.11909.pdf
πŸ‘‰Project zhixuany.github.io/one2avatar_webpage/
πŸ‘12❀3🀩3πŸ”₯2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺŸ BOG: Fine Geometric Views πŸͺŸ

πŸ‘‰ #Google (+TΓΌbingen) unveils Binary Opacity Grids, a novel method to reconstruct triangle meshes from multi-view images able to capture fine geometric detail such as leaves, branches & grass. New SOTA, real-time on Google Pixel 8 Pro (and similar).

πŸ‘‰Review https://t.ly/E6T0W
πŸ‘‰Paper https://lnkd.in/dQEq3zy6
πŸ‘‰Project https://lnkd.in/dYYCadx9
πŸ‘‰Demo https://lnkd.in/d92R6QME
πŸ”₯8🀯4πŸ‘3πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’¦ ObjectDrop: automagical objects removal πŸ’¦

πŸ‘‰#Google unveils ObjectDrop, the new SOTA in photorealistic object removal and insertion. Focus on shadows and reflections, impressive!

πŸ‘‰Review https://t.ly/ZJ6NN
πŸ‘‰Paper https://arxiv.org/pdf/2403.18818.pdf
πŸ‘‰Project https://objectdrop.github.io/
πŸ‘14🀯8❀4πŸ”₯3🍾2
πŸ¦‘ Hyper-Detailed Image Descriptions πŸ¦‘

πŸ‘‰#Google unveils ImageInWords (IIW), a carefully designed HIL annotation framework for curating hyper-detailed image descriptions and a new dataset resulting from this process

πŸ‘‰Review https://t.ly/engkl
πŸ‘‰Paper arxiv.org/pdf/2405.02793
πŸ‘‰Repo github.com/google/imageinwords
πŸ‘‰Project google.github.io/imageinwords
πŸ‘‰Data huggingface.co/datasets/google/imageinwords
❀11πŸ”₯3πŸ‘2🀯2🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ€OmniGlue: Foundation MatcherπŸ€

πŸ‘‰#Google OmniGlue from #CVPR24: the first learnable image matcher powered by foundation models. Impressive OOD results!

πŸ‘‰Review https://t.ly/ezaIc
πŸ‘‰Paper https://arxiv.org/pdf/2405.12979
πŸ‘‰Project hwjiang1510.github.io/OmniGlue/
πŸ‘‰Code https://github.com/google-research/omniglue/
🀯10❀6πŸ‘2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘— SOTA Multi-Garment VTOn Editing πŸ‘—

πŸ‘‰#Google (+UWA) unveils M&M VTO, novel mix 'n' match virtual try-on that takes as input multiple garment images, text description for garment layout and an image of a person. It's the new SOTA both qualitatively and quantitatively. Impressive results!

πŸ‘‰Review https://t.ly/66mLN
πŸ‘‰Paper arxiv.org/pdf/2406.04542
πŸ‘‰Project https://mmvto.github.io
πŸ‘4❀3πŸ₯°3πŸ”₯1🀯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯₯ OmniNOCS: largest 3D NOCS πŸ₯₯

πŸ‘‰OmniNOCS by #Google (+Georgia) is a unified NOCS (Normalized Object Coordinate Space) dataset that contains data across different domains with 90+ object classes. The largest NOCS dataset to date. Data & Code available under Apache 2.0πŸ’™

πŸ‘‰Review https://t.ly/xPgBn
πŸ‘‰Paper arxiv.org/pdf/2407.08711
πŸ‘‰Project https://omninocs.github.io/
πŸ‘‰Data github.com/google-deepmind/omninocs
πŸ”₯4❀3πŸ‘2πŸ‘1πŸ₯°1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ„ Diffusion Models for Transparency πŸͺ„

πŸ‘‰MIT (+ #Google) unveils Alchemist, a novel method to control material attributes of objects like roughness, metallic, albedo & transparency in real images. Amazing work but code not announcedπŸ₯Ί

πŸ‘‰Review https://t.ly/U98_G
πŸ‘‰Paper arxiv.org/pdf/2312.02970
πŸ‘‰Project www.prafullsharma.net/alchemist/
πŸ”₯17πŸ‘4⚑1❀1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐺 Diffusion Game Engine 🐺

πŸ‘‰#Google unveils GameNGen: the first game engine powered entirely by a neural #AI that enables real-time interaction with a complex environment over long trajectories at HQ. No code announced but I love it πŸ’™

πŸ‘‰Review https://t.ly/_WR5z
πŸ‘‰Paper https://lnkd.in/dZqgiqb9
πŸ‘‰Project https://lnkd.in/dJUd2Fr6
πŸ”₯10πŸ‘5❀2πŸ‘1