AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
135 photos
247 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ›Έ Virtual Occlusions in #AR πŸ›Έ

πŸ‘‰Niantic (#pokemongo) on a novel approach for virtual assets to appear β€˜sitting among’ the real world objects

😎Review https://bit.ly/3o04wn6
😎Paper arxiv.org/pdf/2305.07014.pdf
😎Project nianticlabs.github.io/implicit-depth
😎Code github.com/nianticlabs/implicit-depth
πŸ”₯11🀯5πŸ‘3⚑1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
🍿 De-Aging Harrison Ford via SD 🍿

πŸ‘‰Stable Diffusion for Hollywood: preview of the next autotune of entertainment industry. A discussionπŸ‘‡

😎 More: https://bit.ly/41EzaQK
🀯19πŸ”₯9πŸ‘6πŸ’©3⚑1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ° #3D Auto-Reconstruction πŸͺ°

πŸ‘‰AutoRecon: automated discovery & reconstruction of objects from multi-view pics.

😎Review https://bit.ly/3MxI0f4
😎Paper arxiv.org/pdf/2305.08810.pdf
😎Project zju3dv.github.io/autorecon/
😎Code github.com/zju3dv/AutoRecon
πŸ”₯11❀4🀯3πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘š Multi-Layered 3D Garments Animation πŸ‘š

πŸ‘‰S-Lab unveils LayersNet: animating multi-layered garments driven by various external forces, such as human bodies & wind

😎Review https://bit.ly/435b42F
😎Paper arxiv.org/pdf/2305.10418.pdf
😎Project mmlab-ntu.github.io/project/layersnet
πŸ”₯6😱2❀1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🎫 100% Mask-Free VIS 🎫

πŸ‘‰ETH Z unveils MaskFreeVIS: novel high-performing VIS without any mask annotations.

😎Review https://bit.ly/3Wg7CQB
😎Paper arxiv.org/pdf/2303.15904.pdf
😎Project www.vis.xyz/pub/maskfreevis/
😎Code github.com/SysCV/maskfreevis
πŸ”₯6πŸ‘4🀯2❀1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ€„ Drag-GAN: user-friendly image-manipulation πŸ€„

πŸ‘‰ Manual deforming of (real and generated) images over pose, shape, expression and layout.

😎Review https://bit.ly/3BFyXlR
😎Paper arxiv.org/pdf/2305.10973.pdf
😎Project vcai.mpi-inf.mpg.de/projects/DragGAN
😎Code github.com/XingangPan/DragGAN
πŸ”₯34🀯18❀6πŸ‘4😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ—ΊοΈ AI-generated stereotypical men πŸ—ΊοΈ

πŸ‘‰A thread about generating stereotypical person from 15 countries all around the world. And yes, Italian love Pizza.

😎 More https://bit.ly/3oo0t4c
🀣6❀3🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🍢 AVOS Multiscale Encoder-Decoder ViT 🍢

πŸ‘‰ MED-VT, world's first Multiscale Encoder Decoder Video Transformer for AVOS

😎Review https://bit.ly/3MohFi1
😎Paper arxiv.org/pdf/2304.05930.pdf
😎Project rkyuca.github.io/medvt
😎Code github.com/rkyuca/medvt
πŸ‘13πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
🌊 Neural Dynamic Image-Based Rendering 🌊

πŸ‘‰ DynIBaR: synthesizing novel views from monocular video depicting a complex dynamic scene.

😎Review https://t.ly/90Kw
😎Paper arxiv.org/pdf/2211.11082.pdf
😎Project https://dynibar.github.io/
😎Code github.com/google/dynibar
❀9πŸ‘3πŸ₯°1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🦁 Open Semantic Segmentation 🦁

πŸ‘‰SSSegmentation: open source supervised semantic segmentation toolbox based on #PyTorch

😎Review https://t.ly/ZE9q
😎Paper arxiv.org/pdf/2305.17091.pdf
😎Code github.com/SegmentationBLWX/sssegmentation
πŸ”₯10❀4⚑1πŸ‘1🀯1🀩1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸŽ—οΈ 4D Humans with Transformers πŸŽ—οΈ

πŸ‘‰Novel approach to reconstruct and track humans (even in unusual poses)

😎Review https://t.ly/XGv_
😎Paper arxiv.org/pdf/2305.20091.pdf
😎Project shubham-goel.github.io/4dhumans/#
😎Code github.com/shubham-goel/4D-Humans
🀯10πŸ‘7πŸ”₯5❀2⚑1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ—½ Neuralangelo Digital Twins. INSANEπŸ—½

πŸ‘‰ A novel framework from #Nvidia for Hi-Fi 3D Digital twins.

😎Review https://t.ly/rxoF4
😎Project research.nvidia.com/labs/dir/neuralangelo
😎Paper research.nvidia.com/labs/dir/neuralangelo/paper.pdf
πŸ”₯15πŸ‘4🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🦜 ColorDiffuser: Text-to-Video Colorization 🦜

πŸ‘‰HK University unveils ColorDiffuser: adapting pre-trained text-to-image latent diffusion model for video colorization

😎Review https://t.ly/XGv_
😎Paper arxiv.org/pdf/2306.01732.pdf
😎Project colordiffuser.github.io/
😎Code github.com/ColorDiffuser/ColorDiffuser
🀯8❀2🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
🌻 Extending Mona Lisa with AI 🌻

πŸ‘‰ A guy on Reddit extends Mona Lisa Painting with #Photoshop AI. The result is surprising.

😎More https://t.ly/j_2r
🀯20πŸ‘5🀩4πŸ”₯3😱2🀣2⚑1
This media is not supported in your browser
VIEW IN TELEGRAM
🏸 Segment Anything in HQ 🏸

πŸ‘‰HQ-SAM: SAM with the ability to accurately segment objects, maintaining promptable design, efficiency, zero-shot generalizability

😎Review https://t.ly/GxX5B
😎Paper arxiv.org/pdf/2306.01567.pdf
😎Models github.com/SysCV/SAM-HQ
πŸ”₯18πŸ‘4🀯1😱1😍1
This media is not supported in your browser
VIEW IN TELEGRAM
🌈 Track Everything Everywhere 🌈

πŸ‘‰#Google unveils OmniMotion: full-length motion tracking for every pixel in every frame of video.

😎Review https://t.ly/Krvw
😎Paper arxiv.org/pdf/2306.05422.pdf
😎Project omnimotion.github.io/
😎Demo omnimotion.github.io/#interactive_demo
😎Code github.com/qianqianwang68/omnimotion
πŸ”₯23❀5🀯3🀩1πŸ’©1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘οΈ Scene Five: Through Her Eyes πŸ‘οΈ

πŸ‘‰ #3D scene reconstruction of what a person is observing using only the reflections of their eyes

😎Review https://t.ly/uBO6
😎Paper arxiv.org/pdf/2306.09348.pdf
😎Project https://world-from-eyes.github.io/
🀯28πŸ”₯12πŸ’©2🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
🧿 NeRF-Supervised Deep Stereo 🧿

πŸ‘‰A novel pioneering pipeline for training deep stereo networks WITH NO ground-truth

😎Review https://t.ly/c7j-
😎Project nerfstereo.github.io/
😎Dataset https://amsacta.unibo.it/id/eprint/7218/
😎Code github.com/fabiotosi92/NeRF-Supervised-Deep-Stereo
😎Paper https://openaccess.thecvf.com/content/CVPR2023/papers/Tosi_NeRF-Supervised_Deep_Stereo_CVPR_2023_paper.pdf
πŸ₯°8🀩3❀1πŸ‘1πŸ’©1😍1
This media is not supported in your browser
VIEW IN TELEGRAM
🫣 Text-Guided Adversarial Makeup 🫣

πŸ‘‰Novel facial privacy protection via adversarial latent codes. Makeup vs Face Recognition.

😎Review https://t.ly/pBCP
😎Paper arxiv.org/pdf/2306.10008.pdf
😎Code github.com/fahadshamshad/Clip2Protect
❀6πŸ‘1πŸ”₯1πŸ₯°1πŸ’©1
Media is too big
VIEW IN TELEGRAM
🦷 Few-Shot Geometry-Aware Keypoints 🦷

πŸ‘‰UBC (+Flawless AI) unveils the new SOTA in semantic keypoints localization. Suitable for faces, animals, cars, mouth, teeth & more

😎Review https://t.ly/-0qN
😎Paper arxiv.org/pdf/2303.17216.pdf
😎Project xingzhehe.github.io/FewShot3DKP/
🀯10πŸ‘4❀2⚑2πŸ‘2🀩2πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸš” Fooling Neural Forensic Classifiers πŸš”

πŸ‘‰Adversarial faces able to fool the forensic classifiers, while remaining undetectable by humans

😎Review https://t.ly/33Cc
😎Paper arxiv.org/pdf/2306.13091.pdf
😎Project koushiksrivats.github.io/face_attribute_attack
😎Code github.com/koushiksrivats/face_attribute_attack
😒6❀4πŸ‘2😱2🍾2πŸ‘1🀯1😍1