AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
135 photos
247 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🧊EPro-PnP: Persp-n-Points Detection🧊

πŸ‘‰EPro-PnP: probabilistic PnP layer for general e2e pose estimation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Probabilistic PnP for general e2e pose
βœ…Top-tier in 6DoF by inserting into CDPN
βœ…Deformable accurate detection
βœ…2D-3D corresp. learned from scratch

More: https://bit.ly/3BNPXYr
πŸ‘11
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯‡#NVIDIA wins SIGGRAPH's Best PaperπŸ₯‡

πŸ‘‰Instant #NeRF awarded as a best paper at SIGGRAPH 2022!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Speed-up of several orders of magnitude
βœ…HQ neural primitives in a matter of secs
βœ…Render in tens of milliseconds at 1080p
βœ…Source code and resources available!

More: https://bit.ly/3Qt8c9D
πŸ‘16πŸ”₯6❀3πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ° EasyMocap: Open Neural Mocap πŸͺ°

πŸ‘‰EasyMocap: open-source marker-less mocap with novel view synthesis from RGB

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬 (of last paper added):
βœ…Editable free-viewpoint video
βœ…Layered neural representation of humans
βœ…Multi-pax -> instances, weakly-supervised
βœ…HQ neural representation of the humans
βœ…Addressing camera error by human poses

More: https://bit.ly/3p6lUDO
🀯6πŸ‘3πŸ‘3❀2
This media is not supported in your browser
VIEW IN TELEGRAM
🎰 Texturify: Neural Textures Generator 🎰

πŸ‘‰A step towards automated content creation. HQ textures directly on surface of 3D object

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…TUM + Max Planck + Apple 🍏
βœ…Realistic, HQ textures from 2D pics
βœ…3D shape geometry, no 3D supervision
βœ…3D-aware surface-based generation net

More: https://bit.ly/3BW7UUU
πŸ‘8
This media is not supported in your browser
VIEW IN TELEGRAM
🍨 Scaling Neural Indoor Scene 🍨

πŸ‘‰Neural scene rendering for indoor: scalable in both training/rendering

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Neural scene rendering for indoor
βœ…#3D into tiles with MLPs to scale up
βœ…Parallel training of tile-based MLPs
βœ…View-indep. components (via surf-MLP)

More: https://bit.ly/3bH94IX
πŸ”₯2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯Stable Diffusion on clips. INSANEπŸ”₯

πŸ‘‰The most advanced latent text-to-image DM. #RunwayML just announced is going to apply it on clips

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Latent DM on 512p from LAION-5B
βœ…Frozen CLIP ViT-L/14 text encoder
βœ…Lightweight, runs on a 10GB-GPU
βœ…Checkpoints only for research

More: https://bit.ly/3QfkRx3
🀯13😱12πŸ‘2πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐍 Implicitron: "democratizing" NeRF🐍

πŸ‘‰#META opens a novel framework for NeRF-world in #PyTorch3D #pytorch

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Implicit representations (NeRF) / Render
βœ…RaySampler/PointSampler & more
βœ…NeRF’s MLP, IDR’s FF, SRN, etc.
βœ…Renderers: MEAR, LSTMRenderer, etc.

More: https://bit.ly/3bPyJPJ
πŸ”₯4🀯2
This media is not supported in your browser
VIEW IN TELEGRAM
🧰 FGT: flow-guided inpainting 🧰

πŸ‘‰#Microsoft (+USTC) unveils FGT: flow-guided ViT for video inpainting 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…OF into transformer for attention++
βœ…Flow completion net w/ local feats.
βœ…Dual perspective spatial MHSA
βœ…Local attention with global content

More: https://bit.ly/3pk5J5S
❀11πŸ‘5
This media is not supported in your browser
VIEW IN TELEGRAM
🍏NeuMan: Human NeRF in the wild🍏

πŸ‘‰#Apple opens a novel human pose/view from just a single in-the-wild video

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…No extra devices/annotations
βœ…Both Human (novel poses) + Scene
βœ…E2E SMPL optimization + error-corr.
βœ…Applications such as "telegathering"

More: https://bit.ly/3K4iTO6
πŸ‘15
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯‘ CLIP-based Neural Style Transfer πŸ₯‘

πŸ‘‰From #Nvidia a novel method for transferring the style to a #3D object

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Texture style for 3D by CLIP-ResNet50
βœ…Nearest-neighbor feature matching loss
βœ…CLIP-based loss extraction of textures
βœ…NNFM for multiple style pics / control
βœ…No source code or models available πŸ˜’

More: https://bit.ly/3c32dK5
🀯12πŸ”₯5❀4πŸ‘2😱2😁1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ KeypointNeRF: code is out! πŸ”₯

πŸ‘‰KeypointNeRF by #Meta: "NeRF"-avatars

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Generalizable NeRF for virtual avatar
βœ…Sparse 3D keypoints for SOTA avatar
βœ…Novel unseen subjects from 2/3 views
βœ…"iPhone" captures for #metaverse

More: https://bit.ly/3pyl17e
πŸ”₯8πŸ‘3πŸ‘Ž1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯­Massive GTA-V human datasetπŸ₯­

πŸ‘‰GTA-Human: outperforming SOTA with a purely synthetic training.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…600+ gender, age, ethnicity & clothing
βœ…20,000+ clips, variety of human activities
βœ…6 categories of location, different BGs
βœ…Occlusions, lighting, and weather system

More: https://bit.ly/3wpZyRD
πŸ”₯14❀2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🍈DeepBillboards: old-school trick for #VR🍈

πŸ‘‰DeepBillboards models a 3D object implicitly using neural net on the user’s viewing direction

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…#Google Brain +Tsukuba + Tokyo
βœ…Rendering at higher res., improving #VR
βœ…NeRF into interactive VR with accuracy++
βœ…NeRF (or any others) directly in #Unity

More: https://bit.ly/3CsTQ5y
πŸ‘6πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🌐RelPose: Probabilistic Relative Pose🌐

πŸ‘‰A novel method for core component in #SLAM / NeRF-powered apps.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Core component of SfM/SLAM
βœ…Pre-processing for neural (NeRF)
βœ…Energy-based over rotations
βœ…SOTA on both seen/unseen objects

More: https://bit.ly/3T60TXw
πŸ”₯12πŸ‘2πŸ‘2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
🍈 #StableDiffusion archive is out🍈

πŸ‘‰Lexica art is a Stable Diffusion prompt search engine. Real-time, countless #stablediffusion results for everyone. I had fun with the GOAT, #Maradona.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Maradona scoring against a capybara...
βœ…A poster of space jam with Maradona...
βœ…Painting of Maradona very detailed...
βœ…Painting of Maradona in heaven...

More: https://bit.ly/3PTXHLH
❀9πŸ‘5
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦‰PANDORA: Polarized Neural DecompositionπŸ¦‰

πŸ‘‰CIL lab unveils PANDORA: polarimetric inverse rendering approach via INR

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Geometry, reflectance & illumination
βœ…normal, signed distance field, mesh
βœ…Diffuse-specular separation
βœ…Hi-fI incident illumination

More https://bit.ly/3CzGp3F
πŸ‘3πŸ”₯3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯IDOL (#CVPR2022 winner): code is out!πŸ”₯

πŸ‘‰IDOL for VIS: outperforming all online/offline methods, the new SOTA!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Online usually inferior by >10AP
βœ…Online based on contrast-learning
βœ…Discriminative++ instance embeddings
βœ…Full exploiting history for stability

More https://bit.ly/3dXCDXw
🀯16πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ #AIwithPapers: we are 4,000+! πŸ”₯

πŸ’™πŸ’›Lot of people joined, and we talked about #StableDiffusion only twice! Can't believe it.πŸ’™πŸ’›

😈 Invite your friends -> https://t.iss.one/AI_DeepLearning
πŸ”₯10