AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
136 photos
250 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯‘ CLIP-based Neural Style Transfer πŸ₯‘

πŸ‘‰From #Nvidia a novel method for transferring the style to a #3D object

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Texture style for 3D by CLIP-ResNet50
βœ…Nearest-neighbor feature matching loss
βœ…CLIP-based loss extraction of textures
βœ…NNFM for multiple style pics / control
βœ…No source code or models available πŸ˜’

More: https://bit.ly/3c32dK5
🀯12πŸ”₯5❀4πŸ‘2😱2😁1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ KeypointNeRF: code is out! πŸ”₯

πŸ‘‰KeypointNeRF by #Meta: "NeRF"-avatars

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Generalizable NeRF for virtual avatar
βœ…Sparse 3D keypoints for SOTA avatar
βœ…Novel unseen subjects from 2/3 views
βœ…"iPhone" captures for #metaverse

More: https://bit.ly/3pyl17e
πŸ”₯8πŸ‘3πŸ‘Ž1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯­Massive GTA-V human datasetπŸ₯­

πŸ‘‰GTA-Human: outperforming SOTA with a purely synthetic training.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…600+ gender, age, ethnicity & clothing
βœ…20,000+ clips, variety of human activities
βœ…6 categories of location, different BGs
βœ…Occlusions, lighting, and weather system

More: https://bit.ly/3wpZyRD
πŸ”₯14❀2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🍈DeepBillboards: old-school trick for #VR🍈

πŸ‘‰DeepBillboards models a 3D object implicitly using neural net on the user’s viewing direction

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…#Google Brain +Tsukuba + Tokyo
βœ…Rendering at higher res., improving #VR
βœ…NeRF into interactive VR with accuracy++
βœ…NeRF (or any others) directly in #Unity

More: https://bit.ly/3CsTQ5y
πŸ‘6πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🌐RelPose: Probabilistic Relative Pose🌐

πŸ‘‰A novel method for core component in #SLAM / NeRF-powered apps.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Core component of SfM/SLAM
βœ…Pre-processing for neural (NeRF)
βœ…Energy-based over rotations
βœ…SOTA on both seen/unseen objects

More: https://bit.ly/3T60TXw
πŸ”₯12πŸ‘2πŸ‘2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
🍈 #StableDiffusion archive is out🍈

πŸ‘‰Lexica art is a Stable Diffusion prompt search engine. Real-time, countless #stablediffusion results for everyone. I had fun with the GOAT, #Maradona.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Maradona scoring against a capybara...
βœ…A poster of space jam with Maradona...
βœ…Painting of Maradona very detailed...
βœ…Painting of Maradona in heaven...

More: https://bit.ly/3PTXHLH
❀9πŸ‘5
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦‰PANDORA: Polarized Neural DecompositionπŸ¦‰

πŸ‘‰CIL lab unveils PANDORA: polarimetric inverse rendering approach via INR

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Geometry, reflectance & illumination
βœ…normal, signed distance field, mesh
βœ…Diffuse-specular separation
βœ…Hi-fI incident illumination

More https://bit.ly/3CzGp3F
πŸ‘3πŸ”₯3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯IDOL (#CVPR2022 winner): code is out!πŸ”₯

πŸ‘‰IDOL for VIS: outperforming all online/offline methods, the new SOTA!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Online usually inferior by >10AP
βœ…Online based on contrast-learning
βœ…Discriminative++ instance embeddings
βœ…Full exploiting history for stability

More https://bit.ly/3dXCDXw
🀯16πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ #AIwithPapers: we are 4,000+! πŸ”₯

πŸ’™πŸ’›Lot of people joined, and we talked about #StableDiffusion only twice! Can't believe it.πŸ’™πŸ’›

😈 Invite your friends -> https://t.iss.one/AI_DeepLearning
πŸ”₯10
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”΅ Deep Saliency: driving the attention πŸ”΅

πŸ‘‰Google unveils a family of operators to "drive" human saliency

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Editing image to drive Saliency
βœ…Transforms to hide distractors
βœ…Warping operator for distractor
βœ…GAN-op for less-saliency altern.

More: https://bit.ly/3KoQQc2
πŸ‘9🀩4
This media is not supported in your browser
VIEW IN TELEGRAM
🎍#3D scene manipulation from 2D🎍

πŸ‘‰Reconstruct, decompose, manipulate & render 3D scenes in a single pipeline

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Unique 3D, non-occupied space from 2D
βœ…Inverse query algorithm for shapes
βœ…First synthetic dataset for 3D editing

More: https://bit.ly/3RlYhTY
πŸ”₯11❀1
This media is not supported in your browser
VIEW IN TELEGRAM
🍊StableFace: Talking Face Generation🍊

πŸ‘‰Analysis on motion jittering in 3D face generation (audio-in -> video-out)

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Motion jittering analysis for stability
βœ…Gaussian-based adaptive smoothing
βœ…Augmented erosions of neural renderer
βœ…Audio-fused generator for dependency

More: https://bit.ly/3Kt95gI
πŸ‘5😱3❀1
This media is not supported in your browser
VIEW IN TELEGRAM
🧑 Avatarization in 90's. So Romantic 🧑

πŸ‘‰Making of the first #MortalKombat in early 90's

More: https://bit.ly/3wTSpJB
❀13
This media is not supported in your browser
VIEW IN TELEGRAM
πŸš— Massive Dataset in Virtual Cities πŸš—

πŸ‘‰Synthehicle: 7 hours of labeled material, 340 cams, 64 days, rain, dawn, & night scenes.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Multi-target multi-cam tracking
βœ…2D, 3D, segm. & depth annotations
βœ…Instance, semantic & panoptic segm.
βœ…340 clips, 64 scenes, 17 hrs, 4M BBs

More: https://bit.ly/3TArHiV
❀10πŸ‘6
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ¨Controllable #3D Adversarial FaceπŸͺ¨

πŸ‘‰#Meta (+CMU) on decoupling identity/expression + granular control over expressions

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Supervised auto-enc. + GAN
βœ…UV texture maps + 3D faces
βœ…Control expression, saving ID
βœ…Code under X11 License

More: https://bit.ly/3AVE80q
πŸ‘6
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯‘ DALLΒ·E: Outpainting via #NLP πŸ₯‘

πŸ‘‰Extending any original image, creating large-scale images in any aspect ratio

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Extending an image beyond its borders
βœ…Visual elements in same style of the input
βœ…Driving the image "story" in new directions
βœ…Shadows, reflections & textures w/ context

More: https://bit.ly/3eoH8uD
πŸ”₯20🀯7❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸŒͺ️ TimeLapse++: Video Temporal PyramidπŸŒͺ️

πŸ‘‰Multi-scale lens to view the passage of time: far beyond a "classic" timelapse

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Inspired by "old-school" spatial pyramids
βœ…Video Spectrogram to go through pyramid
βœ…Months/years of data in a few seconds!
βœ…Multi-temporal freq., no aliasing

More: https://bit.ly/3TKnYPS
🀯6πŸ‘2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
🫐 Stable Diffusion Video is out! 🫐

πŸ‘‰A free notebook to generate videos by interpolating the latent space of SD.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Blueberry to strawberry spaghetti
βœ…Dream items from same prompt
βœ…Morph different prompts (seeds)
βœ…Built on a script by A. Karpathy

More: https://bit.ly/3ey8632
🀯15πŸ‘1