AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
136 photos
250 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🦊 3D-Aware "StyleGANv2" version 🦊

πŸ‘‰Upgrading StyleGANv2 into a novel 3D-aware GAN with just a minimal set of changes🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…MPI-like 3D-aware GAN w/ single-view
βœ…GMPI: generative multiplane image
βœ…2D GAN 3D-aware with a minimal changes
βœ…Encoding 3D-aware inductive biases

More: https://bit.ly/3OJ5gnS
🀯6πŸ‘4❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“Ί NeRF-ing "The Big Bang Theory" πŸ“Ί

πŸ‘‰Berkeley unveils an approach for accurate estimation of actor’s 3D pose & location

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Input: images across the whole season
βœ…3D context (i.e. cams, structure, body)
βœ…Integrating context in 3D estimation
βœ…Re-ID, gaze, cinematography, pic editing
βœ…Knock, Knock, Penny!

More: https://bit.ly/3OLuaUb
πŸ”₯7🀯5πŸ₯°2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
🎩ShAPO: SOTA in object understanding🎩

πŸ‘‰Joint multi-object detection, #3D texture, 6D object pose & size estimation.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Disentangled shape & appearance
βœ…Efficient octree-based differentiable
βœ…Object-centric understanding pipeline
βœ…Detection, reconstruction , 6D & size
βœ…SOTA in reconstruction & pose est.

More: https://bit.ly/3oHN5EQ
πŸ‘7🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ™οΈ CityNeRF: Neural Rendering of City Scenes πŸ™οΈ

πŸ‘‰Progressive NeRF model and training set on city-scenes

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…BungeeNeRF: novel progressive NeRF
βœ…Details on drastically varied scales
βœ…Growing with residual block structure
βœ…Inclusive multi-level data supervision

More: https://bit.ly/3cS9vk7
πŸ₯°7πŸ‘3🀯3😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🍦🍦 Rewriting Geometry of GAN 🍦🍦

πŸ‘‰Drive GAN synthesizing many unseen objects with the desired shape

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…User-friendly "warping" with geometry
βœ…Low-rank update to layer for editing
βœ…Latent augmentation based on style-mix
βœ…Endless objects with defined changes
βœ…Latent space interpolation, image editing

More: https://bit.ly/3zIfOj8
πŸ‘8😱7😁3πŸ‘Ž2❀1πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
🍏🍏 GAUDI: the Neural Architect 🍏🍏

πŸ‘‰Novel generative model for immersive 3D scenes from a moving camera

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Hundreds of thousands pics/scenes
βœ…Novel denoising optimization objective
βœ…New SOTA across multiple datasets
βœ…Un/conditional on images/text

More: https://bit.ly/3Bt65ye
πŸ”₯6
This media is not supported in your browser
VIEW IN TELEGRAM
🚜NeDDF: the NeRF evolution!🚜

πŸ‘‰Novel 3D representation that reciprocally constrains distance & density fields

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…NeRF provides no distance
βœ…Extending for arbitrary density
βœ…Density via dist-field & gradient
βœ…Alleviating the instability

More: https://bit.ly/3Bte8LC
πŸ‘7
Media is too big
VIEW IN TELEGRAM
πŸ”₯AND/OR: Composable Diffusion ModelsπŸ”₯

πŸ‘‰Novel neural compositional generation via Composable Diffusion Models

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…DM as energy-based models
βœ…Connecting diffusion models
βœ…Conjunction & negation, on top of DM
βœ…Zero-shot combinatorial generalization

More: https://bit.ly/3PYv1Cs
🀯5πŸ‘3❀2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ MobileNeRF is out -> Pure Fire! πŸ”₯

πŸ‘‰MobileNeRF is out: the mobile evolution of NeRF via textured polygons.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Same quality, 10x faster than SNeRG
βœ…Memory-- by storing surface textures
βœ…Integrated GPUs: less memory/power
βœ…Suitable for browser & viewer is HTML

More: https://bit.ly/3PUKPWy
πŸ”₯25πŸ‘5
This media is not supported in your browser
VIEW IN TELEGRAM
🧣NeRF for Outdoor Scene Relighting🧣

πŸ‘‰NeRF-OSR: the first neural radiance fields approach for outdoor scene relighting

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…NeRF-method for outdoor relighting
βœ…Simultaneous illumination/viewpoint
βœ…Control over shading, shadow, albedo
βœ…Self-Supervised training from outdoor
βœ…Dataset: 3240 viewpoints, 110+ times

More: https://bit.ly/3vBiH2G
πŸ”₯5πŸ‘3❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘©β€πŸ¦° Real-Time Neural Hair πŸ‘©β€πŸ¦°

πŸ‘‰Accurate hair geometry & appearance from multi-pics

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Bonn, CMU and Reality Labs
βœ…Photorealistic Real-Time render
βœ…HQ strand geometry/appearance
βœ…Novel scalp texture description
βœ…Intuitive manipulation of 3D hair

More: https://bit.ly/3vBiH2G
❀8πŸ‘6
This media is not supported in your browser
VIEW IN TELEGRAM
πŸš€ #VR by NASA - 1985 πŸš€

πŸ‘‰Q: is #VR the technology that developed least in the last 40 years? πŸ€”

Let's talk: https://bit.ly/3JxDZ7i
🀯7🀩2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ MinVIS, a new SOTA is out πŸ”₯

πŸ‘‰#Nvidia miniVIS: no video-based architectures nor training procedures🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Video architecture/train not required
βœ…MinVIS outperforms the previous SOTA
βœ…Occluded VIS (OVIS): >10% improvement
βœ…1% of labeled frames >> fully-supervised

More: https://bit.ly/3pcYzk1
πŸ”₯12
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯πŸ”₯MultiNeRF: three NeRFs are out!πŸ”₯πŸ”₯

πŸ‘‰Google opens the code of three #cvpr2022 papers: Mip-NeRF 360, Ref-NeRF, RawNeRF

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Paper_1: Mip-NeRF 360
βœ…Paper_2: Ref-NeRF
βœ…Paper_3: NeRF in the Dark

More: https://bit.ly/3QjpRRc
πŸ‘13❀4🀯4
This media is not supported in your browser
VIEW IN TELEGRAM
β˜€οΈLocoProp: Neural Layers Compositionβ˜€οΈ

πŸ‘‰Google AI unveils LocoProp: novel neural paradigm for modular composition of layers.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Backprop++ via Local Loss Optimization
βœ…Layer-based w-reg, target output, loss
βœ…Multiple local update via first-order opt.
βœ…Superior performance and efficiency

More: https://bit.ly/3Q40YJn
πŸ”₯13
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯PCVOS: clip-wise mask VOSπŸ”₯

πŸ‘‰PCVOS: new semi-supervised video object segmentation method

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Reformulating semi-supervised VOS
βœ…Novel per-clip inference perspective
βœ…Clip-wise operation on intra-clip
βœ…PCVOS: model for per-clip inference
βœ…New SOTA on multiple benchmarks

More: https://bit.ly/3vJtmbz
πŸ‘10😁2❀1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘ World-Object Detection via ViT πŸ‘

πŸ‘‰Google unveils OWL-ViT: open-vocabulary detector based on ViTs 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…ViTs for Open-World Localization
βœ…Img-level to open-vocabulary detection
βœ…SOTA one-shot (img.cond.) detection

More: https://bit.ly/3Sy3jOj
🀯12πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
🎹🎹 Learning Piano in #AR 🎹🎹

πŸ‘‰PianoVision (on #META #Quest2) accelerates the piano learning via Passthrough #AR & hand tracking

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Sheet Insight to learn sight-read
βœ…MIDI keyboard connectivity
βœ…Air piano for no physical pianos
βœ…Multiplayer Music Instruction
βœ…PianoVision Music Hall in #VR

More: https://bit.ly/3zYvwGX
❀15🀯6πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🧊EPro-PnP: Persp-n-Points Detection🧊

πŸ‘‰EPro-PnP: probabilistic PnP layer for general e2e pose estimation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Probabilistic PnP for general e2e pose
βœ…Top-tier in 6DoF by inserting into CDPN
βœ…Deformable accurate detection
βœ…2D-3D corresp. learned from scratch

More: https://bit.ly/3BNPXYr
πŸ‘11