AI with Papers - Artificial Intelligence & Deep Learning

🪰NUWA-Infinity is out!🪰

👉∞ generation by #Microsoft: arbitrarily-sized HD images and long videos 🤯

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Unconditional Image Gen.
✅Text-to-Image/Text-to-Clip
✅Animation / Out-painting
✅Hi-res, arbitrary long clip
✅NCP for patches caching

More: https://bit.ly/3zmBf9f

🔥7👍2❤1👏1🤯1

3.28K viewsedited 12:55

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥 #AIwithPapers: we are 3,500+! 🔥

💙💛 Ready for YOLO 10, 11, π, ∞, Ψ, and more? The more we are, the faster we catch'em all 💙💛

😈 Invite your friends -> https://t.iss.one/AI_DeepLearning

👍12❤10😁5🔥3

3.36K views13:37

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🎷🎷OMNI3D: #3D Objects in the Wild🎷🎷

👉#3D detection: 234k images, 3M+ instances & 97 categories

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅OMNI3D from publicly released dataset
✅234k pics, 3M+ annotation with 3D box
✅97 categories such as sofa, table, cars
✅Fast (450x) and exact algorithm for IoU
✅Cube R-CNN: novel 3D object detector

More: https://bit.ly/3cznjzG

👍11

3.08K views07:47

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👹Multiface Neural Rendering 👹

👉A new multi-view, Hi-Res data collected at #META Reality Labs for neural face

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Mugsy, large scale multi-cam apparatus
✅High-Res sync facial performance
✅Closing the gap in accessing HQ data
✅Suitable for #VR & #mixedreality

More: https://bit.ly/3b6XfeL

🤯8👍3

2.94K viewsedited 06:43

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💄DEVIANT: SOTA in mono-3D detection💄

👉A novel Depth EquiVarIAnt NeTwork for 3D monocular detection in the wild

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Michigan + #Meta + Ford 🤯
✅Depth-equi. + scale equiv. steerable
✅New SOTA on KITTI & Waymo
✅Ok cross-dataset -> generalization

More: https://bit.ly/3OEFtgK

🔥16👍2❤1

2.92K views08:02

AI with Papers - Artificial Intelligence & Deep Learning

AI with Papers - Artificial Intelligence & Deep Learning pinned a GIF

19:41

AI with Papers - Artificial Intelligence & Deep Learning

0:12

This media is not supported in your browser

VIEW IN TELEGRAM

🧱 Assembling #LEGO with #AI 🧱

👉Step-by-step assembly manual created by human into machine-interpretable instructions

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Stanford + MIT + #Google 🤯
✅MEPNet: Manual-to-Executable-Plan Net
✅Manual to machine-executable plan
✅2D manual - 3D geometric shape
✅Reasoning on 3D alignments of legos

More: https://bit.ly/3PCwn5C

🔥9❤3

2.81K viewsedited 07:34

AI with Papers - Artificial Intelligence & Deep Learning

🔥One Millisecond Backbone. Fire!🔥 👉MobileOne by #Apple: efficient mobile backbone with inference <1 ms on #iPhone12! 𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬: ✅75.9% top-1 accuracy on ImageNet ✅38× faster than MobileFormer net ✅Classification, detection & segmentation ✅Source code &…

🔥🔥 UPDATE 🔥🔥

Code Released: https://github.com/apple/ml-mobileone

❤3🥰1

2.71K views07:42

AI with Papers - Artificial Intelligence & Deep Learning

0:04

This media is not supported in your browser

VIEW IN TELEGRAM

🎃New SOTA in UDA Semantic Seg.🎃

👉HRDA: multi-res Unsupervised Domain Adaptive Semantic Seg. -> SOTA

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅ETH + MPG + KU Leuven 🤯
✅HRDA: multi-res approach for UDA
✅Manageable GPU memory footprint
✅Small objects & fine segmentation detail
✅New SOTA on GTA and Synthia dataset

More: https://bit.ly/3cKtDEp

🤯8👍1

2.77K views10:14

AI with Papers - Artificial Intelligence & Deep Learning

0:08

This media is not supported in your browser

VIEW IN TELEGRAM

⚗️ SemAbs: 3D Scene Understanding ⚗️

👉Framework that equips 2D Vision-Language Models (VLMs) with new 3D spatial capabilities

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅2D VLMs with 3D reasoning skills
✅ViTs Efficient MS Relevancy Extraction
✅Novel Open-World understanding tasks
✅Completing partially observed objects
✅Finding hidden objects from language

More: https://bit.ly/3PYYk7d

🔥7❤1👍1

2.77K views06:48

AI with Papers - Artificial Intelligence & Deep Learning

0:07

This media is not supported in your browser

VIEW IN TELEGRAM

🦚 TinyCD: Neural Change Detection 🦚

👉TinyCD: new SOTA in change detection with up to 150x fewer parameters.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅SOTA with up to 150X fewer params
✅Mixing blocks for s.t. cross-correlation
✅PW-MLP for pixel wise classification
✅MAMB: novel block for skip connection

More: https://bit.ly/3zFEngk

❤16👍2👏1

2.86K viewsedited 12:45

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🦊 3D-Aware "StyleGANv2" version 🦊

👉Upgrading StyleGANv2 into a novel 3D-aware GAN with just a minimal set of changes🤯

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅MPI-like 3D-aware GAN w/ single-view
✅GMPI: generative multiplane image
✅2D GAN 3D-aware with a minimal changes
✅Encoding 3D-aware inductive biases

More: https://bit.ly/3OJ5gnS

🤯6👍4❤1

2.74K views06:41

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

📺 NeRF-ing "The Big Bang Theory" 📺

👉Berkeley unveils an approach for accurate estimation of actor’s 3D pose & location

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Input: images across the whole season
✅3D context (i.e. cams, structure, body)
✅Integrating context in 3D estimation
✅Re-ID, gaze, cinematography, pic editing
✅Knock, Knock, Penny!

More: https://bit.ly/3OLuaUb

🔥7🤯5🥰2❤1

2.83K viewsedited 17:39

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🎩ShAPO: SOTA in object understanding🎩

👉Joint multi-object detection, #3D texture, 6D object pose & size estimation.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Disentangled shape & appearance
✅Efficient octree-based differentiable
✅Object-centric understanding pipeline
✅Detection, reconstruction , 6D & size
✅SOTA in reconstruction & pose est.

More: https://bit.ly/3oHN5EQ

👍7🤯1

2.72K viewsedited 07:53

AI with Papers - Artificial Intelligence & Deep Learning

0:12

This media is not supported in your browser

VIEW IN TELEGRAM

🏙️ CityNeRF: Neural Rendering of City Scenes 🏙️

👉Progressive NeRF model and training set on city-scenes

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅BungeeNeRF: novel progressive NeRF
✅Details on drastically varied scales
✅Growing with residual block structure
✅Inclusive multi-level data supervision

More: https://bit.ly/3cS9vk7

🥰7👍3🤯3😱1

2.73K views11:47

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍦🍦 Rewriting Geometry of GAN 🍦🍦

👉Drive GAN synthesizing many unseen objects with the desired shape

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅User-friendly "warping" with geometry
✅Low-rank update to layer for editing
✅Latent augmentation based on style-mix
✅Endless objects with defined changes
✅Latent space interpolation, image editing

More: https://bit.ly/3zIfOj8

👍8😱7😁3👎2❤1🔥1

2.7K viewsedited 08:29

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍏🍏 GAUDI: the Neural Architect 🍏🍏

👉Novel generative model for immersive 3D scenes from a moving camera

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Hundreds of thousands pics/scenes
✅Novel denoising optimization objective
✅New SOTA across multiple datasets
✅Un/conditional on images/text

More: https://bit.ly/3Bt65ye

🔥6

2.74K viewsedited 17:05

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🚜NeDDF: the NeRF evolution!🚜

👉Novel 3D representation that reciprocally constrains distance & density fields

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅NeRF provides no distance
✅Extending for arbitrary density
✅Density via dist-field & gradient
✅Alleviating the instability

More: https://bit.ly/3Bte8LC

👍7

2.71K viewsedited 10:32

AI with Papers - Artificial Intelligence & Deep Learning

0:17

Media is too big

VIEW IN TELEGRAM

🔥AND/OR: Composable Diffusion Models🔥

👉Novel neural compositional generation via Composable Diffusion Models

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅DM as energy-based models
✅Connecting diffusion models
✅Conjunction & negation, on top of DM
✅Zero-shot combinatorial generalization

More: https://bit.ly/3PYv1Cs

🤯5👍3❤2

2.71K viewsedited 20:27

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥 MobileNeRF is out -> Pure Fire! 🔥

👉MobileNeRF is out: the mobile evolution of NeRF via textured polygons.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Same quality, 10x faster than SNeRG
✅Memory-- by storing surface textures
✅Integrated GPUs: less memory/power
✅Suitable for browser & viewer is HTML

More: https://bit.ly/3PUKPWy

🔥25👍5

2.88K views05:57

About

Blog

Apps

Platform