AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
136 photos
248 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‡SmeLU: Smooth Activation FunctionπŸ‡

πŸ‘‰Google unveils a new smooth activation function: easy to implement, cheap & less error-prone

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Smooth to mitigate irreproducibility
βœ…Cheap function, better than GELU/Swish
βœ…0-1 slope through quadratic middle region
βœ…SmeLU as convolution of ReLU with box
βœ…Best reproducibility-accuracy tradeoff

More: https://bit.ly/3xcskXm
😱8πŸ‘4❀1πŸ”₯1😁1🀯1
πŸ“Hyper-Dense Landmarks at 150FPSπŸ“

πŸ‘‰#Microsoft unveils the SOTA in dense landmarking + #3D reconstruction. MAGIC.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Accurate 10Γ— as many landmarks as usual
βœ…Synthetic data, perfect annotations
βœ…NO appearance, light, diff-rendering
βœ…#3D @150+FPS with a single CPU thread
βœ…SOTA in monocular 3D reconstruction

More: https://bit.ly/37pQS40
πŸ‘6πŸ”₯4🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
β˜€οΈSunStage: Selfie with the Sunβ˜€οΈ

πŸ‘‰Accurate/tailored reconstruction of facial geometry/reflectance

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel personalized scanning
βœ…Disentanglement of scene params
βœ…Geometry, materials, lighting, poses
βœ…Photorealistic with a single selfie video

More: https://bit.ly/36W1Oqx
πŸ”₯3πŸ‘2πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“« Generative Neural Avatars πŸ“«

πŸ‘‰3D shapes of people in a variety of garments with corresponding skinning weight

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…ETH + Uni-TΓΌbingen + Max Planck
βœ…Animatable #3D human in garment
βœ…Directly from raw posed 3D scans
βœ…NO canonical, registration, manual w.
βœ…Geometric detail in clothing deformation


More: https://bit.ly/3M7mCdB
πŸ‘3πŸ”₯2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ—¨οΈConversational program synthesisπŸ—¨οΈ

πŸ‘‰Conversational synthesis to translate English into executable code

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Conversational program synthesis
βœ…New multi-turn progr.benchmark
βœ…Open Custom library: JAXFORMER
βœ…Source code under BSD-3 license

More: https://bit.ly/3jjWWhk
🀯4πŸ₯°2πŸ”₯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🧯Long Video Diffusion Models🧯

πŸ‘‰#Google unveils a novel diffusion model for video generation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Straightforward extension of 2D UNet
βœ…Longer by new conditional generation
βœ…SOTA in unconditional generation

More: https://bit.ly/35Y2rzg
πŸ”₯4πŸŽ‰2🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸš™ AutoRF: #3D objects in-the-wild πŸš™

πŸ‘‰From #Meta: #3D object from just a single, in-the wild, image

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel view synthesis from in-the-wild
βœ…Normalized, object-centric representation
βœ…Disentangling shape, appearance & pose
βœ…Exploiting BBS & panoptic segmentation
βœ…Shape/appearance properties for objects


More: https://bit.ly/3O4ONeQ
🀯7😱2πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
🌠GAN-based Darkest Dataset🌠

πŸ‘‰Berkeley + #Intel announce first photorealistic dataset under starlight (no moon, <0.001 lx)

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…"Darkest" dataset ever seen
βœ…Moonless, no external illumination
βœ…GAN-tuned physics-based model
βœ…Clips with dancing, volleyball, flags...

More: https://bit.ly/3LXxMkN
πŸ‘3🀯2πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ€–Populating with digital humansπŸ€–

πŸ‘‰ETHZ unveils GAMMA to populate the #3D scene with digital humans

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…GenerAtive Motion primitive MArkers
βœ…Realistic, controllable, infinite motions
βœ…Tree-based search to preserve quality
βœ…SOTA in realistic/controllable motion

More: https://bit.ly/3OgY4AG
😱5πŸ‘4πŸ”₯2πŸ‘1🀯1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯#AIwithPapers: we are ~2,000!πŸ”₯

πŸ’™πŸ’› Simply amazing. Thank you all πŸ’™πŸ’›

😈 Invite your friends -> https://t.iss.one/AI_DeepLearning
❀18πŸ”₯8πŸ₯°4πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
😼GARF: Gaussian Activated NeRF😼

πŸ‘‰GARF: Gaussian Activated R.F. for Hi-Fi reconstruction/pose

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…NeRF from imperfect camera poses
βœ…NO hyper-parameter tuning/initialization
βœ…Theoretical insight on Gaussian activation
βœ…Unlocking NeRF for real-world application?

More: https://bit.ly/36bvdfU
πŸ‘4🀩2❀1πŸ‘1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🎭Novel pre-training strategy for #AI🎭

πŸ‘‰EPFL unveils the Multi-modal Multi-task Masked Autoencoders (MultiMAE)

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Multimodal: additional modal. over RGB
βœ…Multi-task: multiple outputs over RGB
βœ…General: MultiMAE by pseudo-labeling
βœ…Classification, segmentation, depth
βœ…Code under NonCommercial 4.0 Int.

More: https://bit.ly/3jRhNsN
πŸ”₯7🀯2πŸ‘1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ§ͺ A new SOTA in Dataset Distillation πŸ§ͺ

πŸ‘‰A new approach by Matching Training Trajectories is out!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Distilling data "to match" bigger one
βœ…Distilled data to guide a network
βœ…Trajectories of experts from real data
βœ…SOTA + distilling higher-res visual data

More: https://bit.ly/3JwYOxW
πŸ‘5πŸ”₯1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🧀 Two-Hand tracking via GCN 🧀

πŸ‘‰The first-ever GCN for two interacting hands in single RGB image

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Reconstruction by GCN mesh regression
βœ…PIFA: pyramid attention for local occlusion
βœ…CHA: cross hand attention for interaction
βœ…SOTA + generalization in-the-wild scenario
βœ…Source code available under GNU 🀯

More: https://bit.ly/3KH5FWO
πŸ‘10πŸ‘4🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ•ΉοΈVideo K-Net, SOTA in SegmentationπŸ•ΉοΈ

πŸ‘‰Simple, strong, and unified framework for fully end-to-end video panoptic segmentation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Learnable kernels from K-Net
βœ…K-Net learns to segment & track
βœ…Appearance / cross-T kernel interaction
βœ…New SOTA without bells and whistles πŸ€·β€β™‚οΈ

More: https://bit.ly/3uEEZQR
πŸ‘6πŸ”₯1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐭DeepLabCut: tracking animals in the wild🐭

πŸ‘‰A toolbox for markerless pose estimation of animals performing various tasks

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Multi-animal pose estimation
βœ…Datasets for multi-animal pose
βœ…Key-points, limbs, animal identity
βœ…Optimal key-points without input

More: https://bit.ly/37L1mLE
πŸ”₯6πŸ€”4πŸ‘2🀯2❀1πŸ‘1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🍑Neural Articulated Human Body🍑

πŸ‘‰Novel neural implicit representation for articulated body

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…COmpositional Articulated People
βœ…Large variety of shapes & poses
βœ…Novel encoder-decoder architecture

More: https://bit.ly/3xvn7dl
πŸ‘4πŸ₯°2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🦚 2K Resolution Generative #AI 🦚

πŸ‘‰Novel continuous-scale training with variable output resolutions

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Mixed-resolution data
βœ…Arbitrary scales during training
βœ…Generations beyond 1024Γ—1024
βœ…Variant of FID metric for scales
βœ…Source code under MIT license

More: https://bit.ly/3uNfVY6
🀯11πŸ‘2πŸ”₯2😱1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
🐍DS Unsupervised Video Decomposition🐍

πŸ‘‰Novel method to extract persistent elements of a scene

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Scene element as Deformable Sprite (DS)
βœ…Deformable Sprites by video auto-encoder
βœ…Canonical texture image for appearance
βœ…Non-rigid geom. transformation

More: https://bit.ly/37WV9w1
πŸ‘4🀯3πŸ”₯1πŸ₯°1πŸ‘1😱1