AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
136 photos
250 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“« Generative Neural Avatars πŸ“«

πŸ‘‰3D shapes of people in a variety of garments with corresponding skinning weight

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…ETH + Uni-TΓΌbingen + Max Planck
βœ…Animatable #3D human in garment
βœ…Directly from raw posed 3D scans
βœ…NO canonical, registration, manual w.
βœ…Geometric detail in clothing deformation


More: https://bit.ly/3M7mCdB
πŸ‘3πŸ”₯2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ—¨οΈConversational program synthesisπŸ—¨οΈ

πŸ‘‰Conversational synthesis to translate English into executable code

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Conversational program synthesis
βœ…New multi-turn progr.benchmark
βœ…Open Custom library: JAXFORMER
βœ…Source code under BSD-3 license

More: https://bit.ly/3jjWWhk
🀯4πŸ₯°2πŸ”₯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🧯Long Video Diffusion Models🧯

πŸ‘‰#Google unveils a novel diffusion model for video generation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Straightforward extension of 2D UNet
βœ…Longer by new conditional generation
βœ…SOTA in unconditional generation

More: https://bit.ly/35Y2rzg
πŸ”₯4πŸŽ‰2🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸš™ AutoRF: #3D objects in-the-wild πŸš™

πŸ‘‰From #Meta: #3D object from just a single, in-the wild, image

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel view synthesis from in-the-wild
βœ…Normalized, object-centric representation
βœ…Disentangling shape, appearance & pose
βœ…Exploiting BBS & panoptic segmentation
βœ…Shape/appearance properties for objects


More: https://bit.ly/3O4ONeQ
🀯7😱2πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
🌠GAN-based Darkest Dataset🌠

πŸ‘‰Berkeley + #Intel announce first photorealistic dataset under starlight (no moon, <0.001 lx)

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…"Darkest" dataset ever seen
βœ…Moonless, no external illumination
βœ…GAN-tuned physics-based model
βœ…Clips with dancing, volleyball, flags...

More: https://bit.ly/3LXxMkN
πŸ‘3🀯2πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ€–Populating with digital humansπŸ€–

πŸ‘‰ETHZ unveils GAMMA to populate the #3D scene with digital humans

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…GenerAtive Motion primitive MArkers
βœ…Realistic, controllable, infinite motions
βœ…Tree-based search to preserve quality
βœ…SOTA in realistic/controllable motion

More: https://bit.ly/3OgY4AG
😱5πŸ‘4πŸ”₯2πŸ‘1🀯1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯#AIwithPapers: we are ~2,000!πŸ”₯

πŸ’™πŸ’› Simply amazing. Thank you all πŸ’™πŸ’›

😈 Invite your friends -> https://t.iss.one/AI_DeepLearning
❀18πŸ”₯8πŸ₯°4πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
😼GARF: Gaussian Activated NeRF😼

πŸ‘‰GARF: Gaussian Activated R.F. for Hi-Fi reconstruction/pose

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…NeRF from imperfect camera poses
βœ…NO hyper-parameter tuning/initialization
βœ…Theoretical insight on Gaussian activation
βœ…Unlocking NeRF for real-world application?

More: https://bit.ly/36bvdfU
πŸ‘4🀩2❀1πŸ‘1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🎭Novel pre-training strategy for #AI🎭

πŸ‘‰EPFL unveils the Multi-modal Multi-task Masked Autoencoders (MultiMAE)

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Multimodal: additional modal. over RGB
βœ…Multi-task: multiple outputs over RGB
βœ…General: MultiMAE by pseudo-labeling
βœ…Classification, segmentation, depth
βœ…Code under NonCommercial 4.0 Int.

More: https://bit.ly/3jRhNsN
πŸ”₯7🀯2πŸ‘1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ§ͺ A new SOTA in Dataset Distillation πŸ§ͺ

πŸ‘‰A new approach by Matching Training Trajectories is out!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Distilling data "to match" bigger one
βœ…Distilled data to guide a network
βœ…Trajectories of experts from real data
βœ…SOTA + distilling higher-res visual data

More: https://bit.ly/3JwYOxW
πŸ‘5πŸ”₯1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🧀 Two-Hand tracking via GCN 🧀

πŸ‘‰The first-ever GCN for two interacting hands in single RGB image

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Reconstruction by GCN mesh regression
βœ…PIFA: pyramid attention for local occlusion
βœ…CHA: cross hand attention for interaction
βœ…SOTA + generalization in-the-wild scenario
βœ…Source code available under GNU 🀯

More: https://bit.ly/3KH5FWO
πŸ‘10πŸ‘4🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ•ΉοΈVideo K-Net, SOTA in SegmentationπŸ•ΉοΈ

πŸ‘‰Simple, strong, and unified framework for fully end-to-end video panoptic segmentation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Learnable kernels from K-Net
βœ…K-Net learns to segment & track
βœ…Appearance / cross-T kernel interaction
βœ…New SOTA without bells and whistles πŸ€·β€β™‚οΈ

More: https://bit.ly/3uEEZQR
πŸ‘6πŸ”₯1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐭DeepLabCut: tracking animals in the wild🐭

πŸ‘‰A toolbox for markerless pose estimation of animals performing various tasks

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Multi-animal pose estimation
βœ…Datasets for multi-animal pose
βœ…Key-points, limbs, animal identity
βœ…Optimal key-points without input

More: https://bit.ly/37L1mLE
πŸ”₯6πŸ€”4πŸ‘2🀯2❀1πŸ‘1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🍑Neural Articulated Human Body🍑

πŸ‘‰Novel neural implicit representation for articulated body

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…COmpositional Articulated People
βœ…Large variety of shapes & poses
βœ…Novel encoder-decoder architecture

More: https://bit.ly/3xvn7dl
πŸ‘4πŸ₯°2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🦚 2K Resolution Generative #AI 🦚

πŸ‘‰Novel continuous-scale training with variable output resolutions

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Mixed-resolution data
βœ…Arbitrary scales during training
βœ…Generations beyond 1024Γ—1024
βœ…Variant of FID metric for scales
βœ…Source code under MIT license

More: https://bit.ly/3uNfVY6
🀯11πŸ‘2πŸ”₯2😱1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
🐍DS Unsupervised Video Decomposition🐍

πŸ‘‰Novel method to extract persistent elements of a scene

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Scene element as Deformable Sprite (DS)
βœ…Deformable Sprites by video auto-encoder
βœ…Canonical texture image for appearance
βœ…Non-rigid geom. transformation

More: https://bit.ly/37WV9w1
πŸ‘4🀯3πŸ”₯1πŸ₯°1πŸ‘1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯“ L-SVPE for Deep Deblurring πŸ₯“

πŸ‘‰L-SVPE to deblur scenes while recovering high-freq details

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Learned Spatially Varying Pixel Exposures
βœ…Next-gen focal-plane sensor + DL
βœ…Deep conv decoder for motion deblurring
βœ…Superior results over non-optimized exp.

More: https://bit.ly/3uRYQMT
🀩7πŸ‘2πŸ€”2πŸŽ‰1
This media is not supported in your browser
VIEW IN TELEGRAM
🧧Hyper-Fast Instance Segmentation🧧

πŸ‘‰Novel Temporally Efficient Vision Transformer (TeViT) for VIS

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Video instance segmentation transformer
βœ…Contextual-info at frame/instance level
βœ…Nearly convolution-free framework πŸ€·β€β™‚οΈ
βœ…The new SOTA for VIS, ~70 FPS!
βœ…Code & models under MIT license

More: https://bit.ly/3rCMXIn
πŸ”₯10πŸ‘3πŸ‘1🀯1
πŸ“—Unified Scene Text/Layout DetectionπŸ“—

πŸ‘‰World's first hierarchical scene text dataset + novel detection method

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Unified detection & geometric layout
βœ…Hierarchical annotations in natural scenes
βœ…Word, line, & paragraph level annotations
βœ…Source under CC Attribution Share Alike 4.0

More: https://bit.ly/3jRpezV
πŸ”₯3🀯2❀1πŸ‘1