AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
136 photos
248 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”‹V2X-sim for #selfdriving is out!πŸ”‹

πŸ‘‰V2X: collaboration between a vehicle and any surrounding entity

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Suitable for #selfdrivingcars
βœ…Rec. from road & vehicles
βœ…Multi-streams/perception
βœ…Detection, tracking, & segmentation
βœ…RGB, depth, semantic, BEV & LiDAR

More: https://bit.ly/3H6veOI
πŸ”₯6🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
🍏Infinite Synthetic dataset for Fitness🍏

πŸ‘‰Opensource synthetic images for fitness, single/multi-person, and realistic variation in lighting, camera angles, and occlusions

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…60k images, 1-5 avatars
βœ…15 categories, 21 variations
βœ…Blender and ray-tracing
βœ…SMPL-X + facial expression
βœ…Cloth/skin tone sampled
βœ…147 4K HDRI panoramas
βœ…Creative Commons 4.0

More: https://bit.ly/33B1R9q
🀩5❀1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
β™Š DITTO: Digital Twins from Interaction β™Š

πŸ‘‰Digitizing objects for #metaverse through interactive perception

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…DIgital Twin of arTiculated Objects
βœ…Geometry & kinematic articulation
βœ…Articulation & 3D via perception
βœ…Source code under MIT License

More:https://bit.ly/3LMazCV
πŸ”₯5❀2πŸ‘1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ€– Robotic Telekinesis from Youtube πŸ€–

πŸ‘‰CMU unveils a Robot that observes humans and imitates their actions in real-time

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Enabling robo-hand teleoperation
βœ…Suitable for untrained operator
βœ…Single uncalibrated RGB camera
βœ…Leveraging unlabeled #youtube
βœ…No active fine-tuning or setup
βœ…No collision via Adv-Training

More: https://bit.ly/3H7zUnh
πŸ”₯3🀯2πŸ‘1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’„DIGAN: #AI for video generationπŸ’„

πŸ‘‰A novel INR-based generative adversarial network for video generation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Dynamics-aware generator
βœ…INR-based clip generator
βœ…Manipulating space/time
βœ…Identifying unnatural motion

More: https://bit.ly/3H6sHE4
πŸ”₯4🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦„FILM Neural Frame InterpolationπŸ¦„

πŸ‘‰Frame interpolation that synthesizes multiple intermediate frames from two input images with large in-between motion

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Single unified network
βœ…High quality output
βœ…SOTA on the Xiph
βœ…Apache License 2.0

More: https://bit.ly/3pl4ZxH
πŸ”₯5πŸ‘2πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”ˆNeural Maintenance via listeningπŸ”ˆ

πŸ‘‰Novel neural-method to detect whether a machine is "healthy" or requires maintenance

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Defects at an early stage
βœ…FDWT, fast discrete wavelet
βœ…Learnable wavelet/denoising
βœ…Unsupervised learnable FDWT
βœ…The new SOTA in PM

More: https://bit.ly/3hiKWeX
🀯6πŸ€”1
This media is not supported in your browser
VIEW IN TELEGRAM
🟦🟨 StyleGAN on Internet pics 🟦🟨

πŸ‘‰StyleGAN on raw uncurated images collected from Internet

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Outliers & multi-modal
βœ…Self-distillation approach
βœ…Self-filtering of outliers
βœ…Perceptual clustering

More: https://bit.ly/33Z1d5H
❀2πŸ‘1πŸ”₯1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🦜The new SOTA for Unsupervised 🦜

πŸ‘‰Self-supervised transformer to discover objects in images

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Visual tokens as nodes in graph
βœ…Edges as connectivity score
βœ…The second smallest eV = fg
βœ…Suitable for unsupervised saliency
βœ…Weakly supervised obj. detection
βœ…Code under MIT License


More: https://bit.ly/3sqbFg3
πŸ‘4πŸ”₯3🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯¦ GAN-generated CryptoPunks πŸ₯¦

πŸ‘‰A simple (and funny) SN-GAN to generate cryptopunks

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Spectral normalization (2018)
βœ…Easy to incorporate into training
βœ…A project by Teddy Koker 🎩

More: https://bit.ly/35C1rQI
❀3😁3πŸ‘1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ€ͺSEER: self-AI from BILLIONS picπŸ€ͺ

πŸ‘‰META + INRIA trained models on billions of random images without any pre-processing or assumptions

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Self-supervised on pics from web
βœ…Discovering properties in datasets
βœ…More fair, less biased & less harmful
βœ…Better OOD generalization
βœ…Source code available!

More: https://bit.ly/3vy69dd
πŸ”₯4πŸ‘3🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐲A novel AI-controllable synthesis🐲

πŸ‘‰Modeling local semantic parts separately and synthesizing images in a compositional way

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Structure & texture locally controlled
βœ…Disentanglement between areas
βœ…Fine-grained editing of images
βœ…Extendible via transfer learning
βœ…Just accepted to #CVPR2022

More: https://bit.ly/3IBgkBy
😱3🀯2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯£ #AI-Generation with Dream Fields πŸ₯£

πŸ‘‰Neural rendering with multi-modal image and text representations

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Aligned image & text models
βœ…3D from natural language
βœ…No additional data
βœ…D.F. neural-scene

More: https://bit.ly/3Mhwm5D
πŸ‘10πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸŸͺ Mip-NeRF 360 for unbounded scenes πŸŸͺ

πŸ‘‰An extension of NeRF to overcome the challenges presented by unbounded scenes

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Realistic synthesized views
βœ…Intricate/unbounded scenes
βœ…Detailed depth maps
βœ…Mean-squared error -54%
βœ…No code provided πŸ˜₯

More: https://bit.ly/36ZxsD4
🀯4❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“ PINA: personal Neural Avatar πŸ“

πŸ‘‰A novel method to acquire neural avatars from RGB-D videos

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…A virtual copy of themselves
βœ…Realistic clothing deformations
βœ…Shape & non-rigid deformation
βœ…Avatars from RGB-D sequences
βœ…Creative Commons Zero v1.0

More: https://bit.ly/3HAtRIh
πŸ‘4❀1πŸ‘1😁1
This media is not supported in your browser
VIEW IN TELEGRAM
🐦 EfficientVIS: new SOTA for VIS 🐦

πŸ‘‰Simultaneous classification, segmentation, and tracking multiple object instances in videos

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Efficient and fully end-to-end
βœ…Iterative query-video interaction
βœ…First RoI-wise clip-level RT-VIS
βœ…Requires 15Γ— fewer epochs

More: https://bit.ly/3KfqurN
πŸ‘10πŸ”₯3πŸ‘Ž1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐠#AI-clips from single frame🐠

πŸ‘‰Moving objects in #3D while generating a video by a sequence of desired actions

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…A playable environments
βœ…A single starting image🀯
βœ…Controllable camera
βœ…Unsupervised learning

More: https://bit.ly/35VDrYO
❀3πŸ‘1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🧊Kubric: AI dataset generator🧊

πŸ‘‰Open-source #Python framework for photo-realistic scenes: full control, rich annotations, TBs of fresh data 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Synthetic datasets with GT
βœ…From NeRF to optical flow
βœ…Full control over data
βœ…Ok privacy & licensing
βœ…Apache License 2.0

More: https://bit.ly/3hQCaFs
πŸ”₯6πŸ‘1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ‚Β΅Transfer for enormous NNs πŸͺ‚

πŸ‘‰Microsoft unveils how to tune enormous neural networks

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…New HP tuning: Β΅Transfer
βœ…Zero-shot transfer to full-model
βœ…Outperforming BERT-large
βœ…Outperforming 6.7B GPT-3
βœ…Code under MIT license

More: https://bit.ly/3qc37Ij
πŸ”₯2🀯2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
🐧Semantic via only text supervision🐧

πŸ‘‰GroupViT with a text encoder on a large-scale image-text dataset: semantic with any pixel-level annotations in training!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Hierarc. Grouping Vision Transf.
βœ…Additional text encoder
βœ…NO pixel-level annotations
βœ…Semantic-seg task via zero-shot
βœ…Source code available soon

More:https://bit.ly/3hPGeWr
πŸ‘6πŸ₯°1🀯1