AI with Papers - Artificial Intelligence & Deep Learning

♊ DITTO: Digital Twins from Interaction ♊

👉Digitizing objects for #metaverse through interactive perception

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅DIgital Twin of arTiculated Objects
✅Geometry & kinematic articulation
✅Articulation & 3D via perception
✅Source code under MIT License

More:https://bit.ly/3LMazCV

🔥5❤2👍1🤯1

1.73K views16:00

AI with Papers - Artificial Intelligence & Deep Learning

0:13

This media is not supported in your browser

VIEW IN TELEGRAM

🤖 Robotic Telekinesis from Youtube 🤖

👉CMU unveils a Robot that observes humans and imitates their actions in real-time

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Enabling robo-hand teleoperation
✅Suitable for untrained operator
✅Single uncalibrated RGB camera
✅Leveraging unlabeled #youtube
✅No active fine-tuning or setup
✅No collision via Adv-Training

More: https://bit.ly/3H7zUnh

🔥3🤯2👍1👏1

1.75K viewsedited 10:06

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💄DIGAN: #AI for video generation💄

👉A novel INR-based generative adversarial network for video generation

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Dynamics-aware generator
✅INR-based clip generator
✅Manipulating space/time
✅Identifying unnatural motion

More: https://bit.ly/3H6sHE4

🔥4🤯1

1.69K views09:18

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🦄FILM Neural Frame Interpolation🦄

👉Frame interpolation that synthesizes multiple intermediate frames from two input images with large in-between motion

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Single unified network
✅High quality output
✅SOTA on the Xiph
✅Apache License 2.0

More: https://bit.ly/3pl4ZxH

🔥5👍2🥰1

1.55K viewsedited 08:56

AI with Papers - Artificial Intelligence & Deep Learning

0:37

This media is not supported in your browser

VIEW IN TELEGRAM

🔈Neural Maintenance via listening🔈

👉Novel neural-method to detect whether a machine is "healthy" or requires maintenance

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Defects at an early stage
✅FDWT, fast discrete wavelet
✅Learnable wavelet/denoising
✅Unsupervised learnable FDWT
✅The new SOTA in PM

More: https://bit.ly/3hiKWeX

🤯6🤔1

1.53K views13:55

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🟦🟨 StyleGAN on Internet pics 🟦🟨

👉StyleGAN on raw uncurated images collected from Internet

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Outliers & multi-modal
✅Self-distillation approach
✅Self-filtering of outliers
✅Perceptual clustering

More: https://bit.ly/33Z1d5H

❤2👍1🔥1🤯1

1.5K viewsedited 08:06

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🦜The new SOTA for Unsupervised 🦜

👉Self-supervised transformer to discover objects in images

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Visual tokens as nodes in graph
✅Edges as connectivity score
✅The second smallest eV = fg
✅Suitable for unsupervised saliency
✅Weakly supervised obj. detection
✅Code under MIT License

More: https://bit.ly/3sqbFg3

👍4🔥3🤯1

1.54K viewsedited 13:17

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🥦 GAN-generated CryptoPunks 🥦

👉A simple (and funny) SN-GAN to generate cryptopunks

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Spectral normalization (2018)
✅Easy to incorporate into training
✅A project by Teddy Koker 🎩

More: https://bit.ly/35C1rQI

❤3😁3👍1👏1

1.52K views13:00

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🤪SEER: self-AI from BILLIONS pic🤪

👉META + INRIA trained models on billions of random images without any pre-processing or assumptions

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Self-supervised on pics from web
✅Discovering properties in datasets
✅More fair, less biased & less harmful
✅Better OOD generalization
✅Source code available!

More: https://bit.ly/3vy69dd

🔥4👍3🤯1

1.53K viewsedited 13:05

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐲A novel AI-controllable synthesis🐲

👉Modeling local semantic parts separately and synthesizing images in a compositional way

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Structure & texture locally controlled
✅Disentanglement between areas
✅Fine-grained editing of images
✅Extendible via transfer learning
✅Just accepted to #CVPR2022

More: https://bit.ly/3IBgkBy

😱3🤯2❤1

1.57K viewsedited 14:57

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🥣 #AI-Generation with Dream Fields 🥣

👉Neural rendering with multi-modal image and text representations

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Aligned image & text models
✅3D from natural language
✅No additional data
✅D.F. neural-scene

More: https://bit.ly/3Mhwm5D

👍10👏1

1.65K views11:13

AI with Papers - Artificial Intelligence & Deep Learning

0:16

This media is not supported in your browser

VIEW IN TELEGRAM

🟪 Mip-NeRF 360 for unbounded scenes 🟪

👉An extension of NeRF to overcome the challenges presented by unbounded scenes

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Realistic synthesized views
✅Intricate/unbounded scenes
✅Detailed depth maps
✅Mean-squared error -54%
✅No code provided 😥

More: https://bit.ly/36ZxsD4

🤯4❤1

1.74K views15:28

AI with Papers - Artificial Intelligence & Deep Learning

0:10

This media is not supported in your browser

VIEW IN TELEGRAM

🐓 PINA: personal Neural Avatar 🐓

👉A novel method to acquire neural avatars from RGB-D videos

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅A virtual copy of themselves
✅Realistic clothing deformations
✅Shape & non-rigid deformation
✅Avatars from RGB-D sequences
✅Creative Commons Zero v1.0

More: https://bit.ly/3HAtRIh

👍4❤1👏1😁1

1.82K viewsedited 08:46

AI with Papers - Artificial Intelligence & Deep Learning

0:02

This media is not supported in your browser

VIEW IN TELEGRAM

🐦 EfficientVIS: new SOTA for VIS 🐦

👉Simultaneous classification, segmentation, and tracking multiple object instances in videos

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Efficient and fully end-to-end
✅Iterative query-video interaction
✅First RoI-wise clip-level RT-VIS
✅Requires 15× fewer epochs

More: https://bit.ly/3KfqurN

👍10🔥3👎1🤯1

1.87K viewsedited 13:03

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐠#AI-clips from single frame🐠

👉Moving objects in #3D while generating a video by a sequence of desired actions

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅A playable environments
✅A single starting image🤯
✅Controllable camera
✅Unsupervised learning

More: https://bit.ly/35VDrYO

❤3👏1🤯1

1.68K viewsedited 13:00

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🧊Kubric: AI dataset generator🧊

👉Open-source #Python framework for photo-realistic scenes: full control, rich annotations, TBs of fresh data 🤯

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Synthetic datasets with GT
✅From NeRF to optical flow
✅Full control over data
✅Ok privacy & licensing
✅Apache License 2.0

More: https://bit.ly/3hQCaFs

🔥6👍1🤯1

1.73K viewsedited 10:35

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪂µTransfer for enormous NNs 🪂

👉Microsoft unveils how to tune enormous neural networks

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅New HP tuning: µTransfer
✅Zero-shot transfer to full-model
✅Outperforming BERT-large
✅Outperforming 6.7B GPT-3
✅Code under MIT license

More: https://bit.ly/3qc37Ij

🔥2🤯2❤1

1.64K viewsedited 08:47

AI with Papers - Artificial Intelligence & Deep Learning

0:14

This media is not supported in your browser

VIEW IN TELEGRAM

🐧Semantic via only text supervision🐧

👉GroupViT with a text encoder on a large-scale image-text dataset: semantic with any pixel-level annotations in training!

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Hierarc. Grouping Vision Transf.
✅Additional text encoder
✅NO pixel-level annotations
✅Semantic-seg task via zero-shot
✅Source code available soon

More:https://bit.ly/3hPGeWr

👍6🥰1🤯1

1.82K viewsedited 12:54

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

⌚4D-Net: Lidar + RGB synchronization⌚

👉Google unveils 4D-Net to combine 3D LiDAR and onboard RGB camera

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Point clouds/images in time
✅Fusing multiple modalities in 4D
✅Novel sampling for 3D P.C. in time
✅New SOTA for 3D detection

More: https://bit.ly/3hZCFwN

👍12🔥2🤯1

1.79K views08:30

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐌 New SOTA in video synthesis! 🐌

👉Snap unveils a novel multimodal video generation framework via text/images

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Multimodal video generation
✅Bidirectional transformer
✅Video token with self-learn.
✅Text augmentation for robustness
✅Longer sequence synthesis

More: https://bit.ly/3hZLXsG

🤯4👍1🔥1👏1

1.75K views09:46

About

Blog

Apps

Platform