AI with Papers - Artificial Intelligence & Deep Learning

🐦 EfficientVIS: new SOTA for VIS 🐦

👉Simultaneous classification, segmentation, and tracking multiple object instances in videos

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Efficient and fully end-to-end
✅Iterative query-video interaction
✅First RoI-wise clip-level RT-VIS
✅Requires 15× fewer epochs

More: https://bit.ly/3KfqurN

👍10🔥3👎1🤯1

1.87K viewsedited 13:03

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐠#AI-clips from single frame🐠

👉Moving objects in #3D while generating a video by a sequence of desired actions

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅A playable environments
✅A single starting image🤯
✅Controllable camera
✅Unsupervised learning

More: https://bit.ly/35VDrYO

❤3👏1🤯1

1.69K viewsedited 13:00

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🧊Kubric: AI dataset generator🧊

👉Open-source #Python framework for photo-realistic scenes: full control, rich annotations, TBs of fresh data 🤯

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Synthetic datasets with GT
✅From NeRF to optical flow
✅Full control over data
✅Ok privacy & licensing
✅Apache License 2.0

More: https://bit.ly/3hQCaFs

🔥6👍1🤯1

1.73K viewsedited 10:35

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪂µTransfer for enormous NNs 🪂

👉Microsoft unveils how to tune enormous neural networks

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅New HP tuning: µTransfer
✅Zero-shot transfer to full-model
✅Outperforming BERT-large
✅Outperforming 6.7B GPT-3
✅Code under MIT license

More: https://bit.ly/3qc37Ij

🔥2🤯2❤1

1.64K viewsedited 08:47

AI with Papers - Artificial Intelligence & Deep Learning

0:14

This media is not supported in your browser

VIEW IN TELEGRAM

🐧Semantic via only text supervision🐧

👉GroupViT with a text encoder on a large-scale image-text dataset: semantic with any pixel-level annotations in training!

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Hierarc. Grouping Vision Transf.
✅Additional text encoder
✅NO pixel-level annotations
✅Semantic-seg task via zero-shot
✅Source code available soon

More:https://bit.ly/3hPGeWr

👍6🥰1🤯1

1.83K viewsedited 12:54

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

⌚4D-Net: Lidar + RGB synchronization⌚

👉Google unveils 4D-Net to combine 3D LiDAR and onboard RGB camera

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Point clouds/images in time
✅Fusing multiple modalities in 4D
✅Novel sampling for 3D P.C. in time
✅New SOTA for 3D detection

More: https://bit.ly/3hZCFwN

👍12🔥2🤯1

1.8K views08:30

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐌 New SOTA in video synthesis! 🐌

👉Snap unveils a novel multimodal video generation framework via text/images

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Multimodal video generation
✅Bidirectional transformer
✅Video token with self-learn.
✅Text augmentation for robustness
✅Longer sequence synthesis

More: https://bit.ly/3hZLXsG

🤯4👍1🔥1👏1

1.75K views09:46

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🎁 StyelNeRF source code is out 🎁

👉3D consistent photo-realistic image synthesis

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅NeRF + style generator
✅3D consistency for HD image
✅Novel regularization loss
✅Camera control on styles

More: https://bit.ly/3t5xC49

🔥4🥰1🤯1

1.62K views16:17

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🦎CLD-based generative #AI by #Nvidia🦎

👉Nvidia unveils a novel critically-damped Langevin diffusion (CLD) for synthetic data

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅A novel diffusion process for SGMs
✅Novel score matching obj. for CLD
✅Hybrid denoising score matching
✅Efficient sampling from CLD model
✅Source code under a specific license

More: https://bit.ly/35MToBe

🔥2🤩2👍1🤯1

1.73K views19:47

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🛸UFO: segmentation @140+ FPS🛸

👉Unified Transformer Framework for Co-Segmentation, Co-Saliency & Salient Object Detection. All in one!

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Unified framework for co-segmentation
✅Co-segmentation, co-saliency, saliency
✅Block for long-range dependencies
✅Able to reach for 140 FPS in inference
✅The new SOTA on multiple datasets
✅Source code under MIT License

More: https://bit.ly/3KLd9b9

🔥6👍1🤯1

1.75K viewsedited 13:44

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👗 Multi-GANs fashion 👗

👉Global GAN blended with other GANs for faces, shoes, etc.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Multi-GAN framework
✅Several generators
✅Free of artifacts
✅Full-body generation
✅Humans, 1024x1024

More: https://bit.ly/37mfOte

🔥2👏2❤1🤯1

1.83K viewsedited 13:11

AI with Papers - Artificial Intelligence & Deep Learning

0:03

This media is not supported in your browser

VIEW IN TELEGRAM

🚧 FLAG: #3D Avatar Generation 🚧

👉A flow-based generative model of the 3D human body from sparse observations.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅FLow-based Avatar Generative
✅Conditional distro of body pose
✅Exact pose likelihood process
✅Invertibility -> oracle latent code

More: https://bit.ly/3CQpk3p

👏2🔥1🤯1

1.67K viewsedited 11:40

AI with Papers - Artificial Intelligence & Deep Learning

0:03

This media is not supported in your browser

VIEW IN TELEGRAM

💃 Dancing in the wild with StyleGAN 💃

👉StyleGAN-based animations for AR/VR apps

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Video based motion retargeting
✅A StyleGAN architecture based
✅Novel explicit motion representation
✅SOTA qualitatively & quantitatively

More: https://bit.ly/3CZbL1W

👍6🤯3🥰2

1.71K views20:19

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪀TensoRF: the 4D evolution of NeRF 🪀

👉TensoRF, a novel radiance fields via 4D-tensor: 3D voxel grid with per-voxel multi-channel feats.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅VM decomposition technique
✅Low-rank tensor factorization
✅Lower memory footprint (speed)
✅TensoRF is the new SOTA in R.F.
✅Code under the MIT License

More: https://bit.ly/3qffZgI

👍2🔥1

1.74K views07:41

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔼 GAN-meshes without key-points 🔼

👉ETH unveils a GAN framework for generating textured triangle meshes without annotations

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Generative of textured meshes
✅3D generator for all categories
✅3D pose estimation framework
✅Code licensed under MIT License

More: https://bit.ly/3qfH9nJ

🤩3🤯2👍1🔥1

1.81K views10:58

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐯 S.S. Latent Image Animator 🐯

👉Self-supervised autoencoder to animate unseen images by linear navigation in latent

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Latent Image Animator
✅Linear displacement in latent
✅SOTA: VoxCeleb, Taichi, TED-talk
✅Source code (soon) available

More: https://bit.ly/36pgLAC

👍5🔥3🤯2💩1

1.86K viewsedited 14:08

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪨 Google URF for neural-synthesis 🪨

👉Sequence of RGB + Lidar -> 3D surfaces and novel RGB images synthesized

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Extending Neural Radiance Fields
✅Leveraging asynch. lidar data
✅Addressing exposure variation
✅Leveraging segmentations for sky
✅SOTA #3D reconstructions/synthesizes

More: https://bit.ly/3L2vTDb

🔥11👍4👏1🤯1

1.85K views08:51

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🚛 AV2: next-gen. self driving 🚛

👉One of the biggest dataset ever for #autonomousdriving

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅1k seq. of multimodal data
✅3D annotations, 26 categories
✅20k lidar & map-aligned pose
✅250k challenging interactions
✅HD Map: 3D lane & crosswalk
✅CC BY-NC-SA 4.0 license

More: https://bit.ly/3trx3lw

🔥3👍1🤯1

1.71K viewsedited 08:05

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🤖CaTGrasp in Clutter from Simulation🤖

👉Task-relevant grasping: trained solely in simulation with synthetic + SS. hand-object interaction

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Novel cat-level, relevant grasping
✅S.S. hand-object-contact
✅Tiny objects from dense clutter
✅Train-simulation -> to real
✅Source code under Apache 2.0

More: https://bit.ly/3L2YVCo

👍1🔥1

1.69K viewsedited 15:33

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🛼 Drive & Segment without Supervision 🛼

👉Learning pixel-wise semantic seg. on non-curated data collection by cars (cameras + LiDAR) driving around a city

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Cross-modal unsupervised
✅Synchronized LiDAR & RGB
✅Object proposal on LiDAR points
✅SOTA, significant improvements

More: https://bit.ly/3L0wWTW

👍3🔥1🤯1

1.75K viewsedited 09:50

About

Blog

Apps

Platform