AI with Papers - Artificial Intelligence & Deep Learning

🏔️MPS-Net: new SOTA for #3D human🏔️

👉MPS-Net: accurate & temporally coherent 3D human pose/shape from video

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅MoCA: visual cues from motion
✅HAFI to mix past/future feats
✅Stronger temporal correlation
✅SOTA on multiple datasets

More: https://bit.ly/3uAI5EB

🤯9🔥1🥰1😱1

1.76K views07:53

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🤿Transfiner: hyper-detailed segmentation🤿

👉Mask Transfiner: #AI for HQ & efficient instance segmentation

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Transfiner: HQ segmentation
✅HQ seg. via quadtree structure
✅SOTA & extreme details
✅Code under MIT License

More: https://bit.ly/3KVzseM

👍5🔥3🤯1

1.8K viewsedited 18:32

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🥙 DualStyleGAN: SOTA in style transfer🥙

👉Flexible control of dual styles of face domain and extended artistic portrait domain

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅High-resolution (1024*1024)
✅Intrinsic/extrinsic style path
✅Hierarchical style manipulation
✅Novel progressive fine-tuning
✅Source code under MIT License

More: https://bit.ly/3uS26Xp

👍11🤩4🔥1

1.81K viewsedited 06:33

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍚 GTR: Global Tracking Transformers 🍚

👉UTexas + Apple: transformer for global multi-object tracking

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅GTR operates on any object
✅Few frames->global trajectories
✅SOTA on detectors for any object
✅Code under Apache License 2.0

More: https://bit.ly/3DiqkxF

🔥7👍2🤯2😱1

1.84K views07:59

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🧠E2E Perception for #selfdrivingcars🧠

👉HybridNets: multi-task net with several key optimizations

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅End-to-end perception network
✅Traffic, lane, object detection
✅Drivable segmentation area
✅Real-time on embedded systems
✅Source code under MIT License

More: https://bit.ly/3JMk8Az

👍8❤4👏2🤯1😱1

1.9K viewsedited 13:41

AI with Papers - Artificial Intelligence & Deep Learning

1:10

Media is too big

VIEW IN TELEGRAM

🛩️Smart Parking with UAVs🛩️

👉A novel methodology to monitor car parking areas in real-time via Drones/UAVs

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅YoloV3 + DeepSort tracker
✅Vehicle detection/tracking
✅Occupancy estimation via RT
✅Four blocks, unique pipeline

More: https://bit.ly/3iJD8nm

❤8👍5🥰1🤯1

1.92K viewsedited 16:03

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👕 Detecting Events via #AI 👕

👉Localizing object states & corresponding state-modifying actions

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅SS-learning state-modifying
✅Noise adaptive weighting
✅ChangeIt: 2.6k+ hrs , 34k+ changes
✅Dataset, code, and model!

More: https://bit.ly/3uBwxkj

👍7🤯1

1.86K viewsedited 07:26

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌈🌈 Interactive Neural Labelling 🌈🌈

👉Dense labelling of geometry, color & semantics via #3D neural field

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅No training data
✅Dense labeling
✅Classes on the fly
✅Labelling at a scale

More: https://bit.ly/36Y0faQ

🔥4👍1🤯1😱1

1.85K views12:53

AI with Papers - Artificial Intelligence & Deep Learning

0:08

This media is not supported in your browser

VIEW IN TELEGRAM

♟️Neural RGB-D Reconstruction♟️

👉Novel approach for #3D mixing implicit surface representations with NeRFs

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅RGB-D based reconstruction
✅Leveraging color & depth
✅Depth into the NeRF
✅Pose & camera refinement

More: https://bit.ly/3iN6e54

🔥5👍2🤯2🤩1

1.87K views14:52

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🦓 Hyper-Fast Refinement 🦓

👉SharpContour: novel contour-based refinement for semantic segmentation

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Instance-aware Point Classifier
✅Deforming by discrete updating
✅Estimating offsets independently
✅Source code soon available!

More: https://bit.ly/3qL04GY

👍5🔥4🤯1😱1

1.86K views08:44

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🥗 Neural Mesh via Text only 🥗

👉Zero-shot generation of 3D model using only a target text prompt

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅ZS 3D model with text only
✅ZS text-guided generation
✅Meshes with texture/normal
✅Differentiable LLS implementation

More: https://bit.ly/3u0qnvb

🤯9👍1🔥1

1.83K views12:45

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪆#3D, Materials, and Lighting from 2D🪆

👉Nvidia: topology, materials & map lighting jointly from 2D. INSANE 😮

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Topology, materials and lighting
✅Meshes with materials/lighting
✅Compact volumetric texturing
✅Differentiable all-frequency lighting
✅Code under #NVIDIA License

More: https://bit.ly/3IUoF2t

👏5👍1🤯1😱1

1.92K viewsedited 15:21

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍜Ref-NeRF for extreme realism🍜

👉Ref-NeRF: reflected radiance & structures via collection of spatially-varying scene properties

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Realism and accuracy
✅Replacing NeRF’s params
✅Regularization of volume density
✅Integrated Directional Encoding

More: https://bit.ly/3tTlS5l

👍4🤯2🔥1

1.88K views10:11

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🦧OFA for all: Cross, Vision, Language🦧

👉Unified multimodal model for image generation, visual grounding, etc.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Sequence-to-sequence learning
✅Image Captioning / Generation
✅Visual Grounding / Classification
✅Text-to-Image Generation
✅Visual Question Answering

More: https://bit.ly/3wSTGlc

👍7🤯6👏1

1.97K views13:44

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍿Old Films Back to Life with #AI🍿

👉Recurrent transformer network (RTN) to restore heavily degraded old films

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Transformer blocks for spatial
✅Knowledge from adjacent frames
✅Color from keyframes to whole clip
✅Source code available in days!

More: https://bit.ly/3wZbV8y

❤12👍2🤯1

1.82K views07:54

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍊Neural Head #Avatars from RGB🍊

👉Novel neural representation for animatable head avatar

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Novel articulated human head
✅Full-geometry reconstruction
✅Differentiable optimization pipeline
✅Disentanglement of shape/color

More: https://bit.ly/3DxUGMI

🔥3🤯2😱1

1.85K viewsedited 09:26

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌶️ MyStyle: personal generative #AI 🌶️

👉Personalized deep generation with a few shots of a person

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Small set of portraits(∼100)
✅Local, low-dim, personal manifold
✅Personal #AI for ill-posed tasks
✅SOTA vs. previous few-shots

More: https://bit.ly/3wWMwMu

🔥5👍4🤯1

1.92K views10:37

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🦆 GAN + Dense Map 🦆

👉CoordGAN: structure-texture disentangled GAN with dense correspondence map

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Novel coordinate space
✅Warping to learn coordinate
✅Encoder for structure representation
✅HQ structure/texture editable images

More: https://bit.ly/3DOlOaB

🤯4❤2🔥2👏1

1.87K views09:37

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

⚓Unified shape & non-rigid motion⚓

👉CaDeX: SOTA in both shape & non-rigid motion

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Canonical Deformation Coordinate Space
✅Shape + non rigid motion representation
✅Factorization of def-homeomorphisms
✅Cycle consistency, topology & volume
✅SOTA in modelling deformable objects

More: https://bit.ly/3NM5NX1

❤4🤯1😱1

1.81K viewsedited 12:01

AI with Papers - Artificial Intelligence & Deep Learning

📸 ~6 BILLION CLIP-filtered pairs 📸

👉A dataset 14x bigger than the previously biggest openly accessible image-text dataset in the world.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅2,3B English image-text pairs
✅2,2B from 100+ other languages
✅1,3B language not detected
✅KNN index for quick search

More: https://bit.ly/3LFhKvT

❤3🤯1

1.83K viewsedited 13:51

About

Blog

Apps

Platform