AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
135 photos
247 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
βœ‹HaGRID : Half Million HandsπŸ‘‹

πŸ‘‰Russian Sberbank opens HaGRID, enormous dataset for HGR. "Peace" label is present πŸ”΅πŸŸ‘

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…552,992 samples, 18 classes
βœ…HD resolution in RGB format
βœ…BBox, gesture, leading hands
βœ…Dataset/models available

More: https://bit.ly/3n2cd8r
❀11πŸ€”2
πŸ”₯ #AIwithPapers: we are 2,900+! πŸ”₯

πŸ’™πŸ’› Cheers from "Black Metal Lady Gaga" plotted by DallE-mini πŸ’™πŸ’›

😈 Invite your friends -> https://t.iss.one/AI_DeepLearning
😁8πŸ‘3❀2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ…Segmentation with INSANE OcclusionsπŸ…

πŸ‘‰CMU unveils WALT: segmenting in severe occlusion scenarios. Performance over human.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…WALT: Watch & Learn Time-lapse
βœ…4K/1080p cams on streets over a year
βœ…Performance over human-supervised
βœ…Object-occluder-occluded neural layers
βœ…Source code under MIT license

More: https://bit.ly/3n7pvjO
🀯14πŸ‘4πŸ”₯3
This media is not supported in your browser
VIEW IN TELEGRAM
🐠Largest Dataset for #autonomousdriving🐠

πŸ‘‰SHIFT: largest synthetic dataset for #selfdrivingcars. Shifts in cloud, rain, fog, time of day, vehicle & pedestrian density🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…4,800+ clips, multi-view sensor suite
βœ…Semantic/instance, M/stereo depth
βœ…2D/3D object detection, MOT
βœ…Optical flow, point cloud registration
βœ…Visual-Odo, trajectory & human pose

More: https://bit.ly/3HJBUUT
🀯9πŸ‘5❀2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦‘Big Egocentric Dataset by #Meta πŸ¦‘

πŸ‘‰Novel dataset to speed-up research on egocentric MR/AI

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…159 sequences, multiple sensors
βœ…Scenarios: cooking, exercising, etc.
βœ…β€˜Desktop Activities’ via multi-view mocap
βœ…Dataset available upon request

More: https://bit.ly/3QDccVW
πŸ”₯8πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦‹Transf-Codebook HD-Face RestorationπŸ¦‹

πŸ‘‰S-Lab unveils CodeFormer: hyper-datailed face restoration from degraded clips

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Face restoration as a code prediction
βœ…Discrete CB prior in small proxy space
βœ…Controllable transformation for LQ->HQ
βœ…Robustness and global coherence
βœ…Code and models soon available

More: https://bit.ly/3QEa9B5
πŸ”₯13πŸ‘7❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ” Fully Controllable "NeRF" Faces πŸ”

πŸ‘‰Neural control of pose/expressions from single portrait video

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…NeRF-control of the human head
βœ…Loss of rigidity by dynamic NeRF
βœ…3D full control/modelling of faces
βœ…No source code or models yet 😒

More: https://bit.ly/3OEjwi7
πŸ”₯8πŸ‘3❀2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ«€I M AVATAR: source code is out!πŸ«€

πŸ‘‰Neural implicit head avatars from monocular videos

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…#3D morphing-based implicit avatar
βœ…Detailed Geometry/appearance
βœ…D-Rendering e2e learning from clips
βœ…Novel synthetic dataset for evaluation

More: https://bit.ly/3A2yzy9
πŸ‘8πŸ‘4
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ—ΊοΈNeural Translation Image -> MapπŸ—ΊοΈ

πŸ‘‰A novel method for instantaneous mapping as a translation problem

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Bird’s-eye-view (BEV) map from image
βœ…A restricted data-efficient transformer
βœ…Monotonic attention from lang.domain
βœ…SOTA across several datasets

More: https://bit.ly/39MQ76Z
πŸ”₯20πŸ‘6😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯Ά E2V-SDE: biggest troll ever? πŸ₯Ά

πŸ‘‰E2V-SDE paper (accepted to #CVPR2022) consists of texts copied from 10+ previously published papers πŸ˜‚

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Latent ODEs for Irregularly-Sampled TS
βœ…Stochastic Adversarial Video Prediction
βœ…Continuous Latent Process Flows
βœ…More papers....


More: https://bit.ly/3bsL8Zw (AUDIO ON!)
πŸ‘9
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯πŸ”₯YOLOv6 is out: PURE FIRE!πŸ”₯πŸ”₯

πŸ‘‰YOLOv6 is a single-stage object detection framework for industrial applications

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Efficient Decoupled Head with SIoU Loss
βœ…Hardware-friendly for Backbone/Neck
βœ…520+ FPS on T4 + TensorRT FP16
βœ…Released under GNU General Public v3.0

More: https://bit.ly/3OLjncK
πŸ”₯37πŸ‘6
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ BlazePose: Real-Time Human Tracking πŸͺ

πŸ‘‰Novel real-time #3D human landmarks from #google. Suitable for mobile.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…MoCap from single RGB on mobile
βœ…Avatar, Fitness, #Yoga & AR/VR
βœ…Full body pose from monocular
βœ…Novel 3D ground truth acquisition
βœ…Additional hand landmarks
βœ…Fully integrated in #MediaPipe

More: https://bit.ly/3uvyiAv
πŸ”₯14πŸ‘4
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯YOLOv7: YOLO for segmentationπŸ”₯

πŸ‘‰YOLOv7: adding a lot of newer skills to the YOLO architecture family.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…YOLOv7, not a successor of YOLO family!
βœ…Framework for detection & segmentation
βœ…Applications based on #META detectron2
βœ…DETR & ViT detection out-of-box
βœ…Easy support for pipeline thought #ONNX
βœ…YOLOv4 + InstanceSegm. via single stage
βœ…The latest YOLOv6 training is supported!
βœ…Source code under GPL license.

More: https://bit.ly/3ysSJAp
πŸ”₯22🀯9πŸ‘5😁2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯πŸ”₯ HD Dichotomous Segmentation πŸ”₯πŸ”₯

πŸ‘‰ A new task to segment highly accurate objects from natural images.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…5,000+ HD images + accurate binary mask
βœ…IS-Net baseline in high-dim feature spaces
βœ…HCE: model vs. human interventions
βœ…Source code (should be) available soon

More: https://bit.ly/3ah2BDO
πŸ”₯13
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯πŸ”₯ Neural Segmentation on fire πŸ”₯πŸ”₯

πŸ‘‰Novel methods for segmentation with mask calibration. Robustness++ in VOS.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Study: VOS robustness vs. perturbations
βœ…Adaptive object proxy (AOP) aggregation
βœ…Less errors due unstable pixel-level match
βœ…Code/models (should be) available soon

More: https://bit.ly/3yhIY6Q
πŸ‘15❀1πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
😊😎 Seq-DeepFake via Transformers 😎😊

πŸ‘‰S-Lab opens Seq-DeepFake: Detecting Sequential DeepFake Manipulation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Seq-DeepFake: sequences of facial edits
βœ…Dataset: 85k #deepfake manipulation
βœ…Powerful Seq-DeepFake Transformer
βœ…Code, dataset and models available!

More: https://bit.ly/3ACQXhi
πŸ‘15πŸ”₯2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦’ Text2LIVE: Text-Driven Neural Editing πŸ¦’

πŸ‘‰#Amazon unveils a novel #AI for text-driven edit of videos. Insane! 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Semantic edits of real-world videos
βœ…Edit layer–RGBA representing target
βœ…Edit layers synthesized on single input
βœ…No masks or a pre-trained generator

More: https://bit.ly/3NVP6aE
🀯18πŸ‘9πŸ”₯8❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“ŸπŸ“ŸAI-Designed Circuits with Deep RLπŸ“ŸπŸ“Ÿ

πŸ‘‰#Nvidia unveils an #AI to design circuits from scratch, smaller and faster than SOTA ones

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Parallel prefix circuits for Hi-Perf
βœ…RL framework to explore the circuit space
βœ…Smaller, Faster, Power-- from the scratch

More: https://bit.ly/3yY9dk7
🀯13πŸ‘5πŸ”₯3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘½ Neural I2I with a few shoots πŸ‘½

πŸ‘‰#Alibaba unveils a novel portrait stylization. Limited samples (∼100) -> HD outputs

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Calibration first, translation later
βœ…Balanced distribution to calibrate bias
βœ…Spatially semantic constraints via geometry
βœ…Source code and models soon available!

More: https://bit.ly/3IwOmHO
❀10πŸ‘5😱1