AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
135 photos
247 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯YOLOv7: YOLO for segmentationπŸ”₯

πŸ‘‰YOLOv7: adding a lot of newer skills to the YOLO architecture family.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…YOLOv7, not a successor of YOLO family!
βœ…Framework for detection & segmentation
βœ…Applications based on #META detectron2
βœ…DETR & ViT detection out-of-box
βœ…Easy support for pipeline thought #ONNX
βœ…YOLOv4 + InstanceSegm. via single stage
βœ…The latest YOLOv6 training is supported!
βœ…Source code under GPL license.

More: https://bit.ly/3ysSJAp
πŸ”₯22🀯9πŸ‘5😁2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯πŸ”₯ HD Dichotomous Segmentation πŸ”₯πŸ”₯

πŸ‘‰ A new task to segment highly accurate objects from natural images.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…5,000+ HD images + accurate binary mask
βœ…IS-Net baseline in high-dim feature spaces
βœ…HCE: model vs. human interventions
βœ…Source code (should be) available soon

More: https://bit.ly/3ah2BDO
πŸ”₯13
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯πŸ”₯ Neural Segmentation on fire πŸ”₯πŸ”₯

πŸ‘‰Novel methods for segmentation with mask calibration. Robustness++ in VOS.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Study: VOS robustness vs. perturbations
βœ…Adaptive object proxy (AOP) aggregation
βœ…Less errors due unstable pixel-level match
βœ…Code/models (should be) available soon

More: https://bit.ly/3yhIY6Q
πŸ‘15❀1πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
😊😎 Seq-DeepFake via Transformers 😎😊

πŸ‘‰S-Lab opens Seq-DeepFake: Detecting Sequential DeepFake Manipulation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Seq-DeepFake: sequences of facial edits
βœ…Dataset: 85k #deepfake manipulation
βœ…Powerful Seq-DeepFake Transformer
βœ…Code, dataset and models available!

More: https://bit.ly/3ACQXhi
πŸ‘15πŸ”₯2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦’ Text2LIVE: Text-Driven Neural Editing πŸ¦’

πŸ‘‰#Amazon unveils a novel #AI for text-driven edit of videos. Insane! 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Semantic edits of real-world videos
βœ…Edit layer–RGBA representing target
βœ…Edit layers synthesized on single input
βœ…No masks or a pre-trained generator

More: https://bit.ly/3NVP6aE
🀯18πŸ‘9πŸ”₯8❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“ŸπŸ“ŸAI-Designed Circuits with Deep RLπŸ“ŸπŸ“Ÿ

πŸ‘‰#Nvidia unveils an #AI to design circuits from scratch, smaller and faster than SOTA ones

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Parallel prefix circuits for Hi-Perf
βœ…RL framework to explore the circuit space
βœ…Smaller, Faster, Power-- from the scratch

More: https://bit.ly/3yY9dk7
🀯13πŸ‘5πŸ”₯3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘½ Neural I2I with a few shoots πŸ‘½

πŸ‘‰#Alibaba unveils a novel portrait stylization. Limited samples (∼100) -> HD outputs

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Calibration first, translation later
βœ…Balanced distribution to calibrate bias
βœ…Spatially semantic constraints via geometry
βœ…Source code and models soon available!

More: https://bit.ly/3IwOmHO
❀10πŸ‘5😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ€Ήβ€β™‚οΈ K-Means Mask Transformer πŸ€Ήβ€β™‚οΈ

πŸ‘‰#Google AI unveils kMaX-DeepLab, novel E2E method for segmentation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…kMaX-DeepLab: k-means Mask Xformer
βœ…Rethinking relationship pixels / object
βœ…Cross-attention -> k-means clustering
βœ…The new SOTA on several dataset

More: https://bit.ly/3O2QV5I
πŸ”₯11πŸ‘2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
β˜€οΈ 4D Neural Relightable Humans β˜€οΈ

πŸ‘‰Relighting4D: free-viewpoints relighting of humans under unknown illuminations

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Relight dynamic, free viewpoints
βœ…Disentangled reflectance/geometry
βœ…SOTA on synthetic/real datasets
βœ…Code/models under MIT License

More: https://bit.ly/3RF3yH9
πŸ”₯9πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
🍰 Long-Term Object Segmentation 🍰

πŸ‘‰XMem: object segmentation for long clips with unified feature memory stores

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Inspired by Atkinson–Shiffrin model
βœ…Stores with different temporal scales
βœ…Memory consolidation algorithm
βœ…Compact/powerful long-term memory
βœ…Source code and models available

More: https://bit.ly/3PP0EOn
🀯16πŸ‘5πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯Grand Unification of Object TrackingπŸ”₯

πŸ‘‰UNICORN: unified method for SOT, MOT, VOS, & MOTS with a single neural net. 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Great unification for 4 tracking tasks
βœ…Bridging methods / pixel-wise corresp.
βœ…SOTA on 8 challenging benchmarks
βœ…Source code under MIT License

More: https://bit.ly/3o74h6g
πŸ‘13πŸ”₯3🀯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯OmniBenchmark: CV beyond ImageNetπŸ”₯

πŸ‘‰ 21 realms, 7,000+ concepts and 1M+ images. Far beyond ImageNet!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…vs. ImageNet: 2.5x realms, 9x concepts
βœ…Conciseness: no concept overlapping
βœ…ReCo: Relational Contrastive Learning
βœ…New supervised contrastive learning SOTA

More: https://bit.ly/3RJRKU0
πŸ”₯11🀩3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’£ HD Neural Avatar @130FPS πŸ’£

πŸ‘‰Samsung unveils MegaPortraits: novel one-shot creation of HD neural human avatar

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…One-shot neural avatars, SOTA up 512p
βœ…"Upgrading" to megapixel via more pics
βœ…First Neural Head Avatars in HD
βœ…Up to to 130 FPS via #GPU

More: https://bit.ly/3oboWWT
πŸ”₯22πŸ‘1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🦚 TimeLens++: Event-based Interpolation 🦚

πŸ‘‰Novel event-based interpolation with non-linear flow & multi-scale fusion

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel motion spline estimator
βœ…Non-linear continuous event/frames flow
βœ…Multi-feature fusion, gated compression
βœ…Novel hybrid dataset with 100+ videos

More: https://bit.ly/3yJyY6g
πŸ”₯16πŸ‘4
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ°NUWA-Infinity is out!πŸͺ°

πŸ‘‰βˆž generation by #Microsoft: arbitrarily-sized HD images and long videos 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Unconditional Image Gen.
βœ…Text-to-Image/Text-to-Clip
βœ…Animation / Out-painting
βœ…Hi-res, arbitrary long clip
βœ…NCP for patches caching

More: https://bit.ly/3zmBf9f
πŸ”₯7πŸ‘2❀1πŸ‘1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ #AIwithPapers: we are 3,500+! πŸ”₯

πŸ’™πŸ’› Ready for YOLO 10, 11, Ο€, ∞, Ξ¨, and more? The more we are, the faster we catch'em all πŸ’™πŸ’›

😈 Invite your friends -> https://t.iss.one/AI_DeepLearning
πŸ‘12❀10😁5πŸ”₯3
This media is not supported in your browser
VIEW IN TELEGRAM
🎷🎷OMNI3D: #3D Objects in the Wild🎷🎷

πŸ‘‰#3D detection: 234k images, 3M+ instances & 97 categories

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…OMNI3D from publicly released dataset
βœ…234k pics, 3M+ annotation with 3D box
βœ…97 categories such as sofa, table, cars
βœ…Fast (450x) and exact algorithm for IoU
βœ…Cube R-CNN: novel 3D object detector

More: https://bit.ly/3cznjzG
πŸ‘11
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘ΉMultiface Neural Rendering πŸ‘Ή

πŸ‘‰A new multi-view, Hi-Res data collected at #META Reality Labs for neural face

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Mugsy, large scale multi-cam apparatus
βœ…High-Res sync facial performance
βœ…Closing the gap in accessing HQ data
βœ…Suitable for #VR & #mixedreality

More: https://bit.ly/3b6XfeL
🀯8πŸ‘3