AI with Papers - Artificial Intelligence & Deep Learning – Telegram

AI with Papers - Artificial Intelligence & Deep Learning

@AI_DeepLearning

15.5K subscribers

145 photos

255 videos

14 files

1.34K links

All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI

Download Telegram

About

Blog

Apps

Platform

AI with Papers - Artificial Intelligence & Deep Learning

15.5K subscribers

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥YOLOv7: YOLO for segmentation🔥

👉YOLOv7: adding a lot of newer skills to the YOLO architecture family.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅YOLOv7, not a successor of YOLO family!
✅Framework for detection & segmentation
✅Applications based on #META detectron2
✅DETR & ViT detection out-of-box
✅Easy support for pipeline thought #ONNX
✅YOLOv4 + InstanceSegm. via single stage
✅The latest YOLOv6 training is supported!
✅Source code under GPL license.

More: https://bit.ly/3ysSJAp

🔥22🤯9👍5😁2

4.52K viewsedited 15:54

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥🔥 HD Dichotomous Segmentation 🔥🔥

👉 A new task to segment highly accurate objects from natural images.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅5,000+ HD images + accurate binary mask
✅IS-Net baseline in high-dim feature spaces
✅HCE: model vs. human interventions
✅Source code (should be) available soon

More: https://bit.ly/3ah2BDO

🔥13

3.62K views13:44

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥🔥 Neural Segmentation on fire 🔥🔥

👉Novel methods for segmentation with mask calibration. Robustness++ in VOS.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Study: VOS robustness vs. perturbations
✅Adaptive object proxy (AOP) aggregation
✅Less errors due unstable pixel-level match
✅Code/models (should be) available soon

More: https://bit.ly/3yhIY6Q

👍15❤1🔥1

3.53K viewsedited 12:20

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

😊😎 Seq-DeepFake via Transformers 😎😊

👉S-Lab opens Seq-DeepFake: Detecting Sequential DeepFake Manipulation

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Seq-DeepFake: sequences of facial edits
✅Dataset: 85k #deepfake manipulation
✅Powerful Seq-DeepFake Transformer
✅Code, dataset and models available!

More: https://bit.ly/3ACQXhi

👍15🔥2❤1

3.22K views07:20

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🦒 Text2LIVE: Text-Driven Neural Editing 🦒

👉#Amazon unveils a novel #AI for text-driven edit of videos. Insane! 🤯

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Semantic edits of real-world videos
✅Edit layer–RGBA representing target
✅Edit layers synthesized on single input
✅No masks or a pre-trained generator

More: https://bit.ly/3NVP6aE

🤯18👍9🔥8❤1

3.14K views11:02

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

📟📟AI-Designed Circuits with Deep RL📟📟

👉#Nvidia unveils an #AI to design circuits from scratch, smaller and faster than SOTA ones

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Parallel prefix circuits for Hi-Perf
✅RL framework to explore the circuit space
✅Smaller, Faster, Power-- from the scratch

More: https://bit.ly/3yY9dk7

🤯13👍5🔥3

3.14K viewsedited 06:41

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👽 Neural I2I with a few shoots 👽

👉#Alibaba unveils a novel portrait stylization. Limited samples (∼100) -> HD outputs

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Calibration first, translation later
✅Balanced distribution to calibrate bias
✅Spatially semantic constraints via geometry
✅Source code and models soon available!

More: https://bit.ly/3IwOmHO

❤10👍5😱1

3.11K views12:05

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🤹‍♂️ K-Means Mask Transformer 🤹‍♂️

👉#Google AI unveils kMaX-DeepLab, novel E2E method for segmentation

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅kMaX-DeepLab: k-means Mask Xformer
✅Rethinking relationship pixels / object
✅Cross-attention -> k-means clustering
✅The new SOTA on several dataset

More: https://bit.ly/3O2QV5I

🔥11👍2👏1

3.15K viewsedited 14:58

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

☀️ 4D Neural Relightable Humans ☀️

👉Relighting4D: free-viewpoints relighting of humans under unknown illuminations

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Relight dynamic, free viewpoints
✅Disentangled reflectance/geometry
✅SOTA on synthetic/real datasets
✅Code/models under MIT License

More: https://bit.ly/3RF3yH9

🔥9👍2

2.78K views16:20

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍰 Long-Term Object Segmentation 🍰

👉XMem: object segmentation for long clips with unified feature memory stores

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Inspired by Atkinson–Shiffrin model
✅Stores with different temporal scales
✅Memory consolidation algorithm
✅Compact/powerful long-term memory
✅Source code and models available

More: https://bit.ly/3PP0EOn

🤯16👍5👏3

2.85K views06:37

AI with Papers - Artificial Intelligence & Deep Learning

AI with Papers - Artificial Intelligence & Deep Learning

🦔 CogVideo: insane text-to-clip 🦔 👉CogVideo: 9B-parameters world's first large scale open-source text-to-video 😵 𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬: ✅Largest open-source T2C transformer ✅Finetuning of text-to-image model ✅Multi-frame-rate hierarchical training ✅From pretrained…

This media is not supported in your browser

VIEW IN TELEGRAM

🔥🔥 Update 🔥🔥

👉Code https://github.com/THUDM/CogVideo

👉Demo https://wudao.aminer.cn/cogvideo/

More: https://bit.ly/3yP86BQ

🔥5❤4👍1

3.73K viewsedited 18:05

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥Grand Unification of Object Tracking🔥

👉UNICORN: unified method for SOT, MOT, VOS, & MOTS with a single neural net. 🤯

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Great unification for 4 tracking tasks
✅Bridging methods / pixel-wise corresp.
✅SOTA on 8 challenging benchmarks
✅Source code under MIT License

More: https://bit.ly/3o74h6g

👍13🔥3🤯1😱1

2.84K views07:43

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥OmniBenchmark: CV beyond ImageNet🔥

👉 21 realms, 7,000+ concepts and 1M+ images. Far beyond ImageNet!

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅vs. ImageNet: 2.5x realms, 9x concepts
✅Conciseness: no concept overlapping
✅ReCo: Relational Contrastive Learning
✅New supervised contrastive learning SOTA

More: https://bit.ly/3RJRKU0

🔥11🤩3

2.77K viewsedited 12:30

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💣 HD Neural Avatar @130FPS 💣

👉Samsung unveils MegaPortraits: novel one-shot creation of HD neural human avatar

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅One-shot neural avatars, SOTA up 512p
✅"Upgrading" to megapixel via more pics
✅First Neural Head Avatars in HD
✅Up to to 130 FPS via #GPU

More: https://bit.ly/3oboWWT

🔥22👍1👏1

3.02K views14:03

AI with Papers - Artificial Intelligence & Deep Learning

AI with Papers - Artificial Intelligence & Deep Learning

🧠 Bias in #AI, explained simple 🧠 👉Asking DallE-Mini to help me to show what the BIAS in #AI is 𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐞𝐝 𝐒𝐚𝐦𝐩𝐥𝐞𝐬: ✅Best eng.->men/Caucasians ✅Best doctors->men/Caucasians ✅Top CEOs->men/Caucasians ✅Chef, kitchen->men/Caucasians ✅Rich People->only Caucasians…

🔥Important update from #OpenAI🔥

👉 https://openai.com/blog/reducing-bias-and-improving-safety-in-dall-e-2/

Reducing bias and improving safety in DALL·E 2

Today, we are implementing a new technique so that DALL·E generates images of people that more accurately reflect the diversity of the world’s population.

👍10❤2

2.99K views10:29

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🦚 TimeLens++: Event-based Interpolation 🦚

👉Novel event-based interpolation with non-linear flow & multi-scale fusion

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Novel motion spline estimator
✅Non-linear continuous event/frames flow
✅Multi-feature fusion, gated compression
✅Novel hybrid dataset with 100+ videos

More: https://bit.ly/3yJyY6g

🔥16👍4

3.08K viewsedited 12:51

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪰NUWA-Infinity is out!🪰

👉∞ generation by #Microsoft: arbitrarily-sized HD images and long videos 🤯

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Unconditional Image Gen.
✅Text-to-Image/Text-to-Clip
✅Animation / Out-painting
✅Hi-res, arbitrary long clip
✅NCP for patches caching

More: https://bit.ly/3zmBf9f

🔥7👍2❤1👏1🤯1

3.28K viewsedited 12:55

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥 #AIwithPapers: we are 3,500+! 🔥

💙💛 Ready for YOLO 10, 11, π, ∞, Ψ, and more? The more we are, the faster we catch'em all 💙💛

😈 Invite your friends -> https://t.iss.one/AI_DeepLearning

👍12❤10😁5🔥3

3.36K views13:37

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🎷🎷OMNI3D: #3D Objects in the Wild🎷🎷

👉#3D detection: 234k images, 3M+ instances & 97 categories

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅OMNI3D from publicly released dataset
✅234k pics, 3M+ annotation with 3D box
✅97 categories such as sofa, table, cars
✅Fast (450x) and exact algorithm for IoU
✅Cube R-CNN: novel 3D object detector

More: https://bit.ly/3cznjzG

👍11

3.08K views07:47

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👹Multiface Neural Rendering 👹

👉A new multi-view, Hi-Res data collected at #META Reality Labs for neural face

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Mugsy, large scale multi-cam apparatus
✅High-Res sync facial performance
✅Closing the gap in accessing HQ data
✅Suitable for #VR & #mixedreality

More: https://bit.ly/3b6XfeL

🤯8👍3

2.94K viewsedited 06:43