AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
135 photos
247 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🛵 ALPR via CTS-Matching 🛵

👉UIT unveils a neural approach (#YOLO5 + tracking + rotation) to improve the license plate recognition accuracy

😎Review https://t.ly/VP4BP
😎Paper arxiv.org/pdf/2307.11336.pdf
😎Code github.com/chequanghuy/Character-Time-series-Matching
🔥92🤯1😱1🤣1
This media is not supported in your browser
VIEW IN TELEGRAM
🥬 Generative AI’s Next Frontiers 🥬

👉Hair simulation, 2D->3D animation, and much more. ~20 papers from #NVIDIA accepted into #SIGGRAPH2023

😎 Review https://t.ly/wgGin
🤯13👍3🤩3🥰1😱1💩1
This media is not supported in your browser
VIEW IN TELEGRAM
🦀 simPLE: learning to grasp only with CAD 🦀

👉simPLE learns to pick, regrasp & place objects precisely, given only the object CAD model and no prior experience

😎Review https://t.ly/ab5pA
😎Paper arxiv.org/pdf/2307.13133.pdf
😎Project mcube.mit.edu/research/simPLE.html
4🔥2👍1👏1🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐧 Track Anything in HQ 🐧

👉Video multi-object segmenter (VMOS) and a mask refiner (MR) to track anything

😎Review https://t.ly/hAvF2
😎Paper arxiv.org/pdf/2307.13974.pdf
😎Code github.com/jiawen-zhu/HQTrack
🔥5🤯2👍1🤩1
🥬Consensus-Adaptive RANSAC🥬

👉Novel RANSAC that learns to explore the parameter space via a novel attention layer

😎Review https://t.ly/eSLmD
😎Paper arxiv.org/pdf/2307.14030.pdf
😎Code github.com/cavalli1234/CA-RANSAC
🔥7🤯3😱1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🍡 DWPose: 2-stage Pose Distillation 🍡

👉 Tsinghua (+IDEA) unveils a novel two-stage pose Distillation for whole-body pose estimation.

😎Review https://t.ly/BSi20
😎Paper arxiv.org/pdf/2307.15880.pdf
😎Code github.com/IDEA-Research/DWPose
🤯72👍1🔥1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
👗 Multimodal Neural Designer 👗

👉 Multimodal #AI that can generate novel fashion images conditioned on text, keypoints, and sketches

😎Review https://t.ly/zVk70
😎Paper arxiv.org/pdf/2304.02051.pdf
😎Code github.com/aimagelab/multimodal-garment-designer
🥰64🤩3🔥21
This media is not supported in your browser
VIEW IN TELEGRAM
📸 Computational Burst Photography in App 📸

👉#Google unveils a novel computational burst system to democratize the professional photography via smartphone

😎Review https://t.ly/5ibJX
😎Paper arxiv.org/pdf/2308.01379.pdf
😎Project https://motion-mode.github.io
🔥6🥰3👍2🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🎠Neural Closed-Loop Simulator🎠

👉A neural sensor simulator that takes a single recorded log captured by a sensor-equipped vehicle and converts it into a realistic closed-loop multi-sensor simulation

😎Review https://t.ly/EcRLc
😎Paper arxiv.org/pdf/2308.01898.pdf
😎Project https://waabi.ai/unisim/
🤯8🤩32👍2🔥1👏1
🙏 A quick poll for helping me in improving the quality of the contents about #computervision.

Please give me a feedback here: https://t.ly/qXb4C

Thanks :)
17👍7🥰1
AI with Papers - Artificial Intelligence & Deep Learning pinned «🙏 A quick poll for helping me in improving the quality of the contents about #computervision. Please give me a feedback here: https://t.ly/qXb4C Thanks :)»
This media is not supported in your browser
VIEW IN TELEGRAM
🪛 HANDAL: Real-World Manipulable Objects 🪛

👉 #Nvidia unveils HANDAL dataset: category-level object pose and affordance prediction

😎Review https://t.ly/MXZDI
😎Paper arxiv.org/pdf/2308.01477.pdf
😎Dataset wenbowen123.github.io/handaldataset
👍8🔥31🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🎨 Interactive Neural Painting 🎨

👉 Novel AI-powered tool to help artists in completing their artworks

😎Review https://t.ly/ELUb0
😎Paper arxiv.org/pdf/2307.16441.pdf
😎Project helia95.github.io/inp-website
😎Supp helia95.github.io/inp-website/supp_mat.html
🤩4🤯21👍1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
👩‍🚀 HD Avatar via Text & Pose 👩‍🚀

👉 Generating expressive #3D avatars from nothing but text descriptions & pose guidance

😎Review https://t.ly/wrSMH
😎Paper arxiv.org/pdf/2308.03610.pdf
😎Project avatarverse3d.github.io
7🥰4👍1🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐘 Controllable Synthetic Data (extending Image-Net) 🐘

👉#META's PUG, a new generation of interactive environments for representation learning. Extending Image-Net!

😎Review https://t.ly/nCYs0
😎Paper arxiv.org/pdf/2308.03977.pdf
😎Project pug.metademolab.com
😎Code github.com/facebookresearch/PUG
🔥42👍1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🌈 Tracking by Persistent Dynamic View Synthesis 🌈

👉Novel simultaneous addressing of dynamic scene novel-view synthesis + 6-DOF tracking of all dense scene elements

😎Review https://t.ly/Bc535
😎Paper arxiv.org/pdf/2308.09713.pdf
😎Project dynamic3dgaussians.github.io
😎Code github.com/JonathonLuiten/Dynamic3DGaussians
🤯10🔥3😱1
🛒 Digital Twins for AutoRetail Checkout 🛒

👉From #Nvidia a novel approach for using 3D assets for training 2D detection and tracking model in AutoRetail Checkout

😎Review https://t.ly/Ea7kt
😎Paper arxiv.org/pdf/2308.09708.pdf
😎Code github.com/yorkeyao/Automated-Retail-Checkout
🔥2🥰2😱2
This media is not supported in your browser
VIEW IN TELEGRAM
🥎SportsMOT + MixSort = Sport MOT🥎

👉Nanjing just released a MOT dataset for sports scenes + the SOTA code/model for tracking (MixSort)

😎Review https://t.ly/NHUxL
😎Paper arxiv.org/pdf/2304.05170.pdf
😎Code github.com/MCG-NJU/MixSort
😎Project deeperaction.github.io/datasets/sportsmot.html
🔥12👍2🤯21🤩1
⚡️Feature Matching at Light Speed⚡️

👉LightGlue is a lightweight feature matcher with high accuracy and blazing fast inference

😎Review https://t.ly/jkecX
😎Paper arxiv.org/pdf/2306.13643.pdf
😎Code github.com/cvg/LightGlue
23🔥6😱4👍32🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🕹️ CoDeF: Video Content Deformation Fields 🕹️

👉CoDeF is a new type of video representation for video-editing tasks

😎Review https://t.ly/PIVl-
😎Paper arxiv.org/pdf/2308.07926.pdf
😎Project https://qiuyu96.github.io/CoDeF
😎Code https://github.com/qiuyu96/CoDeF
18🔥4👍2🥰1🤯1😱1