AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
135 photos
247 videos
13 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🧯Neural Focal Modulation VAR🧯

👉A novel architecture for video recognition that models both local/global context

😎Review https://t.ly/rF_fk
😎Paper arxiv.org/pdf/2307.06947.pdf
😎Project talalwasim.github.io/Video-FocalNets
😎Code github.com/TalalWasim/Video-FocalNets
🔥81👏1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🐈 Gen-AI as representation learner 🐈

👉DreamTeacher: novel self-supervised feats. representation learning framework that utilizes gen-nets for pre-training downstream image backbones

😎Review https://t.ly/RL8iG
😎Paper arxiv.org/pdf/2307.07487.pdf
😎Project research.nvidia.com/labs/toronto-ai/DreamTeacher
🔥9👍2🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
#SelfDriving? It's all about weather!

👉Novel self-supervised MDE method to handle adverse weather in real-world autonomous driving

😎Review https://t.ly/tcLQW
😎Paper arxiv.org/pdf/2307.08357.pdf
😎Project kieran514.github.io/Robust-Depth-Project/
7👍3🤯1😱1
🦙 Llama-2: the Open-Source "ChatGPT" 🦙

👉GenAI, #Meta unveils Llama-2: a collection of LLMs ranging in scale 7-70B params. Challenging with #chatgpt, but open.

😎Review https://t.ly/bLJgP
😎Paper https://t.ly/AOXru
😎Project https://ai.meta.com/llama
🤯192🔥1💩1
This media is not supported in your browser
VIEW IN TELEGRAM
🍉 AltFreezing: new SOTA in detecting deepfake 🍉

👉#Microsoft unveils AltFreezing: spatial/temporal artifacts in one model for more general face forgery detection

😎Review https://t.ly/mkIKX
😎Paper https://t.ly/z4KnJ
😎Code github.com/ZhendongWang6/AltFreezing
😱6👍5😍4🤯2🥰1
This media is not supported in your browser
VIEW IN TELEGRAM
🪟META's Ultra-HD Data for #AR🪟

👉Aria Digital Twin: egocentric dataset for detection/tracking, reconstruction/understanding, S2R learning, pose and more.

😎Review https://t.ly/MRPt1
😎Paper arxiv.org/pdf/2306.06362.pdf
😎Project www.projectaria.com/datasets/adt
😎Code github.com/facebookresearch/projectaria_tools
🔥10👍1
This media is not supported in your browser
VIEW IN TELEGRAM
👩‍🦰 Ultra-Realistic Neural Hair 👩‍🦰

👉A novel method to reconstruct the hair geometry at a strand level from monocular video or multi-view images

😎Review https://t.ly/6xZyp
😎Paper arxiv.org/pdf/2306.05872.pdf
😎Project samsunglabs.github.io/NeuralHaircut
😎Code github.com/SamsungLabs/NeuralHaircut
🤯17🤩5😍5👍21
This media is not supported in your browser
VIEW IN TELEGRAM
💪 Muscles in Action with #AI 💪

👉Muscles in Action (MIA): learn to incorporate muscle activity into human motion representations

😎Review https://t.ly/hUKub
😎Paper arxiv.org/pdf/2212.02978.pdf
😎Project musclesinaction.cs.columbia.edu
🔥7👍2👏2🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🪤 PAPR: Proximity Attention Point Render 🪤

👉PAPR: fast point-based scene representation with differentiable renderer approach

😎Review https://t.ly/yoI0g
😎Paper arxiv.org/pdf/2307.11086.pdf
😎Project https://zvict.github.io/papr
👍2🥰2😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🪛 CAD-based Object Segmentation 🪛

👉 A novel three-stage approach to segment unseen objects in RGB images using their CAD models

😎Review https://t.ly/RtHLN
😎Paper arxiv.org/pdf/2307.11067.pdf
😎Code https://github.com/nv-nguyen/cnos
🔥7🤯41😱1🤩1😍1
This media is not supported in your browser
VIEW IN TELEGRAM
🛵 ALPR via CTS-Matching 🛵

👉UIT unveils a neural approach (#YOLO5 + tracking + rotation) to improve the license plate recognition accuracy

😎Review https://t.ly/VP4BP
😎Paper arxiv.org/pdf/2307.11336.pdf
😎Code github.com/chequanghuy/Character-Time-series-Matching
🔥92🤯1😱1🤣1
This media is not supported in your browser
VIEW IN TELEGRAM
🥬 Generative AI’s Next Frontiers 🥬

👉Hair simulation, 2D->3D animation, and much more. ~20 papers from #NVIDIA accepted into #SIGGRAPH2023

😎 Review https://t.ly/wgGin
🤯13👍3🤩3🥰1😱1💩1
This media is not supported in your browser
VIEW IN TELEGRAM
🦀 simPLE: learning to grasp only with CAD 🦀

👉simPLE learns to pick, regrasp & place objects precisely, given only the object CAD model and no prior experience

😎Review https://t.ly/ab5pA
😎Paper arxiv.org/pdf/2307.13133.pdf
😎Project mcube.mit.edu/research/simPLE.html
4🔥2👍1👏1🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐧 Track Anything in HQ 🐧

👉Video multi-object segmenter (VMOS) and a mask refiner (MR) to track anything

😎Review https://t.ly/hAvF2
😎Paper arxiv.org/pdf/2307.13974.pdf
😎Code github.com/jiawen-zhu/HQTrack
🔥5🤯2👍1🤩1
🥬Consensus-Adaptive RANSAC🥬

👉Novel RANSAC that learns to explore the parameter space via a novel attention layer

😎Review https://t.ly/eSLmD
😎Paper arxiv.org/pdf/2307.14030.pdf
😎Code github.com/cavalli1234/CA-RANSAC
🔥7🤯3😱1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🍡 DWPose: 2-stage Pose Distillation 🍡

👉 Tsinghua (+IDEA) unveils a novel two-stage pose Distillation for whole-body pose estimation.

😎Review https://t.ly/BSi20
😎Paper arxiv.org/pdf/2307.15880.pdf
😎Code github.com/IDEA-Research/DWPose
🤯72👍1🔥1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
👗 Multimodal Neural Designer 👗

👉 Multimodal #AI that can generate novel fashion images conditioned on text, keypoints, and sketches

😎Review https://t.ly/zVk70
😎Paper arxiv.org/pdf/2304.02051.pdf
😎Code github.com/aimagelab/multimodal-garment-designer
🥰64🤩3🔥21
This media is not supported in your browser
VIEW IN TELEGRAM
📸 Computational Burst Photography in App 📸

👉#Google unveils a novel computational burst system to democratize the professional photography via smartphone

😎Review https://t.ly/5ibJX
😎Paper arxiv.org/pdf/2308.01379.pdf
😎Project https://motion-mode.github.io
🔥6🥰3👍2🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🎠Neural Closed-Loop Simulator🎠

👉A neural sensor simulator that takes a single recorded log captured by a sensor-equipped vehicle and converts it into a realistic closed-loop multi-sensor simulation

😎Review https://t.ly/EcRLc
😎Paper arxiv.org/pdf/2308.01898.pdf
😎Project https://waabi.ai/unisim/
🤯8🤩32👍2🔥1👏1
🙏 A quick poll for helping me in improving the quality of the contents about #computervision.

Please give me a feedback here: https://t.ly/qXb4C

Thanks :)
17👍7🥰1