AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
135 photos
247 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🐀 MagicVideo-V2 announced! 🐀

πŸ‘‰#Bytedance announces a novel multi-stage pipeline capable of generating high-aesthetic videos from textual description

πŸ‘‰Review https://t.ly/zIq4v
πŸ‘‰Project https://lnkd.in/dKUrJPJd
πŸ‘‰Paper https://lnkd.in/dixnN-kU
πŸ”₯7❀1πŸ‘1πŸ₯°1πŸ’©1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ #6D Foundation Pose πŸ”₯

πŸ‘‰#Nvidia unveils FoundationPose, a novel (and unified) foundation model for 6D object pose estimation and tracking.

πŸ‘‰Review https://t.ly/HGd4h
πŸ‘‰Project https://lnkd.in/dPcnBKWm
πŸ‘‰Paper https://lnkd.in/dixn_iHZ
πŸ‘‰Code coming 🩷
πŸ”₯12❀5πŸ‘1🀯1
πŸƒReplaceAnything: demo is out!πŸƒ

πŸ‘‰ReplaceAnything: ultra-high quality content replacement. The ultimate #AI solution for human, clothing & background replacement to change the e-commerce experience for vendors.

πŸ‘‰Review https://t.ly/FMyvf
πŸ‘‰Project https://lnkd.in/dcyZvP2b
πŸ‘‰ModelScope https://lnkd.in/dU4x4nE6
πŸ‘‰Hugging Face https://lnkd.in/dn3uXWgd
πŸ‘‰Empty report https://lnkd.in/dcuGXd6c
πŸ‘‰Paper coming?
❀11πŸ‘3πŸ‘2😍1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯› Transparent Object Tracking πŸ₯›

πŸ‘‰Trans2k: transparent object tracking dataset of 2,000+ sequences with 100,000+ images, annotated by bounding boxes & segmentation mask.

πŸ‘‰Review https://t.ly/mEI6O
πŸ‘‰Paper https://lnkd.in/dsudY3DB
πŸ‘‰Project https://lnkd.in/d48SSJJ3
πŸ‘‰TOB https://lnkd.in/dykBUNfC
πŸ”₯18🀯7❀3πŸ‘2😱2πŸ‘1
πŸ’ŠπŸ’Š AGNOSTIC Object Counting πŸ’ŠπŸ’Š

πŸ‘‰PseCo: combining SAM to segment all possible objects as mask proposals & CLIP to classify proposals to obtain accurate object counts. The new SOTA in both few-shot/zero-shot object counting/detection.

πŸ‘‰Review https://t.ly/e4iza
πŸ‘‰Paper https://lnkd.in/dbzMXKWG
πŸ‘‰Repo https://lnkd.in/db9Q9Pse
πŸ”₯17πŸ‘5πŸ₯°1πŸ‘1
πŸ’₯ Announcing #Py4Ai ConferenceπŸ’₯

πŸ‘‰ Super proud to unveil #Py4AI, the newest conference dedicated to exploring the depths of Python & AI. Py4AI is a 1-day free event for Python and Artificial Intelligence developers.

π“π‘πž 𝐟𝐒𝐫𝐬𝐭 π›πšπ­πœπ‘ 𝐨𝐟 𝐬𝐩𝐞𝐚𝐀𝐞𝐫𝐬:
πŸš€Merve Noyan | #HuggingFace πŸ€—
πŸš€Gabriele Lombardi | ARGO Vision
πŸš€Amanda Cercas Curry | Uni. Bocconi
πŸš€Piero Savastano | Cheshire Cat AI
πŸš€Francesco Zuppichini | Zurich Insurance
πŸš€Andrea Palladino, PhD | Sr. Data Scientist

πŸ‘‰ More: https://www.linkedin.com/posts/visionarynet_py4ai-py4ai-python-activity-7152928716988243968-pOUn?utm_source=share&utm_medium=member_desktop
πŸ‘10πŸ‘2❀1πŸ₯°1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’ƒTimeline Text-Driven HumansπŸ’ƒ

πŸ‘‰Novel challenge: timeline control for text-driven motion synthesis of 3D Humans.

πŸ‘‰Review https://t.ly/HLm-N
πŸ‘‰Paper https://lnkd.in/esaR_M_9
πŸ‘‰Project https://lnkd.in/epCZDvFW
πŸ‘‰Repo coming
πŸ”₯13❀6πŸ‘4πŸ‘3🀩1
πŸ«’ AlphaGeometry: Olympiad-level AI πŸ«’

πŸ‘‰ Theorem prover for Euclidean plane geometry that sidesteps the need for human demonstrations by
synthesizing millions of theorems and proofs across different levels of complexity 🀯

πŸ‘‰Review https://t.ly/2-Z7C
πŸ‘‰Paper https://lnkd.in/g3QkqwCE
πŸ‘‰Blog https://lnkd.in/ge-mpM7q
πŸ‘‰Repo https://lnkd.in/gHjwks_9
🀯20πŸ‘3πŸ₯°2🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
🦠 XINC: Pixels to Neurons 🦠

πŸ‘‰eXplaining the Implicit Neural Canvas (XINC) from the University of Maryland, is a unified framework for explaining properties of INRs by examining the strength of each neuron’s contribution to each output pixel

πŸ‘‰Review https://t.ly/wwAmz
πŸ‘‰Paper arxiv.org/pdf/2401.10217.pdf
πŸ‘‰Project namithap10.github.io/xinc
πŸ‘‰Repo github.com/namithap10/xinc
🀯9πŸ‘3πŸ‘2πŸ”₯1
πŸ‘½ One Model <-> All Segmentations πŸ‘½

πŸ‘‰ 10+ different segmentation tasks in one framework, including image-level, video-level, interactive segmentation, & open-vocabulary segmentation. All in one!

πŸ‘‰Review https://t.ly/fywVz
πŸ‘‰Paper https://lnkd.in/dw3S4B74
πŸ‘‰Project https://lnkd.in/dzHT9v45
πŸ‘‰Repo https://lnkd.in/d6fDCnSp
πŸ”₯17πŸ‘5❀2πŸ₯°1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
😻 GARField: Group Anything 😻

πŸ‘‰ GARField is a novel approach for decomposing #3D scenes into a hierarchy of semantically meaningful groups from posed image inputs.

πŸ‘‰Review https://t.ly/6Hkeq
πŸ‘‰Paper https://lnkd.in/d28mfRcZ
πŸ‘‰Project https://lnkd.in/dzYdRNKy
πŸ‘‰Repo (coming) https://lnkd.in/d2VeRJCS
πŸ‘8❀3πŸ₯°1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ Depth Anything: new SOTA πŸ”₯

πŸ‘‰Depth Anything: the new SOTA in monocular depth estimation (MDE), trained with 1.5M labeled images and 62M+ unlabeled images jointly. It's the new SOTA!

πŸ‘‰Review https://t.ly/tCBwO
πŸ‘‰Paper https://lnkd.in/djx-9k2J
πŸ‘‰Project https://lnkd.in/dYetqZFa
πŸ‘‰Repo https://lnkd.in/d87CrUGv
πŸ‘‰DemoπŸ€— https://lnkd.in/dJhvKBep
πŸ”₯17❀3πŸ₯°2🀩2
This media is not supported in your browser
VIEW IN TELEGRAM
🎭 ULTRA-Realistic Avatar 🎭

πŸ‘‰Novel 3D avatar with enhanced fidelity of geometry, and superior quality of physically based rendering (PBR) textures without unwanted lighting.

πŸ‘‰Review https://t.ly/B3BEu
πŸ‘‰Project https://lnkd.in/dkUQHFEV
πŸ‘‰Paper https://lnkd.in/dtEQxrBu
πŸ‘‰Code coming 🩷
πŸ’©17❀5πŸ‘2🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯Lumiere: SOTA video-genπŸ”₯

πŸ‘‰#Google unveils Lumiere: Space-Time Diffusion Model for Realistic Video Generation. It's the new SOTA, tasks: Text-to-Video, Video Stylization, Cinemagraphs & Video Inpainting.

πŸ‘‰Review https://t.ly/nalJR
πŸ‘‰Paper https://lnkd.in/d-PvrGjT
πŸ‘‰Project https://t.ly/gK8hz
πŸ”₯18❀4πŸ‘3πŸ‘2🀩2πŸ₯°1🀯1πŸ’©1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ§ͺ SUPIR: SOTA restoration πŸ§ͺ

πŸ‘‰SUPIR is the new SOTA in image restoration; suitable for restoration of blurry objects, defining the material texture of objects, and adjusting restoration based on high-level semantics

πŸ‘‰Review https://t.ly/wgObH
πŸ‘‰Project https://supir.xpixel.group/
πŸ‘‰Paper https://lnkd.in/dZPYcUuq
πŸ‘‰Demo coming 🩷 but no code announced :(
❀8πŸ”₯4πŸ₯°1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🫧 SAM + Open Models 🫧

πŸ‘‰Grounded SAM (w/ DINO) as an open-set detector to combine with SAM. It can seamlessly integrate with other Open-World models to accomplish more intricate visual tasks.

πŸ‘‰Review https://t.ly/FwasQ
πŸ‘‰Paper arxiv.org/pdf/2401.14159.pdf
πŸ‘‰Code github.com/IDEA-Research/Grounded-Segment-Anything
πŸ”₯9πŸ‘2πŸ‘1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘’"Virtual Try-All" by #Amazon πŸ‘’

πŸ‘‰#Amazon announces ”Diffuse to Choose”: diffusion-based image-conditioned inpainting for VTON. Virtually place any e-commerce item in any setting.

πŸ‘‰Review https://t.ly/at07Y
πŸ‘‰Paper https://lnkd.in/dxR7nGtd
πŸ‘‰Project diffuse2choose.github.io/
❀15πŸ‘7🀯4πŸ”₯1πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
🦩 WildRGB-D: Objects in the Wild 🦩

πŸ‘‰#NVIDIA unveils a novel RGB-D object dataset captured in the wild: ~8500 recorded objects, ~20,000 RGBD videos, 46 categories with corresponding masks and 3D point clouds.

πŸ‘‰Review https://t.ly/WCqVz
πŸ‘‰Data github.com/wildrgbd/wildrgbd
πŸ‘‰Paper arxiv.org/pdf/2401.12592.pdf
πŸ‘‰Project wildrgbd.github.io/
πŸ‘9❀3πŸ”₯2πŸ‘1🀩1😍1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸŒ‹EasyVolcap: Accelerating Neural VolumetricπŸŒ‹

πŸ‘‰Novel #PyTorch library for accelerating neural video:volumetric video capturing, reconstruction & rendering

πŸ‘‰Review https://t.ly/8BISl
πŸ‘‰Paper arxiv.org/pdf/2312.06575.pdf
πŸ‘‰Code github.com/zju3dv/EasyVolcap
πŸ”₯10πŸ‘2❀1πŸ₯°1πŸ‘1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ™ Rock-Track announced! πŸ™

πŸ‘‰Rock-Track: the evolution of Poly-MOT, the previous SOTA in 3D MOT Tracking-By-Detection framework.

πŸ‘‰Review https://t.ly/hC0ak
πŸ‘‰Repo, coming: https://lnkd.in/dtDkPwCC
πŸ‘‰Paper coming
πŸ‘4πŸ‘4πŸ”₯2❀1πŸ₯°1