AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
136 photos
250 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸชผPatchFusion: SOTA Mono-Depth๐Ÿชผ

๐Ÿ‘‰PatchFusion: novel end-to-end tile-based framework for hi-res monocular metric depth estimation. It's the new SOTA in metric depth estimation from mono. Code & Demo on Hugging Face able ๐Ÿ”ฅ

๐Ÿ‘‰Review https://t.ly/hv3yT
๐Ÿ‘‰Paper https://lnkd.in/d9dXP7iP
๐Ÿ‘‰Project https://lnkd.in/dQcvVJSx
๐Ÿ‘‰Repo https://lnkd.in/dW2GdVR5
๐Ÿ‘‰Demo https://lnkd.in/dFW-gAiY
๐Ÿ”ฅ10โค5๐Ÿ‘1๐Ÿคฏ1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ’ƒOutfit Anyone: Ultra-HQ VTO๐Ÿ’ƒ

๐Ÿ‘‰Alibaba unveils Outfit Anyone: a two-stream conditional diffusion able to adeptly handle garment deformation for more lifelike results in VOT. Extra: Outfit Anyone + Animate Anyone for outfit + motion generation of any character. NO CODE / NO PAPER / DEMO AVAILABLE :)

๐Ÿ‘‰Review https://t.ly/o6UR9
๐Ÿ‘‰Demo https://lnkd.in/dpQYdXhc
๐Ÿ‘‰Repo (empty) https://lnkd.in/dBsNST6r
๐Ÿคฏ10๐Ÿ‘4โค3๐Ÿ”ฅ2
๐Ÿ”ฅ #AIwithPapers: we are 8k+ ๐Ÿ”ฅ

๐Ÿ‘‰ After flirting with #ChatGpt for months, you back in love with this channel. I felt bad, but I forgive you ๐Ÿงก

๐Ÿ˜ˆ Hey Telegram Premium Subscribers, what about boosting us? Click: https://t.iss.one/AI_DeepLearning?boost

๐Ÿ˜ˆ Invite -> https://t.iss.one/AI_DeepLearning
โค16๐Ÿคฃ7๐Ÿ”ฅ1๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸงŠ Depth Conditioning ๐ŸงŠ

๐Ÿ‘‰LooseControl to control the generative image modeling process. Layout by boundaries and #3D box control via object locations (approximate bounding boxes)

๐Ÿ‘‰Review https://t.ly/9y72m
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.03079.pdf
๐Ÿ‘‰Project https://shariqfarooq123.github.io/loose-control/
๐Ÿ‘‰Repo https://github.com/shariqfarooq123/LooseControl
๐Ÿ”ฅ14โค6๐Ÿคฏ4๐Ÿ‘1๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ–ฒ๏ธ Amodal Tracking Any Object ๐Ÿ–ฒ๏ธ

๐Ÿ‘‰Amodal tracking": inferring complete object boundaries, even when certain portions are occluded. New benchmark & approach, 2x better than SOTA in people tracking ๐Ÿ”ฅ

๐Ÿ‘‰Review https://t.ly/Rc6Ku
๐Ÿ‘‰Paper https://lnkd.in/d39rFYT4
๐Ÿ‘‰Project https://lnkd.in/d7bkEcni
๐Ÿ‘‰(empty) Repo https://lnkd.in/dTsNKdfz
โค16๐Ÿคฏ8๐Ÿ”ฅ3๐Ÿ‘2๐Ÿ‘1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿšฟ Event-Cam (1000 fps) Hands ๐Ÿšฟ

๐Ÿ‘‰Ev2Hands, the first method for the 3D reconstruction of two interacting hands from a single event camera. Code available.

๐Ÿ‘‰Review https://t.ly/YpQpX
๐Ÿ‘‰Paper arxiv.org/pdf/2312.14157.pdf
๐Ÿ‘‰Project 4dqv.mpi-inf.mpg.de/Ev2Hands
๐Ÿ‘‰Repo github.com/Chris10M/Ev2Hands
๐Ÿ”ฅ3โค2๐Ÿ‘2๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŽ„UniSDF: Unifying Neural Representations๐ŸŽ„

๐Ÿ‘‰UniSDF: novel general purpose 3D reconstruction for large complex scenes with reflections. SOTA on DTU, Shiny Blender, Mip-NeRF 360 and Ref-NeRF dataset.

๐Ÿ‘‰Review https://t.ly/2QEul
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.13285.pdf
๐Ÿ‘‰Project https://fangjinhuawang.github.io/UniSDF/
๐Ÿ‘‰Repo: No code :(
๐Ÿ”ฅ7๐Ÿ‘2โค1๐Ÿฅฐ1๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸชฎHAAR: Text-Driven Generative Hairstyles๐Ÿชฎ

๐Ÿ‘‰ HAAR: new strand-based generative model for #3D human hairstyles driven by textual input.

๐Ÿ‘‰Review https://t.ly/L38iD
๐Ÿ‘‰Project https://haar.is.tue.mpg.de/
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.11666.pdf
๐Ÿ‘‰Repo coming
๐Ÿคฏ4๐Ÿพ3๐Ÿ‘2๐Ÿ”ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸชฒUniRef++: Segment Every Reference๐Ÿชฒ

๐Ÿ‘‰ UniRef++ is a unified model for RIS, FSS, RVOS & VOS. Code available!

๐Ÿ‘‰Review https://t.ly/OxtOx
๐Ÿ‘‰Paper https://lnkd.in/eTrmDTK3
๐Ÿ‘‰Repo https://lnkd.in/etfTm4Wq
๐Ÿ‘11โค3๐Ÿคฏ3โšก1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿˆš Seeing Through Occlusions ๐Ÿˆš

๐Ÿ‘‰Novel NSF to see through occlusions, reflection suppression & shadow removal.

๐Ÿ‘‰Review https://t.ly/5jcIG
๐Ÿ‘‰Project https://light.princeton.edu/publication/nsf
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.14235.pdf
๐Ÿ‘‰Repo https://github.com/princeton-computational-imaging/NSF
โค10๐Ÿคฏ7๐Ÿ”ฅ3๐Ÿพ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‘ป Avatar Behind Occlusions ๐Ÿ‘ป

๐Ÿ‘‰Neural rendering for occluded in-the-wild mono-videos. Decoupling scenes in occlusion, human, and background.

๐Ÿ‘‰Review https://t.ly/8q__B
๐Ÿ‘‰Paper https://arxiv.org/pdf/2401.00431.pdf
๐Ÿ‘‰Project https://cs.stanford.edu/~xtiange/projects/wild2avatar
๐Ÿ”ฅ11โค3๐Ÿ‘1๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ• En3D: Generative 3D Humans ๐Ÿ•

๐Ÿ‘‰#Alibaba unveils En3D: generative scheme for sculpting HQ 3D human avatars. Zero-shot 3D generative scheme capable of producing visually realistic, geometrically accurate and content-wise diverse 3D humans without relying on pre-existing 3D or 2D asset.

๐Ÿ‘‰Review https://t.ly/nGmDK
๐Ÿ‘‰Project menyifang.github.io/projects/En3D/index.html
๐Ÿ‘‰Paper https://arxiv.org/pdf/2401.01173.pdf
๐Ÿ‘‰Repo (soon?) https://github.com/menyifang/En3D
๐Ÿคฏ5โค3๐Ÿ”ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿค MagicVideo-V2 announced! ๐Ÿค

๐Ÿ‘‰#Bytedance announces a novel multi-stage pipeline capable of generating high-aesthetic videos from textual description

๐Ÿ‘‰Review https://t.ly/zIq4v
๐Ÿ‘‰Project https://lnkd.in/dKUrJPJd
๐Ÿ‘‰Paper https://lnkd.in/dixnN-kU
๐Ÿ”ฅ7โค1๐Ÿ‘1๐Ÿฅฐ1๐Ÿ’ฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ #6D Foundation Pose ๐Ÿ”ฅ

๐Ÿ‘‰#Nvidia unveils FoundationPose, a novel (and unified) foundation model for 6D object pose estimation and tracking.

๐Ÿ‘‰Review https://t.ly/HGd4h
๐Ÿ‘‰Project https://lnkd.in/dPcnBKWm
๐Ÿ‘‰Paper https://lnkd.in/dixn_iHZ
๐Ÿ‘‰Code coming ๐Ÿฉท
๐Ÿ”ฅ12โค5๐Ÿ‘1๐Ÿคฏ1
๐ŸƒReplaceAnything: demo is out!๐Ÿƒ

๐Ÿ‘‰ReplaceAnything: ultra-high quality content replacement. The ultimate #AI solution for human, clothing & background replacement to change the e-commerce experience for vendors.

๐Ÿ‘‰Review https://t.ly/FMyvf
๐Ÿ‘‰Project https://lnkd.in/dcyZvP2b
๐Ÿ‘‰ModelScope https://lnkd.in/dU4x4nE6
๐Ÿ‘‰Hugging Face https://lnkd.in/dn3uXWgd
๐Ÿ‘‰Empty report https://lnkd.in/dcuGXd6c
๐Ÿ‘‰Paper coming?
โค11๐Ÿ‘3๐Ÿ‘2๐Ÿ˜1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฅ› Transparent Object Tracking ๐Ÿฅ›

๐Ÿ‘‰Trans2k: transparent object tracking dataset of 2,000+ sequences with 100,000+ images, annotated by bounding boxes & segmentation mask.

๐Ÿ‘‰Review https://t.ly/mEI6O
๐Ÿ‘‰Paper https://lnkd.in/dsudY3DB
๐Ÿ‘‰Project https://lnkd.in/d48SSJJ3
๐Ÿ‘‰TOB https://lnkd.in/dykBUNfC
๐Ÿ”ฅ18๐Ÿคฏ7โค3๐Ÿ‘2๐Ÿ˜ฑ2๐Ÿ‘1
๐Ÿ’Š๐Ÿ’Š AGNOSTIC Object Counting ๐Ÿ’Š๐Ÿ’Š

๐Ÿ‘‰PseCo: combining SAM to segment all possible objects as mask proposals & CLIP to classify proposals to obtain accurate object counts. The new SOTA in both few-shot/zero-shot object counting/detection.

๐Ÿ‘‰Review https://t.ly/e4iza
๐Ÿ‘‰Paper https://lnkd.in/dbzMXKWG
๐Ÿ‘‰Repo https://lnkd.in/db9Q9Pse
๐Ÿ”ฅ17๐Ÿ‘5๐Ÿฅฐ1๐Ÿ‘1
๐Ÿ’ฅ Announcing #Py4Ai Conference๐Ÿ’ฅ

๐Ÿ‘‰ Super proud to unveil #Py4AI, the newest conference dedicated to exploring the depths of Python & AI. Py4AI is a 1-day free event for Python and Artificial Intelligence developers.

๐“๐ก๐ž ๐Ÿ๐ข๐ซ๐ฌ๐ญ ๐›๐š๐ญ๐œ๐ก ๐จ๐Ÿ ๐ฌ๐ฉ๐ž๐š๐ค๐ž๐ซ๐ฌ:
๐Ÿš€Merve Noyan | #HuggingFace ๐Ÿค—
๐Ÿš€Gabriele Lombardi | ARGO Vision
๐Ÿš€Amanda Cercas Curry | Uni. Bocconi
๐Ÿš€Piero Savastano | Cheshire Cat AI
๐Ÿš€Francesco Zuppichini | Zurich Insurance
๐Ÿš€Andrea Palladino, PhD | Sr. Data Scientist

๐Ÿ‘‰ More: https://www.linkedin.com/posts/visionarynet_py4ai-py4ai-python-activity-7152928716988243968-pOUn?utm_source=share&utm_medium=member_desktop
๐Ÿ‘10๐Ÿ‘2โค1๐Ÿฅฐ1๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ’ƒTimeline Text-Driven Humans๐Ÿ’ƒ

๐Ÿ‘‰Novel challenge: timeline control for text-driven motion synthesis of 3D Humans.

๐Ÿ‘‰Review https://t.ly/HLm-N
๐Ÿ‘‰Paper https://lnkd.in/esaR_M_9
๐Ÿ‘‰Project https://lnkd.in/epCZDvFW
๐Ÿ‘‰Repo coming
๐Ÿ”ฅ13โค6๐Ÿ‘4๐Ÿ‘3๐Ÿคฉ1