AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
135 photos
247 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿก Animate Anyone: new SOTA! ๐Ÿก

๐Ÿ‘‰Alibaba unveils Animate Anyone: novel #AI for transforming character images into animated videos controlled by desired pose sequences. Animating any character image into a video, unconstrained by specific domains ๐Ÿš€

๐Ÿ‘‰Review https://t.ly/qCahZ
๐Ÿ‘‰Paper https://lnkd.in/d-zi8EZ6
๐Ÿ‘‰Project https://lnkd.in/djwjQRvq
๐Ÿ‘‰Repo https://lnkd.in/dDMkjnKz
๐Ÿคฏ22๐Ÿ‘8๐Ÿ”ฅ4โšก1โค1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”Ž Generative Powers of Ten ๐Ÿ”

๐Ÿ‘‰A text-to-image model to generate consistent content across multiple image scales, enabling extreme semantic zooms into a scene. From universe to a human cell ๐Ÿคฏ

๐Ÿ‘‰Review https://t.ly/2DG44
๐Ÿ‘‰Paper https://lnkd.in/eDcSpU59
๐Ÿ‘‰Project https://lnkd.in/e6NKu8n9
๐Ÿคฏ21โค4๐Ÿ”ฅ3๐Ÿ‘2๐Ÿ˜ฑ1
Hello everybody,
a lot of you asked me to re-open the sharing of the contents to involve more people. I want to follow your suggestion, hope you will enjoy this new mood!

๐Ÿ‘ FREE TO FORWARD TO OTHER TELEGRAM CHANNELS

๐Ÿ”ฅ NO COPY OF THE POSTS
๐Ÿ”ฅ NO COMMERCIAL USAGE
๐Ÿ”ฅ NO UNRESPECTFUL USAGE

โš ๏ธ UNDO THE FORWARDING OPTION AT THE FIRST VIOLATION โš ๏ธ
โค19๐Ÿ‘10๐Ÿ‘3๐Ÿฅฐ1๐Ÿพ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฉฐ Magic Animating Human ๐Ÿฉฐ

๐Ÿ‘‰MagicAnimate: the new SOTA in human animation. Code available: let's dance!

๐Ÿ‘‰Review https://t.ly/Oq7Za
๐Ÿ‘‰Paper https://lnkd.in/dSUbGgCs
๐Ÿ‘‰Project https://lnkd.in/dkVFf-SV
๐Ÿ‘‰Code https://lnkd.in/dj2dbzdg
๐Ÿ‘‰Demo https://lnkd.in/dHEKPE9q
๐Ÿคฏ6โค2๐Ÿ‘1๐Ÿ”ฅ1๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ EfficientSAM: 20x faster Segment Anything ๐Ÿ”ฅ

๐Ÿ‘‰Meta AI Research unveils a novel family of SAM-like models, light-weight SAM models with SOTA quality-efficiency trade-offs. Up to 20x faster!

๐Ÿ‘‰Review https://t.ly/966QS
๐Ÿ‘‰Paper https://lnkd.in/duijp_Rh
๐Ÿ‘‰Project https://lnkd.in/dW-p2CuH
๐Ÿ‘‰Code https://lnkd.in/dAbZaB2t
๐Ÿ‘‰Demo https://lnkd.in/d-tjKiUd
๐Ÿ”ฅ15โค4๐Ÿ‘4๐Ÿคฏ2
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿซถ3D Hands with Transformers๐Ÿซถ

๐Ÿ‘‰ HaMeR is a robust and accurate Hand Mesh Recovery from images and video frames, based on Transformer architecture. It's the new SOTA.

๐Ÿ‘‰Review https://t.ly/YtAW8
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.05251.pdf
๐Ÿ‘‰Project https://geopavlakos.github.io/hamer
๐Ÿ‘‰Demo huggingface.co/spaces/geopavlakos/HaMeR
๐Ÿ‘‰Colab colab.research.google.com/drive/1rQbQzegFWGVOm1n1d-S6koOWDo7F2ucu
๐Ÿ‘10โค1๐Ÿ‘1๐Ÿคฏ1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿชฉ DreaMoving: Human Dancer ๐Ÿชฉ

๐Ÿ‘‰Alibaba strikes again with DreaMoving: a diffusion-based controllable video generation framework to produce HQ customized human videos.

๐Ÿ‘‰Review https://t.ly/BD_Yf
๐Ÿ‘‰Paper https://lnkd.in/gepP6Rjw
๐Ÿ‘‰Project https://lnkd.in/gwm72cfS
๐Ÿ‘‰Repo (empty) https://lnkd.in/gsc2Qt-F
๐Ÿ‘7๐Ÿ’ฉ6โค2๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ“ฒ EdgeSAM: Mobile 40x SAM ๐Ÿ“ฒ

๐Ÿ‘‰A novel hyper-optimized version of SAM for mobile devices such as #Iphone. Pure CNNs backbone (better suitable for ANE), up to 40x faster. Code available ๐Ÿ˜‰

๐Ÿ‘‰Review https://t.ly/m_vLH
๐Ÿ‘‰Paper https://lnkd.in/gHZVZN2x
๐Ÿ‘‰Project https://lnkd.in/gK8qEK8p
๐Ÿ‘‰Repo https://lnkd.in/gj6YAGNv
๐Ÿ‘‰Hugging Face https://lnkd.in/gUUHJvxz
๐Ÿ”ฅ20โšก2โค2๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸชผPatchFusion: SOTA Mono-Depth๐Ÿชผ

๐Ÿ‘‰PatchFusion: novel end-to-end tile-based framework for hi-res monocular metric depth estimation. It's the new SOTA in metric depth estimation from mono. Code & Demo on Hugging Face able ๐Ÿ”ฅ

๐Ÿ‘‰Review https://t.ly/hv3yT
๐Ÿ‘‰Paper https://lnkd.in/d9dXP7iP
๐Ÿ‘‰Project https://lnkd.in/dQcvVJSx
๐Ÿ‘‰Repo https://lnkd.in/dW2GdVR5
๐Ÿ‘‰Demo https://lnkd.in/dFW-gAiY
๐Ÿ”ฅ10โค5๐Ÿ‘1๐Ÿคฏ1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ’ƒOutfit Anyone: Ultra-HQ VTO๐Ÿ’ƒ

๐Ÿ‘‰Alibaba unveils Outfit Anyone: a two-stream conditional diffusion able to adeptly handle garment deformation for more lifelike results in VOT. Extra: Outfit Anyone + Animate Anyone for outfit + motion generation of any character. NO CODE / NO PAPER / DEMO AVAILABLE :)

๐Ÿ‘‰Review https://t.ly/o6UR9
๐Ÿ‘‰Demo https://lnkd.in/dpQYdXhc
๐Ÿ‘‰Repo (empty) https://lnkd.in/dBsNST6r
๐Ÿคฏ10๐Ÿ‘4โค3๐Ÿ”ฅ2
๐Ÿ”ฅ #AIwithPapers: we are 8k+ ๐Ÿ”ฅ

๐Ÿ‘‰ After flirting with #ChatGpt for months, you back in love with this channel. I felt bad, but I forgive you ๐Ÿงก

๐Ÿ˜ˆ Hey Telegram Premium Subscribers, what about boosting us? Click: https://t.iss.one/AI_DeepLearning?boost

๐Ÿ˜ˆ Invite -> https://t.iss.one/AI_DeepLearning
โค16๐Ÿคฃ7๐Ÿ”ฅ1๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸงŠ Depth Conditioning ๐ŸงŠ

๐Ÿ‘‰LooseControl to control the generative image modeling process. Layout by boundaries and #3D box control via object locations (approximate bounding boxes)

๐Ÿ‘‰Review https://t.ly/9y72m
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.03079.pdf
๐Ÿ‘‰Project https://shariqfarooq123.github.io/loose-control/
๐Ÿ‘‰Repo https://github.com/shariqfarooq123/LooseControl
๐Ÿ”ฅ14โค6๐Ÿคฏ4๐Ÿ‘1๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ–ฒ๏ธ Amodal Tracking Any Object ๐Ÿ–ฒ๏ธ

๐Ÿ‘‰Amodal tracking": inferring complete object boundaries, even when certain portions are occluded. New benchmark & approach, 2x better than SOTA in people tracking ๐Ÿ”ฅ

๐Ÿ‘‰Review https://t.ly/Rc6Ku
๐Ÿ‘‰Paper https://lnkd.in/d39rFYT4
๐Ÿ‘‰Project https://lnkd.in/d7bkEcni
๐Ÿ‘‰(empty) Repo https://lnkd.in/dTsNKdfz
โค16๐Ÿคฏ8๐Ÿ”ฅ3๐Ÿ‘2๐Ÿ‘1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿšฟ Event-Cam (1000 fps) Hands ๐Ÿšฟ

๐Ÿ‘‰Ev2Hands, the first method for the 3D reconstruction of two interacting hands from a single event camera. Code available.

๐Ÿ‘‰Review https://t.ly/YpQpX
๐Ÿ‘‰Paper arxiv.org/pdf/2312.14157.pdf
๐Ÿ‘‰Project 4dqv.mpi-inf.mpg.de/Ev2Hands
๐Ÿ‘‰Repo github.com/Chris10M/Ev2Hands
๐Ÿ”ฅ3โค2๐Ÿ‘2๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŽ„UniSDF: Unifying Neural Representations๐ŸŽ„

๐Ÿ‘‰UniSDF: novel general purpose 3D reconstruction for large complex scenes with reflections. SOTA on DTU, Shiny Blender, Mip-NeRF 360 and Ref-NeRF dataset.

๐Ÿ‘‰Review https://t.ly/2QEul
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.13285.pdf
๐Ÿ‘‰Project https://fangjinhuawang.github.io/UniSDF/
๐Ÿ‘‰Repo: No code :(
๐Ÿ”ฅ7๐Ÿ‘2โค1๐Ÿฅฐ1๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸชฎHAAR: Text-Driven Generative Hairstyles๐Ÿชฎ

๐Ÿ‘‰ HAAR: new strand-based generative model for #3D human hairstyles driven by textual input.

๐Ÿ‘‰Review https://t.ly/L38iD
๐Ÿ‘‰Project https://haar.is.tue.mpg.de/
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.11666.pdf
๐Ÿ‘‰Repo coming
๐Ÿคฏ4๐Ÿพ3๐Ÿ‘2๐Ÿ”ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸชฒUniRef++: Segment Every Reference๐Ÿชฒ

๐Ÿ‘‰ UniRef++ is a unified model for RIS, FSS, RVOS & VOS. Code available!

๐Ÿ‘‰Review https://t.ly/OxtOx
๐Ÿ‘‰Paper https://lnkd.in/eTrmDTK3
๐Ÿ‘‰Repo https://lnkd.in/etfTm4Wq
๐Ÿ‘11โค3๐Ÿคฏ3โšก1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿˆš Seeing Through Occlusions ๐Ÿˆš

๐Ÿ‘‰Novel NSF to see through occlusions, reflection suppression & shadow removal.

๐Ÿ‘‰Review https://t.ly/5jcIG
๐Ÿ‘‰Project https://light.princeton.edu/publication/nsf
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.14235.pdf
๐Ÿ‘‰Repo https://github.com/princeton-computational-imaging/NSF
โค10๐Ÿคฏ7๐Ÿ”ฅ3๐Ÿพ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‘ป Avatar Behind Occlusions ๐Ÿ‘ป

๐Ÿ‘‰Neural rendering for occluded in-the-wild mono-videos. Decoupling scenes in occlusion, human, and background.

๐Ÿ‘‰Review https://t.ly/8q__B
๐Ÿ‘‰Paper https://arxiv.org/pdf/2401.00431.pdf
๐Ÿ‘‰Project https://cs.stanford.edu/~xtiange/projects/wild2avatar
๐Ÿ”ฅ11โค3๐Ÿ‘1๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ• En3D: Generative 3D Humans ๐Ÿ•

๐Ÿ‘‰#Alibaba unveils En3D: generative scheme for sculpting HQ 3D human avatars. Zero-shot 3D generative scheme capable of producing visually realistic, geometrically accurate and content-wise diverse 3D humans without relying on pre-existing 3D or 2D asset.

๐Ÿ‘‰Review https://t.ly/nGmDK
๐Ÿ‘‰Project menyifang.github.io/projects/En3D/index.html
๐Ÿ‘‰Paper https://arxiv.org/pdf/2401.01173.pdf
๐Ÿ‘‰Repo (soon?) https://github.com/menyifang/En3D
๐Ÿคฏ5โค3๐Ÿ”ฅ1