AI with Papers - Artificial Intelligence & Deep Learning – Telegram

AI with Papers - Artificial Intelligence & Deep Learning

@AI_DeepLearning

15.4K subscribers

143 photos

255 videos

14 files

1.33K links

All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI

Download Telegram

About

Blog

Apps

Platform

AI with Papers - Artificial Intelligence & Deep Learning

15.4K subscribers

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪼 All You Need is SAM (+Flow) 🪼

👉Oxford unveils the new SOTA for moving object segmentation via SAM + Optical Flow. Two novel models & Source Code announced 💙

👉Review https://t.ly/ZRYtp
👉Paper https://lnkd.in/d4XqkEGF
👉Project https://lnkd.in/dHpmx3FF
👉Repo coming: https://github.com/Jyxarthur/

❤12👍7🔥2🤯2

7.65K viewsedited 12:23

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🛞 6Img-to-3D driving scenarios 🛞

👉EPFL (+ Continental) unveils 6Img-to-3D, novel transformer-based encoder-renderer method to create 3D onbounded outdoor driving scenarios with only six pics

👉Review https://shorturl.at/dZ018
👉Paper arxiv.org/pdf/2404.12378.pdf
👉Project 6img-to-3d.github.io/
👉Code github.com/continental/6Img-to-3D

🔥5❤1👍1

7.53K views07:35

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌹 Physics-Based 3D Video-Gen 🌹

👉PhysDreamer, a physics-based approach that leverages the object dynamics priors learned by video generation models. It enables realistic 3D interaction with objects

👉Review https://t.ly/zxXf9
👉Paper arxiv.org/pdf/2404.13026.pdf
👉Project physdreamer.github.io/
👉Code github.com/a1600012888/PhysDreamer

👍14❤9🤯4👏1

8.03K viewsedited 06:46

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🎡 NER-Net: Seeing at Night-Time 🎡

👉Huazhong (+Beijing) unveils a novel event-based nighttime imaging solution under non-uniform illumination, plus a paired multi-illumination level real-world dataset. Repo online, code coming 💙

👉Review https://t.ly/Z9JMJ
👉Paper arxiv.org/pdf/2404.11884.pdf
👉Repo github.com/Liu-haoyue/NER-Net
👉Clip https://www.youtube.com/watch?v=zpfTLCF1Kw4

🤯3🔥2❤1👍1

8.45K viewsedited 12:20

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌊 FlowMap: dense depth video 🌊

👉MIT (+CSAIL) unveils FlowMap, a novel E2E differentiable method that solves for precise camera poses, camera intrinsics, and perframe dense depth of a video sequence. Source Code released 💙

👉Review https://t.ly/CBH48
👉Paper arxiv.org/pdf/2404.15259.pdf
👉Project cameronosmith.github.io/flowmap
👉Code github.com/dcharatan/flowmap

🔥18❤3👍2

8.43K viewsedited 06:50

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👗TELA: Text to 3D Clothed Human👗

👉 TELA is a novel approach for the new task of clothing disentangled 3D human model generation from texts. This novel approach unleashes the potential of many downstream applications (e.g., virtual try-on).

👉Review https://t.ly/6N7JV
👉Paper https://arxiv.org/pdf/2404.16748
👉Project https://jtdong.com/tela_layer/
👉Code https://github.com/DongJT1996/TELA

👍5🔥4🤯3👏1🍾1

7.56K views07:27

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪷 Tunnel Try-on: SOTA VTON 🪷

👉"Tunnel Try-on", the first diffusion-based video virtual try-on model that demonstrates SOTA performance in complex scenarios. No code announced :(

👉Review https://t.ly/joMtJ
👉Paper arxiv.org/pdf/2404.17571
👉Project mengtingchen.github.io/tunnel-try-on-page/

❤9🔥4👍1🥰1🍾1

8.11K views07:03

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🏝️1000x Scalable Neural 3D Fields🏝️

👉Highly-scalable neural 3D Fields: 1000x reductions in memory maintaining speed/quality: 10 MB vs. 10 GB! Code released 💙

👉Review https://t.ly/sLTK5
👉Paper https://lnkd.in/dEYM8-t2
👉Project https://lnkd.in/djptdujx
👉Code https://lnkd.in/dcCnFZ2n

🤯13👍5🔥4❤3🥰1

7.77K views07:08

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌐3D Scenes w/ Depth Inpainting🌐

👉Oxford announced two novel contributions to the field of 3D scene generation: a new benchmark and a novel depth completion model. 🤗-Demo and Source Code released💙

👉Review https://t.ly/BKiny
👉Paper arxiv.org/pdf/2404.19758
👉Project research.paulengstler.com/invisible-stitch/
👉Code github.com/paulengstler/invisible-stitch
👉Demo huggingface.co/spaces/paulengstler/invisible-stitch

❤3👏2👍1🔥1🥰1🤯1🍾1

8.29K viewsedited 11:36

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌊 Diffusive 3D Human Recovery 🌊

👉The Rutgers University unveils ScoreHMR at #CVPR24; novel approach for 3D human pose and shape reconstruction. Impressive results.

👉Review https://t.ly/G0k2D
👉Paper https://arxiv.org/pdf/2403.09623
👉Code https://github.com/statho/ScoreHMR
👉Project https://statho.github.io/ScoreHMR/

🤯11👍6❤1👏1🤣1

7.67K views11:44

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🏷️DiffMOT (#CVPR24): diffusion-MOT🏷️

👉DiffMOT is a novel real-time diffusion-based MOT approach to tackle the complex nonlinear motion. Impressive results & Source Code released💙

👉Review https://t.ly/ztlHi
👉Paper https://lnkd.in/d4K3c-nt
👉Project https://diffmot.github.io/
👉Code github.com/Kroery/DiffMOT

❤12👍4🔥3🤯3

7.45K views07:21

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍏 XFeat: Neural Features Matching 🍏

👉XFeat (Accelerated Features) is lightweight/accurate architecture for efficient visual correspondence. It revisits fundamental design choices in CNN for detecting, extracting & matching local features

👉Review https://t.ly/ppb38
👉Paper arxiv.org/pdf/2404.19174
👉Code https://lnkd.in/dFzTpzN8
👉Project https://lnkd.in/d8JnV-iu

❤17🤯6⚡3👏1🍾1

7.89K views06:40

AI with Papers - Artificial Intelligence & Deep Learning

🦑 Hyper-Detailed Image Descriptions 🦑

👉#Google unveils ImageInWords (IIW), a carefully designed HIL annotation framework for curating hyper-detailed image descriptions and a new dataset resulting from this process

👉Review https://t.ly/engkl
👉Paper arxiv.org/pdf/2405.02793
👉Repo github.com/google/imageinwords
👉Project google.github.io/imageinwords
👉Data huggingface.co/datasets/google/imageinwords

❤11🔥3👍2🤯2🍾1

7.98K viewsedited 16:01

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔫 Free-Moving Reconstruction 🔫

👉EPFL (+#MagicLeap) unveils a novel approach for reconstructing free-moving object from monocular RGB clip. Free interaction with objects in front of a moving cam without relying on any prior, and optimizes the sequence globally without any segments. Great but no code announced🥺

👉Review https://t.ly/2xhtj
👉Paper arxiv.org/pdf/2405.05858
👉Project haixinshi.github.io/fmov/

👍6🤯4⚡1❤1🥰1

8.54K views08:55

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💥FeatUp: Any Model at Any Resolution💥

👉FeatUp is a task-model agnostic framework to restore lost spatial information in deep features. It outperforms other methods in class activation map generation, transfer learning for segmentation & depth, and end-to-end training for semantic segm. Source Code released💙

👉Review https://t.ly/Evq_g
👉Paper https://lnkd.in/gweaN4s6
👉Project https://lnkd.in/gWcGXdxt
👉Code https://lnkd.in/gweq5NY4

🔥19❤4👍3👏1🍾1

8.05K viewsedited 06:52

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐏AniTalker: Universal Talking Humans🐏

👉SJTU (+AISpeech) unveils AniTalker, a framework that transforms a single static portrait and input audio into animated talking videos with naturally flowing movements.

👉Review https://t.ly/MD4yX
👉Paper https://arxiv.org/pdf/2405.03121
👉Project https://x-lance.github.io/AniTalker/
👉Repo https://github.com/X-LANCE/AniTalker

🔥6❤4👍2⚡1🤯1

7.21K viewsedited 12:38

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👻 3D Humans Motion from Text 👻

👉Zhejiang (+ANT) unveils a novel method to generate human motions containing accurate human-object interactions in 3D scenes based on textural descriptions. Code announced, coming 💙

👉Review https://t.ly/eOZnU
👉Paper https://arxiv.org/pdf/2405.07784
👉Project https://zju3dv.github.io/text_scene_motion/

👍3🔥2❤1

7.49K viewsedited 06:57

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪬UHM: Authentic Hand by Phone🪬

👉 META unveils UHM, novel 3D high-fidelity avatarization of your (yes, the your one) hand. Adaptation pipeline fits the pre-trained UHM via phone scan. Source Code released 💙

👉Review https://t.ly/fU5rA
👉Paper https://lnkd.in/dyGaiAnq
👉Code https://lnkd.in/d9B_XFAA

👍4❤1🔥1🤯1

7.54K views15:51

AI with Papers - Artificial Intelligence & Deep Learning

🔥EfficientTrain++: Efficient Foundation Visual Backbone Training🔥

👉Tsinghua unveils EfficientTrain++, a simple, general, surprisingly effective, off-the-shelf approach to reduce the training time of various popular models (e.g., ResNet, ConvNeXt, DeiT, PVT, Swin, CSWin, and CAFormer). Up to 3.0× faster on ImageNet-1K/22K without sacrificing accuracy. Source Code released 💙

👉Review https://t.ly/D8ttv
👉Paper https://arxiv.org/pdf/2405.08768
👉Code https://github.com/LeapLabTHU/EfficientTrain

👍9🔥3🤯3❤2🥰1

8.6K viewsedited 06:56

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🫀 EchoTracker: Tracking Echocardiography🫀

👉EchoTracker: two-fold coarse-to-fine model that facilitates the tracking of queried points on a tissue surface across ultrasound. Source Code released💙

👉Review https://t.ly/NyBe0
👉Paper https://arxiv.org/pdf/2405.08587
👉Code https://github.com/riponazad/echotracker/

❤15👍1🥰1

8.24K viewsedited 12:04

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🦕 Grounding DINO 1.5 Pro/Edge 🦕

👉Grounding DINO 1.5, a suite of advanced open-set object detection models to advanced the "Edge" of open-set object detection. Source Code released under Apache 2.0💙

👉Review https://t.ly/kS-og
👉Paper https://lnkd.in/dNakMge2
👉Code https://lnkd.in/djhnQmrm

🔥22❤1👍1😍1

8.74K views11:59