AI with Papers - Artificial Intelligence & Deep Learning

🌵POCO: 3D HPS + Confidence🌵

👉 Novel framework for HPS: #3D human body + confidence in a single feed-forward pass

😎Review https://t.ly/cDePe
😎Paper arxiv.org/pdf/2308.12965.pdf
😎Project https://poco.is.tue.mpg.de

🔥5👍3❤2🤯1😱1

5.26K viewsedited 11:28

AI with Papers - Artificial Intelligence & Deep Learning

0:03

This media is not supported in your browser

VIEW IN TELEGRAM

🌆 NeO360: NeRF for Sparse Outdoor 🌆

👉#Toyota (+GIT) unveils NeO360: 360◦ outdoor scenes from a single or a few posed RGB images

😎Review https://t.ly/JDJZg
😎Paper arxiv.org/pdf/2308.12967.pdf
😎Project zubair-irshad.github.io/projects/neo360.html

❤13👍3🔥2🥰1🤯1

5.32K views15:04

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🥕 Scenimefy: I-2-I for anime 🥕

👉S-Lab unveils a novel semi-supervised I-2-I translation framework + HD dataset for anime

😎Review https://t.ly/IsdEG
😎Paper arxiv.org/pdf/2308.12968.pdf
😎Code https://github.com/Yuxinn-J/Scenimefy
😎Project https://yuxinn-j.github.io/projects/Scenimefy.html

🥰13❤2🔥1🍾1

5.32K viewsedited 09:36

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐨 Watch Your Steps: Editing by Text 🐨

👉The novel SOTA in image & scene (text) editing via denoising diffusion models

😎Review https://t.ly/fv9wn
😎Paper arxiv.org/pdf/2308.08947.pdf
😎Project ashmrz.github.io/WatchYourSteps

❤4👍3🤯3🔥1

5.36K views12:26

AI with Papers - Artificial Intelligence & Deep Learning

0:06

This media is not supported in your browser

VIEW IN TELEGRAM

💡 Relighting NeRF 💡

👉Neural implicit radiance representation for free viewpoint relighting of an object lit by a moving point light

😎Review https://t.ly/J-3_L
😎Project nrhints.github.io
😎Code github.com/iamNCJ/NRHints
😎Paper nrhints.github.io/pdfs/nrhints-sig23.pdf

🤯3👍2❤1⚡1🔥1

5.35K views12:16

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪶 ReST: Multi-Camera MOT 🪶

👉Novel reconfigurable two-steps graph model for multi-camera multi object video tracking (MC-MOT)

😎Review https://t.ly/3C5tb
😎Paper arxiv.org/pdf/2308.13229.pdf
😎Code github.com/chengche6230/ReST

🔥7❤3🤩2

5.45K views14:03

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌲MagicEdit: Magic Video Edit🌲

👉MagicEdit: explicit disentangling content, structure & motion for Hi-Fi and temporally coherent video editing

😎Report https://t.ly/tREX4
😎Paper arxiv.org/pdf/2308.14749.pdf
😎Project magic-edit.github.io
😎Code github.com/magic-research/magic-edit

🥰8❤4👍3🔥1😱1🤩1

6.37K viewsedited 07:15

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

✂️ VideoCutLER: Simple UVIS ✂️

👉VideoCutLER is a simple unsupervised video instance segmentation (UVIS) method without relying on optical flows

😎Review https://t.ly/PBBjG
😎Paper arxiv.org/pdf/2308.14710.pdf
😎Project people.eecs.berkeley.edu/~xdwang/projects/CutLER
😎Code github.com/facebookresearch/CutLER/tree/main/videocutler

🔥8👍3❤2🤯1

6.47K viewsedited 12:32

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐦 3D Pigeons Pose & Tracking 🐦

👉 3D-MuPPET: estimate and track 3D poses of pigeons with multiple-views

😎Review https://t.ly/jfAJJ
😎Paper arxiv.org/pdf/2308.15316.pdf
😎Code github.com/alexhang212/3D-MuPPET/

🤣17🤯14👍4🥰2❤1🤩1

6.71K viewsedited 13:04

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🎍RoboTAP: Dense Tracking for Few-Shot Imitation🎍

👉RoboTAP: novel dense tracking representation for robotic arm

😎Review https://t.ly/MCO_V
😎Paper arxiv.org/pdf/2308.15975.pdf
😎Project https://robotap.github.io/
😎Code github.com/deepmind/tapnet

🔥8👍2🤯2🤩1

7.05K viewsedited 06:58

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

⛺FACET: Fairness in Computer Vision⛺

👉#META AI opens a large, publicly available dataset for classification, detection & segmentation. Potential performance disparities & challenges across sensitive demographic attributes

😎Review https://t.ly/mKn-t
😎Paper arxiv.org/pdf/2309.00035.pdf
😎Dataset https://facet.iss.onetademolab.com/

🔥10❤6👍4👏1

6.43K views13:57

AI with Papers - Artificial Intelligence & Deep Learning

0:08

This media is not supported in your browser

VIEW IN TELEGRAM

♊️ Doppelgangers in Structures ♊️

👉A novel learning-based approach for visual disambiguation: distinguishing illusory matches to produce correct, disambiguated #3D reconstructions

😎Review https://t.ly/9yLot
😎Paper arxiv.org/pdf/2309.02420.pdf
😎Code github.com/RuojinCai/Doppelgangers
😎Project doppelgangers-3d.github.io/

🔥8👍3🤯2👏1

6.57K viewsedited 06:43

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍃 Tracking Anything with Decoupled VOS 🍃

👉A novel VOS approach that extends SAM for open-world video segmentation with no user input required

😎Review https://t.ly/xeobR
😎Paper arxiv.org/pdf/2309.03903.pdf
😎Project hkchengrex.com/Tracking-Anything-with-DEVA
😎Code github.com/hkchengrex/Tracking-Anything-with-DEVA
😎Colab https://colab.research.google.com/drive/1OsyNVoV_7ETD1zIE8UWxL3NXxu12m_YZ

🔥13👍6🤯4❤2😢1🤩1

6.75K viewsedited 06:57

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪷 Diffusive Consistent Video Editing 🪷

👉 Weizmann Institute of Science unveils TokenFlow, a novel text-to-image diffusion model for text-driven video editing

😎Review https://t.ly/ru8km
😎Paper arxiv.org/pdf/2307.10373.pdf
😎Project diffusion-tokenflow.github.io
😎Code github.com/omerbt/TokenFlow

❤9👍6🔥2🤯1😱1😢1

5.9K views12:16

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥🔥 #META's DINOv2 is now commercial! 🔥🔥

👉Universal features for image classification, instance retrieval, video understanding, depth & semantic segmentation. Now suitable for commercial.

😎Review https://t.ly/LNrGy
😎Paper arxiv.org/pdf/2304.07193.pdf
😎Code github.com/facebookresearch/dinov2
😎Demo dinov2.metademolab.com/

🔥15👍3❤1🤯1😱1

5.9K viewsedited 07:09

AI with Papers - Artificial Intelligence & Deep Learning

0:06

This media is not supported in your browser

VIEW IN TELEGRAM

🧄FreeMan: towards #3D Humans 🧄

👉FreeMan: the first large-scale, real-world, multi-view dataset for #3D human pose estimation. 11M frames!

😎Review https://t.ly/ICxpA
😎Paper arxiv.org/pdf/2309.05073.pdf
😎Project wangjiongw.github.io/freeman

👏6🤯4🥰1

6.3K views13:09

AI with Papers - Artificial Intelligence & Deep Learning

🦊 MagiCapture: HD Multi-Concept Portrait 🦊

👉KAIST unveils MagiCapture: integrating subject and style concepts to generate high-resolution portrait images using just a few subject and style references

😎Review https://t.ly/c9rOo
😎Paper https://arxiv.org/pdf/2309.06895.pdf

❤5🥰1

6.45K views06:49

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

⚽ Dynamic NeRFs for Soccer ⚽

👉SoccerNeRF: first attempt of "cheap" NeRF applied to football for reconstructing soccer replays in space and time.

😎Review https://t.ly/Ywcvk
😎Paper arxiv.org/pdf/2309.06802.pdf
😎Project https://soccernerfs.isach.be/
😎Code github.com/iSach/SoccerNeRFs

🔥8❤4👍3🤩2🥰1

6.82K views13:31

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

☢️ GlueStick: Graph Neural Matching ☢️

👉GlueStick is joint deep matcher for points and lines that leverages the connectivity information between nodes to better glue them together

😎Review https://t.ly/Atxqo
😎Paper arxiv.org/pdf/2304.02008.pdf
😎Code https://github.com/cvg/GlueStick

🔥11👍4❤1🤯1🤩1

6.12K views06:49

AI with Papers - Artificial Intelligence & Deep Learning

0:03

This media is not supported in your browser

VIEW IN TELEGRAM

🫀CPR-Coach: Neural Cardiopulmonary Resuscitation🫀

👉CPR-Coach: fine-grained action recognition in cardiopulmonary resuscitation

😎Review https://t.ly/Qbg4K
😎Paper arxiv.org/pdf/2309.11718.pdf
😎Code github.com/Shunli-Wang/CPR-Coach
😎Project shunli-wang.github.io/CPR-Coach

❤7🔥3👏1

6.08K views13:34

About

Blog

Apps

Platform