AI with Papers - Artificial Intelligence & Deep Learning

🌱Pose-Format: All-in-One Pose🌱

👉 Pose-format: a comprehensive toolkit designed for human pose: unified, flexible, and easy-to-use

😎Review https://t.ly/rFrhq
😎Paper arxiv.org/pdf/2310.09066.pdf
😎Code github.com/sign-language-processing/pose

🔥9🤯4👍3😱2⚡1💩1

5.6K viewsedited 11:53

AI with Papers - Artificial Intelligence & Deep Learning

😻 CatFLW: Cat Neural Landmarks 😻

👉Landmark convolution neural network-based model for cat faces

😎Review https://t.ly/Y3mQ8
😎Paper arxiv.org/pdf/2305.04232.pdf
😎Dataset www.tech4animals.org/catflw

🥰17❤5👍3😱1🤩1😍1

5.69K views07:31

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍡4K4D: Real-Time 4D at 4K🍡

👉THE new SOTA in view synthesis of dynamic 3D scenes at 4K. 30x faster, up to 400 FPS. Nuts!

😎Review https://t.ly/6ddQh
😎Paper arxiv.org/pdf/2310.11448.pdf
😎Project zju3dv.github.io/4k4d/
😎Code github.com/zju3dv/4K4D

🔥8👍5🤯5❤1😱1🤩1

6.15K viewsedited 07:11

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🛣️ Holistic Parking Detection (YOLO) 🛣️

👉 One-step Holistic Parking Slot Network: a tailor-made adaptation of YOLOv4 algorithm for all-shaped parking slot detection

😎Review https://t.ly/2l4ZG
😎Paper arxiv.org/pdf/2310.11629.pdf

🔥8🤯6❤4🤩3👍1🍾1

6.5K views06:36

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍈 Cutie: VOS with heavy occlusions🍈

👉Cutie: novel VOS for challenging scenarios with heavy occlusions & distractors

😎Review https://t.ly/W3FR-
😎Paper arxiv.org/pdf/2310.12982.pdf
😎Project https://hkchengrex.com/Cutie
😎Code https://github.com/hkchengrex/Cutie

👍13🤣3❤1🤯1

6.97K views09:06

AI with Papers - Artificial Intelligence & Deep Learning

0:08

This media is not supported in your browser

VIEW IN TELEGRAM

🧡 Rotoscoping Prince Of Persia (1985) 🧡

👉 A rare footage for the animation of Prince of Persia (1989). Damn Romantic.

😎 More https://t.ly/xJife

❤17👍2👏2🥰1

5.75K views06:51

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪛PACE: new SOTA Motion🪛

👉#Nvidia unveils the novel SOTA to estimate the human motion in a global scene from moving cams. Stunning results.

😎Review https://t.ly/20you
😎Project https://nvlabs.github.io/PACE
😎Paper https://arxiv.org/pdf/2310.13768.pdf

🤣5❤4🔥1🤯1

5.98K viewsedited 17:06

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🥤NanoSAM: SAM on low-cost boards🥤

👉NanoSAM is a Segment Anything variant capable of running in real-time on #NVIDIA Jetson Orin with TensorRT

😎Review https://t.ly/UErq_
😎Tutorial https://github.com/NVIDIA-AI-IOT/nanosam

🔥11👍1👏1🤯1

6.96K views06:48

AI with Papers - Artificial Intelligence & Deep Learning

0:05

This media is not supported in your browser

VIEW IN TELEGRAM

🧂 SOTA RGB-D Video Salient Object 🧂

👉 DCTNet+ (model) and RDVS(dataset) for a new SOTA in Video Saliency Object Detection

😎Review https://t.ly/DapLV
😎Code github.com/kerenfu/RDVS
😎Paper arxiv.org/pdf/2310.15482.pdf

🔥4👍1🤯1

7.12K views08:03

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

✌️ Relighted 3D Hands 🤞

👉#META unveils Re:InterHand: a large dataset of relighted 3D interacting hands

😎Review https://t.ly/I1dQk
😎Paper arxiv.org/pdf/2310.17768.pdf
😎Project mks0601.github.io/ReInterHand
😎Data github.com/mks0601/ReInterHand

🤯8❤1😱1

6.41K viewsedited 08:26

AI with Papers - Artificial Intelligence & Deep Learning

0:08

This media is not supported in your browser

VIEW IN TELEGRAM

🍄 Video Understanding with GPT-4V(ision) 🍄

👉 #Microsoft unveils MM-Vid, the most advanced video understanding framework (w/ #chatgpt4). Impressive results on long-form videos & intricate tasks such as audio description & multimodal high-level comprehension

😎Review https://t.ly/RISMm
😎Paper arxiv.org/pdf/2310.19773.pdf
😎Project https://multimodal-vid.github.io

🤯22👍9🔥2👏1😱1

6.5K viewsedited 10:28

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👣 Foot via Synthetic Data 👣

👉 50,000 synthetic/photorealistic foot images + a novel SOTA library for foot

😎Review https://t.ly/TVanP
😎Paper https://arxiv.org/pdf/2310.18279.pdf
😎Project https://ollieboyne.github.io/FOUND
😎Code https://github.com/OllieBoyne/FOUND

🤣8👍4❤2🥰2🤩2

6.58K views07:57

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🚛 OYSTER: unsupervised detection w/ LIDAR 🚛

👉Waabi unveils OYSTER: a novel unsupervised object detection from LiDAR point clouds.

😎Review https://t.ly/EMi58
😎Project https://waabi.ai/oyster/
😎Paper arxiv.org/pdf/2311.02007.pdf

❤16👏3🔥2👍1

6.29K views08:37

AI with Papers - Artificial Intelligence & Deep Learning

🔥GPT-4 Pass the Turing Test?🔥

👉No. I mean...not yet. Read this Paper from UC San Diego👇

😎Review https://t.ly/o8HgM
😎Paper https://arxiv.org/pdf/2310.20216.pdf

❤4🔥3👍1🤩1

6K viewsedited 11:02

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🥻SF: Towards Virtual Cloth🥻

👉SEA AI Lab unveils a novel #AI to recovery the garment sewing patterns from daily photos for #AR / #VR worlds

😎Review https://t.ly/MwpAV
😎Project https://sewformer.github.io/
😎Paper https://arxiv.org/pdf/2311.04218.pdf
😎Code https://github.com/sail-sg/sewformer

👍4🔥2🥰2👏2🤯1🤩1

5.47K viewsedited 11:02

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🛋️ 3DiffTection: new SOTA 3D detection 🛋️

👉#Nvidia unveils 3DiffTection, the new SOTA for 3D object detection from single images. A powerful 3D detector powered by diffusion model

😎Review https://t.ly/PciXY
😎Paper https://arxiv.org/pdf/2311.04391.pdf
😎Code https://github.com/nv-tlabs/3DiffTection
😎Project research.nvidia.com/labs/toronto-ai/3difftection

🔥8❤6👍3😱3👏1

5.89K viewsedited 08:45

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐪 30x Faster Neural Scenes 🐪

👉 NeuRas: realistic real-time novel-view synthesis of VERY large scenes (>10000 m2 ). 30× faster rendering than previous SOTA w/ comparable or better realism

😎Review https://t.ly/ELJSE
😎Paper https://arxiv.org/pdf/2311.05607.pdf
😎Project https://waabi.ai/NeuRas/

🔥9❤1👍1🤯1🤩1

5.88K viewsedited 07:44

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥 Hu.ma.ne #AI Pin is out! 🔥

👉Hu.ma.ne just launched #AI Pin: the new standalone AI-powered screenless device. Running on the GPT-4 LLMs, suitable for real-time translation. #AI-powered camera and laser projector

😎 More https://t.ly/IvoN7

❤6🔥4💩2👍1😱1

5.93K viewsedited 14:48

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🫀 Segmentation of Human 🫀

👉TotalSegmentator_v2: segmenting 104 anatomical structures (27 organs, 59 bones, 10 muscles, 8 vessels) in CT. Now suitable in 3D Slicer, open source platform for image visualization.

😎Review https://t.ly/yHMm1
😎Code https://lnkd.in/dvgrbsCE
😎Paper https://lnkd.in/dkwHuuzU

🔥14👍7🤯6😱2❤1🤩1

6.26K viewsedited 08:24

AI with Papers - Artificial Intelligence & Deep Learning

🪐 Spacecraft Pose Estimation 🪐

👉SnT (Luxembourg) unveils the most advanced event-based dataset for Spacecrafts: Unreal Engine + data from ICNS simulator + Real images + Real event data acquired in lab

😎Review https://t.ly/m8JPB
😎Paper https://lnkd.in/d_edvc3n
😎Project https://lnkd.in/dPp375aY

❤7🤯2👍1😱1

6.1K views07:56

AI with Papers - Artificial Intelligence & Deep Learning

0:03

This media is not supported in your browser

VIEW IN TELEGRAM

🔥Florence-2: unified Computer Vision🔥

👉#Microsoft announces Florence-2: novel foundation model with unified, prompt-based, representation for a large variety of #computervision & vision-language task. One backbone -> multiple tasks!

👉Review https://t.ly/pOins
👉Paper arxiv.org/pdf/2311.06242.pdf
👉Project www.microsoft.com/en-us/research/project/projectflorence/

😱9❤5🔥3👍1👏1🍾1

6.86K views12:37

About

Blog

Apps

Platform