This media is not supported in your browser
    VIEW IN TELEGRAM
  ๐ฑPose-Format: All-in-One Pose๐ฑ 
 
๐ Pose-format: a comprehensive toolkit designed for human pose: unified, flexible, and easy-to-use
 
๐Review https://t.ly/rFrhq
๐Paper arxiv.org/pdf/2310.09066.pdf
๐Code github.com/sign-language-processing/pose
๐ Pose-format: a comprehensive toolkit designed for human pose: unified, flexible, and easy-to-use
๐Review https://t.ly/rFrhq
๐Paper arxiv.org/pdf/2310.09066.pdf
๐Code github.com/sign-language-processing/pose
๐ฅ9๐คฏ4๐3๐ฑ2โก1๐ฉ1
  ๐ป CatFLW: Cat Neural Landmarks ๐ป 
 
๐Landmark convolution neural network-based model for cat faces
 
๐Review https://t.ly/Y3mQ8
๐Paper arxiv.org/pdf/2305.04232.pdf
๐Dataset www.tech4animals.org/catflw
๐Landmark convolution neural network-based model for cat faces
๐Review https://t.ly/Y3mQ8
๐Paper arxiv.org/pdf/2305.04232.pdf
๐Dataset www.tech4animals.org/catflw
๐ฅฐ17โค4๐3๐ฑ1๐คฉ1๐1
  This media is not supported in your browser
    VIEW IN TELEGRAM
  ๐ก4K4D: Real-Time 4D at 4K๐ก 
 
๐THE new SOTA in view synthesis of dynamic 3D scenes at 4K. 30x faster, up to 400 FPS. Nuts!
 
๐Review https://t.ly/6ddQh
๐Paper arxiv.org/pdf/2310.11448.pdf
๐Project zju3dv.github.io/4k4d/
๐Code github.com/zju3dv/4K4D
๐THE new SOTA in view synthesis of dynamic 3D scenes at 4K. 30x faster, up to 400 FPS. Nuts!
๐Review https://t.ly/6ddQh
๐Paper arxiv.org/pdf/2310.11448.pdf
๐Project zju3dv.github.io/4k4d/
๐Code github.com/zju3dv/4K4D
๐ฅ8๐5๐คฏ5โค1๐ฑ1๐คฉ1
  This media is not supported in your browser
    VIEW IN TELEGRAM
  ๐ฃ๏ธ Holistic Parking Detection (YOLO) ๐ฃ๏ธ 
 
๐ One-step Holistic Parking Slot Network: a tailor-made adaptation of YOLOv4 algorithm for all-shaped parking slot detection
 
๐Review https://t.ly/2l4ZG
๐Paper arxiv.org/pdf/2310.11629.pdf
๐ One-step Holistic Parking Slot Network: a tailor-made adaptation of YOLOv4 algorithm for all-shaped parking slot detection
๐Review https://t.ly/2l4ZG
๐Paper arxiv.org/pdf/2310.11629.pdf
๐ฅ8๐คฏ6โค4๐คฉ3๐1๐พ1
  This media is not supported in your browser
    VIEW IN TELEGRAM
  ๐ Cutie: VOS with heavy occlusions๐ 
 
๐Cutie: novel VOS for challenging scenarios with heavy occlusions & distractors
 
๐Review https://t.ly/W3FR-
๐Paper arxiv.org/pdf/2310.12982.pdf
๐Project https://hkchengrex.com/Cutie
๐Code https://github.com/hkchengrex/Cutie
๐Cutie: novel VOS for challenging scenarios with heavy occlusions & distractors
๐Review https://t.ly/W3FR-
๐Paper arxiv.org/pdf/2310.12982.pdf
๐Project https://hkchengrex.com/Cutie
๐Code https://github.com/hkchengrex/Cutie
๐13๐คฃ3โค1๐คฏ1
  This media is not supported in your browser
    VIEW IN TELEGRAM
  ๐งก Rotoscoping Prince Of Persia (1985) ๐งก 
 
๐ A rare footage for the animation of Prince of Persia (1989). Damn Romantic.
๐ More https://t.ly/xJife
๐ A rare footage for the animation of Prince of Persia (1989). Damn Romantic.
๐ More https://t.ly/xJife
โค17๐2๐2๐ฅฐ1
  This media is not supported in your browser
    VIEW IN TELEGRAM
  ๐ชPACE: new SOTA Motion๐ช  
  
๐#Nvidia unveils the novel SOTA to estimate the human motion in a global scene from moving cams. Stunning results.
 
๐Review https://t.ly/20you
๐Project https://nvlabs.github.io/PACE
๐Paper https://arxiv.org/pdf/2310.13768.pdf
๐#Nvidia unveils the novel SOTA to estimate the human motion in a global scene from moving cams. Stunning results.
๐Review https://t.ly/20you
๐Project https://nvlabs.github.io/PACE
๐Paper https://arxiv.org/pdf/2310.13768.pdf
๐คฃ5โค4๐ฅ1๐คฏ1
  This media is not supported in your browser
    VIEW IN TELEGRAM
  ๐ฅคNanoSAM:  SAM on low-cost boards๐ฅค 
 
๐NanoSAM is a Segment Anything variant capable of running in real-time on #NVIDIA Jetson Orin with TensorRT
๐Review https://t.ly/UErq_
๐Tutorial https://github.com/NVIDIA-AI-IOT/nanosam
๐NanoSAM is a Segment Anything variant capable of running in real-time on #NVIDIA Jetson Orin with TensorRT
๐Review https://t.ly/UErq_
๐Tutorial https://github.com/NVIDIA-AI-IOT/nanosam
๐ฅ11๐1๐1๐คฏ1
  This media is not supported in your browser
    VIEW IN TELEGRAM
  ๐ง SOTA RGB-D Video Salient Object ๐ง 
 
๐ DCTNet+ (model) and RDVS(dataset) for a new SOTA in Video Saliency Object Detection
๐Review https://t.ly/DapLV
๐Code github.com/kerenfu/RDVS
๐Paper arxiv.org/pdf/2310.15482.pdf
๐ DCTNet+ (model) and RDVS(dataset) for a new SOTA in Video Saliency Object Detection
๐Review https://t.ly/DapLV
๐Code github.com/kerenfu/RDVS
๐Paper arxiv.org/pdf/2310.15482.pdf
๐ฅ4๐1๐คฏ1
  This media is not supported in your browser
    VIEW IN TELEGRAM
  โ๏ธ Relighted 3D Hands ๐ค 
 
๐#META unveils Re:InterHand: a large dataset of relighted 3D interacting hands
 
๐Review https://t.ly/I1dQk
๐Paper arxiv.org/pdf/2310.17768.pdf
๐Project mks0601.github.io/ReInterHand
๐Data github.com/mks0601/ReInterHand
๐#META unveils Re:InterHand: a large dataset of relighted 3D interacting hands
๐Review https://t.ly/I1dQk
๐Paper arxiv.org/pdf/2310.17768.pdf
๐Project mks0601.github.io/ReInterHand
๐Data github.com/mks0601/ReInterHand
๐คฏ8โค1๐ฑ1
  This media is not supported in your browser
    VIEW IN TELEGRAM
  ๐ Video Understanding with GPT-4V(ision) ๐ 
 
๐ #Microsoft unveils MM-Vid, the most advanced video understanding framework (w/ #chatgpt4). Impressive results on long-form videos & intricate tasks such as audio description & multimodal high-level comprehension
 
๐Review https://t.ly/RISMm
๐Paper arxiv.org/pdf/2310.19773.pdf
๐Project https://multimodal-vid.github.io
๐ #Microsoft unveils MM-Vid, the most advanced video understanding framework (w/ #chatgpt4). Impressive results on long-form videos & intricate tasks such as audio description & multimodal high-level comprehension
๐Review https://t.ly/RISMm
๐Paper arxiv.org/pdf/2310.19773.pdf
๐Project https://multimodal-vid.github.io
๐คฏ22๐9๐ฅ2๐1๐ฑ1
  This media is not supported in your browser
    VIEW IN TELEGRAM
  ๐ฃ  Foot via Synthetic Data ๐ฃ 
 
๐ 50,000 synthetic/photorealistic foot images + a novel SOTA library for foot
 
๐Review https://t.ly/TVanP
๐Paper https://arxiv.org/pdf/2310.18279.pdf
๐Project https://ollieboyne.github.io/FOUND
๐Code https://github.com/OllieBoyne/FOUND
๐ 50,000 synthetic/photorealistic foot images + a novel SOTA library for foot
๐Review https://t.ly/TVanP
๐Paper https://arxiv.org/pdf/2310.18279.pdf
๐Project https://ollieboyne.github.io/FOUND
๐Code https://github.com/OllieBoyne/FOUND
๐คฃ8๐4โค2๐ฅฐ2๐คฉ2
  This media is not supported in your browser
    VIEW IN TELEGRAM
  ๐ OYSTER: unsupervised detection w/ LIDAR ๐ 
 
๐Waabi unveils OYSTER: a novel unsupervised object detection from LiDAR point clouds.
 
๐Review https://t.ly/EMi58
๐Project https://waabi.ai/oyster/
๐Paper arxiv.org/pdf/2311.02007.pdf
๐Waabi unveils OYSTER: a novel unsupervised object detection from LiDAR point clouds.
๐Review https://t.ly/EMi58
๐Project https://waabi.ai/oyster/
๐Paper arxiv.org/pdf/2311.02007.pdf
โค15๐3๐ฅ2๐1
  ๐ฅGPT-4 Pass the Turing Test?๐ฅ 
 
๐No. I mean...not yet. Read this Paper from UC San Diego๐
 
๐Review https://t.ly/o8HgM
๐Paper https://arxiv.org/pdf/2310.20216.pdf
๐No. I mean...not yet. Read this Paper from UC San Diego๐
๐Review https://t.ly/o8HgM
๐Paper https://arxiv.org/pdf/2310.20216.pdf
โค4๐ฅ3๐1๐คฉ1
  This media is not supported in your browser
    VIEW IN TELEGRAM
  ๐ฅปSF: Towards Virtual Cloth๐ฅป 
 
๐SEA AI Lab unveils a novel #AI to recovery the garment sewing patterns from daily photos for #AR / #VR worlds
 
๐Review https://t.ly/MwpAV
๐Project https://sewformer.github.io/
๐Paper https://arxiv.org/pdf/2311.04218.pdf
๐Code https://github.com/sail-sg/sewformer
๐SEA AI Lab unveils a novel #AI to recovery the garment sewing patterns from daily photos for #AR / #VR worlds
๐Review https://t.ly/MwpAV
๐Project https://sewformer.github.io/
๐Paper https://arxiv.org/pdf/2311.04218.pdf
๐Code https://github.com/sail-sg/sewformer
๐4๐ฅ2๐ฅฐ2๐2๐คฏ1๐คฉ1
  This media is not supported in your browser
    VIEW IN TELEGRAM
  ๐๏ธ 3DiffTection: new SOTA 3D detection ๐๏ธ 
 
๐#Nvidia unveils 3DiffTection, the new SOTA for 3D object detection from single images. A powerful 3D detector powered by diffusion model
 
๐Review https://t.ly/PciXY
๐Paper https://arxiv.org/pdf/2311.04391.pdf
๐Code https://github.com/nv-tlabs/3DiffTection
๐Project research.nvidia.com/labs/toronto-ai/3difftection
๐#Nvidia unveils 3DiffTection, the new SOTA for 3D object detection from single images. A powerful 3D detector powered by diffusion model
๐Review https://t.ly/PciXY
๐Paper https://arxiv.org/pdf/2311.04391.pdf
๐Code https://github.com/nv-tlabs/3DiffTection
๐Project research.nvidia.com/labs/toronto-ai/3difftection
๐ฅ8โค6๐3๐ฑ3๐1
  This media is not supported in your browser
    VIEW IN TELEGRAM
  ๐ช 30x Faster Neural Scenes ๐ช 
 
๐ NeuRas: realistic real-time novel-view synthesis of VERY large scenes (>10000 m2 ). 30ร faster rendering than previous SOTA w/ comparable or better realism
 
๐Review https://t.ly/ELJSE
๐Paper https://arxiv.org/pdf/2311.05607.pdf
๐Project https://waabi.ai/NeuRas/
๐ NeuRas: realistic real-time novel-view synthesis of VERY large scenes (>10000 m2 ). 30ร faster rendering than previous SOTA w/ comparable or better realism
๐Review https://t.ly/ELJSE
๐Paper https://arxiv.org/pdf/2311.05607.pdf
๐Project https://waabi.ai/NeuRas/
๐ฅ9โค1๐1๐คฏ1๐คฉ1
  This media is not supported in your browser
    VIEW IN TELEGRAM
  ๐ฅ Hu.ma.ne #AI Pin is out! ๐ฅ   
   
๐Hu.ma.ne just launched #AI Pin: the new standalone AI-powered screenless device. Running on the GPT-4 LLMs, suitable for real-time translation. #AI-powered camera and laser projector
  
๐ More https://t.ly/IvoN7
๐Hu.ma.ne just launched #AI Pin: the new standalone AI-powered screenless device. Running on the GPT-4 LLMs, suitable for real-time translation. #AI-powered camera and laser projector
๐ More https://t.ly/IvoN7
โค6๐ฅ4๐ฉ2๐1๐ฑ1
  This media is not supported in your browser
    VIEW IN TELEGRAM
  ๐ซ Segmentation of Human ๐ซ 
 
๐TotalSegmentator_v2: segmenting 104 anatomical structures (27 organs, 59 bones, 10 muscles, 8 vessels) in CT. Now suitable in 3D Slicer, open source platform for image visualization.
๐Review https://t.ly/yHMm1
๐Code https://lnkd.in/dvgrbsCE
๐Paper https://lnkd.in/dkwHuuzU
๐TotalSegmentator_v2: segmenting 104 anatomical structures (27 organs, 59 bones, 10 muscles, 8 vessels) in CT. Now suitable in 3D Slicer, open source platform for image visualization.
๐Review https://t.ly/yHMm1
๐Code https://lnkd.in/dvgrbsCE
๐Paper https://lnkd.in/dkwHuuzU
๐ฅ14๐7๐คฏ6๐ฑ2โค1๐คฉ1
  ๐ช Spacecraft Pose Estimation ๐ช 
 
๐SnT (Luxembourg) unveils the most advanced event-based dataset for Spacecrafts: Unreal Engine + data from ICNS simulator + Real images + Real event data acquired in lab
 
๐Review https://t.ly/m8JPB
๐Paper https://lnkd.in/d_edvc3n
๐Project https://lnkd.in/dPp375aY
๐SnT (Luxembourg) unveils the most advanced event-based dataset for Spacecrafts: Unreal Engine + data from ICNS simulator + Real images + Real event data acquired in lab
๐Review https://t.ly/m8JPB
๐Paper https://lnkd.in/d_edvc3n
๐Project https://lnkd.in/dPp375aY
โค7๐คฏ2๐1๐ฑ1
  This media is not supported in your browser
    VIEW IN TELEGRAM
  ๐ฅFlorence-2: unified Computer Vision๐ฅ
๐#Microsoft announces Florence-2: novel foundation model with unified, prompt-based, representation for a large variety of #computervision & vision-language task. One backbone -> multiple tasks!
๐Review https://t.ly/pOins
๐Paper arxiv.org/pdf/2311.06242.pdf
๐Project www.microsoft.com/en-us/research/project/projectflorence/
๐#Microsoft announces Florence-2: novel foundation model with unified, prompt-based, representation for a large variety of #computervision & vision-language task. One backbone -> multiple tasks!
๐Review https://t.ly/pOins
๐Paper arxiv.org/pdf/2311.06242.pdf
๐Project www.microsoft.com/en-us/research/project/projectflorence/
๐ฑ9โค5๐ฅ3๐1๐1๐พ1