✨EdgeTAM: On-Device Track Anything Model
📝 Summary:
EdgeTAM optimizes SAM 2 for mobile devices by addressing memory attention bottlenecks with a novel 2D Spatial Perceiver. This lightweight Transformer encodes frame-level memories to reduce computational cost. A distillation pipeline improves performance, enabling high-quality video segmentation a...
🔹 Publication Date: Published on Jan 13
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2501.07256
• PDF: https://arxiv.org/pdf/2501.07256
• Github: https://github.com/facebookresearch/edgetam
🔹 Models citing this paper:
• https://huggingface.co/yonigozlan/EdgeTAM-hf
• https://huggingface.co/facebook/EdgeTAM
✨ Spaces citing this paper:
• https://huggingface.co/spaces/merve/EdgeTAM
• https://huggingface.co/spaces/yonigozlan/edgetam
• https://huggingface.co/spaces/facebook/EdgeTAM
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#EdgeAI #VideoSegmentation #ComputerVision #MobileAI #DeepLearning
📝 Summary:
EdgeTAM optimizes SAM 2 for mobile devices by addressing memory attention bottlenecks with a novel 2D Spatial Perceiver. This lightweight Transformer encodes frame-level memories to reduce computational cost. A distillation pipeline improves performance, enabling high-quality video segmentation a...
🔹 Publication Date: Published on Jan 13
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2501.07256
• PDF: https://arxiv.org/pdf/2501.07256
• Github: https://github.com/facebookresearch/edgetam
🔹 Models citing this paper:
• https://huggingface.co/yonigozlan/EdgeTAM-hf
• https://huggingface.co/facebook/EdgeTAM
✨ Spaces citing this paper:
• https://huggingface.co/spaces/merve/EdgeTAM
• https://huggingface.co/spaces/yonigozlan/edgetam
• https://huggingface.co/spaces/facebook/EdgeTAM
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#EdgeAI #VideoSegmentation #ComputerVision #MobileAI #DeepLearning
arXiv.org
EdgeTAM: On-Device Track Anything Model
On top of Segment Anything Model (SAM), SAM 2 further extends its capability from image to video inputs through a memory bank mechanism and obtains a remarkable performance compared with previous...
❤1