ML Research Hub
32.6K subscribers
3.62K photos
161 videos
23 files
3.87K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho
Download Telegram
WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning

📝 Summary:
WorldMM is a novel multimodal memory agent for long video reasoning. It uses episodic, semantic, and visual memories with adaptive retrieval across multiple temporal scales, significantly outperforming prior methods on long video question-answering benchmarks.

🔹 Publication Date: Published on Dec 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.02425
• PDF: https://arxiv.org/pdf/2512.02425
• Project Page: https://worldmm.github.io
• Github: https://github.com/wgcyeo/WorldMM

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#MultimodalAI #VideoReasoning #MemoryNetworks #DeepLearning #AI