ML Research Hub – Telegram

ML Research Hub

32.8K subscribers

4.4K photos

272 videos

23 files

4.76K links

Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho

Download Telegram

About

Blog

Apps

Platform

ML Research Hub

32.8K subscribers

ML Research Hub

✨PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC

📝 Summary:
PC-Agent is a hierarchical multi-agent framework improving MLLM-based GUI agents for complex PC tasks. It uses an Active Perception Module and a hierarchical decision-making architecture with Manager, Progress, and Decision agents. A Reflection agent provides feedback. It achieved a 32% task succ...

🔹 Publication Date: Published on Feb 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2502.14282
• PDF: https://arxiv.org/pdf/2502.14282
• Github: https://github.com/X-PLUG/MobileAgent/tree/main/PC-Agent

✨ Spaces citing this paper:
• https://huggingface.co/spaces/junyangwang0410/PC-Agent

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#MultiAgentSystems #AIAgents #MLLMs #PCAutomation #DeepLearning

259 views12:08

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Fara-7B: An Efficient Agentic Model for Computer Use

📝 Summary:
FaraGen creates synthetic datasets for computer use agents, solving a data scarcity problem. This data trains Fara-7B, a small on-device model that perceives computers via screenshots and outperforms larger models on diverse web tasks.

🔹 Publication Date: Published on Nov 24

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.19663
• PDF: https://arxiv.org/pdf/2511.19663

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AIAgents #OnDeviceAI #SyntheticData #MachineLearning #ComputerVision

305 views04:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning

📝 Summary:
Agent0-VL is a self-evolving vision-language agent that integrates tool usage into both reasoning and self-evaluation. It uses a Solver and Verifier in a self-evolving cycle for continuous improvement without human annotation or external rewards, achieving a 12.5% performance gain.

🔹 Publication Date: Published on Nov 25

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.19900
• PDF: https://arxiv.org/pdf/2511.19900

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AIAgents #VisionLanguage #SelfEvolvingAI #ToolAugmentedAI #AIResearch

271 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Latent Collaboration in Multi-Agent Systems

📝 Summary:
LatentMAS enables LLM agents to collaborate directly in latent space, surpassing text-based communication. This boosts reasoning quality, accuracy, and efficiency speed, tokens without extra training.

🔹 Publication Date: Published on Nov 25

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.20639
• PDF: https://arxiv.org/pdf/2511.20639
• Github: https://github.com/Gen-Verse/LatentMAS

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLM #MultiAgentSystems #LatentSpace #AIAgents #ArtificialIntelligence

339 views04:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨DualVLA: Building a Generalizable Embodied Agent via Partial Decoupling of Reasoning and Action

📝 Summary:
DualVLA tackles action degeneration in VLAs by boosting action performance while retaining reasoning. It uses dual-layer data pruning and dual-teacher adaptive distillation. This balances precise action and multimodal understanding, leading to high success rates.

🔹 Publication Date: Published on Nov 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.22134
• PDF: https://arxiv.org/pdf/2511.22134
• Project Page: https://costaliya.github.io/DualVLA/
• Github: https://costaliya.github.io/DualVLA/

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#EmbodiedAI #VLAs #AIagents #DeepLearning #AIResearch

211 views04:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Agentic Policy Optimization via Instruction-Policy Co-Evolution

📝 Summary:
INSPO introduces a novel framework dynamically optimizing instructions within the reinforcement learning loop for autonomous agents. It substantially outperforms static instruction methods in multi-turn reasoning by discovering innovative, strategic reasoning paths.

🔹 Publication Date: Published on Dec 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.01945
• PDF: https://arxiv.org/pdf/2512.01945
• Github: https://github.com/cambridgeltl/inspo

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#ReinforcementLearning #AIAgents #PolicyOptimization #MachineLearning #AI

260 views12:11

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨SIMA 2: A Generalist Embodied Agent for Virtual Worlds

📝 Summary:
SIMA 2 is a Gemini-based embodied agent for 3D virtual worlds. It reasons about goals, handles complex instructions, and autonomously learns new skills. This closes the gap with human performance and validates continuous learning agents.

🔹 Publication Date: Published on Dec 4

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04797
• PDF: https://arxiv.org/pdf/2512.04797

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#EmbodiedAI #AI #VirtualWorlds #ReinforcementLearning #AIagents

151 views03:00

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Thinking with Images via Self-Calling Agent

📝 Summary:
sCoT is a novel visual reasoning paradigm that reformulates interleaved multimodal CoT as a language-only CoT with self-calling subagents. It improves reasoning performance and efficiency by avoiding explicit multimodal interleaving and using group-relative policy optimization.

🔹 Publication Date: Published on Dec 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.08511
• PDF: https://arxiv.org/pdf/2512.08511
• Github: https://github.com/YWenxi/think-with-images-through-self-calling

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#VisualReasoning #MultimodalAI #LLMs #AIagents #AIResearch

354 views10:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

📝 Summary:
This survey reviews self-evolving AI agents that adapt to dynamic environments via automatic enhancement from interaction data. It proposes a unified framework and systematically reviews current techniques, addressing evaluation, safety, and ethics.

🔹 Publication Date: Published on Aug 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.07407
• PDF: https://arxiv.org/pdf/2508.07407
• Project Page: https://huggingface.co/spaces/X-iZhang/Awesome-Self-Evolving-Agents
• Github: https://github.com/EvoAgentX/Awesome-Self-Evolving-Agents

✨ Spaces citing this paper:
• https://huggingface.co/spaces/X-iZhang/Awesome-Self-Evolving-Agents

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#SelfEvolvingAI #AIAgents #FoundationModels #LifelongLearning #ArtificialIntelligence

❤1

544 views14:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

📝 Summary:
The Agentic Learning Ecosystem ALE is a new infrastructure to streamline LLM agent development for real-world tasks. ALE comprises ROLL for optimization, ROCK for sandboxing, and iFlow CLI for context. Their agent ROME, built with ALE, shows strong benchmark performance.

🔹 Publication Date: Published on Dec 31, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.24873
• PDF: https://arxiv.org/pdf/2512.24873

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AIAgents #LLMDevelopment #AgenticLearning #AIArchitecture #MachineLearning

254 views04:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

📝 Summary:
Youtu-Agent scales LLM agent productivity, automating generation and enabling continuous evolution. Its hybrid optimization, using in-context learning and scalable reinforcement learning, yields top performance and boosted capabilities.

🔹 Publication Date: Published on Dec 31, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.24615
• PDF: https://arxiv.org/pdf/2512.24615
• Project Page: https://tencentcloudadp.github.io/youtu-agent/
• Github: https://github.com/TencentCloudADP/youtu-tip

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLM #AIAgents #ReinforcementLearning #MachineLearning #AI

334 views03:00

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents

📝 Summary:
MAGMA is a multi-graph memory architecture that improves AI agent long-context reasoning. It decouples memory representation from retrieval logic across semantic, temporal, causal, and entity graphs for query-adaptive selection, outperforming existing agentic memory systems.

🔹 Publication Date: Published on Jan 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03236
• PDF: https://arxiv.org/pdf/2601.03236

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AIAgents #MemoryArchitecture #LongContextReasoning #GraphAI #ArtificialIntelligence

227 views08:31

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨SimpleMem: Efficient Lifelong Memory for LLM Agents

📝 Summary:
SimpleMem is an efficient memory framework for LLM agents that uses semantic lossless compression. It employs a three-stage pipeline to distill, consolidate, and retrieve historical experiences efficiently. SimpleMem significantly improves accuracy and reduces token consumption by up to 30-fold c...

🔹 Publication Date: Published on Jan 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02553
• PDF: https://arxiv.org/pdf/2601.02553
• Project Page: https://aiming-lab.github.io/SimpleMem-Page/
• Github: https://aiming-lab.github.io/SimpleMem-Page/

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLM #AIAgents #LifelongLearning #AI #DeepLearning

👍1

257 views08:31

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Why LLMs Aren't Scientists Yet: Lessons from Four Autonomous Research Attempts

📝 Summary:
A case study of four LLM agent attempts to autonomously generate ML research papers reveals six recurring failure modes. Most attempts failed, though one was accepted to a special AI-first author venue, leading to proposed design principles for future AI-scientist systems.

🔹 Publication Date: Published on Jan 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03315
• PDF: https://arxiv.org/pdf/2601.03315

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLMs #AIResearch #MachineLearning #AIAgents #AutonomousSystems

👍1

298 views12:32

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Distilling Feedback into Memory-as-a-Tool

📝 Summary:
This framework converts transient critiques into retrievable guidelines using a file-based memory system and agent tools. It enables LLMs to achieve test-time refinement performance with significantly reduced inference costs.

🔹 Publication Date: Published on Jan 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.05960
• PDF: https://arxiv.org/pdf/2601.05960
• Github: https://github.com/vicgalle/feedback-memory-as-a-tool

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLMs #AIAgents #MemorySystems #AIResearch #MachineLearning

169 views09:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨MemoBrain: Executive Memory as an Agentic Brain for Reasoning

📝 Summary:
Long-horizon tasks strain tool-augmented agents due to accumulating context. MemoBrain is an executive memory model that organizes and prunes reasoning steps, maintaining a compact, high-salience backbone within a fixed context. This improves coherent, goal-directed reasoning.

🔹 Publication Date: Published on Jan 12

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.08079
• PDF: https://arxiv.org/pdf/2601.08079
• Github: https://github.com/qhjqhj00/MemoBrain

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AIagents #ExecutiveMemory #Reasoning #LLM #CognitiveAI

156 views12:04

✨ Explore Data Science 📝 Write your paper