ML Research Hub
32.5K subscribers
3.58K photos
155 videos
23 files
3.82K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho
Download Telegram
DeepAgent: A General Reasoning Agent with Scalable Toolsets

📝 Summary:
DeepAgent is an end-to-end deep reasoning agent that autonomously thinks, discovers tools, and executes actions. It uses memory folding for long interactions and ToolPO reinforcement learning for general tool use. DeepAgent consistently outperforms baselines on eight diverse benchmarks.

🔹 Publication Date: Published on Oct 24

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.21618
• PDF: https://arxiv.org/pdf/2510.21618
• Github: https://github.com/RUC-NLPIR/DeepAgent

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #ReasoningAgents #ReinforcementLearning #ToolLearning #DeepLearning
CodeV: Code with Images for Faithful Visual Reasoning via Tool-Aware Policy Optimization

📝 Summary:
CodeV improves faithful visual reasoning by training an agent with Tool-Aware Policy Optimization TAPO. TAPO uses dense rewards directly on visual tool inputs and outputs, encouraging evidence-consistent tool use. This approach significantly boosts faithful tool use and achieves competitive accur...

🔹 Publication Date: Published on Nov 24

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.19661
• PDF: https://arxiv.org/pdf/2511.19661

🔹 Models citing this paper:
https://huggingface.co/RenlyH/CodeV-RL
https://huggingface.co/RenlyH/CodeV-SFT

Datasets citing this paper:
https://huggingface.co/datasets/RenlyH/CodeV-RL-Data

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#VisualReasoning #ReinforcementLearning #ComputerVision #AI #ToolLearning
3