ML Research Hub
32.8K subscribers
4.39K photos
272 videos
23 files
4.75K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking

📝 Summary:
Reinforcement learning for large language model agents suffers from discrimination collapse in open-ended tasks due to pointwise scalar scoring, which ArenaRL addresses through relative ranking and pa...

🔹 Publication Date: Published on Jan 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.06487
• PDF: https://arxiv.org/pdf/2601.06487
• Github: https://github.com/Alibaba-NLP/qqr

Datasets citing this paper:
https://huggingface.co/datasets/Alibaba-NLP/Open-Travel
https://huggingface.co/datasets/Alibaba-NLP/Open-DeepResearch

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Motion Attribution for Video Generation

📝 Summary:
Motive is a gradient-based data attribution framework that identifies influential video clips for motion improvement in text-to-video models through motion-weighted loss masking. AI-generated summary ...

🔹 Publication Date: Published on Jan 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.08828
• PDF: https://arxiv.org/pdf/2601.08828
• Project Page: https://research.nvidia.com/labs/sil/projects/MOTIVE/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices

📝 Summary:
An efficient diffusion transformer framework for mobile and edge devices that maintains high-generation quality while reducing computational costs through compact architecture, elastic training, and k...

🔹 Publication Date: Published on Jan 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.08303
• PDF: https://arxiv.org/pdf/2601.08303

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Aligning Text, Code, and Vision: A Multi-Objective Reinforcement Learning Framework for Text-to-Visualization

📝 Summary:
A reinforcement learning framework for text-to-visualization generation that improves chart quality and code execution by optimizing multiple objectives using post-execution feedback. AI-generated sum...

🔹 Publication Date: Published on Jan 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.04582
• PDF: https://arxiv.org/pdf/2601.04582

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Media is too big
VIEW IN TELEGRAM
VLingNav: Embodied Navigation with Adaptive Reasoning and Visual-Assisted Linguistic Memory

📝 Summary:
VLingNav enhances embodied navigation through linguistic-driven cognition with adaptive reasoning and visual-assisted memory, achieving state-of-the-art performance and zero-shot transfer to real robo...

🔹 Publication Date: Published on Jan 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.08665
• PDF: https://arxiv.org/pdf/2601.08665
• Project Page: https://wsakobe.github.io/VLingNav-web/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences

📝 Summary:
MemGovern framework transforms unstructured GitHub data into structured experiential memory for autonomous software engineering agents, improving bug resolution rates through enhanced experience retri...

🔹 Publication Date: Published on Jan 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.06789
• PDF: https://arxiv.org/pdf/2601.06789
• Github: https://github.com/QuantaAlpha/MemGovern

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale

📝 Summary:
Large reasoning models enable scalable multi-turn dialogue generation through automated task-oriented simulation and user-oriented behavioral modeling for enhanced human-agent interaction datasets. AI...

🔹 Publication Date: Published on Jan 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.08225
• PDF: https://arxiv.org/pdf/2601.08225

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Solar Open Technical Report

📝 Summary:
Solar Open presents a 102B-parameter bilingual Mixture-of-Experts language model that addresses data scarcity in underserved languages through synthetic data generation, progressive curriculum coordin...

🔹 Publication Date: Published on Jan 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.07022
• PDF: https://arxiv.org/pdf/2601.07022

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands

📝 Summary:
ShowUI-π is the first flow-based generative model for GUI agents, unifying discrete clicks and continuous drag actions. It achieves smooth, stable trajectories and significantly outperforms prior agents on ScreenDrag, a new benchmark for GUI drag capabilities.

🔹 Publication Date: Published on Dec 31, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.24965
• PDF: https://arxiv.org/pdf/2512.24965
• Project Page: https://showlab.github.io/showui-pi
• Github: https://github.com/showlab/showui-pi

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
EpiCaR: Knowing What You Don't Know Matters for Better Reasoning in LLMs

📝 Summary:
LLM self-training improves reasoning but causes overconfidence. EpiCaR solves this by jointly optimizing reasoning performance and calibration through epistemic learning and self-evaluation. It achieves better accuracy and calibration, reduces inference compute by 3X, and generalizes well to new ...

🔹 Publication Date: Published on Jan 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.06786
• PDF: https://arxiv.org/pdf/2601.06786

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#LLMs #AI #MachineLearning #Reasoning #Calibration