ML Research Hub
32.9K subscribers
4.45K photos
273 videos
23 files
4.81K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
Xmodel-2.5: 1.3B Data-Efficient Reasoning SLM

📝 Summary:
Xmodel-2.5 is a 1.3B language model designed for efficient edge deployments. It uses maximal-update parameterization and a novel training curriculum that switches from AdamW to Muon, improving reasoning skills by 4.58% while maintaining efficiency.

🔹 Publication Date: Published on Nov 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.19496
• PDF: https://arxiv.org/pdf/2511.19496
• Github: https://github.com/XiaoduoAILab/Xmodel-2.5

🔹 Models citing this paper:
https://huggingface.co/XiaoduoAILab/Xmodel-2.5

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#SLM #EdgeAI #LanguageModels #DeepLearning #ReasoningAI
1