✨Xmodel-2.5: 1.3B Data-Efficient Reasoning SLM
📝 Summary:
Xmodel-2.5 is a 1.3B language model designed for efficient edge deployments. It uses maximal-update parameterization and a novel training curriculum that switches from AdamW to Muon, improving reasoning skills by 4.58% while maintaining efficiency.
🔹 Publication Date: Published on Nov 23
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.19496
• PDF: https://arxiv.org/pdf/2511.19496
• Github: https://github.com/XiaoduoAILab/Xmodel-2.5
🔹 Models citing this paper:
• https://huggingface.co/XiaoduoAILab/Xmodel-2.5
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#SLM #EdgeAI #LanguageModels #DeepLearning #ReasoningAI
📝 Summary:
Xmodel-2.5 is a 1.3B language model designed for efficient edge deployments. It uses maximal-update parameterization and a novel training curriculum that switches from AdamW to Muon, improving reasoning skills by 4.58% while maintaining efficiency.
🔹 Publication Date: Published on Nov 23
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.19496
• PDF: https://arxiv.org/pdf/2511.19496
• Github: https://github.com/XiaoduoAILab/Xmodel-2.5
🔹 Models citing this paper:
• https://huggingface.co/XiaoduoAILab/Xmodel-2.5
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#SLM #EdgeAI #LanguageModels #DeepLearning #ReasoningAI
❤1