ML Research Hub

This channels is for Programmers, Coders, Software Engineers.

0️⃣ Python
1️⃣ Data Science
2️⃣ Machine Learning
3️⃣ Data Visualization
4️⃣ Artificial Intelligence
5️⃣ Data Analysis
6️⃣ Statistics
7️⃣ Deep Learning
8️⃣ programming Languages

✅

https://t.iss.one/addlist/8_rRW2scgfRhOTc0

✅

https://t.iss.one/Codeprogrammer

Please open Telegram to view this post

VIEW IN TELEGRAM

❤1

241 views22:06

ML Research Hub

340 views01:00

ML Research Hub

✨Arch-Router: Aligning LLM Routing with Human Preferences

📝 Summary:
Arch-Router is a 1.5B model that aligns LLM routing with human preferences by matching queries to user-defined domains and action types. It outperforms proprietary models in subjective evaluations and supports flexible addition of new models.

🔹 Publication Date: Published on Jun 19, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2506.16655
• PDF: https://arxiv.org/pdf/2506.16655
• Project Page: https://huggingface.co/katanemo/Arch-Router-1.5B
• Github: https://github.com/katanemo/archgw/

🔹 Models citing this paper:
• https://huggingface.co/katanemo/Arch-Router-1.5B
• https://huggingface.co/katanemo/Arch-Router-1.5B.gguf
• https://huggingface.co/Mungert/Arch-Router-1.5B-GGUF

✨ Spaces citing this paper:
• https://huggingface.co/spaces/jaimegalanmartinez/f1_faq_engine
• https://huggingface.co/spaces/tejasashinde/archRouter_simulator
• https://huggingface.co/spaces/IsaiahJ04/katanemo-Arch-Router-1.5B

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLM #AI #MachineLearning #NLP #LLMRouting

arXiv.org

Arch-Router: Aligning LLM Routing with Human Preferences

With the rapid proliferation of large language models (LLMs) -- each optimized for different strengths, style, or latency/cost profile -- routing has become an essential technique to...

❤3

461 views01:00

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Transformer Explainer: Interactive Learning of Text-Generative Models

📝 Summary:
Transformer Explainer is an interactive web tool enabling non-experts to understand GPT-2's internal workings. It visualizes how the model generates text in real-time based on user input. This improves public access to learning about modern generative AI.

🔹 Publication Date: Published on Aug 8, 2024

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2408.04619
• PDF: https://arxiv.org/pdf/2408.04619
• Project Page: https://poloclub.github.io/transformer-explainer/
• Github: https://github.com/helblazer811/ManimML

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #ExplainableAI #LLM #DataVisualization #GenerativeAI

❤1

533 views02:00

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning

📝 Summary:
To address high costs and limits in chain-of-thought reasoning, InftyThink uses reinforcement learning to optimize iterative reasoning. It learns to strategically summarize and resume, boosting accuracy by 21% on AIME24, reducing latency, and improving generalization.

🔹 Publication Date: Published on Feb 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.06960
• PDF: https://arxiv.org/pdf/2602.06960

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#ReinforcementLearning #AIReasoning #ChainOfThought #ArtificialIntelligence #MachineLearning

386 views03:00

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Revisiting the Shape Convention of Transformer Language Models

📝 Summary:
This paper challenges the traditional narrow-wide-narrow FFN in Transformers, proposing deeper hourglass-shaped FFNs. This new design improves model efficiency and performance by better utilizing parameters, especially when expanding other model components.

🔹 Publication Date: Published on Feb 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.06471
• PDF: https://arxiv.org/pdf/2602.06471

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#Transformers #LLM #DeepLearning #NeuralNetworks #AIResearch

191 views03:00

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments

📝 Summary:
MemGUI-Bench is a new, comprehensive benchmark designed to evaluate the memory capabilities of mobile GUI agents. It addresses current benchmarks' failure to assess memory by offering a taxonomy, 128 tasks, and an automated evaluation pipeline. Experiments with state-of-the-art agents reveal sign...

🔹 Publication Date: Published on Feb 3

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.06075
• PDF: https://arxiv.org/pdf/2602.06075
• Project Page: https://lgy0404.github.io/MemGUI-Bench/
• Github: https://github.com/lgy0404/MemGUI-Bench

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#MobileAI #GUIagents #AIBenchmarking #MemoryAI #AIResearch

196 views03:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale

📝 Summary:
OmniMoE presents a system-algorithm co-designed framework that achieves fine-grained expert specialization in Mixture-of-Experts architectures through vector-level atomic experts and optimized routing...

🔹 Publication Date: Published on Feb 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.05711
• PDF: https://arxiv.org/pdf/2602.05711
• Github: https://github.com/flash-algo/omni-moe

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

205 views03:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

📝 Summary:
This paper theoretically analyzes entropy dynamics in reinforcement fine-tuning of large language models. It derives expressions for entropy change and proposes novel entropy control methods based on discriminant analysis, aiming to optimize the exploration-exploitation balance during LLM fine-tu...

🔹 Publication Date: Published on Feb 3

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.03392
• PDF: https://arxiv.org/pdf/2602.03392
• Github: https://github.com/agentscope-ai/Trinity-RFT

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLM #ReinforcementLearning #Entropy #AIResearch #MachineLearning

179 views04:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Self-Improving Multilingual Long Reasoning via Translation-Reasoning Integrated Training

📝 Summary:
TRIT framework improves multilingual long reasoning by jointly training translation and reasoning. This self-improving method enhances non-English question understanding and response generation without extra data. It boosts accuracy and language consistency, also improving cross-lingual question ...

🔹 Publication Date: Published on Feb 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.05940
• PDF: https://arxiv.org/pdf/2602.05940

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#MultilingualAI #LongReasoning #LLM #NLP #AIResearch

142 views04:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨PlanViz: Evaluating Planning-Oriented Image Generation and Editing for Computer-Use Tasks

📝 Summary:
PlanViz is a new benchmark evaluating unified multimodal models for image generation and editing in computer-use planning tasks. It features route planning, work diagramming, and web&UI displaying sub-tasks, using a task-adaptive PlanScore to assess correctness, visual quality, and efficiency.

🔹 Publication Date: Published on Feb 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.06663
• PDF: https://arxiv.org/pdf/2602.06663
• Project Page: https://github.com/lijunxian111/PlanViz
• Github: https://github.com/lijunxian111/PlanViz/releases/tag/v1

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#MultimodalAI #ImageGeneration #ImageEditing #ComputerVision #Benchmarking

186 views04:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨POINTS-GUI-G: GUI-Grounding Journey

📝 Summary:
GUI agents for automated digital tasks rely on vision-language models with enhanced grounding capabilities, achieved through refined data engineering, improved training strategies, and reinforcement l...

🔹 Publication Date: Published on Feb 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.06391
• PDF: https://arxiv.org/pdf/2602.06391
• Github: https://github.com/Tencent/POINTS-GUI

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

128 views04:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨EgoAVU: Egocentric Audio-Visual Understanding

📝 Summary:
MLLMs struggle with egocentric video's joint audio-visual understanding. EgoAVU, a new data engine, generates diverse audio-visual narrations to create the EgoAVU-Instruct dataset. This fine-tunes MLLMs, enabling up to 113% performance improvement in joint audio-visual comprehension.

🔹 Publication Date: Published on Feb 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.06139
• PDF: https://arxiv.org/pdf/2602.06139

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#EgocentricAI #MultimodalAI #AudioVisualAI #DeepLearning #Datasets

136 views04:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

0:00

This media is not supported in your browser

VIEW IN TELEGRAM

✨DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos

📝 Summary:
DreamDojo is a foundation world model trained on 44k hours of egocentric human videos that enables efficient simulation of dexterous robotic tasks through continuous latent actions and real-time disti...

🔹 Publication Date: Published on Feb 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.06949
• PDF: https://arxiv.org/pdf/2602.06949
• Project Page: https://dreamdojo-world.github.io/

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

131 views04:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Back to Basics: Revisiting Exploration in Reinforcement Learning for LLM Reasoning via Generative Probabilities

📝 Summary:
ProGRPO is a novel RL method for LLM reasoning that tackles entropy collapse. It dynamically re-weights rewards to equilibrate confidence across correct responses, enhancing generative diversity and exploration. ProGRPO significantly outperforms standard methods on reasoning benchmarks.

🔹 Publication Date: Published on Feb 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.05281
• PDF: https://arxiv.org/pdf/2602.05281

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#ReinforcementLearning #LLM #AI #GenerativeAI #MachineLearning

140 views04:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Exploring Knowledge Purification in Multi-Teacher Knowledge Distillation for LLMs

📝 Summary:
This paper introduces Knowledge Purification, consolidating multi-teacher LLM rationales to reduce conflicts and improve distillation efficiency. Methods improve model performance and reduce conflicts; router-based methods generalize robustly.

🔹 Publication Date: Published on Feb 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.01064
• PDF: https://arxiv.org/pdf/2602.01064

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLM #KnowledgeDistillation #KnowledgePurification #AI #DeepLearning

182 views04:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math

📝 Summary:
Consequence-Based Utility evaluates math solutions by testing their value as in-context exemplars for related problems. This oracle-free approach outperforms reward models and LLM judges, improving ranking quality and correct-wrong separation of AI-generated solutions.

🔹 Publication Date: Published on Feb 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.06291
• PDF: https://arxiv.org/pdf/2602.06291

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AIEvaluation #LLMEvaluation #MathAI #ArtificialIntelligence #MachineLearning

241 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

📝 Summary:
LLM training instability is linked to weight matrix stable rank decline and Jacobian alignment, causing gradient explosions. MSign is a new optimizer that restores stable rank via matrix sign operations, effectively preventing training failures with low computational overhead.

🔹 Publication Date: Published on Feb 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.01734
• PDF: https://arxiv.org/pdf/2602.01734

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

166 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making

📝 Summary:
Baichuan-M3 is a medical LLM for clinical decision support. It uses proactive info gathering, long-horizon reasoning, and hallucination suppression. It outperforms GPT-5.2 on medical benchmarks in clinical inquiry and safety.

🔹 Publication Date: Published on Feb 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.06570
• PDF: https://arxiv.org/pdf/2602.06570
• Github: https://github.com/baichuan-inc/Baichuan-M3-235B

🔹 Models citing this paper:
• https://huggingface.co/baichuan-inc/Baichuan-M3-235B
• https://huggingface.co/baichuan-inc/Baichuan-M3-235B-GPTQ-INT4
• https://huggingface.co/baichuan-inc/Baichuan-M3-235B-FP8

✨ Spaces citing this paper:
• https://huggingface.co/spaces/baichuan-inc/Baichuan-M3-Inquiry

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

124 views05:45

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks

📝 Summary:
A novel framework called SEMA is introduced that effectively trains multi-turn attackers for large language models without relying on existing strategies or external data, achieving state-of-the-art a...

🔹 Publication Date: Published on Feb 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.06854
• PDF: https://arxiv.org/pdf/2602.06854

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

103 views05:45

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨RaBiT: Residual-Aware Binarization Training for Accurate and Efficient LLMs

📝 Summary:
Residual binarization framework RaBiT addresses feature co-adaptation in quantized LLMs through hierarchical path derivation and robust initialization, achieving superior accuracy-efficiency trade-off...

🔹 Publication Date: Published on Feb 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.05367
• PDF: https://arxiv.org/pdf/2602.05367

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

122 views05:45

✨ Explore Data Science 📝 Write your paper

About

Blog

Apps

Platform