ML Research Hub – Telegram

ML Research Hub

32.9K subscribers

4.72K photos

292 videos

24 files

5.1K links

Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho

Download Telegram

About

Blog

Apps

Platform

ML Research Hub

32.9K subscribers

ML Research Hub

✨OmegaUse: Building a General-Purpose GUI Agent for Autonomous Task Execution

📝 Summary:
OmegaUse is a general-purpose GUI agent model that achieves state-of-the-art performance on mobile and desktop platforms through a combination of high-quality data construction, decoupled training met...

🔹 Publication Date: Published on Jan 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20380
• PDF: https://arxiv.org/pdf/2601.20380

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

244 views04:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨SE-DiCoW: Self-Enrolled Diarization-Conditioned Whisper

📝 Summary:
SE-DiCoW improves speaker-attributed ASR by using diarization output to identify an enrollment segment for each speaker. This segment provides fixed conditioning in cross-attention layers, resolving ambiguities and significantly reducing transcription error rates compared to DiCoW.

🔹 Publication Date: Published on Jan 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.19194
• PDF: https://arxiv.org/pdf/2601.19194

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

318 views05:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨UPLiFT: Efficient Pixel-Dense Feature Upsampling with Local Attenders

📝 Summary:
UPLiFT is an efficient iterative upsampling architecture with a Local Attender operator that creates dense features from visual backbones. It achieves state-of-the-art performance with lower inference costs than cross-attention methods, overcoming prior limitations.

🔹 Publication Date: Published on Jan 25

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17950
• PDF: https://arxiv.org/pdf/2601.17950
• Project Page: https://www.cs.umd.edu/~mwalmer/uplift/
• Github: https://github.com/mwalmer-umd/UPLiFT/

🔹 Models citing this paper:
• https://huggingface.co/UPLiFT-upsampler/uplift_dinov2-s14
• https://huggingface.co/UPLiFT-upsampler/uplift_dinov3-splus16
• https://huggingface.co/UPLiFT-upsampler/uplift_sd1.5vae

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#ComputerVision #DeepLearning #FeatureUpsampling #AttentionMechanisms #EfficientAI

❤1

334 views10:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

Media is too big

VIEW IN TELEGRAM

✨Shallow-π: Knowledge Distillation for Flow-based VLAs

📝 Summary:
Shallow-pi is a knowledge distillation framework that reduces transformer depth in vision-language-action models. It achieves over two times faster inference with less than one percent performance drop, enabling efficient real-world robotic deployment.

🔹 Publication Date: Published on Jan 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20262
• PDF: https://arxiv.org/pdf/2601.20262
• Project Page: https://icsl-jeon.github.io/shallow-pi/
• Github: https://icsl-jeon.github.io/shallow-pi/

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#KnowledgeDistillation #Robotics #VLAModels #EfficientAI #DeepLearning

❤1

319 views12:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Reinforcement Learning via Self-Distillation

📝 Summary:
Self-Distillation Policy Optimization SDPO leverages rich textual feedback to address the credit-assignment bottleneck in reinforcement learning. SDPO treats the model as a self-teacher, distilling feedback-informed predictions to improve sample efficiency and accuracy. It significantly enhances ...

🔹 Publication Date: Published on Jan 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20802
• PDF: https://arxiv.org/pdf/2601.20802

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#ReinforcementLearning #SelfDistillation #MachineLearning #AI #PolicyOptimization

❤1

329 views13:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Training Reasoning Models on Saturated Problems via Failure-Prefix Conditioning

📝 Summary:
Reinforcement learning training stalls on saturated problems as informative failures are hard to find. Failure-prefix conditioning addresses this by training on prefixes from rare incorrect reasoning paths, exposing models to failures. This boosts performance, maintains efficiency, and improves r...

🔹 Publication Date: Published on Jan 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20829
• PDF: https://arxiv.org/pdf/2601.20829
• Github: https://github.com/minwukim/training-on-saturated-problems

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#ReinforcementLearning #MachineLearning #ArtificialIntelligence #DeepLearning #AIResearch

❤1

292 views16:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨MM-Agent: LLM as Agents for Real-world Mathematical Modeling Problem

📝 Summary:
MM-Agent is an expert-inspired framework that enables LLMs to excel in real-world mathematical modeling by decomposing the task into four stages. It significantly outperforms human experts and baseline agents on a new benchmark, proving its practical effectiveness as a modeling copilot.

🔹 Publication Date: Published on May 20, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2505.14148
• PDF: https://arxiv.org/pdf/2505.14148
• Github: https://github.com/usail-hkust/llm-mm-agent

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLM #MathematicalModeling #AIAgents #ArtificialIntelligence #DataScience

MM-Agent: LLM as Agents for Real-world Mathematical Modeling Problem

Mathematical modeling is a cornerstone of scientific discovery and engineering practice, enabling the translation of real-world problems into formal systems across domains such as physics,...

❤1

290 views17:05

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨VERGE: Formal Refinement and Guidance Engine for Verifiable LLM Reasoning

📝 Summary:
VERGE is a neurosymbolic framework that combines LLMs with SMT solvers for verification-guided iterative refinement of reasoning. It enhances logical correctness through formal semantic checking, semantic routing, and precise error localization, achieving an 18.7% performance uplift on reasoning ...

🔹 Publication Date: Published on Jan 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20055
• PDF: https://arxiv.org/pdf/2601.20055

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLM #NeurosymbolicAI #FormalVerification #AIReasoning #SMTSolvers

❤2🔥1

245 views20:05

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Group Distributionally Robust Optimization-Driven Reinforcement Learning for LLM Reasoning

📝 Summary:
This paper introduces Multi-Adversary GDRO to improve LLM reasoning. It dynamically adapts training distributions by classifying prompt difficulty and reallocating resources. This boosts accuracy by over 10% compared to GRPO, focusing compute on hard problems.

🔹 Publication Date: Published on Jan 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.19280
• PDF: https://arxiv.org/pdf/2601.19280

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLMReasoning #ReinforcementLearning #Optimization #MachineLearning #AI

❤1

233 views21:06

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Persona Prompting as a Lens on LLM Social Reasoning

📝 Summary:
Persona prompting improves LLM classification on subjective tasks like hate speech but degrades explanation quality. It fails to mitigate demographic biases and align with real-world personas, as models remain resistant to significant steering and over-flag content as harmful. This reveals a crit...

🔹 Publication Date: Published on Jan 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20757
• PDF: https://arxiv.org/pdf/2601.20757
• Github: https://github.com/jingyng/PP-social-reasoning

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLM #PersonaPrompting #BiasInAI #AIethics #NLP

🔥1

202 views00:00

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨How AI Impacts Skill Formation

📝 Summary:
AI assistance impairs skill acquisition for novice workers, hindering conceptual understanding and debugging. Heavy AI reliance is not a shortcut to competence. Careful AI adoption is crucial to preserve skill formation.

🔹 Publication Date: Published on Jan 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20245
• PDF: https://arxiv.org/pdf/2601.20245
• Project Page: https://www.anthropic.com/research/AI-assistance-coding-skills

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #SkillFormation #WorkforceDevelopment #LearningScience #HumanAICollaboration

149 views02:00

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨FP8-RL: A Practical and Stable Low-Precision Stack for LLM Reinforcement Learning

📝 Summary:
FP8-RL presents a practical FP8 rollout stack for LLM reinforcement learning, addressing computational and memory bottlenecks. It employs blockwise FP8, KV-cache recalibration, and importance sampling to mitigate train-inference mismatch. This achieves up to 44% throughput gains while preserving ...

🔹 Publication Date: Published on Jan 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18150
• PDF: https://arxiv.org/pdf/2601.18150

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLM #ReinforcementLearning #FP8 #MachineLearning #AIResearch

144 views02:00

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Language-based Trial and Error Falls Behind in the Era of Experience

📝 Summary:
LLMs struggle in nonlinguistic tasks due to costly exploration. SCOUT uses lightweight scouts for efficient exploration, then fine-tunes LLMs via SFT and RL. This boosts performance and saves GPU hours, outperforming proprietary models.

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21754
• PDF: https://arxiv.org/pdf/2601.21754
• Project Page: https://scout-cs.github.io/
• Github: https://github.com/Harry-mic/SCOUT

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

136 views03:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report

📝 Summary:
A two-stage trained cybersecurity reasoning model achieves competitive performance on specialized tasks while maintaining general capabilities through supervised fine-tuning and reinforcement learning...

🔹 Publication Date: Published on Jan 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21051
• PDF: https://arxiv.org/pdf/2601.21051
• Project Page: https://huggingface.co/fdtn-ai/Foundation-Sec-8B-Reasoning

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

136 views03:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning

📝 Summary:
VTC-R1 enables efficient long-context reasoning by compressing textual traces into compact images and iteratively feeding them back into vision-language models as optical memory, achieving significant...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.22069
• PDF: https://arxiv.org/pdf/2601.22069
• Github: https://github.com/w-yibo/VTC-R1

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

153 views04:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

108 views04:01

ML Research Hub

✨Typhoon-S: Minimal Open Post-Training for Sovereign Large Language Models

📝 Summary:
A minimal post-training approach using supervised fine-tuning, on-policy distillation, and small-scale reinforcement fine-tuning enables the development of high-quality sovereign language models with ...

🔹 Publication Date: Published on Jan 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18129
• PDF: https://arxiv.org/pdf/2601.18129
• Project Page: https://opentyphoon.ai/model/typhoon-s
• Github: https://github.com/scb-10x/typhoon-s

🔹 Models citing this paper:
• https://huggingface.co/typhoon-ai/typhoon-s-thaillm-8b-instruct-research-preview
• https://huggingface.co/typhoon-ai/typhoon-s-4b-nitibench-ccl-legal-agent-research-preview

✨ Datasets citing this paper:
• https://huggingface.co/datasets/typhoon-ai/typhoon-s-instruct-post-training
• https://huggingface.co/datasets/typhoon-ai/typhoon-s-sovereign-capability-dataset

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

Typhoon-S: Minimal Open Post-Training for Sovereign Large Language Models

Large language models (LLMs) have progressed rapidly; however, most state-of-the-art models are trained and evaluated primarily in high-resource languages such as English and Chinese, and are...

92 views04:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Exploring Reasoning Reward Model for Agents

📝 Summary:
Agent-RRM, a multi-faceted reward model, provides structured feedback for agentic trajectories through reasoning traces, critiques, and performance scores, with unified feedback integration showing su...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/pdf/2601.22154
• PDF: https://arxiv.org/pdf/2601.22154
• Github: https://github.com/kxfan2002/Reagent

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

89 views04:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Beyond Imitation: Reinforcement Learning for Active Latent Planning

📝 Summary:
Active latent planning method improves reasoning accuracy and efficiency by modeling latent token supervision as conditional VAE and using reinforcement learning with coherence rewards. AI-generated s...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21598
• PDF: https://arxiv.org/pdf/2601.21598

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

90 views04:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation Generation

📝 Summary:
UniMRG enhances unified multimodal models by training them to generate multiple visual representations, improving both understanding and generation capabilities through complementary information captu...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21406
• PDF: https://arxiv.org/pdf/2601.21406
• Github: https://github.com/Sugewud/UniMRG

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

157 views04:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨WebArbiter: A Principle-Guided Reasoning Process Reward Model for Web Agents

📝 Summary:
WebArbiter introduces a reasoning-first WebPRM that formulates reward modeling as text generation to improve web navigation through structured justifications and preference verdicts, outperforming exi...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21872
• PDF: https://arxiv.org/pdf/2601.21872

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

134 views04:02

✨ Explore Data Science 📝 Write your paper