ML Research Hub – Telegram

ML Research Hub

32.9K subscribers

4.72K photos

292 videos

24 files

5.1K links

Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho

Download Telegram

About

Blog

Apps

Platform

ML Research Hub

32.9K subscribers

ML Research Hub

✨Language-based Trial and Error Falls Behind in the Era of Experience

📝 Summary:
LLMs struggle in nonlinguistic tasks due to costly exploration. SCOUT uses lightweight scouts for efficient exploration, then fine-tunes LLMs via SFT and RL. This boosts performance and saves GPU hours, outperforming proprietary models.

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21754
• PDF: https://arxiv.org/pdf/2601.21754
• Project Page: https://scout-cs.github.io/
• Github: https://github.com/Harry-mic/SCOUT

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

138 views03:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report

📝 Summary:
A two-stage trained cybersecurity reasoning model achieves competitive performance on specialized tasks while maintaining general capabilities through supervised fine-tuning and reinforcement learning...

🔹 Publication Date: Published on Jan 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21051
• PDF: https://arxiv.org/pdf/2601.21051
• Project Page: https://huggingface.co/fdtn-ai/Foundation-Sec-8B-Reasoning

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

138 views03:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning

📝 Summary:
VTC-R1 enables efficient long-context reasoning by compressing textual traces into compact images and iteratively feeding them back into vision-language models as optical memory, achieving significant...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.22069
• PDF: https://arxiv.org/pdf/2601.22069
• Github: https://github.com/w-yibo/VTC-R1

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

157 views04:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

111 views04:01

ML Research Hub

✨Typhoon-S: Minimal Open Post-Training for Sovereign Large Language Models

📝 Summary:
A minimal post-training approach using supervised fine-tuning, on-policy distillation, and small-scale reinforcement fine-tuning enables the development of high-quality sovereign language models with ...

🔹 Publication Date: Published on Jan 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18129
• PDF: https://arxiv.org/pdf/2601.18129
• Project Page: https://opentyphoon.ai/model/typhoon-s
• Github: https://github.com/scb-10x/typhoon-s

🔹 Models citing this paper:
• https://huggingface.co/typhoon-ai/typhoon-s-thaillm-8b-instruct-research-preview
• https://huggingface.co/typhoon-ai/typhoon-s-4b-nitibench-ccl-legal-agent-research-preview

✨ Datasets citing this paper:
• https://huggingface.co/datasets/typhoon-ai/typhoon-s-instruct-post-training
• https://huggingface.co/datasets/typhoon-ai/typhoon-s-sovereign-capability-dataset

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

Typhoon-S: Minimal Open Post-Training for Sovereign Large Language Models

Large language models (LLMs) have progressed rapidly; however, most state-of-the-art models are trained and evaluated primarily in high-resource languages such as English and Chinese, and are...

95 views04:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Exploring Reasoning Reward Model for Agents

📝 Summary:
Agent-RRM, a multi-faceted reward model, provides structured feedback for agentic trajectories through reasoning traces, critiques, and performance scores, with unified feedback integration showing su...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/pdf/2601.22154
• PDF: https://arxiv.org/pdf/2601.22154
• Github: https://github.com/kxfan2002/Reagent

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

92 views04:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Beyond Imitation: Reinforcement Learning for Active Latent Planning

📝 Summary:
Active latent planning method improves reasoning accuracy and efficiency by modeling latent token supervision as conditional VAE and using reinforcement learning with coherence rewards. AI-generated s...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21598
• PDF: https://arxiv.org/pdf/2601.21598

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

94 views04:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation Generation

📝 Summary:
UniMRG enhances unified multimodal models by training them to generate multiple visual representations, improving both understanding and generation capabilities through complementary information captu...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21406
• PDF: https://arxiv.org/pdf/2601.21406
• Github: https://github.com/Sugewud/UniMRG

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

159 views04:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨WebArbiter: A Principle-Guided Reasoning Process Reward Model for Web Agents

📝 Summary:
WebArbiter introduces a reasoning-first WebPRM that formulates reward modeling as text generation to improve web navigation through structured justifications and preference verdicts, outperforming exi...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21872
• PDF: https://arxiv.org/pdf/2601.21872

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

136 views04:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

Media is too big

VIEW IN TELEGRAM

✨DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

📝 Summary:
DynamicVLA addresses dynamic object manipulation challenges through a compact vision-language-action model with temporal reasoning and closed-loop adaptation, supported by a new benchmark for dynamic ...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.22153
• PDF: https://arxiv.org/pdf/2601.22153
• Project Page: https://haozhexie.com/project/dynamic-vla
• Github: https://github.com/hzxie/DynamicVLA

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

125 views05:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods

📝 Summary:
A large-scale multimodal reasoning dataset called MMFineReason is introduced to improve vision language models' performance through high-quality reasoning annotations and demonstrates superior paramet...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21821
• PDF: https://arxiv.org/pdf/2601.21821
• Project Page: https://mmfinereason.github.io/

🔹 Models citing this paper:
• https://huggingface.co/OpenDataArena/MMFineReason-8B
• https://huggingface.co/OpenDataArena/MMFineReason-4B
• https://huggingface.co/OpenDataArena/MMFineReason-2B

✨ Datasets citing this paper:
• https://huggingface.co/datasets/OpenDataArena/MMFineReason-1.8M
• https://huggingface.co/datasets/OpenDataArena/MMFineReason-SFT-123K

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

117 views05:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨MAD: Modality-Adaptive Decoding for Mitigating Cross-Modal Hallucinations in Multimodal Large Language Models

📝 Summary:
Multimodal Large Language Models suffer from cross-modal hallucinations where one modality incorrectly influences generation from another, leading to fabricated outputs; this exposes a fundamental def...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21181
• PDF: https://arxiv.org/pdf/2601.21181

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

103 views05:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Qwen3-ASR Technical Report

📝 Summary:
The Qwen3-ASR family introduces speech recognition models with language identification capabilities and a non-autoregressive forced alignment model, achieving state-of-the-art performance and efficien...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21337
• PDF: https://arxiv.org/pdf/2601.21337

🔹 Models citing this paper:
• https://huggingface.co/Qwen/Qwen3-ASR-1.7B
• https://huggingface.co/Qwen/Qwen3-ASR-0.6B
• https://huggingface.co/Qwen/Qwen3-ForcedAligner-0.6B

✨ Spaces citing this paper:
• https://huggingface.co/spaces/Qwen/Qwen3-ASR
• https://huggingface.co/spaces/prithivMLmods/Qwen3-TTS-Daggr-UI
• https://huggingface.co/spaces/sxjeru/Qwen3-ASR-1.7B

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

80 views05:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

📝 Summary:
ConceptMoE dynamically allocates computation by merging similar tokens into concept representations, improving both performance and efficiency in large language models through adaptive processing and ...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21420
• PDF: https://arxiv.org/pdf/2601.21420
• Github: https://github.com/ZihaoHuang-notabot/ConceptMoE

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

❤1

101 views05:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Self-Improving Pretraining: using post-trained models to pretrain better models

📝 Summary:
A reinforcement learning-based pretraining method improves language model safety, factuality, and quality by evaluating generations through a combination of model rollouts, original suffixes, and rewr...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21343
• PDF: https://arxiv.org/pdf/2601.21343

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

91 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Scaling Embeddings Outperforms Scaling Experts in Language Models

📝 Summary:
Embedding scaling offers superior sparsity scaling compared to expert scaling in large language models, enabling efficient inference through system optimizations and speculative decoding. AI-generated...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21204
• PDF: https://arxiv.org/pdf/2601.21204

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

89 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨DeepSearchQA: Bridging the Comprehensiveness Gap for Deep Research Agents

📝 Summary:
DeepSearchQA presents a 900-prompt benchmark evaluating agents on complex multi-step information-seeking tasks requiring systematic information collation, deduplication, and reasoning about stopping c...

🔹 Publication Date: Published on Jan 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20975
• PDF: https://arxiv.org/pdf/2601.20975
• Project Page: https://www.kaggle.com/benchmarks/google/dsqa/leaderboard

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

141 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Segment Length Matters: A Study of Segment Lengths on Audio Fingerprinting Performance

📝 Summary:
Neural audio fingerprinting performance varies with segment length, with short segments (0.5-second) generally providing better retrieval accuracy, and large language models showing promise in recomme...

🔹 Publication Date: Published on Jan 25

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17690
• PDF: https://arxiv.org/pdf/2601.17690

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

88 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨PRISM: Learning Design Knowledge from Data for Stylistic Design Improvement

📝 Summary:
PRISM leverages design data to create a knowledge base for improving graphic designs based on natural language instructions, achieving superior style alignment compared to existing methods. AI-generat...

🔹 Publication Date: Published on Jan 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.11747
• PDF: https://arxiv.org/pdf/2601.11747

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

243 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨WorldBench: Disambiguating Physics for Diagnostic Evaluation of World Models

📝 Summary:
WorldBench is introduced as a video-based benchmark for disentangled evaluation of physical reasoning in generative models, revealing specific failure patterns in current state-of-the-art video world ...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21282
• PDF: https://arxiv.org/pdf/2601.21282
• Project Page: https://world-bench.github.io/

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

👍1

157 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives

📝 Summary:
Offline knowledge construction through structured methodological graphs enables more reliable and scalable autonomous scientific discovery by reducing reliance on real-time literature processing. AI-g...

🔹 Publication Date: Published on Jan 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20833
• PDF: https://arxiv.org/pdf/2601.20833

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

167 views06:03

✨ Explore Data Science 📝 Write your paper