ML Research Hub – Telegram

ML Research Hub

32.9K subscribers

4.72K photos

292 videos

24 files

5.1K links

Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho

Download Telegram

About

Blog

Apps

Platform

ML Research Hub

32.9K subscribers

ML Research Hub

✨MAD: Modality-Adaptive Decoding for Mitigating Cross-Modal Hallucinations in Multimodal Large Language Models

📝 Summary:
Multimodal Large Language Models suffer from cross-modal hallucinations where one modality incorrectly influences generation from another, leading to fabricated outputs; this exposes a fundamental def...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21181
• PDF: https://arxiv.org/pdf/2601.21181

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

100 views05:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Qwen3-ASR Technical Report

📝 Summary:
The Qwen3-ASR family introduces speech recognition models with language identification capabilities and a non-autoregressive forced alignment model, achieving state-of-the-art performance and efficien...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21337
• PDF: https://arxiv.org/pdf/2601.21337

🔹 Models citing this paper:
• https://huggingface.co/Qwen/Qwen3-ASR-1.7B
• https://huggingface.co/Qwen/Qwen3-ASR-0.6B
• https://huggingface.co/Qwen/Qwen3-ForcedAligner-0.6B

✨ Spaces citing this paper:
• https://huggingface.co/spaces/Qwen/Qwen3-ASR
• https://huggingface.co/spaces/prithivMLmods/Qwen3-TTS-Daggr-UI
• https://huggingface.co/spaces/sxjeru/Qwen3-ASR-1.7B

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

77 views05:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

📝 Summary:
ConceptMoE dynamically allocates computation by merging similar tokens into concept representations, improving both performance and efficiency in large language models through adaptive processing and ...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21420
• PDF: https://arxiv.org/pdf/2601.21420
• Github: https://github.com/ZihaoHuang-notabot/ConceptMoE

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

❤1

98 views05:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Self-Improving Pretraining: using post-trained models to pretrain better models

📝 Summary:
A reinforcement learning-based pretraining method improves language model safety, factuality, and quality by evaluating generations through a combination of model rollouts, original suffixes, and rewr...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21343
• PDF: https://arxiv.org/pdf/2601.21343

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

89 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Scaling Embeddings Outperforms Scaling Experts in Language Models

📝 Summary:
Embedding scaling offers superior sparsity scaling compared to expert scaling in large language models, enabling efficient inference through system optimizations and speculative decoding. AI-generated...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21204
• PDF: https://arxiv.org/pdf/2601.21204

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

87 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨DeepSearchQA: Bridging the Comprehensiveness Gap for Deep Research Agents

📝 Summary:
DeepSearchQA presents a 900-prompt benchmark evaluating agents on complex multi-step information-seeking tasks requiring systematic information collation, deduplication, and reasoning about stopping c...

🔹 Publication Date: Published on Jan 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20975
• PDF: https://arxiv.org/pdf/2601.20975
• Project Page: https://www.kaggle.com/benchmarks/google/dsqa/leaderboard

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

140 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Segment Length Matters: A Study of Segment Lengths on Audio Fingerprinting Performance

📝 Summary:
Neural audio fingerprinting performance varies with segment length, with short segments (0.5-second) generally providing better retrieval accuracy, and large language models showing promise in recomme...

🔹 Publication Date: Published on Jan 25

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17690
• PDF: https://arxiv.org/pdf/2601.17690

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

86 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨PRISM: Learning Design Knowledge from Data for Stylistic Design Improvement

📝 Summary:
PRISM leverages design data to create a knowledge base for improving graphic designs based on natural language instructions, achieving superior style alignment compared to existing methods. AI-generat...

🔹 Publication Date: Published on Jan 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.11747
• PDF: https://arxiv.org/pdf/2601.11747

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

156 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨WorldBench: Disambiguating Physics for Diagnostic Evaluation of World Models

📝 Summary:
WorldBench is introduced as a video-based benchmark for disentangled evaluation of physical reasoning in generative models, revealing specific failure patterns in current state-of-the-art video world ...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21282
• PDF: https://arxiv.org/pdf/2601.21282
• Project Page: https://world-bench.github.io/

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

👍1

151 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives

📝 Summary:
Offline knowledge construction through structured methodological graphs enables more reliable and scalable autonomous scientific discovery by reducing reliance on real-time literature processing. AI-g...

🔹 Publication Date: Published on Jan 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20833
• PDF: https://arxiv.org/pdf/2601.20833

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

158 views06:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models

📝 Summary:
OCRVerse unifies text and vision-centric OCR into a holistic end-to-end method for diverse visual documents. It uses comprehensive data and a two-stage SFT-RL training with domain-specific rewards to achieve competitive results.

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21639
• PDF: https://arxiv.org/pdf/2601.21639

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

147 views07:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models

📝 Summary:
Text-to-image models struggle with complex spatial reasoning due to sparse prompts. This paper introduces SpatialGenEval, a new benchmark with dense prompts, showing models struggle with higher-order spatial tasks. A new dataset, SpatialT2I, helps fine-tune models for significant performance gain...

🔹 Publication Date: Published on Jan 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20354
• PDF: https://arxiv.org/pdf/2601.20354
• Github: https://github.com/AMAP-ML/SpatialGenEval

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#TextToImage #SpatialReasoning #GenerativeAI #ComputerVision #AIResearch

109 views08:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨MetricAnything: Scaling Metric Depth Pretraining with Noisy Heterogeneous Sources

📝 Summary:
Metric Anything introduces a scalable pretraining framework for metric depth using Sparse Metric Prompts to handle diverse, noisy 3D data. It shows clear scaling trends and achieves state-of-the-art performance across various depth estimation and spatial intelligence tasks.

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.22054
• PDF: https://arxiv.org/pdf/2601.22054
• Project Page: https://metric-anything.github.io/metric-anything-io/
• Github: https://github.com/metric-anything/metric-anything

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#MetricDepth #ComputerVision #MachineLearning #DeepLearning #3DVision

84 views08:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨PLANING: A Loosely Coupled Triangle-Gaussian Framework for Streaming 3D Reconstruction

📝 Summary:
PLANING is an efficient streaming 3D reconstruction framework. It combines explicit geometric primitives and neural Gaussians with decoupled optimization, achieving both high-quality rendering and accurate geometry. It outperforms prior methods in quality and speed.

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.22046
• PDF: https://arxiv.org/pdf/2601.22046

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#3DReconstruction #ComputerVision #NeuralNetworks #StreamingTech #ComputerGraphics

79 views08:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨BMAM: Brain-inspired Multi-Agent Memory Framework

📝 Summary:
BMAM presents a brain-inspired multi-agent memory architecture that decomposes memory into specialized subsystems to address long-term reasoning challenges in language-model-based agents. AI-generated...

🔹 Publication Date: Published on Jan 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20465
• PDF: https://arxiv.org/pdf/2601.20465
• Github: https://github.com/innovation64/BMAM

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

82 views08:05

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Spotlighting Task-Relevant Features: Object-Centric Representations for Better Generalization in Robotic Manipulation

📝 Summary:
Slot-Based Object-Centric Representations outperform global and dense feature representations in robotic manipulation tasks by providing better generalization under visual distribution shifts. AI-gene...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21416
• PDF: https://arxiv.org/pdf/2601.21416

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

134 views08:05

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨STORM: Slot-based Task-aware Object-centric Representation for robotic Manipulation

📝 Summary:
STORM enhances robotic manipulation by adapting visual foundation models with semantic-aware slots through multi-phase training. This approach improves object discovery, generalization to distractors, and robotic control performance.

🔹 Publication Date: Published on Jan 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20381
• PDF: https://arxiv.org/pdf/2601.20381

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#Robotics #AI #ComputerVision #RoboticManipulation #DeepLearning

136 views08:05

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨AgentLongBench: A Controllable Long Benchmark For Long-Contexts Agents via Environment Rollouts

📝 Summary:
AgentLongBench evaluates LLM agents via dynamic environment rollouts. It finds agents struggle with high-density tool responses more than memory fragmentation in long conversations, driven by tokens needed to resolve queries.

🔹 Publication Date: Published on Jan 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20730
• PDF: https://arxiv.org/pdf/2601.20730
• Github: https://github.com/euReKa025/AgentLongBench

✨ Datasets citing this paper:
• https://huggingface.co/datasets/ign1s/AgentLongBench

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLMAgents #LongContext #AIResearch #NLP #Benchmarking

115 views09:05

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨One-step Latent-free Image Generation with Pixel Mean Flows

📝 Summary:
Pixel MeanFlow pMF proposes a one-step, latent-free image generation method. It separates network output space from loss space, targeting an image manifold for prediction and defining loss in velocity space. pMF achieves strong ImageNet results at 256x256 and 512x512 resolutions.

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.22158
• PDF: https://arxiv.org/pdf/2601.22158

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#ImageGeneration #DeepLearning #ComputerVision #GenerativeAI #AIResearch

102 views09:05

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts

📝 Summary:
HALO efficiently converts Transformer models to RNN-attention hybrids using minimal training data. This enables superior long-context performance and efficiency, showcased by the HypeNet architecture and its application to the Qwen3 series.

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.22156
• PDF: https://arxiv.org/pdf/2601.22156
• Github: https://www.github.com/THUNLP/hybrid-linear-attention

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#HybridAttention #LongContext #Transformers #LLMs #DeepLearning

115 views09:06

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨FROST: Filtering Reasoning Outliers with Attention for Efficient Reasoning

📝 Summary:
FROST is an attention-aware method that improves reasoning efficiency by pruning uncritical paths and removing reasoning outliers, leading to reduced token usage and improved accuracy. AI-generated su...

🔹 Publication Date: Published on Jan 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.19001
• PDF: https://arxiv.org/pdf/2601.19001

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

146 views09:06

✨ Explore Data Science 📝 Write your paper