ML Research Hub – Telegram

ML Research Hub

32.3K subscribers

6.46K photos

441 videos

24 files

7.02K links

Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho

Download Telegram

About

Blog

Apps

Platform

ML Research Hub

32.3K subscribers

ML Research Hub

✨Therefore I am. I Think

📝 Summary:
Large language models often make action choices before generating any reasoning text. Evidence shows early decision signals can be decoded and causally steered, with the subsequent 'thinking' rationalizing the pre-made choice. This suggests decisions precede explicit deliberation.

🔹 Publication Date: Published on Apr 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.01202
• PDF: https://arxiv.org/pdf/2604.01202

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

213 views17:09

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨MultiGen: Level-Design for Editable Multiplayer Worlds in Diffusion Game Engines

📝 Summary:
Video world models with explicit external memory enable user-controlled environment editing and real-time multiplayer interactions by decomposing generation into memory, observation, and dynamics modu...

🔹 Publication Date: Published on Mar 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.06679
• PDF: https://arxiv.org/pdf/2603.06679

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

219 views17:09

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Forecasting Supply Chain Disruptions with Foresight Learning

📝 Summary:
Large language models can be trained to produce calibrated probabilistic forecasts for supply chain disruptions, outperforming existing baselines and enabling decision-ready predictions through domain...

🔹 Publication Date: Published on Apr 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.01298
• PDF: https://arxiv.org/pdf/2604.01298
• Project Page: https://lightningrod.ai

✨ Datasets citing this paper:
• https://huggingface.co/datasets/LightningRodLabs/supply-chain-predictions

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

174 views18:09

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery

📝 Summary:
Autonomous multi-agent evolution framework enables open-ended discovery through persistent memory, asynchronous execution, and collaborative problem-solving, achieving superior performance on mathemat...

🔹 Publication Date: Published on Apr 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.01658
• PDF: https://arxiv.org/pdf/2604.01658
• Project Page: https://human-agent-society.github.io/CORAL
• Github: https://github.com/Human-Agent-Society/CORAL

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

173 views18:09

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Video Models Reason Early: Exploiting Plan Commitment for Maze Solving

📝 Summary:
Video diffusion models demonstrate emergent reasoning abilities in maze solving through early plan commitment and path length prediction, with improved performance achieved via Chaining with Early Pla...

🔹 Publication Date: Published on Mar 31

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.30043
• PDF: https://arxiv.org/pdf/2603.30043
• Project Page: https://video-maze-reasoning.github.io/

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

❤1

245 views18:09

✨ Explore Data Science 📝 Write your paper

ML Research Hub

215 views18:09

ML Research Hub

✨MedGemma Technical Report

📝 Summary:
MedGemma, a collection of medical vision-language foundation models, demonstrates advanced medical understanding and reasoning, outperforming similar-sized generative models and approaching task-speci...

🔹 Publication Date: Published on Jul 7, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2507.05201
• PDF: https://arxiv.org/pdf/2507.05201
• Project Page: https://goo.gle/medgemma
• Github: https://github.com/google-gemini/gemma-cookbook

🔹 Models citing this paper:
• https://huggingface.co/google/medgemma-4b-it
• https://huggingface.co/google/medgemma-1.5-4b-it
• https://huggingface.co/google/medgemma-27b-text-it

✨ Datasets citing this paper:
• https://huggingface.co/datasets/Mateenah/medgemma-4b-hematologic-oncology-blind-spots

✨ Spaces citing this paper:
• https://huggingface.co/spaces/yipengsun/diagnostic-devils-advocate
• https://huggingface.co/spaces/AIencoder/RadAssist-MedGemma
• https://huggingface.co/spaces/google/appoint-ready

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

MedGemma Technical Report

Artificial intelligence (AI) has significant potential in healthcare applications, but its training and deployment faces challenges due to healthcare's diverse data, complex tasks, and the need to...

244 views18:09

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨An Empirical Recipe for Universal Phone Recognition

📝 Summary:
PhoneticXEUS achieves leading performance for universal phone recognition in multilingual and accented speech. This results from large-scale training and an empirical analysis of key factors including SSL representations, data scale, and loss objectives.

🔹 Publication Date: Published on Mar 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.29042
• PDF: https://arxiv.org/pdf/2603.29042
• Github: https://github.com/changelinglab/PhoneticXeus

🔹 Models citing this paper:
• https://huggingface.co/changelinglab/PhoneticXeus

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

257 views21:10

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Signals: Trajectory Sampling and Triage for Agentic Interactions

📝 Summary:
A signal framework efficiently triages agentic interaction trajectories. It computes low-cost signals from live interactions to identify informative samples for post-deployment optimization, achieving 82% informativeness and outperforming other methods.

🔹 Publication Date: Published on Apr 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.00356
• PDF: https://arxiv.org/pdf/2604.00356
• Project Page: https://planoai.dev/
• Github: https://github.com/katanemo/plano

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

265 views00:00

✨ Explore Data Science 📝 Write your paper

ML Research Hub

This media is not supported in your browser

VIEW IN TELEGRAM

✨DeepScientist: Advancing Frontier-Pushing Scientific Findings Progressively

📝 Summary:
DeepScientist autonomously conducts scientific discovery through Bayesian Optimization, surpassing human state-of-the-art methods on multiple AI tasks. AI-generated summary While previous AI Scientist...

🔹 Publication Date: Published on Sep 30, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.26603
• PDF: https://arxiv.org/pdf/2509.26603
• Project Page: https://ai-researcher.net
• Github: https://github.com/ResearAI/DeepScientist

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

316 views00:30

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨LOME: Learning Human-Object Manipulation with Action-Conditioned Egocentric World Model

📝 Summary:
LOME is an egocentric world model that generates realistic human-object interactions in videos by combining image, text, and action inputs with joint estimation of spatial human actions and environmen...

🔹 Publication Date: Published on Mar 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.27449
• PDF: https://arxiv.org/pdf/2603.27449
• Project Page: https://zerg-overmind.github.io/LOME.github.io/
• Github: https://github.com/Zerg-Overmind/LOME

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

❤1

423 views01:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

🔥2026 New IT Certification Prep Kit – Free!

SPOTO cover: #Python #AI #Cisco #PMI #Fortinet #AWS #Azure #Excel #CompTIA #ITIL #Cloud + more

✅ Grab yours free kit now:
• Free Courses (Python, Excel, Cyber Security, Cisco, SQL, ITIL, PMP, AWS)
👉 https://bit.ly/3Ogtn3i
• IT Certs E-book
👉 https://bit.ly/41KZlru
• IT Exams Skill Test
👉 https://bit.ly/4ve6ZbC
• Free AI Materials & Support Tools
👉 https://bit.ly/4vagTuw
• Free Cloud Study Guide
👉 https://bit.ly/4c3BZCh

💬 Need exam help? Contact admin: wa.link/w6cems

✅ Join our IT community: get free study materials, exam tips & peer support
https://chat.whatsapp.com/BiazIVo5RxfKENBv10F444

❤1

485 views07:51

ML Research Hub

ML Research Hub pinned a photo

07:51

ML Research Hub

371 views18:02

ML Research Hub

✨Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

📝 Summary:
This tutorial introduces Hunyuan3D 2.1, a system for generating high-fidelity, textured 3D assets to make AI content creation more accessible. It details the full workflow from data preparation to deployment, using Hunyuan3D-DiT for shape and Hunyuan3D-Paint for texture synthesis.

🔹 Publication Date: Published on Jun 18, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2506.15442
• PDF: https://arxiv.org/pdf/2506.15442
• Github: https://github.com/huggingface/huggingface.js

🔹 Models citing this paper:
• https://huggingface.co/tencent/Hunyuan3D-2.1
• https://huggingface.co/tencent/Hunyuan3D-Omni
• https://huggingface.co/tencent/HY3D-Bench

✨ Datasets citing this paper:
• https://huggingface.co/datasets/tencent/HY3D-Bench

✨ Spaces citing this paper:
• https://huggingface.co/spaces/duranponce/ai-default
• https://huggingface.co/spaces/AliothTalks/Hunyuan3D-2.1
• https://huggingface.co/spaces/joaojack/Hunyuan3D-2.1

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#3DGeneration #AI #ComputerGraphics #ImageTo3D #PBRMaterials

Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with...

3D AI-generated content (AIGC) is a passionate field that has significantly accelerated the creation of 3D models in gaming, film, and design. Despite the development of several groundbreaking...

❤1

467 views18:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

Forwarded from Machine Learning with Python

Follow the Machine Learning with Python channel on WhatsApp: https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A

❤1

176 views23:13

ML Research Hub

✨RF-DETR: Neural Architecture Search for Real-Time Detection Transformers

📝 Summary:
RF-DETR is a light-weight detection transformer using weight-sharing NAS to optimize real-time accuracy and latency across diverse datasets. It significantly outperforms prior state-of-the-art methods on COCO and Roboflow100-VL, with its largest variant exceeding 60 AP on COCO.

🔹 Publication Date: Published on Nov 12, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.09554
• PDF: https://arxiv.org/pdf/2511.09554
• Project Page: https://rfdetr.roboflow.com/1.3.0/
• Github: https://github.com/roboflow/rf-detr

🔹 Models citing this paper:
• https://huggingface.co/mlx-community/rfdetr-base-fp32
• https://huggingface.co/mlx-community/rfdetr-seg-small-fp32
• https://huggingface.co/mlx-community/rfdetr-seg-large-fp32

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#ObjectDetection #NeuralArchitectureSearch #DeepLearning #ComputerVision #DETR

❤1

462 views07:00

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Agentic-MME: What Agentic Capability Really Brings to Multimodal Intelligence?

📝 Summary:
Agentic-MME introduces a process-verified benchmark for multimodal agentic capabilities. It evaluates tool usage and efficiency using real-world tasks and stepwise checkpoints, revealing models struggle with complex multimodal problem-solving.

🔹 Publication Date: Published on Apr 3

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.03016
• PDF: https://arxiv.org/pdf/2604.03016
• Project Page: https://agenticmme.github.io/

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AgenticAI #MultimodalAI #AIEvaluation #AIResearch #Benchmarks

271 views02:00

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents

📝 Summary:
Computer-use agents pose unique safety risks as harm can emerge from sequences of individually benign actions. AgentHazard is a benchmark with 2,653 instances to evaluate this. Experiments reveal current systems are highly vulnerable, showing model alignment alone doesnt ensure agent safety.

🔹 Publication Date: Published on Apr 3

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.02947
• PDF: https://arxiv.org/pdf/2604.02947
• Project Page: https://yunhao-feng.github.io/AgentHazard/

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AISafety #AgentAI #AIVulnerability #AIethics #AIbenchmark

176 views02:00

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨CoME-VL: Scaling Complementary Multi-Encoder Vision-Language Learning

📝 Summary:
CoME-VL fuses contrastive and self-supervised vision encoders to improve vision-language models. It uses entropy-guided aggregation and RoPE-enhanced attention for better visual understanding and grounding, outperforming single-encoder baselines.

🔹 Publication Date: Published on Apr 3

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.03231
• PDF: https://arxiv.org/pdf/2604.03231
• Project Page: https://mbzuai-oryx.github.io/CoME-VL/
• Github: https://github.com/mbzuai-oryx/CoME-VL

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#VisionLanguage #MultimodalAI #ComputerVision #MachineLearning #DeepLearning

183 views02:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨InCoder-32B-Thinking: Industrial Code World Model for Thinking

📝 Summary:
Industrial software development lacks expert reasoning traces for hardware constraints, so a model was trained on error-driven reasoning chains and domain-specific execution traces to generate high-qu...

🔹 Publication Date: Published on Apr 3

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.03144
• PDF: https://arxiv.org/pdf/2604.03144

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #CodeGeneration #IndustrialAI #WorldModels #SoftwareDevelopment

179 views02:01

✨ Explore Data Science 📝 Write your paper