ML Research Hub
32.3K subscribers
6.46K photos
441 videos
24 files
7.02K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
Therefore I am. I Think

📝 Summary:
Large language models often make action choices before generating any reasoning text. Evidence shows early decision signals can be decoded and causally steered, with the subsequent 'thinking' rationalizing the pre-made choice. This suggests decisions precede explicit deliberation.

🔹 Publication Date: Published on Apr 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.01202
• PDF: https://arxiv.org/pdf/2604.01202

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
MultiGen: Level-Design for Editable Multiplayer Worlds in Diffusion Game Engines

📝 Summary:
Video world models with explicit external memory enable user-controlled environment editing and real-time multiplayer interactions by decomposing generation into memory, observation, and dynamics modu...

🔹 Publication Date: Published on Mar 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.06679
• PDF: https://arxiv.org/pdf/2603.06679

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Forecasting Supply Chain Disruptions with Foresight Learning

📝 Summary:
Large language models can be trained to produce calibrated probabilistic forecasts for supply chain disruptions, outperforming existing baselines and enabling decision-ready predictions through domain...

🔹 Publication Date: Published on Apr 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.01298
• PDF: https://arxiv.org/pdf/2604.01298
• Project Page: https://lightningrod.ai

Datasets citing this paper:
https://huggingface.co/datasets/LightningRodLabs/supply-chain-predictions

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery

📝 Summary:
Autonomous multi-agent evolution framework enables open-ended discovery through persistent memory, asynchronous execution, and collaborative problem-solving, achieving superior performance on mathemat...

🔹 Publication Date: Published on Apr 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.01658
• PDF: https://arxiv.org/pdf/2604.01658
• Project Page: https://human-agent-society.github.io/CORAL
• Github: https://github.com/Human-Agent-Society/CORAL

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Video Models Reason Early: Exploiting Plan Commitment for Maze Solving

📝 Summary:
Video diffusion models demonstrate emergent reasoning abilities in maze solving through early plan commitment and path length prediction, with improved performance achieved via Chaining with Early Pla...

🔹 Publication Date: Published on Mar 31

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.30043
• PDF: https://arxiv.org/pdf/2603.30043
• Project Page: https://video-maze-reasoning.github.io/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
An Empirical Recipe for Universal Phone Recognition

📝 Summary:
PhoneticXEUS achieves leading performance for universal phone recognition in multilingual and accented speech. This results from large-scale training and an empirical analysis of key factors including SSL representations, data scale, and loss objectives.

🔹 Publication Date: Published on Mar 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.29042
• PDF: https://arxiv.org/pdf/2603.29042
• Github: https://github.com/changelinglab/PhoneticXeus

🔹 Models citing this paper:
https://huggingface.co/changelinglab/PhoneticXeus

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Signals: Trajectory Sampling and Triage for Agentic Interactions

📝 Summary:
A signal framework efficiently triages agentic interaction trajectories. It computes low-cost signals from live interactions to identify informative samples for post-deployment optimization, achieving 82% informativeness and outperforming other methods.

🔹 Publication Date: Published on Apr 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.00356
• PDF: https://arxiv.org/pdf/2604.00356
• Project Page: https://planoai.dev/
• Github: https://github.com/katanemo/plano

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
DeepScientist: Advancing Frontier-Pushing Scientific Findings Progressively

📝 Summary:
DeepScientist autonomously conducts scientific discovery through Bayesian Optimization, surpassing human state-of-the-art methods on multiple AI tasks. AI-generated summary While previous AI Scientist...

🔹 Publication Date: Published on Sep 30, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.26603
• PDF: https://arxiv.org/pdf/2509.26603
• Project Page: https://ai-researcher.net
• Github: https://github.com/ResearAI/DeepScientist

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
LOME: Learning Human-Object Manipulation with Action-Conditioned Egocentric World Model

📝 Summary:
LOME is an egocentric world model that generates realistic human-object interactions in videos by combining image, text, and action inputs with joint estimation of spatial human actions and environmen...

🔹 Publication Date: Published on Mar 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.27449
• PDF: https://arxiv.org/pdf/2603.27449
• Project Page: https://zerg-overmind.github.io/LOME.github.io/
• Github: https://github.com/Zerg-Overmind/LOME

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
🔥2026 New IT Certification Prep Kit – Free!

SPOTO cover: #Python #AI #Cisco #PMI #Fortinet #AWS #Azure #Excel #CompTIA #ITIL #Cloud + more

Grab yours free kit now:
• Free Courses (Python, Excel, Cyber Security, Cisco, SQL, ITIL, PMP, AWS)
👉 https://bit.ly/3Ogtn3i
• IT Certs E-book
👉 https://bit.ly/41KZlru
• IT Exams Skill Test
👉 https://bit.ly/4ve6ZbC
• Free AI Materials & Support Tools
👉 https://bit.ly/4vagTuw
• Free Cloud Study Guide
👉 https://bit.ly/4c3BZCh

💬 Need exam help? Contact admin: wa.link/w6cems

Join our IT community: get free study materials, exam tips & peer support
https://chat.whatsapp.com/BiazIVo5RxfKENBv10F444
1
Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

📝 Summary:
This tutorial introduces Hunyuan3D 2.1, a system for generating high-fidelity, textured 3D assets to make AI content creation more accessible. It details the full workflow from data preparation to deployment, using Hunyuan3D-DiT for shape and Hunyuan3D-Paint for texture synthesis.

🔹 Publication Date: Published on Jun 18, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2506.15442
• PDF: https://arxiv.org/pdf/2506.15442
• Github: https://github.com/huggingface/huggingface.js

🔹 Models citing this paper:
https://huggingface.co/tencent/Hunyuan3D-2.1
https://huggingface.co/tencent/Hunyuan3D-Omni
https://huggingface.co/tencent/HY3D-Bench

Datasets citing this paper:
https://huggingface.co/datasets/tencent/HY3D-Bench

Spaces citing this paper:
https://huggingface.co/spaces/duranponce/ai-default
https://huggingface.co/spaces/AliothTalks/Hunyuan3D-2.1
https://huggingface.co/spaces/joaojack/Hunyuan3D-2.1

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#3DGeneration #AI #ComputerGraphics #ImageTo3D #PBRMaterials
1
Follow the Machine Learning with Python channel on WhatsApp: https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
1
RF-DETR: Neural Architecture Search for Real-Time Detection Transformers

📝 Summary:
RF-DETR is a light-weight detection transformer using weight-sharing NAS to optimize real-time accuracy and latency across diverse datasets. It significantly outperforms prior state-of-the-art methods on COCO and Roboflow100-VL, with its largest variant exceeding 60 AP on COCO.

🔹 Publication Date: Published on Nov 12, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.09554
• PDF: https://arxiv.org/pdf/2511.09554
• Project Page: https://rfdetr.roboflow.com/1.3.0/
• Github: https://github.com/roboflow/rf-detr

🔹 Models citing this paper:
https://huggingface.co/mlx-community/rfdetr-base-fp32
https://huggingface.co/mlx-community/rfdetr-seg-small-fp32
https://huggingface.co/mlx-community/rfdetr-seg-large-fp32

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#ObjectDetection #NeuralArchitectureSearch #DeepLearning #ComputerVision #DETR
1
Agentic-MME: What Agentic Capability Really Brings to Multimodal Intelligence?

📝 Summary:
Agentic-MME introduces a process-verified benchmark for multimodal agentic capabilities. It evaluates tool usage and efficiency using real-world tasks and stepwise checkpoints, revealing models struggle with complex multimodal problem-solving.

🔹 Publication Date: Published on Apr 3

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.03016
• PDF: https://arxiv.org/pdf/2604.03016
• Project Page: https://agenticmme.github.io/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AgenticAI #MultimodalAI #AIEvaluation #AIResearch #Benchmarks
AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents

📝 Summary:
Computer-use agents pose unique safety risks as harm can emerge from sequences of individually benign actions. AgentHazard is a benchmark with 2,653 instances to evaluate this. Experiments reveal current systems are highly vulnerable, showing model alignment alone doesnt ensure agent safety.

🔹 Publication Date: Published on Apr 3

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.02947
• PDF: https://arxiv.org/pdf/2604.02947
• Project Page: https://yunhao-feng.github.io/AgentHazard/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AISafety #AgentAI #AIVulnerability #AIethics #AIbenchmark
CoME-VL: Scaling Complementary Multi-Encoder Vision-Language Learning

📝 Summary:
CoME-VL fuses contrastive and self-supervised vision encoders to improve vision-language models. It uses entropy-guided aggregation and RoPE-enhanced attention for better visual understanding and grounding, outperforming single-encoder baselines.

🔹 Publication Date: Published on Apr 3

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.03231
• PDF: https://arxiv.org/pdf/2604.03231
• Project Page: https://mbzuai-oryx.github.io/CoME-VL/
• Github: https://github.com/mbzuai-oryx/CoME-VL

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#VisionLanguage #MultimodalAI #ComputerVision #MachineLearning #DeepLearning
InCoder-32B-Thinking: Industrial Code World Model for Thinking

📝 Summary:
Industrial software development lacks expert reasoning traces for hardware constraints, so a model was trained on error-driven reasoning chains and domain-specific execution traces to generate high-qu...

🔹 Publication Date: Published on Apr 3

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.03144
• PDF: https://arxiv.org/pdf/2604.03144

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #CodeGeneration #IndustrialAI #WorldModels #SoftwareDevelopment