ML Research Hub
32.3K subscribers
6.73K photos
472 videos
24 files
7.34K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
Elucidating the SNR-t Bias of Diffusion Probabilistic Models

📝 Summary:
Diffusion models suffer from an SNR-timestep bias during inference, impairing generation quality. A differential correction method is proposed that processes frequency components separately. This significantly improves generation quality across various models with minimal computational cost.

🔹 Publication Date: Published on Apr 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.16044
• PDF: https://arxiv.org/pdf/2604.16044
• Github: https://github.com/AMAP-ML/DCW

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Can Large Language Models Reinvent Foundational Algorithms?

📝 Summary:
Large language models can reinvent foundational computer science algorithms through an unlearning and reinvention process, with performance varying based on hint levels and reinforced learning techniq...

🔹 Publication Date: Published on Apr 7

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.05716
• PDF: https://arxiv.org/pdf/2604.05716
• Project Page: https://huggingface.co/spaces/jzhao1122/qwen3-thinking-dijkstra
• Github: https://github.com/Algo-Reinvention/algo-reinvention

🔹 Models citing this paper:
https://huggingface.co/algo-reinvention/Qwen3-4B-Thinking-2507-Dijkstra-Unlearn
https://huggingface.co/algo-reinvention/Qwen3-4B-Thinking-2507-Strassen-Unlearn

Spaces citing this paper:
https://huggingface.co/spaces/jzhao1122/qwen3-thinking-dijkstra

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
QuantCode-Bench: A Benchmark for Evaluating the Ability of Large Language Models to Generate Executable Algorithmic Trading Strategies

📝 Summary:
QuantCode-Bench evaluates large language models on generating executable trading strategies by testing their ability to translate natural language descriptions into functional code that operates corre...

🔹 Publication Date: Published on Apr 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.15151
• PDF: https://arxiv.org/pdf/2604.15151
• Project Page: https://limexailab.github.io/QuantCode-Bench/
• Github: https://github.com/LimexAILab/QuantCode-Bench

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off

📝 Summary:
A novel reinforcement learning approach for large language models that addresses the exploration-exploitation trade-off through perplexity-based sample partitioning and bidirectional reward allocation...

🔹 Publication Date: Published on Apr 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.13902
• PDF: https://arxiv.org/pdf/2604.13902

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Media is too big
VIEW IN TELEGRAM
Hierarchical Codec Diffusion for Video-to-Speech Generation

📝 Summary:
HiCoDiT generates speech from videos by leveraging the hierarchical structure of discrete speech tokens, achieving better audio-visual alignment through coarse-to-fine conditioning with dual-scale nor...

🔹 Publication Date: Published on Apr 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.15923
• PDF: https://arxiv.org/pdf/2604.15923

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#VideoToSpeech #DiffusionModels #GenerativeAI #SpeechSynthesis #DeepLearning
Where does output diversity collapse in post-training?

📝 Summary:
Output diversity collapse in post-trained language models is primarily driven by training data composition rather than generation format, with different post-training methods affecting diversity diffe...

🔹 Publication Date: Published on Apr 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.16027
• PDF: https://arxiv.org/pdf/2604.16027
• Github: https://github.com/ckarouzos/where-diversity-collapses

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
RoboLab: A High-Fidelity Simulation Benchmark for Analysis of Task Generalist Policies

📝 Summary:
RoboLab is a simulation benchmarking framework that addresses limitations in robot policy evaluation by enabling scalable, realistic task generation and systematic analysis of policy behavior under co...

🔹 Publication Date: Published on Apr 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.09860
• PDF: https://arxiv.org/pdf/2604.09860
• Project Page: https://research.nvidia.com/labs/srl/projects/robolab/
• Github: https://github.com/NVLabs/RoboLab

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
The Amazing Agent Race: Strong Tool Users, Weak Navigators

📝 Summary:
The Amazing Agent Race benchmark introduces DAG-based puzzles to evaluate LLM agents' navigation and tool-use capabilities beyond traditional linear benchmarks, revealing that navigation errors domina...

🔹 Publication Date: Published on Apr 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.10261
• PDF: https://arxiv.org/pdf/2604.10261
• Project Page: https://minnesotanlp.github.io/the-amazing-agent-race/
• Github: https://github.com/minnesotanlp/the-amazing-agent-race

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
Universal statistical signatures of evolution in artificial intelligence architectures

📝 Summary:
The study finds that artificial intelligence architectural evolution follows the same statistical patterns as biological evolution, including similar fitness effect distributions and convergence dynam...

🔹 Publication Date: Published on Apr 12

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.10571
• PDF: https://arxiv.org/pdf/2604.10571
• Github: https://github.com/mool32/ai-evolution-universal-signatures

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Motif-Video 2B: Technical Report

📝 Summary:
Motif-Video 2B achieves high text-to-video quality with a specialized architecture and efficient training methods. It uses shared cross-attention and a three-part backbone to outperform larger models using significantly fewer parameters and less data.

🔹 Publication Date: Published on Apr 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.16503
• PDF: https://arxiv.org/pdf/2604.16503
• Project Page: https://motiftech.io/videoshowcase

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Stratagem: Learning Transferable Reasoning via Trajectory-Modulated Game Self-Play

📝 Summary:
STRATAGEM addresses limitations in reasoning transfer for language models by using a reasoning transferability coefficient and evolution reward to promote abstract, domain-agnostic patterns over game-...

🔹 Publication Date: Published on Apr 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.17696
• PDF: https://arxiv.org/pdf/2604.17696
• Github: https://github.com/ydyyyy/Stratagem

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
The Geometric Canary: Predicting Steerability and Detecting Drift via Representational Stability

📝 Summary:
Geometric stability measures predict language model controllability and detect structural degradation, with supervised variants excelling at steering prediction and unsupervised variants at drift dete...

🔹 Publication Date: Published on Apr 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.17698
• PDF: https://arxiv.org/pdf/2604.17698
• Github: https://github.com/prashantcraju/geometric-canary

🔹 Models citing this paper:
https://huggingface.co/pcr2120/shesha-geometry

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Geometric coherence of single-cell CRISPR perturbations reveals regulatory architecture and predicts cellular stress

📝 Summary:
G e n o m e e n g i n e e r i n g h a s a c h i e v e d r e m a r k a b l e s e q u e n c e - l e v e l p r e c i s i o n , y e t p r e d i c t i n g t h e t r a n s c r i p t o m i c s t a t e t h a ...

🔹 Publication Date: Published on Apr 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.16642
• PDF: https://arxiv.org/pdf/2604.16642
• Github: https://github.com/prashantcraju/geometric-stability-crispr

🔹 Models citing this paper:
https://huggingface.co/pcr2120/shesha-geometry

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
Revisiting a Pain in the Neck: A Semantic Reasoning Benchmark for Language Models

📝 Summary:
SemanticQA is a new benchmark to evaluate language models on semantic phrase processing, covering various phrase types. It reveals significant performance differences, especially in semantic reasoning tasks, highlighting variations in models comprehension.

🔹 Publication Date: Published on Apr 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.16593
• PDF: https://arxiv.org/pdf/2604.16593
• Github: https://github.com/jacklanda/SemanticQA

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Crowded in B-Space: Calibrating Shared Directions for LoRA Merging

📝 Summary:
LoRA adapter merging performance can be improved by separately calibrating the output-side matrix B to reduce interference from shared directions while preserving task-specific information. AI-generat...

🔹 Publication Date: Published on Apr 18

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.16826
• PDF: https://arxiv.org/pdf/2604.16826

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Meta-learning In-Context Enables Training-Free Cross Subject Brain Decoding

📝 Summary:
A meta-optimized approach enables generalizable semantic visual decoding from fMRI by rapidly inferring unique neural encoding patterns from few image-brain examples without fine-tuning across subject...

🔹 Publication Date: Published on Apr 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.08537
• PDF: https://arxiv.org/pdf/2604.08537
• Github: https://github.com/ezacngm/brainCodec

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Multiplication in Multimodal LLMs: Computation with Text, Image, and Audio Inputs

📝 Summary:
Multimodal large language models demonstrate consistent computational limitations in exact multi-digit multiplication across different representations and modalities, with performance closely tied to ...

🔹 Publication Date: Published on Apr 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.18203
• PDF: https://arxiv.org/pdf/2604.18203
• Project Page: https://neuristemic.ai/multiplication-in-multimodal-llms/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

📝 Summary:
OneVL is a unified vision-language-action framework that improves latent chain-of-thought reasoning for autonomous driving. It uses dual language and visual world model supervision to force latent tokens to internalize causal dynamics, achieving state-of-the-art accuracy at answer-only latency.

🔹 Publication Date: Published on Apr 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.18486
• PDF: https://arxiv.org/pdf/2604.18486
• Project Page: https://xiaomi-embodied-intelligence.github.io/OneVL/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

📝 Summary:
Agent-World introduces a self-evolving training framework that advances general agent intelligence through autonomous environment discovery and continuous learning across diverse real-world scenarios....

🔹 Publication Date: Published on Apr 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.18292
• PDF: https://arxiv.org/pdf/2604.18292
• Project Page: https://agent-tars-world.github.io/-/
• Github: https://agent-tars-world.github.io/-/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
MultiWorld: Scalable Multi-Agent Multi-View Video World Models

📝 Summary:
MultiWorld is a unified framework for multi-agent multi-view world modeling that achieves accurate multi-agent control while maintaining multi-view consistency through specialized modules for conditio...

🔹 Publication Date: Published on Apr 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.18564
• PDF: https://arxiv.org/pdf/2604.18564
• Project Page: https://multi-world.github.io/
• Github: https://github.com/CIntellifusion/MultiWorld

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research