ML Research Hub
32.5K subscribers
6.12K photos
403 videos
24 files
6.63K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
REVERE: Reflective Evolving Research Engineer for Scientific Workflows

📝 Summary:
REVERE enhances research coding agent performance via reflective optimization and cumulative knowledge consolidation across multiple tasks. It overcomes prior prompt-optimization limits, achieving significant gains on research coding benchmarks and demonstrating agent evolution.

🔹 Publication Date: Published on Mar 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.20667
• PDF: https://arxiv.org/pdf/2603.20667

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AIAgents #ResearchAutomation #CodingAI #PromptEngineering #AgentEvolution
The Universal Normal Embedding

📝 Summary:
Generative models and vision encoders share a common Gaussian latent space called the Universal Normal Embedding UNE. This shared UNE provides aligned semantic representations and enables controllable image editing through simple linear manipulations.

🔹 Publication Date: Published on Mar 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.21786
• PDF: https://arxiv.org/pdf/2603.21786

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#GenerativeAI #ComputerVision #LatentSpace #DeepLearning #MachineLearning
1
F4Splat: Feed-Forward Predictive Densification for Feed-Forward 3D Gaussian Splatting

📝 Summary:
F4Splat introduces predictive densification for 3D Gaussian splatting, adaptively allocating Gaussians based on spatial complexity and view overlap. This reduces redundant Gaussians, leading to compact, high-quality 3D representations with significantly fewer Gaussians than prior feed-forward met...

🔹 Publication Date: Published on Mar 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.21304
• PDF: https://arxiv.org/pdf/2603.21304
• Project Page: https://mlvlab.github.io/F4Splat/
• Github: https://github.com/mlvlab/F4Splat

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#3DGaussianSplatting #ComputerGraphics #3DReconstruction #MachineLearning #NeuralRendering
BubbleRAG: Evidence-Driven Retrieval-Augmented Generation for Black-Box Knowledge Graphs

📝 Summary:
BubbleRAG improves graph-based RAG recall and precision for black-box knowledge graphs. It uses semantic anchoring and bubble expansion to find relevant subgraphs, achieving state-of-the-art results on multi-hop QA.

🔹 Publication Date: Published on Mar 19

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.20309
• PDF: https://arxiv.org/pdf/2603.20309
• Github: https://github.com/limafang/BubbleRAG

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#RAG #KnowledgeGraphs #AI #NLP #MachineLearning
SEM: Sparse Embedding Modulation for Post-Hoc Debiasing of Vision-Language Models

📝 Summary:
Sparse Embedding Modulation SEM debiases vision-language models by operating in a sparse autoencoder latent space. SEM precisely modulates bias-relevant neurons while preserving semantic information, achieving substantial fairness gains in retrieval and classification tasks.

🔹 Publication Date: Published on Mar 19

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.19028
• PDF: https://arxiv.org/pdf/2603.19028
• Project Page: https://sparse-embedding-modulation.github.io/
• Github: https://github.com/mardgui/SEM

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#VisionLanguageModels #BiasCorrection #MachineLearning #AIResearch #DeepLearning
SNAP: Speaker Nulling for Artifact Projection in Speech Deepfake Detection

📝 Summary:
A speaker-nulling framework called SNAP is proposed to reduce speaker entanglement in speech encoders, enabling detectors to focus on artifact-related patterns for improved deepfake detection performa...

🔹 Publication Date: Published on Mar 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.20686
• PDF: https://arxiv.org/pdf/2603.20686
• Project Page: https://huggingface.co/papers?q=orthogonal%20projection

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Not All Layers Are Created Equal: Adaptive LoRA Ranks for Personalized Image Generation

📝 Summary:
LoRA² adapts layer-specific ranks during fine-tuning for personalized image generation, achieving better performance-memory trade-offs than fixed-rank approaches. AI-generated summary Low Rank Adaptat...

🔹 Publication Date: Published on Mar 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.21884
• PDF: https://arxiv.org/pdf/2603.21884
• Project Page: https://donaldssh.github.io/NotAllLayersAreCreatedEqual/
• Github: https://github.com/donaldssh/NotAllLayersAreCreatedEqual

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Repurposing Geometric Foundation Models for Multi-view Diffusion

📝 Summary:
Geometric Latent Diffusion (GLD) framework utilizes geometric foundation models' feature space as latent space for novel view synthesis, achieving superior 2D and 3D performance while reducing trainin...

🔹 Publication Date: Published on Mar 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.22275
• PDF: https://arxiv.org/pdf/2603.22275
• Project Page: https://cvlab-kaist.github.io/GLD/
• Github: https://github.com/cvlab-kaist/GLD

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

📝 Summary:
OpenResearcher presents a reproducible pipeline for training deep research agents using offline search environments and synthesized trajectories, achieving improved accuracy on benchmark tasks. AI-gen...

🔹 Publication Date: Published on Mar 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.20278
• PDF: https://arxiv.org/pdf/2603.20278
• Project Page: https://github.com/TIGER-AI-Lab/OpenResearcher
• Github: https://github.com/TIGER-AI-Lab/OpenResearcher

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DeepLearning #ResearchAutomation #Reproducibility #OpenScience
FluidWorld: Reaction-Diffusion Dynamics as a Predictive Substrate for World Models

📝 Summary:
FluidWorld demonstrates that partial differential equations can serve as an efficient alternative to attention mechanisms and convolutional recurrent networks in world modeling, achieving better spati...

🔹 Publication Date: Published on Mar 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.21315
• PDF: https://arxiv.org/pdf/2603.21315
• Project Page: https://infinition.github.io/FluidWorld
• Github: https://github.com/infinition/FluidWorld

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs

📝 Summary:
AwaRes is a spatial-on-demand framework for VLMs that resolves the accuracy-efficiency trade-off. It operates on a low-resolution global view and uses tool-calling to dynamically retrieve high-resolution segments as needed. Training involves multi-turn reinforcement learning with composite rewards.

🔹 Publication Date: Published on Mar 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.16932
• PDF: https://arxiv.org/pdf/2603.16932
• Project Page: https://nimrodshabtay.github.io/AwaRes/
• Github: https://github.com/NimrodShabtay/AwaRes

Datasets citing this paper:
https://huggingface.co/datasets/NimrodShabtay1986/AwaRes

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Safe Flow Q-Learning: Offline Safe Reinforcement Learning with Reachability-Based Flow Policies

📝 Summary:
SafeFlow Q-Learning extends FQL to safe offline reinforcement learning by combining a Hamilton-Jacobi reachability-inspired safety value function with an efficient one-step flow policy, achieving lowe...

🔹 Publication Date: Published on Mar 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.15136
• PDF: https://arxiv.org/pdf/2603.15136
• Project Page: https://tau-intelligence.com/safe-fql/
• Github: https://github.com/tau-intelligence/safe-fql

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
AdditiveLLM2: A Multi-modal Large Language Model for Additive Manufacturing

📝 Summary:
AdditiveLLM2 is a multi-modal LLM built on Gemma 3, specialized for additive manufacturing via domain-adaptive pretraining and instruction tuning on a small dataset. It achieves over 90 percent accuracy in AM language and vision tasks, proving an accessible specialization method for domain-specif...

🔹 Publication Date: Published on Mar 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.22017
• PDF: https://arxiv.org/pdf/2603.22017

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Progressive Training for Explainable Citation-Grounded Dialogue: Reducing Hallucination to Zero in English-Hindi LLMs

📝 Summary:
XKD-Dial is a progressive training pipeline for explainable, bilingual English-Hindi knowledge-grounded dialogue. It achieves zero hallucination rates by using citation grounding and improves explainability through post-hoc analyses.

🔹 Publication Date: Published on Mar 19

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.18911
• PDF: https://arxiv.org/pdf/2603.18911

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#LLMs #ExplainableAI #NaturalLanguageProcessing #AIResearch #HallucinationReduction
Aperiodic Structures Never Collapse: Fibonacci Hierarchies for Lossless Compression

📝 Summary:
Fibonacci quasicrystal tilings provide superior lossless compression advantages over periodic alternatives through structural properties that maintain dictionary reuse across all scales and achieve lo...

🔹 Publication Date: Published on Mar 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.14999
• PDF: https://arxiv.org/pdf/2603.14999
• Github: https://github.com/robtacconelli/quasicryth

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
Scalable Prompt Routing via Fine-Grained Latent Task Discovery

📝 Summary:
This paper introduces a two-stage prompt routing architecture for efficiently selecting optimal language models. It uses graph-based clustering to discover latent task types and a mixture-of-experts for quality estimation. This approach improves performance and reduces computational cost by dynam...

🔹 Publication Date: Published on Mar 19

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.19415
• PDF: https://arxiv.org/pdf/2603.19415

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

📝 Summary:
LeWorldModel is a stable, end-to-end JEPA that trains efficiently from raw pixels with only two loss terms. It achieves competitive performance in control tasks, plans faster, and encodes meaningful physical structures, even detecting impossible events.

🔹 Publication Date: Published on Mar 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.19312
• PDF: https://arxiv.org/pdf/2603.19312
• Project Page: https://le-wm.github.io/
• Github: https://github.com/lucas-maes/le-wm

🔹 Models citing this paper:
https://huggingface.co/aguennoune17/atlas-v2-nwm-fp8-compressed

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
2
ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model

📝 Summary:
ThinkJEPA improves latent world models by combining dense JEPA dynamics with VLM semantic guidance through a dual-temporal pathway. This framework enhances long-horizon hand-manipulation trajectory prediction.

🔹 Publication Date: Published on Mar 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.22281
• PDF: https://arxiv.org/pdf/2603.22281

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#ThinkJEPA #LatentWorldModels #VLM #Robotics #AI
TrajLoom: Dense Future Trajectory Generation from Video

📝 Summary:
TrajLoom is a new framework for predicting dense future motion trajectories in videos. It uses grid-anchor encoding, a VAE for a compact latent space, and flow matching to generate realistic future motion. The method significantly extends prediction horizons and improves motion realism.

🔹 Publication Date: Published on Mar 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.22606
• PDF: https://arxiv.org/pdf/2603.22606
• Project Page: https://trajloom.github.io/
• Github: https://github.com/zewei-Zhang/TrajLoom

🔹 Models citing this paper:
https://huggingface.co/zeweizhang/TrajLoom

Datasets citing this paper:
https://huggingface.co/datasets/zeweizhang/TrajLoomDatasets

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI

📝 Summary:
Large language models can automate systematic literature reviews with human-level performance while reducing review time from weeks to hours. AI-generated summary Systematic literature reviews are ess...

🔹 Publication Date: Published on Mar 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.22327
• PDF: https://arxiv.org/pdf/2603.22327
• Project Page: https://oxrml.com/agent-slr/
• Github: https://github.com/OxRML/AgentSLR

Datasets citing this paper:
https://huggingface.co/datasets/OxRML/AgentSLR

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents

📝 Summary:
LLM-based systems use executable workflows that interleave various computational components, with recent approaches organized by workflow structure determination timing and optimization dimensions. AI...

🔹 Publication Date: Published on Mar 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.22386
• PDF: https://arxiv.org/pdf/2603.22386
• Github: https://github.com/IBM/awesome-agentic-workflow-optimization

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research