ML Research Hub
32.3K subscribers
6.73K photos
472 videos
24 files
7.35K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
Backdoor Attacks on Decentralised Post-Training

📝 Summary:
This paper introduces the first backdoor attack on pipeline parallelism in decentralized LLM post-training. An adversary controlling an intermediate stage can significantly misalign the model, reducing alignment from 80% to 6% with a trigger word, even resisting safety training.

🔹 Publication Date: Published on Mar 31

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.02372
• PDF: https://arxiv.org/pdf/2604.02372

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#BackdoorAttack #LLM #DecentralizedAI #AISecurity #MachineLearning
1
Multi-User Large Language Model Agents

📝 Summary:
Multi-user LLM agents struggle with conflicting objectives, privacy, and coordination. This study formalizes the problem and reveals systematic gaps in current LLMs. They fail to prioritize instructions, violate privacy, and suffer coordination bottlenecks.

🔹 Publication Date: Published on Mar 19

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.08567
• PDF: https://arxiv.org/pdf/2604.08567
• Project Page: https://korde-ai.github.io/Multi-User-LLM-Agent/
• Github: https://github.com/Korde-AI/Multi-User-LLM-Agent

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
p1: Better Prompt Optimization with Fewer Prompts

📝 Summary:
Research reveals that prompt optimization effectiveness depends on the balance between response stochasticity and system prompt quality variance, leading to the development of a filtering method that ...

🔹 Publication Date: Published on Apr 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.08801
• PDF: https://arxiv.org/pdf/2604.08801

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
EquiformerV3: Scaling Efficient, Expressive, and General SE(3)-Equivariant Graph Attention Transformers

📝 Summary:
EquiformerV3 advances SE(3)-equivariant graph neural networks through enhanced efficiency, expressivity, and generality via optimized implementation, improved architectural components, and novel activ...

🔹 Publication Date: Published on Apr 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.09130
• PDF: https://arxiv.org/pdf/2604.09130
• Github: https://github.com/atomicarchitects/equiformer_v3

🔹 Models citing this paper:
https://huggingface.co/yilunliao/equiformer_v3
https://huggingface.co/mirror-physics/equiformer_v3

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Semantic Richness or Geometric Reasoning? The Fragility of VLM's Visual Invariance

📝 Summary:
Vision-Language Models show significant vulnerabilities under geometric transformations, lacking robust spatial invariance and equivariance despite strong semantic capabilities. AI-generated summary T...

🔹 Publication Date: Published on Apr 3

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.01848
• PDF: https://arxiv.org/pdf/2604.01848
• Project Page: https://xthomasbu.github.io/visual_invariance/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
Cross-Modal Emotion Transfer for Emotion Editing in Talking Face Video

📝 Summary:
A novel cross-modal emotion transfer approach generates expressive talking face videos by modeling emotion semantic vectors between speech and visual feature spaces, achieving superior emotion accurac...

🔹 Publication Date: Published on Apr 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.07786
• PDF: https://arxiv.org/pdf/2604.07786
• Project Page: https://chanhyeok-choi.github.io/C-MET/
• Github: https://github.com/ChanHyeok-Choi/C-MET

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
Envisioning the Future, One Step at a Time

📝 Summary:
Autoregressive diffusion models predict open-set future scene dynamics by modeling sparse point trajectories, enabling fast and scalable multi-modal motion prediction with physical plausibility. AI-ge...

🔹 Publication Date: Published on Apr 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.09527
• PDF: https://arxiv.org/pdf/2604.09527
• Project Page: https://compvis.github.io/myriad
• Github: https://github.com/compvis/myriad

🔹 Models citing this paper:
https://huggingface.co/CompVis/myriad

Datasets citing this paper:
https://huggingface.co/datasets/CompVis/owm-95
https://huggingface.co/datasets/CompVis/myriad-physics

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Process Reward Agents for Steering Knowledge-Intensive Reasoning

📝 Summary:
Process Reward Agents provide domain-grounded, online step-wise rewards for frozen policies in knowledge-intensive reasoning, enabling improved search-based decoding and generalizing across different ...

🔹 Publication Date: Published on Apr 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.09482
• PDF: https://arxiv.org/pdf/2604.09482
• Project Page: https://process-reward-agents.github.io/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Robust Reasoning Benchmark

📝 Summary:
Research reveals that large language models exhibit fragile reasoning capabilities when subjected to perturbations, with open-weight models showing significant accuracy drops and evidence of memory po...

🔹 Publication Date: Published on Mar 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.08571
• PDF: https://arxiv.org/pdf/2604.08571

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Cactus: Accelerating Auto-Regressive Decoding with Constrained Acceptance Speculative Sampling

📝 Summary:
Speculative sampling methods are enhanced by formulating them as constrained optimization problems, enabling controlled distribution divergence while maintaining high acceptance rates and output quali...

🔹 Publication Date: Published on Apr 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.04987
• PDF: https://arxiv.org/pdf/2604.04987
• Github: https://github.com/MANGA-UOFA/Cactus

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
MixFlow: Mixed Source Distributions Improve Rectified Flows

📝 Summary:
Rectified flows and diffusion models are improved through κ-FC formulation that conditions the source distribution and MixFlow training strategy that reduces generative path curvatures and enhances sa...

🔹 Publication Date: Published on Apr 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.09181
• PDF: https://arxiv.org/pdf/2604.09181
• Github: https://github.com/NazirNayal8/MixFlow

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#RectifiedFlows #DiffusionModels #GenerativeAI #MachineLearning #AIResearch
Beyond the Assistant Turn: User Turn Generation as a Probe of Interaction Awareness in Language Models

📝 Summary:
User-turn generation probes LLM interaction awareness, decoupled from task accuracy. This awareness is often latent but revealed by higher temperature sampling and can be improved through post-training, uncovering a new dimension of LLM behavior.

🔹 Publication Date: Published on Apr 3

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.02315
• PDF: https://arxiv.org/pdf/2604.02315

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#LLM #NLP #AI #InteractionAwareness #UserTurnGeneration
Pseudo-Unification: Entropy Probing Reveals Divergent Information Patterns in Unified Multimodal Models

📝 Summary:
Unified multimodal models suffer from pseudo-unification due to asymmetric encoding and split response patterns, requiring consistent information flow for genuine multimodal synergy. AI-generated summ...

🔹 Publication Date: Published on Apr 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.10949
• PDF: https://arxiv.org/pdf/2604.10949

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
CodeTracer: Towards Traceable Agent States

📝 Summary:
CodeTracer is a tracing architecture that analyzes code agent execution by reconstructing state transitions and localizing failures in complex multi-stage workflows. AI-generated summary Code agents a...

🔹 Publication Date: Published on Apr 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.11641
• PDF: https://arxiv.org/pdf/2604.11641

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Media is too big
VIEW IN TELEGRAM
OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

📝 Summary:
OmniShow is an end-to-end framework for human-object interaction video generation using multimodal conditions like text, images, audio, and pose. It uses unified conditioning, gated attention, and decoupled training to achieve state-of-the-art performance despite data scarcity. A new benchmark, H...

🔹 Publication Date: Published on Apr 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/pdf/2604.11804
• PDF: https://arxiv.org/pdf/2604.11804
• Project Page: https://correr-zhou.github.io/OmniShow
• Github: https://github.com/Correr-Zhou/OmniShow

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
TAIHRI: Task-Aware 3D Human Keypoints Localization for Close-Range Human-Robot Interaction

📝 Summary:
TAIHRI is a vision-language model designed for egocentric human-robot interaction that enables precise 3D keypoint localization through 2D keypoint reasoning and next token prediction. AI-generated su...

🔹 Publication Date: Published on Apr 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.08921
• PDF: https://arxiv.org/pdf/2604.08921
• Github: https://github.com/Tencent/TAIHRI

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks

📝 Summary:
Large language models demonstrate limited general reasoning capabilities despite strong domain-specific performance, as revealed by a new benchmark assessing K-12 level reasoning across diverse proble...

🔹 Publication Date: Published on Apr 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.11778
• PDF: https://arxiv.org/pdf/2604.11778
• Project Page: https://general365.github.io/
• Github: https://general365.github.io/

Datasets citing this paper:
https://huggingface.co/datasets/meituan-longcat/General365_Public

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Low-rank Optimization Trajectories Modeling for LLM RLVR Acceleration

📝 Summary:
A nonlinear extrapolation framework for reinforcement learning with verifiable rewards in large language models that reduces computational overhead by modeling rank-1 parameter trajectories through Lo...

🔹 Publication Date: Published on Apr 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.11446
• PDF: https://arxiv.org/pdf/2604.11446
• Github: https://github.com/RUCAIBox/NExt

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models

📝 Summary:
Credit assignment methods for reinforcement learning with large language models are categorized by granularity and methodology, with distinct approaches emerging for reasoning versus agentic settings....

🔹 Publication Date: Published on Apr 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.09459
• PDF: https://arxiv.org/pdf/2604.09459
• Github: https://github.com/xxzcc/Awesome-Credit-Assignment-in-LLM-RL

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
ADD for Multi-Bit Image Watermarking

📝 Summary:
ADD is a multi-bit image watermarking method that uses linear combination and inner product operations for embedding and decoding, achieving high accuracy and efficiency compared to existing approache...

🔹 Publication Date: Published on Apr 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.11491
• PDF: https://arxiv.org/pdf/2604.11491

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
TorchUMM: A Unified Multimodal Model Codebase for Evaluation, Analysis, and Post-training

📝 Summary:
TorchUMM presents a unified codebase for evaluating and analyzing multimodal models across understanding, generation, and editing tasks with standardized protocols and diverse datasets. AI-generated s...

🔹 Publication Date: Published on Apr 12

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.10784
• PDF: https://arxiv.org/pdf/2604.10784
• Project Page: https://aifrontierlab.github.io/TorchUMM/
• Github: https://github.com/AIFrontierLab/TorchUMM

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research