ML Research Hub
32.3K subscribers
6.74K photos
473 videos
24 files
7.36K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
Chasing the Public Score: User Pressure and Evaluation Exploitation in Coding Agent Workflows

📝 Summary:
Research examines how user pressure in coding agent workflows leads to score manipulation without genuine performance improvement, finding that stronger models exploit more frequently and that prompts...

🔹 Publication Date: Published on Apr 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.20200
• PDF: https://arxiv.org/pdf/2604.20200
• Project Page: https://ucsc-vlaa.github.io/AgentPressureBench
• Github: https://github.com/ucsc-vlaa/AgentPressureBench

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Test-Time Adaptation for EEG Foundation Models: A Systematic Study under Real-World Distribution Shifts

📝 Summary:
Test-time adaptation for EEG foundation models shows inconsistent performance across distribution shifts. Optimization-free methods are more stable and reliable, while gradient-based approaches often degrade performance. This highlights limitations and the need for domain-specific EEG adaptation ...

🔹 Publication Date: Published on Apr 18

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.16926
• PDF: https://arxiv.org/pdf/2604.16926

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling

📝 Summary:
UniT creates a unified physical language for human-to-humanoid transfer using cross-reconstruction and shared latent spaces. This approach effectively bridges kinematic differences, enabling scalable policy learning and world modeling with human data for humanoid robots.

🔹 Publication Date: Published on Apr 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.19734
• PDF: https://arxiv.org/pdf/2604.19734
• Project Page: https://xpeng-robotics.github.io/unit/
• Github: https://github.com/xpeng-robotics/UniT

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
StyleID: A Perception-Aware Dataset and Metric for Stylization-Agnostic Facial Identity Recognition

📝 Summary:
StyleID presents a human perception-aware dataset and evaluation framework for facial identity preservation under stylization, featuring two datasets derived from psychometric experiments and calibrat...

🔹 Publication Date: Published on Apr 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.21689
• PDF: https://arxiv.org/pdf/2604.21689
• Project Page: https://kwanyun.github.io/StyleID_page/
• Github: https://github.com/kwanyun/StyleID

🔹 Models citing this paper:
https://huggingface.co/kwanY/styleid

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
Seeing Fast and Slow: Learning the Flow of Time in Videos

📝 Summary:
Video speed manipulation and perception models are developed through self-supervised temporal reasoning, enabling speed detection, slow-motion video generation, and temporal super-resolution from in-t...

🔹 Publication Date: Published on Apr 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.21931
• PDF: https://arxiv.org/pdf/2604.21931
• Project Page: https://seeing-fast-and-slow.github.io/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
WorldMark: A Unified Benchmark Suite for Interactive Video World Models

📝 Summary:
WorldMark establishes a standardized benchmark for evaluating interactive video generation models with unified controls, identical scenarios, and comprehensive evaluation metrics across multiple model...

🔹 Publication Date: Published on Apr 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.21686
• PDF: https://arxiv.org/pdf/2604.21686
• Project Page: https://alaya-studio.github.io/WorldMark/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Context Unrolling in Omni Models

📝 Summary:
Omni is a unified multimodal model trained on diverse data types that enables context unrolling for improved reasoning across heterogeneous modalities. AI-generated summary We present Omni, a unified ...

🔹 Publication Date: Published on Apr 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.21921
• PDF: https://arxiv.org/pdf/2604.21921
• Project Page: https://omni-model.com/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
UniGenDet: A Unified Generative-Discriminative Framework for Co-Evolutionary Image Generation and Generated Image Detection

📝 Summary:
A unified generative-discriminative framework is proposed that enables co-evolutionary image generation and detection through symbiotic attention mechanisms and unified fine-tuning algorithms. AI-gene...

🔹 Publication Date: Published on Apr 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.21904
• PDF: https://arxiv.org/pdf/2604.21904
• Project Page: https://ivg-yanranzhang.github.io/UniGenDet/
• Github: https://github.com/Zhangyr2022/UniGenDet

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation

📝 Summary:
VLAA-GUI is a modular GUI agent framework that addresses early stopping and repetitive loop issues through integrated components for verification, loop breaking, and search capabilities. AI-generated ...

🔹 Publication Date: Published on Apr 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.21375
• PDF: https://arxiv.org/pdf/2604.21375
• Project Page: https://ucsc-vlaa.github.io/VLAA-GUI/
• Github: https://github.com/UCSC-VLAA/VLAA-GUI

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
TingIS: Real-time Risk Event Discovery from Noisy Customer Incidents at Enterprise Scale

📝 Summary:
TingIS is an enterprise-grade incident discovery system that uses multi-stage event linking with LLMs, cascaded routing, and noise reduction to efficiently identify critical issues from high-volume, n...

🔹 Publication Date: Published on Apr 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.21889
• PDF: https://arxiv.org/pdf/2604.21889

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Trust but Verify: Introducing DAVinCI -- A Framework for Dual Attribution and Verification in Claim Inference for Language Models

📝 Summary:
DAVinCI is a dual attribution and verification framework that enhances factual reliability and interpretability of large language models by attributing claims to internal components and external sourc...

🔹 Publication Date: Published on Apr 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.21193
• PDF: https://arxiv.org/pdf/2604.21193
• Github: https://github.com/vr25/davinci

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Explainable Disentangled Representation Learning for Generalizable Authorship Attribution in the Era of Generative AI

📝 Summary:
A novel variational autoencoder framework with supervised contrastive learning and discriminative disentanglement achieves superior performance in authorship attribution and AI-generated text detectio...

🔹 Publication Date: Published on Apr 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.21300
• PDF: https://arxiv.org/pdf/2604.21300
• Github: https://github.com/hieum98/avae

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks

📝 Summary:
A co-evolution framework enables large language models to discover, retain, and reuse structured skills across episodes in long-horizon interactive environments through a learnable skill bank and skil...

🔹 Publication Date: Published on Apr 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.20987
• PDF: https://arxiv.org/pdf/2604.20987
• Project Page: https://wuxiyang1996.github.io/COSPLAY_page/
• Github: https://wuxiyang1996.github.io/COSPLAY_page/

🔹 Models citing this paper:
https://huggingface.co/IntelligenceLab/COS-PLAY

Datasets citing this paper:
https://huggingface.co/datasets/IntelligenceLab/Cos-Play-Cold-Start

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Hybrid Policy Distillation for LLMs

📝 Summary:
Hybrid Policy Distillation combines forward and reverse KL divergence approaches to improve knowledge distillation stability and efficiency across different model sizes and tasks. AI-generated summary...

🔹 Publication Date: Published on Apr 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.20244
• PDF: https://arxiv.org/pdf/2604.20244
• Github: https://github.com/zwhong714/Hybrid-Policy-Distillation

🔹 Models citing this paper:
https://huggingface.co/wh-zhu/Qwen2.5-7B-PSFT-RL-DAPO-90
https://huggingface.co/wh-zhu/qwen2.5-1.5B-longcot-reasoning-HPD

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
WebGen-R1: Incentivizing Large Language Models to Generate Functional and Aesthetic Websites with Reinforcement Learning

📝 Summary:
WebGen-R1 is a reinforcement learning framework enabling small language models to generate functional and aesthetically pleasing multi-page websites. It uses structured generation and a novel cascaded multimodal reward for structural integrity, functional feedback, and aesthetic supervision. WebG...

🔹 Publication Date: Published on Apr 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.20398
• PDF: https://arxiv.org/pdf/2604.20398

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#ReinforcementLearning #LLMs #WebsiteGeneration #AI #WebDevelopment
Media is too big
VIEW IN TELEGRAM
EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model

📝 Summary:
EditCrafter enables high-resolution image editing using pretrained text-to-image diffusion models through tiled inversion and noise-damped manifold-constrained guidance without requiring model tuning....

🔹 Publication Date: Published on Apr 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.10268
• PDF: https://arxiv.org/pdf/2604.10268
• Project Page: https://editcrafter.github.io/
• Github: https://github.com/EditCrafter/EditCrafter

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
PersonalAI: A Systematic Comparison of Knowledge Graph Storage and Retrieval Approaches for Personalized LLM agents

📝 Summary:
A knowledge graph-based external memory framework enhances language model personalization through dynamic semantic and temporal representations with diverse retrieval mechanisms. AI-generated summary ...

🔹 Publication Date: Published on Apr 12

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2506.17001
• PDF: https://arxiv.org/pdf/2506.17001

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Encoder-Free Human Motion Understanding via Structured Motion Descriptions

📝 Summary:
Structured Motion Description SMD converts human motion into natural language, enabling large language models LLMs to reason about it directly. This encoder-free method achieves state-of-the-art performance on motion question answering and captioning.

🔹 Publication Date: Published on Apr 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.21668
• PDF: https://arxiv.org/pdf/2604.21668
• Project Page: https://yaozhang182.github.io/motion-smd/
• Github: https://yaozhang182.github.io/motion-smd/

🔹 Models citing this paper:
https://huggingface.co/zyyy12138/motion-smd-lora

Datasets citing this paper:
https://huggingface.co/datasets/zyyy12138/motion-smd-data

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#HumanMotionUnderstanding #LLMs #NLP #AI #DeepLearning
1
This media is not supported in your browser
VIEW IN TELEGRAM
LLaTiSA: Towards Difficulty-Stratified Time Series Reasoning from Visual Perception to Semantics

📝 Summary:
A hierarchical time series reasoning dataset and model are introduced to improve LLM understanding of temporal data through visualized patterns and numerical tables. AI-generated summary Comprehensive...

🔹 Publication Date: Published on Apr 19

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.17295
• PDF: https://arxiv.org/pdf/2604.17295
• Github: https://github.com/RainingNovember/LLaTiSA

Datasets citing this paper:
https://huggingface.co/datasets/November-Rain/HiTSR

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
Media is too big
VIEW IN TELEGRAM
Vista4D: Video Reshooting with 4D Point Clouds

📝 Summary:
Vista4D is a video reshooting framework that uses 4D point clouds to synthesize dynamic scenes from new camera viewpoints. It improves 4D consistency, camera control, and visual quality by overcoming depth estimation issues and preserving scene content.

🔹 Publication Date: Published on Apr 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.21915
• PDF: https://arxiv.org/pdf/2604.21915
• Project Page: https://eyeline-labs.github.io/Vista4D
• Github: https://github.com/Eyeline-Labs/Vista4D

🔹 Models citing this paper:
https://huggingface.co/Eyeline-Labs/Vista4D

Datasets citing this paper:
https://huggingface.co/datasets/Eyeline-Labs/Vista4D-Eval-Data

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1