ML Research Hub
32.3K subscribers
6.72K photos
466 videos
24 files
7.3K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
Mind's Eye: A Benchmark of Visual Abstraction, Transformation and Composition for Multimodal LLMs

📝 Summary:
Multimodal large language models demonstrate significant limitations in visuospatial reasoning tasks compared to human performance, revealing deficiencies in visual attention, perceptual manipulation,...

🔹 Publication Date: Published on Apr 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.16054
• PDF: https://arxiv.org/pdf/2604.16054

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning

📝 Summary:
ShadowPEFT is a new parameter-efficient fine-tuning framework that uses a depth-shared shadow module for layer-level refinement. This shifts adaptation from distributed weight perturbations to a shared layer-space process, matching or outperforming LoRA with reduced overhead and increased flexibi...

🔹 Publication Date: Published on Apr 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.19254
• PDF: https://arxiv.org/pdf/2604.19254
• Project Page: https://github.com/ShadowLLM/shadow-peft
• Github: https://github.com/ShadowLLM/shadow-peft

🔹 Models citing this paper:
https://huggingface.co/shadow-llm/Qwen3-4B-GSM8k-Shadow
https://huggingface.co/shadow-llm/Qwen3-4B-SquadV2-Shadow
https://huggingface.co/shadow-llm/Qwen3-4B-MMLU-Shadow

Datasets citing this paper:
https://huggingface.co/datasets/shadow-llm/robot-dog-skills

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#PEFT #FineTuning #MachineLearning #AI #LLMs
HP-Edit: A Human-Preference Post-Training Framework for Image Editing

📝 Summary:
A post-training framework called HP-Edit is introduced to align image editing models with human preferences using a novel automatic evaluator and a real-world dataset, improving editing quality throug...

🔹 Publication Date: Published on Apr 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.19406
• PDF: https://arxiv.org/pdf/2604.19406

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Understanding and Enforcing Weight Disentanglement in Task Arithmetic

📝 Summary:
Task arithmetic lacks theoretical explanation for its success, but the proposed OrthoReg method addresses this by promoting weight disentanglement through enforced orthogonality in weight updates duri...

🔹 Publication Date: Published on Apr 18

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.17078
• PDF: https://arxiv.org/pdf/2604.17078
• Github: https://github.com/RL-MIND/OrthoReg

🔹 Models citing this paper:
https://huggingface.co/RL-MIND/OrthoReg_checkpoints
https://huggingface.co/RL-MIND/OrthoReg

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation

📝 Summary:
Agent-as-a-Judge benchmark evaluates automated verification capabilities across multiple domains with comprehensive task assessment. AI-generated summary As reinforcement learning continues to scale t...

🔹 Publication Date: Published on Apr 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.18240
• PDF: https://arxiv.org/pdf/2604.18240
• Project Page: https://aj-bench.github.io/
• Github: https://github.com/aj-bench/AJ-Bench

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Accurate and scalable exchange-correlation with deep learning

📝 Summary:
A deep learning approach to density functional theory achieves higher accuracy than traditional methods while maintaining computational efficiency by learning electronic structure representations dire...

🔹 Publication Date: Published on Apr 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2506.14665
• PDF: https://arxiv.org/pdf/2506.14665
• Project Page: https://aka.ms/dft
• Github: https://github.com/microsoft/skala

🔹 Models citing this paper:
https://huggingface.co/microsoft/skala-1.0
https://huggingface.co/microsoft/skala-1.1

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
What Makes an LLM a Good Optimizer? A Trajectory Analysis of LLM-Guided Evolutionary Search

📝 Summary:
LLM-guided evolutionary search shows that optimization success depends on search trajectory characteristics rather than initial problem-solving ability alone, with strong optimizers refining locally w...

🔹 Publication Date: Published on Apr 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.19440
• PDF: https://arxiv.org/pdf/2604.19440
• Project Page: https://xinhao-zhang.github.io/traj_evo_search/
• Github: https://github.com/XINHAO-ZHANG/LLMEvo_Eval

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#LLM #Optimization #EvolutionaryAlgorithms #AI #MachineLearning
MoVE: Translating Laughter and Tears via Mixture of Vocalization Experts in Speech-to-Speech Translation

📝 Summary:
MoVE, a Mixture-of-LoRA-Experts architecture with expressive-specialized adapters and a soft-weighting router, enables effective speech-to-speech translation with preserved non-verbal vocalizations wh...

🔹 Publication Date: Published on Apr 19

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.17435
• PDF: https://arxiv.org/pdf/2604.17435
• Github: https://github.com/47zzz/MoVE

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Micro Language Models Enable Instant Responses

📝 Summary:
Micro language models enable instant on-device response initiation with cloud-based continuation, achieving low-latency interactive AI through asymmetric collaboration between edge and cloud computing...

🔹 Publication Date: Published on Apr 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.19642
• PDF: https://arxiv.org/pdf/2604.19642
• Github: https://github.com/Sensente/micro_language_model_swen_project

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
Media is too big
VIEW IN TELEGRAM
CityRAG: Stepping Into a City via Spatially-Grounded Video Generation

📝 Summary:
CityRAG generates long-term, physically grounded video sequences that maintain environmental consistency and support complex navigation through real-world geography using geo-registered data as contex...

🔹 Publication Date: Published on Apr 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.19741
• PDF: https://arxiv.org/pdf/2604.19741
• Project Page: https://cityrag.github.io/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#VideoGeneration #GenerativeAI #SpatialAI #ComputerVision #UrbanSimulation
RDP LoRA: Geometry-Driven Identification for Parameter-Efficient Adaptation in Large Language Models

📝 Summary:
Using geometric trajectory analysis with the Ramer-Douglas-Peucker algorithm to select optimal layers for parameter-efficient fine-tuning of large language models, achieving better performance than fu...

🔹 Publication Date: Published on Apr 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.19321
• PDF: https://arxiv.org/pdf/2604.19321

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
Cortex 2.0: Grounding World Models in Real-World Industrial Deployment

📝 Summary:
Cortex 2.0 introduces a plan-and-act control system for reliable long-horizon robotic manipulation. It generates and evaluates future trajectories in visual latent space, outperforming reactive Vision-Language-Action models. This demonstrates world-model-based planning's reliability in complex in...

🔹 Publication Date: Published on Apr 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.20246
• PDF: https://arxiv.org/pdf/2604.20246

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
SWE-chat: Coding Agent Interactions From Real Users in the Wild

📝 Summary:
SWE-chat presents a large-scale dataset of real coding agent interactions that reveals significant inefficiencies and challenges in current AI-assisted development practices. AI-generated summary A I ...

🔹 Publication Date: Published on Apr 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.20779
• PDF: https://arxiv.org/pdf/2604.20779

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

📝 Summary:
LLaDA2.0-Uni is a unified discrete diffusion language model that integrates multimodal understanding and generation through a semantic discrete tokenizer, MoE-based backbone, and diffusion decoder, ac...

🔹 Publication Date: Published on Apr 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.20796
• PDF: https://arxiv.org/pdf/2604.20796
• Github: https://github.com/inclusionAI/LLaDA2.0-Uni

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
DR-Venus: Towards Frontier Edge-Scale Deep Research Agents with Only 10K Open Data

📝 Summary:
DR-Venus-4B is a 4-billion-parameter deep research agent trained entirely on open data using agentic supervised fine-tuning and reinforcement learning with turn-level rewards to achieve superior perfo...

🔹 Publication Date: Published on Apr 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.19859
• PDF: https://arxiv.org/pdf/2604.19859
• Project Page: https://huggingface.co/collections/inclusionAI/dr-venus
• Github: https://github.com/inclusionAI/DR-Venus/tree/master/Inference

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
WavAlign: Enhancing Intelligence and Expressiveness in Spoken Dialogue Models via Adaptive Hybrid Post-Training

📝 Summary:
Spoken dialogue models face challenges in expressiveness despite end-to-end approaches, but a modality-aware adaptive post-training method using constrained preference updates and explicit anchoring i...

🔹 Publication Date: Published on Apr 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.14932
• PDF: https://arxiv.org/pdf/2604.14932

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
MMCORE: MultiModal COnnection with Representation Aligned Latent Embeddings

📝 Summary:
MMCORE is a unified framework for multimodal image generation and editing that uses a pre-trained Vision-Language Model to predict semantic visual embeddings for diffusion model conditioning, enabling...

🔹 Publication Date: Published on Apr 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.19902
• PDF: https://arxiv.org/pdf/2604.19902

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
CreativeGame:Toward Mechanic-Aware Creative Game Generation

📝 Summary:
A multi-agent system for iterative HTML5 game generation that uses programmatic rewards, lineage memory, runtime validation, and mechanic-guided planning to enable interpretable version-to-version evo...

🔹 Publication Date: Published on Apr 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.19926
• PDF: https://arxiv.org/pdf/2604.19926

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks

📝 Summary:
LLM-based assistants require heterogeneous memory extraction capabilities, which are evaluated through the BEHEMOTH benchmark, with CluE offering improved performance through cluster-based prompt opti...

🔹 Publication Date: Published on Apr 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.11610
• PDF: https://arxiv.org/pdf/2604.11610
• Github: https://github.com/ayyyq/heterogeneous-memory-extraction

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Exploring Spatial Intelligence from a Generative Perspective

📝 Summary:
Generative spatial intelligence benchmark evaluates and enhances 3D spatial constraint manipulation in image generation through real-world and synthetic datasets. AI-generated summary Spatial intellig...

🔹 Publication Date: Published on Apr 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.20570
• PDF: https://arxiv.org/pdf/2604.20570
• Project Page: https://aim-uofa.github.io/GSI-Bench/
• Github: https://github.com/aim-uofa/GSI-Bench

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research