ML Research Hub
32.9K subscribers
5.35K photos
332 videos
24 files
5.78K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts

📝 Summary:
LM-Lexicon improves definition modeling using data clustering and a sparse mixture-of-experts architecture. It trains specialized semantic experts, achieving substantial improvements in definition quality and higher BLEU scores. This advances efficient language models for semantic applications.

🔹 Publication Date: Published on Feb 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14060
• PDF: https://arxiv.org/pdf/2602.14060
• Project Page: https://lm-lexicon.github.io
• Github: https://github.com/jacklanda/LMLexicon

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
DHPLT: large-scale multilingual diachronic corpora and word representations for semantic change modelling

📝 Summary:
In this resource paper, we present DHPLT, an open collection of diachronic corpora in 41 diverse languages. DHPLT is based on the web-crawled HPLT datasets; we use web crawl timestamps as the approxim...

🔹 Publication Date: Published on Feb 12

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.11968
• PDF: https://arxiv.org/pdf/2602.11968
• Project Page: https://data.hplt-project.org/three/diachronic/
• Github: https://github.com/ltgoslo/scdisc_hplt

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
SPILLage: Agentic Oversharing on the Web

📝 Summary:
Web agents inadvertently disclose user information through both content and behavioral traces, with behavioral oversharing being more prevalent than content oversharing, and this issue persists despit...

🔹 Publication Date: Published on Feb 13

🔹 Paper Links:
• arXiv Page: https://www.arxiv.org/abs/2602.13516
• PDF: https://arxiv.org/pdf/2602.13516
• Github: https://github.com/jrohsc/SPILLage

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
STATe-of-Thoughts: Structured Action Templates for Tree-of-Thoughts

📝 Summary:
STATe-of-Thoughts is an interpretable method that replaces stochastic sampling with discrete textual interventions to search over reasoning patterns. This produces more diverse, high-quality, and explainable text, offering better control and insight into the generation process.

🔹 Publication Date: Published on Feb 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14265
• PDF: https://arxiv.org/pdf/2602.14265
• Github: https://github.com/zbambergerNLP/state-of-thoughts

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
EditCtrl: Disentangled Local and Global Control for Real-Time Generative Video Editing

📝 Summary:
Efficient video inpainting framework that focuses computation on masked regions while maintaining global context consistency through a lightweight embedder. AI-generated summary High-fidelity generati...

🔹 Publication Date: Published on Feb 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.15031
• PDF: https://arxiv.org/pdf/2602.15031
• Project Page: https://yehonathanlitman.github.io/edit_ctrl/
• Github: https://github.com/yehonathanlitman/EditCtrl

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories

📝 Summary:
AnchorWeave addresses long-term video generation consistency by replacing global 3D scene reconstruction with multiple local geometric memories and a multi-anchor weaving controller to reconcile cross...

🔹 Publication Date: Published on Feb 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14941
• PDF: https://arxiv.org/pdf/2602.14941
• Project Page: https://zunwang1.github.io/AnchorWeave
• Github: https://github.com/wz0919/AnchorWeave

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Learning to Configure Agentic AI Systems

📝 Summary:
Current AI agent configurations are static and inefficient. This paper introduces ARC, an RL-based system that learns dynamic, per-query configurations. ARC significantly boosts accuracy while cutting costs, proving dynamic configuration is superior.

🔹 Publication Date: Published on Feb 12

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.11574
• PDF: https://arxiv.org/pdf/2602.11574
• Github: https://github.com/somsagar07/Context_Optimization

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
CellMaster: Collaborative Cell Type Annotation in Single-Cell Analysis

📝 Summary:
CellMaster uses LLM-encoded knowledge for zero-shot cell-type annotation in single-cell RNA sequencing, improving accuracy over existing tools through interpretable rationales without pre-training. AI...

🔹 Publication Date: Published on Feb 12

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.13346
• PDF: https://arxiv.org/pdf/2602.13346

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1🔥1
VisPhyWorld: Probing Physical Reasoning via Code-Driven Video Reconstruction

📝 Summary:
VisPhyWorld evaluates MLLMs physical reasoning by requiring them to generate executable simulator code from visual observations. This makes their inferred world representations inspectable and falsifiable, unlike recognition-based methods. Experiments show MLLMs understand scenes but struggle wit...

🔹 Publication Date: Published on Feb 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.13294
• PDF: https://arxiv.org/pdf/2602.13294
• Project Page: https://tiger-ai-lab.github.io/VisPhyWorld/
• Github: https://github.com/TIGER-AI-Lab/VisPhyWorld

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Found-RL: foundation model-enhanced reinforcement learning for autonomous driving

📝 Summary:
Found-RL enhances reinforcement learning for autonomous driving using vision-language models, overcoming VLM latency with asynchronous inference. It distills VLM knowledge into a lightweight RL policy through supervision and reward shaping, achieving near VLM performance at real-time speeds.

🔹 Publication Date: Published on Feb 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10458
• PDF: https://arxiv.org/pdf/2602.10458
• Project Page: https://ys-qu.github.io/found-rl-website/
• Github: https://github.com/ys-qu/found-rl

🔹 Models citing this paper:
https://huggingface.co/ys-qu/found-rl_vlms

Datasets citing this paper:
https://huggingface.co/datasets/ys-qu/found-rl_dataset

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
GLM-5: from Vibe Coding to Agentic Engineering

📝 Summary:
GLM-5 is a new foundation model designed for agentic engineering. It reduces costs with DSA and improves alignment and efficiency using asynchronous reinforcement learning. GLM-5 achieves state-of-the-art performance in real-world coding and end-to-end software engineering tasks.

🔹 Publication Date: Published on Feb 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.15763
• PDF: https://arxiv.org/pdf/2602.15763
• Github: https://github.com/zai-org/GLM-5

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
On Surprising Effectiveness of Masking Updates in Adaptive Optimizers

📝 Summary:
Random parameter update masking achieves superior optimization for large language models by inducing curvature-dependent regularization, with a momentum-aligned variant delivering significant performa...

🔹 Publication Date: Published on Feb 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.15322
• PDF: https://arxiv.org/pdf/2602.15322

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook

📝 Summary:
Large language model agents in networked environments exhibit dynamic stability without true social convergence, maintaining individual diversity while lacking collective influence structures. AI-gene...

🔹 Publication Date: Published on Feb 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14299
• PDF: https://arxiv.org/pdf/2602.14299
• Project Page: https://github.com/MingLiiii/Moltbook_Socialization
• Github: https://github.com/MingLiiii/Moltbook_Socialization

Datasets citing this paper:
https://huggingface.co/datasets/AIcell/moltbook-data

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Visual Persuasion: What Influences Decisions of Vision-Language Models?

📝 Summary:
Visual-language models' decision-making preferences are studied through controlled image choice tasks with systematic input perturbations, revealing visual vulnerabilities and safety concerns. AI-gene...

🔹 Publication Date: Published on Feb 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.15278
• PDF: https://arxiv.org/pdf/2602.15278
• Project Page: https://visual-persuasion-website.vercel.app/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Geometry-Aware Rotary Position Embedding for Consistent Video World Model

📝 Summary:
ViewRope, a geometry-aware encoding method, enhances long-term consistency in predictive world models by injecting camera-ray directions into video transformer attention layers, addressing spatial per...

🔹 Publication Date: Published on Feb 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.07854
• PDF: https://arxiv.org/pdf/2602.07854
• Project Page: https://huggingface.co/papers?q=projective%20geometry

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
STAPO: Stabilizing Reinforcement Learning for LLMs by Silencing Rare Spurious Tokens

📝 Summary:
Research identifies spurious tokens as the cause of training instability in reinforcement learning fine-tuning of large language models and proposes a solution that selectively masks problematic gradi...

🔹 Publication Date: Published on Feb 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.15620
• PDF: https://arxiv.org/pdf/2602.15620

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
ResearchGym: Evaluating Language Model Agents on Real-World AI Research

📝 Summary:
ResearchGym presents a benchmark environment for evaluating AI agents on end-to-end research tasks, revealing significant capability-reliability gaps in current autonomous agents despite occasional st...

🔹 Publication Date: Published on Feb 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.15112
• PDF: https://arxiv.org/pdf/2602.15112
• Github: https://github.com/Anikethh/ResearchGym

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Understanding vs. Generation: Navigating Optimization Dilemma in Multimodal Models

📝 Summary:
The Reason-Reflect-Refine framework addresses the trade-off between generation and understanding in multimodal models by reformulating single-step generation into a multi-step process that explicitly ...

🔹 Publication Date: Published on Feb 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.15772
• PDF: https://arxiv.org/pdf/2602.15772
• Github: https://github.com/sen-ye/R3

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
ClinAlign: Scaling Healthcare Alignment from Clinician Preference

📝 Summary:
A two-stage framework addresses alignment of large language models with clinician preferences through physician-verified examples and distilled clinical principles for improved medical reasoning. AI-g...

🔹 Publication Date: Published on Feb 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.09653
• PDF: https://arxiv.org/pdf/2602.09653
• Project Page: https://github.com/AQ-MedAI/ClinAlign
• Github: https://github.com/AQ-MedAI/ClinAlign

🔹 Models citing this paper:
https://huggingface.co/AQ-MedAI/ClinAligh-4B
https://huggingface.co/AQ-MedAI/ClinAligh-30B-A3B

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
This media is not supported in your browser
VIEW IN TELEGRAM
Revisiting the Platonic Representation Hypothesis: An Aristotelian View

📝 Summary:
A new calibration framework corrects inflated neural network similarity scores. It reveals global convergence vanishes, while local neighborhood similarity persists, supporting the Aristotelian Representation Hypothesis of shared local relationships.

🔹 Publication Date: Published on Feb 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14486
• PDF: https://arxiv.org/pdf/2602.14486
• Project Page: https://brbiclab.epfl.ch/projects/aristotelian/
• Github: https://github.com/mlbio-epfl/aristotelian

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Learning Native Continuation for Action Chunking Flow Policies

📝 Summary:
Legato improves action-chunked Vision Language Action models by using training-time continuation methods that ensure smooth trajectories and reduce multimodal switching during real-time execution. AI-...

🔹 Publication Date: Published on Feb 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.12978
• PDF: https://arxiv.org/pdf/2602.12978
• Project Page: https://lyfeng001.github.io/Legato/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research