ML Research Hub
32.9K subscribers
4.58K photos
281 videos
23 files
4.95K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
Uncertainty-Aware Gradient Signal-to-Noise Data Selection for Instruction Tuning

📝 Summary:
GRADFILTERING is an uncertainty-aware data selection framework for instruction tuning that uses gradient signal-to-noise ratio to improve LLM adaptation efficiency and performance. AI-generated summar...

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.13697
• PDF: https://arxiv.org/pdf/2601.13697

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
Forwarded from Data Analytics
This repository collects everything you need to use AI and LLM in your projects.

120+ libraries, organized by development stages:

→ Model training, fine-tuning, and evaluation
→ Deploying applications with LLM and RAG
→ Fast and scalable model launch
→ Data extraction, crawlers, and scrapers
→ Creating autonomous LLM agents
→ Prompt optimization and security

Repo: https://github.com/KalyanKS-NLP/llm-engineer-toolkit

🥺 https://t.iss.one/DataAnalyticsX
Please open Telegram to view this post
VIEW IN TELEGRAM
3
Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

📝 Summary:
This survey presents 'Locate, Steer, and Improve' as an actionable framework for mechanistic interpretability in LLMs. It shifts MI from an observational science to a systematic methodology for optimizing LLMs, leading to tangible improvements in their alignment, capability, and efficiency.

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14004
• PDF: https://arxiv.org/pdf/2601.14004

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#LLM #MechanisticInterpretability #AI #AIAalignment #MachineLearning
InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning

📝 Summary:
Intervention Training improves large language model reasoning by enabling fine-grained credit assignment through targeted corrections that localize errors and enhance reinforcement learning performanc...

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14209
• PDF: https://arxiv.org/pdf/2601.14209

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
DSAEval: Evaluating Data Science Agents on a Wide Range of Real-World Data Science Problems

📝 Summary:
A comprehensive benchmark for evaluating LLM-based data agents across diverse data science tasks demonstrates superior performance for multimodal agents while highlighting persistent challenges in uns...

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.13591
• PDF: https://arxiv.org/pdf/2601.13591

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
RemoteVAR: Autoregressive Visual Modeling for Remote Sensing Change Detection

📝 Summary:
RemoteVAR is a visual autoregressive framework for remote sensing change detection that improves upon existing methods through multi-resolution feature fusion and autoregressive training tailored for ...

🔹 Publication Date: Published on Jan 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.11898
• PDF: https://arxiv.org/pdf/2601.11898
• Github: https://github.com/yilmazkorkmaz1/RemoteVAR

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution

📝 Summary:
DARC is a two-stage framework stabilizing LLM self-play by decoupling question generation and using asymmetric self-distillation. This mitigates instability and bootstrapping errors, significantly improving reasoning performance across benchmarks without human annotations.

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.13761
• PDF: https://arxiv.org/pdf/2601.13761
• Github: https://github.com/RUCBM/DARC

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Towards Efficient and Robust Linguistic Emotion Diagnosis for Mental Health via Multi-Agent Instruction Refinement

📝 Summary:
APOLO framework uses automated prompt optimization through multi-agent collaboration to improve emotion diagnosis accuracy and robustness in mental healthcare applications. AI-generated summary Lingui...

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.13481
• PDF: https://arxiv.org/pdf/2601.13481

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Agentic Reasoning for Large Language Models

📝 Summary:
Agentic reasoning redefines LLMs as autonomous agents that plan, act, and learn through continuous interaction in dynamic environments. This survey organizes agentic reasoning by environmental dynamics, from single-agent capabilities to multi-agent collaboration, bridging thought and action throu...

🔹 Publication Date: Published on Jan 18

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.12538
• PDF: https://arxiv.org/pdf/2601.12538
• Github: https://github.com/weitianxin/Awesome-Agentic-Reasoning

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning

📝 Summary:
Render-of-Thought framework converts textual reasoning steps into images using vision-language models to improve reasoning traceability and efficiency while maintaining competitive performance. AI-gen...

🔹 Publication Date: Published on Jan 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14750
• PDF: https://arxiv.org/pdf/2601.14750

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Lost in the Prompt Order: Revealing the Limitations of Causal Attention in Language Models

📝 Summary:
Research reveals that causal attention in language models creates information bottlenecks when question-answer options follow context, leading to performance drops of over 14 percentage points compare...

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14152
• PDF: https://arxiv.org/pdf/2601.14152

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance

📝 Summary:
RebuttalAgent is a multi-agent framework that reframes rebuttal generation as an evidence-centric planning task, improving coverage, faithfulness, and strategic coherence in academic peer review. AI-g...

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14171
• PDF: https://arxiv.org/pdf/2601.14171
• Project Page: https://mqleet.github.io/Paper2Rebuttal_ProjectPage/
• Github: https://github.com/AutoLab-SAI-SJTU/Paper2Rebuttal

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
XR: Cross-Modal Agents for Composed Image Retrieval

📝 Summary:
A multi-agent framework for compositional image retrieval that uses specialized agents for generation, filtering, and verification to improve semantic and visual query matching. AI-generated summary R...

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14245
• PDF: https://arxiv.org/pdf/2601.14245
• Github: https://01yzzyu.github.io/xr.github.io/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Facilitating Proactive and Reactive Guidance for Decision Making on the Web: A Design Probe with WebSeek

📝 Summary:
WebSeek is a mixed-initiative browser extension that enables interactive web data extraction and analysis with AI-assisted guidance and automation. AI-generated summary Web AI agents such as ChatGPT A...

🔹 Publication Date: Published on Jan 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.15100
• PDF: https://arxiv.org/pdf/2601.15100

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Media is too big
VIEW IN TELEGRAM
Rethinking Video Generation Model for the Embodied World

📝 Summary:
A comprehensive robotics benchmark evaluates video generation models across multiple task domains and embodiments, revealing deficiencies in physical realism and introducing a large-scale dataset to a...

🔹 Publication Date: Published on Jan 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.15282
• PDF: https://arxiv.org/pdf/2601.15282
• Project Page: https://dagroup-pku.github.io/ReVidgen.github.io/
• Github: https://github.com/DAGroup-PKU/ReVidgen/

Datasets citing this paper:
https://huggingface.co/datasets/DAGroup-PKU/RBench
https://huggingface.co/datasets/DAGroup-PKU/RoVid-X

Spaces citing this paper:
https://huggingface.co/spaces/DAGroup-PKU/RBench-Leaderboard

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
FARE: Fast-Slow Agentic Robotic Exploration

📝 Summary:
FARE is a hierarchical exploration framework that combines large language model reasoning with reinforcement learning control to enable efficient autonomous robot navigation in complex environments. A...

🔹 Publication Date: Published on Jan 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14681
• PDF: https://arxiv.org/pdf/2601.14681

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
RoboBrain 2.5: Depth in Sight, Time in Mind

📝 Summary:
RoboBrain 2.5 enhances embodied AI through improved 3D spatial reasoning and temporal value estimation for more precise manipulation tasks. AI-generated summary We introduce RoboBrain 2.5, a next-gene...

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14352
• PDF: https://arxiv.org/pdf/2601.14352
• Project Page: https://superrobobrain.github.io/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

📝 Summary:
MMDeepResearch-Bench evaluates multimodal research agents on report generation with visual evidence, revealing trade-offs between prose quality, citation accuracy, and visual grounding. AI-generated s...

🔹 Publication Date: Published on Jan 18

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.12346
• PDF: https://arxiv.org/pdf/2601.12346
• Github: https://github.com/AIoT-MLSys-Lab/MMDeepResearch-Bench

Datasets citing this paper:
https://huggingface.co/datasets/MMDR-2025/MMdeepresearch

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments

📝 Summary:
FinVault presents the first execution-grounded security benchmark for financial agents, revealing significant vulnerabilities in current defense mechanisms when applied to real-world financial workflo...

🔹 Publication Date: Published on Jan 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.07853
• PDF: https://arxiv.org/pdf/2601.07853
• Github: https://github.com/aifinlab/FinVault

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
The Responsibility Vacuum: Organizational Failure in Scaled Agent Systems

📝 Summary:
Modern CI/CD pipelines integrating agent-generated code exhibit a structural failure in responsibility attribution. Decisions are executed through formally correct approval processes, yet no entity po...

🔹 Publication Date: Published on Jan 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.15059
• PDF: https://arxiv.org/pdf/2601.15059

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
AgentEHR: Advancing Autonomous Clinical Decision-Making via Retrospective Summarization

📝 Summary:
AgentEHR is a benchmark for autonomous EHR navigation involving complex clinical decision-making in raw data. The RetroSum framework addresses information loss and fractured reasoning through retrospective summarization and evolving experience strategies. RetroSum improves performance by up to 29...

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.13918
• PDF: https://arxiv.org/pdf/2601.13918
• Github: https://github.com/BlueZeros/AgentEHR

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research