ML Research Hub
32.9K subscribers
5.22K photos
324 videos
24 files
5.64K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
🎯 Want to Upskill in IT? Try Our FREE 2026 Learning Kits!

SPOTO gives you free, instant access to high-quality, updated resources that help you study smarter and pass exams faster.
βœ… Latest Exam Materials:
Covering #Python, #Cisco, #PMI, #Fortinet, #AWS, #Azure, #AI, #Excel, #comptia, #ITIL, #cloud & more!
βœ… 100% Free, No Sign-up:
All materials are instantly downloadable

βœ… What’s Inside:
γƒ»πŸ“˜IT Certs E-book: https://bit.ly/3Mlu5ez
γƒ»πŸ“IT Exams Skill Test: https://bit.ly/3NVrgRU
γƒ»πŸŽ“Free IT courses: https://bit.ly/3M9h5su
γƒ»πŸ€–Free PMP Study Guide: https://bit.ly/4te3EIn
γƒ»β˜οΈFree Cloud Study Guide: https://bit.ly/4kgFVDs

πŸ‘‰ Become Part of Our IT Learning Circle! resources and support:
https://chat.whatsapp.com/FlG2rOYVySLEHLKXF3nKGB

πŸ’¬ Want exam help? Chat with an admin now!
wa.link/8fy3x4
✨Didactic to Constructive: Turning Expert Solutions into Learnable Reasoning

πŸ“ Summary:
DAIL improves LLM reasoning by converting didactic expert solutions into detailed, in-distribution traces via contrastive learning. This method achieves 10-25% performance gains and 2-4x reasoning efficiency using minimal expert data.

πŸ”Ή Publication Date: Published on Feb 2

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2602.02405
β€’ PDF: https://arxiv.org/pdf/2602.02405
β€’ Github: https://github.com/ethanm88/DAIL

✨ Datasets citing this paper:
β€’ https://huggingface.co/datasets/emendes3/e1-proof

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Feedback by Design: Understanding and Overcoming User Feedback Barriers in Conversational Agents

πŸ“ Summary:
High-quality feedback is essential for effective human-AI interaction. It bridges knowledge gaps, corrects digressions, and shapes system behavior; both during interaction and throughout model develop...

πŸ”Ή Publication Date: Published on Feb 1

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2602.01405
β€’ PDF: https://arxiv.org/pdf/2602.01405

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Scaling Small Agents Through Strategy Auctions

πŸ“ Summary:
Small language models fail on complex tasks. The paper proposes Strategy Auctions for Workload Efficiency SALE, a marketplace-inspired framework where agents bid strategic plans for task routing and self-improvement. SALE reduces costs by 35% and improves performance, enabling small agents to sca...

πŸ”Ή Publication Date: Published on Feb 2

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2602.02751
β€’ PDF: https://arxiv.org/pdf/2602.02751

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨MemoryLLM: Plug-n-Play Interpretable Feed-Forward Memory for Transformers

πŸ“ Summary:
MemoryLLM decouples feed-forward networks from self-attention in transformers, enabling context-free token-wise neural retrieval memory that improves inference efficiency through pre-computed lookups....

πŸ”Ή Publication Date: Published on Jan 30

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2602.00398
β€’ PDF: https://arxiv.org/pdf/2602.00398

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Position: Agentic Evolution is the Path to Evolving LLMs

πŸ“ Summary:
Large language models struggle to adapt to changing real-world environments. Agentic evolution is proposed as a new approach where deployment-time improvement becomes a goal-directed optimization process. This allows for sustained, open-ended adaptation by scaling evolution.

πŸ”Ή Publication Date: Published on Jan 30

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2602.00359
β€’ PDF: https://arxiv.org/pdf/2602.00359
β€’ Github: https://github.com/ventr1c/agentic-evoluiton

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
❀1
✨MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

πŸ“ Summary:
MiniCPM-V 4.5 is an 8B multimodal LLM achieving high performance and efficiency. It uses a unified 3D-Resampler, unified learning, and hybrid reinforcement learning. It surpasses larger models like GPT-4o and Qwen2.5-VL with significantly less memory and faster inference.

πŸ”Ή Publication Date: Published on Sep 16, 2025

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2509.18154
β€’ PDF: https://arxiv.org/pdf/2509.18154
β€’ Github: https://github.com/OpenBMB/MiniCPM-V

πŸ”Ή Models citing this paper:
β€’ https://huggingface.co/openbmb/MiniCPM-V-4_5
β€’ https://huggingface.co/openbmb/MiniCPM-V-4_5-gguf
β€’ https://huggingface.co/openbmb/MiniCPM-V-4_5-AWQ

✨ Datasets citing this paper:
β€’ https://huggingface.co/datasets/openbmb/RLAIF-V-Dataset

✨ Spaces citing this paper:
β€’ https://huggingface.co/spaces/CGQN/MiniCPM-V-4_5-int4-CPU-0
β€’ https://huggingface.co/spaces/CGQN/MiniCPM-V-4_5-from_gpt5
β€’ https://huggingface.co/spaces/CGQN/MiniCPM-V-4_5

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Privasis: Synthesizing the Largest "Public" Private Dataset from Scratch

πŸ“ Summary:
Privasis is a new million-scale synthetic dataset for AI privacy research. It addresses data scarcity, enabling compact sanitization models that outperform large language models like GPT-5. The diverse dataset and models will be released to the public.

πŸ”Ή Publication Date: Published on Feb 3

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2602.03183
β€’ PDF: https://arxiv.org/pdf/2602.03183
β€’ Project Page: https://privasis.github.io
β€’ Github: https://github.com/skywalker023/privasis

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨FIRE-Bench: Evaluating Agents on the Rediscovery of Scientific Insights

πŸ“ Summary:
FIRE-Bench evaluates AI agents on rediscovering scientific findings through full research cycles, from hypothesis to conclusions. Agents receive a high-level question and act autonomously. Current agents struggle, showing that reliable AI-driven scientific discovery remains challenging.

πŸ”Ή Publication Date: Published on Feb 2

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2602.02905
β€’ PDF: https://arxiv.org/pdf/2602.02905
β€’ Project Page: https://firebench.github.io/

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

πŸ“ Summary:
UI-TARS-2 is a native GUI agent model that tackles challenges in data scalability and multi-turn reinforcement learning. It significantly improves over its predecessor and strong baselines on GUI and game benchmarks, demonstrating robust generalization. This advances GUI agents for real-world int...

πŸ”Ή Publication Date: Published on Sep 2, 2025

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2509.02544
β€’ PDF: https://arxiv.org/pdf/2509.02544
β€’ Project Page: https://seed-tars.com/showcase/ui-tars-2/
β€’ Github: https://github.com/bytedance/ui-tars

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
✨SoMA: A Real-to-Sim Neural Simulator for Robotic Soft-body Manipulation

πŸ“ Summary:
SoMA is a 3D Gaussian Splat simulator that enables stable, long-horizon manipulation of soft bodies by coupling deformable dynamics, environmental forces, and robot actions in a unified latent neural ...

πŸ”Ή Publication Date: Published on Feb 2

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2602.02402
β€’ PDF: https://arxiv.org/pdf/2602.02402
β€’ Project Page: https://huggingface.co/collections/SuemH/project-page
β€’ Github: https://city-super.github.io/SoMA/

πŸ”Ή Models citing this paper:
β€’ https://huggingface.co/SuemH/SoMA

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces

πŸ“ Summary:
Agentic RAG framework enables models to dynamically adapt retrieval decisions across multiple granularities, outperforming traditional approaches while scaling efficiently with model improvements. AI-...

πŸ”Ή Publication Date: Published on Feb 3

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2602.03442
β€’ PDF: https://arxiv.org/pdf/2602.03442

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Self-Hinting Language Models Enhance Reinforcement Learning

πŸ“ Summary:
SAGE is an on-policy reinforcement learning framework that enhances GRPO by injecting self-hints during training to increase outcome diversity under sparse rewards, improving alignment of large langua...

πŸ”Ή Publication Date: Published on Feb 3

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2602.03143
β€’ PDF: https://arxiv.org/pdf/2602.03143
β€’ Github: https://github.com/BaohaoLiao/SAGE

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Context Learning for Multi-Agent Discussion

πŸ“ Summary:
Multi-Agent Discussion methods suffer from inconsistency due to individual context misalignment, which is addressed through a context learning approach that dynamically generates context instructions ...

πŸ”Ή Publication Date: Published on Feb 2

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2602.02350
β€’ PDF: https://arxiv.org/pdf/2602.02350

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨A2Eval: Agentic and Automated Evaluation for Embodied Brain

πŸ“ Summary:
Agentic automatic evaluation framework automates embodied vision-language model assessment through collaborative agents that reduce evaluation costs and improve ranking accuracy. AI-generated summary ...

πŸ”Ή Publication Date: Published on Feb 2

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2602.01640
β€’ PDF: https://arxiv.org/pdf/2602.01640

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨UI-TARS: Pioneering Automated GUI Interaction with Native Agents

πŸ“ Summary:
UI-TARS, a native GUI agent model using screenshots as input, outperforms commercial models in various benchmarks through enhanced perception, unified action modeling, system-2 reasoning, and iterativ...

πŸ”Ή Publication Date: Published on Jan 21, 2025

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2501.12326
β€’ PDF: https://arxiv.org/pdf/2501.12326
β€’ Github: https://github.com/bytedance/UI-TARS

πŸ”Ή Models citing this paper:
β€’ https://huggingface.co/ByteDance-Seed/UI-TARS-1.5-7B
β€’ https://huggingface.co/ByteDance-Seed/UI-TARS-7B-DPO
β€’ https://huggingface.co/ByteDance-Seed/UI-TARS-7B-SFT

✨ Datasets citing this paper:
β€’ https://huggingface.co/datasets/Hcompany/WebClick

✨ Spaces citing this paper:
β€’ https://huggingface.co/spaces/omar0scarf/ui-tars-api
β€’ https://huggingface.co/spaces/bytedance-research/UI-TARS
β€’ https://huggingface.co/spaces/Aheader/gui_test_app

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

πŸ“ Summary:
Quant VideoGen addresses KV cache memory limitations in autoregressive video diffusion models through semantic-aware smoothing and progressive residual quantization, achieving significant memory reduc...

πŸ”Ή Publication Date: Published on Feb 3

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2602.02958
β€’ PDF: https://arxiv.org/pdf/2602.02958

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨EgoActor: Grounding Task Planning into Spatial-aware Egocentric Actions for Humanoid Robots via Visual-Language Models

πŸ“ Summary:
EgoActor is a unified vision-language model that translates high-level instructions into precise humanoid robot actions through integrated perception and execution across simulated and real-world envi...

πŸ”Ή Publication Date: Published on Feb 4

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2602.04515
β€’ PDF: https://arxiv.org/pdf/2602.04515
β€’ Github: https://baai-agents.github.io/EgoActor/

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR

πŸ“ Summary:
Search agents trained on scientific paper corpora demonstrate advanced reasoning capabilities for technical question-answering tasks, outperforming traditional retrieval methods through reinforcement ...

πŸ”Ή Publication Date: Published on Jan 26

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2601.18207
β€’ PDF: https://arxiv.org/pdf/2601.18207
β€’ Project Page: https://jmhb0.github.io/PaperSearchQA/
β€’ Github: https://jmhb0.github.io/PaperSearchQA/

✨ Datasets citing this paper:
β€’ https://huggingface.co/datasets/jmhb/PaperSearchQA
β€’ https://huggingface.co/datasets/jmhb/pubmed_bioasq_2022
β€’ https://huggingface.co/datasets/jmhb/bioasq_factoid

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research