ML Research Hub
32.9K subscribers
5.48K photos
348 videos
24 files
5.93K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

📝 Summary:
ARLArena framework analyzes training stability in agentic reinforcement learning and proposes SAMPO method for stable policy optimization across diverse tasks. AI-generated summary Agentic reinforceme...

🔹 Publication Date: Published on Feb 25

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.21534
• PDF: https://arxiv.org/pdf/2602.21534
• Github: https://github.com/WillDreamer/ARL-Arena

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
SkyReels-V4: Multi-modal Video-Audio Generation, Inpainting and Editing model

📝 Summary:
SkyReels V4 is a unified multimodal video foundation model that generates, edits, and inpaints video and audio simultaneously using a dual-stream architecture with shared text encoding and efficient h...

🔹 Publication Date: Published on Feb 25

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.21818
• PDF: https://arxiv.org/pdf/2602.21818

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
World Guidance: World Modeling in Condition Space for Action Generation

📝 Summary:
World Guidance framework enhances Vision-Language-Action models by mapping future observations into compact conditions for improved action generation and generalization. AI-generated summary Leveragin...

🔹 Publication Date: Published on Feb 25

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.22010
• PDF: https://arxiv.org/pdf/2602.22010
• Project Page: https://selen-suyue.github.io/WoGNet/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
JAEGER: Joint 3D Audio-Visual Grounding and Reasoning in Simulated Physical Environments

📝 Summary:
JAEGER extends audio-visual large language models to 3D space by integrating RGB-D observations and multi-channel audio to improve spatial reasoning and source localization. AI-generated summary Curre...

🔹 Publication Date: Published on Feb 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.18527
• PDF: https://arxiv.org/pdf/2602.18527

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Model Context Protocol (MCP) Tool Descriptions Are Smelly! Towards Improving AI Agent Efficiency with Augmented MCP Tool Descriptions

📝 Summary:
Foundation model agents rely on natural language tool descriptions for effective interaction with external systems, but poor description quality significantly impacts performance and efficiency. AI-ge...

🔹 Publication Date: Published on Feb 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14878
• PDF: https://arxiv.org/pdf/2602.14878

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
UniVBench: Towards Unified Evaluation for Video Foundation Models

📝 Summary:
UniVBench introduces a comprehensive benchmark for evaluating video foundation models across multiple capabilities including understanding, generation, editing, and reconstruction using high-quality, ...

🔹 Publication Date: Published on Feb 25

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.21835
• PDF: https://arxiv.org/pdf/2602.21835
• Github: https://github.com/JianhuiWei7/UniVBench

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
The Design Space of Tri-Modal Masked Diffusion Models

📝 Summary:
A large-scale study of tri-modal discrete diffusion models demonstrates improved performance across text, image, and speech generation tasks through systematic analysis of scaling laws and optimized i...

🔹 Publication Date: Published on Feb 25

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.21472
• PDF: https://arxiv.org/pdf/2602.21472

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
Solaris: Building a Multiplayer Video World Model in Minecraft

📝 Summary:
Solaris is a multiplayer video world model that simulates consistent multi-view observations through a novel data collection system and staged training approach. AI-generated summary Existing action-c...

🔹 Publication Date: Published on Feb 25

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.22208
• PDF: https://arxiv.org/pdf/2602.22208
• Project Page: https://solaris-wm.github.io/
• Github: https://github.com/solaris-wm/solaris

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation

📝 Summary:
DreamID-Omni is a unified framework for controllable human-centric audio-video generation that uses a symmetric conditional diffusion transformer with dual-level disentanglement and multi-task progres...

🔹 Publication Date: Published on Feb 12

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.12160
• PDF: https://arxiv.org/pdf/2602.12160
• Project Page: https://guoxu1233.github.io/DreamID-Omni/
• Github: https://github.com/Guoxu1233/DreamID-Omni

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

📝 Summary:
GUI-Libra addresses limitations in open-source GUI agents through specialized training methods that improve reasoning-grounding alignment and reinforcement learning under partial verifiability, demons...

🔹 Publication Date: Published on Feb 25

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.22190
• PDF: https://arxiv.org/pdf/2602.22190
• Project Page: https://gui-libra.github.io
• Github: https://github.com/GUI-Libra/GUI-Libra

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
NoLan: Mitigating Object Hallucinations in Large Vision-Language Models via Dynamic Suppression of Language Priors

📝 Summary:
Object hallucinations in LVLMs are primarily caused by language decoder priors, leading to the development of a training-free framework that suppresses these priors to reduce hallucinations. AI-genera...

🔹 Publication Date: Published on Feb 25

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.22144
• PDF: https://arxiv.org/pdf/2602.22144
• Github: https://github.com/lingfengren/NoLan

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
MoBind: Motion Binding for Fine-Grained IMU-Video Pose Alignment

📝 Summary:
MoBind learns joint representations between IMU signals and 2D pose sequences through hierarchical contrastive learning to achieve cross-modal retrieval, temporal synchronization, and action recogniti...

🔹 Publication Date: Published on Feb 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.19004
• PDF: https://arxiv.org/pdf/2602.19004
• Github: https://github.com/bbvisual/MoBind

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
NanoKnow: How to Know What Your Language Model Knows

📝 Summary:
NanoKnow is a benchmark using open pre-training data to analyze how LLMs acquire knowledge. It shows accuracy relies on pre-training frequency, which external evidence can mitigate, and that parametric and external knowledge are complementary, but irrelevant data is harmful.

🔹 Publication Date: Published on Feb 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.20122
• PDF: https://arxiv.org/pdf/2602.20122
• Github: https://github.com/castorini/NanoKnow/tree/main

Datasets citing this paper:
https://huggingface.co/datasets/LingweiGu/NanoKnow-Fineweb-Edu-Index
https://huggingface.co/datasets/LingweiGu/NanoKnow_Benchmark

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Image Generation with a Sphere Encoder

📝 Summary:
The Sphere Encoder is an efficient generative model that maps images to a spherical latent space. It produces high-quality images in a single pass, matching diffusion models at a fraction of the inference cost.

🔹 Publication Date: Published on Feb 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.15030
• PDF: https://arxiv.org/pdf/2602.15030
• Project Page: https://sphere-encoder.github.io

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
This media is not supported in your browser
VIEW IN TELEGRAM
SeaCache: Spectral-Evolution-Aware Cache for Accelerating Diffusion Models

📝 Summary:
Spectral-Evolution-Aware Cache (SeaCache) improves diffusion model inference speed by using spectrally aligned representations to optimize intermediate output reuse, achieving better latency-quality t...

🔹 Publication Date: Published on Feb 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.18993
• PDF: https://arxiv.org/pdf/2602.18993
• Project Page: https://jiwoogit.github.io/SeaCache/
• Github: https://github.com/jiwoogit/SeaCache

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
VecGlypher: Unified Vector Glyph Generation with Language Models

📝 Summary:
VecGlypher is a multimodal language model that generates high-fidelity vector glyphs directly from text or images by emitting SVG path tokens. This bypasses raster processes, creating editable outlines in one pass. It outperforms prior methods, simplifying font design.

🔹 Publication Date: Published on Feb 25

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.21461
• PDF: https://arxiv.org/pdf/2602.21461
• Project Page: https://xk-huang.github.io/VecGlypher/
• Github: https://github.com/xk-huang/VecGlypher

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#VectorGraphics #LLM #FontDesign #GenerativeAI #AI
Limited Time Offer: Premium Q1 & Q2 Publications at Just $300!
🎓 Exclusive February Sale - Ending Soon!
Are you looking to boost your academic profile with high-impact publications? We're offering an exceptional opportunity you don't want to miss!
What We Offer:
Q1 & Q2 Journal Articles - Top-tier, indexed publications
Unbeatable Price: Only $300 per article
Limited Time: Offer valid until the end of February 2026
Why Choose Our Service?

Fast publication process
Reputable Q1 & Q2 journals
Expert support throughout
Guaranteed acceptance

Contact: @Omidyzd62
1
Functional Continuous Decomposition

📝 Summary:
Functional Continuous Decomposition FCD is a new framework for parametric, continuous optimization of time-series data. It extracts M modes capturing local and global patterns, improving feature extraction. FCD features enhance machine learning models, leading to faster convergence and higher acc...

🔹 Publication Date: Published on Feb 24

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.20857
• PDF: https://arxiv.org/pdf/2602.20857
• Project Page: https://arxiv.org/abs/2602.20857
• Github: https://github.com/Tima-a/fcd

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#FCD #TimeSeries #Optimization #FeatureExtraction #MachineLearning
MolHIT: Advancing Molecular-Graph Generation with Hierarchical Discrete Diffusion Models

📝 Summary:
MolHIT presents a hierarchical discrete diffusion model for molecular graph generation. It achieves state-of-the-art performance with near-perfect chemical validity and strong property-guided synthesis, surpassing existing methods.

🔹 Publication Date: Published on Feb 19

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.17602
• PDF: https://arxiv.org/pdf/2602.17602

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#MolHIT #MolecularGraphs #DiffusionModels #DrugDiscovery #Cheminformatics
DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference

📝 Summary:
DualPath addresses KV-cache I/O bottlenecks in LLM inference with dual-path loading. It loads KV-cache into decode engines, transfers it to prefill engines, and dynamically balances load to boost throughput up to 1.96 times.

🔹 Publication Date: Published on Feb 25

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.21548
• PDF: https://arxiv.org/pdf/2602.21548

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#LLM #AI #MachineLearning #PerformanceOptimization #SystemDesign
Yor-Sarc: A gold-standard dataset for sarcasm detection in a low-resource African language

📝 Summary:
Yor-Sarc introduces the first gold-standard dataset for sarcasm detection in Yorùbá, a low-resource African language. It offers 436 expertly annotated instances with high inter-annotator agreement and soft labels, designed to advance NLP for African languages.

🔹 Publication Date: Published on Feb 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.18964
• PDF: https://arxiv.org/pdf/2602.18964
• Project Page: https://arxiv.org/abs/2602.18964
• Github: https://github.com/toheebadura/yor-sarc

Datasets citing this paper:
https://huggingface.co/datasets/toheebadura/yor-sarc

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#NLP #SarcasmDetection #Yoruba #LowResourceLanguages #AfricanLanguages
1