ML Research Hub
32.6K subscribers
5.73K photos
365 videos
24 files
6.19K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
PIRA-Bench: A Transition from Reactive GUI Agents to GUI-based Proactive Intent Recommendation Agents

📝 Summary:
PIRA-Bench presents a benchmark for evaluating multimodal large language models on proactive GUI agent tasks using continuous visual inputs, while PIRF offers a memory-aware framework for handling com...

🔹 Publication Date: Published on Mar 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.08013
• PDF: https://arxiv.org/pdf/2603.08013
• Project Page: https://www.pira-bench.top

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
PureCC: Pure Learning for Text-to-Image Concept Customization

📝 Summary:
PureCC presents a concept customization method that preserves original model behavior through decoupled learning and adaptive guidance scaling. AI-generated summary Existing concept customization meth...

🔹 Publication Date: Published on Mar 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.07561
• PDF: https://arxiv.org/pdf/2603.07561
• Github: https://github.com/lzc-sg/PureCC

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
From Narrow to Panoramic Vision: Attention-Guided Cold-Start Reshapes Multimodal Reasoning

📝 Summary:
The study introduces a novel attention-based metric called Visual Attention Score to analyze cold-start initialization in multimodal large reasoning models, identifying a counter-intuitive phenomenon ...

🔹 Publication Date: Published on Mar 4

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.03825
• PDF: https://arxiv.org/pdf/2603.03825
• Github: https://github.com/lrlbbzl/Qwen-AVAR

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Media is too big
VIEW IN TELEGRAM
Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence

📝 Summary:
Holi-Spatial presents the first fully automated, large-scale, spatially-aware multimodal dataset constructed from raw video inputs, supporting multi-level spatial supervision for 3D scene understandin...

🔹 Publication Date: Published on Mar 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.07660
• PDF: https://arxiv.org/pdf/2603.07660
• Project Page: https://visionary-laboratory.github.io/holi-spatial/
• Github: https://github.com/Visionary-Laboratory/holi-spatial

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
\$OneMillion-Bench: How Far are Language Agents from Human Experts?

📝 Summary:
A new benchmark evaluates language models on complex, real-world professional tasks requiring multi-step reasoning, evidence resolution, and domain-specific decision-making across multiple industries....

🔹 Publication Date: Published on Mar 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.07980
• PDF: https://arxiv.org/pdf/2603.07980
• Github: https://github.com/humanlaya/OneMillion-Bench

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Believe Your Model: Distribution-Guided Confidence Calibration

📝 Summary:
Large reasoning models enhance prediction accuracy through test-time scaling techniques that generate multiple candidate responses, with the proposed DistriVoting method utilizing distributional prior...

🔹 Publication Date: Published on Mar 4

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.03872
• PDF: https://arxiv.org/pdf/2603.03872
• Github: https://github.com/yxizhong/SSC

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Scale Space Diffusion

📝 Summary:
Scale-space theory connects diffusion models' information hierarchy to low-pass filtering, leading to a framework that combines scale spaces with diffusion processes for efficient image processing. AI...

🔹 Publication Date: Published on Mar 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.08709
• PDF: https://arxiv.org/pdf/2603.08709
• Project Page: https://prateksha.github.io/projects/scale-space-diffusion/
• Github: https://github.com/prateksha/ScaleSpaceDiffusion

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
FVG-PT: Adaptive Foreground View-Guided Prompt Tuning for Vision-Language Models

📝 Summary:
Foreground attention shifts during CLIP-based prompt tuning are addressed through an adaptive module that enhances foreground view quality and mitigates generalization degradation. AI-generated summar...

🔹 Publication Date: Published on Mar 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.08708
• PDF: https://arxiv.org/pdf/2603.08708
• Github: https://github.com/JREion/FVG-PT

Datasets citing this paper:
https://huggingface.co/datasets/JREion/Prompt_Tuning_Datasets_with_Foreground

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Skip to the Good Part: Representation Structure & Inference-Time Layer Skipping in Diffusion vs. Autoregressive LLMs

📝 Summary:
Diffusion language models exhibit distinct representational structures compared to autoregressive models, with hierarchical abstractions and reduced bias, enabling efficient layer-skipping inference w...

🔹 Publication Date: Published on Mar 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.07475
• PDF: https://arxiv.org/pdf/2603.07475

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Scaling Agentic Capabilities, Not Context: Efficient Reinforcement Finetuning for Large Toolspaces

📝 Summary:
ATLAS enables small language models to effectively operate in large-scale tool environments through reinforcement fine-tuning that learns context control and execution structure, achieving performance...

🔹 Publication Date: Published on Mar 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.06713
• PDF: https://arxiv.org/pdf/2603.06713

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Agentic Critical Training

📝 Summary:
Agentic Critical Training (ACT) is a reinforcement learning approach that trains language model agents to autonomously reason about action quality by directly rewarding correct judgment between altern...

🔹 Publication Date: Published on Mar 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.08706
• PDF: https://arxiv.org/pdf/2603.08706
• Project Page: https://attention-is-all-i-need.github.io/ACT/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
OfficeQA Pro: An Enterprise Benchmark for End-to-End Grounded Reasoning

📝 Summary:
OfficeQA Pro evaluates AI agents on multi-document reasoning across historical financial documents, revealing persistent challenges in grounded reasoning despite advanced model capabilities. AI-genera...

🔹 Publication Date: Published on Mar 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.08655
• PDF: https://arxiv.org/pdf/2603.08655
• Github: https://github.com/databricks/officeqa

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising

📝 Summary:
HiAR, a hierarchical autoregressive diffusion framework, improves video generation by conditioning on context at the same noise level and employs forward-KL regularization to maintain temporal continu...

🔹 Publication Date: Published on Mar 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.08703
• PDF: https://arxiv.org/pdf/2603.08703
• Project Page: https://jacky-hate.github.io/HiAR/
• Github: https://jacky-hate.github.io/HiAR/

🔹 Models citing this paper:
https://huggingface.co/jackyhate/HiAR

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
NaviDriveVLM: Decoupling High-Level Reasoning and Motion Planning for Autonomous Driving

📝 Summary:
NaviDriveVLM presents a decoupled vision-language model framework for autonomous driving that separates high-level reasoning from motion planning, achieving superior performance in end-to-end driving ...

🔹 Publication Date: Published on Mar 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.07901
• PDF: https://arxiv.org/pdf/2603.07901
• Github: https://github.com/TAMU-CVRL/NaviDrive

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
CARE-Edit: Condition-Aware Routing of Experts for Contextual Image Editing

📝 Summary:
CARE-Edit introduces a condition-aware routing mechanism that dynamically allocates diffusion model computation to specialized experts for improved contextual image editing tasks. AI-generated summary...

🔹 Publication Date: Published on Mar 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.08589
• PDF: https://arxiv.org/pdf/2603.08589
• Project Page: https://care-edit.github.io/
• Github: https://care-edit.github.io/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Spatiotemporal Heterogeneity of AI-Driven Traffic Flow Patterns and Land Use Interaction: A GeoAI-Based Analysis of Multimodal Urban Mobility

📝 Summary:
A GeoAI Hybrid framework combining MGWR, RF, and ST-GCN models effectively captures complex traffic flow patterns and land use interactions across multiple mobility modes with superior predictive perf...

🔹 Publication Date: Published on Mar 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.05581
• PDF: https://arxiv.org/pdf/2603.05581

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
AutoResearch-RL: Perpetual Self-Evaluating Reinforcement Learning Agents for Autonomous Neural Architecture Discovery

📝 Summary:
An autonomous reinforcement learning framework conducts continuous neural architecture and hyperparameter research without human intervention, achieving performance comparable to hand-tuned baselines ...

🔹 Publication Date: Published on Mar 7

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.07300
• PDF: https://arxiv.org/pdf/2603.07300

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Autophoresis of a Janus particle near a planar wall: a lubrication limit

📝 Summary:
We study the self-diffusiophoresis of a spherical chemically active particle near a planar, impermeable wall, with a focus on the influence of particle orientation on propulsion. We analyze a Janus pa...

🔹 Publication Date: Published on Feb 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.00791
• PDF: https://arxiv.org/pdf/2603.00791

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Media is too big
VIEW IN TELEGRAM
LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

📝 Summary:
LoGeR enables long-term 3D video reconstruction by combining bidirectional priors with a hybrid memory system that includes parametric Test-Time Training and non-parametric sliding window attention me...

🔹 Publication Date: Published on Mar 3

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.03269
• PDF: https://arxiv.org/pdf/2603.03269
• Project Page: https://loger-project.github.io/
• Github: https://github.com/Junyi42/LoGeR

🔹 Models citing this paper:
https://huggingface.co/Junyi42/LoGeR

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
Forwarded from Code With Python
This channels is for Programmers, Coders, Software Engineers.

0️⃣ Python
1️⃣ Data Science
2️⃣ Machine Learning
3️⃣ Data Visualization
4️⃣ Artificial Intelligence
5️⃣ Data Analysis
6️⃣ Statistics
7️⃣ Deep Learning
8️⃣ programming Languages

https://t.iss.one/addlist/8_rRW2scgfRhOTc0

https://t.iss.one/Codeprogrammer
Please open Telegram to view this post
VIEW IN TELEGRAM
1
How Far Can Unsupervised RLVR Scale LLM Training?

📝 Summary:
Intrinsic Unsupervised RL with Verifiable Rewards URLVR for LLMs faces fundamental scaling limits. It fails due to confidence-correction misalignment, leading to collapse. External reward methods show promise for overcoming these barriers.

🔹 Publication Date: Published on Mar 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.08660
• PDF: https://arxiv.org/pdf/2603.08660
• Github: https://github.com/PRIME-RL/TTRL/tree/urlvr-dev

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research