ML Research Hub
32.3K subscribers
6.73K photos
472 videos
24 files
7.34K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
NTIRE 2026 Challenge on Video Saliency Prediction: Methods and Results

📝 Summary:
This paper overviews the NTIRE 2026 Challenge on Video Saliency Prediction. Participants developed automatic saliency map prediction for videos using a novel 2,000-video dataset with crowdsourced fixations. Over 20 teams submitted, and all challenge data is now publicly available.

🔹 Publication Date: Published on Apr 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.14816
• PDF: https://arxiv.org/pdf/2604.14816
• Project Page: https://www.codabench.org/competitions/12842/
• Github: https://github.com/msu-video-group/NTIRE26_Saliency_Prediction

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#VideoSaliency #ComputerVision #NTIRE #MachineLearning #SaliencyPrediction
TIPSv2: Advancing Vision-Language Pretraining with Enhanced Patch-Text Alignment

📝 Summary:
Enhanced vision-language models achieve superior dense patch-text alignment through improved pretraining techniques including patch-level distillation, modified masked image objectives, and optimized ...

🔹 Publication Date: Published on Apr 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.12012
• PDF: https://arxiv.org/pdf/2604.12012
• Project Page: https://gdm-tipsv2.github.io/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
(1D) Ordered Tokens Enable Efficient Test-Time Search

📝 Summary:
This paper demonstrates that 1D ordered, coarse-to-fine token structures enhance test-time search in autoregressive models. These tokens allow better verifier evaluation of intermediate states, improving scaling and enabling training-free text-to-image generation through pure test-time search. To...

🔹 Publication Date: Published on Apr 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.15453
• PDF: https://arxiv.org/pdf/2604.15453
• Project Page: https://soto.epfl.ch/
• Github: https://github.com/EPFL-VILAB/search-over-tokens

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
TwinTrack: Post-hoc Multi-Rater Calibration for Medical Image Segmentation

📝 Summary:
TwinTrack framework addresses pancreatic cancer segmentation ambiguity through post-hoc calibration of ensemble probabilities to empirical mean human response, improving calibration metrics on multi-r...

🔹 Publication Date: Published on Apr 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.15950
• PDF: https://arxiv.org/pdf/2604.15950

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
EdgeDetect: Importance-Aware Gradient Compression with Homomorphic Aggregation for Federated Intrusion Detection

📝 Summary:
EdgeDetect enables efficient and secure federated intrusion detection for 6G-IoT environments through gradient binarization and homomorphic encryption, achieving high accuracy with reduced communicati...

🔹 Publication Date: Published on Apr 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.14663v1
• PDF: https://arxiv.org/pdf/2604.14663

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Elucidating the SNR-t Bias of Diffusion Probabilistic Models

📝 Summary:
Diffusion models suffer from an SNR-timestep bias during inference, impairing generation quality. A differential correction method is proposed that processes frequency components separately. This significantly improves generation quality across various models with minimal computational cost.

🔹 Publication Date: Published on Apr 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.16044
• PDF: https://arxiv.org/pdf/2604.16044
• Github: https://github.com/AMAP-ML/DCW

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Can Large Language Models Reinvent Foundational Algorithms?

📝 Summary:
Large language models can reinvent foundational computer science algorithms through an unlearning and reinvention process, with performance varying based on hint levels and reinforced learning techniq...

🔹 Publication Date: Published on Apr 7

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.05716
• PDF: https://arxiv.org/pdf/2604.05716
• Project Page: https://huggingface.co/spaces/jzhao1122/qwen3-thinking-dijkstra
• Github: https://github.com/Algo-Reinvention/algo-reinvention

🔹 Models citing this paper:
https://huggingface.co/algo-reinvention/Qwen3-4B-Thinking-2507-Dijkstra-Unlearn
https://huggingface.co/algo-reinvention/Qwen3-4B-Thinking-2507-Strassen-Unlearn

Spaces citing this paper:
https://huggingface.co/spaces/jzhao1122/qwen3-thinking-dijkstra

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
QuantCode-Bench: A Benchmark for Evaluating the Ability of Large Language Models to Generate Executable Algorithmic Trading Strategies

📝 Summary:
QuantCode-Bench evaluates large language models on generating executable trading strategies by testing their ability to translate natural language descriptions into functional code that operates corre...

🔹 Publication Date: Published on Apr 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.15151
• PDF: https://arxiv.org/pdf/2604.15151
• Project Page: https://limexailab.github.io/QuantCode-Bench/
• Github: https://github.com/LimexAILab/QuantCode-Bench

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off

📝 Summary:
A novel reinforcement learning approach for large language models that addresses the exploration-exploitation trade-off through perplexity-based sample partitioning and bidirectional reward allocation...

🔹 Publication Date: Published on Apr 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.13902
• PDF: https://arxiv.org/pdf/2604.13902

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Media is too big
VIEW IN TELEGRAM
Hierarchical Codec Diffusion for Video-to-Speech Generation

📝 Summary:
HiCoDiT generates speech from videos by leveraging the hierarchical structure of discrete speech tokens, achieving better audio-visual alignment through coarse-to-fine conditioning with dual-scale nor...

🔹 Publication Date: Published on Apr 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.15923
• PDF: https://arxiv.org/pdf/2604.15923

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#VideoToSpeech #DiffusionModels #GenerativeAI #SpeechSynthesis #DeepLearning
Where does output diversity collapse in post-training?

📝 Summary:
Output diversity collapse in post-trained language models is primarily driven by training data composition rather than generation format, with different post-training methods affecting diversity diffe...

🔹 Publication Date: Published on Apr 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.16027
• PDF: https://arxiv.org/pdf/2604.16027
• Github: https://github.com/ckarouzos/where-diversity-collapses

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
RoboLab: A High-Fidelity Simulation Benchmark for Analysis of Task Generalist Policies

📝 Summary:
RoboLab is a simulation benchmarking framework that addresses limitations in robot policy evaluation by enabling scalable, realistic task generation and systematic analysis of policy behavior under co...

🔹 Publication Date: Published on Apr 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.09860
• PDF: https://arxiv.org/pdf/2604.09860
• Project Page: https://research.nvidia.com/labs/srl/projects/robolab/
• Github: https://github.com/NVLabs/RoboLab

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
The Amazing Agent Race: Strong Tool Users, Weak Navigators

📝 Summary:
The Amazing Agent Race benchmark introduces DAG-based puzzles to evaluate LLM agents' navigation and tool-use capabilities beyond traditional linear benchmarks, revealing that navigation errors domina...

🔹 Publication Date: Published on Apr 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.10261
• PDF: https://arxiv.org/pdf/2604.10261
• Project Page: https://minnesotanlp.github.io/the-amazing-agent-race/
• Github: https://github.com/minnesotanlp/the-amazing-agent-race

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
Universal statistical signatures of evolution in artificial intelligence architectures

📝 Summary:
The study finds that artificial intelligence architectural evolution follows the same statistical patterns as biological evolution, including similar fitness effect distributions and convergence dynam...

🔹 Publication Date: Published on Apr 12

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.10571
• PDF: https://arxiv.org/pdf/2604.10571
• Github: https://github.com/mool32/ai-evolution-universal-signatures

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Motif-Video 2B: Technical Report

📝 Summary:
Motif-Video 2B achieves high text-to-video quality with a specialized architecture and efficient training methods. It uses shared cross-attention and a three-part backbone to outperform larger models using significantly fewer parameters and less data.

🔹 Publication Date: Published on Apr 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.16503
• PDF: https://arxiv.org/pdf/2604.16503
• Project Page: https://motiftech.io/videoshowcase

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Stratagem: Learning Transferable Reasoning via Trajectory-Modulated Game Self-Play

📝 Summary:
STRATAGEM addresses limitations in reasoning transfer for language models by using a reasoning transferability coefficient and evolution reward to promote abstract, domain-agnostic patterns over game-...

🔹 Publication Date: Published on Apr 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.17696
• PDF: https://arxiv.org/pdf/2604.17696
• Github: https://github.com/ydyyyy/Stratagem

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
The Geometric Canary: Predicting Steerability and Detecting Drift via Representational Stability

📝 Summary:
Geometric stability measures predict language model controllability and detect structural degradation, with supervised variants excelling at steering prediction and unsupervised variants at drift dete...

🔹 Publication Date: Published on Apr 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.17698
• PDF: https://arxiv.org/pdf/2604.17698
• Github: https://github.com/prashantcraju/geometric-canary

🔹 Models citing this paper:
https://huggingface.co/pcr2120/shesha-geometry

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Geometric coherence of single-cell CRISPR perturbations reveals regulatory architecture and predicts cellular stress

📝 Summary:
G e n o m e e n g i n e e r i n g h a s a c h i e v e d r e m a r k a b l e s e q u e n c e - l e v e l p r e c i s i o n , y e t p r e d i c t i n g t h e t r a n s c r i p t o m i c s t a t e t h a ...

🔹 Publication Date: Published on Apr 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.16642
• PDF: https://arxiv.org/pdf/2604.16642
• Github: https://github.com/prashantcraju/geometric-stability-crispr

🔹 Models citing this paper:
https://huggingface.co/pcr2120/shesha-geometry

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
Revisiting a Pain in the Neck: A Semantic Reasoning Benchmark for Language Models

📝 Summary:
SemanticQA is a new benchmark to evaluate language models on semantic phrase processing, covering various phrase types. It reveals significant performance differences, especially in semantic reasoning tasks, highlighting variations in models comprehension.

🔹 Publication Date: Published on Apr 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.16593
• PDF: https://arxiv.org/pdf/2604.16593
• Github: https://github.com/jacklanda/SemanticQA

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Crowded in B-Space: Calibrating Shared Directions for LoRA Merging

📝 Summary:
LoRA adapter merging performance can be improved by separately calibrating the output-side matrix B to reduce interference from shared directions while preserving task-specific information. AI-generat...

🔹 Publication Date: Published on Apr 18

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.16826
• PDF: https://arxiv.org/pdf/2604.16826

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research