ML Research Hub
32.6K subscribers
5.8K photos
371 videos
24 files
6.27K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
🎁 23 Years of SPOTO – Claim Your Free IT Certs Prep Kit!

πŸ”₯Whether you're preparing for #Python, #AI, #Cisco, #PMI, #Fortinet, #AWS, #Azure, #Excel, #comptia, #ITIL, #cloud or any other in-demand certification – SPOTO has got you covered!

βœ… Free Resources :
・Free Python, Excel, Cyber Security, Cisco, SQL, ITIL, PMP, AWS courses: https://bit.ly/4lk4m3c
・IT Certs E-book: https://bit.ly/4bdZOqt
・IT Exams Skill Test: https://bit.ly/4sDvi0b
・Free AI material and support tools: https://bit.ly/46TpsQ8
・Free Cloud Study Guide: https://bit.ly/4lk3dIS

🎁 Join SPOTO 23rd anniversary Lucky Draw:
πŸ“± iPhone 17
πŸ›’free order
πŸ›’ Amazon Gift Card $50/$100
πŸ“˜ AI/CCNA/PMP Course Training + Study Material + eBook
Enter the Draw πŸ‘‰: https://bit.ly/3NwkceD

πŸ‘‰ Become Part of Our IT Learning Circle! resources and support:
https://chat.whatsapp.com/Cnc5M5353oSBo3savBl397

πŸ’¬ Want exam help? Chat with an admin now!
wa.link/rozuuw

⏰Last Chance – Get It Before It’s Gone!
πŸ”₯2πŸ‘1πŸ‘1
✨Are Audio-Language Models Listening? Audio-Specialist Heads for Adaptive Audio Steering

πŸ“ Summary:
Large audio-language models can under-utilize audio. This work identifies audio-specialist attention heads that provide a listening signal. An inference-time intervention amplifies audio influence, improving LALM accuracy by up to 8% without parameter updates.

πŸ”Ή Publication Date: Published on Mar 6

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2603.06854
β€’ PDF: https://arxiv.org/pdf/2603.06854

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AudioLanguageModels #DeepLearning #AttentionMechanisms #AIResearch #MachineLearning
This media is not supported in your browser
VIEW IN TELEGRAM
✨Reward Prediction with Factorized World States

πŸ“ Summary:
StateFactory transforms observations into hierarchical object-attribute structures using language models. This enables superior zero-shot reward prediction across domains by measuring semantic similarity, significantly improving agent planning performance.

πŸ”Ή Publication Date: Published on Mar 10

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2603.09400
β€’ PDF: https://arxiv.org/pdf/2603.09400
β€’ Project Page: https://statefactory.github.io/
β€’ Github: https://github.com/yijunshens/StateFactory

✨ Datasets citing this paper:
β€’ https://huggingface.co/datasets/YijunShen/RewardPrediction

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#RewardPrediction #AI #LanguageModels #MachineLearning #AgentPlanning
✨Do What I Say: A Spoken Prompt Dataset for Instruction-Following

πŸ“ Summary:
DoWhatISay is a new multilingual dataset of human-recorded spoken and written prompts for evaluating Speech Large Language Models. It reveals text prompts consistently outperform spoken prompts, except in speech-output tasks. This highlights the need for speech-based SLLM evaluation.

πŸ”Ή Publication Date: Published on Mar 10

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2603.09881
β€’ PDF: https://arxiv.org/pdf/2603.09881
β€’ Project Page: https://huggingface.co/collections/meetween/meetweens-research-papers
β€’ Github: https://github.com/MaikeZuefle/DOWIS

✨ Datasets citing this paper:
β€’ https://huggingface.co/datasets/maikezu/dowis

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#SLLM #SpeechAI #LLM #PromptEngineering #Dataset
✨Compiler-First State Space Duality and Portable O(1) Autoregressive Caching for Inference

πŸ“ Summary:
Mamba-2's state space model is implemented using XLA-optimized primitives, eliminating custom kernels. This enables efficient cross-platform deployment on CPU, GPU, and TPU, realizing O1 autoregressive caching with high performance.

πŸ”Ή Publication Date: Published on Mar 10

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2603.09555
β€’ PDF: https://arxiv.org/pdf/2603.09555
β€’ Github: https://github.com/CosmoNaught/mamba2-jax

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#Mamba2 #StateSpaceModels #DeepLearning #MLInference #PerformanceOptimization
✨ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement Learning

πŸ“ Summary:
ReflexiCoder uses reinforcement learning to teach large language models autonomous code reflection and self-correction. It internalizes the debugging process into the model, achieving state-of-the-art performance on coding benchmarks, rivaling proprietary models, and reducing inference compute by...

πŸ”Ή Publication Date: Published on Mar 6

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2603.05863
β€’ PDF: https://arxiv.org/pdf/2603.05863
β€’ Github: https://github.com/juyongjiang/ReflexiCoder

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#LLM #ReinforcementLearning #CodeGeneration #AI #DeepLearning
✨TALON: Test-time Adaptive Learning for On-the-Fly Category Discovery

πŸ“ Summary:
TALON is a test-time adaptation framework for on-the-fly category discovery. It dynamically updates prototypes and encoder parameters, while calibrating logits, to improve novel class recognition and prevent category explosion. This approach significantly outperforms existing methods.

πŸ”Ή Publication Date: Published on Mar 9

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2603.08075
β€’ PDF: https://arxiv.org/pdf/2603.08075
β€’ Github: https://github.com/ynanwu/TALON

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#MachineLearning #DeepLearning #CategoryDiscovery #TestTimeAdaptation #ComputerVision
✨Test-Driven AI Agent Definition (TDAD): Compiling Tool-Using Agents from Behavioral Specifications

πŸ“ Summary:
TDAD is a methodology that compiles AI agent prompts from behavioral specifications using automated testing. This iterative process refines prompts to ensure measurable compliance, preventing regressions and policy violations for reliable production deployment.

πŸ”Ή Publication Date: Published on Mar 9

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2603.08806
β€’ PDF: https://arxiv.org/pdf/2603.08806
β€’ Project Page: https://www.alphaxiv.org/abs/2603.08806
β€’ Github: https://github.com/f-labs-io/tdad-paper-code

✨ Datasets citing this paper:
β€’ https://huggingface.co/datasets/f-labs-io/SpecSuite-Core

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AIAgents #PromptEngineering #TestDrivenDevelopment #AISafety #AIResearch
✨Bolbosh: Script-Aware Flow Matching for Kashmiri Text-to-Speech

πŸ“ Summary:
Bolbosh is the first open-source neural TTS for Kashmiri, addressing diacritic and data challenges. It uses script-aware flow matching and acoustic enhancement. The system significantly outperforms multilingual baselines, setting a new benchmark for Kashmiri TTS.

πŸ”Ή Publication Date: Published on Mar 8

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2603.07513
β€’ PDF: https://arxiv.org/pdf/2603.07513
β€’ Project Page: https://gaash-lab.github.io/Bolbosh
β€’ Github: https://github.com/gaash-lab/Bolbosh

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control

πŸ“ Summary:
The Test-Time Control TTC layer embeds optimal control LQR planning as an architectural component in LLMs. This enables planning before prediction for enhanced reasoning. TTC layers improve mathematical problem-solving performance by up to 27.8% on MATH-500 and 2-3x on other benchmarks, using a h...

πŸ”Ή Publication Date: Published on Mar 10

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2603.09221
β€’ PDF: https://arxiv.org/pdf/2603.09221
β€’ Project Page: https://vita-group.github.io/TTC-Net/
β€’ Github: https://github.com/VITA-Group/TTC-Net

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨BiCLIP: Domain Canonicalization via Structured Geometric Transformation

πŸ“ Summary:
BiCLIP adapts vision-language models to specialized domains using a simple bilinear transformation. It aligns multimodal features via geometric canonicalization, leveraging few-shot samples as anchors. This achieves state-of-the-art results on multiple benchmarks with extreme simplicity.

πŸ”Ή Publication Date: Published on Mar 9

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2603.08942
β€’ PDF: https://arxiv.org/pdf/2603.08942
β€’ Project Page: https://quantitativeimaginglaboratory.github.io/BilinearCLIP/
β€’ Github: https://github.com/QuantitativeImagingLaboratory/BilinearCLIP

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Micro-Diffusion Compression -- Binary Tree Tweedie Denoising for Online Probability Estimation

πŸ“ Summary:
Midicoth enhances compression efficiency by applying a micro-diffusion denoising layer to refine probability estimates in adaptive statistical models, addressing limitations in sparse data scenarios t...

πŸ”Ή Publication Date: Published on Mar 9

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2603.08771
β€’ PDF: https://arxiv.org/pdf/2603.08771
β€’ Github: https://github.com/robtacconelli/midicoth

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Multi-Head Low-Rank Attention

πŸ“ Summary:
Multi-Head Low-Rank Attention addresses long-context inference bottlenecks in large language models by enabling efficient 4-way tensor parallelism decoding through partitionable latent states. AI-gene...

πŸ”Ή Publication Date: Published on Mar 2

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/pdf/2603.02188
β€’ PDF: https://arxiv.org/pdf/2603.02188
β€’ Project Page: https://songtaoliu0823.github.io/mlra/
β€’ Github: https://github.com/SongtaoLiu0823/MLRA

πŸ”Ή Models citing this paper:
β€’ https://huggingface.co/Soughing/MLRA

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨OpenClaw-RL: Train Any Agent Simply by Talking

πŸ“ Summary:
OpenClaw-RL unifies policy learning from all live next-state signals across diverse interaction modalities. It asynchronously recovers evaluative and directive information, enabling agents to improve simply by being used.

πŸ”Ή Publication Date: Published on Mar 10

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2603.10165
β€’ PDF: https://arxiv.org/pdf/2603.10165
β€’ Github: https://github.com/Gen-Verse/OpenClaw-RL

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback

πŸ“ Summary:
RetroAgent enhances LLM-based agents through online reinforcement learning with self-reflection mechanisms that provide both numerical and language-based intrinsic feedback for improved exploration an...

πŸ”Ή Publication Date: Published on Mar 9

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2603.08561
β€’ PDF: https://arxiv.org/pdf/2603.08561
β€’ Github: https://github.com/zhangxy-2019/RetroAgent

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨EmboAlign: Aligning Video Generation with Compositional Constraints for Zero-Shot Manipulation

πŸ“ Summary:
A data-free framework aligns video generative model outputs with vision-language model constraints for improved robotic manipulation, achieving higher success rates through constraint-guided selection...

πŸ”Ή Publication Date: Published on Mar 5

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2603.05757
β€’ PDF: https://arxiv.org/pdf/2603.05757

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨V_{0.5}: Generalist Value Model as a Prior for Sparse RL Rollouts

πŸ“ Summary:
Adaptive value estimation method combines pretrained prior with empirical rollouts using real-time statistical testing to reduce variance and improve reinforcement learning performance under sparse sa...

πŸ”Ή Publication Date: Published on Mar 11

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2603.10848
β€’ PDF: https://arxiv.org/pdf/2603.10848

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Just-in-Time: Training-Free Spatial Acceleration for Diffusion Transformers

πŸ“ Summary:
Diffusion Transformers face high computational costs during iterative sampling, which this work addresses by introducing a spatial-domain acceleration framework that uses sparse anchor tokens and dete...

πŸ”Ή Publication Date: Published on Mar 11

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2603.10744
β€’ PDF: https://arxiv.org/pdf/2603.10744

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR

πŸ“ Summary:
Contrastive Learning mechanism integrated into Policy Optimization enhances LLM reasoning by regularizing correct reasoning paths and reducing hallucinations. AI-generated summary Reinforcement Learni...

πŸ”Ή Publication Date: Published on Mar 10

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2603.10101
β€’ PDF: https://arxiv.org/pdf/2603.10101
β€’ Github: https://github.com/Qwen-Applications/CLIPO

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Code-Space Response Oracles: Generating Interpretable Multi-Agent Policies with Large Language Models

πŸ“ Summary:
Code-Space Response Oracles replace traditional neural network policies with human-readable code generated by large language models, enabling interpretable and explainable multi-agent reinforcement le...

πŸ”Ή Publication Date: Published on Mar 10

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2603.10098
β€’ PDF: https://arxiv.org/pdf/2603.10098

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research