ML Research Hub
32.5K subscribers
5.95K photos
383 videos
24 files
6.44K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
ECG-Reasoning-Benchmark: A Benchmark for Evaluating Clinical Reasoning Capabilities in ECG Interpretation

📝 Summary:
W h i l e M u l t i m o d a l L a r g e L a n g u a g e M o d e l s ( M L L M s ) s h o w p r o m i s i n g p e r f o r m a n c e i n a u t o m a t e d e l e c t r o c a r d i o g r a m i n t e r p r ...

🔹 Publication Date: Published on Mar 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.14326
• PDF: https://arxiv.org/pdf/2603.14326

Datasets citing this paper:
https://huggingface.co/datasets/Jwoo5/ECG-Reasoning-Benchmark

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Residual Stream Duality in Modern Transformer Architectures

📝 Summary:
The residual stream in Transformers can be viewed through a two-axis framework where sequence position and layer depth provide different pathways for information flow, with causal depth-wise residual ...

🔹 Publication Date: Published on Mar 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.16039
• PDF: https://arxiv.org/pdf/2603.16039
• Project Page: https://github.com/yifanzhang-pro/residual-stream-duality
• Github: https://github.com/yifanzhang-pro/residual-stream-duality

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
ARISE: Agent Reasoning with Intrinsic Skill Evolution in Hierarchical Reinforcement Learning

📝 Summary:
A hierarchical reinforcement learning framework named ARISE employs a skill management system to improve mathematical reasoning in language models through reusable strategies and structured skill libr...

🔹 Publication Date: Published on Mar 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.16060
• PDF: https://arxiv.org/pdf/2603.16060
• Github: https://github.com/Skylanding/ARISE

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
MDM-Prime-v2: Binary Encoding and Index Shuffling Enable Compute-optimal Scaling of Diffusion Language Models

📝 Summary:
MDM-Prime-v2 enhances masked diffusion language models with Binary Encoding and Index Shuffling. It is 21.8 times more compute-efficient than autoregressive models, achieving significantly better perplexity and zero-shot accuracy.

🔹 Publication Date: Published on Mar 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.16077
• PDF: https://arxiv.org/pdf/2603.16077
• Project Page: https://chen-hao-chao.github.io/mdm-prime-v2/
• Github: https://github.com/chen-hao-chao/mdm-prime-v2

🔹 Models citing this paper:
https://huggingface.co/chen-hao-chao/mdm-prime-v2-c4
https://huggingface.co/chen-hao-chao/mdm-prime-v2-slimpajama

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Mamba: Linear-Time Sequence Modeling with Selective State Spaces

📝 Summary:
Mamba is a novel SSM that outperforms Transformers by enabling content-based reasoning through selective state spaces. It offers 5x faster inference, linear scaling, and achieves state-of-the-art results across language, audio, and genomics, even matching larger Transformers.

🔹 Publication Date: Published on Dec 1, 2023

🔹 Paper Links:
• arXiv Page: https://arxivexplained.com/papers/mamba-linear-time-sequence-modeling-with-selective-state-spaces
• PDF: https://arxiv.org/pdf/2312.00752
• Github: https://github.com/state-spaces/mamba

🔹 Models citing this paper:
https://huggingface.co/tiiuae/falcon-mamba-7b
https://huggingface.co/state-spaces/mamba-2.8b-slimpj
https://huggingface.co/tiiuae/falcon-mamba-7b-instruct

Datasets citing this paper:
https://huggingface.co/datasets/huaXiaKyrie/up
https://huggingface.co/datasets/Sherirto/BD4UI

Spaces citing this paper:
https://huggingface.co/spaces/FallnAI/Quantize-HF-Models
https://huggingface.co/spaces/openfree/LLM_Quantization
https://huggingface.co/spaces/seawolf2357/LLM_Quantization

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Unified Spatio-Temporal Token Scoring for Efficient Video VLMs

📝 Summary:
STTS is a lightweight module for efficiently pruning vision tokens across vision transformer and language models in video VLMs. It achieves 62% efficiency gains with only a 0.7% performance drop by learning spatio-temporal token scoring without text conditioning.

🔹 Publication Date: Published on Mar 18

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.18004
• PDF: https://arxiv.org/pdf/2603.18004
• Github: https://github.com/allenai/STTS

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
MosaicMem: Hybrid Spatial Memory for Controllable Video World Models

📝 Summary:
Video diffusion models use hybrid spatial memory to maintain consistency under camera motion and enable long-term scene editing and navigation. AI-generated summary Video diffusion models are moving b...

🔹 Publication Date: Published on Mar 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.17117
• PDF: https://arxiv.org/pdf/2603.17117
• Project Page: https://mosaicmem.github.io/mosaicmem/
• Github: https://mosaicmem.github.io/mosaicmem/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Stereo World Model: Camera-Guided Stereo Video Generation

📝 Summary:
StereoWorld is a camera-conditioned stereo world model that generates stereo videos end-to-end using RGB modality while maintaining geometric consistency and efficiency through novel attention mechani...

🔹 Publication Date: Published on Mar 18

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.17375
• PDF: https://arxiv.org/pdf/2603.17375
• Project Page: https://sunyangtian.github.io/StereoWorld-web/
• Github: https://github.com/SunYangtian/StereoWorld

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
When AI Navigates the Fog of War

📝 Summary:
Large language models demonstrate varying capabilities in reasoning about unfolding geopolitical conflicts, showing strategic realism in structured settings but inconsistent performance in complex pol...

🔹 Publication Date: Published on Mar 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.16642
• PDF: https://arxiv.org/pdf/2603.16642
• Project Page: https://www.war-forecast-arena.com/
• Github: https://github.com/xirui-li/war-test

Datasets citing this paper:
https://huggingface.co/datasets/AIcell/war-test-dataset

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
AdaMem: Adaptive User-Centric Memory for Long-Horizon Dialogue Agents

📝 Summary:
AdaMem is an adaptive memory framework for dialogue agents that organizes conversation history into multiple memory types and uses conditional retrieval to improve long-horizon reasoning and user mode...

🔹 Publication Date: Published on Mar 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.16496
• PDF: https://arxiv.org/pdf/2603.16496

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research