ML Research Hub
32.8K subscribers
4.17K photos
251 videos
23 files
4.5K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
DreamStyle: A Unified Framework for Video Stylization

📝 Summary:
DreamStyle is a unified video stylization framework that supports multiple style conditions while addressing style inconsistency and temporal flicker through a specialized data curation pipeline and L...

🔹 Publication Date: Published on Jan 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02785
• PDF: https://arxiv.org/pdf/2601.02785
• Project Page: https://lemonsky1995.github.io/dreamstyle/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
MiMo-V2-Flash Technical Report

📝 Summary:
MiMo-V2-Flash is a sparse Mixture-of-Experts model with hybrid attention architecture and efficient distillation technique that achieves strong performance with reduced parameters and improved inferen...

🔹 Publication Date: Published on Jan 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02780
• PDF: https://arxiv.org/pdf/2601.02780
• Project Page: https://mimo.xiaomi.com/blog/mimo-v2-flash
• Github: https://github.com/XiaomiMiMo/MiMo-V2-Flash

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks

📝 Summary:
WebGym presents a large-scale open-source environment for training visual web agents using reinforcement learning with high-throughput asynchronous sampling, achieving superior performance on unseen w...

🔹 Publication Date: Published on Jan 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02439
• PDF: https://arxiv.org/pdf/2601.02439

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision

📝 Summary:
UniCorn is a self-improvement framework enhancing multimodal model generation. It uses self-play and cognitive reconstruction, without external data or supervision. UniCorn achieves state-of-the-art text-to-image generation.

🔹 Publication Date: Published on Jan 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03193
• PDF: https://arxiv.org/pdf/2601.03193

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
The Sonar Moment: Benchmarking Audio-Language Models in Audio Geo-Localization

📝 Summary:
Audio geo-localization benchmark AGL1K is introduced to advance audio language models' geospatial reasoning capabilities through curated audio clips and evaluation across multiple models. AI-generated...

🔹 Publication Date: Published on Jan 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03227
• PDF: https://arxiv.org/pdf/2601.03227
• Github: https://github.com/Rising0321/AGL1K

Spaces citing this paper:
https://huggingface.co/spaces/RisingZhang/AudioGeoLoc

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Media is too big
VIEW IN TELEGRAM
SOP: A Scalable Online Post-Training System for Vision-Language-Action Models

📝 Summary:
SOP is a scalable online post-training system for VLA models that enables real-world robot policy adaptation. It uses a robot fleet to continuously learn from interaction, improving task proficiency while maintaining generality. SOP significantly boosts VLA model performance within hours.

🔹 Publication Date: Published on Jan 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03044
• PDF: https://arxiv.org/pdf/2601.03044

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
AceFF: A State-of-the-Art Machine Learning Potential for Small Molecules

📝 Summary:
AceFF is a new machine learning potential for small molecule drug discovery. It offers DFT-level accuracy with high speed, supporting essential elements and charged states. Validation shows it is state-of-the-art for organic molecules.

🔹 Publication Date: Published on Jan 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.00581
• PDF: https://arxiv.org/pdf/2601.00581
• Github: https://github.com/torchmd/torchmd-net

🔹 Models citing this paper:
https://huggingface.co/Acellera/AceFF-2.0

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#MachineLearning #DrugDiscovery #ComputationalChemistry #AIforScience #SmallMolecules
1
U-Net-Like Spiking Neural Networks for Single Image Dehazing

📝 Summary:
DehazeSNN introduces a U-Net-like Spiking Neural Network with an Orthogonal Leaky-Integrate-and-Fire Block for efficient image dehazing. It achieves competitive performance with reduced computational resources and a smaller model size.

🔹 Publication Date: Published on Dec 30, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.23950
• PDF: https://arxiv.org/pdf/2512.23950
• Github: https://github.com/HaoranLiu507/DehazeSNN

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Large Reasoning Models Are (Not Yet) Multilingual Latent Reasoners

📝 Summary:
Large reasoning models show multilingual latent reasoning, stronger in resource-rich languages but weaker in low-resource ones. Despite varying strength, their internal prediction evolution is consistent across languages, suggesting an English-centered latent reasoning pathway.

🔹 Publication Date: Published on Jan 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02996
• PDF: https://arxiv.org/pdf/2601.02996
• Github: https://github.com/cisnlp/multilingual-latent-reasoner

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
UniVideo: Unified Understanding, Generation, and Editing for Videos

📝 Summary:
UniVideo, a dual-stream framework combining a Multimodal Large Language Model and a Multimodal DiT, extends unified modeling to video generation and editing, achieving state-of-the-art performance and...

🔹 Publication Date: Published on Oct 9, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.08377
• PDF: https://arxiv.org/pdf/2510.08377
• Project Page: https://congwei1230.github.io/UniVideo/
• Github: https://github.com/KwaiVGI/UniVideo

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research