This media is not supported in your browser
VIEW IN TELEGRAM
✨DreamStyle: A Unified Framework for Video Stylization
📝 Summary:
DreamStyle is a unified video stylization framework that supports multiple style conditions while addressing style inconsistency and temporal flicker through a specialized data curation pipeline and L...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02785
• PDF: https://arxiv.org/pdf/2601.02785
• Project Page: https://lemonsky1995.github.io/dreamstyle/
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
DreamStyle is a unified video stylization framework that supports multiple style conditions while addressing style inconsistency and temporal flicker through a specialized data curation pipeline and L...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02785
• PDF: https://arxiv.org/pdf/2601.02785
• Project Page: https://lemonsky1995.github.io/dreamstyle/
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨MiMo-V2-Flash Technical Report
📝 Summary:
MiMo-V2-Flash is a sparse Mixture-of-Experts model with hybrid attention architecture and efficient distillation technique that achieves strong performance with reduced parameters and improved inferen...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02780
• PDF: https://arxiv.org/pdf/2601.02780
• Project Page: https://mimo.xiaomi.com/blog/mimo-v2-flash
• Github: https://github.com/XiaomiMiMo/MiMo-V2-Flash
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
MiMo-V2-Flash is a sparse Mixture-of-Experts model with hybrid attention architecture and efficient distillation technique that achieves strong performance with reduced parameters and improved inferen...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02780
• PDF: https://arxiv.org/pdf/2601.02780
• Project Page: https://mimo.xiaomi.com/blog/mimo-v2-flash
• Github: https://github.com/XiaomiMiMo/MiMo-V2-Flash
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks
📝 Summary:
WebGym presents a large-scale open-source environment for training visual web agents using reinforcement learning with high-throughput asynchronous sampling, achieving superior performance on unseen w...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02439
• PDF: https://arxiv.org/pdf/2601.02439
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
WebGym presents a large-scale open-source environment for training visual web agents using reinforcement learning with high-throughput asynchronous sampling, achieving superior performance on unseen w...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02439
• PDF: https://arxiv.org/pdf/2601.02439
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
❤1
✨UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision
📝 Summary:
UniCorn is a self-improvement framework enhancing multimodal model generation. It uses self-play and cognitive reconstruction, without external data or supervision. UniCorn achieves state-of-the-art text-to-image generation.
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03193
• PDF: https://arxiv.org/pdf/2601.03193
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
UniCorn is a self-improvement framework enhancing multimodal model generation. It uses self-play and cognitive reconstruction, without external data or supervision. UniCorn achieves state-of-the-art text-to-image generation.
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03193
• PDF: https://arxiv.org/pdf/2601.03193
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨The Sonar Moment: Benchmarking Audio-Language Models in Audio Geo-Localization
📝 Summary:
Audio geo-localization benchmark AGL1K is introduced to advance audio language models' geospatial reasoning capabilities through curated audio clips and evaluation across multiple models. AI-generated...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03227
• PDF: https://arxiv.org/pdf/2601.03227
• Github: https://github.com/Rising0321/AGL1K
✨ Spaces citing this paper:
• https://huggingface.co/spaces/RisingZhang/AudioGeoLoc
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Audio geo-localization benchmark AGL1K is introduced to advance audio language models' geospatial reasoning capabilities through curated audio clips and evaluation across multiple models. AI-generated...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03227
• PDF: https://arxiv.org/pdf/2601.03227
• Github: https://github.com/Rising0321/AGL1K
✨ Spaces citing this paper:
• https://huggingface.co/spaces/RisingZhang/AudioGeoLoc
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
Media is too big
VIEW IN TELEGRAM
✨SOP: A Scalable Online Post-Training System for Vision-Language-Action Models
📝 Summary:
SOP is a scalable online post-training system for VLA models that enables real-world robot policy adaptation. It uses a robot fleet to continuously learn from interaction, improving task proficiency while maintaining generality. SOP significantly boosts VLA model performance within hours.
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03044
• PDF: https://arxiv.org/pdf/2601.03044
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
SOP is a scalable online post-training system for VLA models that enables real-world robot policy adaptation. It uses a robot fleet to continuously learn from interaction, improving task proficiency while maintaining generality. SOP significantly boosts VLA model performance within hours.
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03044
• PDF: https://arxiv.org/pdf/2601.03044
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨AceFF: A State-of-the-Art Machine Learning Potential for Small Molecules
📝 Summary:
AceFF is a new machine learning potential for small molecule drug discovery. It offers DFT-level accuracy with high speed, supporting essential elements and charged states. Validation shows it is state-of-the-art for organic molecules.
🔹 Publication Date: Published on Jan 2
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.00581
• PDF: https://arxiv.org/pdf/2601.00581
• Github: https://github.com/torchmd/torchmd-net
🔹 Models citing this paper:
• https://huggingface.co/Acellera/AceFF-2.0
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#MachineLearning #DrugDiscovery #ComputationalChemistry #AIforScience #SmallMolecules
📝 Summary:
AceFF is a new machine learning potential for small molecule drug discovery. It offers DFT-level accuracy with high speed, supporting essential elements and charged states. Validation shows it is state-of-the-art for organic molecules.
🔹 Publication Date: Published on Jan 2
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.00581
• PDF: https://arxiv.org/pdf/2601.00581
• Github: https://github.com/torchmd/torchmd-net
🔹 Models citing this paper:
• https://huggingface.co/Acellera/AceFF-2.0
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#MachineLearning #DrugDiscovery #ComputationalChemistry #AIforScience #SmallMolecules
❤1
✨U-Net-Like Spiking Neural Networks for Single Image Dehazing
📝 Summary:
DehazeSNN introduces a U-Net-like Spiking Neural Network with an Orthogonal Leaky-Integrate-and-Fire Block for efficient image dehazing. It achieves competitive performance with reduced computational resources and a smaller model size.
🔹 Publication Date: Published on Dec 30, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.23950
• PDF: https://arxiv.org/pdf/2512.23950
• Github: https://github.com/HaoranLiu507/DehazeSNN
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
DehazeSNN introduces a U-Net-like Spiking Neural Network with an Orthogonal Leaky-Integrate-and-Fire Block for efficient image dehazing. It achieves competitive performance with reduced computational resources and a smaller model size.
🔹 Publication Date: Published on Dec 30, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.23950
• PDF: https://arxiv.org/pdf/2512.23950
• Github: https://github.com/HaoranLiu507/DehazeSNN
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Large Reasoning Models Are (Not Yet) Multilingual Latent Reasoners
📝 Summary:
Large reasoning models show multilingual latent reasoning, stronger in resource-rich languages but weaker in low-resource ones. Despite varying strength, their internal prediction evolution is consistent across languages, suggesting an English-centered latent reasoning pathway.
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02996
• PDF: https://arxiv.org/pdf/2601.02996
• Github: https://github.com/cisnlp/multilingual-latent-reasoner
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Large reasoning models show multilingual latent reasoning, stronger in resource-rich languages but weaker in low-resource ones. Despite varying strength, their internal prediction evolution is consistent across languages, suggesting an English-centered latent reasoning pathway.
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02996
• PDF: https://arxiv.org/pdf/2601.02996
• Github: https://github.com/cisnlp/multilingual-latent-reasoner
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
❤1
✨UniVideo: Unified Understanding, Generation, and Editing for Videos
📝 Summary:
UniVideo, a dual-stream framework combining a Multimodal Large Language Model and a Multimodal DiT, extends unified modeling to video generation and editing, achieving state-of-the-art performance and...
🔹 Publication Date: Published on Oct 9, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.08377
• PDF: https://arxiv.org/pdf/2510.08377
• Project Page: https://congwei1230.github.io/UniVideo/
• Github: https://github.com/KwaiVGI/UniVideo
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
UniVideo, a dual-stream framework combining a Multimodal Large Language Model and a Multimodal DiT, extends unified modeling to video generation and editing, achieving state-of-the-art performance and...
🔹 Publication Date: Published on Oct 9, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.08377
• PDF: https://arxiv.org/pdf/2510.08377
• Project Page: https://congwei1230.github.io/UniVideo/
• Github: https://github.com/KwaiVGI/UniVideo
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research