ML Research Hub

✨Routing the Lottery: Adaptive Subnetworks for Heterogeneous Data

📝 Summary:
Routing the Lottery framework discovers multiple specialized subnetworks tailored to different data conditions, outperforming traditional pruning methods while using fewer parameters and identifying s...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.22141
• PDF: https://arxiv.org/pdf/2601.22141

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

136 views04:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization

📝 Summary:
PLaT introduces a latent reasoning framework that decouples reasoning from verbalization, enabling dynamic termination and improved scalability over traditional approaches. AI-generated summary Chain-...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21358
• PDF: https://arxiv.org/pdf/2601.21358

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

150 views04:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨THINKSAFE: Self-Generated Safety Alignment for Reasoning Models

📝 Summary:
ThinkSafe is a self-aligned framework that enhances safety in large reasoning models. It uses lightweight refusal steering and fine-tuning on self-generated responses to preserve reasoning performance and reduce computational costs. ThinkSafe significantly improves safety without degrading native...

🔹 Publication Date: Published on Jan 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.23143
• PDF: https://arxiv.org/pdf/2601.23143

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AISafety #LLMs #AIAlignment #MachineLearning #DeepLearning

113 views05:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning

📝 Summary:
MemOCR is a multimodal memory agent for long-horizon reasoning that compresses interaction histories into visual layouts. It adaptively allocates memory space, visually prioritizing crucial evidence while compressing details, outperforming text-based baselines under tight budgets.

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21468
• PDF: https://arxiv.org/pdf/2601.21468

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #MultimodalAI #LongHorizonReasoning #MemoryNetworks #ComputerVision

109 views05:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Pushing the Boundaries of Natural Reasoning: Interleaved Bonus from Formal-Logic Verification

📝 Summary:
This paper presents a framework that interleaves formal logic verification with natural language generation to improve LLM reasoning. It actively detects and corrects errors during the reasoning process. This method significantly outperforms state-of-the-art models on various reasoning benchmarks.

🔹 Publication Date: Published on Jan 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.22642
• PDF: https://arxiv.org/pdf/2601.22642

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

101 views05:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨PaperBanana: Automating Academic Illustration for AI Scientists

📝 Summary:
_paperbanana is an agentic framework that automates the creation of publication-ready academic illustrations using advanced vision-language models and image generation techniques. AI-generated summary...

🔹 Publication Date: Published on Jan 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.23265
• PDF: https://arxiv.org/pdf/2601.23265
• Project Page: https://dwzhu-pku.github.io/PaperBanana/

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

87 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨ReGuLaR: Variational Latent Reasoning Guided by Rendered Chain-of-Thought

📝 Summary:
ReGuLaR introduces a variational auto-encoding framework that compresses reasoning processes into latent space while maintaining performance through image-rendered explicit reasoning chains for guidan...

🔹 Publication Date: Published on Jan 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.23184
• PDF: https://arxiv.org/pdf/2601.23184
• Github: https://github.com/FanmengWang/ReGuLaR

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

117 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨LMK > CLS: Landmark Pooling for Dense Embeddings

📝 Summary:
Landmark pooling improves long-context representation learning by partitioning sequences into chunks and using landmark tokens to preserve both global and local information more effectively than tradi...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21525
• PDF: https://arxiv.org/pdf/2601.21525

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

92 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨DINO-SAE: DINO Spherical Autoencoder for High-Fidelity Image Reconstruction and Generation

📝 Summary:
A novel vision autoencoder framework combines semantic representation with pixel-level reconstruction using spherical latent space and Riemannian flow matching for improved fidelity and efficiency. AI...

🔹 Publication Date: Published on Jan 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.22904
• PDF: https://arxiv.org/pdf/2601.22904

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

93 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨NativeTok: Native Visual Tokenization for Improved Image Generation

📝 Summary:
NativeTok introduces a novel visual tokenization approach that enforces causal dependencies during image encoding, using a Meta Image Transformer and Mixture of Causal Expert Transformer for efficient...

🔹 Publication Date: Published on Jan 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.22837
• PDF: https://arxiv.org/pdf/2601.22837
• Github: https://github.com/wangbei1/Nativetok

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

109 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨DIFFA-2: A Practical Diffusion Large Language Model for General Audio Understanding

📝 Summary:
DIFFA-2, a diffusion-based large audio language model, achieves competitive audio understanding performance with improved efficiency over autoregressive counterparts through enhanced encoding, dual ad...

🔹 Publication Date: Published on Jan 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.23161
• PDF: https://arxiv.org/pdf/2601.23161
• Github: https://github.com/NKU-HLT/DIFFA

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

133 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Deep Search with Hierarchical Meta-Cognitive Monitoring Inspired by Cognitive Neuroscience

📝 Summary:
Deep search agents with hierarchical metacognitive monitoring enhance reasoning and retrieval performance through fast consistency checks and experience-driven corrective interventions. AI-generated s...

🔹 Publication Date: Published on Jan 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.23188
• PDF: https://arxiv.org/pdf/2601.23188

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

168 views06:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Robust Tool Use via Fission-GRPO: Learning to Recover from Execution Errors

📝 Summary:
A framework called Fission-GRPO is introduced to improve multi-turn tool execution in large language models by converting execution errors into corrective supervision during reinforcement learning tra...

🔹 Publication Date: Published on Jan 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.15625
• PDF: https://arxiv.org/pdf/2601.15625

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

118 views06:23

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨FourierSampler: Unlocking Non-Autoregressive Potential in Diffusion Language Models via Frequency-Guided Generation

📝 Summary:
Diffusion language models face positional bias. FourierSampler uses frequency analysis to guide generation by separating global structure from local details. This sliding window approach significantly outperforms previous methods and autoregressive models.

🔹 Publication Date: Published on Jan 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.23182
• PDF: https://arxiv.org/pdf/2601.23182
• Github: https://github.com/ShirleYoung/FourierSampler

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

99 views07:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Statistical Estimation of Adversarial Risk in Large Language Models under Best-of-N Sampling

📝 Summary:
A scaling-aware risk estimation method called SABER is introduced for predicting large-scale adversarial vulnerability in language models through Best-of-N sampling, enabling accurate assessment with ...

🔹 Publication Date: Published on Jan 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.22636
• PDF: https://arxiv.org/pdf/2601.22636

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

108 views07:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing

📝 Summary:
A compact vision-language model achieves state-of-the-art accuracy on document understanding tasks while maintaining efficiency through specialized benchmarking and extended functionality. AI-generate...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21957
• PDF: https://arxiv.org/pdf/2601.21957

🔹 Models citing this paper:
• https://huggingface.co/PaddlePaddle/PaddleOCR-VL-1.5
• https://huggingface.co/PaddlePaddle/PP-DocLayoutV3

✨ Spaces citing this paper:
• https://huggingface.co/spaces/PaddlePaddle/PaddleOCR-VL-1.5_Online_Demo
• https://huggingface.co/spaces/AAAASSSASDASD3000/PaddleOCR-VL-1.5_Online_Demo

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

137 views07:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Revisiting Diffusion Model Predictions Through Dimensionality

📝 Summary:
Diffusion models using direct data prediction outperform traditional noise or velocity prediction in high-dimensional settings, with a proposed framework automatically learning optimal prediction para...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21419
• PDF: https://arxiv.org/pdf/2601.21419

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

167 views07:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Machine Learning for Energy-Performance-aware Scheduling

📝 Summary:
We propose a Bayesian Optimization framework using Gaussian Processes to automate scheduling configuration on multi-core systems. It approximates the energy-time Pareto Frontier and reveals dominant hardware parameters through sensitivity analysis.

🔹 Publication Date: Published on Jan 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.23134
• PDF: https://arxiv.org/pdf/2601.23134

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#MachineLearning #Optimization #EnergyEfficiency #ComputerArchitecture #DataScience

158 views08:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Continual GUI Agents

📝 Summary:
The Continual GUI Agents framework addresses performance degradation in dynamic UI environments. It introduces GUI-Anchoring in Flux GUI-AiF, a reinforcement fine-tuning method with novel anchoring rewards that stabilize learning across shifting UI domains and resolutions, outperforming existing ...

🔹 Publication Date: Published on Jan 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20732
• PDF: https://arxiv.org/pdf/2601.20732

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#ContinualLearning #ReinforcementLearning #AIAgents #HumanComputerInteraction #MachineLearning

164 views08:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨RM -RF: Reward Model for Run-Free Unit Test Evaluation

📝 Summary:
RM-RF is a lightweight reward model predicting unit test outcomes directly from source code, skipping compile and run. It forecasts test suite success, coverage, and mutation kill rate, offering faster, cheaper evaluation for AI generated tests. This enables scalable feedback for test generation.

🔹 Publication Date: Published on Jan 19

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.13097
• PDF: https://arxiv.org/pdf/2601.13097
• Github: https://github.com/trndcenter/RM-RF-unit-tests

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#RewardModels #UnitTesting #AIGeneratedTests #SoftwareEngineering #MachineLearning

153 views09:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨TAM-Eval: Evaluating LLMs for Automated Unit Test Maintenance

📝 Summary:
TAM-Eval is a new framework and benchmark for evaluating LLMs on comprehensive test suite maintenance tasks like creation, repair, and updating across Python, Java, and Go. It operates at the test file level with full repository context. Empirical results show current LLMs have limited capabiliti...

🔹 Publication Date: Published on Jan 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18241
• PDF: https://arxiv.org/pdf/2601.18241
• Github: https://github.com/trndcenter/TAM-Eval

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLM #SoftwareEngineering #TestAutomation #AI4Code #TAMEval

❤1

177 views09:04

✨ Explore Data Science 📝 Write your paper

About

Blog

Apps

Platform