ML Research Hub

✨Continuous Audio Language Models

📝 Summary:
Audio Language Models (ALM) have emerged as the dominant paradigm for speech and music generation by representing audio as sequences of discrete tokens. Yet, unlike text tokens, which are invertible, ...

🔹 Publication Date: Published on Sep 8, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.06926
• PDF: https://arxiv.org/pdf/2509.06926
• Project Page: https://huggingface.co/spaces/kyutai/calm-samples
• Github: https://github.com/kyutai-labs/pocket-tts

🔹 Models citing this paper:
• https://huggingface.co/kyutai/pocket-tts
• https://huggingface.co/kyutai/pocket-tts-without-voice-cloning
• https://huggingface.co/Verylicious/pocket-tts-ungated

✨ Spaces citing this paper:
• https://huggingface.co/spaces/D3vShoaib/pocket-tts
• https://huggingface.co/spaces/Xlnk/tts
• https://huggingface.co/spaces/kyutai/calm-samples

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

arXiv.org

Continuous Audio Language Models

Audio Language Models (ALM) have emerged as the dominant paradigm for speech and music generation by representing audio as sequences of discrete tokens. Yet, unlike text tokens, which are...

183 views06:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨InfiAgent: An Infinite-Horizon Framework for General-Purpose Autonomous Agents

📝 Summary:
InfiAgent is a framework that maintains bounded reasoning context for long-horizon tasks by externalizing persistent state into a file-centric abstraction, enabling stable performance without task-spe...

🔹 Publication Date: Published on Jan 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03204
• PDF: https://arxiv.org/pdf/2601.03204
• Github: https://github.com/ChenglinPoly/infiAgent

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

237 views06:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨YaPO: Learnable Sparse Activation Steering Vectors for Domain Adaptation

📝 Summary:
YaPO learns sparse steering vectors for LLMs using Sparse Autoencoders, enabling more effective and stable control than dense methods. This leads to disentangled, interpretable directions for fine-grained alignment across various behaviors, without degrading general knowledge. YaPO offers a gener...

🔹 Publication Date: Published on Jan 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.08441
• PDF: https://arxiv.org/pdf/2601.08441
• Project Page: https://mbzuai-paris.github.io/YaPO/
• Github: https://github.com/MBZUAI-Paris/YaPO

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLMs #DomainAdaptation #SparseLearning #MachineLearning #AI

❤1

203 views11:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨CLARE: Continual Learning for Vision-Language-Action Models via Autonomous Adapter Routing and Expansion

📝 Summary:
CLARE enables robots to continually learn new tasks without forgetting, using lightweight adapters. It autonomously expands these adapters and dynamically routes them, ensuring high performance without needing task labels or storing past data.

🔹 Publication Date: Published on Jan 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.09512
• PDF: https://arxiv.org/pdf/2601.09512
• Project Page: https://tum-lsy.github.io/clare/
• Github: https://github.com/utiasDSL/clare

✨ Datasets citing this paper:
• https://huggingface.co/datasets/continuallearning/libero_10_image_task_0

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#ContinualLearning #Robotics #AI #MachineLearning #VLAModels

❤2

308 views12:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨PubMed-OCR: PMC Open Access OCR Annotations

📝 Summary:
PubMed-OCR is a corpus of 209.5K scientific articles from PubMed Central with Google Cloud Vision OCR annotations. It provides word, line, and paragraph bounding boxes to support layout-aware modeling and OCR evaluation. This data is publicly released.

🔹 Publication Date: Published on Jan 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.11425
• PDF: https://arxiv.org/pdf/2601.11425

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#OCR #Dataset #ComputerVision #MachineLearning #DataScience

❤3

262 views16:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨PaddleOCR 3.0 Technical Report

📝 Summary:
PaddleOCR 3.0 is an open-source toolkit offering efficient OCR and document parsing solutions. Its models achieve competitive accuracy and efficiency with fewer than 100 million parameters, rivaling much larger vision-language models.

🔹 Publication Date: Published on Jul 8, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2507.05595
• PDF: https://huggingface.co/collections/PaddlePaddle/pp-structurev3

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

❤1

216 views22:05

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Multi-Agent Collaboration via Evolving Orchestration

📝 Summary:
A centralized orchestrator, trained with reinforcement learning, dynamically directs LLM agents for multi-agent collaboration. This puppeteer-style method achieves superior performance and reduced computational costs.

🔹 Publication Date: Published on May 26, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2505.19591
• PDF: https://arxiv.org/pdf/2505.19591
• Github: https://github.com/OpenBMB/ChatDev/tree/puppeteer

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

159 views01:00

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨MemoryRewardBench: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models

📝 Summary:
MemoryRewardBench is a new benchmark evaluating reward models ability to assess long-term memory management in LLMs across various context lengths and patterns. Evaluations reveal newer RMs outperform predecessors, open-source models are closing the gap, and current RMs have limitations.

🔹 Publication Date: Published on Jan 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.11969
• PDF: https://arxiv.org/pdf/2601.11969
• Github: https://github.com/LCM-Lab/MemRewardBench

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

130 views03:00

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨UniX: Unifying Autoregression and Diffusion for Chest X-Ray Understanding and Generation

📝 Summary:
UniX presents a unified medical foundation model that decouples visual understanding and generation tasks using distinct autoregressive and diffusion branches with cross-modal attention for enhanced p...

🔹 Publication Date: Published on Jan 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.11522
• PDF: https://arxiv.org/pdf/2601.11522
• Github: https://github.com/ZrH42/UniX

🔹 Models citing this paper:
• https://huggingface.co/ZrH42/UniX

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

85 views04:00

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Aligning Agentic World Models via Knowledgeable Experience Learning

📝 Summary:
WorldMind addresses LLM physical hallucinations by autonomously building a symbolic world knowledge repository. It unifies process and goal experiences to enforce physical feasibility and task optimality, achieving superior performance and transferability.

🔹 Publication Date: Published on Jan 19

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.13247
• PDF: https://arxiv.org/pdf/2601.13247
• Project Page: https://zjunlp.github.io/project/WorldMind/
• Github: https://github.com/zjunlp/WorldMind

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

91 views04:01

✨ Explore Data Science 📝 Write your paper

About

Blog

Apps

Platform