ML Research Hub
32.9K subscribers
4.58K photos
281 videos
23 files
4.95K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
Towards Efficient and Robust Linguistic Emotion Diagnosis for Mental Health via Multi-Agent Instruction Refinement

📝 Summary:
APOLO framework uses automated prompt optimization through multi-agent collaboration to improve emotion diagnosis accuracy and robustness in mental healthcare applications. AI-generated summary Lingui...

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.13481
• PDF: https://arxiv.org/pdf/2601.13481

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Agentic Reasoning for Large Language Models

📝 Summary:
Agentic reasoning redefines LLMs as autonomous agents that plan, act, and learn through continuous interaction in dynamic environments. This survey organizes agentic reasoning by environmental dynamics, from single-agent capabilities to multi-agent collaboration, bridging thought and action throu...

🔹 Publication Date: Published on Jan 18

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.12538
• PDF: https://arxiv.org/pdf/2601.12538
• Github: https://github.com/weitianxin/Awesome-Agentic-Reasoning

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning

📝 Summary:
Render-of-Thought framework converts textual reasoning steps into images using vision-language models to improve reasoning traceability and efficiency while maintaining competitive performance. AI-gen...

🔹 Publication Date: Published on Jan 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14750
• PDF: https://arxiv.org/pdf/2601.14750

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Lost in the Prompt Order: Revealing the Limitations of Causal Attention in Language Models

📝 Summary:
Research reveals that causal attention in language models creates information bottlenecks when question-answer options follow context, leading to performance drops of over 14 percentage points compare...

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14152
• PDF: https://arxiv.org/pdf/2601.14152

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance

📝 Summary:
RebuttalAgent is a multi-agent framework that reframes rebuttal generation as an evidence-centric planning task, improving coverage, faithfulness, and strategic coherence in academic peer review. AI-g...

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14171
• PDF: https://arxiv.org/pdf/2601.14171
• Project Page: https://mqleet.github.io/Paper2Rebuttal_ProjectPage/
• Github: https://github.com/AutoLab-SAI-SJTU/Paper2Rebuttal

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
XR: Cross-Modal Agents for Composed Image Retrieval

📝 Summary:
A multi-agent framework for compositional image retrieval that uses specialized agents for generation, filtering, and verification to improve semantic and visual query matching. AI-generated summary R...

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14245
• PDF: https://arxiv.org/pdf/2601.14245
• Github: https://01yzzyu.github.io/xr.github.io/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Facilitating Proactive and Reactive Guidance for Decision Making on the Web: A Design Probe with WebSeek

📝 Summary:
WebSeek is a mixed-initiative browser extension that enables interactive web data extraction and analysis with AI-assisted guidance and automation. AI-generated summary Web AI agents such as ChatGPT A...

🔹 Publication Date: Published on Jan 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.15100
• PDF: https://arxiv.org/pdf/2601.15100

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Media is too big
VIEW IN TELEGRAM
Rethinking Video Generation Model for the Embodied World

📝 Summary:
A comprehensive robotics benchmark evaluates video generation models across multiple task domains and embodiments, revealing deficiencies in physical realism and introducing a large-scale dataset to a...

🔹 Publication Date: Published on Jan 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.15282
• PDF: https://arxiv.org/pdf/2601.15282
• Project Page: https://dagroup-pku.github.io/ReVidgen.github.io/
• Github: https://github.com/DAGroup-PKU/ReVidgen/

Datasets citing this paper:
https://huggingface.co/datasets/DAGroup-PKU/RBench
https://huggingface.co/datasets/DAGroup-PKU/RoVid-X

Spaces citing this paper:
https://huggingface.co/spaces/DAGroup-PKU/RBench-Leaderboard

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
FARE: Fast-Slow Agentic Robotic Exploration

📝 Summary:
FARE is a hierarchical exploration framework that combines large language model reasoning with reinforcement learning control to enable efficient autonomous robot navigation in complex environments. A...

🔹 Publication Date: Published on Jan 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14681
• PDF: https://arxiv.org/pdf/2601.14681

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
RoboBrain 2.5: Depth in Sight, Time in Mind

📝 Summary:
RoboBrain 2.5 enhances embodied AI through improved 3D spatial reasoning and temporal value estimation for more precise manipulation tasks. AI-generated summary We introduce RoboBrain 2.5, a next-gene...

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14352
• PDF: https://arxiv.org/pdf/2601.14352
• Project Page: https://superrobobrain.github.io/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

📝 Summary:
MMDeepResearch-Bench evaluates multimodal research agents on report generation with visual evidence, revealing trade-offs between prose quality, citation accuracy, and visual grounding. AI-generated s...

🔹 Publication Date: Published on Jan 18

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.12346
• PDF: https://arxiv.org/pdf/2601.12346
• Github: https://github.com/AIoT-MLSys-Lab/MMDeepResearch-Bench

Datasets citing this paper:
https://huggingface.co/datasets/MMDR-2025/MMdeepresearch

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments

📝 Summary:
FinVault presents the first execution-grounded security benchmark for financial agents, revealing significant vulnerabilities in current defense mechanisms when applied to real-world financial workflo...

🔹 Publication Date: Published on Jan 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.07853
• PDF: https://arxiv.org/pdf/2601.07853
• Github: https://github.com/aifinlab/FinVault

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
The Responsibility Vacuum: Organizational Failure in Scaled Agent Systems

📝 Summary:
Modern CI/CD pipelines integrating agent-generated code exhibit a structural failure in responsibility attribution. Decisions are executed through formally correct approval processes, yet no entity po...

🔹 Publication Date: Published on Jan 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.15059
• PDF: https://arxiv.org/pdf/2601.15059

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
AgentEHR: Advancing Autonomous Clinical Decision-Making via Retrospective Summarization

📝 Summary:
AgentEHR is a benchmark for autonomous EHR navigation involving complex clinical decision-making in raw data. The RetroSum framework addresses information loss and fractured reasoning through retrospective summarization and evolving experience strategies. RetroSum improves performance by up to 29...

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.13918
• PDF: https://arxiv.org/pdf/2601.13918
• Github: https://github.com/BlueZeros/AgentEHR

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Numina-Lean-Agent: An Open and General Agentic Reasoning System for Formal Mathematics

📝 Summary:
A general coding agent paradigm enables flexible formal theorem proving by directly interfacing with proof assistants and retrieving relevant theorems without task-specific training. AI-generated summ...

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14027
• PDF: https://arxiv.org/pdf/2601.14027
• Project Page: https://demo.projectnumina.ai/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Typhoon OCR: Open Vision-Language Model For Thai Document Extraction

📝 Summary:
Typhoon OCR is an open vision-language model for Thai and English document extraction, tackling complex script and unstructured documents. It achieves high accuracy and layout reconstruction comparable to larger proprietary systems, yet is compact and computationally efficient.

🔹 Publication Date: Published on Jan 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14722
• PDF: https://arxiv.org/pdf/2601.14722

🔹 Models citing this paper:
https://huggingface.co/typhoon-ai/typhoon-ocr-7b
https://huggingface.co/typhoon-ai/typhoon-ocr1.5-2b
https://huggingface.co/typhoon-ai/typhoon-ocr-3b

Spaces citing this paper:
https://huggingface.co/spaces/doeqoth/typhoon-ocr

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Quantifying Speaker Embedding Phonological Rule Interactions in Accented Speech Synthesis

📝 Summary:
Research investigates the relationship between speaker embeddings and phonological rules in accent control for text-to-speech systems, introducing a metric to measure rule preservation versus embeddin...

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14417
• PDF: https://arxiv.org/pdf/2601.14417
• Project Page: https://sav-eng.github.io/icassp_samples.html

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Typhoon ASR Real-time: FastConformer-Transducer for Thai Automatic Speech Recognition

📝 Summary:
A 115M-parameter FastConformer-Transducer model achieves low-latency Thai speech recognition with reduced computational cost through text normalization and curriculum learning, accompanied by a benchm...

🔹 Publication Date: Published on Jan 19

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.13044
• PDF: https://arxiv.org/pdf/2601.13044

🔹 Models citing this paper:
https://huggingface.co/typhoon-ai/typhoon-asr-realtime
https://huggingface.co/typhoon-ai/typhoon-isan-asr-realtime
https://huggingface.co/typhoon-ai/typhoon-whisper-turbo

Datasets citing this paper:
https://huggingface.co/datasets/typhoon-ai/gigaspeech2-typhoon
https://huggingface.co/datasets/typhoon-ai/TVSpeech

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
sangkuriang: A pseudo-spectral Python library for Korteweg-de Vries soliton simulation

📝 Summary:
The Korteweg-de Vries (KdV) equation serves as a foundational model in nonlinear wave physics, describing the balance between dispersive spreading and nonlinear steepening that gives rise to solitons....

🔹 Publication Date: Published on Jan 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.12029
• PDF: https://arxiv.org/pdf/2601.12029
• Project Page: https://pypi.org/project/sangkuriang-ideal-solver/
• Github: https://github.com/sandyherho/sangkuriang-ideal-solver

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
UltraRAG: A Modular and Automated Toolkit for Adaptive Retrieval-Augmented Generation

📝 Summary:
UltraRAG is a RAG toolkit automating knowledge adaptation across the entire workflow from data to evaluation. It provides a user-friendly WebUI, enabling non-coders to build and optimize RAG systems for diverse scenarios.

🔹 Publication Date: Published on Mar 31, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2504.08761
• PDF: https://arxiv.org/pdf/2504.08761
• Github: https://github.com/OpenBMB/UltraRAG

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#RAG #AI #LLMs #Automation #DataScience
3
Behavior Knowledge Merge in Reinforced Agentic Models

📝 Summary:
Reinforced Agent Merging RAM improves integrating RL agents by distinguishing shared and task-specific parameters. This preserves critical behaviors, outperforming baselines and unlocking synergistic performance beyond specialized agents.

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.13572
• PDF: https://arxiv.org/pdf/2601.13572
• Project Page: https://xiangchi-yuan.github.io/ram-project/
• Github: https://github.com/xiangchi-yuan/mrl

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#ReinforcementLearning #MultiAgentSystems #ArtificialIntelligence #DeepLearning #AgenticModels