ML Research Hub

✨Thinking with Comics: Enhancing Multimodal Reasoning through Structured Visual Storytelling

📝 Summary:
Thinking with Comics emerges as an effective visual reasoning approach that bridges images and videos by leveraging comic structures for improved multimodal reasoning efficiency and performance. AI-ge...

🔹 Publication Date: Published on Feb 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.02453
• PDF: https://arxiv.org/pdf/2602.02453
• Project Page: https://thinking-with-comics.github.io/
• Github: https://github.com/andongBlue/Think-with-Comics

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #MultimodalAI #VisualReasoning #Comics #ComputerVision

140 views14:11

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨ParalESN: Enabling parallel information processing in Reservoir Computing

📝 Summary:
Parallel Echo State Network (ParalESN) addresses reservoir computing limitations by enabling parallel temporal processing through diagonal linear recurrence, maintaining theoretical guarantees while a...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.22296
• PDF: https://arxiv.org/pdf/2601.22296

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

134 views14:11

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Internal Flow Signatures for Self-Checking and Refinement in LLMs

📝 Summary:
Internal flow signatures analyze depthwise dynamics in large language models to enable self-checking and targeted refinement without modifying the base model. AI-generated summary Large language model...

🔹 Publication Date: Published on Feb 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.01897
• PDF: https://arxiv.org/pdf/2602.01897
• Github: https://github.com/EavnJeong/Internal-Flow-Signatures-for-Self-Checking-and-Refinement-in-LLMs

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

164 views14:11

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Diagnosing the Reliability of LLM-as-a-Judge via Item Response Theory

📝 Summary:
A two-phase diagnostic framework based on Item Response Theory and Graded Response Model is introduced to assess the reliability of LLM-as-a-Judge by examining intrinsic consistency and human alignmen...

🔹 Publication Date: Published on Jan 31

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.00521
• PDF: https://arxiv.org/pdf/2602.00521

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

198 views14:11

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Cross-Lingual Stability of LLM Judges Under Controlled Generation: Evidence from Finno-Ugric Languages

📝 Summary:
Controlled cross-lingual evaluation reveals instability in LLM assessment methods when targeting morphologically rich languages, indicating unreliable zero-shot judge transfer for discourse-level task...

🔹 Publication Date: Published on Feb 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.02287
• PDF: https://arxiv.org/pdf/2602.02287
• Github: https://github.com/isaac-chung/

✨ Datasets citing this paper:
• https://huggingface.co/datasets/isaacchung/controlled-generated-convos-gpt-4.1-mini

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

163 views15:11

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Fast Autoregressive Video Diffusion and World Models with Temporal Cache Compression and Sparse Attention

📝 Summary:
Autoregressive video diffusion models face efficiency challenges due to growing KV caches and redundant attention computations, which are addressed through TempCache, AnnCA, and AnnSA techniques that ...

🔹 Publication Date: Published on Feb 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.01801
• PDF: https://arxiv.org/pdf/2602.01801
• Project Page: https://dvirsamuel.github.io/fast-auto-regressive-video/
• Github: https://dvirsamuel.github.io/fast-auto-regressive-video/

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

❤1

173 views15:11

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Generative Visual Code Mobile World Models

📝 Summary:
Visual world models for mobile GUI agents are improved through renderable code generation using vision-language models, achieving better performance with reduced model size compared to existing approa...

🔹 Publication Date: Published on Feb 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.01576
• PDF: https://arxiv.org/pdf/2602.01576

🔹 Models citing this paper:
• https://huggingface.co/trillionlabs/gWorld-8B
• https://huggingface.co/trillionlabs/gWorld-32B

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

157 views16:11

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Sparse Reward Subsystem in Large Language Models

📝 Summary:
Research identifies a sparse reward subsystem in LLM hidden states containing value neurons that represent internal state expectations and dopamine-like neurons encoding reward prediction errors. AI-g...

🔹 Publication Date: Published on Feb 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.00986
• PDF: https://arxiv.org/pdf/2602.00986

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

142 views16:12

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Hunt Instead of Wait: Evaluating Deep Data Research on Large Language Models

📝 Summary:
Agentic large language models require investigatory intelligence for autonomous data analysis, demonstrated through the Deep Data Research benchmark that evaluates their ability to extract insights fr...

🔹 Publication Date: Published on Feb 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.02039
• PDF: https://arxiv.org/pdf/2602.02039
• Project Page: https://huggingface.co/spaces/thinkwee/DDR_Bench
• Github: https://github.com/thinkwee/DDR_Bench

✨ Datasets citing this paper:
• https://huggingface.co/datasets/thinkwee/DDRBench_10K

✨ Spaces citing this paper:
• https://huggingface.co/spaces/thinkwee/DDR_Bench

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

193 views16:12

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Rethinking Generative Recommender Tokenizer: Recsys-Native Encoding and Semantic Quantization Beyond LLMs

📝 Summary:
ReSID introduces a recommendation-native framework to improve sequential recommenders. It learns predictive item representations and optimizes quantization for better information preservation and sequential predictability without LLMs. ReSID significantly outperforms baselines by over 10% and red...

🔹 Publication Date: Published on Feb 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.02338
• PDF: https://arxiv.org/pdf/2602.02338

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

187 views17:12

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Clipping-Free Policy Optimization for Large Language Models

📝 Summary:
Clipping-Free Policy Optimization replaces heuristic clipping with convex quadratic penalty to stabilize reinforcement learning training for large language models without performance loss. AI-generate...

🔹 Publication Date: Published on Jan 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.22801
• PDF: https://arxiv.org/pdf/2601.22801

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

❤1

204 views17:12

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation

📝 Summary:
Large language models used as judges for agent performance evaluation are vulnerable to manipulation of reasoning traces, with content-based fabrications being more effective than style-based alterati...

🔹 Publication Date: Published on Jan 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14691
• PDF: https://arxiv.org/pdf/2601.14691

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

🙏1

208 views18:12

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨A Semantically Consistent Dataset for Data-Efficient Query-Based Universal Sound Separation

📝 Summary:
Automated pipeline for sound separation using high-purity single-event segments from in-the-wild datasets achieves competitive performance with significantly reduced data requirements. AI-generated su...

🔹 Publication Date: Published on Jan 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.22599
• PDF: https://arxiv.org/pdf/2601.22599
• Project Page: https://shandaai.github.io/Hive
• Github: https://github.com/ShandaAI/Hive

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

182 views19:12

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Where to Attend: A Principled Vision-Centric Position Encoding with Parabolas

📝 Summary:
Parabolic Position Encoding (PaPE) is a novel position encoding method for vision modalities that improves upon existing approaches by incorporating translation invariance, rotation invariance, distan...

🔹 Publication Date: Published on Feb 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.01418
• PDF: https://arxiv.org/pdf/2602.01418
• Project Page: https://chrisohrstrom.github.io/parabolic-position-encoding/
• Github: https://github.com/DTU-PAS/parabolic-position-encoding

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

160 views20:13

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨YOLOE-26: Integrating YOLO26 with YOLOE for Real-Time Open-Vocabulary Instance Segmentation

📝 Summary:
YOLOE-26 integrates YOLO26 architecture with open-vocabulary learning for real-time instance segmentation, utilizing convolutional backbones, end-to-end regression, and object embedding heads with tex...

🔹 Publication Date: Published on Jan 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.00168
• PDF: https://arxiv.org/pdf/2602.00168

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

156 views20:13

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability

📝 Summary:
A new reinforcement learning framework trains LLMs for divide-and-conquer reasoning. This method decomposes complex problems, significantly elevating test-time scalability and outperforming traditional chain-of-thought approaches on challenging benchmarks.

🔹 Publication Date: Published on Feb 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.02477
• PDF: https://arxiv.org/pdf/2602.02477
• Github: https://github.com/MasterVito/DAC-RL

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

137 views21:13

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨CUA-Skill: Develop Skills for Computer Using Agent

📝 Summary:
CUA-Skill introduces a large-scale library of engineered computer-use skills that enhance agent performance and efficiency on Windows-based tasks. AI-generated summary Computer-Using Agents (CUAs) aim...

🔹 Publication Date: Published on Jan 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.21123
• PDF: https://arxiv.org/pdf/2601.21123
• Project Page: https://microsoft.github.io/cua_skill/
• Github: https://github.com/microsoft/cua_skill

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

148 views21:13

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨VoxServe: Streaming-Centric Serving System for Speech Language Models

📝 Summary:
VoxServe is a unified serving system for SpeechLMs that optimizes streaming performance. It uses model-execution abstraction, streaming-aware scheduling, and asynchronous inference pipelines. This achieves 10-20x higher throughput at comparable latency for diverse SpeechLM architectures.

🔹 Publication Date: Published on Jan 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.00269
• PDF: https://arxiv.org/pdf/2602.00269
• Project Page: https://vox-serve.github.io/
• Github: https://github.com/vox-serve/vox-serve

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

164 views22:13

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Competing Visions of Ethical AI: A Case Study of OpenAI

📝 Summary:
This study analyzed OpenAI's public discourse on ethical AI. It found OpenAI primarily frames the discussion around safety and risk, largely avoiding academic ethics frameworks. This indicates a distinct approach to AI ethics in industry communications.

🔹 Publication Date: Published on Jan 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.16513
• PDF: https://arxiv.org/pdf/2601.16513

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

148 views02:00

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding

📝 Summary:
Multimodal LLMs can effectively understand source code represented as compressed images, achieving up to 8x token reduction. This method leverages visual cues and sometimes outperforms text inputs, indicating a path to more efficient code comprehension.

🔹 Publication Date: Published on Feb 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.01785
• PDF: https://arxiv.org/pdf/2602.01785

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

99 views03:00

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Research on World Models Is Not Merely Injecting World Knowledge into Specific Tasks

📝 Summary:
Current world models lack unified frameworks despite task-specific advances, necessitating a comprehensive approach integrating interaction, perception, symbolic reasoning, and spatial representation....

🔹 Publication Date: Published on Feb 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.01630
• PDF: https://arxiv.org/pdf/2602.01630
• Github: https://github.com/OpenDCAI/DataFlow-MM

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

75 views03:00

✨ Explore Data Science 📝 Write your paper

About

Blog

Apps

Platform