ML Research Hub
32.3K subscribers
6.75K photos
475 videos
24 files
7.36K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
Geometric Context Transformer for Streaming 3D Reconstruction

📝 Summary:
LingBot-Map is a feed-forward 3D foundation model that reconstructs scenes from video streams using a geometric context transformer architecture with specialized attention mechanisms for coordinate gr...

🔹 Publication Date: Published on Apr 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.14141
• PDF: https://arxiv.org/pdf/2604.14141
• Project Page: https://technology.robbyant.com/lingbot-map
• Github: https://github.com/robbyant/lingbot-map

🔹 Models citing this paper:
https://huggingface.co/robbyant/lingbot-map

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
Narrative-Driven Paper-to-Slide Generation via ArcDeck

📝 Summary:
ArcDeck is a multi-agent framework for paper-to-slide generation that models a paper's logical flow through discourse trees. It uses an iterative refinement process to ensure narrative coherence and improve presentations over direct summarization methods.

🔹 Publication Date: Published on Apr 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.11969
• PDF: https://arxiv.org/pdf/2604.11969
• Project Page: https://arcdeck.org/
• Github: https://github.com/RehgLab/ArcDeck

Datasets citing this paper:
https://huggingface.co/datasets/ArcDeck/ArcBench

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AIPresentations #NLP #GenerativeAI #ResearchTools #AcademicPublishing
1
HDR Video Generation via Latent Alignment with Logarithmic Encoding

📝 Summary:
This work enables high dynamic range HDR video generation by leveraging pretrained generative models. It uses logarithmic encoding to align HDR imagery with model latent spaces and camera-mimicking degradation training, achieving strong results without architectural redesign or complex retraining.

🔹 Publication Date: Published on Apr 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.11788
• PDF: https://arxiv.org/pdf/2604.11788
• Project Page: https://hdr-lumivid.github.io/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
LangFlow: Continuous Diffusion Rivals Discrete in Language Modeling

📝 Summary:
LangFlow demonstrates that continuous diffusion models can match discrete counterparts in language modeling by leveraging embedding-space flow matching with novel training techniques and noise schedul...

🔹 Publication Date: Published on Apr 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.11748
• PDF: https://arxiv.org/pdf/2604.11748
• Project Page: https://caradryanl.github.io/blog/2026/langflow/
• Github: https://github.com/nealchen2003/LangFlow

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Self-Distillation Zero: Self-Revision Turns Binary Rewards into Dense Supervision

📝 Summary:
Self-Distillation Zero trains a model to transform binary rewards into dense token-level self-supervision through dual-role training and on-policy self-distillation, achieving superior performance in ...

🔹 Publication Date: Published on Apr 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.12002
• PDF: https://arxiv.org/pdf/2604.12002

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Anthropogenic Regional Adaptation in Multimodal Vision-Language Model

📝 Summary:
Vision-language models can be adapted for regional contexts through Anthropogenic Regional Adaptation and GG-EZ method while maintaining global performance and improving cultural relevance. AI-generat...

🔹 Publication Date: Published on Apr 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.11490
• PDF: https://arxiv.org/pdf/2604.11490
• Project Page: https://huggingface.co/collections/SEACrowd/sea-vl-phase-2-multimodal-vision-language-models-for-sea

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
What do Language Models Learn and When? The Implicit Curriculum Hypothesis

📝 Summary:
LLM pretraining follows an Implicit Curriculum Hypothesis, showing a compositional and predictable skill emergence. Capabilities emerge consistently across models, with composite tasks appearing after their components. This order is encoded in representations, allowing prediction of training traj...

🔹 Publication Date: Published on Apr 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.08510
• PDF: https://arxiv.org/pdf/2604.08510
• Github: https://github.com/KaiserWhoLearns/ElementalTask

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
ROSE: An Intent-Centered Evaluation Metric for NL2SQL

📝 Summary:
ROSE is a new NL2SQL metric addressing unreliable Execution Accuracy. It evaluates if predicted SQL answers user intent via a Prover-Refuter cascade, showing superior agreement with human experts.

🔹 Publication Date: Published on Apr 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.12988
• PDF: https://arxiv.org/pdf/2604.12988

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#NL2SQL #NLP #EvaluationMetrics #AIResearch #DataScience
Cross-Tokenizer LLM Distillation through a Byte-Level Interface

📝 Summary:
Byte-Level Distillation BLD is a new simple method for cross-tokenizer LLM knowledge transfer. It uses a shared byte-level interface, converting teacher outputs to byte probabilities for student distillation. BLD performs competitively with complex approaches, suggesting the byte level is a natur...

🔹 Publication Date: Published on Apr 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.07466
• PDF: https://arxiv.org/pdf/2604.07466

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation

📝 Summary:
MM-WebAgent is a hierarchical agentic framework that coordinates AIGC-based element generation for coherent and visually consistent webpage design through joint optimization of layout and multimodal c...

🔹 Publication Date: Published on Apr 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.15309
• PDF: https://arxiv.org/pdf/2604.15309
• Github: https://github.com/microsoft/MM-webagent

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
ASGuard: Activation-Scaling Guard to Mitigate Targeted Jailbreaking Attack

📝 Summary:
Activation-Scaling Guard (ASGuard) mitigates brittle refusal behaviors in large language models by identifying and recalibrating specific attention heads vulnerable to tense-based jailbreaking attacks...

🔹 Publication Date: Published on Apr 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.25843
• PDF: https://arxiv.org/pdf/2509.25843
• Github: https://github.com/dmis-lab/ASGuard

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

📝 Summary:
HY-World 2.0 is a multi-modal world model framework that generates high-fidelity 3D Gaussian Splatting scenes from diverse inputs using specialized modules for panorama generation, trajectory planning...

🔹 Publication Date: Published on Apr 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.14268
• PDF: https://arxiv.org/pdf/2604.14268
• Project Page: https://3d-models.hunyuan.tencent.com/world/
• Github: https://github.com/Tencent-Hunyuan/HY-World-2.0

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data

📝 Summary:
Teacher-student cooperation data synthesis framework addresses stylistic divergence in synthetic data for improved model fine-tuning performance. AI-generated summary A widely adopted strategy for mod...

🔹 Publication Date: Published on Mar 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.14164
• PDF: https://arxiv.org/pdf/2604.14164
• Github: https://github.com/CoopReason/TESSY

Datasets citing this paper:
https://huggingface.co/datasets/CoopReason/TESSY-Code-80K

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
LongAct: Harnessing Intrinsic Activation Patterns for Long-Context Reinforcement Learning

📝 Summary:
LongAct improves long-context reasoning in LLMs by implementing saliency-guided sparse updates based on high-magnitude activation patterns in query and key vectors. AI-generated summary Reinforcement ...

🔹 Publication Date: Published on Apr 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.14922
• PDF: https://arxiv.org/pdf/2604.14922

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
UniDoc-RL: Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards

📝 Summary:
UniDoc-RL introduces a reinforcement learning framework for LVLMs that jointly optimizes retrieval, reranking, visual perception, and reasoning through hierarchical decision-making and dense multi-rew...

🔹 Publication Date: Published on Apr 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.14967
• PDF: https://arxiv.org/pdf/2604.14967
• Github: https://github.com/deepglint/UniDoc-RL

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
C2: Scalable Rubric-Augmented Reward Modeling from Binary Preferences

📝 Summary:
Cooperative yet Critical reward modeling (C2) enhances reward model reliability by enabling critical collaboration between a reward model and a rubric generator trained exclusively from binary prefere...

🔹 Publication Date: Published on Apr 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.13618
• PDF: https://arxiv.org/pdf/2604.13618
• Github: https://github.com/asahi-research/C2

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
LeapAlign: Post-Training Flow Matching Models at Any Generation Step by Building Two-Step Trajectories

📝 Summary:
LeapAlign improves flow matching model fine-tuning by reducing computational costs and enabling stable gradient propagation through shortened trajectory steps while maintaining alignment with human pr...

🔹 Publication Date: Published on Apr 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.15311
• PDF: https://arxiv.org/pdf/2604.15311
• Project Page: https://rockeycoss.github.io/leapalign/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation

📝 Summary:
DR$^{3}$-Eval is a benchmark for evaluating deep research agents on multimodal, multi-file report generation, featuring a realistic simulation of web environments and a comprehensive evaluation framew...

🔹 Publication Date: Published on Apr 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.14683
• PDF: https://arxiv.org/pdf/2604.14683
• Github: https://github.com/NJU-LINK/DR3-Eval

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems

📝 Summary:
The study analyzes Claude Code's architecture, identifying five motivating human values and tracing them through thirteen design principles to specific implementation choices, including a core while-l...

🔹 Publication Date: Published on Apr 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.14228
• PDF: https://arxiv.org/pdf/2604.14228
• Github: https://github.com/VILA-Lab/Dive-into-Claude-Code

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
KV Packet: Recomputation-Free Context-Independent KV Caching for LLMs

📝 Summary:
KV Packet is a cache reuse framework that eliminates recomputation overhead in large language models by treating cached documents as immutable packets with trainable soft-token adapters. AI-generated ...

🔹 Publication Date: Published on Apr 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.13226
• PDF: https://arxiv.org/pdf/2604.13226
• Github: https://github.com/ChuangtaoChen-TUM/KVPacket

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
RAD-2: Scaling Reinforcement Learning in a Generator-Discriminator Framework

📝 Summary:
A unified generator-discriminator framework for autonomous driving motion planning that improves stability and performance through diffusion-based trajectory generation and reinforcement learning opti...

🔹 Publication Date: Published on Apr 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2604.15308
• PDF: https://arxiv.org/pdf/2604.15308
• Project Page: https://hgao-cv.github.io/RAD-2/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research