ML Research Hub – Telegram

ML Research Hub

32.3K subscribers

6.33K photos

424 videos

24 files

6.87K links

Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho

Download Telegram

About

Blog

Apps

Platform

ML Research Hub

32.3K subscribers

ML Research Hub

Media is too big

VIEW IN TELEGRAM

✨CutClaw: Agentic Hours-Long Video Editing via Music Synchronization

📝 Summary:
CutClaw is an autonomous multi-agent framework that uses multimodal language models to automatically edit long video footage into rhythmic, narratively consistent short videos with synchronized audio ...

🔹 Publication Date: Published on Mar 31

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.29664
• PDF: https://arxiv.org/pdf/2603.29664
• Project Page: https://github.com/GVCLab/CutClaw
• Github: https://github.com/GVCLab/CutClaw

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

99 views03:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨VectorGym: A Multitask Benchmark for SVG Code Generation, Sketching, and Editing

📝 Summary:
VectorGym presents a comprehensive benchmark suite for scalable vector graphics encompassing text-to-svg generation, sketch-to-svg conversion, complex svg editing, and visual understanding tasks with ...

🔹 Publication Date: Published on Feb 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.29852
• PDF: https://arxiv.org/pdf/2603.29852

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

108 views03:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Learn2Fold: Structured Origami Generation with World Model Planning

📝 Summary:
A neuro-symbolic framework called Learn2Fold generates physically valid origami folding sequences from text by combining language model semantic proposals with graph-structured world model verificatio...

🔹 Publication Date: Published on Feb 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.29585
• PDF: https://arxiv.org/pdf/2603.29585

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

129 views03:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

📝 Summary:
FIPO enhances reinforcement learning for language models by using discounted future-KL divergence to improve credit assignment and extend reasoning chains, achieving better mathematical problem-solvin...

🔹 Publication Date: Published on Mar 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.19835
• PDF: https://arxiv.org/pdf/2603.19835
• Project Page: https://qwen-pilot.notion.site/fipo
• Github: https://github.com/qwenpilot/FIPO

🔹 Models citing this paper:
• https://huggingface.co/QwenPilot/FIPO_32B

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

116 views04:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis

📝 Summary:
Unify-Agent integrates agent-based modeling with multimodal understanding to enhance image synthesis through reasoning, searching, and generation processes grounded in external knowledge. AI-generated...

🔹 Publication Date: Published on Mar 31

🔹 Paper Links:
• arXiv Page: https://arxiv.org/pdf/2603.29620
• PDF: https://arxiv.org/pdf/2603.29620
• Github: https://github.com/shawn0728/Unify-Agent

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

114 views04:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨GEMS: Agent-Native Multimodal Generation with Memory and Skills

📝 Summary:
GEMS is an agent-native multimodal generation framework that enhances model capabilities through structured multi-agent optimization, persistent memory, and domain-specific skills across general and d...

🔹 Publication Date: Published on Mar 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.28088
• PDF: https://arxiv.org/pdf/2603.28088
• Project Page: https://gems-gen.github.io/
• Github: https://github.com/lcqysl/GEMS

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

161 views04:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨FlowPIE: Test-Time Scientific Idea Evolution with Flow-Guided Literature Exploration

📝 Summary:
FlowPIE is a novel retrieval-generation framework for scientific idea generation. It uses flow-guided Monte Carlo Tree Search for literature exploration and an evolutionary process to produce diverse, high-quality, and novel ideas by integrating cross-domain knowledge.

🔹 Publication Date: Published on Mar 31

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.29557
• PDF: https://arxiv.org/pdf/2603.29557
• Project Page: https://flowpie.wangqiyao.me/
• Github: https://github.com/AIforIP/FlowPIE

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

135 views05:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Extend3D: Town-Scale 3D Generation

📝 Summary:
An object-centric 3D generative model is extended with adaptive latent space and iterative refinement to generate complete 3D scenes from single images, incorporating noise-aware completion and 3D-awa...

🔹 Publication Date: Published on Mar 31

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.29387
• PDF: https://arxiv.org/pdf/2603.29387
• Project Page: https://seungwoo-yoon.github.io/extend3d-page/
• Github: https://github.com/SNU-VGILab/Extend3D

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

155 views05:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨MMFace-DiT: A Dual-Stream Diffusion Transformer for High-Fidelity Multimodal Face Generation

📝 Summary:
A unified dual-stream diffusion transformer model enables synergistic multimodal face synthesis by jointly processing spatial and semantic tokens through shared attention mechanisms while maintaining ...

🔹 Publication Date: Published on Mar 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.29029
• PDF: https://arxiv.org/pdf/2603.29029
• Project Page: https://vcbsl.github.io/MMFace-DiT/
• Github: https://github.com/Bharath-K3/MMFace-DiT

🔹 Models citing this paper:
• https://huggingface.co/BharathK333/MMFace-DiT-Models

✨ Datasets citing this paper:
• https://huggingface.co/datasets/BharathK333/MMFace-DiT-Datasets

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

204 views05:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨How Auditory Knowledge in LLM Backbones Shapes Audio Language Models: A Holistic Evaluation

📝 Summary:
This paper explores how much auditory knowledge LLMs acquire from text-only pre-training and its effect on audio language models. They found that auditory knowledge varies substantially and text-only results strongly correlate with audio performance.

🔹 Publication Date: Published on Mar 19

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.19195
• PDF: https://arxiv.org/pdf/2603.19195
• Project Page: https://kehanlu.github.io/AKB
• Github: https://github.com/kehanlu/AKB

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLMs #AudioAI #NLP #DeepLearning #AIResearch

178 views08:21

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨daVinci-LLM:Towards the Science of Pretraining

📝 Summary:
daVinci-LLM explores pretraining with industrial resources and an open science approach. It demonstrates that data processing depth and adaptive curriculum strategies significantly impact model capabilities, releasing full processes for community advancement.

🔹 Publication Date: Published on Mar 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.27164
• PDF: https://arxiv.org/pdf/2603.27164
• Github: https://github.com/GAIR-NLP/daVinci-LLM

🔹 Models citing this paper:
• https://huggingface.co/SII-GAIR-NLP/davinci-llm-model

✨ Datasets citing this paper:
• https://huggingface.co/datasets/SII-GAIR-NLP/davinci-llm-data

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLM #Pretraining #OpenScience #AI #MachineLearning

125 views09:21

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨MonitorBench: A Comprehensive Benchmark for Chain-of-Thought Monitorability in Large Language Models

📝 Summary:
MonitorBench is a comprehensive benchmark for evaluating LLM chain of thought monitorability. It reveals monitorability decreases when structural reasoning is not required, and both open and closed source models exhibit reduced monitorability under stress testing.

🔹 Publication Date: Published on Mar 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.28590
• PDF: https://arxiv.org/pdf/2603.28590
• Github: https://github.com/ASTRAL-Group/MonitorBench

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#MonitorBench #LLM #ChainOfThought #Monitorability #AIResearch

120 views09:22

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨PoseDreamer: Scalable and Photorealistic Human Data Generation Pipeline with Diffusion Models

📝 Summary:
PoseDreamer uses diffusion models to generate large-scale, photorealistic synthetic 3D human mesh datasets with improved image quality. Models trained on this data achieve comparable or superior performance to those using real or traditional synthetic datasets, offering a scalable solution.

🔹 Publication Date: Published on Mar 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.28763
• PDF: https://arxiv.org/pdf/2603.28763
• Project Page: https://prosperolo.github.io/posedreamer

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#DiffusionModels #SyntheticData #3DGeneration #ComputerVision #AIResearch

❤1

117 views09:22

✨ Explore Data Science 📝 Write your paper

ML Research Hub

This media is not supported in your browser

VIEW IN TELEGRAM

✨ArtHOI: Taming Foundation Models for Monocular 4D Reconstruction of Hand-Articulated-Object Interactions

📝 Summary:
ArtHOI presents an optimization-based framework that integrates foundation model priors to reconstruct 4D human-articulated-object interactions from single monocular RGB videos using adaptive sampling...

🔹 Publication Date: Published on Mar 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.25791
• PDF: https://arxiv.org/pdf/2603.25791
• Project Page: https://arthoi-reconstruction.github.io/
• Github: https://github.com/hitcs-zikaiwang/ArtHOI-4D-Reconstruction

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#4DReconstruction #FoundationModels #ComputerVision #HumanObjectInteraction #AI

115 views09:22

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Distilling Human-Aligned Privacy Sensitivity Assessment from Large Language Models

📝 Summary:
This paper distills large language models into lightweight encoders for efficient privacy evaluation of textual data. These models maintain strong human agreement while significantly reducing computational costs, enabling practical large-scale privacy assessment.

🔹 Publication Date: Published on Mar 31

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.29497
• PDF: https://arxiv.org/pdf/2603.29497
• Github: https://github.com/gabrielloiseau/privacy-distillation

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLM #Privacy #MachineLearning #NLP #DataScience

124 views09:22

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Ghost-FWL: A Large-Scale Full-Waveform LiDAR Dataset for Ghost Detection and Removal

📝 Summary:
This paper introduces Ghost-FWL, the first large-scale full-waveform LiDAR dataset for ghost point detection and removal. It leverages FWL data and a self-supervised learning approach to significantly improve LiDAR-based SLAM and 3D object detection accuracy by effectively removing false reflecti...

🔹 Publication Date: Published on Mar 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.28224
• PDF: https://arxiv.org/pdf/2603.28224
• Project Page: https://keio-csg.github.io/Ghost-FWL/
• Github: https://github.com/Keio-CSG/Ghost-FWL

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LiDAR #GhostDetection #SLAM #3DObjectDetection #SelfSupervisedLearning

186 views09:23

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR

📝 Summary:
LLM-based ASR improves with multimodal conversational context, especially for entities. Raw audio context is costly, so Abstract Compression replaces prior-turn audio with fixed latent tokens, retaining transcripts. This reduces computational cost while recovering some performance gains.

🔹 Publication Date: Published on Mar 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.26246
• PDF: https://arxiv.org/pdf/2603.26246

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLM #ASR #SpeechRecognition #NLP #AI

246 views09:23

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨It Takes Two: A Duet of Periodicity and Directionality for Burst Flicker Removal

📝 Summary:
Flicker artifacts in short-exposure photos are addressed by Flickerformer, a transformer-based architecture. It leverages flicker's intrinsic periodicity and directionality to effectively remove artifacts without introducing ghosting, outperforming existing methods.

🔹 Publication Date: Published on Mar 24

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.22794
• PDF: https://arxiv.org/pdf/2603.22794

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#ImageProcessing #DeepLearning #ComputerVision #Transformers #FlickerRemoval

188 views12:23

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

📝 Summary:
Medical imaging datasets are fragmented and small, limiting foundation model development. This survey of 1000+ open-access datasets proposes a metadata-driven fusion paradigm to integrate them, creating larger resources. This scales medical imaging data for more capable foundation models.

🔹 Publication Date: Published on Mar 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.27460
• PDF: https://arxiv.org/pdf/2603.27460
• Project Page: https://huggingface.co/datasets/General-Medical-AI/Project-Imaging-X
• Github: https://github.com/uni-medical/Project-Imaging-X

✨ Datasets citing this paper:
• https://huggingface.co/datasets/General-Medical-AI/Project-Imaging-X

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#MedicalImaging #FoundationModels #AI #DataScience #OpenData

❤1

145 views13:23

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Falcon Perception

📝 Summary:
Falcon Perception introduces a unified early-fusion Transformer that processes images and text within a single architecture from the first layer. This simplifies perception systems and achieves improved mask prediction and OCR performance, outperforming traditional modular designs.

🔹 Publication Date: Published on Mar 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.27365
• PDF: https://arxiv.org/pdf/2603.27365

✨ Datasets citing this paper:
• https://huggingface.co/datasets/tiiuae/PBench

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

105 views13:24

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨CREval: An Automated Interpretable Evaluation for Creative Image Manipulation under Complex Instructions

📝 Summary:
A fully automated question-answer based evaluation pipeline and comprehensive benchmark are introduced for assessing creative image manipulation tasks under complex instructions, demonstrating strong ...

🔹 Publication Date: Published on Mar 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2603.26174
• PDF: https://arxiv.org/pdf/2603.26174
• Github: https://github.com/ChonghuinanWang/CREval

✨ Datasets citing this paper:
• https://huggingface.co/datasets/ChonghuinanWang/CREval

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

114 views13:24

✨ Explore Data Science 📝 Write your paper