Data Science | Machine Learning with Python for Researchers
31.4K subscribers
1.53K photos
102 videos
22 files
1.81K links
Admin: @HusseinSheikho

The Data Science and Python channel is for researchers and advanced programmers

Buy ads: https://telega.io/c/dataScienceT
Download Telegram
🔹 Title: Discrete Noise Inversion for Next-scale Autoregressive Text-based Image Editing

🔹 Publication Date: Published on Sep 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.01984
• PDF: https://arxiv.org/pdf/2509.01984

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: Improving Large Vision and Language Models by Learning from a Panel of Peers

🔹 Publication Date: Published on Sep 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.01610
• PDF: https://arxiv.org/pdf/2509.01610

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
1
🔹 Title: VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

🔹 Publication Date: Published on Sep 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.01055
• PDF: https://arxiv.org/pdf/2509.01055
• Github: https://github.com/TIGER-AI-Lab/verl-tool

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: C-DiffDet+: Fusing Global Scene Context with Generative Denoising for High-Fidelity Object Detection

🔹 Publication Date: Published on Aug 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.00578
• PDF: https://arxiv.org/pdf/2509.00578

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis

🔹 Publication Date: Published on Jul 31

🔹 Abstract: A novel framework uses a Direct 4DMesh-to-GS Variation Field VAE and Gaussian Variation Field diffusion model to generate high-quality dynamic 3D content from single video inputs, demonstrating superior quality and generalization. AI-generated summary In this paper, we present a novel framework for video-to-4D generation that creates high-quality dynamic 3D content from single video inputs. Direct 4D diffusion modeling is extremely challenging due to costly data construction and the high-dimensional nature of jointly representing 3D shape, appearance, and motion. We address these challenges by introducing a Direct 4DMesh-to-GS Variation Field VAE that directly encodes canonical Gaussian Splats (GS) and their temporal variations from 3D animation data without per-instance fitting, and compresses high-dimensional animations into a compact latent space. Building upon this efficient representation, we train a Gaussian Variation Field diffusion model with temporal-aware Diffusion Transformer conditioned on input videos and canonical GS. Trained on carefully-curated animatable 3D objects from the Objaverse dataset , our model demonstrates superior generation quality compared to existing methods. It also exhibits remarkable generalization to in-the-wild video inputs despite being trained exclusively on synthetic data, paving the way for generating high-quality animated 3D content. Project page: https://gvfdiffusion.github.io/.

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2507.23785

• PDF: https://arxiv.org/pdf/2507.23785

• Project Page: https://gvfdiffusion.github.io/

• Github: https://github.com/ForeverFancy/gvfdiffusion

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
1
🔹 Title: Baichuan-M2: Scaling Medical Capability with Large Verifier System

🔹 Publication Date: Published on Sep 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.02208
• PDF: https://arxiv.org/pdf/2509.02208

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: GenCompositor: Generative Video Compositing with Diffusion Transformer

🔹 Publication Date: Published on Sep 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.02460
• PDF: https://arxiv.org/pdf/2509.02460
• Project Page: https://gencompositor.github.io/
• Github: https://github.com/TencentARC/GenCompositor

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: AMBEDKAR-A Multi-level Bias Elimination through a Decoding Approach with Knowledge Augmentation for Robust Constitutional Alignment of Language Models

🔹 Publication Date: Published on Sep 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.02133
• PDF: https://arxiv.org/pdf/2509.02133

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

🔹 Publication Date: Published on Sep 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.01363
• PDF: https://arxiv.org/pdf/2509.01363

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: SQL-of-Thought: Multi-agentic Text-to-SQL with Guided Error Correction

🔹 Publication Date: Published on Aug 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.00581
• PDF: https://arxiv.org/pdf/2509.00581

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: Metis: Training Large Language Models with Advanced Low-Bit Quantization

🔹 Publication Date: Published on Aug 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.00404
• PDF: https://arxiv.org/pdf/2509.00404

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: Fantastic Pretraining Optimizers and Where to Find Them

🔹 Publication Date: Published on Sep 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.02046
• PDF: https://arxiv.org/pdf/2509.02046

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
1
🔹 Title: Benchmarking Optimizers for Large Language Model Pretraining

🔹 Publication Date: Published on Sep 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.01440
• PDF: https://arxiv.org/pdf/2509.01440
• Github: https://github.com/epfml/llm-optimizer-benchmark

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: The Gold Medals in an Empty Room: Diagnosing Metalinguistic Reasoning in LLMs with Camlang

🔹 Publication Date: Published on Aug 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.00425
• PDF: https://arxiv.org/pdf/2509.00425

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: MobiAgent: A Systematic Framework for Customizable Mobile Agents

🔹 Publication Date: Published on Aug 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.00531
• PDF: https://arxiv.org/pdf/2509.00531
• Github: https://github.com/IPADS-SAI/MobiAgent/releases/download/v1.0/Mobiagent.apk

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association

🔹 Publication Date: Published on Sep 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.01584
• PDF: https://arxiv.org/pdf/2509.01584
• Project Page: https://ganlinzhang.xyz/vista-slam/
• Github: https://github.com/zhangganlin/vista-slam

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: DCPO: Dynamic Clipping Policy Optimization

🔹 Publication Date: Published on Sep 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.02333
• PDF: https://arxiv.org/pdf/2509.02333
• Github: https://github.com/lime-RL/DCPO

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: FastFit: Accelerating Multi-Reference Virtual Try-On via Cacheable Diffusion Models

🔹 Publication Date: Published on Aug 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.20586
• PDF: https://arxiv.org/pdf/2508.20586
• Project Page: https://github.com/Zheng-Chong/FastFit
• Github: https://github.com/Zheng-Chong/FastFit

🔹 Datasets citing this paper:
https://huggingface.co/datasets/zhengchong/DressCode-MR

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
2
🔹 Title: ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

🔹 Publication Date: Published on Aug 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/pdf/2508.21496
• PDF: https://arxiv.org/pdf/2508.21496
• Github: https://github.com/hlsv02/ELV-Halluc

🔹 Datasets citing this paper:
https://huggingface.co/datasets/HLSv/ELV-Halluc

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: DynaGuard: A Dynamic Guardrail Model With User-Defined Policies

🔹 Publication Date: Published on Sep 2

🔹 Paper Links:
• arXiv Page: https://huggingface.co/collections/tomg-group-umd/dynaguard-68af4d916ae81d06ef774523
• PDF: https://arxiv.org/pdf/2509.02563
• Project Page: https://taruschirag.github.io/DynaGuard/
• Github: https://github.com/montehoover/DynaGuard

🔹 Datasets citing this paper:
https://huggingface.co/datasets/tomg-group-umd/DynaBench

🔹 Spaces citing this paper:
https://huggingface.co/spaces/tomg-group-umd/DynaGuard
==================================

For more data science resources:
https://t.iss.one/DataScienceT