Data Science | Machine Learning with Python for Researchers
31.7K subscribers
1.78K photos
102 videos
22 files
2.06K links
Admin: @HusseinSheikho

The Data Science and Python channel is for researchers and advanced programmers

Buy ads: https://telega.io/c/dataScienceT
Download Telegram
🔹 Title: FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark

🔹 Publication Date: Published on Sep 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.09680
• PDF: https://arxiv.org/pdf/2509.09680
• Project Page: https://flux-reason-6m.github.io/
• Github: https://github.com/rongyaofang/prism-bench

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

🔹 Publication Date: Published on Sep 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.09674
• PDF: https://arxiv.org/pdf/2509.09674
• Github: https://github.com/PRIME-RL/SimpleVLA-RL

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

🔹 Publication Date: Published on Sep 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.09265
• PDF: https://arxiv.org/pdf/2509.09265
• Project Page: https://empgseed-seed.github.io/
• Github: https://empgseed-seed.github.io/

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval

🔹 Publication Date: Published on Sep 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.09118
• PDF: https://arxiv.org/pdf/2509.09118
• Github: https://github.com/Multimodal-Representation-Learning-MRL/GA-DMS

🔹 Datasets citing this paper:
https://huggingface.co/datasets/Kaichengalex/WebPerson-5M
https://huggingface.co/datasets/Kaichengalex/WebPerson-1M

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: SpatialVID: A Large-Scale Video Dataset with Spatial Annotations

🔹 Publication Date: Published on Sep 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.09676
• PDF: https://arxiv.org/pdf/2509.09676
• Project Page: https://nju-3dv.github.io/projects/SpatialVID/

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering

🔹 Publication Date: Published on Sep 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.09614
• PDF: https://arxiv.org/pdf/2509.09614
• Github: https://github.com/SalesforceAIResearch/LoCoBench

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis

🔹 Publication Date: Published on Sep 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.09595
• PDF: https://arxiv.org/pdf/2509.09595

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: OmniEVA: Embodied Versatile Planner via Task-Adaptive 3D-Grounded and Embodiment-aware Reasoning

🔹 Publication Date: Published on Sep 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.09332
• PDF: https://arxiv.org/pdf/2509.09332
• Project Page: https://omnieva.github.io/

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: Visual Programmability: A Guide for Code-as-Thought in Chart Understanding

🔹 Publication Date: Published on Sep 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.09286
• PDF: https://arxiv.org/pdf/2509.09286
• Github: https://github.com/Aphelios-Tang/Code-as-Thought

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward

🔹 Publication Date: Published on Sep 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.07430
• PDF: https://arxiv.org/pdf/2509.07430

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: 2D Gaussian Splatting with Semantic Alignment for Image Inpainting

🔹 Publication Date: Published on Sep 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.01964
• PDF: https://arxiv.org/pdf/2509.01964
• Github: https://github.com/hitlhy715/2DGS_inpaint

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: Can Understanding and Generation Truly Benefit Together -- or Just Coexist?

🔹 Publication Date: Published on Sep 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.09666
• PDF: https://arxiv.org/pdf/2509.09666

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

🔹 Publication Date: Published on Sep 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.09372
• PDF: https://arxiv.org/pdf/2509.09372
• Project Page: https://vla-adapter.github.io/
• Github: https://vla-adapter.github.io/

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
2
🔹 Title: Reasoning Introduces New Poisoning Attacks Yet Makes Them More Complicated

🔹 Publication Date: Published on Sep 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.05739
• PDF: https://arxiv.org/pdf/2509.05739

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: Modality Alignment with Multi-scale Bilateral Attention for Multimodal Recommendation

🔹 Publication Date: Published on Sep 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.09114
• PDF: https://arxiv.org/pdf/2509.09114
• Github: https://github.com/rkl71/MambaRec

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: mmBERT: A Modern Multilingual Encoder with Annealed Language Learning

🔹 Publication Date: Published on Sep 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.06888
• PDF: https://arxiv.org/pdf/2509.06888

🔹 Datasets citing this paper:
https://huggingface.co/datasets/jhu-clsp/mmBERT-midtraining-data
https://huggingface.co/datasets/jhu-clsp/mmBERT-pretrain-p2-fineweb2-remaining
https://huggingface.co/datasets/jhu-clsp/mmBERT-pretrain-p3-others
https://huggingface.co/datasets/jhu-clsp/mmBERT-pretrain-p1-fineweb2-langs

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: Cross-Domain Evaluation of Transformer-Based Vulnerability Detection on Open & Industry Data

🔹 Publication Date: Published on Sep 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.09313
• PDF: https://arxiv.org/pdf/2509.09313

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: Towards Better Dental AI: A Multimodal Benchmark and Instruction Dataset for Panoramic X-ray Analysis

🔹 Publication Date: Published on Sep 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.09254
• PDF: https://arxiv.org/pdf/2509.09254

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: All You Need Is A Fuzzing Brain: An LLM-Powered System for Automated Vulnerability Detection and Patching

🔹 Publication Date: Published on Sep 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.07225
• PDF: https://arxiv.org/pdf/2509.07225
• Github: https://o2lab.github.io/FuzzingBrain-Leaderboard

🔹 Datasets citing this paper:
https://huggingface.co/datasets/Kitxuuu/AIXCC-C-Challenge

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
🔹 Title: AU-Harness: An Open-Source Toolkit for Holistic Evaluation of Audio LLMs

🔹 Publication Date: Published on Sep 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.08031
• PDF: https://arxiv.org/pdf/2509.08031
• Project Page: https://au-harness.github.io/
• Github: https://github.com/ServiceNow/AU-Harness

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.iss.one/DataScienceT
3