Data Science | Machine Learning with Python for Researchers

✅ LISA: Reasoning Segmentation via Large Language Model

New segmentation task -- reasoning segmentation. The task is designed to output a segmentation mask given a complex and implicit query text.

🖥 Github: https://github.com/dvlab-research/lisa

📕 Paper: https://arxiv.org/abs/2308.00692v2

☑️ Dataset: https://github.com/dvlab-research/lisa#dataset

https://t.iss.one/DataScienceT

👍7

2.12K viewsedited 03:55

🌟 MiraData: Large, long-duration video dataset with structured annotations.

When training generative models, the training dataset plays an important role in the quality of reference of ready-made models.
One of the good sources can be MiraData from Tencent - a ready-made dataset with a total video duration of 16 thousand hours, designed for training models for generating text in videos. It includes long videos (average 72.1 seconds) with high motion intensity and detailed structured annotations (average 318 words per video).

To assess the quality of the dataset, a system of MiraBench benchmarks was even specially created, consisting of 17 metrics that evaluate temporal consistency, movement in the frame, video quality, and other parameters. According to their results, MiroData outperforms other well-known datasets available in open sources, which mainly consist of short videos with floating quality and short descriptions.

🟡

Project page

🟡

Arxiv

🤗

Hugging Face

🖥

GitHub

#Text2Video #Dataset #ML

https://t.iss.one/DataScienceT

⭐️

Please open Telegram to view this post

VIEW IN TELEGRAM

Please open Telegram to view this post

VIEW IN TELEGRAM

👍2❤1

1.83K views15:50

Data Science | Machine Learning with Python for Researchers

This media is not supported in your browser

VIEW IN TELEGRAM

🍄 4D Mocap Human-Object 🍄

Adobe unveils HUMOTO, a high-quality #dataset of human-object interactions designed for #motiongeneration, #computervision, and #robotics. It features over 700 sequences (7,875 seconds @ 30FPS) with interactions involving 63 precisely modeled objects and 72 articulated parts—a rich resource for researchers and developers in the field.

⚡️ Review: https://t.ly/lCof3
⚡️ Paper: https://lnkd.in/dVVBDd_c
⚡️ Project: https://lnkd.in/dwBcseDf

#HUMOTO #4DMocap #HumanObjectInteraction #AdobeResearch #AI #MachineLearning #PoseEstimation

⚡️

BEST DATA SCIENCE CHANNELS ON TELEGRAM

🌟

Please open Telegram to view this post

VIEW IN TELEGRAM

👍5❤1🔥1

2.55K viewsedited 07:15

Data Science | Machine Learning with Python for Researchers

✨Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing

📝 Summary:
Pico-Banana-400K is a new 400K-image dataset for text-guided image editing, built from real photos. It offers diverse edit types, high quality, and specialized subsets for multi-turn, preference-based, and long-short instruction editing, enabling comprehensive model development.

🔹 Publication Date: Published on Oct 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.19808
• PDF: https://arxiv.org/pdf/2510.19808
• Github: https://github.com/apple/pico-banana-400k

🔹 Models citing this paper:
• https://huggingface.co/eigen-ai-labs/eigen-banana-qwen-image-edit

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#ImageEditing #TextGuidedEditing #Dataset #ComputerVision #AI

63 views06:03

✨ Explore Data Science 📝 Write your paper

Data Science | Machine Learning with Python for Researchers

✨GUI-360: A Comprehensive Dataset and Benchmark for Computer-Using Agents

📝 Summary:
GUI-360 is a large dataset and benchmark for computer-using agents, addressing gaps in real-world tasks and unified evaluation. It contains over 1.2M action steps in Windows apps for GUI grounding, screen parsing, and action prediction. Benchmarking reveals significant shortcomings in current mod...

🔹 Publication Date: Published on Nov 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.04307
• PDF: https://arxiv.org/pdf/2511.04307

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #ComputerAgents #GUIAgents #Dataset #Benchmark

225 views03:01

✨ Explore Data Science 📝 Write your paper

Data Science | Machine Learning with Python for Researchers

✨CATS-V2V: A Real-World Vehicle-to-Vehicle Cooperative Perception Dataset with Complex Adverse Traffic Scenarios

📝 Summary:
CATS-V2V is a new real-world dataset for V2V cooperative perception, focusing on complex adverse traffic scenarios. It provides extensive synchronized sensor data, including LiDAR and cameras, from two vehicles across diverse conditions. This dataset supports autonomous driving research.

🔹 Publication Date: Published on Nov 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.11168
• PDF: https://arxiv.org/pdf/2511.11168

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#V2V #AutonomousDriving #CooperativePerception #Dataset #ADAS

234 views04:01

✨ Explore Data Science 📝 Write your paper

Data Science | Machine Learning with Python for Researchers

✨miniF2F-Lean Revisited: Reviewing Limitations and Charting a Path Forward

📝 Summary:
An analysis of miniF2F showed AI systems had 36% accuracy due to problem errors. Correcting these errors created miniF2F-v2, improving accuracy to 70%. High-quality benchmarks like miniF2F-v2 are crucial for evaluating formal reasoning progress.

🔹 Publication Date: Published on Nov 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.03108
• PDF: https://arxiv.org/pdf/2511.03108
• Github: https://github.com/roozbeh-yz/miniF2F_v2

✨ Datasets citing this paper:
• https://huggingface.co/datasets/roozbeh-yz/miniF2F_v2

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#AI #FormalReasoning #Benchmarks #MachineLearning #Dataset

208 views01:01

✨ Explore Data Science 📝 Write your paper

Data Science | Machine Learning with Python for Researchers

✨MicroVQA++: High-Quality Microscopy Reasoning Dataset with Weakly Supervised Graphs for Multimodal Large Language Model

📝 Summary:
MicroVQA plus plus is a new high-quality microscopy VQA dataset built via a three-stage process. This includes HiCQA-Graph, a novel filtering method using NLI, CLIP, and MLLM signals. The dataset enables strong microscopy reasoning performance for MLLMs.

🔹 Publication Date: Published on Nov 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.11407
• PDF: https://arxiv.org/pdf/2511.11407
• Github: https://github.com/ieellee/MicroVQA-PlusPlus

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#MLLM #Microscopy #VQA #AIResearch #Dataset

245 views07:07

✨ Explore Data Science 📝 Write your paper

About

Blog

Apps

Platform