Data Science | Machine Learning with Python for Researchers
31.8K subscribers
2.08K photos
102 videos
22 files
2.36K links
Admin: @HusseinSheikho

The Data Science and Python channel is for researchers and advanced programmers

Buy ads: https://telega.io/c/dataScienceT
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🌳 Compose Anything is Out! 🌳
#SkyworkAI unveils #SkyReelsA2 β€” a controllable video generation framework that can assemble arbitrary visual elements (e.g., characters, objects, backgrounds) into fully synthesized videos from text prompts.
Code, models, and evaluation benchmark are all released!
πŸ”— Resources:
Review: https://t.ly/MEjzL
Paper: https://arxiv.org/pdf/2504.02436
Project: https://skyworkai.github.io/skyreels-a2.github.io/
Repo: https://github.com/SkyworkAI/SkyReels-A2
πŸ€— Models: https://huggingface.co/Skywork/SkyReels-A2

#AI #VideoGeneration #Multimodal #GenerativeAI #SkyReels #OpenSource

https://t.iss.one/DataScienceT βœ…
Please open Telegram to view this post
VIEW IN TELEGRAM
πŸ‘5
This media is not supported in your browser
VIEW IN TELEGRAM
🐈 TTT Long Video Generation 🐈

πŸ‘‰ A novel architecture for video generation, adapting the #CogVideoX 5B model by incorporating #TestTimeTraining (TTT) layers.
Adding TTT layers into a pre-trained Transformer enables generating a one-minute clip from text storyboards.
Videos, code & annotations released πŸ’™

πŸ”— Review: https://t.ly/mhlTN
πŸ“„ Paper: arxiv.org/pdf/2504.05298
🌐 Project: test-time-training.github.io/video-dit
πŸ’» Repo: github.com/test-time-training/ttt-video-dit

#AI #VideoGeneration #MachineLearning #DeepLearning #Transformers #TTT #GenerativeAI

⭐️ BEST DATA SCIENCE CHANNELS ON TELEGRAM ⭐️
Please open Telegram to view this post
VIEW IN TELEGRAM
πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
NVIDIA introduces Describe Anything Model (DAM)

a new state-of-the-art model designed to generate rich, detailed descriptions for specific regions in images and videos. Users can mark these regions using points, boxes, scribbles, or masks.
DAM sets a new benchmark in multimodal understanding, with open-source code under the Apache license, a dedicated dataset, and a live demo available on Hugging Face.

Explore more below:
Paper: https://lnkd.in/dZh82xtV
Project Page: https://lnkd.in/dcv9V2ZF
GitHub Repo: https://lnkd.in/dJB9Ehtb
Hugging Face Demo: https://lnkd.in/dXDb2MWU
Review: https://t.ly/la4JD

#NVIDIA #DescribeAnything #ComputerVision #MultimodalAI #DeepLearning #ArtificialIntelligence #MachineLearning #OpenSource #HuggingFace #GenerativeAI #VisualUnderstanding #Python #AIresearch

https://t.iss.one/DataScienceT βœ…
Please open Telegram to view this post
VIEW IN TELEGRAM
πŸ‘5