Data Science | Machine Learning with Python for Researchers

Data Science | Machine Learning with Python for Researchers

🌟

Please open Telegram to view this post

VIEW IN TELEGRAM

👍2

2.38K views06:57

🍄 4D Mocap Human-Object 🍄

Adobe unveils HUMOTO, a high-quality #dataset of human-object interactions designed for #motiongeneration, #computervision, and #robotics. It features over 700 sequences (7,875 seconds @ 30FPS) with interactions involving 63 precisely modeled objects and 72 articulated parts—a rich resource for researchers and developers in the field.

⚡️ Review: https://t.ly/lCof3
⚡️ Paper: https://lnkd.in/dVVBDd_c
⚡️ Project: https://lnkd.in/dwBcseDf

#HUMOTO #4DMocap #HumanObjectInteraction #AdobeResearch #AI #MachineLearning #PoseEstimation

⚡️

Data Science | Machine Learning with Python for Researchers

🌟

Please open Telegram to view this post

VIEW IN TELEGRAM

👍5❤1🔥1

2.55K viewsedited 07:15

Forwarded from Python | Machine Learning | Coding | R

Forget Coding; start Vibing! Tell AI what you want, and watch it build your dream website while you enjoy a cup of coffee.

Date: Thursday, April 17th at 9 PM IST

Register for FREE: https://lu.ma/4nczknky?tk=eAT3Bi

Limited FREE Seat !!!!!!

👍2

1.91K views16:33

Data Science | Machine Learning with Python for Researchers

💥

Geo4D: VideoGen 4D Scene

💥

The Oxford VGG unveils Geo4D, a breakthrough in #videodiffusion for monocular 4D reconstruction. Trained only on synthetic data, Geo4D still achieves strong generalization to real-world scenarios. It outputs point maps, depth, and ray maps, setting a new #SOTA in dynamic scene reconstruction. Code is now released!

⚡️ Review: https://t.ly/X55Uj
⚡️ Paper: https://arxiv.org/pdf/2504.07961
⚡️ Project: https://geo4d.github.io/
⚡️ Code: https://github.com/jzr99/Geo4D

#Geo4D #4DReconstruction #DynamicScenes #OxfordVGG #ComputerVision #MachineLearning #DiffusionModels

⚡️

Data Science | Machine Learning with Python for Researchers

🌟

Please open Telegram to view this post

VIEW IN TELEGRAM

❤2👍2

2.92K views07:22

Forwarded from ️Crypto Rates, Prices and news

🔥ENTER VIP FOR FREE! ENTRY 24 HOURS FREE!

LISA TRADER - most successful trader for 2024. A week ago they finished a marathon in their vip channel where from $100 they made $2000, in just two weeks of time!

Entry to her channel cost : ~~$1500~~ FOR 24 ENTRY FREE!

JOIN THE VIP CHANNEL NOW!
JOIN THE VIP CHANNEL NOW!
JOIN THE VIP CHANNEL NOW!

👍2

2.5K views12:03

Data Science | Machine Learning with Python for Researchers

🔥 General Attention-Based Object Detection 🔥

👉 GATE3D is a novel framework designed specifically for generalized monocular 3D object detection via weak supervision. GATE3D effectively bridges domain gaps by employing consistency losses between 2D and 3D predictions.

👉 Review: https://t.ly/O7wqH
👉 Paper: https://lnkd.in/dc5VTUj9
👉 Project: https://lnkd.in/dzrt-qQV

#3DObjectDetection #Monocular3D #DeepLearning #WeakSupervision #ComputerVision #AI #MachineLearning #GATE3D

⚡️

Data Science | Machine Learning with Python for Researchers

🌟

Please open Telegram to view this post

VIEW IN TELEGRAM

👍3❤1

3.14K viewsedited 07:45

Forwarded from Python | Machine Learning | Coding | R

📢 5-Day Generative AI Intensive Course with #Google is now available as a self-paced Learn Guide!

Access whitepapers, podcasts, code labs, & recorded livestreams. Additionally, there is a bonus assignment for you!
https://www.kaggle.com/learn-guide/5-day-genai

#GenerativeAI #GoogleAI #AICourse #SelfPacedLearning #MachineLearning #DeepLearning #Kaggle #AICommunity #TechEducation #AIforEveryone

⚡️

5-Day Gen AI Intensive Course with Google

🌟

Please open Telegram to view this post

VIEW IN TELEGRAM

Kaggle

Kaggle is the world’s largest data science community with powerful tools and resources to help you achieve your data science goals.

❤‍🔥2

2.53K views18:09

Data Science | Machine Learning with Python for Researchers

REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers

14 Apr 2025 · Xingjian Leng, Jaskirat Singh, Yunzhong Hou, Zhenchang Xing, Saining Xie, Liang Zheng ·

In this paper we tackle a fundamental question: "Can we train latent diffusion models together with the variational auto-encoder (VAE) tokenizer in an end-to-end manner?" Traditional deep-learning wisdom dictates that end-to-end training is often preferable when possible. However, for latent diffusion transformers, it is observed that end-to-end training both VAE and diffusion-model using standard diffusion-loss is ineffective, even causing a degradation in final performance. We show that while diffusion loss is ineffective, end-to-end training can be unlocked through the representation-alignment (REPA) loss -- allowing both VAE and diffusion model to be jointly tuned during the training process. Despite its simplicity, the proposed training recipe (REPA-E) shows remarkable performance; speeding up diffusion model training by over 17x and 45x over REPA and vanilla training recipes, respectively. Interestingly, we observe that end-to-end tuning with REPA-E also improves the VAE itself; leading to improved latent space structure and downstream generation performance. In terms of final performance, our approach sets a new state-of-the-art; achieving FID of 1.26 and 1.83 with and without classifier-free guidance on ImageNet 256 x 256. Code is available at https://end2end-diffusion.github.io.

Paper: https://arxiv.org/pdf/2504.10483v1.pdf

Code: https://github.com/End2End-Diffusion/REPA-E

Dataset: ImageNet

https://t.iss.one/DataScienceT

✅

Please open Telegram to view this post

VIEW IN TELEGRAM

👍3🔥1🙏1

3.28K viewsedited 09:39

Data Science | Machine Learning with Python for Researchers

Liquid: Language Models are Scalable Multi-modal Generators

5 Dec 2024 · Junfeng Wu, Yi Jiang, Chuofan Ma, Yuliang Liu, Hengshuang Zhao, Zehuan Yuan, Song Bai, Xiang Bai ·

We present Liquid, an auto-regressive generation paradigm that seamlessly integrates visual comprehension and generation by tokenizing images into discrete codes and learning these code embeddings alongside text tokens within a shared feature space for both vision and language. Unlike previous multimodal large language model (MLLM), Liquid achieves this integration using a single large language model (LLM), eliminating the need for external pretrained visual embeddings such as CLIP. For the first time, Liquid uncovers a scaling law that performance drop unavoidably brought by the unified training of visual and language tasks diminishes as the model size increases. Furthermore, the unified token space enables visual generation and comprehension tasks to mutually enhance each other, effectively removing the typical interference seen in earlier models. We show that existing LLMs can serve as strong foundations for Liquid, saving 100x in training costs while outperforming Chameleon in multimodal capabilities and maintaining language performance comparable to mainstream LLMs like LLAMA2. Liquid also outperforms models like SD v2.1 and SD-XL (FID of 5.47 on MJHQ-30K), excelling in both vision-language and text-only tasks. This work demonstrates that LLMs such as LLAMA3.2 and GEMMA2 are powerful multimodal generators, offering a scalable solution for enhancing both vision-language understanding and generation. The code and models will be released at https://github.com/FoundationVision/Liquid.

Paper: https://arxiv.org/pdf/2412.04332v2.pdf

Code: https://github.com/foundationvision/liquid

https://t.iss.one/DataScienceT

🖕

Please open Telegram to view this post

VIEW IN TELEGRAM

👍2

3.3K viewsedited 09:49

Data Science | Machine Learning with Python for Researchers

Data Science | Machine Learning with Python for Researchers

NVIDIA introduces Describe Anything Model (DAM)

a new state-of-the-art model designed to generate rich, detailed descriptions for specific regions in images and videos. Users can mark these regions using points, boxes, scribbles, or masks.
DAM sets a new benchmark in multimodal understanding, with open-source code under the Apache license, a dedicated dataset, and a live demo available on Hugging Face.

Explore more below:
Paper: https://lnkd.in/dZh82xtV
Project Page: https://lnkd.in/dcv9V2ZF
GitHub Repo: https://lnkd.in/dJB9Ehtb
Hugging Face Demo: https://lnkd.in/dXDb2MWU
Review: https://t.ly/la4JD

#NVIDIA #DescribeAnything #ComputerVision #MultimodalAI #DeepLearning #ArtificialIntelligence #MachineLearning #OpenSource #HuggingFace #GenerativeAI #VisualUnderstanding #Python #AIresearch

https://t.iss.one/DataScienceT ✅

Please open Telegram to view this post

VIEW IN TELEGRAM

👍5

3.07K viewsedited 10:08

Forwarded from Python | Machine Learning | Coding | R

This channels is for Programmers, Coders, Software Engineers.

0️⃣ Python
1️⃣ Data Science
2️⃣ Machine Learning
3️⃣ Data Visualization
4️⃣ Artificial Intelligence
5️⃣ Data Analysis
6️⃣ Statistics
7️⃣ Deep Learning
8️⃣ programming Languages

✅

https://t.iss.one/addlist/8_rRW2scgfRhOTc0

✅

https://t.iss.one/Codeprogrammer

Please open Telegram to view this post

VIEW IN TELEGRAM

👍2❤1

1.63K views12:50

Data Science | Machine Learning with Python for Researchers

Data Science | Machine Learning with Python for Researchers

🌼 SOTA Textured 3D-Guided VTON 🌼

👉 #ALIBABA unveils 3DV-TON, a novel diffusion model for HQ and temporally consistent video. Generating animatable textured 3D meshes as explicit frame-level guidance, alleviating the issue of models over-focusing on appearance fidelity at the expense of motion coherence. Code & benchmark to be released 💙

👉 Review: https://t.ly/0tjdC
👉 Paper: https://lnkd.in/dFseYSXz
👉 Project: https://lnkd.in/djtqzrzs
👉 Repo: TBA

#AI #3DReconstruction #DiffusionModels #VirtualTryOn #ComputerVision #DeepLearning #VideoSynthesis

https://t.iss.one/DataScienceT

🔗

Please open Telegram to view this post

VIEW IN TELEGRAM

❤2👍2

3.13K viewsedited 08:57

Forwarded from ENG. Hussein Sheikho

فرصة عمل عن بعد 🧑‍💻
لا يتطلب اي مؤهل او خبره الشركه تقدم تدريب كامل ✨
ساعات العمل مرنه ⏰
يتم التسجيل ثم التواصل معك لحضور لقاء تعريفي بالعمل والشركه

https://forms.gle/hqUZXu7u4uLjEDPv8

Please open Telegram to view this post

VIEW IN TELEGRAM

Google Docs

فرصة عمل

العمل من المنزل هو ببساطة حل لمشكلة البطالة للشباب العربي ولكل البشر حول العالم،👌 انه طريقك للوصول الى الحرية المالية وبعيداً عن شغل الوظيفة الحكومية المملة والمرتبات الضعيفة..
أصبح الربح من الانترنت أمر حقيقي وليس وهم..🤜
نقدم لك فرصة الآن من غير أي شهادات…

2.94K views20:10

Data Science | Machine Learning with Python for Researchers

Forwarded from Python Courses

🚀 LunaProxy - The Most Cost-effective Residential Proxy Exclusive Benefits for Members of This Group: 💥 Residential Proxy: As low as $0.77 / GB. Use the discount code [lunapro30] when placing an order and save 30% immediately. ✔️ Over 200 million pure IPs | No charge for invalid ones | Success rate > 99.9% 💥 Unlimited Traffic Proxy: Enjoy a discount of up to 72%, only $79 / day. ✔️ Unlimited traffic | Unlimited concurrency | Bandwidth of over 100Gbps | Customized services | Save 90% of the cost when collecting AI/LLM data Join the Luna Affiliate Program and earn a 10% commission. There is no upper limit for the commission, and you can withdraw it at any time.
👉 Take action now: https://www.lunaproxy.com/?ls=data&lk=?01

Please open Telegram to view this post

VIEW IN TELEGRAM

❤2👍1

1.92K views06:40

Data Science | Machine Learning with Python for Researchers

Forwarded from Python | Machine Learning | Coding | R

🚀 Master the Transformer Architecture with PyTorch! 🧠

Dive deep into the world of Transformers with this comprehensive PyTorch implementation guide. Whether you're a seasoned ML engineer or just starting out, this resource breaks down the complexities of the Transformer model, inspired by the groundbreaking paper "Attention Is All You Need".

🔗 Check it out here:
https://www.k-a.in/pyt-transformer.html

This guide offers:

🌟 Detailed explanations of each component of the Transformer architecture.

🌟 Step-by-step code implementations in PyTorch.

🌟 Insights into the self-attention mechanism and positional encoding.

By following along, you'll gain a solid understanding of how Transformers work and how to implement them from scratch.

#MachineLearning #DeepLearning #PyTorch #Transformer #AI #NLP #AttentionIsAllYouNeed #Coding #DataScience #NeuralNetworks

💯