Data Science | Machine Learning with Python for Researchers
31.3K subscribers
1.52K photos
102 videos
22 files
1.79K links
Admin: @HusseinSheikho

The Data Science and Python channel is for researchers and advanced programmers

Buy ads: https://telega.io/c/dataScienceT
Download Telegram
MIT Introduction to Deep Learning - 2023 Starting soon! MIT Intro to DL is one of the most concise AI courses on the web that cover basic deep learning techniques, architectures, and applications.

2023 lectures are starting in just one day, Jan 9th!

Link to register:
https://introtodeeplearning.com

MIT Introduction to Deep Learning The 2022 lectures can be found here:

https://m.youtube.com/playlist?list=PLtBw6njQRU-rwp5__7C0oIVt26ZgjG9NI

https://t.iss.one/DataScienceT
โคโ€๐Ÿ”ฅ3๐Ÿ‘3๐Ÿ†2
Train your ControlNet with diffusers ๐Ÿงจ

ControlNet is a neural network structure that allows fine-grained control of diffusion models by adding extra conditions.

๐Ÿค— Hugging face: https://huggingface.co/blog/train-your-controlnet#

๐Ÿ–ฅ Github: https://github.com/huggingface/blog/blob/main/train-your-controlnet.md

โฉ ControlNet training example: https://github.com/huggingface/diffusers/tree/main/examples/controlnet

https://t.iss.one/DataScienceT
โคโ€๐Ÿ”ฅ3๐Ÿ†2
๐Ÿ”ฅ Fix the Noise: Disentangling Source Feature for Controllable Domain Translation

A new approach for high-quality domain translation with better controllability.

๐Ÿ–ฅ Github: https://github.com/LeeDongYeun/FixNoise

โฉ Paper: https://arxiv.org/abs/2303.11545v1

๐Ÿ’จ Dataset: https://paperswithcode.com/dataset/metfaces

https://t.iss.one/DataScienceT
โค1
This media is not supported in your browser
VIEW IN TELEGRAM
"A panda is playing guitar on times square"

Text2Video-Zero

Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators

Paper: https://arxiv.org/abs/2303.13439
Video Result: video result link
Source code: https://github.com/picsart-ai-research/text2video-zero

https://t.iss.one/DataScienceT
โค1
This media is not supported in your browser
VIEW IN TELEGRAM
Conditional Image-to-Video Generation with Latent Flow Diffusion Models

New approach for cI2V using novel latent flow diffusion models (LFDM) that synthesize an optical flow sequence in the latent space based on the given condition to warp the given image.

๐Ÿ–ฅ Github: https://github.com/nihaomiao/cvpr23_lfdm

โฉ Paper: https://arxiv.org/abs/2303.13744v1

๐Ÿ’จ Dataset: https://drive.google.com/file/d/1dRn1wl5TUaZJiiDpIQADt1JJ0_q36MVG/view?usp=share_link

https://t.iss.one/DataScienceT
โคโ€๐Ÿ”ฅ2โค2๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
Test of Time: Instilling Video-Language Models with a Sense of Time

GPT-5 will likely have video abilities, but will it have a sense of time? Here is answer to this question in #CVPR2023 paper by student of University of Amsterdam to learn how to instil time into video-language foundation models.

Paper:
https://arxiv.org/abs/2301.02074

Code:
https://github.com/bpiyush/TestOfTime

Project Page:
https://bpiyush.github.io/testoftime-website/

https://t.iss.one/DataScienceT
โคโ€๐Ÿ”ฅ3
This media is not supported in your browser
VIEW IN TELEGRAM
ViperGPT: Visual Inference via Python Execution for Reasoning

ViperGPT, a framework that leverages code-generation models to compose vision-and-language models into subroutines to produce a result for any query.


Github:
https://github.com/cvlab-columbia/viper

Paper:
https://arxiv.org/pdf/2303.08128.pdf

Project:
https://paperswithcode.com/dataset/beat

https://t.iss.one/DataScienceT
โคโ€๐Ÿ”ฅ2
WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research

Propose a three-stage processing pipeline for filtering noisy data and generating high-quality captions, where ChatGPT.

๐Ÿ–ฅ Github: https://github.com/xinhaomei/wavcaps

โฉ Paper: https://arxiv.org/abs/2303.17395v1

๐Ÿ’จ Dataset: https://paperswithcode.com/dataset/sounddescs

https://t.iss.one/DataScienceT
โคโ€๐Ÿ”ฅ2๐Ÿ‘2
DPF: Learning Dense Prediction Fields with Weak Supervision

๐Ÿ–ฅ Github: https://github.com/cxx226/dpf

โฉ Paper: https://arxiv.org/abs/2303.16890v1

๐Ÿ’จ Dataset: https://paperswithcode.com/dataset/pascal-context

https://t.iss.one/DataScienceT
โคโ€๐Ÿ”ฅ3๐Ÿ‘1
Human Guided Ground-truth Generation for Realistic Image Super-resolution

๐Ÿ–ฅ Github: https://github.com/chrisdud0257/hggt

โฉ Paper: https://arxiv.org/abs/2303.13069

๐Ÿ’จ Dataset: https://paperswithcode.com/dataset/div2k

https://t.iss.one/DataScienceT
โค2โคโ€๐Ÿ”ฅ1
ImageNet-E: Benchmarking Neural Network Robustness via Attribute Editing

๐Ÿ–ฅ Github: https://github.com/alibaba/easyrobust/tree/main/benchmarks/imagenet-e

โฉ Paper: https://arxiv.org/abs/2303.17096v1

๐Ÿ’จ Dataset: https://paperswithcode.com/dataset/objectnet

https://t.iss.one/DataScienceT
โคโ€๐Ÿ”ฅ4
โšก๏ธToken Merging for Stable Diffusion

Token Merging (ToMe) speeds up transformers by merging redundant tokens, which means the transformer has to do less work.

pip install tomesd

๐Ÿ–ฅ Github: https://github.com/dbolya/tomesd

โฉ Paper: https://arxiv.org/abs/2303.17604v1

๐Ÿ’จ Blog: https://research.facebook.com/blog/2023/2/token-merging-your-vit-but-faster/

https://t.iss.one/DataScienceT
โคโ€๐Ÿ”ฅ1
โญ๏ธ HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace

Language serves as an interface for LLMs to connect numerous AI models for solving complicated AI tasks!

๐Ÿ–ฅ Github: https://github.com/microsoft/JARVIS

โฉ Paper: https://arxiv.org/abs/2303.17604v1

https://t.iss.one/DataScienceT
๐Ÿ‘4
WeakTr: Exploring Plain Vision Transformer for Weakly-supervised Semantic Segmentation

๐Ÿ–ฅ Github: https://github.com/hustvl/weaktr

โฉ Paper: https://arxiv.org/abs/2304.01184v1

๐Ÿ’จ Dataset: https://paperswithcode.com/dataset/imagenet

https://t.iss.one/DataScienceT
โคโ€๐Ÿ”ฅ2
This media is not supported in your browser
VIEW IN TELEGRAM
Test of Time: Instilling Video-Language Models with a Sense of Time

GPT-5 will likely have video abilities, but will it have a sense of time? Here is answer to this question in #CVPR2023 paper by student of University of Amsterdam to learn how to instil time into video-language foundation models.

Paper:
https://arxiv.org/abs/2301.02074

Code:
https://github.com/bpiyush/TestOfTime

Project Page:
https://bpiyush.github.io/testoftime-website/

https://t.iss.one/DataScienceT
โคโ€๐Ÿ”ฅ4
This media is not supported in your browser
VIEW IN TELEGRAM
Segment Anything

The Segment Anything Model (SAM) produces high quality object masks from input prompts such as points or boxes, and it can be used to generate masks for all objects in an image.

๐Ÿ–ฅ Github: https://github.com/facebookresearch/segment-anything

โญ๏ธ Project: https://segment-anything.com/

โฉ Paper: https://arxiv.org/abs/2304.02643v1

๐Ÿ’จ Dataset: https://segment-anything.com/dataset/index.html

https://t.iss.one/DataScienceT
โคโ€๐Ÿ”ฅ5
Media is too big
VIEW IN TELEGRAM
Painter โ†’ SegGPT: Vision Foundation Models from BAAI

SegGPT, a generalist model for segmenting everything in context.

๐Ÿ–ฅ Github: https://github.com/baaivision/painter

โฉ Paper: https://arxiv.org/abs/2304.03284v1

โฉ Demo: https://huggingface.co/spaces/BAAI/SegGPT

๐Ÿ’จ Dataset: https://paperswithcode.com/dataset/youtube-vos

https://t.iss.one/DataScienceT
โคโ€๐Ÿ”ฅ3๐Ÿ‘1
To watch paid channel content
All you have to do is subscribe to the paid channel. The paid channel includes multiple and huge programming courses, in addition to very useful books that are not available for free except in the paid channel.

To request a subscription: talk to @Hussein_Sheikho

Channel link: https://t.iss.one/+LnCmAFJO3tNmYjUy

โญ๏ธโญ๏ธโญ๏ธโญ๏ธโญ๏ธโญ๏ธโญ๏ธโญ๏ธโญ๏ธโญ๏ธโญ๏ธ

We launched a special bot some time ago to download all scientific, software and mathematics books The bot contains more than thirty million books, and new books are downloaded first, In addition to the possibility of downloading all articles and scientific papers for free

To request a subscription: talk to @Hussein_Sheikho
โคโ€๐Ÿ”ฅ2