Data Science Machine Learning Data Analysis
38.8K subscribers
3.68K photos
31 videos
39 files
1.28K links
ads: @HusseinSheikho

This channel is for Programmers, Coders, Software Engineers.

1- Data Science
2- Machine Learning
3- Data Visualization
4- Artificial Intelligence
5- Data Analysis
6- Statistics
7- Deep Learning
Download Telegram
πŸ“Œ Pandas Can’t Handle This: How ArcticDB Powers Massive Datasets

πŸ—‚ Category: DATA SCIENCE

πŸ•’ Date: 2025-02-12 | ⏱️ Read time: 14 min read

Python has grown to dominate data science, and its package Pandas has become the go-to…
❀1
πŸ“Œ Branching Out: 4 Git Workflows for Collaborating on ML

πŸ—‚ Category: MACHINE LEARNING

πŸ•’ Date: 2025-02-12 | ⏱️ Read time: 11 min read

It’s been more than 15 years since I finished my master’s degree, but I’m still…
❀1
πŸ“Œ Build a Decision Tree in Polars from Scratch

πŸ—‚ Category: DATA SCIENCE

πŸ•’ Date: 2025-02-12 | ⏱️ Read time: 21 min read

Decision tree algorithms have always fascinated me. They are easy to implement and achieve good…
πŸ“Œ Virtualization & Containers for Data Science Newbies

πŸ—‚ Category: DATA SCIENCE

πŸ•’ Date: 2025-02-11 | ⏱️ Read time: 12 min read

Virtualization makes it possible to run multiple virtual machines (VMs) on a single piece of…
❀1
πŸ“Œ 4-Dimensional Data Visualization: Time in Bubble Charts

πŸ—‚ Category: DATA SCIENCE

πŸ•’ Date: 2025-02-11 | ⏱️ Read time: 9 min read

Bubble charts elegantly compress large amounts of information into a single visualization, with bubble size…
πŸ“Œ Understanding Model Calibration: A Gentle Introduction & Visual Exploration

πŸ—‚ Category: MACHINE LEARNING

πŸ•’ Date: 2025-02-11 | ⏱️ Read time: 16 min read

How Reliable Are Your Predictions? About To be considered reliable, a model must be calibrated…
πŸ“Œ Data vs. Business Strategy

πŸ—‚ Category: DATA SCIENCE

πŸ•’ Date: 2025-02-11 | ⏱️ Read time: 12 min read

There seems to be a consensus that leveraging data, analytics, and AI to create a…
πŸ“Œ Polars vs. Pandas β€” An Independent Speed Comparison

πŸ—‚ Category: DATA ENGINEERING

πŸ•’ Date: 2025-02-11 | ⏱️ Read time: 13 min read

Overview Introduction β€” Purpose and Reasons Speed is important when dealing with large amounts of…
πŸ“Œ Six Ways to Control Style and Content in Diffusion Models

πŸ—‚ Category: MACHINE LEARNING

πŸ•’ Date: 2025-02-10 | ⏱️ Read time: 9 min read

Stable Diffusion 1.5/2.0/2.1/XL 1.0, DALL-E, Imagen… In the past years, diffusion models have showcased stunning…
πŸ“Œ The Gamma Hurdle Distribution

πŸ—‚ Category: DATA SCIENCE

πŸ•’ Date: 2025-02-07 | ⏱️ Read time: 11 min read

Which Outcome Matters? Here is a common scenario : An A/B test was conducted, where…
πŸ“Œ Triangle Forecasting: Why Traditional Impact Estimates Are Inflated (And How to Fix Them)

πŸ—‚ Category: DATA SCIENCE

πŸ•’ Date: 2025-02-07 | ⏱️ Read time: 7 min read

Accurate impact estimations can make or break your business case. Yet, despite its importance, most…
πŸ“Œ I Tried Making my Own (Bad) LLM Benchmark to Cheat in Escape Rooms

πŸ—‚ Category: MACHINE LEARNING

πŸ•’ Date: 2025-02-07 | ⏱️ Read time: 20 min read

Recently, DeepSeek announced their latest model, R1, and article after article came out praising its…
πŸ“Œ Synthetic Data Generation with LLMs

πŸ—‚ Category: LARGE LANGUAGE MODELS

πŸ•’ Date: 2025-02-07 | ⏱️ Read time: 9 min read

Popularity of RAG Over the past two years while working with financial firms, I’ve observed…
πŸ“Œ The Method of Moments Estimator for Gaussian Mixture Models

πŸ—‚ Category: DATA SCIENCE

πŸ•’ Date: 2025-02-07 | ⏱️ Read time: 8 min read

Audio processing is one of the most important application domains of digital signal processing (DSP)…
πŸ“Œ A Comprehensive Guide to LLM Temperature

πŸ—‚ Category: LARGE LANGUAGE MODELS

πŸ•’ Date: 2025-02-07 | ⏱️ Read time: 8 min read

While building my own LLM-based application, I found many prompt engineering guides, but few equivalent…
πŸ“Œ How to Create Network Graph Visualizations in Microsoft PowerBI

πŸ—‚ Category: DATA VISUALIZATION

πŸ•’ Date: 2025-02-07 | ⏱️ Read time: 6 min read

Microsoft PowerBI is a one of the most popular business intelligence (BI) tools, and while…
πŸ“Œ Efficient Metric Collection in PyTorch: Avoiding the Performance Pitfalls of TorchMetrics

πŸ—‚ Category: MACHINE LEARNING

πŸ•’ Date: 2025-02-06 | ⏱️ Read time: 13 min read

Metric collection is an essential part of every machine learning project, enabling us to track…
πŸ“Œ Introduction to Minimum Cost Flow Optimization in Python

πŸ—‚ Category: DATA SCIENCE

πŸ•’ Date: 2025-02-06 | ⏱️ Read time: 21 min read

Minimum cost flow optimization minimizes the cost of moving flow through a network of nodes…
πŸ“Œ A Visual Guide to How Diffusion Models Work

πŸ—‚ Category: MACHINE LEARNING

πŸ•’ Date: 2025-02-06 | ⏱️ Read time: 26 min read

This article is aimed at those who want to understand exactly how diffusion models work,…
πŸ“Œ Ridge Regression: A Robust Path to Reliable Predictions

πŸ—‚ Category: DATA SCIENCE

πŸ•’ Date: 2025-01-30 | ⏱️ Read time: 11 min read

Learn how regularization reduces overfitting and improves model stability in linear regression.