Machine Learning
38.9K subscribers
3.73K photos
31 videos
40 files
1.29K links
Machine learning insights, practical tutorials, and clear explanations for beginners and aspiring data scientists. Follow the channel for models, algorithms, coding guides, and real-world ML applications.

Admin: @HusseinSheikho
Download Telegram
Cheat Sheet for Data Cleaning in Python Pandas

1. Data Overview

import pandas as pd
df = pd.read_csv('data.csv')   # Load CSV
df.head(5)                     # First 5 rows
df.info()                      # Data types and null counts
df.describe()                  # Summary statistics


2. Handling Missing Values and Duplicates

df.isnull().sum()              # Count missing values
df.dropna()                    # Remove rows with nulls
df.fillna(0)                   # Fill missing values with zeros
df.duplicated().sum()          # Count duplicates
df.drop_duplicates()           # Remove duplicates


3. Renaming and Type Conversion

df.rename(columns={'old': 'new'})   # Rename column
df.astype({'col': 'int'})           # Change data type
df.columns.str.strip()              # Remove spaces


4. Filtering and Selection

df.loc[df['col'] > 100]             # Filter by condition
df.iloc[0:5]                        # Select by row index
df['col'].isin(['A', 'B'])          # Filter by values


5. Sorting and Grouping

df.sort_values('col')               # Sort ascending
df.groupby('group')['val'].mean()   # Group and aggregate
df['col'].value_counts()            # Count unique values


6. Column Operations

df['new'] = df['col1'] + df['col2']               # New column
df['col'] = df['col'].apply(lambda x: x*2)        # Apply function
df.drop('col', axis=1)                            # Drop column


7. Saving Cleaned Data

df.to_csv('cleaned.csv', index=False)  # Export to CSV
❀6
πŸ”₯ Trending Repository: free-programming-books

πŸ“ Description: πŸ“š Freely available programming books

πŸ”— Repository URL: https://github.com/EbookFoundation/free-programming-books

🌐 Website: https://ebookfoundation.github.io/free-programming-books/

πŸ“– Readme: https://github.com/EbookFoundation/free-programming-books#readme

πŸ“Š Statistics:
🌟 Stars: 366K stars
πŸ‘€ Watchers:
🍴 Forks: 63.9K forks

πŸ’» Programming Languages: Python - HTML

🏷️ Related Topics:
#education #list #books #resource #hacktoberfest


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: midday

πŸ“ Description: Invoicing, Time tracking, File reconciliation, Storage, Financial Overview & your own Assistant made for Freelancers

πŸ”— Repository URL: https://github.com/midday-ai/midday

🌐 Website: https://midday.ai

πŸ“– Readme: https://github.com/midday-ai/midday#readme

πŸ“Š Statistics:
🌟 Stars: 11.1K stars
πŸ‘€ Watchers: 68
🍴 Forks: 1K forks

πŸ’» Programming Languages: TypeScript - MDX - Rust - CSS - JavaScript - Dockerfile

🏷️ Related Topics:
#finance #typescript #nextjs #tailwind #supabase


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯1
πŸ”₯ Trending Repository: yt-dlp

πŸ“ Description: A feature-rich command-line audio/video downloader

πŸ”— Repository URL: https://github.com/yt-dlp/yt-dlp

🌐 Website: https://discord.gg/H5MNcFW63r

πŸ“– Readme: https://github.com/yt-dlp/yt-dlp#readme

πŸ“Š Statistics:
🌟 Stars: 124K stars
πŸ‘€ Watchers: 640
🍴 Forks: 9.9K forks

πŸ’» Programming Languages: Python

🏷️ Related Topics:
#python #cli #downloader #youtube_dl #youtube_downloader #sponsorblock #yt_dlp


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: django

πŸ“ Description: The Web framework for perfectionists with deadlines.

πŸ”— Repository URL: https://github.com/django/django

🌐 Website: https://www.djangoproject.com/

πŸ“– Readme: https://github.com/django/django#readme

πŸ“Š Statistics:
🌟 Stars: 84.7K stars
πŸ‘€ Watchers: 2.3k
🍴 Forks: 32.8K forks

πŸ’» Programming Languages: Python - Jinja

🏷️ Related Topics:
#python #django #views #framework #orm #web #apps #templates #models


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: ComfyUI-nunchaku

πŸ“ Description: ComfyUI Plugin of Nunchaku

πŸ”— Repository URL: https://github.com/nunchaku-tech/ComfyUI-nunchaku

🌐 Website: https://nunchaku.tech/docs/ComfyUI-nunchaku/

πŸ“– Readme: https://github.com/nunchaku-tech/ComfyUI-nunchaku#readme

πŸ“Š Statistics:
🌟 Stars: 1.9K stars
πŸ‘€ Watchers: 21
🍴 Forks: 68 forks

πŸ’» Programming Languages: Python

🏷️ Related Topics:
#flux #quantization #diffusion #mlsys #comfyui #genai


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: klipper

πŸ“ Description: Klipper is a 3d-printer firmware

πŸ”— Repository URL: https://github.com/Klipper3d/klipper

πŸ“– Readme: https://github.com/Klipper3d/klipper#readme

πŸ“Š Statistics:
🌟 Stars: 10.7K stars
πŸ‘€ Watchers: 368
🍴 Forks: 5.6K forks

πŸ’» Programming Languages: C - Python - Jupyter Notebook - C++ - Assembly - Makefile

🏷️ Related Topics: Not available

==================================
🧠 By: https://t.iss.one/DataScienceM
❀1
πŸ”₯ Trending Repository: Java

πŸ“ Description: All Algorithms implemented in Java

πŸ”— Repository URL: https://github.com/TheAlgorithms/Java

πŸ“– Readme: https://github.com/TheAlgorithms/Java#readme

πŸ“Š Statistics:
🌟 Stars: 62.8K stars
πŸ‘€ Watchers: 2.2k
🍴 Forks: 20.2K forks

πŸ’» Programming Languages: Java - Dockerfile

🏷️ Related Topics:
#search #java #algorithm #algorithms #sort #data_structures #sorting_algorithms #algorithm_challenges #hacktoberfest #algorithms_datastructures


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: Folo

πŸ“ Description: 🧑 Follow everything in one place

πŸ”— Repository URL: https://github.com/RSSNext/Folo

🌐 Website: https://folo.is

πŸ“– Readme: https://github.com/RSSNext/Folo#readme

πŸ“Š Statistics:
🌟 Stars: 32K stars
πŸ‘€ Watchers: 119
🍴 Forks: 1.5K forks

πŸ’» Programming Languages: TypeScript - Swift - JavaScript - CSS - Kotlin - HTML

🏷️ Related Topics:
#rss #ai #rss_reader #blockchain #reader


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: ai-agents-for-beginners

πŸ“ Description: 11 Lessons to Get Started Building AI Agents

πŸ”— Repository URL: https://github.com/microsoft/ai-agents-for-beginners

🌐 Website: https://microsoft.github.io/ai-agents-for-beginners/

πŸ“– Readme: https://github.com/microsoft/ai-agents-for-beginners#readme

πŸ“Š Statistics:
🌟 Stars: 34.9K stars
πŸ‘€ Watchers: 328
🍴 Forks: 11.1K forks

πŸ’» Programming Languages: Jupyter Notebook - Python - Dockerfile

🏷️ Related Topics:
#ai_agents #autogen #generative_ai #semantic_kernel #ai_agents_framework #agentic_framework #agentic_rag #agentic_ai


==================================
🧠 By: https://t.iss.one/DataScienceM
❀1
πŸ”₯ Trending Repository: verifiers

πŸ“ Description: Verifiers for LLM Reinforcement Learning

πŸ”— Repository URL: https://github.com/willccbb/verifiers

πŸ“– Readme: https://github.com/willccbb/verifiers#readme

πŸ“Š Statistics:
🌟 Stars: 2.1K stars
πŸ‘€ Watchers: 26
🍴 Forks: 267 forks

πŸ’» Programming Languages: Python

🏷️ Related Topics: Not available

==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: spotify-downloader

πŸ“ Description: Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found).

πŸ”— Repository URL: https://github.com/spotDL/spotify-downloader

🌐 Website: https://spotdl.readthedocs.io/en/latest/

πŸ“– Readme: https://github.com/spotDL/spotify-downloader#readme

πŸ“Š Statistics:
🌟 Stars: 21.3K stars
πŸ‘€ Watchers: 197
🍴 Forks: 1.9K forks

πŸ’» Programming Languages: Python

🏷️ Related Topics:
#python #music #playlists #spotify #mp3 #song #hacktoberfest #youtube_music #song_lyrics #download_music #spotdl #spotdl_cli


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: github-readme-stats

πŸ“ Description: ⚑ Dynamically generated stats for your github readmes

πŸ”— Repository URL: https://github.com/anuraghazra/github-readme-stats

🌐 Website: https://github-readme-stats.vercel.app

πŸ“– Readme: https://github.com/anuraghazra/github-readme-stats#readme

πŸ“Š Statistics:
🌟 Stars: 75.3K stars
πŸ‘€ Watchers: 291
🍴 Forks: 25.7K forks

πŸ’» Programming Languages: JavaScript - Shell

🏷️ Related Topics:
#serverless #dynamic #hacktoberfest #readme_generator #profile_readme #readme_stats


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: nn-zero-to-hero

πŸ“ Description: Neural Networks: Zero to Hero

πŸ”— Repository URL: https://github.com/karpathy/nn-zero-to-hero

πŸ“– Readme: https://github.com/karpathy/nn-zero-to-hero#readme

πŸ“Š Statistics:
🌟 Stars: 15.5K stars
πŸ‘€ Watchers: 319
🍴 Forks: 2.2K forks

πŸ’» Programming Languages: Jupyter Notebook

🏷️ Related Topics: Not available

==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: system_prompts_leaks

πŸ“ Description: Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini

πŸ”— Repository URL: https://github.com/asgeirtj/system_prompts_leaks

πŸ“– Readme: https://github.com/asgeirtj/system_prompts_leaks#readme

πŸ“Š Statistics:
🌟 Stars: 9.2K stars
πŸ‘€ Watchers: 200
🍴 Forks: 2K forks

πŸ’» Programming Languages: JavaScript

🏷️ Related Topics:
#ai #gemini #openai #chatbots #prompts #claude #google_deepmind #large_language_models #llm #prompt_engineering #generative_ai #chatgpt #prompt_injection #anthropic


==================================
🧠 By: https://t.iss.one/DataScienceM
❀1
πŸ”₯ Trending Repository: generative-ai-for-beginners

πŸ“ Description: 21 Lessons, Get Started Building with Generative AI

πŸ”— Repository URL: https://github.com/microsoft/generative-ai-for-beginners

πŸ“– Readme: https://github.com/microsoft/generative-ai-for-beginners#readme

πŸ“Š Statistics:
🌟 Stars: 95.7K stars
πŸ‘€ Watchers: 827
🍴 Forks: 50.1K forks

πŸ’» Programming Languages: Jupyter Notebook - Python - JavaScript - TypeScript - Shell - PowerShell

🏷️ Related Topics:
#ai #azure #transformers #openai #gpt #language_model #semantic_search #dall_e #prompt_engineering #llms #generative_ai #generativeai #chatgpt #microsoft_for_beginners


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: awesome-tuis

πŸ“ Description: List of projects that provide terminal user interfaces

πŸ”— Repository URL: https://github.com/rothgar/awesome-tuis

πŸ“– Readme: https://github.com/rothgar/awesome-tuis#readme

πŸ“Š Statistics:
🌟 Stars: 11.1K stars
πŸ‘€ Watchers: 155
🍴 Forks: 430 forks

πŸ’» Programming Languages: Not available

🏷️ Related Topics: Not available

==================================
🧠 By: https://t.iss.one/DataScienceM
✨ Meet BLIP: The Vision-Language Model Powering Image Captioning ✨

πŸ“– Table of Contents Meet BLIP: The Vision-Language Model Powering Image Captioning What Is Image Captioning and Why Is It Challenging? Why It’s Challenging Why Traditional Vision Tasks Aren’t Enough Configuring Your Development Environment A Brief History of Image Captioning Models…...

🏷️ #ComputerVision #DeepLearning #ImageCaptioning #MultimodalAI #Tutorial
❀1
πŸ”₯ Trending Repository: SurfSense

πŸ“ Description: Open Source Alternative to NotebookLM / Perplexity, connected to external sources such as Search Engines, Slack, Linear, Jira, ClickUp, Confluence, Notion, YouTube, GitHub, Discord and more. Join our discord:https://discord.gg/ejRNvftDp9

πŸ”— Repository URL: https://github.com/MODSetter/SurfSense

🌐 Website: https://www.surfsense.net

πŸ“– Readme: https://github.com/MODSetter/SurfSense#readme

πŸ“Š Statistics:
🌟 Stars: 6.7K stars
πŸ‘€ Watchers: 46
🍴 Forks: 507 forks

πŸ’» Programming Languages: Python - TypeScript - MDX - CSS - JavaScript - Dockerfile

🏷️ Related Topics:
#python #chrome_extension #slack #agent #jira #typescript #extension #ai #nextjs #agents #notion #perplexity #rag #fastapi #langchain #ollama #langgraph #nextjs15 #aceternity_ui #notebooklm


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: Seelen-UI

πŸ“ Description: The Fully Customizable Desktop Environment for Windows 10/11.

πŸ”— Repository URL: https://github.com/eythaann/Seelen-UI

πŸ“– Readme: https://github.com/eythaann/Seelen-UI#readme

πŸ“Š Statistics:
🌟 Stars: 10.7K stars
πŸ‘€ Watchers: 45
🍴 Forks: 327 forks

πŸ’» Programming Languages: Rust - TypeScript - CSS - NSIS - SCSS - JavaScript

🏷️ Related Topics:
#windows #wallpaper #web #tools #toolbar #finder #desktop_application #dock #taskbar #tiling_window_manager #multilanguage #topbar #tauri #web_technologies #app_launcher #windows_11 #tauri_app #seelen_ui #seelen


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: Summer2026-Internships

πŸ“ Description: Collection of Summer 2026 tech internships!

πŸ”— Repository URL: https://github.com/SimplifyJobs/Summer2026-Internships

🌐 Website: https://swelist.com/

πŸ“– Readme: https://github.com/SimplifyJobs/Summer2026-Internships#readme

πŸ“Š Statistics:
🌟 Stars: 40.3K stars
πŸ‘€ Watchers: 1.6k
🍴 Forks: 3K forks

πŸ’» Programming Languages: Not available

🏷️ Related Topics:
#university #jobs #internships #interview_preparation


==================================
🧠 By: https://t.iss.one/DataScienceM
❀1