Data Science Jupyter Notebooks
12.5K subscribers
314 photos
49 videos
9 files
1.06K links
Explore the world of Data Science through Jupyter Notebooksโ€”insights, tutorials, and tools to boost your data journey. Code, analyze, and visualize smarter with every post.
Download Telegram
๐Ÿ”ฅ Trending Repository: free-llm-api-resources

๐Ÿ“ Description: A list of free LLM inference resources accessible via API.

๐Ÿ”— Repository URL: https://github.com/cheahjs/free-llm-api-resources

๐Ÿ“– Readme: https://github.com/cheahjs/free-llm-api-resources#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 8.5K stars
๐Ÿ‘€ Watchers: 138
๐Ÿด Forks: 840 forks

๐Ÿ’ป Programming Languages: Python

๐Ÿท๏ธ Related Topics:
#ai #gemini #openai #llama #claude #llm


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: claude-skills

๐Ÿ“ Description: 65 Specialized Skills for Full-Stack Developers. Transform Claude Code into your expert pair programmer.

๐Ÿ”— Repository URL: https://github.com/Jeffallan/claude-skills

๐Ÿ“– Readme: https://github.com/Jeffallan/claude-skills#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 498 stars
๐Ÿ‘€ Watchers: 6
๐Ÿด Forks: 56 forks

๐Ÿ’ป Programming Languages: Python - JavaScript - HTML - Astro - Shell - Makefile

๐Ÿท๏ธ Related Topics:
#ai_agents #claude #claude_code #claude_skills #claude_marketplace


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
โค1
๐Ÿ”น DATA SCIENCE โ€“ INTERVIEW REVISION SHEET*

*1๏ธโƒฃ What is Data Science?*
> โ€œData science is the process of using data, statistics, and machine learning to extract insights and build predictive or decision-making models.โ€

Difference from Data Analytics:
- Data Analytics โ†’ past & present (what/why)
- Data Science โ†’ future & automation (what will happen)

*2๏ธโƒฃ Data Science Lifecycle (Very Important)*
1. Business problem understanding
2. Data collection
3. Data cleaning & preprocessing
4. Exploratory Data Analysis (EDA)
5. Feature engineering
6. Model building
7. Model evaluation
8. Deployment & monitoring
Interview line:
> โ€œI always start from business understanding, not the model.โ€

*3๏ธโƒฃ Data Types*
- Structured โ†’ tables, SQL
- Semi-structured โ†’ JSON, logs
- Unstructured โ†’ text, images

*4๏ธโƒฃ Statistics You MUST Know*
- Central tendency: Mean, Median (use when outliers exist)
- Spread: Variance, Standard deviation
- Correlation โ‰  causation
- Normal distribution
- Skewness (income โ†’ right skewed)

*5๏ธโƒฃ Data Cleaning & Preprocessing*
Steps you should say in interviews:
1. Handle missing values
2. Remove duplicates
3. Treat outliers
4. Encode categorical variables
5. Scale numerical data
Scaling:
- Min-Max โ†’ bounded range
- Standardization โ†’ normal distribution

*6๏ธโƒฃ Feature Engineering (Interview Favorite)*
> โ€œFeature engineering is creating meaningful input variables that improve model performance.โ€
Examples:
- Extract month from date
- Create customer lifetime value
- Binning age groups

*7๏ธโƒฃ Machine Learning Basics*
- Supervised learning: Regression, Classification
- Unsupervised learning: Clustering, Dimensionality reduction

*8๏ธโƒฃ Common Algorithms (Know WHEN to use)*
- Regression: Linear regression โ†’ continuous output
- Classification: Logistic regression, Decision tree, Random forest, SVM
- Unsupervised: K-Means โ†’ segmentation, PCA โ†’ dimensionality reduction

*9๏ธโƒฃ Overfitting vs Underfitting*
- Overfitting โ†’ model memorizes training data
- Underfitting โ†’ model too simple
Fixes:
- Regularization
- More data
- Cross-validation

*๐Ÿ”Ÿ Model Evaluation Metrics*
- Classification: Accuracy, Precision, Recall, F1 score, ROC-AUC
- Regression: MAE, RMSE
Interview line:
> โ€œMetric selection depends on business problem.โ€

*1๏ธโƒฃ1๏ธโƒฃ Imbalanced Data Techniques*
- Class weighting
- Oversampling / undersampling
- SMOTE
- Metric preference: Precision, Recall, F1, ROC-AUC

*1๏ธโƒฃ2๏ธโƒฃ Python for Data Science*
Core libraries:
- NumPy
- Pandas
- Matplotlib / Seaborn
- Scikit-learn
Must know:
- loc vs iloc
- Groupby
- Vectorization

*1๏ธโƒฃ3๏ธโƒฃ Model Deployment (Basic Understanding)*
- Batch prediction
- Real-time prediction
- Model monitoring
- Model drift
Interview line:
> โ€œModels must be monitored because data changes over time.โ€

*1๏ธโƒฃ4๏ธโƒฃ Explain Your Project (Template)*
> โ€œThe goal was _. I cleaned the data using _. I performed EDA to identify _. I built _ model and evaluated using _. The final outcome was _.โ€

*1๏ธโƒฃ5๏ธโƒฃ HR-Style Data Science Answers*
Why data science?
> โ€œI enjoy solving complex problems using data and building models that automate decisions.โ€
Biggest challenge:
โ€œHandling messy real-world data.โ€
Strength:
โ€œStrong foundation in statistics and ML.โ€

*๐Ÿ”ฅ LAST-DAY INTERVIEW TIPS*
- Explain intuition, not math
- Donโ€™t jump to algorithms immediately
- Always connect model โ†’ business value
- Say assumptions clearly
โค4
๐Ÿ”ฅ Trending Repository: Personal_AI_Infrastructure

๐Ÿ“ Description: Agentic AI Infrastructure for magnifying HUMAN capabilities.

๐Ÿ”— Repository URL: https://github.com/danielmiessler/Personal_AI_Infrastructure

๐Ÿ“– Readme: https://github.com/danielmiessler/Personal_AI_Infrastructure#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 7.2K stars
๐Ÿ‘€ Watchers: 120
๐Ÿด Forks: 1.1K forks

๐Ÿ’ป Programming Languages: TypeScript - Vue - Python - Shell - CSS - Handlebars

๐Ÿท๏ธ Related Topics:
#productivity #ai #humans #augmentation


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: rowboat

๐Ÿ“ Description: Open-source AI coworker, with memory

๐Ÿ”— Repository URL: https://github.com/rowboatlabs/rowboat

๐ŸŒ Website: https://www.rowboatlabs.com

๐Ÿ“– Readme: https://github.com/rowboatlabs/rowboat#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 4.9K stars
๐Ÿ‘€ Watchers: 38
๐Ÿด Forks: 388 forks

๐Ÿ’ป Programming Languages: TypeScript - CSS - MDX - Python - JavaScript - Dockerfile

๐Ÿท๏ธ Related Topics:
#productivity #open_source #ai #orchestration #multiagent #agents #ai_agents #llm #generative_ai #chatgpt #opeani #ai_agents_automation #claude_code #agents_sdk #claude_cowork


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: cinny

๐Ÿ“ Description: Yet another matrix client

๐Ÿ”— Repository URL: https://github.com/cinnyapp/cinny

๐ŸŒ Website: https://cinny.in

๐Ÿ“– Readme: https://github.com/cinnyapp/cinny#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 2.8K stars
๐Ÿ‘€ Watchers: 19
๐Ÿด Forks: 385 forks

๐Ÿ’ป Programming Languages: TypeScript

๐Ÿท๏ธ Related Topics:
#client #reactjs #matrix #hacktoberfest #matrix_client #matrix_org #cinny #cinnyapp


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
The biggest surprise for our valued audience: we are offering 40 paid courses completely free.

Enroll Here and request
https://adsly.me/l/jwxfnss0yi

We use a spam/flood protection system to ensure that all registered users are real people.
๐Ÿ”ฅ Trending Repository: aios-core

๐Ÿ“ Description: Synkra AIOS: AI-Orchestrated System for Full Stack Development - Core Framework v4.0

๐Ÿ”— Repository URL: https://github.com/SynkraAI/aios-core

๐ŸŒ Website: https://github.com/allfluence/aios-core

๐Ÿ“– Readme: https://github.com/SynkraAI/aios-core#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 291 stars
๐Ÿ‘€ Watchers: 29
๐Ÿด Forks: 171 forks

๐Ÿ’ป Programming Languages: JavaScript - Python - Shell - Handlebars - PLpgSQL - CSS

๐Ÿท๏ธ Related Topics:
#nodejs #cli #development #automation #framework #typescript #ai #orchestration #fullstack #agents #ai_agents #claude


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: MTProxy

๐Ÿ“ Description: No description available

๐Ÿ”— Repository URL: https://github.com/TelegramMessenger/MTProxy

๐Ÿ“– Readme: https://github.com/TelegramMessenger/MTProxy#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 5.8K stars
๐Ÿ‘€ Watchers: 233
๐Ÿด Forks: 994 forks

๐Ÿ’ป Programming Languages: C - Makefile

๐Ÿท๏ธ Related Topics: Not available

==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: superhuman

๐Ÿ“ Description: No description available

๐Ÿ”— Repository URL: https://github.com/google-deepmind/superhuman

๐Ÿ“– Readme: https://github.com/google-deepmind/superhuman#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 268 stars
๐Ÿ‘€ Watchers: 14
๐Ÿด Forks: 21 forks

๐Ÿ’ป Programming Languages: TeX

๐Ÿท๏ธ Related Topics: Not available

==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
โค1
๐Ÿ”ฅ Trending Repository: slime

๐Ÿ“ Description: slime is an LLM post-training framework for RL Scaling.

๐Ÿ”— Repository URL: https://github.com/THUDM/slime

๐ŸŒ Website: https://thudm.github.io/slime

๐Ÿ“– Readme: https://github.com/THUDM/slime#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 4K stars
๐Ÿ‘€ Watchers: 16
๐Ÿด Forks: 523 forks

๐Ÿ’ป Programming Languages: Python - Shell

๐Ÿท๏ธ Related Topics: Not available

==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
โค2
๐Ÿ”ฅ Trending Repository: DebugSwift

๐Ÿ“ Description: A toolkit to make debugging iOS applications easier ๐Ÿš€

๐Ÿ”— Repository URL: https://github.com/DebugSwift/DebugSwift

๐Ÿ“– Readme: https://github.com/DebugSwift/DebugSwift#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 1.3K stars
๐Ÿ‘€ Watchers: 7
๐Ÿด Forks: 118 forks

๐Ÿ’ป Programming Languages: Swift

๐Ÿท๏ธ Related Topics:
#debugger #swift #debugging #ui #networking #log #analytics #analysis #view #cocoapods #sandbox #uikit #debug #performance_analysis #crashlytics #hacktoberfest #leak_detection #logs_analysis #layout_debugger #swift6


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
โค3
SQL ๐—ข๐—ฟ๐—ฑ๐—ฒ๐—ฟ ๐—ข๐—ณ ๐—˜๐˜…๐—ฒ๐—ฐ๐˜‚๐˜๐—ถ๐—ผ๐—ป

1 โ†’ FROM (Tables selected).
2 โ†’ WHERE (Filters applied).
3 โ†’ GROUP BY (Rows grouped).
4 โ†’ HAVING (Filter on grouped data).
5 โ†’ SELECT (Columns selected).
6 โ†’ ORDER BY (Sort the data).
7 โ†’ LIMIT (Restrict number of rows).

๐—–๐—ผ๐—บ๐—บ๐—ผ๐—ป ๐—ค๐˜‚๐—ฒ๐—ฟ๐—ถ๐—ฒ๐˜€ ๐—ง๐—ผ ๐—ฃ๐—ฟ๐—ฎ๐—ฐ๐˜๐—ถ๐—ฐ๐—ฒ โ†“

โ†ฌ Find the second-highest salary:

SELECT MAX(Salary) FROM Employees WHERE Salary < (SELECT MAX(Salary) FROM Employees);

โ†ฌ Find duplicate records:

SELECT Name, COUNT(*)
FROM Emp
GROUP BY Name
HAVING COUNT(*) > 1;


https://t.iss.one/DataScienceM
โค1๐Ÿ”ฅ1
๐Ÿ”ฅ Trending Repository: zvec

๐Ÿ“ Description: A lightweight, lightning-fast, in-process vector database

๐Ÿ”— Repository URL: https://github.com/alibaba/zvec

๐ŸŒ Website: https://zvec.org/en/

๐Ÿ“– Readme: https://github.com/alibaba/zvec#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 967 stars
๐Ÿ‘€ Watchers: 4
๐Ÿด Forks: 56 forks

๐Ÿ’ป Programming Languages: C++ - SWIG - Python - C - CMake - ANTLR

๐Ÿท๏ธ Related Topics:
#embedded_database #rag #vector_search #ann_search #vectordb


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: wifi-densepose

๐Ÿ“ Description: Production-ready implementation of InvisPose - a revolutionary WiFi-based dense human pose estimation system that enables real-time full-body tracking through walls using commodity mesh routers

๐Ÿ”— Repository URL: https://github.com/ruvnet/wifi-densepose

๐Ÿ“– Readme: https://github.com/ruvnet/wifi-densepose#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 6K stars
๐Ÿ‘€ Watchers: 39
๐Ÿด Forks: 544 forks

๐Ÿ’ป Programming Languages: Python - Rust - JavaScript - Shell - HTML - CSS

๐Ÿท๏ธ Related Topics: Not available

==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: unstract

๐Ÿ“ Description: No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents

๐Ÿ”— Repository URL: https://github.com/Zipstack/unstract

๐ŸŒ Website: https://unstract.com

๐Ÿ“– Readme: https://github.com/Zipstack/unstract#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 6.2K stars
๐Ÿ‘€ Watchers: 46
๐Ÿด Forks: 588 forks

๐Ÿ’ป Programming Languages: Python - JavaScript - Shell - CSS

๐Ÿท๏ธ Related Topics:
#unstructured_data #etl_pipeline #llm_platform


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: letta-code

๐Ÿ“ Description: The memory-first coding agent

๐Ÿ”— Repository URL: https://github.com/letta-ai/letta-code

๐ŸŒ Website: https://docs.letta.com/letta-code

๐Ÿ“– Readme: https://github.com/letta-ai/letta-code#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 1.1K stars
๐Ÿ‘€ Watchers: 7
๐Ÿด Forks: 133 forks

๐Ÿ’ป Programming Languages: TypeScript

๐Ÿท๏ธ Related Topics: Not available

==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: ruby

๐Ÿ“ Description: The Ruby Programming Language

๐Ÿ”— Repository URL: https://github.com/ruby/ruby

๐ŸŒ Website: https://www.ruby-lang.org/

๐Ÿ“– Readme: https://github.com/ruby/ruby#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 23.3K stars
๐Ÿ‘€ Watchers: 1.1k
๐Ÿด Forks: 5.6K forks

๐Ÿ’ป Programming Languages: Ruby - C - Rust - C++ - Yacc - HTML

๐Ÿท๏ธ Related Topics:
#ruby #c #language #programming_language #rust #jit #object_oriented #ruby_language


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
โค1
๐Ÿ”ฅ Trending Repository: gogcli

๐Ÿ“ Description: Google Suite CLI: Gmail, GCal, GDrive, GContacts.

๐Ÿ”— Repository URL: https://github.com/steipete/gogcli

๐ŸŒ Website: https://gogcli.sh

๐Ÿ“– Readme: https://github.com/steipete/gogcli#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 2.7K stars
๐Ÿ‘€ Watchers: 19
๐Ÿด Forks: 242 forks

๐Ÿ’ป Programming Languages: Go - HTML - Shell

๐Ÿท๏ธ Related Topics:
#google #gmail #gdrive #gcal #gcontacts


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: moonshine

๐Ÿ“ Description: Fast and accurate automatic speech recognition (ASR) for edge devices

๐Ÿ”— Repository URL: https://github.com/moonshine-ai/moonshine

๐Ÿ“– Readme: https://github.com/moonshine-ai/moonshine#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 3.6K stars
๐Ÿ‘€ Watchers: 43
๐Ÿด Forks: 175 forks

๐Ÿ’ป Programming Languages: C - C++ - Python - Swift - Java - Jupyter Notebook

๐Ÿท๏ธ Related Topics: Not available

==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: brave-browser

๐Ÿ“ Description: Brave browser for Android, iOS, Linux, macOS, Windows.

๐Ÿ”— Repository URL: https://github.com/brave/brave-browser

๐ŸŒ Website: https://brave.com

๐Ÿ“– Readme: https://github.com/brave/brave-browser#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 21.5K stars
๐Ÿ‘€ Watchers: 388
๐Ÿด Forks: 3K forks

๐Ÿ’ป Programming Languages: Not available

๐Ÿท๏ธ Related Topics:
#windows #macos #linux #browser #chromium #brave


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
โค1