Data Science Jupyter Notebooks
12.5K subscribers
313 photos
48 videos
9 files
1.05K links
Explore the world of Data Science through Jupyter Notebooksβ€”insights, tutorials, and tools to boost your data journey. Code, analyze, and visualize smarter with every post.
Download Telegram
πŸ”₯ Trending Repository: calibre

πŸ“ Description: The official source code repository for the calibre ebook manager

πŸ”— Repository URL: https://github.com/kovidgoyal/calibre

🌐 Website: https://calibre-ebook.com

πŸ“– Readme: https://github.com/kovidgoyal/calibre#readme

πŸ“Š Statistics:
🌟 Stars: 23.5K stars
πŸ‘€ Watchers: 385
🍴 Forks: 2.5K forks

πŸ’» Programming Languages: Python - C - C++ - HTML - Shell - XSLT

🏷️ Related Topics:
#python #ebook #epub #kindle #ebook_manager #calibre #ebook_reader #ebooks #ebook_formats #epub_generation


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: vibetunnel

πŸ“ Description: Turn any browser into your terminal & command your agents on the go.

πŸ”— Repository URL: https://github.com/amantus-ai/vibetunnel

🌐 Website: https://vt.sh

πŸ“– Readme: https://github.com/amantus-ai/vibetunnel#readme

πŸ“Š Statistics:
🌟 Stars: 3.4K stars
πŸ‘€ Watchers: 11
🍴 Forks: 223 forks

πŸ’» Programming Languages: TypeScript - Swift - HTML - Shell - JavaScript - Zig

🏷️ Related Topics:
#terminal #remote #vibecoding


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: CodexBar

πŸ“ Description: Show usage stats for OpenAI Codex and Claude Code, without having to login.

πŸ”— Repository URL: https://github.com/steipete/CodexBar

🌐 Website: https://codexbar.app

πŸ“– Readme: https://github.com/steipete/CodexBar#readme

πŸ“Š Statistics:
🌟 Stars: 3.6K stars
πŸ‘€ Watchers: 14
🍴 Forks: 250 forks

πŸ’» Programming Languages: Swift - Shell - JavaScript

🏷️ Related Topics:
#swift #ai #codex #claude_code


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: prek

πŸ“ Description: ⚑ Better `pre-commit`, re-engineered in Rust

πŸ”— Repository URL: https://github.com/j178/prek

🌐 Website: https://prek.j178.dev/

πŸ“– Readme: https://github.com/j178/prek#readme

πŸ“Š Statistics:
🌟 Stars: 4.1K stars
πŸ‘€ Watchers: 13
🍴 Forks: 126 forks

πŸ’» Programming Languages: Rust

🏷️ Related Topics:
#git #pre_commit #git_hooks


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: Stable-Video-Infinity

πŸ“ Description: [ICLR 26] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling

πŸ”— Repository URL: https://github.com/vita-epfl/Stable-Video-Infinity

🌐 Website: https://stable-video-infinity.github.io/homepage/

πŸ“– Readme: https://github.com/vita-epfl/Stable-Video-Infinity#readme

πŸ“Š Statistics:
🌟 Stars: 1.6K stars
πŸ‘€ Watchers: 30
🍴 Forks: 128 forks

πŸ’» Programming Languages: Python - Shell

🏷️ Related Topics:
#dance_generation #long_video_generation #audio_driven_talking_face #video_diffusion_transformers #end_to_end_filming


==================================
🧠 By: https://t.iss.one/DataScienceM
Top 100 Data Science Interview Questions βœ…

Data Science Basics
1. What is data science and how is it different from data analytics?
2. What are the key steps in a data science lifecycle?
3. What types of problems does data science solve?
4. What skills does a data scientist need in real projects?
5. What is the difference between structured and unstructured data?
6. What is exploratory data analysis and why do you do it first?
7. What are common data sources in real companies?
8. What is feature engineering?
9. What is the difference between supervised and unsupervised learning?
10. What is bias in data and how does it affect models?

Statistics and Probability
11. What is the difference between mean, median, and mode?
12. What is standard deviation and variance?
13. What is probability distribution?
14. What is normal distribution and where is it used?
15. What is skewness and kurtosis?
16. What is correlation vs causation?
17. What is hypothesis testing?
18. What are Type I and Type II errors?
19. What is p-value?
20. What is confidence interval?

Data Cleaning and Preprocessing
21. How do you handle missing values?
22. How do you treat outliers?
23. What is data normalization and standardization?
24. When do you use Min-Max scaling vs Z-score?
25. How do you handle imbalanced datasets?
26. What is one-hot encoding?
27. What is label encoding?
28. How do you detect data leakage?
29. What is duplicate data and how do you handle it?
30. How do you validate data quality?

Python for Data Science
31. Why is Python popular in data science?
32. Difference between list, tuple, set, and dictionary?
33. What is NumPy and why is it fast?
34. What is Pandas and where do you use it?
35. Difference between loc and iloc?
36. What are vectorized operations?
37. What is lambda function?
38. What is list comprehension?
39. How do you handle large datasets in Python?
40. What are common Python libraries used in data science?

Data Visualization
41. Why is data visualization important?
42. Difference between bar chart and histogram?
43. When do you use box plots?
44. What does a scatter plot show?
45. What are common mistakes in data visualization?
46. Difference between Seaborn and Matplotlib?
47. What is a heatmap used for?
48. How do you visualize distributions?
49. What is dashboarding?
50. How do you choose the right chart?

Machine Learning Basics
51. What is machine learning?
52. Difference between regression and classification?
53. What is overfitting and underfitting?
54. What is train-test split?
55. What is cross-validation?
56. What is bias-variance tradeoff?
57. What is feature selection?
58. What is model evaluation?
59. What is baseline model?
60. How do you choose a model?

Supervised Learning
61. How does linear regression work?
62. Assumptions of linear regression?
63. What is logistic regression?
64. What is decision tree?
65. What is random forest?
66. What is KNN and when do you use it?
67. What is SVM?
68. How does Naive Bayes work?
69. What are ensemble methods?
70. How do you tune hyperparameters?

Unsupervised Learning
71. What is clustering?
72. Difference between K-means and hierarchical clustering?
73. How do you choose value of K?
74. What is PCA?
75. Why is dimensionality reduction needed?
76. What is anomaly detection?
77. What is association rule mining?
78. What is DBSCAN?
79. What is cosine similarity?
80. Where is unsupervised learning used?

Model Evaluation Metrics
81. What is accuracy and when is it misleading?
82. What is precision and recall?
83. What is F1 score?
84. What is ROC curve?
85. What is AUC?
86. Difference between confusion matrix metrics?
87. What is log loss?
88. What is RMSE?
89. What metric do you use for imbalanced data?
90. How do business metrics link to ML metrics?
❀5
πŸ”₯ Trending Repository: qui

πŸ“ Description: A fast, single-binary qBittorrent web UI: manage multiple instances, automate torrent workflows, and cross-seed across trackers.

πŸ”— Repository URL: https://github.com/autobrr/qui

🌐 Website: https://getqui.com

πŸ“– Readme: https://github.com/autobrr/qui#readme

πŸ“Š Statistics:
🌟 Stars: 2.6K stars
πŸ‘€ Watchers: 8
🍴 Forks: 74 forks

πŸ’» Programming Languages: Go - TypeScript - CSS - Python - Makefile - HTML

🏷️ Related Topics:
#go #golang #qbittorrent #libtorrent #workflows #qbit #cross_seed #cross_seeding


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: nanochat

πŸ“ Description: The best ChatGPT that $100 can buy.

πŸ”— Repository URL: https://github.com/karpathy/nanochat

πŸ“– Readme: https://github.com/karpathy/nanochat#readme

πŸ“Š Statistics:
🌟 Stars: 41.4K stars
πŸ‘€ Watchers: 289
🍴 Forks: 5.4K forks

πŸ’» Programming Languages: Python - Jupyter Notebook - HTML - Shell

🏷️ Related Topics: Not available

==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: rag-from-scratch

πŸ“ Description: No description available

πŸ”— Repository URL: https://github.com/langchain-ai/rag-from-scratch

πŸ“– Readme: https://github.com/langchain-ai/rag-from-scratch#readme

πŸ“Š Statistics:
🌟 Stars: 6.8K stars
πŸ‘€ Watchers: 60
🍴 Forks: 1.8K forks

πŸ’» Programming Languages: Jupyter Notebook

🏷️ Related Topics: Not available

==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: review-prompts

πŸ“ Description: AI review prompts

πŸ”— Repository URL: https://github.com/masoncl/review-prompts

πŸ“– Readme: https://github.com/masoncl/review-prompts#readme

πŸ“Š Statistics:
🌟 Stars: 192 stars
πŸ‘€ Watchers: 9
🍴 Forks: 29 forks

πŸ’» Programming Languages: Python - Shell

🏷️ Related Topics: Not available

==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: skills

πŸ“ Description: Skills Catalog for Codex

πŸ”— Repository URL: https://github.com/openai/skills

πŸ“– Readme: https://github.com/openai/skills#readme

πŸ“Š Statistics:
🌟 Stars: 2.6K stars
πŸ‘€ Watchers: 26
🍴 Forks: 166 forks

πŸ’» Programming Languages: Python - Shell - JavaScript

🏷️ Related Topics: Not available

==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: ccpm

πŸ“ Description: Project management system for Claude Code using GitHub Issues and Git worktrees for parallel agent execution.

πŸ”— Repository URL: https://github.com/automazeio/ccpm

🌐 Website: https://automaze.io/ccpm

πŸ“– Readme: https://github.com/automazeio/ccpm#readme

πŸ“Š Statistics:
🌟 Stars: 6.5K stars
πŸ‘€ Watchers: 39
🍴 Forks: 684 forks

πŸ’» Programming Languages: Shell - Batchfile

🏷️ Related Topics:
#project_management #ai_agents #claude #ai_coding #vibe_coding #claude_code


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: vm0

πŸ“ Description: the easiest way to run natural language-described workflows automatically

πŸ”— Repository URL: https://github.com/vm0-ai/vm0

🌐 Website: https://vm0.ai

πŸ“– Readme: https://github.com/vm0-ai/vm0#readme

πŸ“Š Statistics:
🌟 Stars: 522 stars
πŸ‘€ Watchers: 1
🍴 Forks: 20 forks

πŸ’» Programming Languages: TypeScript - MDX - Shell - CSS - Rust - JavaScript

🏷️ Related Topics:
#react #cli #typescript #containers #sandbox #cloudflare #codex #dev_tools #ai_agent #ai_runtime #gemini_cli #agentic_workflow #claude_code #context_engineer #ai_sandbox


==================================
🧠 By: https://t.iss.one/DataScienceM
🎯 Want to Upskill in IT? Try Our FREE 2026 Learning Kits!

SPOTO gives you free, instant access to high-quality, updated resources that help you study smarter and pass exams faster.
βœ… Latest Exam Materials:
Covering #Python, #Cisco, #PMI, #Fortinet, #AWS, #Azure, #AI, #Excel, #comptia, #ITIL, #cloud & more!
βœ… 100% Free, No Sign-up:
All materials are instantly downloadable

βœ… What’s Inside:
γƒ»πŸ“˜IT Certs E-book: https://bit.ly/3Mlu5ez
γƒ»πŸ“IT Exams Skill Test: https://bit.ly/3NVrgRU
γƒ»πŸŽ“Free IT courses: https://bit.ly/3M9h5su
γƒ»πŸ€–Free PMP Study Guide: https://bit.ly/4te3EIn
γƒ»β˜οΈFree Cloud Study Guide: https://bit.ly/4kgFVDs

πŸ‘‰ Become Part of Our IT Learning Circle! resources and support:
https://chat.whatsapp.com/FlG2rOYVySLEHLKXF3nKGB

πŸ’¬ Want exam help? Chat with an admin now!
wa.link/8fy3x4
❀2
πŸ”₯ Trending Repository: claude-code-hooks-mastery

πŸ“ Description: Master Claude Code Hooks

πŸ”— Repository URL: https://github.com/disler/claude-code-hooks-mastery

πŸ“– Readme: https://github.com/disler/claude-code-hooks-mastery#readme

πŸ“Š Statistics:
🌟 Stars: 2.3K stars
πŸ‘€ Watchers: 52
🍴 Forks: 498 forks

πŸ’» Programming Languages: Python - TypeScript

🏷️ Related Topics: Not available

==================================
🧠 By: https://t.iss.one/DataScienceM
❀1
πŸ”₯ Trending Repository: anki

πŸ“ Description: Anki is a smart spaced repetition flashcard program

πŸ”— Repository URL: https://github.com/ankitects/anki

🌐 Website: https://apps.ankiweb.net

πŸ“– Readme: https://github.com/ankitects/anki#readme

πŸ“Š Statistics:
🌟 Stars: 26.1K stars
πŸ‘€ Watchers: 349
🍴 Forks: 2.8K forks

πŸ’» Programming Languages: Rust - Python - Svelte - TypeScript - SCSS - Shell

🏷️ Related Topics: Not available

==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: opentelemetry-collector-contrib

πŸ“ Description: Contrib repository for the OpenTelemetry Collector

πŸ”— Repository URL: https://github.com/open-telemetry/opentelemetry-collector-contrib

🌐 Website: https://opentelemetry.io

πŸ“– Readme: https://github.com/open-telemetry/opentelemetry-collector-contrib#readme

πŸ“Š Statistics:
🌟 Stars: 4.3K stars
πŸ‘€ Watchers: 62
🍴 Forks: 3.3K forks

πŸ’» Programming Languages: Go - Makefile - Go Template - Shell - Dockerfile - Jinja

🏷️ Related Topics:
#opentelemetry #open_telemetry


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ”₯ Trending Repository: likec4

πŸ“ Description: Visualize, collaborate, and evolve the software architecture with always actual and live diagrams from your code

πŸ”— Repository URL: https://github.com/likec4/likec4

🌐 Website: https://likec4.dev

πŸ“– Readme: https://github.com/likec4/likec4#readme

πŸ“Š Statistics:
🌟 Stars: 1.4K stars
πŸ‘€ Watchers: 17
🍴 Forks: 117 forks

πŸ’» Programming Languages: TypeScript - MDX - Astro - JavaScript - CSS - Langium

🏷️ Related Topics:
#architecture #diagrams #c4 #architecture_as_code


==================================
🧠 By: https://t.iss.one/DataScienceM
πŸ€– A tool that allows you to collect ML models based on a text description

There's an entire agent system inside that automates the entire ML creation cycle - from the idea to the finished solution, without manual fiddling with architecture and pipelines.

How it works:
βž– You formulate the task in ordinary text and provide the data. If necessary, the system extracts the schema itself

βž– Under the hood, a group of AI agents work: one designs the model, the second writes the code, the third evaluates the quality and corrects errors

βž– If there's a lack of data, the system can generate a synthetic dataset for testing

βž– There's support for Ray for parallel model exploration and scaling to cores or clusters

βž– It connects to any cloud or local models via LiteLLM


It's ideal for rapid prototyping and experiments, when it's important to quickly get a working result - get it here.
https://github.com/plexe-ai/plexe

tags: #useful

➑ @DataScienceN
Please open Telegram to view this post
VIEW IN TELEGRAM
❀2
βœ… Data Science Project Ideas

1️⃣ Beginner Friendly Projects
β€’ Exploratory Data Analysis (EDA) on CSV datasets
β€’ Student Marks Analysis
β€’ COVID / Weather Data Analysis
β€’ Simple Data Visualization Dashboard
β€’ Basic Recommendation System (rule-based)

2️⃣ Python for Data Science
β€’ Sales Data Analysis using Pandas
β€’ Web Scraping + Analysis (BeautifulSoup)
β€’ Data Cleaning  Preprocessing Project
β€’ Movie Rating Analysis
β€’ Stock Price Analysis (historical data)

3️⃣ Machine Learning Projects
β€’ House Price Prediction
β€’ Spam Email Classifier
β€’ Loan Approval Prediction
β€’ Customer Churn Prediction
β€’ Iris / Titanic Dataset Classification

4️⃣ Data Visualization Projects
β€’ Interactive Dashboard using Matplotlib/Seaborn
β€’ Sales Performance Dashboard
β€’ Social Media Analytics Dashboard
β€’ COVID Trends Visualization
β€’ Country-wise GDP Analysis

5️⃣ NLP (Text  Language) Projects
β€’ Sentiment Analysis on Reviews
β€’ Resume Screening System
β€’ Fake News Detection
β€’ Chatbot (Rule-based β†’ ML-based)
β€’ Topic Modeling on Articles

6️⃣ Advanced ML / AI Projects
β€’ Recommendation System (Collaborative Filtering)
β€’ Credit Card Fraud Detection
β€’ Image Classification (CNN basics)
β€’ Face Mask Detection
β€’ Speech-to-Text Analysis

7️⃣ Data Engineering / Big Data
β€’ ETL Pipeline using Python
β€’ Data Warehouse Design (Star Schema)
β€’ Log File Analysis
β€’ API Data Ingestion Project
β€’ Batch Processing with Large Datasets

8️⃣ Real-World / Portfolio Projects
β€’ End-to-End Data Science Project
β€’ Business Problem β†’ Data β†’ Model β†’ Insights
β€’ Kaggle Competition Project
β€’ Open Dataset Case Study
β€’ Automated Data Reporting Tool
❀2πŸ”₯1