Data Science Portfolio - Kaggle Datasets & AI Projects | Artificial Intelligence
37.7K subscribers
283 photos
76 files
337 links
Free Datasets For Data Science Projects & Portfolio

Buy ads: https://telega.io/c/DataPortfolio

For Promotions/ads: @coderfun @love_data
Download Telegram
๐Ÿฐ ๐—™๐—ฅ๐—˜๐—˜ ๐—–๐—ผ๐˜‚๐—ฟ๐˜€๐—ฒ๐˜€ ๐—ฏ๐˜† ๐—›๐—ฎ๐—ฟ๐˜ƒ๐—ฎ๐—ฟ๐—ฑ ๐—ฎ๐—ป๐—ฑ ๐—ฆ๐˜๐—ฎ๐—ป๐—ณ๐—ผ๐—ฟ๐—ฑ ๐˜๐—ผ ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป ๐—”๐—œ๐Ÿ˜

Dreaming of Mastering AI? ๐ŸŽฏ

Harvard and Stanfordโ€”two of the most prestigious universities in the worldโ€”are offering FREE AI courses๐Ÿ‘จโ€๐Ÿ’ป

No hidden fees, no long applicationsโ€”just pure, world-class education, accessible to everyone๐Ÿ”ฅ

๐‹๐ข๐ง๐ค๐Ÿ‘‡:-

https://pdlink.in/3GqHkau

Hereโ€™s your golden ticket to the future!โœ…
๐Ÿ‘1
Important Topics to become a data scientist [Advanced Level]
๐Ÿ‘‡๐Ÿ‘‡

1. Mathematics

Linear Algebra
Analytic Geometry
Matrix
Vector Calculus
Optimization
Regression
Dimensionality Reduction
Density Estimation
Classification

2. Probability

Introduction to Probability
1D Random Variable
The function of One Random Variable
Joint Probability Distribution
Discrete Distribution
Normal Distribution

3. Statistics

Introduction to Statistics
Data Description
Random Samples
Sampling Distribution
Parameter Estimation
Hypotheses Testing
Regression

4. Programming

Python:

Python Basics
List
Set
Tuples
Dictionary
Function
NumPy
Pandas
Matplotlib/Seaborn

R Programming:

R Basics
Vector
List
Data Frame
Matrix
Array
Function
dplyr
ggplot2
Tidyr
Shiny

DataBase:
SQL
MongoDB

Data Structures

Web scraping

Linux

Git

5. Machine Learning

How Model Works
Basic Data Exploration
First ML Model
Model Validation
Underfitting & Overfitting
Random Forest
Handling Missing Values
Handling Categorical Variables
Pipelines
Cross-Validation(R)
XGBoost(Python|R)
Data Leakage

6. Deep Learning

Artificial Neural Network
Convolutional Neural Network
Recurrent Neural Network
TensorFlow
Keras
PyTorch
A Single Neuron
Deep Neural Network
Stochastic Gradient Descent
Overfitting and Underfitting
Dropout Batch Normalization
Binary Classification

7. Feature Engineering

Baseline Model
Categorical Encodings
Feature Generation
Feature Selection

8. Natural Language Processing

Text Classification
Word Vectors

9. Data Visualization Tools

BI (Business Intelligence):
Tableau
Power BI
Qlik View
Qlik Sense

10. Deployment

Microsoft Azure
Heroku
Google Cloud Platform
Flask
Django

I have curated the best interview resources to crack Data Science Interviews
๐Ÿ‘‡๐Ÿ‘‡
https://whatsapp.com/channel/0029Va4QUHa6rsQjhITHK82y

Like if you need similar content ๐Ÿ˜„๐Ÿ‘
๐Ÿ‘3
Forwarded from Generative AI
๐—™๐—ฅ๐—˜๐—˜ ๐—š๐—ผ๐—ผ๐—ด๐—น๐—ฒ ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป๐—ถ๐—ป๐—ด ๐—ฃ๐—ฎ๐˜๐—ต! ๐—•๐—ฒ๐—ฐ๐—ผ๐—บ๐—ฒ ๐—ฎ ๐—–๐—ฒ๐—ฟ๐˜๐—ถ๐—ณ๐—ถ๐—ฒ๐—ฑ ๐——๐—ฎ๐˜๐—ฎ ๐—”๐—ป๐—ฎ๐—น๐˜†๐˜€๐˜ ๐—ถ๐—ป ๐Ÿฎ๐Ÿฌ๐Ÿฎ๐Ÿฑ๐Ÿ˜

If youโ€™re dreaming of starting a high-paying data career or switching into the booming tech industry, Google just made it a whole lot easier โ€” and itโ€™s completely FREE๐Ÿ‘จโ€๐Ÿ’ป

๐‹๐ข๐ง๐ค๐Ÿ‘‡:-

https://pdlink.in/4cMx2h2

Youโ€™ll get access to hands-on labs, real datasets, and industry-grade training created directly by Googleโ€™s own experts๐Ÿ’ป
๐Ÿ‘2
Please go through this top 5 SQL projects with Datasets that you can practice and can add in your resume

๐Ÿš€1. Web Analytics:
(
https://www.kaggle.com/zynicide/wine-reviews)

๐Ÿš€2. Healthcare Data Analysis:
(
https://www.kaggle.com/cdc/mortality)

๐Ÿ“Œ3. E-commerce Analysis:
(
https://www.kaggle.com/olistbr/brazilian-ecommerce)

๐Ÿš€4. Inventory Management:
(
https://www.kaggle.com/code/govindji/inventory-management)


๐Ÿš€ 5. Analysis of Sales Data:
(
https://www.kaggle.com/kyanyoga/sample-sales-data)

Small suggestion from my side for non tech students: kindly pick those datasets which you like the subject in general, that way you will be more excited to practice it, instead of just doing it for the sake of resume, you will learn SQL more passionately, since itโ€™s a programming language try to make it more exciting for yourself.

Hope this piece of information helps you
๐Ÿ‘2
๐—•๐—ฒ๐˜€๐˜ ๐—ฌ๐—ผ๐˜‚๐—ง๐˜‚๐—ฏ๐—ฒ ๐—–๐—ต๐—ฎ๐—ป๐—ป๐—ฒ๐—น๐˜€ ๐˜๐—ผ ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป ๐—˜๐˜€๐˜€๐—ฒ๐—ป๐˜๐—ถ๐—ฎ๐—น ๐——๐—ฎ๐˜๐—ฎ ๐—”๐—ป๐—ฎ๐—น๐˜†๐˜๐—ถ๐—ฐ๐˜€ ๐—ฆ๐—ธ๐—ถ๐—น๐—น๐˜€ ๐—ณ๐—ผ๐—ฟ ๐—™๐—ฅ๐—˜๐—˜๐Ÿ˜

Dreaming of becoming a Data Analyst but feel overwhelmed by where to start?๐Ÿ‘จโ€๐Ÿ’ป

Hereโ€™s the truth: YouTube is packed with goldmine content, and the best part โ€” itโ€™s all 100% FREE๐Ÿ”ฅ

๐‹๐ข๐ง๐ค๐Ÿ‘‡:-

https://pdlink.in/4cL3SyM

๐Ÿš€ If Youโ€™re Serious About Data Analytics, You Canโ€™t Sleep on These YouTube Channels!
๐Ÿ‘1
Forwarded from Artificial Intelligence
๐—ง๐—–๐—ฆ ๐—™๐—ฅ๐—˜๐—˜ ๐—–๐—ฒ๐—ฟ๐˜๐—ถ๐—ณ๐—ถ๐—ฐ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—ข๐—ป ๐——๐—ฎ๐˜๐—ฎ ๐— ๐—ฎ๐—ป๐—ฎ๐—ด๐—ฒ๐—บ๐—ฒ๐—ป๐˜ - ๐—˜๐—ป๐—ฟ๐—ผ๐—น๐—น ๐—™๐—ผ๐—ฟ ๐—™๐—ฅ๐—˜๐—˜๐Ÿ˜

Want to know how top companies handle massive amounts of data without losing track? ๐Ÿ“Š

TCS is offering a FREE beginner-friendly course on Master Data Management, and yesโ€”it comes with a certificate! ๐ŸŽ“

๐‹๐ข๐ง๐ค๐Ÿ‘‡:-

https://pdlink.in/4jGFBw0

Just click and start learning!โœ…๏ธ
๐Ÿ‘1
๐Ÿš€ ๐—ฆ๐˜๐—ฟ๐˜‚๐—ด๐—ด๐—น๐—ถ๐—ป๐—ด ๐˜„๐—ถ๐˜๐—ต ๐——๐—ฎ๐˜๐—ฎ ๐—ฆ๐—ฐ๐—ถ๐—ฒ๐—ป๐—ฐ๐—ฒ ๐—œ๐—ป๐˜๐—ฒ๐—ฟ๐˜ƒ๐—ถ๐—ฒ๐˜„๐˜€? ๐—™๐—ผ๐—น๐—น๐—ผ๐˜„ ๐—ง๐—ต๐—ถ๐˜€ ๐—ฅ๐—ผ๐—ฎ๐—ฑ๐—บ๐—ฎ๐—ฝ! ๐Ÿš€

Data Science interviews can be daunting, but with the right approach, you can ace them! If you're feeling overwhelmed, here's a roadmap to guide you through the process and help you succeed:

๐Ÿ” ๐Ÿญ. ๐—จ๐—ป๐—ฑ๐—ฒ๐—ฟ๐˜€๐˜๐—ฎ๐—ป๐—ฑ ๐˜๐—ต๐—ฒ ๐—•๐—ฎ๐˜€๐—ถ๐—ฐ๐˜€:
Master fundamental concepts like statistics, linear algebra, and probability. These are crucial for tackling both theoretical and practical questions.

๐Ÿ’ป ๐Ÿฎ. ๐—ช๐—ผ๐—ฟ๐—ธ ๐—ผ๐—ป ๐—ฅ๐—ฒ๐—ฎ๐—น-๐—ช๐—ผ๐—ฟ๐—น๐—ฑ ๐—ฃ๐—ฟ๐—ผ๐—ท๐—ฒ๐—ฐ๐˜๐˜€:
Build a strong portfolio by solving real-world problems. Kaggle competitions, open datasets, and personal projects are great ways to gain hands-on experience.

๐Ÿง  ๐Ÿฏ. ๐—ฆ๐—ต๐—ฎ๐—ฟ๐—ฝ๐—ฒ๐—ป ๐—ฌ๐—ผ๐˜‚๐—ฟ ๐—–๐—ผ๐—ฑ๐—ถ๐—ป๐—ด ๐—ฆ๐—ธ๐—ถ๐—น๐—น๐˜€:
Coding is key in Data Science! Practice on platforms like LeetCode, HackerRank, or Codewars to boost your problem-solving ability and efficiency. Be comfortable with Python, SQL, and essential libraries.

๐Ÿ“Š ๐Ÿฐ. ๐— ๐—ฎ๐˜€๐˜๐—ฒ๐—ฟ ๐——๐—ฎ๐˜๐—ฎ ๐—ช๐—ฟ๐—ฎ๐—ป๐—ด๐—น๐—ถ๐—ป๐—ด & ๐—ฃ๐—ฟ๐—ฒ๐—ฝ๐—ฟ๐—ผ๐—ฐ๐—ฒ๐˜€๐˜€๐—ถ๐—ป๐—ด:
A significant portion of Data Science work revolves around cleaning and preparing data. Make sure you're comfortable with handling missing data, outliers, and feature engineering.

๐Ÿ“š ๐Ÿฑ. ๐—ฆ๐˜๐˜‚๐—ฑ๐˜† ๐—”๐—น๐—ด๐—ผ๐—ฟ๐—ถ๐˜๐—ต๐—บ๐˜€ & ๐— ๐—ผ๐—ฑ๐—ฒ๐—น๐˜€:
From decision trees to neural networks, ensure you understand how different models work and when to apply them. Know their strengths, weaknesses, and the mathematical principles behind them.

๐Ÿ’ฌ ๐Ÿฒ. ๐—œ๐—บ๐—ฝ๐—ฟ๐—ผ๐˜ƒ๐—ฒ ๐—–๐—ผ๐—บ๐—บ๐˜‚๐—ป๐—ถ๐—ฐ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—ฆ๐—ธ๐—ถ๐—น๐—น๐˜€:
Being able to explain complex concepts in a simple way is essential, especially when communicating with non-technical stakeholders. Practice explaining your findings and solutions clearly.

๐Ÿ”„ ๐Ÿณ. ๐— ๐—ผ๐—ฐ๐—ธ ๐—œ๐—ป๐˜๐—ฒ๐—ฟ๐˜ƒ๐—ถ๐—ฒ๐˜„๐˜€ & ๐—™๐—ฒ๐—ฒ๐—ฑ๐—ฏ๐—ฎ๐—ฐ๐—ธ:
Practice mock interviews with peers or mentors. Constructive feedback will help you identify areas of improvement and build confidence.

๐Ÿ“ˆ ๐Ÿด. ๐—ž๐—ฒ๐—ฒ๐—ฝ ๐—จ๐—ฝ ๐—ช๐—ถ๐˜๐—ต ๐—ง๐—ฟ๐—ฒ๐—ป๐—ฑ๐˜€:
Data Science is a fast-evolving field! Stay updated on the latest techniques, tools, and industry trends to remain competitive.

๐Ÿ‘‰ ๐—ฃ๐—ฟ๐—ผ ๐—ง๐—ถ๐—ฝ: Be persistent! Rejections are part of the journey, but every experience teaches you something new.
Many people still aren't fully utilizing the power of Telegram.

There are numerous channels on Telegram that can help you find the latest job and internship opportunities?

Here are some of my top channel recommendations to help you get started ๐Ÿ‘‡๐Ÿ‘‡

Latest Jobs & Internships: https://t.iss.one/getjobss

Jobs Preparation Resources:
https://t.iss.one/jobinterviewsprep

Web Development Jobs:
https://t.iss.one/webdeveloperjob

Data Science Jobs:
https://t.iss.one/datasciencej

Interview Tips:
https://t.iss.one/Interview_Jobs

Data Analyst Jobs:
https://t.iss.one/jobs_SQL

AI Jobs:
https://t.iss.one/AIjobz

Remote Jobs:
https://t.iss.one/jobs_us_uk

FAANG Jobs:
https://t.iss.one/FAANGJob

Software Developer Jobs: https://t.iss.one/internshiptojobs

If you found this helpful, donโ€™t forget to like, share, and follow for more resources that can boost your career journey!

Let me know if you know any other useful telegram channel

ENJOY LEARNING๐Ÿ‘๐Ÿ‘
๐Ÿ‘1
jscheatsheet.pdf
1.4 MB
Javascript Cheatsheet โœจ๏ธ
๐Ÿ‘1๐Ÿ”ฅ1
Forwarded from Artificial Intelligence
๐Ÿฑ ๐—™๐—ฟ๐—ฒ๐—ฒ ๐—ช๐—ฒ๐—ฏ๐˜€๐—ถ๐˜๐—ฒ๐˜€ ๐˜๐—ผ ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป ๐—ฃ๐˜†๐˜๐—ต๐—ผ๐—ป ๐—ณ๐—ฟ๐—ผ๐—บ ๐—ฆ๐—ฐ๐—ฟ๐—ฎ๐˜๐—ฐ๐—ต ๐—ถ๐—ป ๐Ÿฎ๐Ÿฌ๐Ÿฎ๐Ÿฑ (๐—ก๐—ผ ๐—œ๐—ป๐˜ƒ๐—ฒ๐˜€๐˜๐—บ๐—ฒ๐—ป๐˜ ๐—ก๐—ฒ๐—ฒ๐—ฑ๐—ฒ๐—ฑ!)๐Ÿ˜

If youโ€™re serious about starting your tech journey, Python is one of the best languages to master๐Ÿ‘จโ€๐Ÿ’ป๐Ÿ‘จโ€๐ŸŽ“

Iโ€™ve found 5 hidden gems that offer beginner tutorials, advanced exercises, and even real-world projects โ€” absolutely FREE๐Ÿ”ฅ

๐‹๐ข๐ง๐ค๐Ÿ‘‡:-

https://pdlink.in/4lOVqmb

Start today, and youโ€™ll thank yourself tomorrow.โœ…๏ธ
๐Ÿ‘1
Machine learning powers so many things around us โ€“ from recommendation systems to self-driving cars!

But understanding the different types of algorithms can be tricky.

This is a quick and easy guide to the four main categories: Supervised, Unsupervised, Semi-Supervised, and Reinforcement Learning.

๐Ÿ. ๐’๐ฎ๐ฉ๐ž๐ซ๐ฏ๐ข๐ฌ๐ž๐ ๐‹๐ž๐š๐ซ๐ง๐ข๐ง๐ 
In supervised learning, the model learns from examples that already have the answers (labeled data). The goal is for the model to predict the correct result when given new data.

๐’๐จ๐ฆ๐ž ๐œ๐จ๐ฆ๐ฆ๐จ๐ง ๐ฌ๐ฎ๐ฉ๐ž๐ซ๐ฏ๐ข๐ฌ๐ž๐ ๐ฅ๐ž๐š๐ซ๐ง๐ข๐ง๐  ๐š๐ฅ๐ ๐จ๐ซ๐ข๐ญ๐ก๐ฆ๐ฌ ๐ข๐ง๐œ๐ฅ๐ฎ๐๐ž:

โžก๏ธ Linear Regression โ€“ For predicting continuous values, like house prices.
โžก๏ธ Logistic Regression โ€“ For predicting categories, like spam or not spam.
โžก๏ธ Decision Trees โ€“ For making decisions in a step-by-step way.
โžก๏ธ K-Nearest Neighbors (KNN) โ€“ For finding similar data points.
โžก๏ธ Random Forests โ€“ A collection of decision trees for better accuracy.
โžก๏ธ Neural Networks โ€“ The foundation of deep learning, mimicking the human brain.

๐Ÿ. ๐”๐ง๐ฌ๐ฎ๐ฉ๐ž๐ซ๐ฏ๐ข๐ฌ๐ž๐ ๐‹๐ž๐š๐ซ๐ง๐ข๐ง๐ 
With unsupervised learning, the model explores patterns in data that doesnโ€™t have any labels. It finds hidden structures or groupings.

๐’๐จ๐ฆ๐ž ๐ฉ๐จ๐ฉ๐ฎ๐ฅ๐š๐ซ ๐ฎ๐ง๐ฌ๐ฎ๐ฉ๐ž๐ซ๐ฏ๐ข๐ฌ๐ž๐ ๐ฅ๐ž๐š๐ซ๐ง๐ข๐ง๐  ๐š๐ฅ๐ ๐จ๐ซ๐ข๐ญ๐ก๐ฆ๐ฌ ๐ข๐ง๐œ๐ฅ๐ฎ๐๐ž:

โžก๏ธ K-Means Clustering โ€“ For grouping data into clusters.
โžก๏ธ Hierarchical Clustering โ€“ For building a tree of clusters.
โžก๏ธ Principal Component Analysis (PCA) โ€“ For reducing data to its most important parts.
โžก๏ธ Autoencoders โ€“ For finding simpler representations of data.

๐Ÿ‘. ๐’๐ž๐ฆ๐ข-๐’๐ฎ๐ฉ๐ž๐ซ๐ฏ๐ข๐ฌ๐ž๐ ๐‹๐ž๐š๐ซ๐ง๐ข๐ง๐ 
This is a mix of supervised and unsupervised learning. It uses a small amount of labeled data with a large amount of unlabeled data to improve learning.

๐‚๐จ๐ฆ๐ฆ๐จ๐ง ๐ฌ๐ž๐ฆ๐ข-๐ฌ๐ฎ๐ฉ๐ž๐ซ๐ฏ๐ข๐ฌ๐ž๐ ๐ฅ๐ž๐š๐ซ๐ง๐ข๐ง๐  ๐š๐ฅ๐ ๐จ๐ซ๐ข๐ญ๐ก๐ฆ๐ฌ ๐ข๐ง๐œ๐ฅ๐ฎ๐๐ž:

โžก๏ธ Label Propagation โ€“ For spreading labels through connected data points.
โžก๏ธ Semi-Supervised SVM โ€“ For combining labeled and unlabeled data.
โžก๏ธ Graph-Based Methods โ€“ For using graph structures to improve learning.

๐Ÿ’. ๐‘๐ž๐ข๐ง๐Ÿ๐จ๐ซ๐œ๐ž๐ฆ๐ž๐ง๐ญ ๐‹๐ž๐š๐ซ๐ง๐ข๐ง๐ 
In reinforcement learning, the model learns by trial and error. It interacts with its environment, receives feedback (rewards or penalties), and learns how to act to maximize rewards.

๐๐จ๐ฉ๐ฎ๐ฅ๐š๐ซ ๐ซ๐ž๐ข๐ง๐Ÿ๐จ๐ซ๐œ๐ž๐ฆ๐ž๐ง๐ญ ๐ฅ๐ž๐š๐ซ๐ง๐ข๐ง๐  ๐š๐ฅ๐ ๐จ๐ซ๐ข๐ญ๐ก๐ฆ๐ฌ ๐ข๐ง๐œ๐ฅ๐ฎ๐๐ž:

โžก๏ธ Q-Learning โ€“ For learning the best actions over time.
โžก๏ธ Deep Q-Networks (DQN) โ€“ Combining Q-learning with deep learning.
โžก๏ธ Policy Gradient Methods โ€“ For learning policies directly.
โžก๏ธ Proximal Policy Optimization (PPO) โ€“ For stable and effective learning.

ENJOY LEARNING ๐Ÿ‘๐Ÿ‘
๐Ÿ‘2
๐—š๐—ผ๐—ผ๐—ด๐—น๐—ฒ ๐—™๐—ฅ๐—˜๐—˜ ๐—”๐—œ ๐—–๐—ฒ๐—ฟ๐˜๐—ถ๐—ณ๐—ถ๐—ฐ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—–๐—ผ๐˜‚๐—ฟ๐˜€๐—ฒ๐˜€๐Ÿ˜

Ever wondered how machines describe images in words?๐Ÿ’ป

Want to get hands-on with cutting-edge AI and computer vision โ€” for FREE?๐ŸŽŠ

๐‹๐ข๐ง๐ค๐Ÿ‘‡:-

https://pdlink.in/42FaT0Y

๐ŸŽฏ Start Learning AI for FREE
๐Ÿ‘2