Python for Data Analysts
47.9K subscribers
503 photos
64 files
318 links
Find top Python resources from global universities, cool projects, and learning materials for data analytics.

For promotions: @coderfun

Useful links: heylink.me/DataAnalytics
Download Telegram
Always remember this one thing in your life β€οΈπŸ‘‡
https://youtube.com/shorts/ir-4oBR9ec0?feature=share
πŸ‘7❀4
πŸ‘15❀4
Easy Python scenarios for everyday data tasks



Scenario 1: Data Cleaning
Question:
You have a DataFrame containing product prices with columns Product and Price. Some of the prices are stored as strings with a dollar sign, like $10. Write a Python function to convert the prices to float.

Answer:
import pandas as pd
data = {
'Product': ['A', 'B', 'C', 'D'],
'Price': ['$10', '$20', '$30', '$40']
}
df = pd.DataFrame(data)

def clean_prices(df):
df['Price'] = df['Price'].str.replace('$', '').astype(float)
return df
cleaned_df = clean_prices(df)
print(cleaned_df)

Scenario 2: Basic Aggregation
Question:
You have a DataFrame containing sales data with columns Region and Sales. Write a Python function to calculate the total sales for each region.

Answer:
import pandas as pd
data = {
'Region': ['North', 'South', 'East', 'West', 'North', 'South', 'East', 'West'],
'Sales': [100, 200, 150, 250, 300, 100, 200, 150]
}
df = pd.DataFrame(data)

def total_sales_per_region(df):
total_sales = df.groupby('Region')['Sales'].sum().reset_index()
return total_sales

total_sales = total_sales_per_region(df)
print(total_sales)

Scenario 3: Filtering Data
Question:
You have a DataFrame containing customer data with columns β€˜CustomerID’, Name, and Age. Write a Python function to filter out customers who are younger than 18 years old.

Answer:
import pandas as pd
data = {
'CustomerID': [1, 2, 3, 4, 5],
'Name': ['Alice', 'Bob', 'Charlie', 'David', 'Eve'],
'Age': [17, 22, 15, 35, 40]
}
df = pd.DataFrame(data)

def filter_customers(df):
filtered_df = df[df['Age'] >= 18]
return filtered_df
filtered_customers = filter_customers(df)
print(filtered_customers)
πŸ‘25❀8
❀13πŸ‘6πŸ‘1
❀12πŸ‘9πŸ₯°2
5 essential Python functions for handling missing data:

πŸ”Ή isna(): Detects missing values in your DataFrame. Identifies NaNs

πŸ”Ή notna(): Detects non-missing values. Filters out the NaNs.

πŸ”Ή interpolate(): Fills missing values using interpolation

πŸ”Ή bfill(): Backward fill. Fills missing values with the next valid observation.

πŸ”Ή ffill(): Forward fill. Fills missing values with the previous valid observation.
πŸ‘14❀10
7 level of writing Python Dictionary



Level 1: Basic Dictionary Creation

Level 2: Accessing and Modifying values

Level 3: Adding and Removing key Values Pairs

Level 4: Dictionary Methods

Level 5: Dictionary Comprehensions

Level 6: Nested Dictionary

Level 7: Advanced Dictionary Operations

I have curated the best interview resources to crack Python Interviews πŸ‘‡πŸ‘‡
https://topmate.io/coding/898340

Hope you'll like it

Like this post if you need more resources like this πŸ‘β€οΈ
πŸ‘7❀2
𝐏𝐲𝐭𝐑𝐨𝐧 𝐈𝐧𝐭𝐞𝐫𝐯𝐒𝐞𝐰 𝐏𝐫𝐞𝐩:

Must practise the following questions for your next Python interview:

1. How would you handle missing values in a dataset?

2. Write a python code to merge datasets based on a common column.

3. How would you analyse the distribution of a continuous variable in dataset?

4. Write a python code to pivot an dataframe.

5. How would you handle categorical variables with many levels?

6. Write a python code to calculate the accuracy, precision, and recall of a classification model?

7. How would you handle errors when working with large datasets?

I have curated the best interview resources to crack Python Interviews πŸ‘‡πŸ‘‡
https://topmate.io/coding/898340

Hope you'll like it

Like this post if you need more resources like this πŸ‘β€οΈ
πŸ‘7❀6
Many people reached out to me saying telegram may get banned in their countries. So I've decided to create WhatsApp channels based on your interests πŸ‘‡πŸ‘‡

Free Courses with Certificate: https://whatsapp.com/channel/0029Vamhzk5JENy1Zg9KmO2g

Jobs & Internship Opportunities:
https://whatsapp.com/channel/0029VaI5CV93AzNUiZ5Tt226

Web Development: https://whatsapp.com/channel/0029VaiSdWu4NVis9yNEE72z

Python Free Books & Projects: https://whatsapp.com/channel/0029VaiM08SDuMRaGKd9Wv0L

Java Resources: https://whatsapp.com/channel/0029VamdH5mHAdNMHMSBwg1s

Coding Interviews: https://whatsapp.com/channel/0029VammZijATRSlLxywEC3X

SQL: https://whatsapp.com/channel/0029VanC5rODzgT6TiTGoa1v

Power BI: https://whatsapp.com/channel/0029Vai1xKf1dAvuk6s1v22c

Programming Free Resources: https://whatsapp.com/channel/0029VahiFZQ4o7qN54LTzB17

Data Science Projects: https://whatsapp.com/channel/0029Va4QUHa6rsQjhITHK82y

Learn Data Science & Machine Learning: https://whatsapp.com/channel/0029Va8v3eo1NCrQfGMseL2D

Don’t worry Guys your contact number will stay hidden!

ENJOY LEARNING πŸ‘πŸ‘
πŸ‘7πŸ‘4❀1
How to create simple pivot table in Python? DataAnalytics

πŸ”Ή Step 1: Import pandas
πŸ”Ή Step 2: Load your DataFrame
πŸ”Ή Step 3: Pivot the DataFrame
πŸ”Ή Step 4: Display the pivoted data
πŸ‘9❀4
Data Structures Notes πŸ“‘
πŸ‘18❀10πŸ‘4
How to Use Python’s range() Function

The range() function generates a sequence of numbers, commonly used for looping a specific number of times or creating numeric lists.

The first number is included, but the last number is excluded.

For example, range(5, 10) will generate numbers from 5 to 9, but not 10.
πŸ‘9
Creating Beautiful Box Plots with Seaborn in Python

A box plot is a simple way to visualise the distribution of a dataset and identify potential outliers. It displays the minimum, first quartile (Q1), median (Q2), third quartile (Q3), and maximum of the data, as well as any outliers. For more details on box plots you can watch my latest video on Insta

πŸ”Ή Step 1: Import Seaborn and load your dataset

πŸ”Ή Step 2: Create a basic box plot
πŸ‘9πŸ‘4❀2