Data Analysis Books | Python | SQL | Excel | Artificial Intelligence | Power BI | Tableau | AI Resources
48.5K subscribers
236 photos
1 video
36 files
396 links
Download Telegram
Essential Skills for Data Analysis โ˜๏ธ
๐Ÿ‘4
CHOOSING THE RIGHT DATA ANALYTICS TOOLS

With so many data analytics tools available,
how do you pick the right one?

The truth isโ€”thereโ€™s no one-size-fits-all answer.
The best tool depends on your needs, your data, and your goals.

Hereโ€™s how to decide:

๐Ÿ”น For Data Exploration & Cleaning โ†’ SQL, Python (Pandas), Excel
๐Ÿ”น For Dashboarding & Reporting โ†’ Tableau, Power BI, Looker
๐Ÿ”น For Big Data Processing โ†’ Spark, Snowflake, Google BigQuery
๐Ÿ”น For Statistical Analysis โ†’ R, Python (Statsmodels, SciPy)
๐Ÿ”น For Machine Learning โ†’ Python (Scikit-learn, TensorFlow)

Ask yourself:
โœ… What type of data am I working with?
โœ… Do I need interactive dashboards?
โœ… Is coding necessary, or do I need a no-code tool?
โœ… What does my team/stakeholder prefer?

The best tool is the one that helps you solve problems efficiently.
โค2๐Ÿ‘2
How do you handle null, 0, and blank values in your data during the cleaning process?

Sometimes interview questions are also based on this topic. Many data aspirants or even some professionals sometimes make the mistake of simply deleting missing values or trying to fill them without proper analysis.This can damage the integrity of the analysis. Itโ€™s essential to ask or find out the reason behind missing values in the data
whether from the project head, client, or through own investigation.

๐˜ผ๐™ฃ๐™จ๐™ฌ๐™š๐™ง:

Handling null, 0, and blank values is crucial for ensuring the accuracy and reliability of data analysis. Hereโ€™s how to approach it:

1. ๐™„๐™™๐™š๐™ฃ๐™ฉ๐™ž๐™›๐™ฎ๐™ž๐™ฃ๐™œ ๐™–๐™ฃ๐™™ ๐™๐™ฃ๐™™๐™š๐™ง๐™จ๐™ฉ๐™–๐™ฃ๐™™๐™ž๐™ฃ๐™œ ๐™ฉ๐™๐™š ๐˜พ๐™ค๐™ฃ๐™ฉ๐™š๐™ญ๐™ฉ:
   - ๐™‰๐™ช๐™ก๐™ก ๐™‘๐™–๐™ก๐™ช๐™š๐™จ: These represent missing or undefined data. Identify them using functions like 'ISNULL' or filters in Power Query.
   - 0 ๐™‘๐™–๐™ก๐™ช๐™š๐™จ: These can be legitimate data points but may also indicate missing data in some contexts. Understanding the context is important.
   - ๐˜ฝ๐™ก๐™–๐™ฃ๐™  ๐™‘๐™–๐™ก๐™ช๐™š๐™จ: These can be spaces or empty strings. Identify them using 'LEN', 'TRIM', or filters.

2. ๐™ƒ๐™–๐™ฃ๐™™๐™ก๐™ž๐™ฃ๐™œ ๐™๐™๐™š๐™จ๐™š ๐™‘๐™–๐™ก๐™ช๐™š๐™จ ๐™๐™จ๐™ž๐™ฃ๐™œ ๐™‹๐™ง๐™ค๐™ฅ๐™š๐™ง ๐™๐™š๐™˜๐™๐™ฃ๐™ž๐™ฆ๐™ช๐™š๐™จ:
   - ๐™‰๐™ช๐™ก๐™ก ๐™‘๐™–๐™ก๐™ช๐™š๐™จ: Typically decide whether to impute, remove, or leave them based on the datasetโ€™s context and the analysis requirements. Common imputation methods include using mean, median, or a placeholder.
   - 0 ๐™‘๐™–๐™ก๐™ช๐™š๐™จ: If 0s are valid data, leave them as is. If they indicate missing data, treat them similarly to null values.

   - ๐˜ฝ๐™ก๐™–๐™ฃ๐™  ๐™‘๐™–๐™ก๐™ช๐™š๐™จ: Convert blanks to nulls or handle them as needed. This involves using 'IF' statements or Power Query transformations.

3. ๐™๐™จ๐™ž๐™ฃ๐™œ ๐™€๐™ญ๐™˜๐™š๐™ก ๐™–๐™ฃ๐™™ ๐™‹๐™ค๐™ฌ๐™š๐™ง ๐™Œ๐™ช๐™š๐™ง๐™ฎ:
   - ๐™€๐™ญ๐™˜๐™š๐™ก: Use formulas like 'IFERROR', 'IF', and 'VLOOKUP' to handle these values.
   - ๐™‹๐™ค๐™ฌ๐™š๐™ง ๐™Œ๐™ช๐™š๐™ง๐™ฎ: Use transformations to filter, replace, or fill null and blank values. Steps like 'Fill Down', 'Replace Values', and custom columns help automate the process.

By carefully considering the context and using appropriate methods, the data cleaning process maintains the integrity and quality of the data.

Hope it helps :)
๐Ÿ‘7
Data Analyst Interview Questions
[Python, SQL, PowerBI]

1. Is indentation required in python?
Ans:
Indentation is necessary for Python. It specifies a block of code. All code within loops, classes, functions, etc is specified within an indented block. It is usually done using four space characters. If your code is not indented necessarily, it will not execute accurately and will throw errors as well.

2. What are Entities and Relationships?
Ans:
Entity:
An entity can be a real-world object that can be easily identifiable. For example, in a college database, students, professors, workers, departments, and projects can be referred to as entities.

Relationships: Relations or links between entities that have something to do with each other. For example โ€“ The employeeโ€™s table in a companyโ€™s database can be associated with the salary table in the same database.

3. What are Aggregate and Scalar functions?
Ans:
An aggregate function performs operations on a collection of values to return a single scalar value. Aggregate functions are often used with the GROUP BY and HAVING clauses of the SELECT statement. A scalar function returns a single value based on the input value.

4. What are Custom Visuals in Power BI?
Ans:
Custom Visuals are like any other visualizations, generated using Power BI. The only difference is that it develops the custom visuals using a custom SDK. The languages like JQuery and JavaScript are used to create custom visuals in Power BI

ENJOY LEARNING ๐Ÿ‘๐Ÿ‘
๐Ÿ‘5
Finance is one of the highest paid domains for Data Science jobs.

Hereโ€™s a complete step by step roadmap to learn Data Science for Finance ๐Ÿ‘‡๐Ÿ‘‡

Step 1: Understand the fundamentals of finance

Step 2: Learn essential programming languages and tools

Step 3: Learn the fundamentals of statistics for Data Science

Step 4: Learn Data Manipulation, Analysis, and Visualization

Step 5: Dive deep into Data Science and Machine Learning Algorithms

Step 6: Learn to work with Financial Data
๐Ÿ‘7
BECOMING A DATA ANALYST IN 2025

Becoming a data analyst doesnโ€™t have to be expensive in 2025.

With the right free resources and a structured approach,
you can become a skilled data analyst.

Hereโ€™s a roadmap with free resources to guide your journey:

1๏ธโƒฃ Learn the Basics of Data Analytics
Start with foundational concepts like:
โ†ณ What is data analytics?
โ†ณ Types of analytics (descriptive, predictive, prescriptive).
โ†ณ Basics of data types and statistics.

๐Ÿ“˜ Free Resources:
1. Intro to Statistics : https://www.khanacademy.org/math/statistics-probability
2. Introduction to Data Analytics by IBM (audit for free) :
https://imp.i384100.net/WyNqoM


2๏ธโƒฃ Master Excel for Data Analysis
Excel is an essential tool for data cleaning, analysis, and visualization.

๐Ÿ“˜ Free Resources:
1. Excel Is Fun (YouTube): https://www.youtube.com/user/ExcelIsFun
2. Chandoo.org: https://chandoo.org/

๐ŸŽฏ Practice: Learn how to create pivot tables and use functions like VLOOKUP, SUMIF, and IF.


3๏ธโƒฃ Learn SQL for Data Queries
SQL is the language of dataโ€”used to retrieve and manipulate datasets.

๐Ÿ“˜ Free Resources:
1. W3Schools SQL Tutorial : https://www.w3schools.com/sql/
2. Mode Analytics SQL Tutorial : https://mode.com/sql-tutorial/

๐ŸŽฏ Practice: Write SELECT, WHERE, and JOIN queries on free datasets.


4๏ธโƒฃ Get Hands-On with Data Visualization
Learn to communicate insights visually with tools like Tableau or Power BI.

๐Ÿ“˜ Free Resources:
1. Tableau Public: https://www.tableau.com/learn/training
2. Power BI Community Blog: https://community.fabric.microsoft.com/t5/Power-BI-Community-Blog/bg-p/community_blog

๐ŸŽฏ Practice: Create dashboards to tell stories using real datasets.

5๏ธโƒฃ Dive into Python or R for Analytics
Coding isnโ€™t mandatory, but Python or R can open up advanced analytics.

๐Ÿ“˜ Free Resources:
1. Googleโ€™s Python Course https://developers.google.com/edu/python
2. R for Data Science (free book) r4ds.had.co.nz

๐ŸŽฏ Practice: Use libraries like Pandas (Python) or dplyr (R) to clean and analyze data.


6๏ธโƒฃ Work on Real Projects
Apply your skills to real-world datasets to build your portfolio.

๐Ÿ“˜ Free Resources:
Kaggle: Datasets and beginner-friendly competitions.
Google Dataset Search: Access datasets on any topic.

๐ŸŽฏ Project Ideas:
Analyze sales data and create a dashboard.
Predict customer churn using a public dataset.


7๏ธโƒฃ Build Your Portfolio and Network
Showcase your projects and connect with others in the field.

๐Ÿ“˜ Tips:
โ†’ Use GitHub to share your work.
โ†’ Create LinkedIn posts about your learning journey.
โ†’ Join forums like r/DataScience on Reddit or LinkedIn groups.

๐Ÿ’ก Start small, use free resources, and keep building.
๐Ÿ’ก Remember: Every small step adds up to big progress.
๐Ÿ‘8
Hey Guys๐Ÿ‘‹,

The Average Salary Of a Data Scientist is 14LPA 

๐๐ž๐œ๐จ๐ฆ๐ž ๐š ๐‚๐ž๐ซ๐ญ๐ข๐Ÿ๐ข๐ž๐ ๐ƒ๐š๐ญ๐š ๐’๐œ๐ข๐ž๐ง๐ญ๐ข๐ฌ๐ญ ๐ˆ๐ง ๐“๐จ๐ฉ ๐Œ๐๐‚๐ฌ๐Ÿ˜

We help you master the required skills.

Learn by doing, build Industry level projects

๐Ÿ‘ฉโ€๐ŸŽ“ 1500+ Students Placed
๐Ÿ’ผ 7.2 LPA Avg. Package
๐Ÿ’ฐ 41 LPA Highest Package
๐Ÿค 450+ Hiring Partners

Apply for FREE๐Ÿ‘‡ :
https://tracking.acciojob.com/g/PUfdDxgHR

( Limited Slots )
โค4๐Ÿ‘1
๐—ช๐—ฎ๐—ป๐˜ ๐˜๐—ผ ๐—ธ๐—ป๐—ผ๐˜„ ๐˜„๐—ต๐—ฎ๐˜ ๐—ต๐—ฎ๐—ฝ๐—ฝ๐—ฒ๐—ป๐˜€ ๐—ถ๐—ป ๐—ฎ ๐—ฟ๐—ฒ๐—ฎ๐—น ๐—ฑ๐—ฎ๐˜๐—ฎ ๐—ฎ๐—ป๐—ฎ๐—น๐˜†๐˜€๐˜ ๐—ถ๐—ป๐˜๐—ฒ๐—ฟ๐˜ƒ๐—ถ๐—ฒ๐˜„?

๐—•๐—ฎ๐˜€๐—ถ๐—ฐ ๐—œ๐—ป๐˜๐—ฟ๐—ผ๐—ฑ๐˜‚๐—ฐ๐˜๐—ถ๐—ผ๐—ป

-Brief introduction about yourself.

-Explanation of how you developed an interest in learning Power BI despite having a chemical background.


๐—ง๐—ผ๐—ผ๐—น๐˜€ ๐—ฃ๐—ฟ๐—ผ๐—ณ๐—ถ๐—ฐ๐—ถ๐—ฒ๐—ป๐—ฐ๐˜†

-Discussion about the tools you are proficient in.

-Detailed explanation of a project that demonstrated your proficiency in these tools.

๐—ฃ๐—ฟ๐—ผ๐—ท๐—ฒ๐—ฐ๐˜ ๐—˜๐˜…๐—ฝ๐—น๐—ฎ๐—ป๐—ฎ๐˜๐—ถ๐—ผ๐—ป

Explain about any Data Analytics Project you did, below are some follow-up questions for sales related data analysis project

Follow-up Question:

Was there any improvement in sales after building the report?

Provide a clear before and after scenario in sales post-report creation.

What areas did you identify where the company was losing sales, and what were your recommendations?

- How do you check the quality of data when it's given to you?
Explain your methods for ensuring data quality.

- How do you handle null values? Describe your approach to managing null values in datasets.


๐—ฆ๐—ค๐—Ÿ ๐—พ๐˜‚๐—ฒ๐˜€๐˜๐—ถ๐—ผ๐—ป๐˜€

-Explain the order in which SQL clauses are executed.

-Write a query to find the percentage of the 18-year-old population.
Details: You are given two tables:
Table 1: Contains states and their respective populations.
Table 2: Contains three columns (state, gender, and population of 18-year-olds).

-Explain window functions and how to rank values in SQL.

- Difference between JOIN and UNION.

-How to return unique values in SQL.

๐—•๐—ฒ๐—ต๐—ฎ๐˜ƒ๐—ถ๐—ผ๐—ฟ๐—ฎ๐—น ๐—ค๐˜‚๐—ฒ๐˜€๐˜๐—ถ๐—ผ๐—ป๐˜€

-Solve a puzzle involving 3 gallons of water in one jar and 2 gallons in another to get exactly 4 gallons.
Step-by-step solution for the water puzzle.

- What skills have you learned on your own? Discuss the skills you self-taught and their impact on your career.

-Describe cases when you showcased team spirit.

-โญ ๐—ฆ๐—ผ๐—ฐ๐—ถ๐—ฎ๐—น ๐— ๐—ฒ๐—ฑ๐—ถ๐—ฎ ๐—”๐—ฝ๐—ฝ ๐—ค๐˜‚๐—ฒ๐˜€๐˜๐—ถ๐—ผ๐—ป
Scenario: Choose any social media app (I choose Discord).
Question: What function/feature would you add to the Discord app, and how would you track its success?

- Rate yourself on Excel, SQL, and Python out of 10.

- What are your strengths in data analytics?

Like if it helps :)
๐Ÿ‘5โค3
Python Interview Questions for data analyst interview

Question 1: Find the top 5 dates when the percentage change in Company A's stock price was the highest.

Question 2: Calculate the annualized volatility of Company B's stock price. (Hint: Annualized volatility is the standard deviation of daily returns multiplied by the square root of the number of trading days in a year.)

Question 3: Identify the longest streaks of consecutive days when the stock price of Company A was either increasing or decreasing continuously.

Question 4: Create a new column that represents the cumulative returns of Company A's stock price over the year.

Question 5: Calculate the 7-day rolling average of both Company A's and Company B's stock prices and find the date when the two rolling averages were closest to each other.

Question 6: Create a new DataFrame that contains only the dates when Company A's stock price was above its 50-day moving average, and Company B's stock price was below its 50-day moving average
๐Ÿ‘6โค1
Data Analyst Interview Questions
[Python, SQL, PowerBI]

1. Is indentation required in python?
Ans:
Indentation is necessary for Python. It specifies a block of code. All code within loops, classes, functions, etc is specified within an indented block. It is usually done using four space characters. If your code is not indented necessarily, it will not execute accurately and will throw errors as well.

2. What are Entities and Relationships?
Ans:
Entity:
An entity can be a real-world object that can be easily identifiable. For example, in a college database, students, professors, workers, departments, and projects can be referred to as entities.

Relationships: Relations or links between entities that have something to do with each other. For example โ€“ The employeeโ€™s table in a companyโ€™s database can be associated with the salary table in the same database.

3. What are Aggregate and Scalar functions?
Ans:
An aggregate function performs operations on a collection of values to return a single scalar value. Aggregate functions are often used with the GROUP BY and HAVING clauses of the SELECT statement. A scalar function returns a single value based on the input value.

4. What are Custom Visuals in Power BI?
Ans:
Custom Visuals are like any other visualizations, generated using Power BI. The only difference is that it develops the custom visuals using a custom SDK. The languages like JQuery and JavaScript are used to create custom visuals in Power BI

ENJOY LEARNING ๐Ÿ‘๐Ÿ‘
๐Ÿ‘4โค1