Guesstimate questions are scary, simply because they really matter for impacting your performance in those all-important interviews โ often for consulting, data analytics or product management. No need to worry; you can do it! In this guide, we are looking at how to approach guesstimate questions with confidence and make what sounds like a guessing game into an opportunity for showcasing our analytical thinking.
https://datasimplifier.com/guesstimate-questions/
https://datasimplifier.com/guesstimate-questions/
๐5
Reminder for all data analyst job seekersโฐ
DA + HR Knowledgeโก๏ธHR Analyst
DA + Sales Knowledgeโก๏ธSales Analyst
DA + Supply Chainโก๏ธSupply chain Analyst
DA + Finance Knowledgeโก๏ธFinance Analyst
DA + Research Knowledgeโก๏ธResearch Analyst
DA + Marketing Knowledgeโก๏ธMarketing Analyst
What does it mean?
โฉBuild more functional / domain knowledge
โฉBy doing more projects & research
Why?
โ To increase your chances of landing a DA job ๐
DA + HR Knowledgeโก๏ธHR Analyst
DA + Sales Knowledgeโก๏ธSales Analyst
DA + Supply Chainโก๏ธSupply chain Analyst
DA + Finance Knowledgeโก๏ธFinance Analyst
DA + Research Knowledgeโก๏ธResearch Analyst
DA + Marketing Knowledgeโก๏ธMarketing Analyst
What does it mean?
โฉBuild more functional / domain knowledge
โฉBy doing more projects & research
Why?
โ To increase your chances of landing a DA job ๐
๐7
10 Data Cleaning Techniques Every Data Analyst Should Master:
1. Handling Missing Data
Use methods like imputation (mean, median, mode) or deletion to handle missing values.
In Python, pandas functions like fillna() or dropna() are useful.
Example:
2. Removing Duplicates
Identify and remove duplicate records to ensure the dataset is accurate. Use
1. Handling Missing Data
Use methods like imputation (mean, median, mode) or deletion to handle missing values.
In Python, pandas functions like fillna() or dropna() are useful.
Example:
df.fillna(df.mean()) replaces missing values with the column mean.2. Removing Duplicates
Identify and remove duplicate records to ensure the dataset is accurate. Use
drop_duplicates() in pandas.๐12๐ฅ2
3. Standardizing Data
Ensure consistency in formatting, such as dates and strings.
Use
4. Handling Outliers
Detect and manage outliers using statistical methods or by creating visuals like box plots. Methods include capping, flooring, or removing outliers.
Example:
Ensure consistency in formatting, such as dates and strings.
Use
str.lower() or pd.to_datetime() for standardization.4. Handling Outliers
Detect and manage outliers using statistical methods or by creating visuals like box plots. Methods include capping, flooring, or removing outliers.
Example:
df = df[(df['column'] >= lower_limit) & (df['column'] <= upper_limit)]๐6๐1
5. Correcting Data Types
Check that all columns have the correct data types for analysis. Use
6. Normalizing and Scaling Data
Normalize or scale data to bring all values into a similar range, which is important for algorithms like K-Means clustering.
Use
Example:
Check that all columns have the correct data types for analysis. Use
astype() in pandas to convert data types.6. Normalizing and Scaling Data
Normalize or scale data to bring all values into a similar range, which is important for algorithms like K-Means clustering.
Use
StandardScaler or MinMaxScaler from scikit-learn.Example:
from sklearn.preprocessing import StandardScaler; df_scaled = StandardScaler().fit_transform(df)๐7
7. Encoding Categorical Variables
Convert categorical data into numerical format using techniques like one-hot encoding or label encoding. Use
Example:
8. Dealing with Inconsistent Data
Identify and correct inconsistencies in data entries, such as typos or inconsistent naming conventions.
Example:
Convert categorical data into numerical format using techniques like one-hot encoding or label encoding. Use
pd.get_dummies() or LabelEncoder.Example:
df_encoded = pd.get_dummies(df, columns=['category'])8. Dealing with Inconsistent Data
Identify and correct inconsistencies in data entries, such as typos or inconsistent naming conventions.
Example:
df['column'] = df['column'].replace({'val1':'value1', 'val2':'value2'})๐9
9. Parsing and Extracting Data
Extract relevant information from complex data types such as strings or dates. Use string methods or regex (regular expressions).
Example:
10. Combining Multiple Data Sources
Merge or concatenate multiple datasets to create a comprehensive dataset.
Use
Example:
Extract relevant information from complex data types such as strings or dates. Use string methods or regex (regular expressions).
Example:
df['year'] = df['date'].dt.year10. Combining Multiple Data Sources
Merge or concatenate multiple datasets to create a comprehensive dataset.
Use
merge() or concat() in pandas.Example:
df_combined = pd.merge(df1, df2, on='key_column')๐8
โ
๐๐ผ๐ฟ๐ฟ๐ฒ๐ฐ๐ ๐๐ฎ๐ ๐๐ผ ๐ฎ๐๐ธ ๐ณ๐ผ๐ฟ ๐ฎ ๐ฟ๐ฒ๐ณ๐ฒ๐ฟ๐ฟ๐ฎ๐น:๐ฉ๐ป
---
Subject: Referral Request for [Position] at [Company Name]
Hi [Recipient's Name]๐,
I hope youโre doing well. Iโm interested in the [Position] at [Company] and noticed you work there. My background in data analytics, particularly in [specific expertise], aligns well with this role.
I understand the interviews will likely focus heavily on technical data analysis skills, and Iโm well-prepared, having worked on numerous projects and effectively used data-driven strategies to address complex challenges.
Here are the details for your reference:
- Job posting: [Job Link]
- Resume: [Resume Link]
- Projects and coding profile:
- GitHub: [GitHub Link]
- [Coding Profile Link] (e.g., [mention ranking/level if impressive])
I assure you that a referral will be highly valued and I will make the most of this opportunity. Iโm also happy to assist you with anything in return.
Any additional suggestion/advice you can provide would be greatly appreciated.
Thanks in advance!
Best,
[Your Full Name]
---
Subject: Referral Request for [Position] at [Company Name]
Hi [Recipient's Name]๐,
I hope youโre doing well. Iโm interested in the [Position] at [Company] and noticed you work there. My background in data analytics, particularly in [specific expertise], aligns well with this role.
I understand the interviews will likely focus heavily on technical data analysis skills, and Iโm well-prepared, having worked on numerous projects and effectively used data-driven strategies to address complex challenges.
Here are the details for your reference:
- Job posting: [Job Link]
- Resume: [Resume Link]
- Projects and coding profile:
- GitHub: [GitHub Link]
- [Coding Profile Link] (e.g., [mention ranking/level if impressive])
I assure you that a referral will be highly valued and I will make the most of this opportunity. Iโm also happy to assist you with anything in return.
Any additional suggestion/advice you can provide would be greatly appreciated.
Thanks in advance!
Best,
[Your Full Name]
๐12
The most powerful data analyst tool?
CTRL + C and CTRL + V
CTRL + C and CTRL + V
๐22๐5๐ซก1
When starting off your data analytics journey you DON'T need to be a SQL guru from the get-go.
In fact, most SQL skills you will only learn on the job with:
- real business problems.
- actual data sets.
- imperfect data architecture.
- other people to collaborate with.
So be kind to yourself, give yourself time to grow and above all...
try to become proficient at SQL rather than perfect.
The rest will take care of itself along the way! ๐
In fact, most SQL skills you will only learn on the job with:
- real business problems.
- actual data sets.
- imperfect data architecture.
- other people to collaborate with.
So be kind to yourself, give yourself time to grow and above all...
try to become proficient at SQL rather than perfect.
The rest will take care of itself along the way! ๐
๐10โค1
Essential Data Analysis Techniques Every Analyst Should Know
1. Descriptive Statistics: Understanding measures of central tendency (mean, median, mode) and measures of spread (variance, standard deviation) to summarize data.
2. Data Cleaning: Techniques to handle missing values, outliers, and inconsistencies in data, ensuring that the data is accurate and reliable for analysis.
3. Exploratory Data Analysis (EDA): Using visualization tools like histograms, scatter plots, and box plots to uncover patterns, trends, and relationships in the data.
4. Hypothesis Testing: The process of making inferences about a population based on sample data, including understanding p-values, confidence intervals, and statistical significance.
5. Correlation and Regression Analysis: Techniques to measure the strength of relationships between variables and predict future outcomes based on existing data.
6. Time Series Analysis: Analyzing data collected over time to identify trends, seasonality, and cyclical patterns for forecasting purposes.
7. Clustering: Grouping similar data points together based on characteristics, useful in customer segmentation and market analysis.
8. Dimensionality Reduction: Techniques like PCA (Principal Component Analysis) to reduce the number of variables in a dataset while preserving as much information as possible.
9. ANOVA (Analysis of Variance): A statistical method used to compare the means of three or more samples, determining if at least one mean is different.
10. Machine Learning Integration: Applying machine learning algorithms to enhance data analysis, enabling predictions, and automation of tasks.
Like this post if you need more ๐โค๏ธ
Hope it helps :)
1. Descriptive Statistics: Understanding measures of central tendency (mean, median, mode) and measures of spread (variance, standard deviation) to summarize data.
2. Data Cleaning: Techniques to handle missing values, outliers, and inconsistencies in data, ensuring that the data is accurate and reliable for analysis.
3. Exploratory Data Analysis (EDA): Using visualization tools like histograms, scatter plots, and box plots to uncover patterns, trends, and relationships in the data.
4. Hypothesis Testing: The process of making inferences about a population based on sample data, including understanding p-values, confidence intervals, and statistical significance.
5. Correlation and Regression Analysis: Techniques to measure the strength of relationships between variables and predict future outcomes based on existing data.
6. Time Series Analysis: Analyzing data collected over time to identify trends, seasonality, and cyclical patterns for forecasting purposes.
7. Clustering: Grouping similar data points together based on characteristics, useful in customer segmentation and market analysis.
8. Dimensionality Reduction: Techniques like PCA (Principal Component Analysis) to reduce the number of variables in a dataset while preserving as much information as possible.
9. ANOVA (Analysis of Variance): A statistical method used to compare the means of three or more samples, determining if at least one mean is different.
10. Machine Learning Integration: Applying machine learning algorithms to enhance data analysis, enabling predictions, and automation of tasks.
Like this post if you need more ๐โค๏ธ
Hope it helps :)
๐16
If you are targeting your first Data Analyst job then this is why you should avoid guided projects
The common thing nowadays is "Coffee Sales Analysis" and "Pizza Sales Analysis"
I don't see these projects as PROJECTS
But as big RED flags
We are showing our SKILLS through projects, RIGHT?
Then what's WRONG with these projects?
Don't think from YOUR side
Think from the HIRING team's side
These projects have more than a MILLION views on YouTube
Even if you consider 50% of this NUMBER
Then just IMAGINE how many aspiring Data Analysts would have created this same project
Hiring teams see hundreds of resumes and portfolios on a DAILY basis
Just imagine how many times they would have seen the SAME titles of projects again and again
They would know that these projects are PUBLICLY available for EVERYONE
You have simply copied pasted the ENTIRE project from YouTube
So now if I want to hire a Data Analyst then how would I JUDGE you or your technical skills?
What is the USE of Pizza or Coffee sales analysis projects for MY company?
By doing such guided projects, you are involving yourself in a big circle of COMPETITION
I repeat, there were more than a MILLION views
So please AVOID guided projects at all costs
Guided projects are good for your personal PRACTICE and LinkedIn CONTENT
But try not to involve them in your PORTFOLIO or RESUME
The common thing nowadays is "Coffee Sales Analysis" and "Pizza Sales Analysis"
I don't see these projects as PROJECTS
But as big RED flags
We are showing our SKILLS through projects, RIGHT?
Then what's WRONG with these projects?
Don't think from YOUR side
Think from the HIRING team's side
These projects have more than a MILLION views on YouTube
Even if you consider 50% of this NUMBER
Then just IMAGINE how many aspiring Data Analysts would have created this same project
Hiring teams see hundreds of resumes and portfolios on a DAILY basis
Just imagine how many times they would have seen the SAME titles of projects again and again
They would know that these projects are PUBLICLY available for EVERYONE
You have simply copied pasted the ENTIRE project from YouTube
So now if I want to hire a Data Analyst then how would I JUDGE you or your technical skills?
What is the USE of Pizza or Coffee sales analysis projects for MY company?
By doing such guided projects, you are involving yourself in a big circle of COMPETITION
I repeat, there were more than a MILLION views
So please AVOID guided projects at all costs
Guided projects are good for your personal PRACTICE and LinkedIn CONTENT
But try not to involve them in your PORTFOLIO or RESUME
๐8โค2
The best way to learn data analytics skills is to:
1. Watch a tutorial
2. Immediately practice what you just learned
3. Do projects to apply your learning to real-life applications
If you only watch videos and never practice, you wonโt retain any of your teaching.
If you never apply your learning with projects, you wonโt be able to solve problems on the job. (You also will have a much harder time attracting recruiters without a recruiter.)
1. Watch a tutorial
2. Immediately practice what you just learned
3. Do projects to apply your learning to real-life applications
If you only watch videos and never practice, you wonโt retain any of your teaching.
If you never apply your learning with projects, you wonโt be able to solve problems on the job. (You also will have a much harder time attracting recruiters without a recruiter.)
โค5๐5๐2
If youโre a data analyst, hereโs what recruiters really want:
Itโs not just about knowing the tools like Power BI, SQL, and Python.
They want to see that you can:
Understand business problems
Communicate your findings clearly
Turn data into useful insights
Make predictions about future trends
Data analysis isnโt just about generating reports; itโs about using data to support your companyโs goals.
Show that you can connect the dots, see the bigger picture, and explain your findings in simple terms.
Itโs not just about knowing the tools like Power BI, SQL, and Python.
They want to see that you can:
Understand business problems
Communicate your findings clearly
Turn data into useful insights
Make predictions about future trends
Data analysis isnโt just about generating reports; itโs about using data to support your companyโs goals.
Show that you can connect the dots, see the bigger picture, and explain your findings in simple terms.
๐4โค1
I have uploaded a lot of free resources on linkedin as well
๐๐
https://www.linkedin.com/company/sql-analysts/
We're just 94 followers away from reaching 100k on LinkedIn! โค๏ธ Join us and be part of this milestone!
๐๐
https://www.linkedin.com/company/sql-analysts/
We're just 94 followers away from reaching 100k on LinkedIn! โค๏ธ Join us and be part of this milestone!
๐8โค4
Data Analysis Books | Python | SQL | Excel | Artificial Intelligence | Power BI | Tableau | AI Resources
I have uploaded a lot of free resources on linkedin as well ๐๐ https://www.linkedin.com/company/sql-analysts/ We're just 94 followers away from reaching 100k on LinkedIn! โค๏ธ Join us and be part of this milestone!
100k followers completed, thanks for the love and support โค๏ธ
๐6โค4
Forwarded from SQL Programming Resources
What's the full form of NoSQL?
Anonymous Quiz
17%
Next Structured Query Language
68%
No Structure Query Language
4%
Non Stop Query Language
11%
Not Only SQL
๐7๐5
Most Demanding Data Analytics Skills!
โณ Dive into the essential skills and tools that are shaping the future of data analytics. From SQL and Python to Tableau and PowerBI, discover which technologies are crucial for advancing your data analysis capabilities.
โณ Explore the importance of machine learning techniques like linear regression, logistic regression, SVM, decision trees, random forests, K-means, and K-nearest neighbors, and how they can enhance your analytical prowess.
โณ Understand why soft skills such as communication, collaboration, critical thinking, and creativity are just as important as technical skills in the data analytics field.
โณ Get a comprehensive overview of the skills and technologies that can propel your career forward and make you a standout in the competitive world of data analytics.
โณ Dive into the essential skills and tools that are shaping the future of data analytics. From SQL and Python to Tableau and PowerBI, discover which technologies are crucial for advancing your data analysis capabilities.
โณ Explore the importance of machine learning techniques like linear regression, logistic regression, SVM, decision trees, random forests, K-means, and K-nearest neighbors, and how they can enhance your analytical prowess.
โณ Understand why soft skills such as communication, collaboration, critical thinking, and creativity are just as important as technical skills in the data analytics field.
โณ Get a comprehensive overview of the skills and technologies that can propel your career forward and make you a standout in the competitive world of data analytics.
๐7