Data Analyst Interview Resources
51.2K subscribers
254 photos
1 video
51 files
317 links
Join our telegram channel to learn how data analysis can reveal fascinating patterns, trends, and stories hidden within the numbers! πŸ“Š

For ads & suggestions: @love_data
Download Telegram
Struggling to stay motivated in your job search?

Try setting input goals first, then shift to output goals once you’re consistent.

Let me explain how this works with a real-life example.

Input Goals vs. Output Goals:

When starting, focus on input goals to build consistency.

For instance, if you're struggling to go to the gym, set a goal to show up every other day rather than aiming to lose 50 pounds.

Once you’re consistent, shift to output goals like losing 5 pounds a month.

Why This Works:

- Focus and Pressure: Output goals create a sense of urgency and focus.
- Efficiency: You find faster and more effective ways to achieve your goals.
- Persistence: Sticking with a strategy until it works builds resilience and problem-solving skills.

Action Time:

1) Start with Input Goals: If you're struggling with consistency, set small, manageable goals to build habits.

2) Shift to Output Goals: Once you’re consistent, set specific, measurable outcomes.

3) Don't Quit: Commit to your goals and find ways to make them work.
πŸ‘12❀4πŸ€”1πŸŽ‰1
Many people ask this common question β€œCan I get a job with just SQL and Excel?” or β€œCan I get a job with just Power BI and Python?”.

The answer to all of those questions is yes.

There are jobs that use only SQL, Tableau, Power BI, Excel, Python, or R or some combination of those.

However, the combination of tools you learn impacts the total number of jobs you are qualified for.

For example, let’s say with just SQL and Excel you are qualified for 10 jobs, but if you add Tableau to that, you are qualified for 50 jobs.

If you have a success rate of landing a job you’re qualified for of 4%, having 5 times as many jobs to go for greatly improves your odds of landing a job.

Does this mean you should go out there and learn every single skill any data analyst job requires?

NO!

It’s about finding the core tools that many jobs want.

And, in my opinion, those tools are SQL, Excel, and a visualization tool.

With these three tools, you are qualified for the majority of entry level data jobs and many higher level jobs.

So, you can land a job with whatever tools you’re comfortable with.

But if you have the three tools above in your toolbelt, you will have many more jobs to apply for and greatly improve your chances of snagging one.
πŸ‘7❀2
𝐋𝐒𝐬𝐭 𝐨𝐟 𝐜𝐨𝐦𝐩𝐚𝐧𝐒𝐞𝐬 𝐭𝐑𝐚𝐭 𝐑𝐒𝐫𝐞 𝐝𝐚𝐭𝐚 𝐚𝐧𝐚π₯𝐲𝐬𝐭𝐬:
TMcKinsey & Company
Boston Consulting Group (BCG)
Bain & Company
Deloitte
PwC
Ernst & Young (EY)
KPMG
Accenture
Google
Amazon
Microsoft
IBM
Oracle
Tiger Analytics
Mu Sigma
Fractal Analytics
EXL Service
ZS Associates
Wells Fargo
Walmart
Target
LTIMindtree
Infosys
TCS (Tata Consultancy Services)
Wipro
HCL Technologies
Capgemini
Cognizant

These companies often hire data analysts to use data for making decisions and planning strategically for their clients.
πŸ‘34
Data Analyst Interview Questions.pdf
81.4 KB
Data Analyst Interview Questions
πŸ‘7❀1
Data Analyst Interview QnA

1. Find avg of salaries department wise from table.

Answer-
SELECT department_id, AVG(salary) AS avg_salary
FROM employees
GROUP BY department_id;


2. What does Filter context in DAX mean?

Answer - Filter context in DAX refers to the subset of data that is actively being used in the calculation of a measure or in the evaluation of an expression. This context is determined by filters on the dashboard items like slicers, visuals, and filters pane which restrict the data being processed.

3. Explain how to implement Row-Level Security (RLS) in Power BI.

Answer - Row-Level Security (RLS) in Power BI can be implemented by:

- Creating roles within the Power BI service.
- Defining DAX expressions that specify the data each role can access.
- Assigning users to these roles either in Power BI or dynamically through AD group membership.

4. Create a dictionary, add elements to it, modify an element, and then print the dictionary in alphabetical order of keys.

Answer -
d = {'apple': 2, 'banana': 5}
d['orange'] = 3 # Add element
d['apple'] = 4 # Modify element
sorted_d = dict(sorted(d.items())) # Sort dictionary
print(sorted_d)


5. Find and print duplicate values in a list of assorted numbers, along with the number of times each value is repeated.

Answer -
from collections import Counter

numbers = [1, 2, 2, 3, 4, 5, 1, 6, 7, 3, 8, 1]
count = Counter(numbers)
duplicates = {k: v for k, v in count.items() if v > 1}
print(duplicates)
πŸ‘27❀2πŸ€”1
Most asked SQL Interview Questions πŸ’―

1.) Explain order of execution of SQL.
2.) What is difference between where and having?
3.) What is the use of group by?
4.) Explain all types of joins in SQL?
5.) What are triggers in SQL?
6.) What is stored procedure in SQL
7.) Explain all types of window functions?
(Mainly rank, row_num, dense_rank, lead & lag)
8.) What is difference between Delete and Truncate?
9.) What is difference between DML, DDL and DCL?
10.) What are aggregate function and when do we use them? explain with few example.
11.) Which is faster between CTE and Subquery?
12.) What are constraints and types of Constraints?
13.) Types of Keys?
14.) Different types of Operators ?
15.) Difference between Group By and Where?
16.) What are Views?
17.) What are different types of constraints?
18.) What is difference between varchar and nvarchar?
19.) Similar for char and nchar?
20.) What are index and their types?
21.) What is an index? Explain its different types.
22.) List the different types of relationships in SQL.
23.) Differentiate between UNION and UNION ALL.
24.) How many types of clauses in SQL?
25.) What is the difference between UNION and UNION ALL in SQL?
26.) What are the various types of relationships in SQL?
27.) Difference between Primary Key and Secondary Key?
28.) What is the difference between where and having?
29.) Find the second highest salary of an employee?
30.) Write retention query in SQL?
31.) Write year-on-year growth in SQL?
32.) Write a query for cummulative sum in SQL?
33.) Difference between Function and Store procedure ?
34.) Do we use variable in views?
35.) What are the limitations of views?

Like this post if you need more πŸ‘β€οΈ

Hope it helps :)
πŸ‘57❀7πŸ‘5πŸŽ‰1
Data Analyst Interview Resources
https://youtu.be/1-T-VBjLpJo?si=fo_RhbXC46Hg-FVE
I have kept the language as English so that everyone can understand. Please bear with my voice & video editing skills as I am pretty new to all this 😁
πŸ‘7❀5
You’re not a failure as a data analyst if:

β€’ It takes you more than two months to land a job (remove the time expectation!)

β€’ Complex concepts don’t immediately sink in

β€’ You use Google/YouTube daily on the job (this is a sign you’re successful, actually)

β€’ You don’t make as much money as others in the field

β€’ You don’t code in 12 different languages (SQL is all you need. Add Python later if you want.)
πŸ‘15πŸŽ‰1πŸ‘Œ1
Essential SQL topics & free resources to practice sql.
πŸ‘‡πŸ‘‡
https://youtu.be/VCZxODefTIs?si=1XB44uv5DIpcJA4K

Please like this video & subscribe my youtube channel so that I can bring more awesome videos. I would really appreciate any feedback in the comments :)
πŸ‘4
Data Analyst Interview QnA

1. Find avg of salaries department wise from table.

Answer-
SELECT department_id, AVG(salary) AS avg_salary
FROM employees
GROUP BY department_id;


2. What does Filter context in DAX mean?

Answer - Filter context in DAX refers to the subset of data that is actively being used in the calculation of a measure or in the evaluation of an expression. This context is determined by filters on the dashboard items like slicers, visuals, and filters pane which restrict the data being processed.

3. Explain how to implement Row-Level Security (RLS) in Power BI.

Answer - Row-Level Security (RLS) in Power BI can be implemented by:

- Creating roles within the Power BI service.
- Defining DAX expressions that specify the data each role can access.
- Assigning users to these roles either in Power BI or dynamically through AD group membership.

4. Create a dictionary, add elements to it, modify an element, and then print the dictionary in alphabetical order of keys.

Answer -
d = {'apple': 2, 'banana': 5}
d['orange'] = 3 # Add element
d['apple'] = 4 # Modify element
sorted_d = dict(sorted(d.items())) # Sort dictionary
print(sorted_d)


5. Find and print duplicate values in a list of assorted numbers, along with the number of times each value is repeated.

Answer -
from collections import Counter

numbers = [1, 2, 2, 3, 4, 5, 1, 6, 7, 3, 8, 1]
count = Counter(numbers)
duplicates = {k: v for k, v in count.items() if v > 1}
print(duplicates)


Like ❀️ & Share the post if you want me to post more similar content. 😊
πŸ‘14❀7πŸ₯°1πŸ‘1πŸŽ‰1
Most Important Python Topics for Data Analyst Interview:

#Basics of Python:

1. Data Types

2. Lists

3. Dictionaries

4. Control Structures:

- if-elif-else

- Loops

5. Functions

6. Practice basic FAQs questions, below mentioned are few examples:

- How to reverse a string in Python?

- How to find the largest/smallest number in a list?

- How to remove duplicates from a list?

- How to count the occurrences of each element in a list?

- How to check if a string is a palindrome?

#Pandas:

1. Pandas Data Structures (Series, DataFrame)

2. Creating and Manipulating DataFrames

3. Filtering and Selecting Data

4. Grouping and Aggregating Data

5. Handling Missing Values

6. Merging and Joining DataFrames

7. Adding and Removing Columns

8. Exploratory Data Analysis (EDA):

- Descriptive Statistics

- Data Visualization with Pandas (Line Plots, Bar Plots, Histograms)

- Correlation and Covariance

- Handling Duplicates

- Data Transformation

#Numpy:

1. NumPy Arrays

2. Array Operations:

- Creating Arrays

- Slicing and Indexing

- Arithmetic Operations

#Integration with Other Libraries:

1. Basic Data Visualization with Pandas (Line Plots, Bar Plots)

#Key Concepts to Revise:

1. Data Manipulation with Pandas and NumPy

2. Data Cleaning Techniques

3. File Handling (reading and writing CSV files, JSON files)

4. Handling Missing and Duplicate Values

5. Data Transformation (scaling, normalization)

6. Data Aggregation and Group Operations

7. Combining and Merging Datasets
πŸ‘25❀7πŸ€”2πŸ‘1πŸ‘Œ1
Power Bi interview prep

1. What is Power BI?

Answer: Power BI is a business analytics tool by Microsoft that helps to visualize data, share insights, and turn raw data into actionable insights through interactive dashboards and reports.

2. What are the components of Power BI?

Answer:
- Power BI Desktop: A tool to create reports and dashboards.

- Power BI Service: An online SaaS (Software as a Service) platform to share and collaborate on reports.

- Power BI Mobile: Apps for mobile devices to view reports on the go.
- Power BI Gateway: Connects on-premise data sources with Power BI Service for real-time updates.

- Power BI Report Server: An on-premise server for hosting Power BI reports.

3. What is a dashboard in Power BI?

Answer: A dashboard is a single-page, often called a canvas, that shows visualizations or tiles based on one or more datasets. It provides a summary view and can be shared with others.

4. What is DAX in Power BI?

Answer: DAX stands for Data Analysis Expressions. It is a formula language used in Power BI to create custom calculations and logic, similar to Excel formulas.

5. What are the different types of filters in Power BI?

Answer:
- Visual-level filters: Apply to a single visualization.

- Page-level filters: Apply to all the visualizations on a single page.

- Report-level filters: Apply to all pages in a report.

6. What is a calculated column in Power BI?

Answer: A calculated column is a new column that you add to a table using a DAX formula. It’s useful when you need to create new data from existing data in your dataset.

7. What is a Power Query?

Answer: Power Query is a data connection technology that allows you to discover, connect, combine, and refine data across a wide range of sources.

8. What is the difference between a calculated column and a measure in Power BI?

Answer:
- Calculated Column: A new column created in a table using a DAX formula. The values are calculated row by row.

- Measure: A calculation performed on data aggregated over many rows. Measures are used in visualizations like totals, averages, and percentages.

9. What are Power BI dataflows?

Answer: Dataflows are a collection of tables created and managed in the Power BI service, where you can ingest, transform, and store data in a cloud environment.

10. What is the use of the Power BI gateway?

Answer: The Power BI Gateway is used to connect on-premise data sources securely with Power BI service, allowing for real-time data refreshes.

11. How do you create a relationship between tables in Power BI?

Answer: In Power BI, you can create relationships between tables by linking columns that have common data (like an ID or name). You do this in the "Model" view by dragging a line between the related columns.

12. What is row-level security (RLS) in Power BI?

Answer: RLS is a feature in Power BI that restricts data access for users based on roles. For example, a user can only see data related to their department.

13. What are the different views in Power BI Desktop?

Answer:
- Report View: Create and view visualizations.

- Data View: View and explore the data in your tables.

- Model View: Create relationships between tables and manage your data model.

14. How can you share reports in Power BI?

Answer: You can share reports in Power BI through the Power BI Service by publishing reports to the web, sharing them directly with others via email, or by creating and sharing dashboards.

15. What is the difference between Power BI and Tableau?

Answer: Both are data visualization tools, but Power BI is more integrated with Microsoft products, offers more affordable pricing, and is easier for users who are already familiar with Microsoft tools. Tableau is known for its advanced visualization capabilities and flexibility but can be more complex and costly.


Join for more: https://whatsapp.com/channel/0029Vai1xKf1dAvuk6s1v22c
πŸ‘28❀8
πŸ‘‰βœ”οΈHere are Data Analytics-related questions along with their answers:

1.Question: What is the purpose of exploratory data analysis (EDA)?

Answer: EDA is used to analyze and summarize data sets, often through visual methods, to understand patterns, relationships, and potential outliers.

2. Question: What is the difference between supervised and unsupervised learning?

Answer: Supervised learning involves training a model on a labeled dataset, while unsupervised learning deals with unlabeled data to discover patterns without explicit guidance.

3.Question: Explain the concept of normalization in the context of data preprocessing.

Answer: Normalization scales numeric features to a standard range, preventing certain features from dominating due to their larger scales.

4. Question: What is the purpose of a correlation coefficient in statistics?

Answer: A correlation coefficient measures the strength and direction of a linear relationship between two variables, ranging from -1 to 1.

5. Question: What is the role of a decision tree in machine learning?

Answer: A decision tree is a predictive model that maps features to outcomes by recursively splitting data based on feature conditions.

6. Question: Define precision and recall in the context of classification models.

Answer: Precision is the ratio of correctly predicted positive observations to the total predicted positives, while recall is the ratio of correctly predicted positive observations to all actual positives.

7. Question: What is the purpose of cross-validation in machine learning?

Answer: Cross-validation assesses a model's performance by dividing the dataset into multiple subsets, training the model on some, and testing it on others, helping to evaluate its generalization ability.

8. Question: Explain the concept of a data warehouse.

Answer: A data warehouse is a centralized repository that stores, integrates, and manages large volumes of data from different sources, providing a unified view for analysis and reporting.

9. Question: What is the difference between structured and unstructured data?

Answer: Structured data is organized and easily searchable (e.g., databases), while unstructured data lacks a predefined structure (e.g., text documents, images).

10. Question: What is clustering in machine learning?

Answer: Clustering is a technique that groups similar data points together based on certain features, helping to identify patterns or relationships within the data.
πŸ‘17❀6
Q. Explain the data preprocessing steps in data analysis.

Ans. Data preprocessing transforms the data into a format that is more easily and effectively processed in data mining, machine learning and other data science tasks.
1. Data profiling.
2. Data cleansing.
3. Data reduction.
4. Data transformation.
5. Data enrichment.
6. Data validation.

Q. What Are the Three Stages of Building a Model in Machine Learning?

Ans. The three stages of building a machine learning model are:

Model Building: Choosing a suitable algorithm for the model and train it according to the requirement

Model Testing: Checking the accuracy of the model through the test data

Applying the Model: Making the required changes after testing and use the final model for real-time projects


Q. What are the subsets of SQL?

Ans. The following are the four significant subsets of the SQL:

Data definition language (DDL): It defines the data structure that consists of commands like CREATE, ALTER, DROP, etc.

Data manipulation language (DML): It is used to manipulate existing data in the database. The commands in this category are SELECT, UPDATE, INSERT, etc.

Data control language (DCL): It controls access to the data stored in the database. The commands in this category include GRANT and REVOKE.

Transaction Control Language (TCL): It is used to deal with the transaction operations in the database. The commands in this category are COMMIT, ROLLBACK, SET TRANSACTION, SAVEPOINT, etc.


Q. What is a Parameter in Tableau? Give an Example.

Ans. A parameter is a dynamic value that a customer could select, and you can use it to replace constant values in calculations, filters, and reference lines.
For example, when creating a filter to show the top 10 products based on total profit instead of the fixed value, you can update the filter to show the top 10, 20, or 30 products using a parameter.
πŸ‘13❀2πŸ‘Œ1
πŸ–₯ SQL Database vs NoSQL Database
πŸ‘8❀2