Data Analyst Interview Resources
51.3K subscribers
254 photos
1 video
51 files
317 links
Join our telegram channel to learn how data analysis can reveal fascinating patterns, trends, and stories hidden within the numbers! πŸ“Š

For ads & suggestions: @love_data
Download Telegram
1. Define the term 'Data Wrangling.

Data Wrangling is the process wherein raw data is cleaned, structured, and enriched into a desired usable format for better decision making. It involves discovering, structuring, cleaning, enriching, validating, and analyzing data. This process can turn and map out large amounts of data extracted from various sources into a more useful format.

2. What are the best methods for data cleaning?

Create a data cleaning plan by understanding where the common errors take place and keep all the communications open. Before working with the data, identify and remove the duplicates. This will lead to an easy and effective data analysis process.Focus on the accuracy of the data. Set cross-field validation, maintain the value types of data, and provide mandatory constraints.Normalize the data at the entry point so that it is less chaotic. You will be able to ensure that all information is standardized, leading to fewer errors on entry.


3. Explain the Type I and Type II errors in Statistics?

In Hypothesis testing, a Type I error occurs when the null hypothesis is rejected even if it is true. It is also known as a false positive.

A Type II error occurs when the null hypothesis is not rejected, even if it is false. It is also known as a false negative.

4. How do you make a dropdown list in MS Excel?

First, click on the Data tab that is present in the ribbon.Under the Data Tools group, select Data Validation.Then navigate to Settings > Allow > List.Select the source you want to provide as a list array.

5. State some ways to improve the performance of Tableau?

Use an Extract to make workbooks run faster.
Reduce the scope of data to decrease the volume of data.
Reduce the number of marks on the view to avoid information overload.
Hide unused fields.
Use Context filters.
Use indexing in tables and use the same fields for filtering.
Remove unnecessary calculations and sheets.
πŸ‘13❀4
1. What are Query and Query language?

A query is nothing but a request sent to a database to retrieve data or information. The required data can be retrieved from a table or many tables in the database.

Query languages use various types of queries to retrieve data from databases. SQL, Datalog, and AQL are a few examples of query languages; however, SQL is known to be the widely used query language.



2. What are Superkey and candidate key?

A super key may be a single or a combination of keys that help to identify a record in a table. Know that Super keys can have one or more attributes, even though all the attributes are not necessary to identify the records.

A candidate key is the subset of Superkey, which can have one or more than one attributes to identify records in a table. Unlike Superkey, all the attributes of the candidate key must be helpful to identify the records.


3. What do you mean by buffer pool and mention its benefits?

A buffer pool in SQL is also known as a buffer cache. All the resources can store their cached data pages in a buffer pool. The size of the buffer pool can be defined during the configuration of an instance of SQL Server.
The following are the benefits of a buffer pool:

Increase in I/O performance
Reduction in I/O latency
Increase in transaction throughput
Increase in reading performance


4. What is the difference between Zero and NULL values in SQL?

When a field in a column doesn’t have any value, it is said to be having a NULL value. Simply put, NULL is the blank field in a table. It can cancel be considered as an unassigned, unknown, or unavailable value. On the contrary, zero is a number, and it is an available, assigned, and known value.
πŸ‘19❀4πŸŽ‰1
Data Analysis with Excel
πŸ‘‡πŸ‘‡
https://t.iss.one/excel_analyst/2

Power BI DAX Functions
πŸ‘‡πŸ‘‡
https://t.iss.one/PowerBI_analyst/2

All about SQL
πŸ‘‡πŸ‘‡
https://t.iss.one/sqlanalyst/29

Python for data analysis
πŸ‘‡πŸ‘‡
https://t.iss.one/pythonanalyst/26

Statistics Book and other useful resources
πŸ‘‡πŸ‘‡
https://t.iss.one/DataAnalystInterview/34
πŸ‘18πŸ‘6πŸ‘Œ2❀1πŸ₯°1πŸ€”1πŸŽ‰1
Hi Guys,

Here are some of the telegram channels which may help you in data analytics journey πŸ‘‡πŸ‘‡

SQL: https://t.iss.one/sqlanalyst

Power BI & Tableau:
https://t.iss.one/PowerBI_analyst

Excel:
https://t.iss.one/excel_analyst

Python:
https://t.iss.one/dsabooks

Jobs:
https://t.iss.one/jobs_SQL

Data Science:
https://t.iss.one/datasciencefree

Artificial intelligence:
https://t.iss.one/machinelearning_deeplearning

Data Engineering:
https://t.iss.one/sql_engineer

Hope it helps :)
πŸ€”7πŸ‘5❀4πŸ₯°2πŸ‘Œ2
πŸ‘12❀4πŸ€”1
These are the top 5 skills (I think) you need as an entry-level data analyst:

1. Excel. It may not be fancy but it's still one of the most used tools in the business world. I can guarantee you will use it at some point.

2. SQL. You may not actually use SQL but it's worth learning. It's the language of databases and gives you a strong foundation for working with other data analysis tools.

3. A data viz tool. Look, I don't care if you learn Power BI, Tableau, or any other data viz tool. You need to be able to communicate insights in a way that makes sense to non-technical people.

4. Communication. This may actually be the most important skill. It doesn't matter if you can analyze data if you can't communicate why that analysis should matter.

5. Problem solving. You use data to answer business questions and...wait for it... solve problems. It's an absolutely essential skill to have.

The best part of this is that you very likely already have 2, if not 3, of these in a pretty good place.

Focus your efforts on the skills that will make a difference.
πŸ‘14❀4πŸ₯°1
Struggling to stay motivated in your job search?

Try setting input goals first, then shift to output goals once you’re consistent.

Let me explain how this works with a real-life example.

Input Goals vs. Output Goals:

When starting, focus on input goals to build consistency.

For instance, if you're struggling to go to the gym, set a goal to show up every other day rather than aiming to lose 50 pounds.

Once you’re consistent, shift to output goals like losing 5 pounds a month.

Why This Works:

- Focus and Pressure: Output goals create a sense of urgency and focus.
- Efficiency: You find faster and more effective ways to achieve your goals.
- Persistence: Sticking with a strategy until it works builds resilience and problem-solving skills.

Action Time:

1) Start with Input Goals: If you're struggling with consistency, set small, manageable goals to build habits.

2) Shift to Output Goals: Once you’re consistent, set specific, measurable outcomes.

3) Don't Quit: Commit to your goals and find ways to make them work.
πŸ‘12❀4πŸ€”1πŸŽ‰1
Many people ask this common question β€œCan I get a job with just SQL and Excel?” or β€œCan I get a job with just Power BI and Python?”.

The answer to all of those questions is yes.

There are jobs that use only SQL, Tableau, Power BI, Excel, Python, or R or some combination of those.

However, the combination of tools you learn impacts the total number of jobs you are qualified for.

For example, let’s say with just SQL and Excel you are qualified for 10 jobs, but if you add Tableau to that, you are qualified for 50 jobs.

If you have a success rate of landing a job you’re qualified for of 4%, having 5 times as many jobs to go for greatly improves your odds of landing a job.

Does this mean you should go out there and learn every single skill any data analyst job requires?

NO!

It’s about finding the core tools that many jobs want.

And, in my opinion, those tools are SQL, Excel, and a visualization tool.

With these three tools, you are qualified for the majority of entry level data jobs and many higher level jobs.

So, you can land a job with whatever tools you’re comfortable with.

But if you have the three tools above in your toolbelt, you will have many more jobs to apply for and greatly improve your chances of snagging one.
πŸ‘7❀2
𝐋𝐒𝐬𝐭 𝐨𝐟 𝐜𝐨𝐦𝐩𝐚𝐧𝐒𝐞𝐬 𝐭𝐑𝐚𝐭 𝐑𝐒𝐫𝐞 𝐝𝐚𝐭𝐚 𝐚𝐧𝐚π₯𝐲𝐬𝐭𝐬:
TMcKinsey & Company
Boston Consulting Group (BCG)
Bain & Company
Deloitte
PwC
Ernst & Young (EY)
KPMG
Accenture
Google
Amazon
Microsoft
IBM
Oracle
Tiger Analytics
Mu Sigma
Fractal Analytics
EXL Service
ZS Associates
Wells Fargo
Walmart
Target
LTIMindtree
Infosys
TCS (Tata Consultancy Services)
Wipro
HCL Technologies
Capgemini
Cognizant

These companies often hire data analysts to use data for making decisions and planning strategically for their clients.
πŸ‘34
Data Analyst Interview Questions.pdf
81.4 KB
Data Analyst Interview Questions
πŸ‘7❀1
Data Analyst Interview QnA

1. Find avg of salaries department wise from table.

Answer-
SELECT department_id, AVG(salary) AS avg_salary
FROM employees
GROUP BY department_id;


2. What does Filter context in DAX mean?

Answer - Filter context in DAX refers to the subset of data that is actively being used in the calculation of a measure or in the evaluation of an expression. This context is determined by filters on the dashboard items like slicers, visuals, and filters pane which restrict the data being processed.

3. Explain how to implement Row-Level Security (RLS) in Power BI.

Answer - Row-Level Security (RLS) in Power BI can be implemented by:

- Creating roles within the Power BI service.
- Defining DAX expressions that specify the data each role can access.
- Assigning users to these roles either in Power BI or dynamically through AD group membership.

4. Create a dictionary, add elements to it, modify an element, and then print the dictionary in alphabetical order of keys.

Answer -
d = {'apple': 2, 'banana': 5}
d['orange'] = 3 # Add element
d['apple'] = 4 # Modify element
sorted_d = dict(sorted(d.items())) # Sort dictionary
print(sorted_d)


5. Find and print duplicate values in a list of assorted numbers, along with the number of times each value is repeated.

Answer -
from collections import Counter

numbers = [1, 2, 2, 3, 4, 5, 1, 6, 7, 3, 8, 1]
count = Counter(numbers)
duplicates = {k: v for k, v in count.items() if v > 1}
print(duplicates)
πŸ‘27❀2πŸ€”1
Most asked SQL Interview Questions πŸ’―

1.) Explain order of execution of SQL.
2.) What is difference between where and having?
3.) What is the use of group by?
4.) Explain all types of joins in SQL?
5.) What are triggers in SQL?
6.) What is stored procedure in SQL
7.) Explain all types of window functions?
(Mainly rank, row_num, dense_rank, lead & lag)
8.) What is difference between Delete and Truncate?
9.) What is difference between DML, DDL and DCL?
10.) What are aggregate function and when do we use them? explain with few example.
11.) Which is faster between CTE and Subquery?
12.) What are constraints and types of Constraints?
13.) Types of Keys?
14.) Different types of Operators ?
15.) Difference between Group By and Where?
16.) What are Views?
17.) What are different types of constraints?
18.) What is difference between varchar and nvarchar?
19.) Similar for char and nchar?
20.) What are index and their types?
21.) What is an index? Explain its different types.
22.) List the different types of relationships in SQL.
23.) Differentiate between UNION and UNION ALL.
24.) How many types of clauses in SQL?
25.) What is the difference between UNION and UNION ALL in SQL?
26.) What are the various types of relationships in SQL?
27.) Difference between Primary Key and Secondary Key?
28.) What is the difference between where and having?
29.) Find the second highest salary of an employee?
30.) Write retention query in SQL?
31.) Write year-on-year growth in SQL?
32.) Write a query for cummulative sum in SQL?
33.) Difference between Function and Store procedure ?
34.) Do we use variable in views?
35.) What are the limitations of views?

Like this post if you need more πŸ‘β€οΈ

Hope it helps :)
πŸ‘57❀7πŸ‘5πŸŽ‰1
Data Analyst Interview Resources
https://youtu.be/1-T-VBjLpJo?si=fo_RhbXC46Hg-FVE
I have kept the language as English so that everyone can understand. Please bear with my voice & video editing skills as I am pretty new to all this 😁
πŸ‘7❀5
You’re not a failure as a data analyst if:

β€’ It takes you more than two months to land a job (remove the time expectation!)

β€’ Complex concepts don’t immediately sink in

β€’ You use Google/YouTube daily on the job (this is a sign you’re successful, actually)

β€’ You don’t make as much money as others in the field

β€’ You don’t code in 12 different languages (SQL is all you need. Add Python later if you want.)
πŸ‘15πŸŽ‰1πŸ‘Œ1
Essential SQL topics & free resources to practice sql.
πŸ‘‡πŸ‘‡
https://youtu.be/VCZxODefTIs?si=1XB44uv5DIpcJA4K

Please like this video & subscribe my youtube channel so that I can bring more awesome videos. I would really appreciate any feedback in the comments :)
πŸ‘4