Data Analyst Interview Resources
51.7K subscribers
255 photos
1 video
53 files
322 links
Join our telegram channel to learn how data analysis can reveal fascinating patterns, trends, and stories hidden within the numbers! ๐Ÿ“Š

For ads & suggestions: @love_data
Download Telegram
Here's one of the most useful SQL keywords you've never heard of:


GROUP BY CUBE

Think of GROUP BY CUBE as the ultimate grouping keyword. It essentially contains GROUP BY ROLLUP and GROUP BY GROUPING SETS inside of it.

How does it work?

Take our example dataset: the Citibike trips dataset. This data contains Citibike trips taken by riders in NYC.

Let's say you're grouping by three different columns: year, month, and station name. You're doing a COUNT() of trips taken.

What are all the possible groupings we could come up with?
1. Total trips overall
2. Trips from each station
3. Trips in each month
4. Trips in each year
5. Trips in each month + each station
6. Trips in each year + each station
7. Trips in each year + each month
8. Trips in each year + each month + each station

GROUP BY CUBE will perform all of these grouping combinations for you automatically.

Usage is super simple. All you need to do is type GROUP BY CUBE along with the names of the columns you'd like to group by.
๐Ÿ‘9
Data Analyst Interview Preparation Guide
๐Ÿ‘11
SQL Interview Ques & ANS ๐Ÿ’ฅ
โค3
๐‡๐จ๐ฐ ๐ญ๐จ ๐ฉ๐ซ๐š๐œ๐ญ๐ข๐œ๐ž ๐๐š๐ญ๐š ๐ฏ๐š๐ฅ๐ข๐๐š๐ญ๐ข๐จ๐ง ๐š๐ฌ ๐š๐ง ๐š๐ฌ๐ฉ๐ข๐ซ๐ข๐ง๐  ๐๐š๐ญ๐š ๐š๐ง๐š๐ฅ๐ฒ๐ฌ๐ญ?

Here's a step-by-step guide for the same:

Step 1๏ธโƒฃ - Download a practice dataset. I'd recommend the Codebasics resume project challenge dataset (as it contains multi-table datasets).

Step 2๏ธโƒฃ - Open your preferred RDBMS tool (SQL server/MySQL). Create a local database to load the dataset.

Step 3๏ธโƒฃ - Import the practice dataset (.xlsx/.csv) into this database by creating the tables (please google if you need help).

Step 4๏ธโƒฃ - Now open Power BI desktop and connect to the local database using the appropriate connector.

Step 5๏ธโƒฃ - Build the dashboard using the questions shared in the resume project challenge.

Step 6๏ธโƒฃ - Now, you can validate the output of your dashboard by writing SQL queries.

Step 7๏ธโƒฃ - Try to write an SQL query for a question asked in the challenge. You need to convert a natural language question into an SQL query.

Step 8๏ธโƒฃ - Compare the query output with the dashboard output and check if the numbers are matching. If they aren't matching, either the query is wrong or the dashboard numbers are wrong. Hence, try to identify the gap.

Step 9๏ธโƒฃ - Repeat the process for every question asked in the challenge.

Thus, you will learn and practice both SQL and Power BI simultaneously.

๐–๐ก๐ฒ ๐ฌ๐ก๐จ๐ฎ๐ฅ๐ ๐ฒ๐จ๐ฎ ๐ญ๐ซ๐ฒ ๐ญ๐ก๐ข๐ฌ ๐ฆ๐ž๐ญ๐ก๐จ๐?

In real-world scenarios, ๐๐š๐ญ๐š ๐ฏ๐š๐ฅ๐ข๐๐š๐ญ๐ข๐จ๐ง is a very important step in every analytics project. One needs to compare the output of the report/dashboard with the data source and then launch it for usage, to avoid discrepancies.

This will help you weed out any mistakes that you have applied in your report/dashboard logic.

Best Telegram Channel for Data Analysts: https://t.iss.one/sqlspecialist
๐Ÿ‘4
Q1: How do you ensure data consistency and integrity in a data warehousing environment?

Ans: I implement data validation checks, use constraints like primary and foreign keys, and ensure that ETL processes have error-handling mechanisms. Regular audits and data reconciliation processes are also set up to ensure data accuracy and consistency.

Q2: Describe a situation where you had to design a star schema for a data warehousing project.

Ans: For a retail sales data warehousing project, I designed a star schema with a central fact table containing sales transactions. Surrounding this were dimension tables like Products, Stores, Time, and Customers. This structure allowed for efficient querying and reporting of sales metrics across various dimensions.

Q3: How would you use data analytics to assess credit risk for loan applicants?

Ans: I'd analyze the applicant's financial history, including credit score, income, employment stability, and existing debts. Using predictive modeling, I'd assess the probability of default based on historical data of similar applicants. This would help in making informed lending decisions.

Q4: Describe a situation where you had to ensure data security for sensitive financial data.

Ans: While working on a project involving customer transaction data, I ensured that all data was encrypted both at rest and in transit. I also implemented role-based access controls, ensuring that only authorized personnel could access specific data sets. Regular audits and penetration tests were conducted to identify and rectify potential vulnerabilities.
๐Ÿ‘6โค1
Q1: How would you analyze data to understand user connection patterns on a professional network? 

Ans: I'd use graph databases like Neo4j for social network analysis. By analyzing connection patterns, I can identify influencers or isolated communities.

Q2: Describe a challenging data visualization you created to represent user engagement metrics. 

Ans: I visualized multi-dimensional data showing user engagement across features, regions, and time using tools like D3.js, creating an interactive dashboard with drill-down capabilities.

Q3: How would you identify and target passive job seekers on LinkedIn? 

Ans: I'd analyze user behavior patterns, like increased profile updates, frequent visits to job postings, or engagement with career-related content, to identify potential passive job seekers.

Q4: How do you measure the effectiveness of a new feature launched on LinkedIn? 


Ans: I'd set up A/B tests, comparing user engagement metrics between those who have access to the new feature and a control group. I'd then analyze metrics like time spent, feature usage frequency, and overall platform engagement to measure effectiveness.
๐Ÿ‘4๐Ÿ‘Œ1
Data Analyst Interview Questions ๐Ÿ‘‡

1.How to create filters in Power BI?

Filters are an integral part of Power BI reports. They are used to slice and dice the data as per the dimensions we want. Filters are created in a couple of ways.

Using Slicers: A slicer is a visual under Visualization Pane. This can be added to the design view to filter our reports. When a slicer is added to the design view, it requires a field to be added to it. For example- Slicer can be added for Country fields. Then the data can be filtered based on countries.
Using Filter Pane: The Power BI team has added a filter pane to the reports, which is a single space where we can add different fields as filters. And these fields can be added depending on whether you want to filter only one visual(Visual level filter), or all the visuals in the report page(Page level filters), or applicable to all the pages of the report(report level filters)


2.How to sort data in Power BI?

Sorting is available in multiple formats. In the data view, a common sorting option of alphabetical order is there. Apart from that, we have the option of Sort by column, where one can sort a column based on another column. The sorting option is available in visuals as well. Sort by ascending and descending option by the fields and measure present in the visual is also available.


3.How to convert pdf to excel?

Open the PDF document you want to convert in XLSX format in Acrobat DC.
Go to the right pane and click on the โ€œExport PDFโ€ option.
Choose spreadsheet as the Export format.
Select โ€œMicrosoft Excel Workbook.โ€
Now click โ€œExport.โ€
Download the converted file or share it.


4. How to enable macros in excel?

Click the file tab and then click โ€œOptions.โ€
A dialog box will appear. In the โ€œExcel Optionsโ€ dialog box, click on the โ€œTrust Centerโ€ and then โ€œTrust Center Settings.โ€
Go to the โ€œMacro Settingsโ€ and select โ€œenable all macros.โ€
Click OK to apply the macro settings.
๐Ÿ‘3