Interview Questions and Answers For Data Analysts
Interview Questions and Answers For Data Analysts
2. Can you describe how you currently or have used SQL in the past?
Example Answer: I currently use it in my role to query tables and views to analyze data. I also
use it to create Stored Procedures to automatically send my reports to clients. I work with our
programming team to help optimize scripts in SQL. Lastly, I use SQL to help with acquiring data,
cleaning data, and modeling it for use in our products.
3. Have you ever used a Cloud based platform? And if so, how have you used
it?
Example Answer: I currently use Azure, but I have use AWS in the past. I use Azure DataLake to
store large amounts of data. I also use Databricks to automatically ingest flat files from clients.
Broad Questions
1. What was your most challenging data analyst project?
Example Answer: In March we were submitting data for MIPS for one of your largest clients. We
were on a very tight deadline and one of our contract workers who worked in Canada was not
answering emails or calls. We needed him to help analyze a very large data set for several things
and when we couldn’t reach him we had to figure out a solution. We had a very narrow deadline
and I ended up working with our Director of Data Analytics in an all night session to get the
analytics to our client to prepare for the submission of their MIPS data. We successfully handled
the project and submitted their data on time.
2. Describe a time when you had to work with a group on a Data Analyst
Project?
Example Answer: We were working with a client in North Carolina who needed a Gap Analysis
done of their current on-prem system compared to their new Cloud based system. We had to
work with a small team in North Carolina as well as our team at our company to get access to
their data from both locations. The team lead on our side gave us our tasks and we worked for
several weeks to give them an in-depth analysis of their data.
4. Describe a time when you’ve made a mistake on a project and how you
handled it.
Example Answer: I was working with a client to get them an analysis of some data they sent us.
At the time we had 2 servers with almost the same configurations. One was a historical server
and the other an active server that was getting a daily feed. I spent a few days on the analysis
and sent it to the client and they emailed back saying the numbers looked off. I quickly realized
that I had accidentally done my analysis on the historical server and not the updated server. I
quickly apologized to the client and said I would get him an analysis with the correct data.
Luckily, I had a good relationship with the client and he said it was all good and to get him a new
analysis of the correct data as soon as I could. I sent that to him shortly after that and was still
able to meet the deadline that we had set.
2. How would you write a query that would only select unique records in a
column?
SELECT DISTINCT(Column)
FROM Table
FROM table
GROUP BY Drug_Name
4. What does GROUP BY (Statement) do in a Query? And why would you use
it?
Example Answer: The Group By statement groups rows that have the same value into summary
rows and are typically use with aggregate functions to look at specific data in the dataset in a
more organized manner.
5. I have a column called Drug_Name. I want to look at Drugs that start with
“Aspirin”. How would you only return drug names that start with Aspirin?
(Answer use the “Like” Operator”)
Answer:
SELECT Drug_Name
FROM table
Intermediate
1. What is a subquery and can you describe how you would write that?
Example Answer: A Subquery is a query nested inside of a larger query.
I would write it like this:
SELECT *
FROM Table
WHERE UserID in
(SELECT userID
FROM Table2)
2. What is an join and what data would be returned if you use an inner join?
Example Answer:
A Join combines two tables into a single output. An Inner Join will return data that is intersects
(or is common) between both tables. For example: if Table 1 has a,b, and c and Table 2 has b
and c. Only b and c will be returned because a is only in Table 1.
6. I have 2 tables. One table contains patient information and the other
contains Drug information. In the Patient table we have PatientID,
First_Name, Last_Name, and Disease. In the Drug table we have PatientID,
Dispensed_drug, Date_dispensed. Can you create a Query to return the
PatientID, Disease, and Dispensed_drug?
Answer:
FROM Patient
JOIN Drug
ON Patient.PatientID = Drug.PatientID
Difficult
1. What are sys tables or System tables?
Example Answer: sys.tables is a system table and is used for maintaining information on tables
in a database. For every table added to the database, a record is created in the sys.tables table.
There is only one record for each table and it contains information such as table name, object id
of table, created date, modified date, etc. Object ID is unique and we will use it to join this table
with other system tables (sys.columns) in order to fetch column details.