0% found this document useful (0 votes)
171 views4 pages

Data Analyst - Technical Interview

The document provides instructions and questions for a technical test for a data analyst position. It explains that the test should take around 1 hour total and covers topics like logic, programming, SQL, statistics, and explaining a technical concept simply. The test includes questions about cubes and probabilities, writing a Fibonacci number program, writing SQL queries about user and payment data, statistical measures and distributions, and describing a technical concept to a 4-year old child.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
171 views4 pages

Data Analyst - Technical Interview

The document provides instructions and questions for a technical test for a data analyst position. It explains that the test should take around 1 hour total and covers topics like logic, programming, SQL, statistics, and explaining a technical concept simply. The test includes questions about cubes and probabilities, writing a Fibonacci number program, writing SQL queries about user and payment data, statistical measures and distributions, and describing a technical concept to a 4-year old child.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Data Analyst – technical test

These questions are designed to test your logical thinking, data modelling, stats and
programming ability.

Some tips:
- The whole test can be completed in an hour but often takes longer. You should not
spend more than 2 hours in total. (Some guidance is given below about how long we
expect you to take on each question, but this will vary a lot depending on your
skillset)
- An important skill for data analyst is to validate your work, so please check your
answers (and where appropriate show how you have checked them)

Once you have completed your test, please email the answer (you can type out answers on a
word document or use pen/paper and email photos of your answer).

Part 1. Logic (10-20 minutes)


A cube is painted Blue on all six sides. It is divided into 125 (=5 x 5 x 5) equal smaller cubes.

Find:

1. The number of smaller cubes having

a) 3 faces coloured?
b) Exactly 2 faces coloured?
c) Exactly 1 face coloured?
d) 0 faces coloured?

2. All 125 cubes are put into a bag. If a


single cube is selected at random from
the bag, find probability of picking a
cube having 1 or more Blue faces

3. What is the average number of Blue faces on a small cube?

In the above situation N=5, (with N^3 =125)

4. For general N, give a formula for (1.1 b) the number with exactly 2 faces coloured

1.5 For what values of N is this formula correct?


Part 2. Programming (10-15 minutes)

Write a program which calculates the sum of the first 10 Fibonacci numbers bigger than 1000.
You may use any programming language you like R, Python, Java, etc - please state what
language you are using.
[Reminder: Fibonacci numbers are the following series: 0, 1, 1, 2, 3, 5, 8, 13, 21, 34, … where
each term is the sum of the previous two terms]

Part 3. SQL (15-30 minutes)


We have the following tables:

user - table with information about registered students

Column name Datetype Description


id integer Unique id of user
date_joined timestamp Date and time of
registration
country_code varchar(2) country of user (2-letter
country abbreviation)

payment - table with information about payments

Column name Datetype Description


id integer Unique id of payment ((one
student can have from 0 to X
payments)

user_id integer Id from table user


payment_amount float Paid amount in USD
created_at timestamp Date and time of the
payment

lesson - table with list of lessons.


Possible statuses of lessons:

CONFIRMED - lesson happened successfully

SCHEDULED - for future lessons

CANCELED - for lessons that were canceled

Each lesson appears on the table only one time. Lesson status is being updated.

Column name Datetype Description


id integer Unique id of lesson ((one
student can have from 0 to X
lessons)

user_id integer Id from table user


status varchar(255) Current status of lesson
(can be updated)
created_at timestamp Date and time of the
lesson
hours integer Duration in terms of hours
of the lesson

Write SQL query that returns:

1. number of registered users by country


2. % of users, who made their first payment in 3 days after registration by country
3. % of users, who made their first payment in 3 days after registration and had 2
confirmed lessons in 7 days after registration by country
4. % of weekly new users that never have done a payment
5. Advanced level (Extra point): Write the SQL that returns how many hours of confirmed 

lessons a specific user (for example user_id=1) has taken between payments.
Part 4. Statistics (10-20 minutes)

4.1 What standard measures are a,b and c most likely denoting? Please explain why.

4.2 If you drew n samples from this distribution and measured their mean, then repeated that
many times, how would you expect the distribution of those sample means to differ from
the distribution?

4.3 Would its standard deviation be bigger, smaller or the same as this distribution’s standard
deviation and why?

4.4 Define whether the difference in average payment amount in groups A and B is statistically
significant (use dataset attached: BI_test1.xlsx).

Part 5. Storytelling (10-20 minutes)

Describe a technological abstract concept of your choice (for example: internet, electricity, credit
card, email, slack, …) to a 4 years old child. Please make sure you don’t use other complex
concepts while describing it and use a plain language in order to maximize the chances that the
child will be able to understand it. We are going to evaluate your capacity to simplify and explain
complex subjects.

(Mínimum 100 words, maximum 400 words)

You might also like