0% found this document useful (0 votes)

98 views10 pages

Walmart Data Analyst Interview Experience

The document outlines interview questions and answers for a Walmart Data Analyst position, covering topics in Python, Power BI, and SQL. It includes practical coding examples for data manipulation, as well as theoretical explanations of key concepts like data structures, report types, and security measures. Additionally, it discusses the differences between various data handling techniques and tools, providing insights into best practices for data analysis.

Uploaded by

mukesh kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

98 views10 pages

Walmart Data Analyst Interview Experience

Uploaded by

mukesh kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Walmart Data Analyst

Interview Experience (1-3

Years):- CTC - 18 LPA
Python

1- Write a Python script to identify unique values in a list and

count their occurrences.
Theoretical Explanation

This question tests your understanding of Python data structures like sets and dictionaries:

• Sets: Used to store unique elements. It eliminates duplicates.

• Dictionaries: Key-value pairs are used to store occurrences of each unique element
efficiently.
Using these, you can identify unique elements and count their occurrences.

Code

# Sample list

data = [1, 2, 2, 3, 4, 4, 4, 5]

# Using set to find unique values

unique_values = set(data)

# Using dictionary to count occurrences

occurrences = {value: data.count(value) for value in unique_values}

print("Unique Values:", unique_values)

print("Occurrences:", occurrences)

Output:

Unique Values: {1, 2, 3, 4, 5}

Occurrences: {1: 1, 2: 2, 3: 1, 4: 3, 5: 1}

2. How would you use pandas to merge two datasets and

calculate total sales for products with valid promotions?
Theoretical Explanation

• Merging Datasets: Use pandas.merge() to combine two datasets based on a

common key (e.g., product_id).

• Filtering Promotions: Filter rows where promotions are valid.

• Grouping and Aggregation: Use groupby() to group data by products and calculate
total sales using aggregation functions like sum().

Code

import pandas as pd

# Sample datasets

products = pd.DataFrame({

'product_id': [101, 102, 103, 104],

'product_name': ['A', 'B', 'C', 'D']

})
sales = pd.DataFrame({

'product_id': [101, 102, 102, 103, 104],

'sales': [200, 150, 100, 300, 50],

'promotion_valid': [True, True, False, False, True]

})

# Merge datasets on product_id

merged_data = pd.merge(products, sales, on='product_id')

# Filter for valid promotions

valid_promotions = merged_data[merged_data['promotion_valid']]

# Group by product_name and calculate total sales

total_sales = valid_promotions.groupby('product_name')['sales'].sum()

print("Total Sales for Valid Promotions:")

print(total_sales)

Output:

Total Sales for Valid Promotions:

product_name

A 200

B 150

D 50

3. Differences Between Lists, Tuples, Sets, and Dictionaries

Theoretical Explanation
• Lists: Ordered, mutable, allows duplicates. Suitable for sequential data and
iteration.

• Tuples: Ordered, immutable, allows duplicates. Used for fixed collections of items.

• Sets: Unordered, mutable, no duplicates. Ideal for membership tests and unique
element extraction.

• Dictionaries: Unordered, mutable, key-value pairs. Excellent for fast lookups and
association of data.

Feature List Tuple Set Dictionary

Ordered Yes Yes No No

Mutable Yes No Yes Yes

Allows Duplicates Yes Yes No Keys: No, Values: Yes

Use Case Iteration Fixed Data Unique Data Key-Value Mapping

Code

# List

my_list = [1, 2, 3, 3]

print("List:", my_list)

# Tuple

my_tuple = (1, 2, 3, 3)

print("Tuple:", my_tuple)

# Set

my_set = {1, 2, 3, 3}

print("Set (No Duplicates):", my_set)

# Dictionary
my_dict = {'a': 1, 'b': 2, 'c': 3}

print("Dictionary:", my_dict)

Output:

css

CopyEdit

List: [1, 2, 3, 3]

Tuple: (1, 2, 3, 3)

Set (No Duplicates): {1, 2, 3}

Dictionary: {'a': 1, 'b': 2, 'c': 3}

POWER BI

1. Difference Between Import and Direct Query Modes

Theoretical Explanation

• Import Mode:

o Data is imported into Power BI's in-memory model, offering faster

performance.

o The report becomes static and doesn’t reflect real-time changes in the
source unless refreshed.

o Suitable for small to medium datasets.

• Direct Query Mode:

o Data stays in the source system, and queries are sent to fetch data as
needed.

o Enables real-time data visualization but may be slower due to dependency

on the source system's performance.

o Suitable for large datasets or when real-time updates are critical.

When to Choose:
For large datasets, use Direct Query to avoid importing and storing massive data into
Power BI. However, it may impact performance, so ensure the data source is optimized for
query execution.

2. Slicers vs Visual-Level Filters

Theoretical Explanation

• Slicers:

o Interactive visuals that allow users to filter data directly on the dashboard.

o They are visible to users and improve interactivity.

o Example: A slicer for "Year" allows selecting specific years to filter all linked
visuals.

• Visual-Level Filters:

o Filters applied to specific visuals rather than the entire page or report.

o Not interactive for end-users but provide control over what data is displayed
in a specific visual.

o Example: A filter applied to a bar chart to display only sales > $10,000.

Impact:
Slicers enhance user interactivity, allowing dynamic filtering, while visual-level filters
provide static control for specific visuals.

3. Row-Level Security (RLS)

Theoretical Explanation

• RLS restricts data access based on roles, ensuring that users or groups see only the
data they are authorized to view.

• Implementation Steps:
1. Define roles in Power BI Desktop: Use DAX expressions to filter data based on
user criteria (e.g., Region = "North").

2. Assign roles in Power BI Service: Map users/groups to the defined roles.

3. Validate: Test roles in Power BI Desktop by simulating different users.

Example:
To restrict regional managers to see only their respective region's data, create a role with a
DAX filter:

[Region] = USERPRINCIPALNAME()

Then assign regional managers to this role in the Power BI Service.

4. What is a Paginated Report and When to Use It?

Theoretical Explanation

• Paginated Reports:

o Pixel-perfect reports designed for printing or exporting.

o Data is displayed across multiple pages, with precise control over layout.

o Suitable for reports like invoices, billing statements, or regulatory reports

where exact formatting is crucial.

• When to Use:

o When you need formatted, printable outputs that may span multiple pages.

o When exporting reports to formats like PDF or Word is essential.

o For operational reports with detailed rows of data.

Example: A paginated report would be ideal for generating monthly sales invoices for a
large number of customers.

SQL
1. Find the Second-Highest Salary in a Department
Theoretical Explanation
• ROW_NUMBER(): Assigns a unique sequential number to each row within a
partition of data.

• DENSE_RANK(): Assigns ranks to rows in a partition, but ties receive the same rank.
There are no gaps in ranks.

To find the second-highest salary in each department, partition data by department_id and
order salaries in descending order, then filter for rank = 2.

Query Using DENSE_RANK()

WITH RankedSalaries AS (

SELECT

department_id,

employee_id,

salary,

DENSE_RANK() OVER (PARTITION BY department_id ORDER BY salary DESC) AS rank

FROM employees

SELECT department_id, employee_id, salary

FROM RankedSalaries

WHERE rank = 2;

2. Calculate Total Transactions Per User for Each Day

Theoretical Explanation

To calculate daily transaction counts for each user:

• Use GROUP BY to group data by user_id and transaction_date.

• Use COUNT() to count the transactions for each group.

Query

SELECT

user_id,
transaction_date,

COUNT(*) AS total_transactions

FROM transactions

GROUP BY user_id, transaction_date

ORDER BY user_id, transaction_date;

3. Select Projects with the Highest Budget-Per-Employee Ratio

Theoretical Explanation

This involves:

1. Joining the projects table with the employees table to calculate the number of
employees per project.

2. Calculating the budget-per-employee ratio for each project.

3. Finding the project(s) with the highest ratio.

Assume Tables

• projects(project_id, budget)

• employees(employee_id, project_id)

Query

WITH ProjectEmployeeCount AS (

SELECT

p.project_id,

p.budget,

COUNT(e.employee_id) AS total_employees

FROM projects p

LEFT JOIN employees e ON p.project_id = e.project_id

GROUP BY p.project_id, p.budget

),
BudgetRatio AS (

SELECT

project_id,

budget,

total_employees,

CASE

WHEN total_employees > 0 THEN budget / total_employees

ELSE 0

END AS budget_per_employee

FROM ProjectEmployeeCount

SELECT project_id, budget, total_employees, budget_per_employee

FROM BudgetRatio

WHERE budget_per_employee = (SELECT MAX(budget_per_employee) FROM BudgetRatio);

Amazon Data Analyst Interview Questions - 1
No ratings yet
Amazon Data Analyst Interview Questions - 1
22 pages
Barclays Data Engineer Interview Questions
No ratings yet
Barclays Data Engineer Interview Questions
17 pages
Flipkart Data Analyst Interview Questions 1747625566
No ratings yet
Flipkart Data Analyst Interview Questions 1747625566
27 pages
Python Interviews
No ratings yet
Python Interviews
154 pages
Interview Questions
No ratings yet
Interview Questions
29 pages
Mastercard Data Engineer Interview Questions
No ratings yet
Mastercard Data Engineer Interview Questions
16 pages
Accenture Data Analyst Interview Questions
No ratings yet
Accenture Data Analyst Interview Questions
17 pages
Data Analyst Cheat Sheet
No ratings yet
Data Analyst Cheat Sheet
28 pages
SQL Python PowerBI Questions and Answers
No ratings yet
SQL Python PowerBI Questions and Answers
4 pages
Recently Asked Data Analyst Interview Questions-2
No ratings yet
Recently Asked Data Analyst Interview Questions-2
4 pages
Ip Practical File
No ratings yet
Ip Practical File
20 pages
Summative Test No. 1 Objectives Code Percenta Ge No. of Items Item Placement
50% (2)
Summative Test No. 1 Objectives Code Percenta Ge No. of Items Item Placement
4 pages
CFA L2 2024 Volume1
100% (1)
CFA L2 2024 Volume1
168 pages
COGNIZANT Data Analyst Interview Questions Part 2-11
No ratings yet
COGNIZANT Data Analyst Interview Questions Part 2-11
17 pages
Pyspark and SQL
No ratings yet
Pyspark and SQL
57 pages
Python Interview Questions 1653100147
No ratings yet
Python Interview Questions 1653100147
24 pages
Data Science: Part 2 - SQL
100% (1)
Data Science: Part 2 - SQL
13 pages
Amazon Interview Questions & Answers
No ratings yet
Amazon Interview Questions & Answers
8 pages
8.1 - Prompts - For - AI - Enabled - Data - Life
No ratings yet
8.1 - Prompts - For - AI - Enabled - Data - Life
16 pages
INTERVIEW QUESTIONS - ALL Companies
No ratings yet
INTERVIEW QUESTIONS - ALL Companies
15 pages
Super Study Guide: Data Science Tools: Afshine Amidi and Shervine Amidi August 21, 2020
No ratings yet
Super Study Guide: Data Science Tools: Afshine Amidi and Shervine Amidi August 21, 2020
23 pages
NNNNNN
No ratings yet
NNNNNN
51 pages
Panda
No ratings yet
Panda
39 pages
Recently Asked Data Analyst Interview Questions
No ratings yet
Recently Asked Data Analyst Interview Questions
4 pages
Data Science Tools Study Guides For MIT's 15.003
No ratings yet
Data Science Tools Study Guides For MIT's 15.003
23 pages
Ade 1737191501
No ratings yet
Ade 1737191501
29 pages
Project Report
100% (1)
Project Report
58 pages
Question Bank-BDA (Module 1&2) 2
No ratings yet
Question Bank-BDA (Module 1&2) 2
5 pages
Class 12 Practical File Informatics Practices
No ratings yet
Class 12 Practical File Informatics Practices
22 pages
Deloitte Data Engineer Interview Experience (0-3 Yoe)
No ratings yet
Deloitte Data Engineer Interview Experience (0-3 Yoe)
22 pages
Company Interview
No ratings yet
Company Interview
24 pages
Wipro Data Analyst Interview Questions
No ratings yet
Wipro Data Analyst Interview Questions
29 pages
Interview Questions
No ratings yet
Interview Questions
24 pages
Short Notes On Servo Motor
100% (3)
Short Notes On Servo Motor
2 pages
DEBasic Test Que NAns
No ratings yet
DEBasic Test Que NAns
15 pages
I.P File
No ratings yet
I.P File
20 pages
HHHH
No ratings yet
HHHH
22 pages
12 Wcdma Hsdpa RRM and Parameters
No ratings yet
12 Wcdma Hsdpa RRM and Parameters
67 pages
Flipkart Business Analyst Interview Questions
No ratings yet
Flipkart Business Analyst Interview Questions
16 pages
Data Science Professional
No ratings yet
Data Science Professional
21 pages
Class 12 Practical File Informatics Practices
No ratings yet
Class 12 Practical File Informatics Practices
22 pages
Python - Pandas - Numpy Interview Q&A
No ratings yet
Python - Pandas - Numpy Interview Q&A
12 pages
Data Analtycs Professional-1
No ratings yet
Data Analtycs Professional-1
15 pages
100 Interview Questions
No ratings yet
100 Interview Questions
15 pages
Dsmlusingpython
No ratings yet
Dsmlusingpython
10 pages
Pbi 2002
No ratings yet
Pbi 2002
13 pages
Hydraulic Design Calculations-Head Loss in Plants
100% (3)
Hydraulic Design Calculations-Head Loss in Plants
42 pages
Informatics Practices Practical File
No ratings yet
Informatics Practices Practical File
8 pages
Questions For Preparation
No ratings yet
Questions For Preparation
9 pages
MAths IGCSE PAper 2 May 2002
60% (5)
MAths IGCSE PAper 2 May 2002
12 pages
Practical
No ratings yet
Practical
6 pages
Ans IP AISSCE Practical Exam 2023
No ratings yet
Ans IP AISSCE Practical Exam 2023
7 pages
SQL & Python Interview Q&A
No ratings yet
SQL & Python Interview Q&A
7 pages
Ip MS
No ratings yet
Ip MS
6 pages
Set B
No ratings yet
Set B
8 pages
Ans Key Set A
No ratings yet
Ans Key Set A
6 pages
IP Imp Notes
No ratings yet
IP Imp Notes
5 pages
CFE
No ratings yet
CFE
5 pages
Top 50 Industry-Relevant Data Analyst Interview Q - A
No ratings yet
Top 50 Industry-Relevant Data Analyst Interview Q - A
5 pages
Prac 1
No ratings yet
Prac 1
5 pages
Ip Sample Paper 6 Answer Key
No ratings yet
Ip Sample Paper 6 Answer Key
6 pages
Programming Notes 3
No ratings yet
Programming Notes 3
3 pages
Data Analyst Interview Prep
No ratings yet
Data Analyst Interview Prep
4 pages
Text 3
No ratings yet
Text 3
3 pages
Exp 10 Relative Density Application
0% (1)
Exp 10 Relative Density Application
2 pages
Pyspark Interview Questions
No ratings yet
Pyspark Interview Questions
4 pages
Excel Building Weight Calculator
0% (1)
Excel Building Weight Calculator
2 pages
Junior French Course PDF
No ratings yet
Junior French Course PDF
232 pages
Relative Humidity and Dewpoint Temperature Provide The Same Exact Information and Can Be Used Interchangeably
80% (5)
Relative Humidity and Dewpoint Temperature Provide The Same Exact Information and Can Be Used Interchangeably
76 pages
Sirosonic L
No ratings yet
Sirosonic L
100 pages
BTC Script Grabber
No ratings yet
BTC Script Grabber
3 pages
PASOLINK V4 LCT Training Manual: NEC Cooperation
No ratings yet
PASOLINK V4 LCT Training Manual: NEC Cooperation
35 pages
1CD PDF
No ratings yet
1CD PDF
522 pages
JR Inter Maths 1A AP EM 01022025
No ratings yet
JR Inter Maths 1A AP EM 01022025
11 pages
Section 6 Quiz 1 l1 l4
No ratings yet
Section 6 Quiz 1 l1 l4
4 pages
Building An Open Source Facial Recognition System For Mass Surveillance
100% (1)
Building An Open Source Facial Recognition System For Mass Surveillance
31 pages
Gaminglasopa: Powered by
No ratings yet
Gaminglasopa: Powered by
3 pages
Unit 09
No ratings yet
Unit 09
9 pages
Taking The Control System For Granted - Ensuring The Integrity of Sub-Sil Instrumented Functions
No ratings yet
Taking The Control System For Granted - Ensuring The Integrity of Sub-Sil Instrumented Functions
5 pages
(L2) - (JLD 4.0) - Solutions - 30th April
No ratings yet
(L2) - (JLD 4.0) - Solutions - 30th April
34 pages
EMD Important Questions Unit-III Starting Methods
No ratings yet
EMD Important Questions Unit-III Starting Methods
9 pages
RES320 - Preisinger, Carrie FINAL EXAM
100% (1)
RES320 - Preisinger, Carrie FINAL EXAM
5 pages
Free Body Diagrams With Animated GIF Files: Paper ID #16401
No ratings yet
Free Body Diagrams With Animated GIF Files: Paper ID #16401
12 pages
Stability & Routh Hurwitz Criterion
No ratings yet
Stability & Routh Hurwitz Criterion
5 pages
Philosophy of Science 134E
No ratings yet
Philosophy of Science 134E
4 pages
Asphalt Testing Discussion-Conclusion
No ratings yet
Asphalt Testing Discussion-Conclusion
2 pages
What Is Non-MMU or MMU-Less Linux
No ratings yet
What Is Non-MMU or MMU-Less Linux
2 pages

Walmart Data Analyst Interview Experience

Uploaded by

Walmart Data Analyst Interview Experience

Uploaded by

Walmart Data Analyst

Interview Experience (1-3

1- Write a Python script to identify unique values in a list and

• Sets: Used to store unique elements. It eliminates duplicates.

# Using set to find unique values

# Using dictionary to count occurrences

print("Unique Values:", unique_values)

Unique Values: {1, 2, 3, 4, 5}

2. How would you use pandas to merge two datasets and

• Merging Datasets: Use pandas.merge() to combine two datasets based on a

• Filtering Promotions: Filter rows where promotions are valid.

'product_id': [101, 102, 103, 104],

'product_name': ['A', 'B', 'C', 'D']

'product_id': [101, 102, 102, 103, 104],

'sales': [200, 150, 100, 300, 50],

'promotion_valid': [True, True, False, False, True]

# Merge datasets on product_id

merged_data = pd.merge(products, sales, on='product_id')

# Filter for valid promotions

# Group by product_name and calculate total sales

print("Total Sales for Valid Promotions:")

Total Sales for Valid Promotions:

3. Differences Between Lists, Tuples, Sets, and Dictionaries

Feature List Tuple Set Dictionary

Ordered Yes Yes No No

Mutable Yes No Yes Yes

Allows Duplicates Yes Yes No Keys: No, Values: Yes

Use Case Iteration Fixed Data Unique Data Key-Value Mapping

print("Set (No Duplicates):", my_set)

Set (No Duplicates): {1, 2, 3}

Dictionary: {'a': 1, 'b': 2, 'c': 3}

1. Difference Between Import and Direct Query Modes

o Data is imported into Power BI's in-memory model, offering faster

o Suitable for small to medium datasets.

• Direct Query Mode:

o Enables real-time data visualization but may be slower due to dependency

o Suitable for large datasets or when real-time updates are critical.

2. Slicers vs Visual-Level Filters

o They are visible to users and improve interactivity.

3. Row-Level Security (RLS)

2. Assign roles in Power BI Service: Map users/groups to the defined roles.

3. Validate: Test roles in Power BI Desktop by simulating different users.

Then assign regional managers to this role in the Power BI Service.

4. What is a Paginated Report and When to Use It?

o Pixel-perfect reports designed for printing or exporting.

o Suitable for reports like invoices, billing statements, or regulatory reports

o When exporting reports to formats like PDF or Word is essential.

o For operational reports with detailed rows of data.

Query Using DENSE_RANK()

DENSE_RANK() OVER (PARTITION BY department_id ORDER BY salary DESC) AS rank

SELECT department_id, employee_id, salary

2. Calculate Total Transactions Per User for Each Day

To calculate daily transaction counts for each user:

• Use GROUP BY to group data by user_id and transaction_date.

• Use COUNT() to count the transactions for each group.

GROUP BY user_id, transaction_date

ORDER BY user_id, transaction_date;

3. Select Projects with the Highest Budget-Per-Employee Ratio

2. Calculating the budget-per-employee ratio for each project.

3. Finding the project(s) with the highest ratio.

LEFT JOIN employees e ON p.project_id = e.project_id

GROUP BY p.project_id, p.budget

WHEN total_employees > 0 THEN budget / total_employees

SELECT project_id, budget, total_employees, budget_per_employee

WHERE budget_per_employee = (SELECT MAX(budget_per_employee) FROM BudgetRatio);

You might also like