0% found this document useful (0 votes)

84 views7 pages

Deloitte Data Engineer

The document outlines various data engineering scenarios involving real-time processing of IoT sensor data, log data, user transactions, and nested JSON datasets. It emphasizes the importance of correctly handling data streams, addressing data skew, and using appropriate aggregation techniques. Additionally, it promotes Prominent Academy's services for preparing candidates for data engineering interviews through mock interviews and personalized coaching.

Uploaded by

ronit.kumar2802

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

84 views7 pages

Deloitte Data Engineer

Uploaded by

ronit.kumar2802

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

data

engineer

www.prominentacademy.in
Question:

You are given a stream of IoT sensor data with columns:

sensor_id, timestamp, and value. Detect sensors with values
exceeding a threshold (e.g., 100) in real-time.

Explanation:
Read streaming data from Kafka.
Parse the data and filter sensors with values exceeding the threshold.
Write the output to the console.

📞 Don’t wait—call us at +91 98604 38743 today

Your next opportunity is closer than you think. Let’s get you there!
Question:

You are given a stream of log data with columns: timestamp,

user_id, and action. Calculate the count of actions per user in
real-time.

Explanation:
Explanation:
Read streaming data from Kafka.
Parse the data and group by user_id and a 1-minute window.
Write the output to the console.

Common Mistakes:
Not defining the window correctly.
Forgetting to start the streaming query with awaitTermination().

📞 Don’t wait—call us at +91 98604 38743 today

Your next opportunity is closer than you think. Let’s get you there!
Question:

You are given a dataset of user transactions with columns:

user_id, transaction_id, and amount. The dataset is heavily
skewed on the user_id column. Calculate the total transaction
amount per user while handling the skew.

Explanation:
Add a random salt to the skewed key (user_id) to distribute the data
evenly.
Perform the first aggregation on the salted key.
Remove the salt and perform a second aggregation to get the final
result.

Common Mistakes:
Not addressing data skew, leading to slow performance.
Forgetting to remove the salt in the final aggregation.

📞 Don’t wait—call us at +91 98604 38743 today

Your next opportunity is closer than you think. Let’s get you there!
Question:

You are given a nested JSON dataset with the following

structure:

Write a Spark job to extract the following:

Total revenue per order.
Payment method for each order.

📞 Don’t wait—call us at +91 98604 38743 today

Your next opportunity is closer than you think. Let’s get you there!
Explanation:
Use explode to flatten the nested items array.
Calculate revenue for each item by multiplying quantity and price.
Group by order_id and payment.method to aggregate the total
revenue.

Common Mistakes:
Not using explode to handle nested arrays.
Forgot to include the payment.method in the groupBy.

📞 Don’t wait—call us at +91 98604 38743 today

Your next opportunity is closer than you think. Let’s get you there!
#AzureSynapse #DataEngineering #InterviewPreparation
#JobReady #MockInterviews #Deloitte #CareerSuccess
#ProminentAcademy

❌Think your skills are enough?

Think again—these Data engineer
scenario-based questions could cost you
your data engineering job.
In a recent interview at many big MNC’s, one of our
students faced scenario-based questions related to
data engineering, and many candidates struggled to
answer them correctly. These questions are designed
to test your real-world knowledge and ability to solve
complex data engineering problems.

Unfortunately, many students failed to answer these

questions confidently. The truth is, preparation is key,
and that’s where Prominent Academy comes in!
We specialize in preparing you for spark and data

✅
engineering interviews by:

✅
Offering scenario-based mock interviews
Providing hands-on training with data engineering

✅
features

✅
Optimizing your resume & LinkedIn profile
Giving personalized interview coaching to ensure
you’re job-ready
Don’t leave your future to chance!

📞Call us at +91 98604 38743and get the

interview prep you need to succeed

Ace The Data Engineer Interview PDF
No ratings yet
Ace The Data Engineer Interview PDF
72 pages
Fundamentals of Data Engineering Concepts
No ratings yet
Fundamentals of Data Engineering Concepts
219 pages
I'm Gonna Live With You Not Because My Parents Left Me Their Debt But Because I Like You
No ratings yet
I'm Gonna Live With You Not Because My Parents Left Me Their Debt But Because I Like You
567 pages
Computational Thinking A Primer For Programmers and Data Scientists G Venkatesh Madhavan Mukund
No ratings yet
Computational Thinking A Primer For Programmers and Data Scientists G Venkatesh Madhavan Mukund
187 pages
Spiritual Cleansings by Carlos G. Montenegro
100% (2)
Spiritual Cleansings by Carlos G. Montenegro
162 pages
Chennai, Bangalore and Hyderabad
50% (2)
Chennai, Bangalore and Hyderabad
52 pages
Week 3 - Data Engineering Lifecycle
100% (1)
Week 3 - Data Engineering Lifecycle
6 pages
Databricks Certified Data Engineer Professional Exam Guide 1 Mar 2025
No ratings yet
Databricks Certified Data Engineer Professional Exam Guide 1 Mar 2025
6 pages
Help Electrical Explained PDF
No ratings yet
Help Electrical Explained PDF
18 pages
NOTES ON IPC (RA No. 8293) - LAW On PATENTS
100% (4)
NOTES ON IPC (RA No. 8293) - LAW On PATENTS
21 pages
Mastercard Data Engineer Interview Questions
No ratings yet
Mastercard Data Engineer Interview Questions
16 pages
Sample Provisional Acceptance Sheger
67% (9)
Sample Provisional Acceptance Sheger
4 pages
Quantiphi Interview
No ratings yet
Quantiphi Interview
2 pages
Capital Budgeting Sail
100% (1)
Capital Budgeting Sail
84 pages
Chapter Two Data Science: by Abdulaziz Oumer
No ratings yet
Chapter Two Data Science: by Abdulaziz Oumer
29 pages
Coconut Pulp and Eggshell Chalk Potential Unveiled
No ratings yet
Coconut Pulp and Eggshell Chalk Potential Unveiled
59 pages
Top 100+ Data Engineer Interview Questions and Answers For 2022
No ratings yet
Top 100+ Data Engineer Interview Questions and Answers For 2022
4 pages
Interview
No ratings yet
Interview
2 pages
Manual - Profinet Board - CP (TIA)
No ratings yet
Manual - Profinet Board - CP (TIA)
13 pages
12 - DataEngineer - Interview - Questions and Answers - EPAM Anywhere
No ratings yet
12 - DataEngineer - Interview - Questions and Answers - EPAM Anywhere
2 pages
Data Engineering Interview Preparation Questions
No ratings yet
Data Engineering Interview Preparation Questions
7 pages
?stuck in A Loop of Rejections - Let's Break The Cycle!?
No ratings yet
?stuck in A Loop of Rejections - Let's Break The Cycle!?
7 pages
Syllabus Exam PDF
No ratings yet
Syllabus Exam PDF
13 pages
SQL Interview Questions
No ratings yet
SQL Interview Questions
3 pages
DSBDA Easy Solution 2019
No ratings yet
DSBDA Easy Solution 2019
58 pages
Human Resource Management Practices Case Study
No ratings yet
Human Resource Management Practices Case Study
23 pages
Data Engineering Vs Data Science
No ratings yet
Data Engineering Vs Data Science
26 pages
Big Data Unit-1
No ratings yet
Big Data Unit-1
9 pages
AI Agenda
No ratings yet
AI Agenda
36 pages
Resume 2023
No ratings yet
Resume 2023
1 page
New Microsoft Office Excel Worksheet
No ratings yet
New Microsoft Office Excel Worksheet
44 pages
Apache Spark - Practices
No ratings yet
Apache Spark - Practices
24 pages
Spark Handbook
No ratings yet
Spark Handbook
7 pages
Sample
No ratings yet
Sample
54 pages
Top 10 Code Challenges Interview Abhishek
No ratings yet
Top 10 Code Challenges Interview Abhishek
6 pages
Algorithms For Data Engineers 1737183205
No ratings yet
Algorithms For Data Engineers 1737183205
6 pages
Data Superstar Placement Assurance Program Brochure
No ratings yet
Data Superstar Placement Assurance Program Brochure
22 pages
Data Science - Curriculum Brochure
No ratings yet
Data Science - Curriculum Brochure
31 pages
Answer Key Split Up Fds
No ratings yet
Answer Key Split Up Fds
11 pages
Interview QnAs - CloudyML
No ratings yet
Interview QnAs - CloudyML
13 pages
Ramadan Bundle Offer All Course Module
No ratings yet
Ramadan Bundle Offer All Course Module
13 pages
CloudyML Mega Combo Course Brochure
No ratings yet
CloudyML Mega Combo Course Brochure
19 pages
Intro To Data Analytics - Cleanup & Transformation
No ratings yet
Intro To Data Analytics - Cleanup & Transformation
30 pages
BD Question Bank MCQ Answered
No ratings yet
BD Question Bank MCQ Answered
8 pages
TCS Rejected Many Due To Weak PySpark Logic!?
No ratings yet
TCS Rejected Many Due To Weak PySpark Logic!?
7 pages
Company Interview
No ratings yet
Company Interview
24 pages
Data Engineering Interviews Are Getting TOUGHER?
No ratings yet
Data Engineering Interviews Are Getting TOUGHER?
8 pages
Data Engineering - Ignite - 4 Weeksbbbbu
No ratings yet
Data Engineering - Ignite - 4 Weeksbbbbu
18 pages
Spark Test Que
No ratings yet
Spark Test Que
3 pages
Data Science With Machine Learning Level 1-5
No ratings yet
Data Science With Machine Learning Level 1-5
7 pages
Deloitte Scenario-Based Questions in Spark
No ratings yet
Deloitte Scenario-Based Questions in Spark
7 pages
DP 203T00A ENU AssessmentGuide
No ratings yet
DP 203T00A ENU AssessmentGuide
13 pages
Ajay Patil ADE
No ratings yet
Ajay Patil ADE
1 page
Resume Building Tips by Prafful
No ratings yet
Resume Building Tips by Prafful
7 pages
The Data Engineering Team Is Configuring Environments For Devonentg New Data Pipeline
No ratings yet
The Data Engineering Team Is Configuring Environments For Devonentg New Data Pipeline
3 pages
Chapter One: Condenser
100% (2)
Chapter One: Condenser
10 pages
Data Engineer Preparation
No ratings yet
Data Engineer Preparation
5 pages
Publicis Sapient Pyspark
No ratings yet
Publicis Sapient Pyspark
10 pages
Data Engineer Interview at A Top Product-Based Company
No ratings yet
Data Engineer Interview at A Top Product-Based Company
7 pages
BDA Syllabus - Sem VII - Mumbai University
No ratings yet
BDA Syllabus - Sem VII - Mumbai University
3 pages
BDA Questions
No ratings yet
BDA Questions
8 pages
Day6 Dataanalyst
No ratings yet
Day6 Dataanalyst
9 pages
Big Data Training in Chennai - Big Data Course in Chennai
No ratings yet
Big Data Training in Chennai - Big Data Course in Chennai
1 page
New Microsoft Word Document
No ratings yet
New Microsoft Word Document
3 pages
Bda Solved Sample Question Paper 70 Marks
No ratings yet
Bda Solved Sample Question Paper 70 Marks
29 pages
Hadoroh 3 Tarikat
No ratings yet
Hadoroh 3 Tarikat
2 pages
Big Data: An Overview
No ratings yet
Big Data: An Overview
9 pages
Health Hygiene Policy
No ratings yet
Health Hygiene Policy
2 pages
Individual Assign
No ratings yet
Individual Assign
2 pages
Tech Mahindra
No ratings yet
Tech Mahindra
1 page
Wireless Body Area Network (WBAN)
No ratings yet
Wireless Body Area Network (WBAN)
21 pages
Set. No - 1 P18PECS031-Data Preparation and Analysis QP - PH.D.
No ratings yet
Set. No - 1 P18PECS031-Data Preparation and Analysis QP - PH.D.
22 pages
Life
No ratings yet
Life
3 pages
Managing Change in Printing Industry
No ratings yet
Managing Change in Printing Industry
10 pages
Untitled
No ratings yet
Untitled
300 pages
Workflows - Installation and Configuration 2009
100% (1)
Workflows - Installation and Configuration 2009
45 pages
VFR Chart Icao LR 1 Romania
No ratings yet
VFR Chart Icao LR 1 Romania
1 page
Filipino History 01 Japanese Invasion
No ratings yet
Filipino History 01 Japanese Invasion
68 pages
c24 Grand Btest-2 Maths (Paper-1)
No ratings yet
c24 Grand Btest-2 Maths (Paper-1)
11 pages
ICT Concept For FX
No ratings yet
ICT Concept For FX
24 pages
Chemical Bonding
No ratings yet
Chemical Bonding
43 pages
Seating Plan
No ratings yet
Seating Plan
21 pages
Complexity Theory - Chapter 1 - Introduction
No ratings yet
Complexity Theory - Chapter 1 - Introduction
14 pages
Clean Resume Vol 1
No ratings yet
Clean Resume Vol 1
1 page
Tourism Industries in Assam Agriculture Economy Geography
No ratings yet
Tourism Industries in Assam Agriculture Economy Geography
6 pages
MCIRMARCH0B
No ratings yet
MCIRMARCH0B
4 pages
Sight Screen Catalog
No ratings yet
Sight Screen Catalog
3 pages
MAT 210 School Based
No ratings yet
MAT 210 School Based
3 pages
Tourism Product Portfolio Narrative
No ratings yet
Tourism Product Portfolio Narrative
2 pages
Low-Code/No-Code: Citizen Developers and the Surprising Future of Business Applications
From Everand
Low-Code/No-Code: Citizen Developers and the Surprising Future of Business Applications
Phil Simon
2.5/5 (2)

Deloitte Data Engineer

Uploaded by

Deloitte Data Engineer

Uploaded by

data

You are given a stream of IoT sensor data with columns:

📞 Don’t wait—call us at +91 98604 38743 today

You are given a stream of log data with columns: timestamp,

📞 Don’t wait—call us at +91 98604 38743 today

You are given a dataset of user transactions with columns:

📞 Don’t wait—call us at +91 98604 38743 today

You are given a nested JSON dataset with the following

Write a Spark job to extract the following:

📞 Don’t wait—call us at +91 98604 38743 today

📞 Don’t wait—call us at +91 98604 38743 today

❌Think your skills are enough?

Unfortunately, many students failed to answer these

📞Call us at +91 98604 38743and get the

You might also like