EY Mock

The document outlines a set of interview questions for data engineers with 3-4 years of experience, covering topics such as SQL, Python, PySpark, Azure Data Engineering, data modeling, and behavioral scenarios. It includes specific technical questions about SQL functions, data manipulation, and big data concepts, as well as practical scenarios related to data pipeline management and optimization. The questions aim to assess both technical skills and problem-solving abilities in real-world situations.

Uploaded by

gupta.ayushi2425

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views1 page

EY Mock

Uploaded by

gupta.ayushi2425

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 1

𝗘𝗬 𝗗𝗮𝘁𝗮 𝗘𝗻𝗴𝗶𝗻𝗲𝗲𝗿 𝗜𝗻𝘁𝗲𝗿𝘃𝗶𝗲𝘄 𝗤𝘂𝗲𝘀𝘁𝗶𝗼𝗻𝘀 (𝟯–𝟰 𝗬𝗲𝗮𝗿𝘀 𝗘𝘅𝗽𝗲𝗿𝗶𝗲𝗻𝗰𝗲)

.
.
.

🔹 𝗦𝗤𝗟 & 𝗗𝗮𝘁𝗮 𝗠𝗮𝗻𝗶𝗽𝘂𝗹𝗮𝘁𝗶𝗼𝗻

What is the difference between RANK(), DENSE_RANK(), and ROW_NUMBER()?

How would you find the second highest salary from an employee table?
Write a SQL query to find duplicate records in a table.
What is a CTE? When would you use it over subqueries?
Explain different types of SQL joins with real-time examples.

𝗣𝘆𝘁𝗵𝗼𝗻 𝗳𝗼𝗿 𝗗𝗮𝘁𝗮 𝗘𝗻𝗴𝗶𝗻𝗲𝗲𝗿𝗶𝗻𝗴

How would you handle missing values in a large dataset using Python?
Explain the difference between list, tuple, set, and dictionary.
What are Python generators? How are they useful in data pipelines?
How would you optimize a large data transformation using Pandas?
Explain multithreading vs multiprocessing in Python.

𝗣𝘆𝗦𝗽𝗮𝗿𝗸 & 𝗕𝗶𝗴 𝗗𝗮𝘁𝗮

Difference between RDD, DataFrame, and Dataset in PySpark?

How do you handle skewed data in Spark?
What are broadcast variables and accumulators?
Explain the Spark execution flow (Job → Stage → Task).
How do you optimize a PySpark job?

𝗔𝘇𝘂𝗿𝗲 𝗗𝗮𝘁𝗮 𝗘𝗻𝗴𝗶𝗻𝗲𝗲𝗿𝗶𝗻𝗴

What are the key components of Azure Data Factory?

Difference between Azure Blob Storage and Azure Data Lake?
How does Azure Databricks integrate with ADF?
What are triggers and pipelines in Azure Data Factory?
Explain Delta Lake. Why is it used?

𝗗𝗮𝘁𝗮 𝗠𝗼𝗱𝗲𝗹𝗶𝗻𝗴 & 𝗪𝗮𝗿𝗲𝗵𝗼𝘂𝘀𝗶𝗻𝗴

What is the difference between OLTP and OLAP?

Explain Star Schema vs Snowflake Schema.
What is data partitioning and bucketing?
What is Slowly Changing Dimension (SCD) Type 2?
How would you design a data warehouse for a retail chain?

𝗦𝗰𝗲𝗻𝗮𝗿𝗶𝗼 -𝗕𝗮𝘀𝗲𝗱 / 𝗕𝗲𝗵𝗮𝘃𝗶𝗼𝗿𝗮𝗹

How do you handle a failed pipeline in production?

Describe a time you optimized a data job and improved performance.
Have you ever dealt with data duplication issues? How did you fix it?
How do you ensure data quality in your ETL processes?
What is your approach to version control and deployment of data pipelines?

Azure Data Engineering Interview Q & A - Topicwise
No ratings yet
Azure Data Engineering Interview Q & A - Topicwise
57 pages
Databricks Data Engineer Professional Practice
No ratings yet
Databricks Data Engineer Professional Practice
10 pages
Data Architect Interview Questions
No ratings yet
Data Architect Interview Questions
66 pages
Advanced Interview QA ADF Databricks PowerBI
No ratings yet
Advanced Interview QA ADF Databricks PowerBI
3 pages
Azure Comapny Wise Question
No ratings yet
Azure Comapny Wise Question
68 pages
Azure DE Interview Que
100% (1)
Azure DE Interview Que
25 pages
SQL, Python, Azure Interview Questions
No ratings yet
SQL, Python, Azure Interview Questions
8 pages
Data Engineering
No ratings yet
Data Engineering
15 pages
Ultimate Big Data Masters Program Curriculum v1
No ratings yet
Ultimate Big Data Masters Program Curriculum v1
14 pages
Databricks Certified Data Engineer Associate Exam Guide
No ratings yet
Databricks Certified Data Engineer Associate Exam Guide
7 pages
Tcs DE INTERVIEW Q&A2025
No ratings yet
Tcs DE INTERVIEW Q&A2025
12 pages
Pyspark Scenario Based Qs
No ratings yet
Pyspark Scenario Based Qs
13 pages
Top 100+ Data Engineer Interview Questions and Answers For 2022
No ratings yet
Top 100+ Data Engineer Interview Questions and Answers For 2022
4 pages
Data Engineer
No ratings yet
Data Engineer
19 pages
60+ Data Engineer Interview Questions and Answers
No ratings yet
60+ Data Engineer Interview Questions and Answers
16 pages
Deloitte Pyspark Interview Questions For Data Engineer 2024 - by Ronit Malhotra - Jun, 2024 - Medium
No ratings yet
Deloitte Pyspark Interview Questions For Data Engineer 2024 - by Ronit Malhotra - Jun, 2024 - Medium
9 pages
Ade Companywise Interview
No ratings yet
Ade Companywise Interview
133 pages
Azure Data Engineer + Databricks Content
No ratings yet
Azure Data Engineer + Databricks Content
7 pages
Data Engineering Vs Data Science
No ratings yet
Data Engineering Vs Data Science
26 pages
Interview
No ratings yet
Interview
2 pages
Apache Spark
No ratings yet
Apache Spark
62 pages
Ultimate Data Interview Guide
No ratings yet
Ultimate Data Interview Guide
9 pages
Data Engineering QB 14 Aug v1.0
No ratings yet
Data Engineering QB 14 Aug v1.0
40 pages
Interview Q & A (SQL Spark HIVE Airflow AWS Kafka) - 1
No ratings yet
Interview Q & A (SQL Spark HIVE Airflow AWS Kafka) - 1
25 pages
Naga Tulasi Gedela - DE
No ratings yet
Naga Tulasi Gedela - DE
4 pages
Deloitte Data Engineer Interview Experience (0-3 Yoe)
No ratings yet
Deloitte Data Engineer Interview Experience (0-3 Yoe)
22 pages
Tiger Analytics 1735834470
No ratings yet
Tiger Analytics 1735834470
27 pages
45 Data Analyst Interview Questions-1
No ratings yet
45 Data Analyst Interview Questions-1
22 pages
Azure Etl 1741608374
No ratings yet
Azure Etl 1741608374
14 pages
@Q - B@Snowflake & AWS
No ratings yet
@Q - B@Snowflake & AWS
17 pages
DATA MANAGEMENT OFFICER II TRA Qs&AS
No ratings yet
DATA MANAGEMENT OFFICER II TRA Qs&AS
10 pages
12 - DataEngineer - Interview - Questions and Answers - EPAM Anywhere
No ratings yet
12 - DataEngineer - Interview - Questions and Answers - EPAM Anywhere
2 pages
Aarate 1
No ratings yet
Aarate 1
3 pages
Python and Pyspark With Databricks, With Azure Project
No ratings yet
Python and Pyspark With Databricks, With Azure Project
9 pages
My Walmart Interviewexperience Answers
No ratings yet
My Walmart Interviewexperience Answers
13 pages
Pyspark Interview Questions
No ratings yet
Pyspark Interview Questions
9 pages
Question
No ratings yet
Question
6 pages
Data and Analytics - TechM PDF
No ratings yet
Data and Analytics - TechM PDF
8 pages
Top 10 Production-Grade Reusable PySpark Scripts For Data Engineers - by Mayurkumar Surani - May, 2025 - Medium
No ratings yet
Top 10 Production-Grade Reusable PySpark Scripts For Data Engineers - by Mayurkumar Surani - May, 2025 - Medium
14 pages
Report Zazmic Inc. Senior Middle Data Engineer Hiring Test AWS Snowflake Databricks Python SQL Kalgaonkarsiddhesh
No ratings yet
Report Zazmic Inc. Senior Middle Data Engineer Hiring Test AWS Snowflake Databricks Python SQL Kalgaonkarsiddhesh
36 pages
2525872-Azure Data Engineering
No ratings yet
2525872-Azure Data Engineering
11 pages
Interviewsss
No ratings yet
Interviewsss
4 pages
Data Engineer
No ratings yet
Data Engineer
5 pages
Data Engineer Preparation
No ratings yet
Data Engineer Preparation
5 pages
Spark Interview Questions
No ratings yet
Spark Interview Questions
4 pages
Data - Engineer Questions
No ratings yet
Data - Engineer Questions
3 pages
Skill Wise Azure DE - Interview Questions (BR)
No ratings yet
Skill Wise Azure DE - Interview Questions (BR)
6 pages
BASF Interview QA
No ratings yet
BASF Interview QA
4 pages
General Data Engineering Questions
No ratings yet
General Data Engineering Questions
4 pages
Data Engineering New
No ratings yet
Data Engineering New
3 pages
HCL Interview Prepration
No ratings yet
HCL Interview Prepration
4 pages
Senior Data Engineer Qna
No ratings yet
Senior Data Engineer Qna
4 pages
Tech Mahindra
No ratings yet
Tech Mahindra
2 pages
New Microsoft Word Document
No ratings yet
New Microsoft Word Document
3 pages
Marketing Questions - Updated
No ratings yet
Marketing Questions - Updated
6 pages
Cloud Based Developer - RizwanShaikh (3y - 8m)
No ratings yet
Cloud Based Developer - RizwanShaikh (3y - 8m)
1 page
Jameel M - Data Analyst Engineer
No ratings yet
Jameel M - Data Analyst Engineer
4 pages
Q2
No ratings yet
Q2
2 pages
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
From Everand
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
Eric Tome
No ratings yet
SQL Made Easy: Tips and Tricks to Mastering SQL Programming
From Everand
SQL Made Easy: Tips and Tricks to Mastering SQL Programming
Ryan Campbell
No ratings yet

EY Mock

Uploaded by

EY Mock

Uploaded by

𝗘𝗬 𝗗𝗮𝘁𝗮 𝗘𝗻𝗴𝗶𝗻𝗲𝗲𝗿 𝗜𝗻𝘁𝗲𝗿𝘃𝗶𝗲𝘄 𝗤𝘂𝗲𝘀𝘁𝗶𝗼𝗻𝘀 (𝟯–𝟰 𝗬𝗲𝗮𝗿𝘀 𝗘𝘅𝗽𝗲𝗿𝗶𝗲𝗻𝗰𝗲)

🔹 𝗦𝗤𝗟 & 𝗗𝗮𝘁𝗮 𝗠𝗮𝗻𝗶𝗽𝘂𝗹𝗮𝘁𝗶𝗼𝗻

What is the difference between RANK(), DENSE_RANK(), and ROW_NUMBER()?

𝗣𝘆𝘁𝗵𝗼𝗻 𝗳𝗼𝗿 𝗗𝗮𝘁𝗮 𝗘𝗻𝗴𝗶𝗻𝗲𝗲𝗿𝗶𝗻𝗴

𝗣𝘆𝗦𝗽𝗮𝗿𝗸 & 𝗕𝗶𝗴 𝗗𝗮𝘁𝗮

Difference between RDD, DataFrame, and Dataset in PySpark?

𝗔𝘇𝘂𝗿𝗲 𝗗𝗮𝘁𝗮 𝗘𝗻𝗴𝗶𝗻𝗲𝗲𝗿𝗶𝗻𝗴

What are the key components of Azure Data Factory?

𝗗𝗮𝘁𝗮 𝗠𝗼𝗱𝗲𝗹𝗶𝗻𝗴 & 𝗪𝗮𝗿𝗲𝗵𝗼𝘂𝘀𝗶𝗻𝗴

What is the difference between OLTP and OLAP?

𝗦𝗰𝗲𝗻𝗮𝗿𝗶𝗼 -𝗕𝗮𝘀𝗲𝗱 / 𝗕𝗲𝗵𝗮𝘃𝗶𝗼𝗿𝗮𝗹

How do you handle a failed pipeline in production?

You might also like