Spark Questions Asked in Mock Interview

The document lists a series of Spark-related questions commonly asked in mock interviews, covering topics such as executors, slowly changing dimensions (SCD), data handling, and optimization techniques. It includes inquiries about file reading/writing modes, handling NULL values, data frame operations, and various Spark concepts like Medallion Architecture and partitioning. The questions are designed to assess knowledge and practical skills in Spark and data processing.

Uploaded by

Satyajit Ligade

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views2 pages

Spark Questions Asked in Mock Interview

Uploaded by

Satyajit Ligade

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

 Spark Questions Asked in Mock Interview 

1. There are 10 nodes and 15 Cores How many executors will be there?
2. How will you implement SCD in your project and which type will you use
and why?
3. Modes while reading a file?
4. Modes while writing in different file format?
5. What is query to implement SCD1 AND 2 in Delta Table?
6. How will you handle NULL in data frame?
7. How will you read only 4 files in a folder having 10 files in it?
8. How will you replace NULL by replacing with NA or with any value?
9. Difference in Left Anti-Join and Left Semi-Join?
10.What is Medallion Architecture?
11.How will you handle duplicates in data frame or how will you remove
duplicates in Data Frame?
12.What is serialization and what is deserialization?
13.How to create Delta Table?
14.Memory Management in Spark?
15.What is sorting and shuffling?
16.What is salting?
17.How will we handle skewness ?
18.Optimization Technique in Spark?
19.Advance Join in spark?
20.What is Partition By and Bucket By/Bucketing?
21.Errors faced in our Airline Project?
22.What is Partition Pruning and Dynamic Partition Pruning?
23.How will you check the skewness in spark?
24.How will you check which partition have lager data in it without using
UI?
25. How to remove data from Disk and from memory?
26.What is Lineage and how it is different from DAG?
27.Steps to handle extra comma in CSV file?
28.Difference between JSON and parquet file format?
29.After writing in Parquet file why we use Coalesce(1)?
30.Speculative Exection?
31.Difference in Spark’s (Union and Union All) and SQL’s (Union and Union
All).
32. What is broadcast variable?
33.How will you write exact SQL queries in Spark?
34.What is Spark-Submit Command?
35.If there is no python worker will our pyspark code work?
36.Why we can’t use Coalesce to increase partition?
37.What is hash and Heap?
38.How our execution plan switch to AQE?
39.Calculation of Number of executor, cores
40.10 GB file and there is cluster of 5 Executor, tell how many number of
task will be formed?
41. Pivot and unpivot in Spark?
42.How will you flatten the data?
43.How will you Extract columns from JSON file?
44.How will you take out the column from the data frame and save it.?

PySpark Comprehensive Notes
No ratings yet
PySpark Comprehensive Notes
59 pages
50 PySpark Interview Questions 1732556477
No ratings yet
50 PySpark Interview Questions 1732556477
7 pages
Understanding Apache Spark Architecture
No ratings yet
Understanding Apache Spark Architecture
30 pages
Master Pyspark Zero To Hero 1738689679
No ratings yet
Master Pyspark Zero To Hero 1738689679
102 pages
SparkStepbyStepInterviewGuide Draft
No ratings yet
SparkStepbyStepInterviewGuide Draft
3 pages
PySpark Optimization Scenarios - Wipro
No ratings yet
PySpark Optimization Scenarios - Wipro
8 pages
Pyspark Study Material
No ratings yet
Pyspark Study Material
5 pages
Complete Data Engineer Interview Guide
No ratings yet
Complete Data Engineer Interview Guide
3 pages
Apache Backend Frameworks
No ratings yet
Apache Backend Frameworks
4 pages
B LSC CD W1 Geiv Yx BAmc EE3 U
No ratings yet
B LSC CD W1 Geiv Yx BAmc EE3 U
166 pages
Senior Data Engineer Qna
No ratings yet
Senior Data Engineer Qna
4 pages
Full PySpark Interview QA
No ratings yet
Full PySpark Interview QA
5 pages
Pyspark Questions & Scenario Based
No ratings yet
Pyspark Questions & Scenario Based
25 pages
Spark Interview Questions Answers
No ratings yet
Spark Interview Questions Answers
2 pages
18-22LPA Important Interview Questions On: Harshavardhana I Data Engineer
No ratings yet
18-22LPA Important Interview Questions On: Harshavardhana I Data Engineer
8 pages
Imp Pyspark Questions
No ratings yet
Imp Pyspark Questions
1 page
PySpark Interview QA
No ratings yet
PySpark Interview QA
2 pages
Pyq 435
No ratings yet
Pyq 435
1 page
Interview Questions
No ratings yet
Interview Questions
1 page
Interview
No ratings yet
Interview
1 page
PySpark Cheatsheet
No ratings yet
PySpark Cheatsheet
12 pages
Deloitte Pyspark Interview Questions For Data Engineer 2024 - by Ronit Malhotra - Jun, 2024 - Medium
No ratings yet
Deloitte Pyspark Interview Questions For Data Engineer 2024 - by Ronit Malhotra - Jun, 2024 - Medium
9 pages
Top 200 Data Engineer Interview Question PDF
100% (4)
Top 200 Data Engineer Interview Question PDF
482 pages
THYZQh Meot
No ratings yet
THYZQh Meot
13 pages
Spark Interview Questions and Answers
100% (3)
Spark Interview Questions and Answers
31 pages
Apache Spark Interview Questions by PST IT Solutions
No ratings yet
Apache Spark Interview Questions by PST IT Solutions
3 pages
Tech Mahindra
No ratings yet
Tech Mahindra
2 pages
Azure Data Engineer Scenario Based Interview Questions
No ratings yet
Azure Data Engineer Scenario Based Interview Questions
2 pages
Pyspark Interview Questions
No ratings yet
Pyspark Interview Questions
9 pages
Spark Questions
No ratings yet
Spark Questions
3 pages
Interview Questions
No ratings yet
Interview Questions
6 pages
Spark Interview Questions
No ratings yet
Spark Interview Questions
3 pages
Apache Spark Interview Questions and Answers PDF
No ratings yet
Apache Spark Interview Questions and Answers PDF
31 pages
Spark Interview More Questions With Answers
No ratings yet
Spark Interview More Questions With Answers
3 pages
PySpark Basic Interview Questions
No ratings yet
PySpark Basic Interview Questions
1 page
TFWolj ND9 K
No ratings yet
TFWolj ND9 K
25 pages
PySpark Interview Questions
No ratings yet
PySpark Interview Questions
2 pages
Data Engineer
No ratings yet
Data Engineer
19 pages
PySpark Interview Questions Shubham
No ratings yet
PySpark Interview Questions Shubham
3 pages
Spark
No ratings yet
Spark
27 pages
Spark Interview Questions: Click Here
No ratings yet
Spark Interview Questions: Click Here
35 pages
Must Know Before Your Next Databricks Interview
No ratings yet
Must Know Before Your Next Databricks Interview
7 pages
Top 75 Apache Spark Interview Questions
No ratings yet
Top 75 Apache Spark Interview Questions
18 pages
Pyspark
No ratings yet
Pyspark
6 pages
Most Asked Interview Questions in Top MNC'S: 1. A. Partitioning Caching Broadcasting
No ratings yet
Most Asked Interview Questions in Top MNC'S: 1. A. Partitioning Caching Broadcasting
4 pages
Tiger Analytics 1735834470
No ratings yet
Tiger Analytics 1735834470
27 pages
PySpark Real Time Q&A
No ratings yet
PySpark Real Time Q&A
5 pages
Interviewsss
No ratings yet
Interviewsss
4 pages
Apache Spark Interview Questions
No ratings yet
Apache Spark Interview Questions
12 pages
Apache Spark
No ratings yet
Apache Spark
62 pages
Spark Interview Questions
No ratings yet
Spark Interview Questions
4 pages
PySpark Core Print
No ratings yet
PySpark Core Print
8 pages
Pyspark Theory Questions
No ratings yet
Pyspark Theory Questions
5 pages
PySpark Interview Questions
No ratings yet
PySpark Interview Questions
3 pages
Spark Interview Questions 04
No ratings yet
Spark Interview Questions 04
4 pages
Spark Scenario Based Interview Questions !! For Interview
No ratings yet
Spark Scenario Based Interview Questions !! For Interview
4 pages
Spark Material
No ratings yet
Spark Material
6 pages
v3 GCP Service Wise Interview Questions
No ratings yet
v3 GCP Service Wise Interview Questions
62 pages
Spark Vs Hadoop Features Spark
No ratings yet
Spark Vs Hadoop Features Spark
9 pages
Spark Theory
No ratings yet
Spark Theory
26 pages
New Questions From Batch
No ratings yet
New Questions From Batch
7 pages
???? ?????????? ????
No ratings yet
???? ?????????? ????
4 pages
Pyspark 1
No ratings yet
Pyspark 1
7 pages
Vedant Int Ques Till Now
No ratings yet
Vedant Int Ques Till Now
2 pages
2505 IT Interview Questions for ChatGPT
From Everand
2505 IT Interview Questions for ChatGPT
Christos Varsamis
No ratings yet

Spark Questions Asked in Mock Interview

Uploaded by

Spark Questions Asked in Mock Interview

Uploaded by

 Spark Questions Asked in Mock Interview 

You might also like