0% found this document useful (0 votes)
168 views5 pages

Persistent Systems Senior Data Engineer

The document provides a list of common interview questions and answers for the Senior Data Engineer position at Persistent Systems, updated for 2025. Key topics include data frame checks, partitioning methods, SQL operations, and data handling techniques. It serves as a resource for candidates preparing for interviews in data engineering roles.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
168 views5 pages

Persistent Systems Senior Data Engineer

The document provides a list of common interview questions and answers for the Senior Data Engineer position at Persistent Systems, updated for 2025. Key topics include data frame checks, partitioning methods, SQL operations, and data handling techniques. It serves as a resource for candidates preparing for interviews in data engineering roles.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

3/7/25, 4:39 PM Top 14 Persistent Systems Senior Data Engineer Interview Questions and Answers 2025 | AmbitionBox

Premium Employer

Persistent Systems Share an Interview Follow (7.2k) Compare

3.5 based on 3.9k Reviews

3.9k 37.7k 598 166 435 22 3


Why join us Reviews Salaries Interviews Jobs Benefits Photos Posts

Filter interviews by Senior Data Engineer Clear (1)

10+ Persistent Systems Senior Data Engineer Interview Questions and Answers
Updated 27 Sep 2024

Search by designation

Asked in Senior Data Engineer Interview

Q1. What is the best approach to finding whether the data frame is empty or not?
Ans. Use the len() function to check the length of the data frame.
Use len() function to get the number of rows in the data frame.
If the length is 0, then the data frame is empty.

Example: if len(df) == 0: print('Data frame is empty')


Add your answer

Asked in Senior Data Engineer Interview

Q2. What is the difference between repartition and Coelsce?


Ans. Repartition increases or decreases the number of partitions in a DataFrame, while Coalesce only decreases the number of partitions.

Repartition can increase or decrease the number of partitions in a DataFrame, leading to a shuffle of data across the cluster.
Coalesce only decreases the number of partitions in a DataFrame without performing a full shuffle, making it more efficient than repartition.

Repartition is typically used when there is a need to increase the number of parti...read more
Add your answer

Spark

Asked in Senior Data Engineer Interview

Q3. Two SQL Codes and Two Python codes like reverse a string ?
Ans. Reverse a string using SQL and Python codes.
In SQL, use the REVERSE function to reverse a string.

In Python, use slicing with a step of -1 to reverse a string.


Add your answer

Algorithms Python SQL

Asked in Senior Data Engineer Interview

Q4. How do you decide on cores and worker nodes?


Ans. Cores and worker nodes are decided based on the workload requirements and scalability needs of the data processing system.
Consider the size and complexity of the data being processed

Evaluate the processing speed and memory requirements of the tasks


Take into account the parallelism and concurrency needed for efficient data processing

Monitor the system performance and adjust cores and worker nodes as needed

Add your answer

Cloud Computing

https://www.ambitionbox.com/interviews/persistent-systems-interview-questions/senior-data-engineer/top-questions?campaign=history_cards 1/5
3/7/25, 4:39 PM Top 14 Persistent Systems Senior Data Engineer Interview Questions and Answers 2025 | AmbitionBox

Discover Persistent Systems


interview dos and don'ts from real …

View 598 interview experiences

Asked in Senior Data Engineer Interview

Q5. Find top 5 countries with highest population in Spark and SQL
Ans. Use Spark and SQL to find the top 5 countries with the highest population.

Use Spark to load the data and perform data processing.


Use SQL queries to group by country and sum the population.

Order the results in descending order and limit to top 5.


Example: SELECT country, SUM(population) AS total_population FROM table_name GROUP BY country ORDER BY total_population DESC LIMIT 5

Add your answer

Spark SQL

Asked in Senior Data Engineer Interview

Q6. What is shuffling? How to Handle Shuffling?


Ans. Shuffling is the process of redistributing data across partitions in a distributed computing environment.
Shuffling is necessary when data needs to be grouped or aggregated across different partitions.

It can be handled efficiently by minimizing the amount of data being shuffled and optimizing the partitioning strategy.

Techniques like partitioning, combiners, and reducers can help reduce the amount of shuffling in MapReduce jobs.

Add your answer

Algorithms

Are these interview questions helpful? Yes No

Asked in Senior Data Engineer Interview

Q7. What is a catalyst optimiser? How it works?


Ans. A catalyst optimizer is a query optimization tool used in Apache Spark to improve performance by generating an optimal query plan.

Catalyst optimizer is a rule-based query optimization framework in Apache Spark.

It leverages rules to transform the logical query plan into a more optimized physical plan.
The optimizer applies various optimization techniques like predicate pushdown, constant folding, and join reordering.

By optimizing the query plan, it reduces the overall execution...read more

Add your answer

Asked in Senior Data Engineer Interview

Q8. How do you handle Incremental data?


Ans. Incremental data is handled by identifying new data since the last update and merging it with existing data.

Identify new data since last update

Merge new data with existing data

Update data warehouse or database with incremental changes


Add your answer

Data Management

Share interview questions and


help millions of jobseekers 🌟

Share interview questions

Asked in Senior Data Engineer Interview

Q9. Using two tables find the different records for different joins
Ans. To find different records for different joins using two tables

Use the SQL query to perform different joins like INNER JOIN, LEFT JOIN, RIGHT JOIN, and FULL JOIN

https://www.ambitionbox.com/interviews/persistent-systems-interview-questions/senior-data-engineer/top-questions?campaign=history_cards 2/5
3/7/25, 4:39 PM Top 14 Persistent Systems Senior Data Engineer Interview Questions and Answers 2025 | AmbitionBox
Identify the key columns in both tables to join on

Select the columns from both tables and use WHERE clause to filter out the different records
Add your answer

SQL

Asked in Senior Data Engineer Interview

Q10. What happens when we enforce schema ?


Ans. Enforcing schema ensures that data conforms to a predefined structure and rules.
Ensures data integrity by validating incoming data against predefined schema

Helps in maintaining consistency and accuracy of data

Prevents data corruption and errors in data processing

Can lead to rejection of data that does not adhere to the schema

Add your answer

Asked in Senior Data Engineer Interview

Q11. How DAG handle Fault tolerance?


Ans. DAGs handle fault tolerance by rerunning failed tasks and maintaining task dependencies.
DAGs rerun failed tasks automatically to ensure completion.

DAGs maintain task dependencies to ensure proper sequencing.

DAGs can be configured to retry failed tasks a certain number of times before marking them as failed.

Add your answer

Asked in Senior Data Engineer Interview

Q12. When we use ssis packages? Difference between union merge


Ans. SSIS packages are used for ETL processes in SQL Server. Union combines datasets vertically, while merge combines them horizontally.

SSIS packages are used for Extract, Transform, Load (ETL) processes in SQL Server.

Union in SSIS combines datasets vertically, stacking rows on top of each other.
Merge in SSIS combines datasets horizontally, matching rows based on specified columns.

Union All in SSIS combines datasets vertically without removing duplicates.

Merge Join in SSIS combine...read more

Add your answer

Asked in Senior Data Engineer Interview

Q13. What is SCD ??


Ans. SCD stands for Slowly Changing Dimension, a concept in data warehousing to track changes in data over time.

SCD is used to maintain historical data in a data warehouse.


There are three types of SCD - Type 1, Type 2, and Type 3.

Type 1 SCD overwrites old data with new data.


Type 2 SCD creates a new record for each change, preserving history.
Type 3 SCD maintains both old and new values in the same record.

SCD is important for tracking changes in dimensions like customer information ...read more
Add your answer

Asked in Senior Data Engineer Interview

Q14. What is ssis? How we use


Ans. SSIS stands for SQL Server Integration Services, a tool provided by Microsoft for data integration and workflow applications.

SSIS is a platform for building high-performance data integration and workflow solutions.
It allows you to create packages that move data from various sources to destinations.
SSIS includes a visual design interface for creating, monitoring, and managing data integration processes.

You can use SSIS to automate tasks such as data extraction, transformation,...read more
Add your answer

SSIS

More about working at Persistent Systems

HQ - Pune, Maharashtra, India IT Services & Consulting 10k-50k Employees (India) Telecom FinTech Healthcare Emerging Technologies

https://www.ambitionbox.com/interviews/persistent-systems-interview-questions/senior-data-engineer/top-questions?campaign=history_cards 3/5
3/7/25, 4:39 PM Top 14 Persistent Systems Senior Data Engineer Interview Questions and Answers 2025 | AmbitionBox
Software Product

Contribute & help others!

Write a review Share interview Contribute salary Add office photos

Interview Process at Persistent


Systems Senior Data Engineer
based on 2 interviews

Interview experience

4.0 Good

View more

Interview Tips & Stories

Ace your next interview with expert


advice and inspiring stories

Explore community

Top Senior Data Engineer Interview


Questions from Similar Companies

LTIMindtree Senior Data Engineer


Interview Questions
3.8 • 36 Interview Questions

TCS Senior Data Engineer Interview


Questions
3.7 • 22 Interview Questions

EPAM Systems Senior Data Engineer


Interview Questions
3.7 • 10 Interview Questions

View all

Recently Viewed

INTERVIEWS INTERVIEWS INTERVIEWS INTERVIEWS INTERVIEWS INTERVIEWS

Accenture EPAM Systems EPAM Systems EPAM Systems LTIMindtree TCS


Senior Data Engineer Senior Data Engineer Senior Data Engineer Senior Data Engineer

8.1k interviews 10 top interview questions 11 interviews 535 interviews 36 top interview questions 22 top interview questions

Home > Interviews > Persistent Systems Interview Questions And Answers > Persistent Systems Senior Data Engineer Interview Questions And Answers >
Top Persistent Systems Senior Data Engineer Interview Questions And Answers
Stay ahead in your career.
Get AmbitionBox app

https://www.ambitionbox.com/interviews/persistent-systems-interview-questions/senior-data-engineer/top-questions?campaign=history_cards 4/5
3/7/25, 4:39 PM Top 14 Persistent Systems Senior Data Engineer Interview Questions and Answers 2025 | AmbitionBox

Helping over 1 Crore job seekers every month in choosing their right fit company

75 Lakh+ 5 Lakh+ 4 Crore+ 1 Cr+


Reviews Interviews Salaries Users/Month

Contribute to help millions

Write a Review Add a salary Share an interview Add Office Photos Add Company Benefits

Users/Jobseekers AmbitionBox Awards

Companies ABECA 2026

Reviews ABECA 2025

Salaries ABECA 2024


Jobs AmbitionBox Best Places to Work 2022

Interviews AmbitionBox Best Places to Work 2021


Salary Calculator Invite employees to rate

Campus Placements

Practice Test

Compare Companies

AmbitionBox Employers

About Us Create a new company

Email Us Update company information

Blog Respond to reviews

FAQ Invite employees to review

Credits AmbitionBox Offering for Employers

AmbitionBox Employers Brochure

Privacy Grievances Terms of Use Summons/Notices Community Guidelines

Get AmbitionBox app

Follow us

Made with ❤️ in India. Trademarks belong to their respective owners. All rights reserved © 2024 Info Edge (India) Ltd.

https://www.ambitionbox.com/interviews/persistent-systems-interview-questions/senior-data-engineer/top-questions?campaign=history_cards 5/5

You might also like