Persistent Systems Senior Data Engineer
Persistent Systems Senior Data Engineer
Premium Employer
10+ Persistent Systems Senior Data Engineer Interview Questions and Answers
Updated 27 Sep 2024
Search by designation
Q1. What is the best approach to finding whether the data frame is empty or not?
Ans. Use the len() function to check the length of the data frame.
Use len() function to get the number of rows in the data frame.
If the length is 0, then the data frame is empty.
Repartition can increase or decrease the number of partitions in a DataFrame, leading to a shuffle of data across the cluster.
Coalesce only decreases the number of partitions in a DataFrame without performing a full shuffle, making it more efficient than repartition.
Repartition is typically used when there is a need to increase the number of parti...read more
Add your answer
Spark
Q3. Two SQL Codes and Two Python codes like reverse a string ?
Ans. Reverse a string using SQL and Python codes.
In SQL, use the REVERSE function to reverse a string.
Monitor the system performance and adjust cores and worker nodes as needed
Cloud Computing
https://www.ambitionbox.com/interviews/persistent-systems-interview-questions/senior-data-engineer/top-questions?campaign=history_cards 1/5
3/7/25, 4:39 PM Top 14 Persistent Systems Senior Data Engineer Interview Questions and Answers 2025 | AmbitionBox
Q5. Find top 5 countries with highest population in Spark and SQL
Ans. Use Spark and SQL to find the top 5 countries with the highest population.
Spark SQL
It can be handled efficiently by minimizing the amount of data being shuffled and optimizing the partitioning strategy.
Techniques like partitioning, combiners, and reducers can help reduce the amount of shuffling in MapReduce jobs.
Algorithms
It leverages rules to transform the logical query plan into a more optimized physical plan.
The optimizer applies various optimization techniques like predicate pushdown, constant folding, and join reordering.
Data Management
Q9. Using two tables find the different records for different joins
Ans. To find different records for different joins using two tables
Use the SQL query to perform different joins like INNER JOIN, LEFT JOIN, RIGHT JOIN, and FULL JOIN
https://www.ambitionbox.com/interviews/persistent-systems-interview-questions/senior-data-engineer/top-questions?campaign=history_cards 2/5
3/7/25, 4:39 PM Top 14 Persistent Systems Senior Data Engineer Interview Questions and Answers 2025 | AmbitionBox
Identify the key columns in both tables to join on
Select the columns from both tables and use WHERE clause to filter out the different records
Add your answer
SQL
Can lead to rejection of data that does not adhere to the schema
DAGs can be configured to retry failed tasks a certain number of times before marking them as failed.
SSIS packages are used for Extract, Transform, Load (ETL) processes in SQL Server.
Union in SSIS combines datasets vertically, stacking rows on top of each other.
Merge in SSIS combines datasets horizontally, matching rows based on specified columns.
SCD is important for tracking changes in dimensions like customer information ...read more
Add your answer
SSIS is a platform for building high-performance data integration and workflow solutions.
It allows you to create packages that move data from various sources to destinations.
SSIS includes a visual design interface for creating, monitoring, and managing data integration processes.
You can use SSIS to automate tasks such as data extraction, transformation,...read more
Add your answer
SSIS
HQ - Pune, Maharashtra, India IT Services & Consulting 10k-50k Employees (India) Telecom FinTech Healthcare Emerging Technologies
https://www.ambitionbox.com/interviews/persistent-systems-interview-questions/senior-data-engineer/top-questions?campaign=history_cards 3/5
3/7/25, 4:39 PM Top 14 Persistent Systems Senior Data Engineer Interview Questions and Answers 2025 | AmbitionBox
Software Product
Interview experience
4.0 Good
View more
Explore community
View all
Recently Viewed
8.1k interviews 10 top interview questions 11 interviews 535 interviews 36 top interview questions 22 top interview questions
Home > Interviews > Persistent Systems Interview Questions And Answers > Persistent Systems Senior Data Engineer Interview Questions And Answers >
Top Persistent Systems Senior Data Engineer Interview Questions And Answers
Stay ahead in your career.
Get AmbitionBox app
https://www.ambitionbox.com/interviews/persistent-systems-interview-questions/senior-data-engineer/top-questions?campaign=history_cards 4/5
3/7/25, 4:39 PM Top 14 Persistent Systems Senior Data Engineer Interview Questions and Answers 2025 | AmbitionBox
Helping over 1 Crore job seekers every month in choosing their right fit company
Write a Review Add a salary Share an interview Add Office Photos Add Company Benefits
Campus Placements
Practice Test
Compare Companies
AmbitionBox Employers
Follow us
Made with ❤️ in India. Trademarks belong to their respective owners. All rights reserved © 2024 Info Edge (India) Ltd.
https://www.ambitionbox.com/interviews/persistent-systems-interview-questions/senior-data-engineer/top-questions?campaign=history_cards 5/5