Python, Pyspark,SQL

The document contains a series of technical questions related to data processing, Spark architecture, and Databricks features. It covers topics such as transformations, data loading techniques, SQL queries, and optimization strategies. Additionally, it includes inquiries about recent projects, workflows, and data management practices.

Uploaded by

Shobhit

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Python, Pyspark,SQL

Uploaded by

Shobhit

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 1

1.What is difference between Narrow and wide Transformation?

2.What is spark architecture

3 What is Initial, incremental and delta load?
4.What is the difference between incremental and delta load?
5.In which layer initial and incremental load takes place?
6. Write a code for incremental load
7.Concept of shuffling and stages in spark
8. Given two tables named "orders" and "order_details" with columns (order_id,
customer_id, order_date) and (order_id, product_id, quantity, unit_price), write an
SQL query to find the total revenue generated by each customer in the year 2023.
9.Table A: 1,1 Table 2 : 1,1,1
Write values of full join, inner join, left join, right join
10.Concept of DAG and lineage graph

1) What are workflows in Databricks?

2) What is Unity Catalog in Databricks, and what are its features?
3) What are the 4 Vs of Big Data?
4) If we have 1 driver and 3 workers and a dataset with 100 records, how many
partitions would be created for perfect distribution?
5) What are Spark optimization techniques?
6) Can you explain broadcast join with a real-world scenario?
7) How do you configure cluster settings?
8) Can you describe the domains of projects you've worked on?
9) What is pivoting in the context of data processing?
10) partitioning?

Recent project explanations

Questions related to recent projects:
What is Unity Catalog and how does it differ from Hive Metastore?
How do you maintain logging?
Steps for cost optimization.
Real problems related to concurrency control.
Scenario-based questions:
How would you fetch streaming data every 2 minutes from an API and ingest it into
Databricks? Write a step by step process.
Cross questions included:
Data Cleansing
Handling dirty/corrupt data
Reverse ETL
Medallion Architecture in detail
Cluster Configurations

Trackpad Pro Ver. 5.0 Class 6: WINDOWS 11 & MS OFFICE 2021
From Everand
Trackpad Pro Ver. 5.0 Class 6: WINDOWS 11 & MS OFFICE 2021
Nidhi Arora
No ratings yet
Modern C++ Programming
From Everand
Modern C++ Programming
Orhan Gazi
No ratings yet
SQL: For Beginners: Your Guide To Easily Learn SQL Programming in 7 Days
From Everand
SQL: For Beginners: Your Guide To Easily Learn SQL Programming in 7 Days
i Code Academy
5/5 (4)
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
Azure de Interview Question Set Part 1 1710925748
No ratings yet
Azure de Interview Question Set Part 1 1710925748
9 pages
interview questions
No ratings yet
interview questions
3 pages
Ibm Datastage Interview Questions
No ratings yet
Ibm Datastage Interview Questions
3 pages
Informatica and Business Objects Faq's
No ratings yet
Informatica and Business Objects Faq's
9 pages
Data Engineer
No ratings yet
Data Engineer
19 pages
Informatica Qns
No ratings yet
Informatica Qns
10 pages
All Interview
No ratings yet
All Interview
11 pages
Interview Questions
No ratings yet
Interview Questions
6 pages
Basic Interview Questions
No ratings yet
Basic Interview Questions
2 pages
Excelent Scenarios and Faq's of Informatica
0% (1)
Excelent Scenarios and Faq's of Informatica
34 pages
4
No ratings yet
4
2 pages
text 4
No ratings yet
text 4
1 page
Imp QSTN
No ratings yet
Imp QSTN
18 pages
Datastage Interview Questions
No ratings yet
Datastage Interview Questions
22 pages
Web Site Name:: 100 TOP SAP BI Interview Questions and Answers PDF
No ratings yet
Web Site Name:: 100 TOP SAP BI Interview Questions and Answers PDF
3 pages
Interview Questions_Who attended from Batch_12
No ratings yet
Interview Questions_Who attended from Batch_12
6 pages
Powerbi Interview Questions
No ratings yet
Powerbi Interview Questions
6 pages
Ds Ques
No ratings yet
Ds Ques
2 pages
Etl Interview Questions
100% (1)
Etl Interview Questions
4 pages
azure comapny wise question
No ratings yet
azure comapny wise question
68 pages
DATA_ENGINEER QUESTIONS
No ratings yet
DATA_ENGINEER QUESTIONS
3 pages
Interview Questions
No ratings yet
Interview Questions
96 pages
1st Round Interview Questions
No ratings yet
1st Round Interview Questions
5 pages
Oracle Questions:: On Wed, Mar 23, 2011 at 22:24, Arun Kumar Gollamudi Wrote
No ratings yet
Oracle Questions:: On Wed, Mar 23, 2011 at 22:24, Arun Kumar Gollamudi Wrote
9 pages
All Interview Questions Cognos Ibm
No ratings yet
All Interview Questions Cognos Ibm
13 pages
Faqs
No ratings yet
Faqs
2 pages
Selected Questions
No ratings yet
Selected Questions
12 pages
Spark Questions Asked in Mock Interview
No ratings yet
Spark Questions Asked in Mock Interview
2 pages
Oracle Interview Question
No ratings yet
Oracle Interview Question
7 pages
Pawan Kumar Khowal SQL Server Interview Questions Only Set 1 100 Questions
No ratings yet
Pawan Kumar Khowal SQL Server Interview Questions Only Set 1 100 Questions
3 pages
Computer_STD 7_HYE_Ch 2 to 4 RQB_2024_25_1
No ratings yet
Computer_STD 7_HYE_Ch 2 to 4 RQB_2024_25_1
8 pages
What Is The Difference Between Query Transform and SQL Transform in BODI
No ratings yet
What Is The Difference Between Query Transform and SQL Transform in BODI
7 pages
InInformaticaiew Question in in Vesco
No ratings yet
InInformaticaiew Question in in Vesco
3 pages
Int Questions
100% (1)
Int Questions
5 pages
Viva Voce
No ratings yet
Viva Voce
41 pages
Informatica Faqs
No ratings yet
Informatica Faqs
33 pages
Home Work
No ratings yet
Home Work
11 pages
ADF Question only for your practice Ramya
No ratings yet
ADF Question only for your practice Ramya
2 pages
Informatica FAQ's
No ratings yet
Informatica FAQ's
5 pages
PLSQL Interview Questions and Answers: Functions
No ratings yet
PLSQL Interview Questions and Answers: Functions
15 pages
Informatic Question
No ratings yet
Informatic Question
2 pages
Latest Year KPIT Technologies Technical Test Question Paper
No ratings yet
Latest Year KPIT Technologies Technical Test Question Paper
7 pages
My Questions
No ratings yet
My Questions
4 pages
TCS SQL Question
No ratings yet
TCS SQL Question
2 pages
Digital Engineering: Complex System Design
From Everand
Digital Engineering: Complex System Design
S Mathioudakis
No ratings yet
Coding Interview Questions and Answers
From Everand
Coding Interview Questions and Answers
Chinmoy Mukherjee
No ratings yet
GETTING STARTED WITH SQL: Exercises with PhpMyAdmin and MySQL
From Everand
GETTING STARTED WITH SQL: Exercises with PhpMyAdmin and MySQL
Remy Lentzner
No ratings yet
Real-Time Big Data Analytics: Emerging Trends
From Everand
Real-Time Big Data Analytics: Emerging Trends
Trilokesh Khatri
No ratings yet
C++ Data Structures Explained: A Practical Guide with Examples
From Everand
C++ Data Structures Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
100 Puzzles to Learn Data Warehousing
From Everand
100 Puzzles to Learn Data Warehousing
Cristian Scutaru
No ratings yet
Data Structures and Algorithms with Python
From Everand
Data Structures and Algorithms with Python
Aadinath Pothuvaal
No ratings yet
Modern C++ Programming: Including the recent standards C++11, C++17, C++20, C++23
From Everand
Modern C++ Programming: Including the recent standards C++11, C++17, C++20, C++23
Orhan Gazi
No ratings yet
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
C++ Programming: From Novice to Expert in a Step-by-Step Journey
From Everand
C++ Programming: From Novice to Expert in a Step-by-Step Journey
Ryan Campbell
No ratings yet
Hands-on Cloud Analytics with Microsoft Azure Stack
From Everand
Hands-on Cloud Analytics with Microsoft Azure Stack
Prashila Naik
No ratings yet
Mastering the Art of C++ STL: Unlock the Secrets of Expert-Level Skills
From Everand
Mastering the Art of C++ STL: Unlock the Secrets of Expert-Level Skills
Larry Jones
No ratings yet
Arayan-Raj-Resume
No ratings yet
Arayan-Raj-Resume
2 pages
Wwp Snaplogic Steps
No ratings yet
Wwp Snaplogic Steps
1 page
CV_2024-08-08_Vikash_Kumar
No ratings yet
CV_2024-08-08_Vikash_Kumar
1 page
PREM_ASHISH_CV_DA
No ratings yet
PREM_ASHISH_CV_DA
2 pages

Python, Pyspark,SQL

Uploaded by

Python, Pyspark,SQL

Uploaded by

1.What is difference between Narrow and wide Transformation?

2.What is spark architecture

1) What are workflows in Databricks?

Recent project explanations

You might also like