0% found this document useful (0 votes)

5 views7 pages

SQL Optimization

The document discusses the differences between UNION and UNION ALL in SQL, emphasizing that UNION removes duplicates and is slower, while UNION ALL retains duplicates and is faster. It advises using UNION ALL unless deduplication is necessary, and provides optimization tips to avoid expensive sorting operations. Real-world examples illustrate the performance benefits of using UNION ALL over UNION in large datasets.

Uploaded by

rajakarthick58360

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views7 pages

SQL Optimization

Uploaded by

rajakarthick58360

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

SQL OPTIMIZATION

INTERVIEW
QUESTION
Interview Question:

Q: What’s the difference between UNION and UNION

ALL, and when should you use each?
Asked by: TCS Digital, Cognizant GenC Next

✅ Answer:
 UNION: Removes duplicates → slower due to sorting
 UNION ALL: Keeps duplicates → faster and cheaper
 Prefer UNION ALL unless your business case requires
deduplication

Why it matters:

Both DISTINCT and UNION perform sorting and

deduplication, which are expensive operations —
especially with millions of rows. This leads to unnecessary
CPU usage, memory pressure, and longer runtimes in
large-scale data systems (e.g., Azure Synapse, BigQuery,
Spark).
Optimization Tips:

Use UNION ALL instead of UNION when duplicates

don’t affect results.
Avoid DISTINCT unless you’re solving a real business
need.
Investigate duplicates first — don’t assume they're
there.
Check execution plans to confirm if a sort/shuffle is
involved.
Examples:

Slower Query:
SELECT DISTINCT city FROM customers;

✅ Faster (no deduplication needed):

SELECT city FROM customers;

Union with deduplication:

SELECT city FROM customers_2023
UNION
SELECT city FROM customers_2024;

✅ Union All - better performance:

SELECT city FROM customers_2023
UNION ALL
SELECT city FROM customers_2024;
Best Practice:

 ✅ Prefer UNION ALL over UNION when duplicates

do not impact business logic — it's significantly
faster.

 ❌ Avoid using UNION by default — it performs an

implicit sort and deduplication, which is resource-
intensive.

 ✅ If you must remove duplicates, consider whether

upstream data cleansing or filtering can handle it
instead.

 ✅ Use EXISTS, JOIN, or ROW_NUMBER() +

FILTER for more controlled deduplication when
needed.

 Always check the query plan — UNION often

triggers sort or shuffle operations that slow down
performance, especially in Spark or distributed SQL
engines.
Real-World Example:

You’re combining order data from two e-commerce

platforms:

 orders_amazon → 5 million rows

 orders_ebay → 4 million rows

❌ Inefficient Query:
SELECT order_id, customer_id FROM orders_amazon
UNION
SELECT order_id, customer_id FROM orders_ebay;

Result: Full deduplication via sort → high memory

usage, query runs in ~30 seconds on large clusters

✅ Optimized Query:
SELECT order_id, customer_id FROM orders_amazon
UNION ALL
SELECT order_id, customer_id FROM orders_ebay;
Result: No sort → query runs in ~6 seconds
Improves scalability and reduces cost in cloud
environments like BigQuery, Snowflake, or Azure Synapse

Design and Build Modern Datacentres, A to Z practical guide
From Everand
Design and Build Modern Datacentres, A to Z practical guide
Engineer Said AL Hosni
3/5 (2)
Union Vs Union All
No ratings yet
Union Vs Union All
3 pages
CTE Linkedin Posts
No ratings yet
CTE Linkedin Posts
2 pages
Joining Queries and Functions
No ratings yet
Joining Queries and Functions
11 pages
Join VS Union
No ratings yet
Join VS Union
9 pages
Union Operator
No ratings yet
Union Operator
14 pages
Top 10 Frequently Asked Interview Questions and Answers
No ratings yet
Top 10 Frequently Asked Interview Questions and Answers
12 pages
SQL Using, Union: Session 7 (Week 4)
No ratings yet
SQL Using, Union: Session 7 (Week 4)
25 pages
Lab 2 PDF
100% (1)
Lab 2 PDF
11 pages
SET Operators
No ratings yet
SET Operators
9 pages
SQL Concepts
No ratings yet
SQL Concepts
12 pages
Most Confusing SQL Functions
No ratings yet
Most Confusing SQL Functions
27 pages
PracticalNo10
No ratings yet
PracticalNo10
4 pages
Oracle Class - SET Operators
No ratings yet
Oracle Class - SET Operators
10 pages
Select - Union: Syntax
No ratings yet
Select - Union: Syntax
3 pages
Advance SQL With Rajan Chettri
No ratings yet
Advance SQL With Rajan Chettri
47 pages
Joining Queries and Functions in MySQL
No ratings yet
Joining Queries and Functions in MySQL
46 pages
DB Lab New-08 (v3.5-menahil+SA-2024) - Joins
No ratings yet
DB Lab New-08 (v3.5-menahil+SA-2024) - Joins
3 pages
Lab 08 - SQL (DML) - 03
No ratings yet
Lab 08 - SQL (DML) - 03
14 pages
Assignment - Union and Union All
No ratings yet
Assignment - Union and Union All
3 pages
9 DB Unit 3
No ratings yet
9 DB Unit 3
28 pages
Questionaire Set 2 - SQL 1
No ratings yet
Questionaire Set 2 - SQL 1
4 pages
SQL Questions Important
No ratings yet
SQL Questions Important
6 pages
6 SQL Server 2012 Querying pt1 m06 Slides PDF
No ratings yet
6 SQL Server 2012 Querying pt1 m06 Slides PDF
4 pages
Union: Course Materials May Not Be Reproduced in Whole or in Part Without The Prior Written Permission of IBM
No ratings yet
Union: Course Materials May Not Be Reproduced in Whole or in Part Without The Prior Written Permission of IBM
16 pages
Union Examples
No ratings yet
Union Examples
3 pages
Exam Note SQL
No ratings yet
Exam Note SQL
27 pages
Unit4 E-Commerce
No ratings yet
Unit4 E-Commerce
38 pages
Top SQL Interview Questions
No ratings yet
Top SQL Interview Questions
8 pages
SET Operators in SQL
No ratings yet
SET Operators in SQL
11 pages
Set Operators
No ratings yet
Set Operators
3 pages
Implementation of Queries Using SQL Set Operators
No ratings yet
Implementation of Queries Using SQL Set Operators
5 pages
CNG351 Lecture 10 DML Part 2
No ratings yet
CNG351 Lecture 10 DML Part 2
26 pages
SQL Latest
No ratings yet
SQL Latest
7 pages
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
Explain UNION and UNION ALL SQL Clause With Example
No ratings yet
Explain UNION and UNION ALL SQL Clause With Example
8 pages
MySQL UNION Explained A Tutorial With Practical Examples For All Skill Levels - Devart Blog
No ratings yet
MySQL UNION Explained A Tutorial With Practical Examples For All Skill Levels - Devart Blog
21 pages
SQL Set Operations Examples
No ratings yet
SQL Set Operations Examples
4 pages
Set Operators
No ratings yet
Set Operators
5 pages
Presentation On MYSQL: Presented By: Rajesh Ramagiri
No ratings yet
Presentation On MYSQL: Presented By: Rajesh Ramagiri
10 pages
Experiment No 8
No ratings yet
Experiment No 8
6 pages
NoSQL For Dummies
From Everand
NoSQL For Dummies
Adam Fowler
No ratings yet
SQL 2
No ratings yet
SQL 2
13 pages
Reviewer SQL Lab2
No ratings yet
Reviewer SQL Lab2
2 pages
Industrial Cases in Simulation Modeling
From Everand
Industrial Cases in Simulation Modeling
James A. Chisman PhD
No ratings yet
W05 Paper CIT 225
No ratings yet
W05 Paper CIT 225
2 pages
Union Query: by Neil A. Basabe
No ratings yet
Union Query: by Neil A. Basabe
18 pages
SQL Cheet Sheet
No ratings yet
SQL Cheet Sheet
10 pages
Lesson 2 SQL NoSQL
No ratings yet
Lesson 2 SQL NoSQL
25 pages
Set Operation
No ratings yet
Set Operation
3 pages
6 SQL Post 3
No ratings yet
6 SQL Post 3
47 pages
SQL Basics Cheat Sheet Letter 02
No ratings yet
SQL Basics Cheat Sheet Letter 02
2 pages
DBMS Unit 3.1
No ratings yet
DBMS Unit 3.1
4 pages
SQL For Beginer
No ratings yet
SQL For Beginer
13 pages
Roadmap 2 ETL Testing - by Himanshu
100% (1)
Roadmap 2 ETL Testing - by Himanshu
56 pages
Lab4 - DML3 - DML4
No ratings yet
Lab4 - DML3 - DML4
6 pages
Advanced Data Selection
No ratings yet
Advanced Data Selection
36 pages
Set Operator Notes
No ratings yet
Set Operator Notes
2 pages
SQL - Data Manipulation Language - Concepts
No ratings yet
SQL - Data Manipulation Language - Concepts
6 pages
06 Joins and Set
No ratings yet
06 Joins and Set
45 pages
Buying Computer Parts
No ratings yet
Buying Computer Parts
4 pages
Mobile GIS Development
No ratings yet
Mobile GIS Development
5 pages
IGCSE Hardware-QP
No ratings yet
IGCSE Hardware-QP
7 pages
Unit I
100% (1)
Unit I
7 pages
B XLZ Z2 RZ ZW9 Q MDEz NZ Y0 MTC 2 MQ
No ratings yet
B XLZ Z2 RZ ZW9 Q MDEz NZ Y0 MTC 2 MQ
2 pages
Module 4.1 - Memory and Data Locality: GPU Teaching Kit
No ratings yet
Module 4.1 - Memory and Data Locality: GPU Teaching Kit
132 pages
Ifsys 8003
No ratings yet
Ifsys 8003
50 pages
Installation Instructions DS550X SW 6.2.0 C1
No ratings yet
Installation Instructions DS550X SW 6.2.0 C1
35 pages
Debug Log
No ratings yet
Debug Log
935 pages
Hands-On Lab 1-2024b
No ratings yet
Hands-On Lab 1-2024b
8 pages
AIT APGFC2 Protocol Support
No ratings yet
AIT APGFC2 Protocol Support
6 pages
Hdv100a1 3903ds
No ratings yet
Hdv100a1 3903ds
1 page
J 8707 Paper Ii
No ratings yet
J 8707 Paper Ii
33 pages
Lab#2 Creating A Database
No ratings yet
Lab#2 Creating A Database
6 pages
DB2 For ZOS Course 2
No ratings yet
DB2 For ZOS Course 2
654 pages
Termux Basic Commands
100% (1)
Termux Basic Commands
2 pages
Kushal Vijay Resume
No ratings yet
Kushal Vijay Resume
1 page
How To Clone A Git Repository - Devconnected
No ratings yet
How To Clone A Git Repository - Devconnected
9 pages
String Manipulation Using Operator Overloading - The Code Gallery
No ratings yet
String Manipulation Using Operator Overloading - The Code Gallery
6 pages
How To Use Index and Match
No ratings yet
How To Use Index and Match
7 pages
Pps Syllabus
No ratings yet
Pps Syllabus
7 pages
TSM Linux BA Client Trouble Shooting Issues
No ratings yet
TSM Linux BA Client Trouble Shooting Issues
750 pages
Reverse Engineering For Beginners by Dennis Yurichev - August 2016
No ratings yet
Reverse Engineering For Beginners by Dennis Yurichev - August 2016
987 pages
RTS Networking - Operation - Meeting - July15 - 2021
No ratings yet
RTS Networking - Operation - Meeting - July15 - 2021
17 pages
Test 2 Multiple Choice Questions: Level 1 Asia Pacific University of Technology and Innovation 2017/ 01
No ratings yet
Test 2 Multiple Choice Questions: Level 1 Asia Pacific University of Technology and Innovation 2017/ 01
8 pages
Detailed Lesson Plan in ICT Excel
100% (3)
Detailed Lesson Plan in ICT Excel
5 pages
Steve Zoerb C+I 336 Logo Where To Get Mswlogo
No ratings yet
Steve Zoerb C+I 336 Logo Where To Get Mswlogo
4 pages
CST202 Computer Organization and Architecture, December 2024
No ratings yet
CST202 Computer Organization and Architecture, December 2024
2 pages
Concurrent Request ORA-20100 Errors in The Request Logs
No ratings yet
Concurrent Request ORA-20100 Errors in The Request Logs
3 pages
SAP ABAP Smartforms & SAPscript Formatting
No ratings yet
SAP ABAP Smartforms & SAPscript Formatting
4 pages

SQL Optimization

Uploaded by

SQL Optimization

Uploaded by

SQL OPTIMIZATION

Q: What’s the difference between UNION and UNION

Both DISTINCT and UNION perform sorting and

Use UNION ALL instead of UNION when duplicates

✅ Faster (no deduplication needed):

Union with deduplication:

✅ Union All - better performance:

 ✅ Prefer UNION ALL over UNION when duplicates

 ❌ Avoid using UNION by default — it performs an

 ✅ If you must remove duplicates, consider whether

 ✅ Use EXISTS, JOIN, or ROW_NUMBER() +

 Always check the query plan — UNION often

You’re combining order data from two e-commerce

 orders_amazon → 5 million rows

Result: Full deduplication via sort → high memory

You might also like