Difference Between Distinct and Group by

Group by and distinct handle duplicate rows differently in distributed databases. Group by only sends unique values between AMPs, while distinct redistributes all rows, which can move more data. However, group by wastes time checking for duplicates that may not exist, while distinct does not have this inefficiency. The document then outlines the step-by-step processes for de-duplication using distinct and group by.

Uploaded by

PavelStrelkov

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

151 views1 page

Difference Between Distinct and Group by

Uploaded by

PavelStrelkov

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 1

Since DISTINCT redistributes the rows immediately, more data may move between the AMPs, where as

GROUP BY that only sends unique values between the AMPs.

So, we can say that GROUP BY sounds more efficient.
But when you assume that data is nearly unique in a table, GROUP BY will spend more time
attempting to eliminate duplicates that do not exist at all.Therefore, it is wasting its time to check for
duplicates the first time. Then, it must redistribute the same amount of data .
Let us see how these steps are used in each case for elimination of Duplicates
(can be found out using explain plan)
DISTINCT
1. It reads each row on AMP
2. Hashes the column value identified in the distinct clause of select statement.
3. Then redistributes the rows according to row value into appropriate AMP
4. Once redistribution is completed , it
a. Sorts data to group duplicates on each AMP
b. Will remove all the duplicates on each amp and sends the original/unique value
P.s: There are cases when "Error : 2646 No more Spool Space " . In such cases try using GROUP BY.
GROUP BY
1. It reads all the rows part of GROUP BY
2. It will remove all duplicates in each AMP for given set of values using "BUCKETS" concept
3. Hashes the unique values on each AMP
4. Then it will re-distribute them to particular /appropriate AMP's
5. Once redistribution is completed , it
a. Sorts data to group duplicates on each AMP
b. Will remove all the duplicates on each amp and sends the original/unique value
Hence it is better to go for
GROUP BY - when Many duplicates
DISTINCT - when few or no duplicates
GROUP BY - SPOOL space is exceeded

MATH 5 - Q1 - Mod1 PDF
78% (49)
MATH 5 - Q1 - Mod1 PDF
25 pages
9TH SSC Trigonometry Paper
100% (2)
9TH SSC Trigonometry Paper
2 pages
SQL Keywords
No ratings yet
SQL Keywords
6 pages
Multiple Row Functions
No ratings yet
Multiple Row Functions
3 pages
Week 7 - Day 1-4 - Grouping Data With GROUP by
No ratings yet
Week 7 - Day 1-4 - Grouping Data With GROUP by
2 pages
Reporting Aggregated Data Using The Group Functions
No ratings yet
Reporting Aggregated Data Using The Group Functions
6 pages
Exp 3
No ratings yet
Exp 3
3 pages
DBMS 5
No ratings yet
DBMS 5
6 pages
SQL For Data Analyst Part - 3
No ratings yet
SQL For Data Analyst Part - 3
8 pages
Lab-12-Manual - (Reporting Aggregated Data Using GROUP BY)
No ratings yet
Lab-12-Manual - (Reporting Aggregated Data Using GROUP BY)
11 pages
10.2 - Chapter 6 - Full Relations Operations
No ratings yet
10.2 - Chapter 6 - Full Relations Operations
22 pages
Techniques Used To Transform Data, Part 1
No ratings yet
Techniques Used To Transform Data, Part 1
12 pages
Data Base Lab 6
No ratings yet
Data Base Lab 6
8 pages
SQL - Using Group by On Multiple Columns - Stack Overflow
No ratings yet
SQL - Using Group by On Multiple Columns - Stack Overflow
4 pages
Aggregate Function
No ratings yet
Aggregate Function
5 pages
Aggregate Functions MCA1B
No ratings yet
Aggregate Functions MCA1B
7 pages
'Jensen': Sum Sum Sum
No ratings yet
'Jensen': Sum Sum Sum
6 pages
Oup Func
No ratings yet
Oup Func
19 pages
57.11 - Distinct - mp4
No ratings yet
57.11 - Distinct - mp4
3 pages
Hacker Rank
No ratings yet
Hacker Rank
20 pages
How Are Analytic Functions Different From Group or Aggregate Functions?
No ratings yet
How Are Analytic Functions Different From Group or Aggregate Functions?
4 pages
Session 5 BIZ
No ratings yet
Session 5 BIZ
69 pages
Teradata - Explain
No ratings yet
Teradata - Explain
5 pages
Exp 6 - 7 - 8
No ratings yet
Exp 6 - 7 - 8
26 pages
Structured Query Language: Next Slide
No ratings yet
Structured Query Language: Next Slide
14 pages
Using The Group Functions
No ratings yet
Using The Group Functions
3 pages
Ex07-Aggregating Data With GROUP BY and ORDER BY
No ratings yet
Ex07-Aggregating Data With GROUP BY and ORDER BY
5 pages
Ora Final Material 2024
No ratings yet
Ora Final Material 2024
41 pages
Session 9 XP
No ratings yet
Session 9 XP
89 pages
8-In-Built Functions, Join and Group by Queries-19-03-2024
No ratings yet
8-In-Built Functions, Join and Group by Queries-19-03-2024
38 pages
Aggregation
No ratings yet
Aggregation
35 pages
Slides
No ratings yet
Slides
19 pages
T3 L2 Groupby
No ratings yet
T3 L2 Groupby
25 pages
Clauses Task
No ratings yet
Clauses Task
12 pages
Multiple-Row Function and Group by Clause
No ratings yet
Multiple-Row Function and Group by Clause
19 pages
GR Ouping D Ata: GROUP BY Clause
No ratings yet
GR Ouping D Ata: GROUP BY Clause
11 pages
4.1 Sorting Data: Function Output
No ratings yet
4.1 Sorting Data: Function Output
2 pages
Experiment 7
No ratings yet
Experiment 7
8 pages
Grade: XII Subject: Computer Science Aggregate Function in SQL
No ratings yet
Grade: XII Subject: Computer Science Aggregate Function in SQL
16 pages
Grouping and Aggregating Data: Module Overview
No ratings yet
Grouping and Aggregating Data: Module Overview
24 pages
SQL To Pandas - Group Aggregations
No ratings yet
SQL To Pandas - Group Aggregations
6 pages
Introduction To Oracle Functions and Group by Clause
100% (2)
Introduction To Oracle Functions and Group by Clause
62 pages
CH 2.3 - Aggregate Functions
No ratings yet
CH 2.3 - Aggregate Functions
4 pages
Module 10 Summarizing Data
No ratings yet
Module 10 Summarizing Data
32 pages
Grouping and Aggregating Data
No ratings yet
Grouping and Aggregating Data
21 pages
Queries
No ratings yet
Queries
12 pages
Database SQL Aggregate Functions
No ratings yet
Database SQL Aggregate Functions
14 pages
SQL Aggregate Functions
No ratings yet
SQL Aggregate Functions
9 pages
Last Minute Revision For IP Board Exam - MySQL
No ratings yet
Last Minute Revision For IP Board Exam - MySQL
3 pages
Assignment 5
No ratings yet
Assignment 5
15 pages
Grouping and Aggregating Data
No ratings yet
Grouping and Aggregating Data
15 pages
W3 R4 Aggregate
No ratings yet
W3 R4 Aggregate
16 pages
Grouping and Summarizing Data
No ratings yet
Grouping and Summarizing Data
34 pages
Exp 3
No ratings yet
Exp 3
2 pages
Chapter 7 - Querying Using SQL
No ratings yet
Chapter 7 - Querying Using SQL
32 pages
Aggregating and Grouping Example
No ratings yet
Aggregating and Grouping Example
25 pages
4 Group by Clause, Having Clause, Multiple Row (Or Group or Aggregate) Functions
100% (1)
4 Group by Clause, Having Clause, Multiple Row (Or Group or Aggregate) Functions
17 pages
Warehouse and SQL QUESTIONS
No ratings yet
Warehouse and SQL QUESTIONS
14 pages
SELECT DISTINCT Store - Name FROM Store - Information: Result
No ratings yet
SELECT DISTINCT Store - Name FROM Store - Information: Result
7 pages
Aggregate Functions
No ratings yet
Aggregate Functions
14 pages
50 most powerful Excel Functions and Formulas
From Everand
50 most powerful Excel Functions and Formulas
Andrei Besedin
4/5 (1)
201 Mind Boggling Problems In Mathematics
From Everand
201 Mind Boggling Problems In Mathematics
Srijit Mondal
No ratings yet
Effectively Moving S As Data Into Tera Data
No ratings yet
Effectively Moving S As Data Into Tera Data
74 pages
Tip Sheet Connecting To Tera Data
No ratings yet
Tip Sheet Connecting To Tera Data
1 page
Secondary Index
No ratings yet
Secondary Index
4 pages
Shared Disk vs. Shared Nothing
No ratings yet
Shared Disk vs. Shared Nothing
17 pages
Full Download The Future of HRD, Volume I: Innovation and Technology Mark Loon PDF
100% (2)
Full Download The Future of HRD, Volume I: Innovation and Technology Mark Loon PDF
76 pages
Lift Manuals - Manuale Delle Parti - CHASSIS, MAST, OPTIONS & INTERNAL HOSING - PDF Tav 4 Ver
No ratings yet
Lift Manuals - Manuale Delle Parti - CHASSIS, MAST, OPTIONS & INTERNAL HOSING - PDF Tav 4 Ver
3 pages
Rebranding and Revitalisation
100% (1)
Rebranding and Revitalisation
7 pages
FULL Version Testbank Coordinate Geometry For JEE Advanced 3rd Edition G Tewani Multiple Formats
No ratings yet
FULL Version Testbank Coordinate Geometry For JEE Advanced 3rd Edition G Tewani Multiple Formats
409 pages
Subtitle
No ratings yet
Subtitle
4 pages
6sn1118 0dh23 0aa1 Manual
100% (1)
6sn1118 0dh23 0aa1 Manual
485 pages
Pre-Schwarzian and Schwarzian Norm Estimates For Subclasses of Univalent Functions
No ratings yet
Pre-Schwarzian and Schwarzian Norm Estimates For Subclasses of Univalent Functions
19 pages
AE 814 Compliance of Draft Construction Stage Report For TMP (PKG-I To III)
No ratings yet
AE 814 Compliance of Draft Construction Stage Report For TMP (PKG-I To III)
12 pages
Hardening
No ratings yet
Hardening
7 pages
Pediatric Demyelinating Diseases of The Central Nervous System and Their Mimics
100% (1)
Pediatric Demyelinating Diseases of The Central Nervous System and Their Mimics
338 pages
B1 Final Test SpeakingTestFormat
No ratings yet
B1 Final Test SpeakingTestFormat
4 pages
General Biology Chapter 2 Assignment
No ratings yet
General Biology Chapter 2 Assignment
2 pages
Todorov Theory
No ratings yet
Todorov Theory
1 page
5th Grade Gmo Plan
No ratings yet
5th Grade Gmo Plan
1 page
Case Study On Starbucks Coffee
No ratings yet
Case Study On Starbucks Coffee
14 pages
Home Sweet Compromise
No ratings yet
Home Sweet Compromise
7 pages
Assignment - 2 (Google in China)
100% (1)
Assignment - 2 (Google in China)
5 pages
Updates in Hyperkalemia: Outcomes and Therapeutic Strategies
No ratings yet
Updates in Hyperkalemia: Outcomes and Therapeutic Strategies
7 pages
Forrester - Enabling Smarter Procurement
No ratings yet
Forrester - Enabling Smarter Procurement
15 pages
Homework 6: Math 308 Due: 8 March
No ratings yet
Homework 6: Math 308 Due: 8 March
3 pages
Fundamentals For Nursing
100% (1)
Fundamentals For Nursing
5 pages
Advanced AutoCAD 2022 Exercise Workbook For Windows Cheryl R Shrock Steve Heather Download PDF
100% (2)
Advanced AutoCAD 2022 Exercise Workbook For Windows Cheryl R Shrock Steve Heather Download PDF
40 pages
GD121 Spare Parts Old
No ratings yet
GD121 Spare Parts Old
647 pages
Math 6 March 23 Quarter 3 Speed
No ratings yet
Math 6 March 23 Quarter 3 Speed
34 pages
Acyfar 3 Answer Key Q1andq2 T2ay2324
No ratings yet
Acyfar 3 Answer Key Q1andq2 T2ay2324
3 pages
OSS Engine Parts Section
No ratings yet
OSS Engine Parts Section
28 pages
General Description: ISO 17987/LIN 2.x/SAE J2602 Transceiver
100% (1)
General Description: ISO 17987/LIN 2.x/SAE J2602 Transceiver
24 pages
The Process of Photosynthesis
No ratings yet
The Process of Photosynthesis
2 pages

Difference Between Distinct and Group by

Uploaded by

Difference Between Distinct and Group by

Uploaded by

Since DISTINCT redistributes the rows immediately, more data may move between the AMPs, where as

GROUP BY that only sends unique values between the AMPs.

You might also like