0% found this document useful (0 votes)

11 views8 pages

GroupByHavinginSQL

The document explains the use of SQL commands GROUP BY and HAVING for data aggregation and filtering. It details how to group data, apply aggregate functions, and filter results using HAVING to handle conditions on aggregated data. Additionally, it outlines the order of execution for SQL commands, clarifying the differences between WHERE and HAVING clauses.

Uploaded by

ranaswarnadeep

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views8 pages

GroupByHavinginSQL

Uploaded by

ranaswarnadeep

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Use GROUP BY and HAVING in SQL

Aggregation is another name for summarizing your data points to get a single value.
For example, calculating the mean or the minimum. Sometimes, aggregating all your
data will result in a value that isn't useful.

For example, if you are exploring buying behavior in your store, and the people who come in
are a mix of poor students and rich professionals, it will be more informative to calculate the
mean spend for those groups separately. That is, you need to aggregate the amount spent,
grouped by different customer segments.

Using SQL GROUP BY

GROUP BY is a SQL command commonly used to aggregate the data to get insights from it.
There are three phases when you group data:

 Split: the dataset is split up into chunks of rows based on the values of the
variables we have chosen for the aggregation
 Apply: Compute an aggregate function, like average, minimum and maximum,
returning a single value
 Combine: All these resulting outputs are combined in a unique table. In this way,
we’ll have a single value for each modality of the variable of interest.
SQL GROUP BY Example 1

We can begin by showing a simple example of GROUP BY. Suppose we want to find the top
ten countries with the highest number of Unicorn companies.

SELECT * FROM companies

It also would be nice to order the results in decreasing order based on the number of
companies

SELECT country, COUNT(*) AS n_companies

FROM companies

GROUP BY country

ORDER BY n_companies DESC

LIMIT 10

Here we have the results. You will probably not be surprised to find the US, China, and India
in the ranking. Let’s explain the decision behind this query:

 First, notice that we used COUNT(*) to count the rows for each group, which
corresponds to the country. In addition, we also used the SQL alias to rename the
column into a more explainable name. This is possible by using the keyword AS,
followed by the new name. COUNT is covered in more depth in the COUNT() SQL
FUNCTION tutorial.
 The fields were selected from the table companies, where each row corresponds to
a Unicorn company.
 After, we need to specify the column name after GROUP BY to aggregate the data
based on the country.
 ORDER BY is required to visualize the countries in the right order, from the highest
number to the lower number of companies.
 We limit the results to 10 using LIMIT, which is followed by the number of rows
you want in the results.
SQL GROUP BY Example 2

Now, we will analyze the table with the sales. For each order number, we have the type of
client, the product line, the quantity, the unit price, the total, etc.

his time, we are interested in finding the average price per unit, the total number of orders,
and the total gain for each product line:

SELECT

product_line,

AVG(unit_price) AS avg_price,

SUM(quantity) AS tot_pieces,

SUM(total) AS total_gain

FROM sales

GROUP BY product_line

ORDER BY total_gain DESC

 Instead of counting the number of rows, we have the AVG() function to obtain the
average price and the SUM() function to calculate the total number of orders and the
total gain for each product line.
 As before, we specify the column initially dividing the dataset into chunks. Then
the aggregation functions will allow us to obtain a row per each modality of the
product line.
 This time, ORDER BY is optional. It was included to highlight how the higher total
gains are not always proportional to higher average prices or total pieces.

The Limitations of WHERE

Let’s take the previous example again. Now, we want to put a condition to the query: we only
want to filter for the total number of orders higher than 40,000. Let's try the WHERE clause:

SELECT

product_line,

AVG(unit_price) AS avg_price,

SUM(quantity) AS tot_pieces,

SUM(total) AS total_gain

FROM sales

WHERE SUM(total) > 40000

GROUP BY product_line

ORDER BY total_gain DESC

This query will return the following error:

This error’s not possible to pass aggregated functions in the WHERE clause. We need a new
command to solve this issue.

Using SQL HAVING

Like WHERE, the HAVING clause filters the rows of a table. Whereas WHERE tried to filter the
whole table, HAVING filters rows within each of the groups defined by GROUP BY

SQL HAVING Example 1

Here's the previous example again, replacing the word WHERE with HAVING.

SELECT

product_line,

AVG(unit_price) AS avg_price,

SUM(quantity) AS tot_pieces,

SUM(total) AS total_gain

FROM sales

GROUP BY product_line

HAVING SUM(total) > 40000

ORDER BY total_gain DESC

This time it will produce three rows. The other product lines didn’t match the criterion, so we
passed from six results to three.

What else do you notice from the query? We didn’t pass the column alias to HAVING, but the
aggregation of the original field. Are you asking yourself why? You’ll unravel the mystery in
the next example.

SQL HAVING Example 2

As the last example, we will use the table called product_emissions, which contains the
emission of the products provided by the companies.

This time, we are interested in showing the average product carbon footprint (pcf) for each
company that belongs to the industry group “Technology Hardware & Equipment.”
Moreover, it would be helpful to see the number of products for each company to understand
if there is some relationship between the number of products and the carbon footprint. We
also again use HAVING to extract companies with an average carbon footprint of over 100.

SELECT pe.company, count(product_name) AS n_products, avg(carbon_footprint_pcf) AS

avg_carbon_footprint_pcf

FROM product_emissions AS pe

WHERE industry_group = 'Technology Hardware & Equipment'

GROUP BY pe.company, industry_group

having avg_carbon_footprint_pcf>100

ORDER BY n_products

An error appeared after trying to use the alias. For the HAVING clause, the new column’s name
doesn’t exist, so it won’t be able to filter the query. Let’s correct the request:

SELECT pe.company, count(product_name) AS n_products, avg(carbon_footprint_pcf) AS

avg_carbon_footprint_pcf

FROM product_emissions AS pe

WHERE industry_group = 'Technology Hardware & Equipment'

GROUP BY pe.company, industry_group

having avg(carbon_footprint_pcf)>100

ORDER BY n_products

This time, the condition worked, and we can visualize the results from the table. We just
learned that column aliases can’t be used in HAVING because this condition is applied before
the SELECT. For this reason, it cannot recognize the fields from the new names.
SQL Order of Execution

This is the order of the commands while writing the query:

SELECT

FROM

WHERE

GROUP BY

HAVING

ORDER BY

But there is a question you need to ask yourself. In what order do SQL commands execute?
As humans, we often take for granted that the computer reads and interprets SQL from top to
down. But the reality is different from what it might look like. This is the right order of
execution:

FROM

WHERE

GROUP BY

HAVING

SELECT

ORDER BY

LIMIT

So, the query processor doesn’t start from SELECT, but it begins by selecting which tables to include,
and SELECT is executed after HAVING. This explains why HAVING doesn’t allow the use of ALIAS,
while ORDER BY doesn’t have problems with it. In addition to this aspect, this order of execution clarifies
the reason why HAVING is used together with GROUP BY to apply conditions on aggregated data,
while WHERE cannot.

02 GROUP by Statements
No ratings yet
02 GROUP by Statements
67 pages
SQL Notes Basic To Advanced (SQL Clauses)
No ratings yet
SQL Notes Basic To Advanced (SQL Clauses)
10 pages
SQL-QUERYING & MANIPULATING DATA USING ORDERBY GROUPBY AND HAVING
No ratings yet
SQL-QUERYING & MANIPULATING DATA USING ORDERBY GROUPBY AND HAVING
9 pages
20761A_09
No ratings yet
20761A_09
21 pages
Aggregation
No ratings yet
Aggregation
8 pages
Lecture 6 New - SQL Data Manipulation Language (Intermediate)
No ratings yet
Lecture 6 New - SQL Data Manipulation Language (Intermediate)
34 pages
BDPA - U1A2 - Guerrero Hernández Nidia Nicolle
No ratings yet
BDPA - U1A2 - Guerrero Hernández Nidia Nicolle
13 pages
17-SQL (GROUP BY & HAVING Clause)
No ratings yet
17-SQL (GROUP BY & HAVING Clause)
16 pages
dbms lab 4
No ratings yet
dbms lab 4
7 pages
Part 4 - Grouping Data and Subqueries
No ratings yet
Part 4 - Grouping Data and Subqueries
12 pages
GROUP BY and HAVING Clause in SQL Article DataCamp
No ratings yet
GROUP BY and HAVING Clause in SQL Article DataCamp
10 pages
Group by Clause
No ratings yet
Group by Clause
19 pages
Lab06 Mysql
No ratings yet
Lab06 Mysql
9 pages
SQL Statements With Aggregation and Filtering
No ratings yet
SQL Statements With Aggregation and Filtering
13 pages
Practical 7
No ratings yet
Practical 7
3 pages
Department of Computer Science Bahria University, Islamabad
No ratings yet
Department of Computer Science Bahria University, Islamabad
4 pages
Worksheet4 Aggregation
No ratings yet
Worksheet4 Aggregation
3 pages
SQL 3 - Group by Clause
No ratings yet
SQL 3 - Group by Clause
30 pages
Dbms Lab 3
No ratings yet
Dbms Lab 3
10 pages
Self-Notes: Data Manipulation Using SQL
No ratings yet
Self-Notes: Data Manipulation Using SQL
4 pages
Grouping and Aggregating Data
No ratings yet
Grouping and Aggregating Data
15 pages
Chapter 4. Intermediate SQL: Objectives
No ratings yet
Chapter 4. Intermediate SQL: Objectives
25 pages
T3_L2_GROUPBY
No ratings yet
T3_L2_GROUPBY
25 pages
Grouping
No ratings yet
Grouping
10 pages
Aggregation: Takes Values in Multiple Rows of Data and Returns One Value
No ratings yet
Aggregation: Takes Values in Multiple Rows of Data and Returns One Value
11 pages
'Jensen': Sum Sum Sum
No ratings yet
'Jensen': Sum Sum Sum
6 pages
Sqltutorialbasiccommands 130310014513 Phpapp01
No ratings yet
Sqltutorialbasiccommands 130310014513 Phpapp01
19 pages
Data Base Lab 6
No ratings yet
Data Base Lab 6
8 pages
Chapter 7 - Querying Using SQL
No ratings yet
Chapter 7 - Querying Using SQL
32 pages
Lab 10
No ratings yet
Lab 10
12 pages
Aggregation
No ratings yet
Aggregation
35 pages
Module 2 Introduction to SQL
No ratings yet
Module 2 Introduction to SQL
22 pages
SQL Notes
100% (1)
SQL Notes
42 pages
Learn SQL_ Aggregate Functions Cheatsheet _ Codecademy
No ratings yet
Learn SQL_ Aggregate Functions Cheatsheet _ Codecademy
3 pages
chp04 05 More SQL
No ratings yet
chp04 05 More SQL
23 pages
Database SQL Aggregate Functions
No ratings yet
Database SQL Aggregate Functions
14 pages
MySQL Activity 3
No ratings yet
MySQL Activity 3
9 pages
8-In-Built Functions, Join and Group by Queries-19-03-2024
No ratings yet
8-In-Built Functions, Join and Group by Queries-19-03-2024
38 pages
4 Group by Clause, Having Clause, Multiple Row (Or Group or Aggregate) Functions
100% (1)
4 Group by Clause, Having Clause, Multiple Row (Or Group or Aggregate) Functions
17 pages
Lab 5 Muhammad Abdullah (1823-2021)
No ratings yet
Lab 5 Muhammad Abdullah (1823-2021)
6 pages
Week+2SQL
No ratings yet
Week+2SQL
7 pages
CS232L-Lab#03
No ratings yet
CS232L-Lab#03
15 pages
SQL
No ratings yet
SQL
14 pages
CNG351 Lecture 10 DML Part 1 (1)
No ratings yet
CNG351 Lecture 10 DML Part 1 (1)
19 pages
SELECT03
No ratings yet
SELECT03
8 pages
Database Query Using SQL
No ratings yet
Database Query Using SQL
23 pages
exp 5 group by,having, orderby
No ratings yet
exp 5 group by,having, orderby
6 pages
Aggregate Functions
No ratings yet
Aggregate Functions
11 pages
Calculating Aggregates - Aggregate Functions Cheatsheet - Codecademy
No ratings yet
Calculating Aggregates - Aggregate Functions Cheatsheet - Codecademy
3 pages
SQL - OrderBy - GroupBy - Having - 01-Feb-23 PDF
No ratings yet
SQL - OrderBy - GroupBy - Having - 01-Feb-23 PDF
7 pages
Lab 7 - (Queries II)
No ratings yet
Lab 7 - (Queries II)
8 pages
Learn SQL_ Aggregate Functions Cheatsheet _ Codecademy
No ratings yet
Learn SQL_ Aggregate Functions Cheatsheet _ Codecademy
2 pages
basic functions of mysql (2)
No ratings yet
basic functions of mysql (2)
39 pages
Group by & Having Clause
No ratings yet
Group by & Having Clause
29 pages
Group and Aggregation Introduction
No ratings yet
Group and Aggregation Introduction
21 pages
Chapter 11
No ratings yet
Chapter 11
35 pages
Lab # 04 Implementation of SQL Functions
No ratings yet
Lab # 04 Implementation of SQL Functions
18 pages
Week11 Relational Algebra & SQL - Aggregation and Grouping Operation
No ratings yet
Week11 Relational Algebra & SQL - Aggregation and Grouping Operation
23 pages
DBMS Lab Manual
From Everand
DBMS Lab Manual
Jitendra Patel
1.5/5 (3)
Excel Techniques
From Everand
Excel Techniques
Online Trainees
2/5 (1)
Weekly Overview Measurement
No ratings yet
Weekly Overview Measurement
5 pages
EE-733 (Solid State Devices) : Physical Foundations
No ratings yet
EE-733 (Solid State Devices) : Physical Foundations
4 pages
Physics Resources v3 Abridged
50% (2)
Physics Resources v3 Abridged
9 pages
Experiment 1: MPLAB and Instruction Set Analysis 1: Objectives
No ratings yet
Experiment 1: MPLAB and Instruction Set Analysis 1: Objectives
20 pages
Transformers Can Do Bayesian Inference
No ratings yet
Transformers Can Do Bayesian Inference
23 pages
Instant Download Empirical Asset Pricing Models Jau-Lian Jeng PDF All Chapters
100% (3)
Instant Download Empirical Asset Pricing Models Jau-Lian Jeng PDF All Chapters
52 pages
Music Philosophy Paper
No ratings yet
Music Philosophy Paper
6 pages
Chapter 2
No ratings yet
Chapter 2
60 pages
MID TERM,CLASS 11, ECONOMICS,2023-24
No ratings yet
MID TERM,CLASS 11, ECONOMICS,2023-24
5 pages
NLC Attendance - Elementary Class Week 3
No ratings yet
NLC Attendance - Elementary Class Week 3
16 pages
CIA 3 Coaching Material
No ratings yet
CIA 3 Coaching Material
9 pages
R Programming
No ratings yet
R Programming
77 pages
KS Maths Testfoundation Non-Calculator
No ratings yet
KS Maths Testfoundation Non-Calculator
6 pages
Module 1 - Number System
No ratings yet
Module 1 - Number System
20 pages
PHY 301 Classical Mechanics Shaheen
No ratings yet
PHY 301 Classical Mechanics Shaheen
2 pages
Math MDL TMS320
No ratings yet
Math MDL TMS320
66 pages
6th Sem Open Elective III Syllabus - Final
No ratings yet
6th Sem Open Elective III Syllabus - Final
52 pages
Complementary&suplementary Angles LP
No ratings yet
Complementary&suplementary Angles LP
4 pages
GATE Progress Tracker DA
No ratings yet
GATE Progress Tracker DA
8 pages
Etabs Tutorial
100% (4)
Etabs Tutorial
27 pages
Digital Systems Testing and Testable Design Abramovici 1990
No ratings yet
Digital Systems Testing and Testable Design Abramovici 1990
659 pages
Formative Teaching Methods
No ratings yet
Formative Teaching Methods
18 pages
Perfeksionisme, Harga Diri, Dan Kecenderungan Depresi Pada Remaja Akhir
No ratings yet
Perfeksionisme, Harga Diri, Dan Kecenderungan Depresi Pada Remaja Akhir
14 pages
Work Energy Power
No ratings yet
Work Energy Power
47 pages
Systems Engineering For Ship Concept Design
No ratings yet
Systems Engineering For Ship Concept Design
9 pages
IYPT Problemas 2020
No ratings yet
IYPT Problemas 2020
1 page
Como Hacer Un Micromouse
No ratings yet
Como Hacer Un Micromouse
21 pages
RRB PO Prelims Memory Based Paper Held On 05 August 2023 Shift 1
No ratings yet
RRB PO Prelims Memory Based Paper Held On 05 August 2023 Shift 1
23 pages
[Number Theory] Lecture 03 - Induction and Pigeonhole Principles
No ratings yet
[Number Theory] Lecture 03 - Induction and Pigeonhole Principles
9 pages
Math 7 Q1 Week 9
No ratings yet
Math 7 Q1 Week 9
3 pages

GroupByHavinginSQL

Uploaded by

GroupByHavinginSQL

Uploaded by

Use GROUP BY and HAVING in SQL

Using SQL GROUP BY

SELECT * FROM companies

SELECT country, COUNT(*) AS n_companies

ORDER BY n_companies DESC

ORDER BY total_gain DESC

The Limitations of WHERE

WHERE SUM(total) > 40000

ORDER BY total_gain DESC

Using SQL HAVING

SQL HAVING Example 1

HAVING SUM(total) > 40000

ORDER BY total_gain DESC

SQL HAVING Example 2

SELECT pe.company, count(product_name) AS n_products, avg(carbon_footprint_pcf) AS

WHERE industry_group = 'Technology Hardware & Equipment'

SELECT pe.company, count(product_name) AS n_products, avg(carbon_footprint_pcf) AS

WHERE industry_group = 'Technology Hardware & Equipment'

GROUP BY pe.company, industry_group

This is the order of the commands while writing the query:

You might also like