100% found this document useful (1 vote)

348 views15 pages

Window Functions

The document discusses window functions in SQL Server. It begins by explaining that window functions work on a group of rows and return an aggregated value for each row, unlike aggregate functions which return a single value for the entire table. It then describes the three types of window functions: aggregate, ranking, and value. Examples are provided to demonstrate aggregate window functions like SUM(), AVG(), MIN(), MAX(), and COUNT(). Ranking window functions like RANK(), DENSE_RANK(), ROW_NUMBER(), and NTILE() are explained. The syntax and usage of window functions is also covered using examples.

Uploaded by

chenna kesava

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

348 views15 pages

Window Functions

Uploaded by

chenna kesava

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 15

SQL Server Window Functions

We are all well-known for the regular aggregate function that performs calculations on
the table and works with a GROUP BY clause. However, only a small percentage of SQL
users use Window functions, and these are functions that work on a group of rows and
display a single aggregated value for every row. This article will discuss in detail the
window functions in SQL Server.

What is window functions?

Window functions are used to perform a calculation on an aggregate value based on
a set of rows and return multiple rows for each group. The window word represents the
group of rows on which the function will be operated. This function performs a
calculation in the same way that the aggregate functions would perform. Unlike
aggregate functions that operate on the entire table, Window functions do not give a
result to be combined into a single row. It means that window functions work on a
group of rows and return a total value for each row. As a result, each row retains its
distinct identity.

The below pictorial representations explain the difference of aggregate function and
window function in SQL Server:

Window Functions Types

SQL Server categorizes the window functions into mainly three types:

1. Aggregate Window Functions: These functions operated on multiple rows and

Examples of such functions are SUM(), MAX(), MIN(), AVG(), COUNT(), etc.
2. Ranking Window Functions: These functions ranks each row of a partition in a
table. Example of such functions are RANK(), DENSE_RANK(), ROW_NUMBER(),
NTILE(), etc.
3. Value Window Functions: These functions are locally represented by a power
series. Example of such functions are LAG(), LEAD(), FIRST_VALUE(), LAST_VALUE(),
etc.

Syntax

The following are the basic syntax for using a window function:

1. window_function_name([ALL] expression)
2. OVER (
3.     [partition_defintion]
4.     [order_definition]
5. )

Parameter Explanation

Let us understand the arguments used in the above syntax:

window_function: It indicates the name of your window function.

ALL: It is an optional keyword that is used to count all values along with duplicates. We
cannot use the DISTINCT keyword in window functions.

Expression: It is the name of the column or expression on which window functions is

operated. In other words, it is the column name for which we calculate an aggregated
value.

OVER: It specifies the window clauses for aggregate functions. It mainly contains two
expressions partition by and order by, and it always has an opening and closing
parenthesis even there is no expression.

PARTITION BY: This clause divides the rows into partitions, and then a window function
is operated on each partition. Here we need to provide columns for the partition after
the PARTITION BY clause. If we want to specify more than one column, we must
separate them by a comma operator. SQL Server will group the entire table when this
clause is not specified, and values will be aggregated accordingly.
ORDER BY: It is used to specify the order of rows within each partition. When this clause
is not defined, SQL Server will use the ORDER BY for the entire table.

Example
Let us understand the concept of window function through an example. First, we will
create a table named "product_sales" using the following statement:

1. CREATE TABLE Product_Sales(
2.     Emp_Name VARCHAR(45) NOT NULL,
3.     Year INT NOT NULL,
4.     Country VARCHAR(45) NOT NULL,
5.     Prod_name VARCHAR(45) NOT NULL,
6.     Sales DECIMAL(12,2) NOT NULL,
7.     PRIMARY KEY(Emp_Name, Year)
8. );

Next, we will fill records into this table using the INSERT statement as below:

1. INSERT INTO Product_Sales(Emp_Name, Year, Country, Prod_name, Sales)
2. VALUES('Mike Johnson', 2017, 'Britain', 'Laptop', 10000),
3. ('Mike Johnson', 2018, 'Britain', 'Laptop', 15000),
4. ('Mike Johnson', 2019, 'Britain', 'TV', 20000),
5. ('Mary Greenspan', 2017, 'Australia', 'Computer', 15000),
6. ('Mary Greenspan', 2018, 'Australia', 'Computer', 10000),
7. ('Mary Greenspan', 2019, 'Australia', 'TV', 20000),
8. ('Nancy Jackson', 2017, 'Canada', 'Mobile', 20000),
9. ('Nancy Jackson', 2018, 'Canada', 'Calculator', 1500),
10. ('Nancy Jackson', 2019, 'Canada', 'Mobile', 25000);

We can verify the inserted records using the SELECT statement. We will see the below
output:
Now we will demonstrate all window functions using this table.

Aggregate Window Function

SUM()

It is an aggregate function that performs the addition of the specified field for a
specified group or the entire table when we have not specified any group. Here we will
examine this function in both ways, either regular aggregate function or window
aggregate function.

The below statement explains the regular aggregate function that adds the order
amount for each country:

1. SELECT Country, SUM(Sales) AS total_amount
2. FROM Product_Sales GROUP BY Country;

Executing the statement, we see that this function groups multiple rows into a single
output row. It causes individual rows to lose their identity.

The below statement explains the window aggregate function that maintains the row
identity. It also displays the aggregated value for each row.

1. SELECT Emp_Name, Year, Country, Prod_name, Sales, SUM(Sales)
2. OVER(PARTITION BY Country) as grand_total
3. FROM Product_Sales;
Executing the query will return the below output. Here we can see that it aggregates the
data for each country and displays the sum of total sales for each of them. It also inserts
another column for the total sales as grand_total so that each row retains its identity.

AVG()

This function returns the average value of the specified column. It works in exactly the
same way with a window function.

The below example will produce the average sales for each country and each year.
Here we have specified more than one average by specifying multiple fields in the
partition list.

1. SELECT Emp_Name, Year, Country, Prod_name, Sales, AVG(Sales)
2. OVER(PARTITION BY Country, YEAR(Year)) as avg_sales_amount
3. FROM Product_Sales;

Executing the statement will return the below output where we can see that on average,
we have received a sale amount of 15000 for Australia country.
MIN()

This function returns the minimum value for a specified group. When we have not
defined the group, it will return the minimum value for the entire table.

The below example will return the smallest sales amount for each country:

1. SELECT Emp_Name, Year, Country, Prod_name, Sales, MIN(Sales)
2. OVER(PARTITION BY Country) AS minimum_sales_amount
3. FROM Product_Sales;

Executing the query will produce the below output where we can see the minimum sales
amount for each country:

MAX()

This function returns the maximum value for a specified group. When we have not
defined the group, it will return the maximum value for the entire table.

The below example will return the highest sales amount for each country:

1. SELECT Emp_Name, Year, Country, Prod_name, Sales, MAX(Sales)
2. OVER(PARTITION BY Country) AS minimum_sales_amount
3. FROM Product_Sales;

Executing the query will produce the below output where we can see the highest sales
amount for each country:
COUNT()

The count function will return the total number of rows or records present in the table
or group. The regular aggregate function uses the DISTINCT keyword not to count the
duplicates rows. But the window count function does not support this keyword. If we
use this keyword with the window function, SQL Server throws an error.

Suppose we want to see how many employees order the product in the year 2018. We
cannot directly count all employees as the same employee has ordered multiple
products in the same year.

For example,

COUNT(emp_name) will produce the incorrect result because it can count duplicates

also. COUNT(DISTINCT emp_name) will produce the correct result because it always
counts each employee only once.

This statement is executed successfully as it is a regular aggregate function:

1. SELECT Country, COUNT(DISTINCT Emp_name) As number_of_employees
2. FROM Product_Sales
3. GROUP BY Country;

This statement will produce an error as it is a window aggregate function:

1. SELECT Country, COUNT(DISTINCT Emp_name)
2. OVER(PARTITION BY Country) As number_of_employees
3. FROM Product_Sales;

Here is the error:

The below statement will return the total product sales in each country using the
window count function:

1. SELECT Emp_Name, Year, Country, Prod_name, Sales, COUNT(Prod_name)
2. OVER(PARTITION BY Country) As total_product
3. FROM Product_Sales;

Here is the result:

Ranking Window Functions

The RANKING function ranks the values in a defined column and categorizes them
based on their rank. The following are the ranking functions supported in SQL Server:

RANK(), DENSE_RANK(), ROW_NUMBER(), and NTILE(). Let us discuss each function in

detail based on the table named "rank_demo" that contains the below data:

RANK()
It's used to generate a unique rank for each row in a table based on the specified
value. If this function gets the two records with the same value, it will assign the same
rank to both records and skip the next ranking. For example, if rank 2 has two identical
values, the rank function provides the same rank 2 to both records and skip the next
rank 3. Now, the next rank will be assigned with rank 4.

The below statement explains the RANK() function by assigning numbering to

each row based on the city:

1. SELECT first_name, last_name, city,
2. RANK () OVER (ORDER BY city) AS Rank_No
3. FROM rank_demo;

This query returns the below output where we see that the same rank (2) is assigned to
two identical records having equal city names. The next number in the ranking will be its
previous rank plus a number of duplicate numbers, i.e. 4.

DENSE_RANK()

It works the same as the RANK() function except that it does not skip any rank. It always
assigns rank in consecutive order. It means that when two records are found equal, this
function will assign the same rank to both records and the next rank being the next
sequential number.

The below query explains this function practically to assign a rank number for
each row based on the city:

1. SELECT first_name, last_name, city,
2. DENSE_RANK() OVER (ORDER BY city) AS Rank_No
3. FROM rank_demo;
This query will return the below output where we can see that the duplicate values have
the same rank, and the next rank is given to the next record without skipping a rank
value.

ROW_NUMBER()

It is used to assign a unique sequential number to each record within the partition. It
always starts with one and increases by one until all the records in a partition are not
reached. It will be reset when one partition ranking is completed and goes to the next
partition.

Example of ROW_NUMBER() without PARTITION BY

The below query assigns the number to each row based on the city:

1. SELECT first_name, last_name, city,
2. ROW_NUMBER() OVER (ORDER BY city) AS Rank_No
3. FROM rank_demo;

It returns the following output:

Example of ROW_NUMBER() with PARTITION BY

The below statement partition the table based on the city, which means the row number
is reset for each city and restarts at 1 again. It is also ordering the records on the basis of
the first_name column.

1. SELECT first_name, last_name, city,
2. ROW_NUMBER() OVER (PARTITION BY city ORDER BY first_name) AS Rank_No
3. FROM rank_demo;

It returns the below output:

NTILE()

This window function distributes rows into a pre-defined number (N) of

approximately equal groups. Each row group is assigned a rank depending on the
defined condition, and the numbering begins with the first group. It enables us to
determine which percentile (or quartile, or other subdivision) a particular row belongs
to. It implies that if we have 20 records and want to divide them into five quartiles based
on a specific value field, we can easily do so and see how many rows are in each
quartile.

The following statement will divide the table into 3 quartiles based on the city
column:

1. SELECT first_name, last_name, city,
2. NTILE(3) OVER ( ORDER BY city) AS Rank_No
3. FROM rank_demo;

Executing the statement will return the below output where we see each group have
three quartiles:
PERCENT_RANK()

This function evaluates a percentile rank (relative rank) for rows within a partition of a
result set. It gives the result between 0 and 1. If it finds the NULL value, it treats them as
the lowest possible value.

This function evaluates the rank with the help of the below formula for each record:

1. (rank-1) / ( total_rows-1)

Here, rank indicates the numbering of each row returns by rank() function, and
total_rows are the total number of rows found in the partition.

The following example will calculate the rank value for each row order by country
name:

1. SELECT Year, Prod_name, Country, Sales,
2. PERCENT_RANK() OVER(PARTITION BY Year ORDER BY Country) AS my_rank
3. FROM Product_Sales;

Executing the statement will return the expected output:

Value Window Functions
SQL Server used this function to get the first, last, previous, and next values in a table. It
mainly contains these functions: LAG(), LEAD(), FIRST_VALUE(), and LAST_VALUE().

LEAD and LAG Function

The LEAD and LAG functions are used to get the preceding and succeeding values of
specified rows from the current row within its partition.

Let us take the above product_sales table to demonstrate these functions. The

following example returns the sales and next sales detail of each employee. It first
split the result set based on the year and then sorted each partition using the country
column. After that, we have to use the LEAD() function on each partition to get the next
sales detail.

1. SELECT Year, Prod_name, Country, Sales,
2. LEAD(Sales,1) OVER (PARTITION BY Year ORDER BY Country) AS Next_Sale
3. FROM Product_Sales;

Executing the statement will display the expected result:

The following example returns the sales and previous sales detail of each
employee. It first split the result set based on the year and then sorted each partition
using the country column. After that, we have to use the LAD() function on each
partition to get the previous sales detail.

1. SELECT Year, Prod_name, Country, Sales,
2. LAG(Sales, 1) OVER (PARTITION BY Year ORDER BY Country) AS Previous_Sale
3. FROM Product_Sales;
Executing the statement will display the expected result:

FIRST_VALUE() and LAST_VALUE()

These functions are used to find the first and last record in the table or a partition if the
PARTITION BY clause is specified. Here we should note that these functions are
mandatory to use the ORDER BY clause. Let us see how these functions work in SQL
Server through practical examples.

The following example will find the first and last sales of each country in a given
table:

1. SELECT Year, Prod_name, Country, Sales,
2. FIRST_VALUE(Sales) OVER(PARTITION BY Country ORDER BY Country) first_sale,
3. LAST_VALUE(Sales) OVER(PARTITION BY Country ORDER BY Country) last_sale

4. FROM Product_Sales;

Executing the query will display the expected result as shown below:
Conclusion

This article will explain all window functions used in the SQL Server that work on a set of
rows and return a single aggregated value for every row.

Course Description, CSE Dept, National University, Bangladesh
30% (10)
Course Description, CSE Dept, National University, Bangladesh
30 pages
Huawei HCIA-AI V3.0 Exam - H13-311 - V3.0 Free Exam Questions
100% (2)
Huawei HCIA-AI V3.0 Exam - H13-311 - V3.0 Free Exam Questions
2 pages
24 StoredProcs
No ratings yet
24 StoredProcs
6 pages
Leetcode SQL QnA 1693149052
No ratings yet
Leetcode SQL QnA 1693149052
60 pages
Window Functions in SQL (Slides)
No ratings yet
Window Functions in SQL (Slides)
24 pages
Snowflake Mini Project
No ratings yet
Snowflake Mini Project
7 pages
SQL Vs PySpark 1678871778
No ratings yet
SQL Vs PySpark 1678871778
8 pages
Best Practices For Bucketing in Spark SQL - by David Vrba - Towards Data Science
No ratings yet
Best Practices For Bucketing in Spark SQL - by David Vrba - Towards Data Science
27 pages
53 SQL Questions-Answers
No ratings yet
53 SQL Questions-Answers
89 pages
Select As From: Firstname (First Name) Employeedetail
100% (1)
Select As From: Firstname (First Name) Employeedetail
7 pages
ITEC 1010 Final Exam Review
No ratings yet
ITEC 1010 Final Exam Review
6 pages
Windowing Functions
No ratings yet
Windowing Functions
54 pages
SQL Queries Gcreddy
No ratings yet
SQL Queries Gcreddy
11 pages
SQL Interview Questions For A Data Engineer
No ratings yet
SQL Interview Questions For A Data Engineer
11 pages
Pyspark Interview 1738079940
No ratings yet
Pyspark Interview 1738079940
6 pages
SQL Statement Tunning
No ratings yet
SQL Statement Tunning
19 pages
Snowflake
No ratings yet
Snowflake
11 pages
Top Pyspark InterviewQuestions
No ratings yet
Top Pyspark InterviewQuestions
21 pages
21.streams in Snowflake
No ratings yet
21.streams in Snowflake
8 pages
Window Function in Pyspark
100% (1)
Window Function in Pyspark
8 pages
Azure Data Engineering Interview Q & A - Topicwise
100% (1)
Azure Data Engineering Interview Q & A - Topicwise
57 pages
External Tables
No ratings yet
External Tables
105 pages
DBT Flow
No ratings yet
DBT Flow
15 pages
Big Query Optimization Document
No ratings yet
Big Query Optimization Document
10 pages
Analaytical Function-Pravin
No ratings yet
Analaytical Function-Pravin
24 pages
Create Temporary, Permanent & Transient Table
No ratings yet
Create Temporary, Permanent & Transient Table
2 pages
Interview Questions
No ratings yet
Interview Questions
2 pages
Teradata SQL Performance Tuning Case Study Part II
0% (1)
Teradata SQL Performance Tuning Case Study Part II
37 pages
Spark With Python Notes
No ratings yet
Spark With Python Notes
206 pages
Incremental Loading For Dimension Table
100% (1)
Incremental Loading For Dimension Table
3 pages
DWH BASICS Interview Questions
No ratings yet
DWH BASICS Interview Questions
29 pages
Spark Interview Q&A
No ratings yet
Spark Interview Q&A
31 pages
Join Stage
No ratings yet
Join Stage
14 pages
Hive Cheat Sheet - Quick Reference
No ratings yet
Hive Cheat Sheet - Quick Reference
19 pages
Star and Snowflake Schemas
No ratings yet
Star and Snowflake Schemas
4 pages
Snowflake Prctice1
No ratings yet
Snowflake Prctice1
51 pages
17.views and MaterializedViews
No ratings yet
17.views and MaterializedViews
13 pages
SQL Notes by Krishna Reddy
No ratings yet
SQL Notes by Krishna Reddy
49 pages
SQL For Everyone (Definitive Guide)
No ratings yet
SQL For Everyone (Definitive Guide)
10 pages
Tuning SQL Queries - Oracle
100% (1)
Tuning SQL Queries - Oracle
27 pages
DBMS SQL Practice Questions Shivani
No ratings yet
DBMS SQL Practice Questions Shivani
10 pages
Python Pandas Cheatsheety
No ratings yet
Python Pandas Cheatsheety
7 pages
Pyspark Practice - Databricks
No ratings yet
Pyspark Practice - Databricks
66 pages
PySpark VS SQL Interview Questions
100% (1)
PySpark VS SQL Interview Questions
16 pages
Sssis Interview Questins
No ratings yet
Sssis Interview Questins
7 pages
PYTHON Notes by Devaraj
100% (1)
PYTHON Notes by Devaraj
40 pages
Interview Series ADF Part-1
No ratings yet
Interview Series ADF Part-1
17 pages
Spark SQL Optimization
No ratings yet
Spark SQL Optimization
29 pages
SQL Sub Queries
75% (4)
SQL Sub Queries
5 pages
PostgreSQL Cheat Sheet - Hackr - Io
No ratings yet
PostgreSQL Cheat Sheet - Hackr - Io
90 pages
OLTP
No ratings yet
OLTP
12 pages
Oracle Analytical Functions 1
No ratings yet
Oracle Analytical Functions 1
18 pages
Data Warehousing Interview Questions - by Shobha Bhagwat - Medium
No ratings yet
Data Warehousing Interview Questions - by Shobha Bhagwat - Medium
9 pages
Pyspark Study Material
No ratings yet
Pyspark Study Material
5 pages
Pyspark - SQL Module
No ratings yet
Pyspark - SQL Module
132 pages
Master Pyspark Zero To Hero 1738689679
No ratings yet
Master Pyspark Zero To Hero 1738689679
102 pages
Snowflake Document
No ratings yet
Snowflake Document
21 pages
VBA MACRO Course Training Syllabus PDF
No ratings yet
VBA MACRO Course Training Syllabus PDF
1 page
Windows Function
No ratings yet
Windows Function
27 pages
Window Function SQL
No ratings yet
Window Function SQL
2 pages
Window Functions and Syntax (Slides)
No ratings yet
Window Functions and Syntax (Slides)
14 pages
Window Functions
No ratings yet
Window Functions
30 pages
SQL (Window Function)
No ratings yet
SQL (Window Function)
6 pages
Twinmotion
No ratings yet
Twinmotion
25 pages
DISC112 - Computer and Problem Solving - Mahira Ilyas Spring 2020
No ratings yet
DISC112 - Computer and Problem Solving - Mahira Ilyas Spring 2020
5 pages
Archana K Raghunath 12yrs Management Role
No ratings yet
Archana K Raghunath 12yrs Management Role
4 pages
Chapter 6. Decision and Control Statements: 6.1 If Statement
No ratings yet
Chapter 6. Decision and Control Statements: 6.1 If Statement
12 pages
4559 (7) System Analysis FMECA
No ratings yet
4559 (7) System Analysis FMECA
59 pages
Introduccion A La Programacion
No ratings yet
Introduccion A La Programacion
10 pages
1.3-Comments, Identifiers and Keywords
No ratings yet
1.3-Comments, Identifiers and Keywords
6 pages
Cinema 4D Shortcut Keys
No ratings yet
Cinema 4D Shortcut Keys
12 pages
NoSQL Paper 1
No ratings yet
NoSQL Paper 1
14 pages
Luke Richardson Resume
No ratings yet
Luke Richardson Resume
2 pages
Rabbi CV
No ratings yet
Rabbi CV
2 pages
85XX+ User Manual PDF
No ratings yet
85XX+ User Manual PDF
109 pages
Mobility Whitepaper Mobile Application Testing 1012 1
No ratings yet
Mobility Whitepaper Mobile Application Testing 1012 1
13 pages
Chapter 2.1 - Review On Exponents
No ratings yet
Chapter 2.1 - Review On Exponents
2 pages
Ayu 2024 Resume Template
No ratings yet
Ayu 2024 Resume Template
2 pages
PDEng Thesis Duriji Dugarte Manoukian
No ratings yet
PDEng Thesis Duriji Dugarte Manoukian
89 pages
Serial Protocol Specification-1
No ratings yet
Serial Protocol Specification-1
35 pages
TLE Module Q1
No ratings yet
TLE Module Q1
69 pages
M.E.VLSI Design and Embedded Systems
No ratings yet
M.E.VLSI Design and Embedded Systems
58 pages
H10 LibraryReference PDF
No ratings yet
H10 LibraryReference PDF
5 pages
Report Abap Alv
No ratings yet
Report Abap Alv
10 pages
CV - Afriansyah Putra Nasution Updated
100% (1)
CV - Afriansyah Putra Nasution Updated
3 pages
Fiber Connectivity: Intrafacility Fiber Cable (IFC) Assemblies
No ratings yet
Fiber Connectivity: Intrafacility Fiber Cable (IFC) Assemblies
2 pages
Data Privacy and Security Best Practices
No ratings yet
Data Privacy and Security Best Practices
2 pages
Plug in Gait
No ratings yet
Plug in Gait
70 pages
ABB Formula Air
No ratings yet
ABB Formula Air
98 pages
Mac Keyboard SHORTCATS
No ratings yet
Mac Keyboard SHORTCATS
8 pages

Window Functions

Uploaded by

Window Functions

Uploaded by

SQL Server Window Functions

What is window functions?

Window Functions Types

1. Aggregate Window Functions: These functions operated on multiple rows and

Let us understand the arguments used in the above syntax:

window_function: It indicates the name of your window function.

Expression: It is the name of the column or expression on which window functions is

Aggregate Window Function

COUNT(emp_name) will produce the incorrect result because it can count duplicates

This statement is executed successfully as it is a regular aggregate function:

This statement will produce an error as it is a window aggregate function:

Here is the error:

Here is the result:

Ranking Window Functions

RANK(), DENSE_RANK(), ROW_NUMBER(), and NTILE(). Let us discuss each function in

The below statement explains the RANK() function by assigning numbering to

Example of ROW_NUMBER() without PARTITION BY

It returns the following output:

Example of ROW_NUMBER() with PARTITION BY

It returns the below output:

This window function distributes rows into a pre-defined number (N) of

Executing the statement will return the expected output:

LEAD and LAG Function

Let us take the above product_sales table to demonstrate these functions. The

Executing the statement will display the expected result:

FIRST_VALUE() and LAST_VALUE()

You might also like