SQL - Statistical Functions
Last Updated :
19 Dec, 2024
SQL statistical functions are essential tools for extracting meaningful insights from databases. These functions, enable users to perform statistical calculations on numeric data. Whether determining averages, sums, counts, or measures of variability, these functions empower efficient data analysis within the SQL environment.
In this article, we’ll explore the most commonly used SQL statistical functions such as AVG(), SUM(), COUNT(), MIN(), MAX(), STDDEV(), VAR(), and more. We will also provide practical examples to demonstrate their usage.
What is SQL Statistical Functions?
Statistics is a branch of mathematics that deals with data collection, analysis, interpretation, presentation, and organization. It involves the use of mathematical techniques to extract meaningful information from data. Statistics is widely used in various fields such as business, economics, social science, medicine, and engineering
A Statistical function is a mathematical function that helps us to process and analyze data to provide meaningful information about the dataset. For example mean, sum, min, max, standard deviation, etc.
Statistical Functions in SQL
Here are Some Common Statistical Functions in SQL:
Function | Output |
---|
AVG() | Calculates the average value of a numeric column. |
---|
SUM() | Calculates the sum of values in a numeric column. |
---|
COUNT() | Counts the number of rows in a result set or the number of non-null values in a column. |
---|
MIN() | Returns the minimum value in a column. |
---|
MAX() | Returns the maximum value in a column. |
---|
VAR() / VARIANCE() | Calculates the population variance of a numeric column. |
---|
STDDEV() / STDDEV_POP() | Calculates the population standard deviation of a numeric column. |
---|
CORR() | Calculates the correlation coefficient between two numeric columns. |
---|
COVAR_POP() | Calculates the population covariance between two numeric columns. |
---|
PERCENTILE_CONT() | Calculates a specified percentile value for a numeric column |
---|
Statistical Functions With Exmaple
We have four tables in our database: 'studentDetails,' 'employees,' 'sales_data,' and 'financial_data.' (The pictures are displayed below.)
Table : StudentDetails
employees Table:
Table:Employeessales_data:
Table:Sales_datafinancial_data:
Table: financial_data1. AVG() Function
Calculate the average or arithmetic mean for a group of numbers or a numeric column.
Syntax:
SELECT AVG(column_name) FROM table_name;
Example Query:
SELECT AVG(marks) AS average_marks FROM studentDetails;
Output:
AVG_MARKS2. SUM() Function
The total of all numeric values in a group i.e. Calculates the total sum of values in a numeric column.
Syntax:
SELECT SUM(column_name) FROM table_name;
Example Query:
SELECT SUM(marks) AS total_marks FROM studentDetails;
Output:
Sum of marks3. Count() Function
The number of cell locations in a range that contain a numeric character i.e Counts the number of rows in a result set or the number of non-null values in a column.
Syntax:
SELECT COUNT(*) FROM table_name;
SELECT COUNT(column_name) FROM table_name;
Example Query:
SELECT COUNT(studentID) AS total_students FROM studentDetails;
Output:
Count of StudentExample Query:
select count(*) from studentdetails;
Output:
Return the count of rows that meet a specified condition .
count all rows
4. Max() Function
Returns the highest numeric value in a group of numbers.
Syntax:
SELECT MAX(column_name) FROM table_name;
Example Query:
SELECT MAX(marks) AS highest_marks FROM studentDetails;
Output:
Maximum marks5. MIN() Function
Returns the lowest numeric value in a group of numbers.
Syntax:
SELECT MIN(column_name) FROM table_name;
Example Query:
SELECT MIN(marks) AS lowest_marks FROM studentDetails;
Output:
Minimum marks6. VAR() / VARIANCE() Function
Calculates the population variance of a numeric column
Syntax:
SELECT VAR(column_name) FROM table_name;
Example Query:
SELECT VARIANCE(marks) AS variance_marks FROM studentDetails;
Output:
Variance marks7. STDDEV() / STDDEV_POP() Function
The standard deviation for a group of numbers based on a sample
Syntax:
SELECT STDDEV(column_name) FROM table_name;
Example Query:
SELECT STDDEV(marks) AS stddev_marks FROM studentDetails;
Output:
Standrad deviation for marks8. PERCENTILE_CONT() Function
Calculates a specified percentile value for a numeric column.
Syntax:
SELECT PERCENTILE_CONT(0.5) WITHIN GROUP (ORDER BY column_name) FROM table_name;
Example Query:
SELECT PERCENTILE_CONT(0.5) WITHIN GROUP (ORDER BY salary) AS median_salary
FROM employees;
Output:
Median salary of employee's9. CORR() Function
Calculates the correlation coefficient between two numeric columns.
Syntax:
SELECT CORR(column1, column2) FROM table_name;
Example Query:
SELECT CORR(sales, profit) AS correlation_coefficient
FROM sales_data;
Output:
correlation coefficient between 'sales' and 'profit'10 .COVAR_POP() Function
Calculates the population covariance between two numeric columns.
Syntax:
SELECT COVAR_POP(column1, column2) FROM table_name;
Example Query:
SELECT COVAR_POP(revenue, expenses) AS population_covariance
FROM financial_data;
Output:
Population Covariance between revenue and expensesConclusion
In SQL, statistical functions help to analyze and summarise data in the database. These functions assist in extracting meaningful information from the given datasets. For determining the number of occurrences , calculating totals , finding averages or calculating the variance in the dataset statistical functions plays a vital role .Overall, the integration of Statistical Functions elevates SQL's capabilities, making it an invaluable asset for businesses and analysts seeking actionable intelligence from their relational databases.
Similar Reads
SQL Tutorial Structured Query Language (SQL) is the standard language used to interact with relational databases. Mainly used to manage data. Whether you want to create, delete, update or read data, SQL provides the structure and commands to perform these operations. Widely supported across various database syst
8 min read
Basics
What is SQL?Structured Query Language (SQL) is the standard language used to interact with relational databases. Allows users to store, retrieve, update, and manage data efficiently through simple commands. Known for its user-friendly syntax and powerful capabilities, SQL is widely used across industries. How D
6 min read
SQL Data TypesIn SQL, each column must be assigned a data type that defines the kind of data it can store, such as integers, dates, text, or binary values. Choosing the correct data type is crucial for data integrity, query performance and efficient indexing.Benefits of using the right data type:Memory-efficient
3 min read
SQL OperatorsSQL operators are symbols or keywords used to perform operations on data in SQL queries. These operations can include mathematical calculations, data comparisons, logical manipulations, other data-processing tasks. Operators help in filtering, calculating, and updating data in databases, making them
5 min read
SQL Commands | DDL, DQL, DML, DCL and TCL CommandsSQL commands are the fundamental building blocks for communicating with a database management system (DBMS). It is used to interact with the database with some operations. It is also used to perform specific tasks, functions, and queries of data. SQL can perform various tasks like creating a table,
7 min read
SQL Database OperationsSQL databases or relational databases are widely used for storing, managing and organizing structured data in a tabular format. These databases store data in tables consisting of rows and columns. SQL is the standard programming language used to interact with these databases. It enables users to cre
3 min read
SQL CREATE TABLECreating a table is one of the first and most important steps in building a database. The CREATE TABLE command in SQL defines how your data will be stored, including the table name, column names, data types, and rules (constraints) such as NOT NULL, PRIMARY KEY, and CHECK.Defines a new table in the
3 min read
Queries & Operations
SQL SELECT QuerySQL SELECT is used to fetch or retrieve data from a database. It can fetch all the data from a table or return specific results based on specified conditions. The data returned is stored in a result table. The SELECT clause is the first and one of the last components evaluated in the SQL query proce
3 min read
SQL INSERT INTO StatementThe INSERT INTO statement in SQL is used to add new rows of data into an existing table. Essential command for inserting records like customer data, employee records, or student information. SQL offers multiple ways to insert data depending on your requirement, whether it is for all columns, specifi
4 min read
SQL UPDATE StatementThe UPDATE statement in SQL is used to modify the data of an existing record in a database table. We can update single or multiple columns in a single query using the UPDATE statement as per our requirement. Whether you need to correct data, change values based on certain conditions, or update multi
4 min read
SQL DELETE StatementThe SQL DELETE statement is an essential command in SQL used to remove one or more rows from a database table. Unlike the DROP statement, which removes the entire table, the DELETE statement removes data (rows) from the table retaining only the table structure, constraints and schema. Whether you ne
3 min read
SQL | WHERE ClauseIn SQL, the WHERE clause is used to filter rows based on specific conditions. Whether you are retrieving, updating, or deleting data, WHERE ensures that only relevant records are affected. Without it, your query applies to every row in the table! The WHERE clause helps you:Filter rows that meet cert
3 min read
SQL | AliasesIn SQL, aliases are temporary names assigned to columns or tables to improve readability and simplify complex queries. It does not change the actual table or column name in the databaseâit's just for that one query. It is used when the name of a column or table is used other than its original name,
3 min read
SQL Joins & Functions
SQL Joins (Inner, Left, Right and Full Join)SQL joins are fundamental tools for combining data from multiple tables in relational databases. For example, consider two tables where one table (say Student) has student information with id as a key and other table (say Marks) has information about marks of every student id. Now to display the mar
4 min read
SQL CROSS JOINIn SQL, the CROSS JOIN is a unique join operation that returns the Cartesian product of two or more tables. This means it matches each row from the left table with every row from the right table, resulting in a combination of all possible pairs of records. In this article, we will learn the CROSS JO
3 min read
SQL | Date Functions (Set-1)SQL Date Functions are essential for managing and manipulating date and time values in SQL databases. They provide tools to perform operations such as calculating date differences, retrieving current dates and times and formatting dates. From tracking sales trends to calculating project deadlines, w
5 min read
SQL | String functionsSQL String Functions are powerful tools that allow us to manipulate, format, and extract specific parts of text data in our database. These functions are essential for tasks like cleaning up data, comparing strings, and combining text fields. Whether we're working with names, addresses, or any form
7 min read
Data Constraints & Aggregate Functions
SQL NOT NULL ConstraintIn SQL, constraints are used to enforce rules on data, ensuring the accuracy, consistency, and integrity of the data stored in a database. One of the most commonly used constraints is the NOT NULL constraint, which ensures that a column cannot have NULL values. This is important for maintaining data
3 min read
SQL PRIMARY KEY ConstraintThe PRIMARY KEY constraint in SQL is one of the most important constraints used to ensure data integrity in a database table. A primary key uniquely identifies each record in a table, preventing duplicate or NULL values in the specified column(s). Understanding how to properly implement and use the
5 min read
SQL Count() FunctionIn the world of SQL, data analysis often requires us to get counts of rows or unique values. The COUNT() function is a powerful tool that helps us perform this task. Whether we are counting all rows in a table, counting rows based on a specific condition, or even counting unique values, the COUNT()
7 min read
SQL SUM() FunctionThe SUM() function in SQL is one of the most commonly used aggregate functions. It allows us to calculate the total sum of a numeric column, making it essential for reporting and data analysis tasks. Whether we're working with sales data, financial figures, or any other numeric information, the SUM(
5 min read
SQL MAX() FunctionThe MAX() function in SQL is a powerful aggregate function used to retrieve the maximum (highest) value from a specified column in a table. It is commonly employed for analyzing data to identify the largest numeric value, the latest date, or other maximum values in various datasets. The MAX() functi
4 min read
AVG() Function in SQLSQL is an RDBMS system in which SQL functions become very essential to provide us with primary data insights. One of the most important functions is called AVG() and is particularly useful for the calculation of averages within datasets. In this, we will learn about the AVG() function, and its synta
4 min read
Advanced SQL Topics
SQL SubqueryA subquery in SQL is a query nested within another SQL query. It allows you to perform complex filtering, aggregation, and data manipulation by using the result of one query inside another. Subqueries are often found in the WHERE, HAVING, or FROM clauses and are supported in SELECT, INSERT, UPDATE,
5 min read
Window Functions in SQLSQL window functions are essential for advanced data analysis and database management. It is a type of function that allows us to perform calculations across a specific set of rows related to the current row. These calculations happen within a defined window of data and they are particularly useful
6 min read
SQL Stored ProceduresStored procedures are precompiled SQL statements that are stored in the database and can be executed as a single unit. SQL Stored Procedures are a powerful feature in database management systems (DBMS) that allow developers to encapsulate SQL code and business logic. When executed, they can accept i
7 min read
SQL TriggersA trigger is a stored procedure in adatabase that automatically invokes whenever a special event in the database occurs. By using SQL triggers, developers can automate tasks, ensure data consistency, and keep accurate records of database activities. For example, a trigger can be invoked when a row i
7 min read
SQL Performance TuningSQL performance tuning is an essential aspect of database management that helps improve the efficiency of SQL queries and ensures that database systems run smoothly. Properly tuned queries execute faster, reducing response times and minimizing the load on the serverIn this article, we'll discuss var
8 min read
SQL TRANSACTIONSSQL transactions are essential for ensuring data integrity and consistency in relational databases. Transactions allow for a group of SQL operations to be executed as a single unit, ensuring that either all the operations succeed or none of them do. Transactions allow us to group SQL operations into
8 min read
Database Design & Security
Introduction of ER ModelThe Entity-Relationship Model (ER Model) is a conceptual model for designing a databases. This model represents the logical structure of a database, including entities, their attributes and relationships between them. Entity: An objects that is stored as data such as Student, Course or Company.Attri
10 min read
Introduction to Database NormalizationNormalization is an important process in database design that helps improve the database's efficiency, consistency, and accuracy. It makes it easier to manage and maintain the data and ensures that the database is adaptable to changing business needs.Database normalization is the process of organizi
6 min read
SQL InjectionSQL Injection is a security flaw in web applications where attackers insert harmful SQL code through user inputs. This can allow them to access sensitive data, change database contents or even take control of the system. It's important to know about SQL Injection to keep web applications secure.In t
7 min read
SQL Data EncryptionIn todayâs digital era, data security is more critical than ever, especially for organizations storing the personal details of their customers in their database. SQL Data Encryption aims to safeguard unauthorized access to data, ensuring that even if a breach occurs, the information remains unreadab
5 min read
SQL BackupIn SQL Server, a backup, or data backup is a copy of computer data that is created and stored in a different location so that it can be used to recover the original in the event of a data loss. To create a full database backup, the below methods could be used : 1. Using the SQL Server Management Stu
4 min read
What is Object-Relational Mapping (ORM) in DBMS?Object-relational mapping (ORM) is a key concept in the field of Database Management Systems (DBMS), addressing the bridge between the object-oriented programming approach and relational databases. ORM is critical in data interaction simplification, code optimization, and smooth blending of applicat
7 min read