Poorly written SQL queries can result in slow performance, high resource costs, locking and blocking issues, and unhappy users. The following are common practices that can be used to write efficient queries.
1. Use Indexes Wisely
Indexes let the database quickly look up rows instead of scanning the entire table. For Example:
Creating an index on customer_id if there are frequent queries on this column like the following query.
SELECT * FROM orders WHERE customer_id = 123;
CREATE INDEX idx_orders_customer_id ON orders(customer_id);
The above query will run much faster if customer_id is indexed.
- Primary Index: Automatically created on the primary key; ensures uniqueness and efficient access.
- Secondary Index: Created on non-primary key columns to improve query performance. Need to be created explicitly.
- Clustered Index: Determines the physical order of data in a table; only one per table. In some databases primary indexes are automatically clustered.
- Non-Clustered Index: Contains pointers to the data; multiple can exist per table
Best Practices:
- Index columns that are frequently used in WHERE clauses, JOIN conditions, or ORDER BY clauses.
- We should avoid over indexing as it makes insertions and deletions slower. Additional storage space required; can slow down write operations like INSERT, UPDATE, and DELETE due to index maintenance overhead.
- Regularly monitor and analyze index usage to optimize performance
2. Avoid SELECT *: Choose Only Required Columns
Using SELECT * impact query performance with large tables or joins. The database engine retrieves every column, even the ones you don’t need which increases memory usage, slows down data transfer, and makes the execution plan more complex. Example:
Avoid this:
SELECT * FROM products;
Use this instead:
SELECT product_id, product_name, price FROM products;
Why This Helps:
- Reduces I/O load and memory usage.
- Enables the optimizer to skip unnecessary columns.
- Makes queries more readable and maintainable.
3. Limit Rows with WHERE and LIMIT
Fetching more rows than needed is a common issue. Even if you only use 10 rows in your app, the query might retrieve thousands, slowing things down. Use the WHERE clause to filter data precisely and LIMIT to restrict the number of rows returned.
Example:
SELECT name FROM customers
WHERE country = 'USA'
ORDER BY signup_date DESC
LIMIT 50;
Benefits:
- Reduces CPU and network load.
- Avoids returning excessive data during analysis or validation.
- Great for previewing transformations and testing queries.
4. Write Efficient WHERE Clauses
The WHERE clause is one of the most important parts of an SQL query because it filters rows based on conditions. However, how you write it can significantly impact performance. A common mistake is using functions or operations directly on column values in the WHERE clause — this can prevent the database from using indexes, which slows down query execution.
Poor:
SELECT * FROM employees WHERE YEAR(joining_date) = 2022;
Why this is bad: The YEAR() function is applied to every value in the joining_date column. This disables the use of indexes, forcing a full table scan.
Optimized:
SELECT * FROM employees
WHERE joining_date >= '2022-01-01' AND joining_date < '2023-01-01';
Optimization Tips:
- Don’t use functions on columns (LOWER(column), YEAR(column), etc.)
- Avoid mathematical operations like salary + 5000 = 100000
- Rewrite conditions to let the database use available indexes
5. Avoid Functions on Indexed Columns
Using SQL functions (like UPPER(), LOWER(), DATE()) on indexed columns can prevent the database from using indexes, leading to slower queries.
Bad (pseudocode):
SELECT * FROM users WHERE UPPER(email) = '[email protected]';
Good (pseudocode):
SELECT * FROM users WHERE email = '[email protected]';
Why This Matters:
- Preserves index usability
- Keeps search efficient
- Prevents unnecessary computation on large datasets
6. Use Joins Smartly
Always join only the tables you need and filter data before joining whenever possible. Use INNER JOIN instead of OUTER JOIN when you don't need unmatched records.
Example:
SELECT u.name, o.amount
FROM users u
JOIN orders o ON u.user_id = o.user_id
WHERE o.amount > 100;
Why This Matters:
- Reduces join processing time
- Prevents Cartesian products (when no ON condition)
- Helps the planner optimize the execution path
7. Avoid N+1 Query Problems
N+1 happens when your app makes one query to get a list, then runs additional queries in a loop to get related data. Always aim to fetch related data in one query using JOINs.
Bad (pseudocode):
SELECT * FROM users;
-- For each user: SELECT * FROM orders WHERE user_id = ?
Good pseudocode:
SELECT u.user_id, u.name, o.order_id, o.amount
FROM users u
JOIN orders o ON u.user_id = o.user_id;
Why This Matters:
- Reduces database calls
- Improves latency and throughput
- Keeps the database from getting overloaded
8. Use EXISTS Instead of IN (for Subqueries)
When checking existence, EXISTS can be more efficient than IN, especially if the subquery returns a large dataset.
Bad (pseudocode):
SELECT name FROM customers
WHERE customer_id IN (SELECT customer_id FROM orders);
Good (pseudocode):
SELECT name FROM customers
WHERE EXISTS (
SELECT 1 FROM orders WHERE orders.customer_id = customers.customer_id
);
Why This Matters:
- EXISTS stops searching once a match is found
- Often better optimized by query planners
- Reduces memory use for large subqueries
9. Avoid Wildcards at the Start of LIKE
Using % at the beginning of a LIKE pattern disables index use and leads to full table scans.
Bad (pseudocode):
SELECT * FROM users WHERE name LIKE '%john';
Good (pseudocode):
SELECT * FROM users WHERE name LIKE 'john%';
Why This Matters:
- Keeps searches index-friendly
- Speeds up pattern matching
- Reduces scan overhead
While normalization keeps data clean, excessive joins can slow down read-heavy queries. Denormalization (storing some redundant data) can help in scenarios where performance is more critical than strict data structure.
Example:
- Store total order amount in the orders table instead of calculating with JOINs every time
Why This Matters:
- Reduces join complexity
- Speeds up frequent reads
- Useful for analytics or dashboards
11. Use Query Execution Plan
Every major DBMS has a way to show the execution plan (like EXPLAIN in MySQL/PostgreSQL). It shows how the SQL engine processes your query.
Example:
EXPLAIN SELECT * FROM orders WHERE user_id = 42;
Why This Matters:
- Helps identify full table scans
- Reveals if indexes are used
- Guides optimization decisions
12. Use UNION ALL Instead of UNION (if possible)
UNION removes duplicates, which adds sorting overhead. If you don’t need duplicates removed, UNION ALL is faster.
Bad (pseudocode):
SELECT col FROM table1
UNION
SELECT col FROM table2;
Good (pseudocode):
SELECT col FROM table1
UNION ALL
SELECT col FROM table2;
Why This Matters:
- Avoids unnecessary sorting
- Faster result merging
- Better for performance on large sets
13. Avoid SELECT Inside Loops (In Applications)
Don't run a query inside a loop in your app if you can write one efficient query that retrieves all needed data.
Bad (pseudocode):
for id in ids:
cursor.execute("SELECT name FROM users WHERE id = ?", (id,))
Good (pseudocode):
SELECT id, name FROM users WHERE id IN (1, 2, 3, 4);
Why This Matters:
- Reduces round trips to the database
- Lower latency
- Better network and CPU usage
14. Partition Large Tables
Partitioning helps by breaking a large table into smaller, more manageable chunks. Queries on partitions are faster as they scan only relevant data.
Example:
- Partition sales data by year or region
Why This Matters:
- Speeds up scans on large datasets
- Helps with parallelism and archiving
- Makes indexes more effective within partitions
15. Optimize ORDER BY and GROUP BY
Sorting and grouping can be expensive operations. Always limit the number of rows being sorted or grouped and use indexes that match the ORDER BY columns if possible.
Bad (pseudocode):
SELECT * FROM orders ORDER BY created_at;
Good (pseudocode):
SELECT order_id, amount FROM orders WHERE created_at >= '2023-01-01' ORDER BY created_at;
Why This Matters:
- Reduces memory load for sorting
- Makes sorting and grouping faster
- Enhances user-facing query performance
Similar Reads
SQL Tutorial Structured Query Language (SQL) is the standard language used to interact with relational databases. Mainly used to manage data. Whether you want to create, delete, update or read data, SQL provides the structure and commands to perform these operations. Widely supported across various database syst
8 min read
Basics
What is SQL?Structured Query Language (SQL) is the standard language used to interact with relational databases. Allows users to store, retrieve, update, and manage data efficiently through simple commands. Known for its user-friendly syntax and powerful capabilities, SQL is widely used across industries.How Do
6 min read
SQL Data TypesIn SQL, every column in a table must be defined with a data type, which specifies what kind of data it can store such as integers, dates, text or binary values. These types are fundamental to how databases store, retrieve, validate and manipulate data efficiently.Choosing the right data type is crit
3 min read
SQL OperatorsSQL operators are symbols or keywords used to perform operations on data in SQL queries. These operations can include mathematical calculations, data comparisons, logical manipulations, other data-processing tasks. Operators help in filtering, calculating, and updating data in databases, making them
5 min read
SQL Commands | DDL, DQL, DML, DCL and TCL CommandsSQL commands are the fundamental building blocks for communicating with a database management system (DBMS). It is used to interact with the database with some operations. It is also used to perform specific tasks, functions, and queries of data. SQL can perform various tasks like creating a table,
7 min read
SQL Database OperationsSQL databases or relational databases are widely used for storing, managing and organizing structured data in a tabular format. These databases store data in tables consisting of rows and columns. SQL is the standard programming language used to interact with these databases. It enables users to cre
3 min read
SQL CREATE TABLECreating a table is one of the first steps in building a database. The CREATE TABLE command in SQL helps define how your data will be stored, including the table name, column names, data types, and rules (constraints) such as NOT NULL, PRIMARY KEY, and CHECK.Whether you are storing customer details,
4 min read
Queries & Operations
SQL SELECT QuerySQL SELECT is used to fetch or retrieve data from a database. It can fetch all the data from a table or return specific results based on specified conditions. The data returned is stored in a result table. The SELECT clause is the first and one of the last components evaluated in the SQL query proce
3 min read
SQL INSERT INTO StatementThe INSERT INTO statement in SQL is used to add new rows of data into an existing table. Essential command for inserting records like customer data, employee records, or student information. SQL offers multiple ways to insert data depending on your requirement, whether it is for all columns, specifi
4 min read
SQL UPDATE StatementThe UPDATE statement in SQL is used to modify the data of an existing record in a database table. We can update single or multiple columns in a single query using the UPDATE statement as per our requirement. Whether you need to correct data, change values based on certain conditions, or update multi
4 min read
SQL DELETE StatementThe SQL DELETE statement is an essential command in SQL used to remove one or more rows from a database table. Unlike the DROP statement, which removes the entire table, the DELETE statement removes data (rows) from the table retaining only the table structure, constraints and schema. Whether you ne
3 min read
SQL | WHERE ClauseIn SQL, the WHERE clause is used to filter rows based on specific conditions. Whether you are retrieving, updating, or deleting data, WHERE ensures that only relevant records are affected. Without it, your query applies to every row in the table! The WHERE clause helps you:Filter rows that meet cert
3 min read
SQL | AliasesIn SQL, aliases are temporary names assigned to columns or tables to improve readability and simplify complex queries. It does not change the actual table or column name in the databaseâit's just for that one query. It is used when the name of a column or table is used other than its original name,
3 min read
SQL Joins & Functions
SQL Joins (Inner, Left, Right and Full Join)SQL joins are fundamental tools for combining data from multiple tables in relational databases. For example, consider two tables where one table (say Student) has student information with id as a key and other table (say Marks) has information about marks of every student id. Now to display the mar
4 min read
SQL CROSS JOINIn SQL, the CROSS JOIN is a unique join operation that returns the Cartesian product of two or more tables. This means it matches each row from the left table with every row from the right table, resulting in a combination of all possible pairs of records. In this article, we will learn the CROSS JO
3 min read
SQL | Date Functions (Set-1)SQL Date Functions are essential for managing and manipulating date and time values in SQL databases. They provide tools to perform operations such as calculating date differences, retrieving current dates and times and formatting dates. From tracking sales trends to calculating project deadlines, w
5 min read
SQL | String functionsSQL String Functions are powerful tools that allow us to manipulate, format, and extract specific parts of text data in our database. These functions are essential for tasks like cleaning up data, comparing strings, and combining text fields. Whether we're working with names, addresses, or any form
7 min read
Data Constraints & Aggregate Functions
SQL NOT NULL ConstraintIn SQL, constraints are used to enforce rules on data, ensuring the accuracy, consistency, and integrity of the data stored in a database. One of the most commonly used constraints is the NOT NULL constraint, which ensures that a column cannot have NULL values. This is important for maintaining data
3 min read
SQL PRIMARY KEY ConstraintThe PRIMARY KEY constraint in SQL is one of the most important constraints used to ensure data integrity in a database table. A primary key uniquely identifies each record in a table, preventing duplicate or NULL values in the specified column(s). Understanding how to properly implement and use the
5 min read
SQL Count() FunctionIn the world of SQL, data analysis often requires us to get counts of rows or unique values. The COUNT() function is a powerful tool that helps us perform this task. Whether we are counting all rows in a table, counting rows based on a specific condition, or even counting unique values, the COUNT()
7 min read
SQL SUM() FunctionThe SUM() function in SQL is one of the most commonly used aggregate functions. It allows us to calculate the total sum of a numeric column, making it essential for reporting and data analysis tasks. Whether we're working with sales data, financial figures, or any other numeric information, the SUM(
5 min read
SQL MAX() FunctionThe MAX() function in SQL is a powerful aggregate function used to retrieve the maximum (highest) value from a specified column in a table. It is commonly employed for analyzing data to identify the largest numeric value, the latest date, or other maximum values in various datasets. The MAX() functi
4 min read
AVG() Function in SQLSQL is an RDBMS system in which SQL functions become very essential to provide us with primary data insights. One of the most important functions is called AVG() and is particularly useful for the calculation of averages within datasets. In this, we will learn about the AVG() function, and its synta
4 min read
Advanced SQL Topics
SQL SubqueryA subquery in SQL is a query nested within another SQL query. It allows you to perform complex filtering, aggregation, and data manipulation by using the result of one query inside another. Subqueries are often found in the WHERE, HAVING, or FROM clauses and are supported in SELECT, INSERT, UPDATE,
5 min read
Window Functions in SQLSQL window functions are essential for advanced data analysis and database management. It is a type of function that allows us to perform calculations across a specific set of rows related to the current row. These calculations happen within a defined window of data and they are particularly useful
6 min read
SQL Stored ProceduresStored procedures are precompiled SQL statements that are stored in the database and can be executed as a single unit. SQL Stored Procedures are a powerful feature in database management systems (DBMS) that allow developers to encapsulate SQL code and business logic. When executed, they can accept i
7 min read
SQL TriggersA trigger is a stored procedure in adatabase that automatically invokes whenever a special event in the database occurs. By using SQL triggers, developers can automate tasks, ensure data consistency, and keep accurate records of database activities. For example, a trigger can be invoked when a row i
7 min read
SQL Performance TuningSQL performance tuning is an essential aspect of database management that helps improve the efficiency of SQL queries and ensures that database systems run smoothly. Properly tuned queries execute faster, reducing response times and minimizing the load on the serverIn this article, we'll discuss var
8 min read
SQL TRANSACTIONSSQL transactions are essential for ensuring data integrity and consistency in relational databases. Transactions allow for a group of SQL operations to be executed as a single unit, ensuring that either all the operations succeed or none of them do. Transactions allow us to group SQL operations into
8 min read
Database Design & Security
Introduction of ER ModelThe Entity-Relationship Model (ER Model) is a conceptual model for designing a databases. This model represents the logical structure of a database, including entities, their attributes and relationships between them. Entity: An objects that is stored as data such as Student, Course or Company.Attri
10 min read
Introduction to Database NormalizationNormalization is an important process in database design that helps improve the database's efficiency, consistency, and accuracy. It makes it easier to manage and maintain the data and ensures that the database is adaptable to changing business needs.Database normalization is the process of organizi
6 min read
SQL InjectionSQL Injection is a security flaw in web applications where attackers insert harmful SQL code through user inputs. This can allow them to access sensitive data, change database contents or even take control of the system. It's important to know about SQL Injection to keep web applications secure.In t
7 min read
SQL Data EncryptionIn todayâs digital era, data security is more critical than ever, especially for organizations storing the personal details of their customers in their database. SQL Data Encryption aims to safeguard unauthorized access to data, ensuring that even if a breach occurs, the information remains unreadab
5 min read
SQL BackupIn SQL Server, a backup, or data backup is a copy of computer data that is created and stored in a different location so that it can be used to recover the original in the event of a data loss. To create a full database backup, the below methods could be used : 1. Using the SQL Server Management Stu
4 min read
What is Object-Relational Mapping (ORM) in DBMS?Object-relational mapping (ORM) is a key concept in the field of Database Management Systems (DBMS), addressing the bridge between the object-oriented programming approach and relational databases. ORM is critical in data interaction simplification, code optimization, and smooth blending of applicat
7 min read