0% found this document useful (0 votes)

42 views49 pages

SQL Query Optimization

SQL query optimization involves choosing indexes, data types, and schemas to improve query performance. The database server analyzes queries to generate efficient execution plans that may use indexes to retrieve data from tables. Indexes like B-Trees and hash indexes provide fast data retrieval but slower writes. Choosing optimal data types, indexes, and limiting null values makes queries easier for the database engine to optimize. Covering indexes containing all needed columns can improve performance by avoiding lookups to the main table.

Uploaded by

Ganesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views49 pages

SQL Query Optimization

Uploaded by

Ganesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 49

SQL Query Optimization

and Indexing

Along the way

How is a db query executed

Schema optimization
Execution Plan
Indexing (Types of Indices)
Using indices
Lock Contention
Covering Indices
The DB Engine

How does a
database server
run a query ?

Server process

SQL Query

analysis

If new query?

Execution plan?

(Optimizer )

If cpu?

Table scan ?

index?

Require high
performance ??

Good optimized schema

Give indexes for specific
queries

Tradeoffs !!

Whats an Index??

adata structure

Retrieval

Inserts

Also

Denormalized db
=
faster 4 some
queries
+
slower for others

Choosing optimal Data Types

Smaller is better
Use less space
Require fewer CPU cycles

Simple is good
Integers easier to compare than characters
E.g.: Use MySQL built in types for date/time

Avoid NULL if possible

Harder for MySQL to optimize queries referring to
nullable columns
DATETIME and TIMESTAMP store same kind of data, but ??

String Types
VARCHAR and CHAR types
Their storage on disk is storage engine dependent
Usually the storage is different for disk, memory and
after retrieval from the storage engine

VARCHAR

Uses as much space as it needs

Uses 1 or 2 bytes extra for storing the length
1 byte if length up to 255 bytes, 2 for above 255 length
So VARCHAR(10) uses 11 bytes and VARCHAR(1000)
uses 1002 bytes
Improves performance as it saves space
Variableupdat
rows can
more

-
-
length
e
grow
work!!!
-
Use VARCHAR , max col length > avg length, updates are
rare

CHAR
Fixed length
For data changing frequently, char better than varchar
For very short columns, CHAR(1) = 1 BYTE and
VARCHAR(1) = 2 BYTES
The siblings of char and varchar are binary and
varbinary data types
Good for comparing as bytes that characters.

Comparing random strings

Strings produced by MD5(), SHA1() OR UUID().

Each new string generated will be distributed in arbitrary
ways over a large space
Can slow INSERT coz get inserted in a random loc in
indexes
They slow some SELECT queries as logically adjacent
rows will be widely dispersed in disk and memory

If you do store UUID values, you

should remove the dashes or, even
better, convert the UUID values to
16-byte numbers with UNHEX() and
store them in a BINARY(16) column.
You can retrieve the values in
hexadecimal format with the HEX()
function

IP Address

Usual case, use VARCHAR(15)

But, IP is really an unsigned 32 bit integer , not a string
Dotted-quad notation for humans to understand easily
MySQL provide INET_ATON() and INET_NTOA() fns to
convert btw 2 representations

The Execution Plan

Every SQL query is broken down in to series of
execution steps called as operators
Each operator performs basic operations like
insertion, search, scan, updation, aggregation etc.
There are 2 kinds of operators Logical operators
and physical operators.
Logical operators : describe how the execution
will be executed at a conceptual level
Physical operators : The actual logic / routine
which perform the action.

Checks
syntax
Query
process
or tree
is output
of parse

PARSE

OPTIMIZE

Calculate
cost and
gives out
estimated
plan and
an actual
plan

DATA STATISTICS
1. How many rows?
2. Unique data?
3. Does table span
over more than
one page?

EXECUTE
As per
plan
executio
n is
done

Into Indexing
TYPES
B-Tree Indexes
Hash Indexes

B-Tree
We use the term "B-Tree" for these indexes because
that's what MySQL uses in CREATE TABLE and other
statements
All the values are stored in order, and each leaf page is
the same distance from the root

Leaf nodes have pointers to the indexed data instead

of pointers to other pages
Because B-Trees store the indexed columns in order,
they're useful for searching for ranges of data

Hash Indexes
Built on hash tables and useful for exact lookups that
use every column in the index
Memory storage engine only supports this in MySQL
Forms hash codes of the indexed columns and stores a
pointer to each row in hash table
E.g. :

CREATE TABLE testhash (

fname VARCHAR(50) NOT NULL,
lname VARCHAR(50) NOT NULL,
KEY USING HASH(fname)
) ENGINE=MEMORY;

containing this data:

mysql> SELECT * FROM testhash

Fname
Darshan
Bijesh
Jophin
Vivek

lname
Raj
Chandran
Joseph
Babu

Suppose the index use a fn f(), which return following

values
f(Darshan) = 2323
f(Bijesh') = 7437
f(Jophin') = 8784
f('Vivek') = 2458

The index's data structure will look like this:

Slot

Value

2323

Pointer to row 1

2458

Pointer to row 4

7437

Pointer to row 2

8784

Pointer to row 3

A hash index on a TINYINT will be the same size as a hash

index on a large character, coz ???
the indexes store only the short hash values.

Non - Clustered Indexes

Data present in random order

Logical ordering specified by index
Typically created on column used in JOIN, WHERE and ORDER BY
Good for tables whose values may be modified frequently

Clustered Indexes
Data blocks arranged in order to match the index
Only one clustered index possible on a given table
Faster retrieval if data accessed in asc or desc order

MS SQL Server creates non-clustered

indices by default when CREATE INDEX is
given.

Using indices
Indexing the primary key
Usually automatically indexed to facilitate effective
information retrieval
Most effective access path
Other columns or combination of columns = secondary
index to improve performance in data retrieval

Secondary indexes
Indexes on other columns other than primary key
column
Create secondary indexes on tables that have more
reads than writes
Just copy of the db table but containing only the fields
specified in the index

Dont give more than 4 fields in an

index and more than 5 indexes for a
table. You are inviting trouble
otherwise !!

Index Column Order does matter !!

Not useful if lookup does not start from the leftmost side
of the indexed columns.
Cant skip columns in the index.

Join vs. Sub query

Join faster when we have less number of tables

Join faster when we have less data in tables
Sub query faster when there are large number of tables
as joining more tables is tedious
Sub query faster when we have huge data in tables

Explaining the explain ??!

A way to obtain information about how MySQL executes a
SELECT statement
Syntax : Explain SELECT select_options
Returns a row of information for each table used in the
SELECT statement
These are the info that MySQL gives for each table
id

Selec
t_typ
e

Tabl
e

Type Possibl
e_
Keys

Key

Key_
Length

Ref

Row
s

Id : select identifier
Select_type : type of select (Simple, Primary , Union ,
Dependent Union, Subquery, Dependent Subquery etc)

extr
a

Table : table to which row output refers

type : The join type (important)
possible keys : The possible indexes that can be used for
the query
keys : The indexes used in the query
rows : no: of rows scanned

Lock Contention??

1) DELETE FROM user WHERE

status = 9

Fully scan user

table, deleting if
status = 9;

User_id
(PK)

Name

status

100

What happens if query

1 does not lock row:
user_id = 100 ?

100000
2) UPDATE user SET status=9 WHERE
user_id =100

DATA CONSISTENCY IS
BROKEN !!

If STATUS column is
indexed
1) DELETE FROM user WHERE
status = 9
Status

100

101

12345

100000

1) And 2) can run in

parallel
(CONCURRENCY
IMPROVED)

User_id
(PK)

Name

status

Roger

100

Rafael

100000

Andy

2) UPDATE user SET status=9 WHERE

user_id =100

Covering Index??

DB Engine ??

The underlying software component

that a database management
system(DBMS) uses to create , read ,
update , delete (CRUD) data from a
database
MySQL has InnoDB and MyISAM
InnoDB = transactional
MyISAM = non-transactional

InnoDB create a Clustered Index for

every table. If it has a primary key,
that is the clustered index. If not, it
created a six-byte unique ID and
makes it the clustered index.
All Indexes are B-Trees. The Primary
keys leaf nodes are the data.

References

High Performance MySQL Steven Feuerstein

Mastering the art of Indexing - Yoshinori Matsunobu
http://www.codeproject.com
http://www.databasejournal.com
SQL Best Practices Video Journal by Steven Feuerstein
MySQL 5.0 Reference manual

THANK YOU

SQL 100 Interview Questions
80% (5)
SQL 100 Interview Questions
24 pages
ADDB7311 Assignment 1
No ratings yet
ADDB7311 Assignment 1
14 pages
Homework Week #1 PL/SQL Virtual Training: PL/SQL SQL PL/SQL SQL PL/SQL SQL
No ratings yet
Homework Week #1 PL/SQL Virtual Training: PL/SQL SQL PL/SQL SQL PL/SQL SQL
3 pages
Query Optimization
No ratings yet
Query Optimization
9 pages
Mysql For Developers: Carol Mcdonald, Java Architect
No ratings yet
Mysql For Developers: Carol Mcdonald, Java Architect
77 pages
MySQL-Indexing Best Practices (WEBINAR)
No ratings yet
MySQL-Indexing Best Practices (WEBINAR)
41 pages
DB Ii - 7
No ratings yet
DB Ii - 7
42 pages
SQLG
No ratings yet
SQLG
6 pages
Query Optimization in Mysql Database Usi F8e2fb8b
No ratings yet
Query Optimization in Mysql Database Usi F8e2fb8b
7 pages
Mysql Query & Index Tuning: Keith Murphy
No ratings yet
Mysql Query & Index Tuning: Keith Murphy
46 pages
Index and Triggers
No ratings yet
Index and Triggers
30 pages
Lec6 QP Indexing
No ratings yet
Lec6 QP Indexing
40 pages
Create Index Syntax: Optimization and Indexes
No ratings yet
Create Index Syntax: Optimization and Indexes
3 pages
Index: Presented By-VISHAKHA CHANDRA (10030141082)
No ratings yet
Index: Presented By-VISHAKHA CHANDRA (10030141082)
29 pages
MySQL Indexing
No ratings yet
MySQL Indexing
19 pages
03 Indexing Partitioning
No ratings yet
03 Indexing Partitioning
36 pages
PHP 09 MySQL
No ratings yet
PHP 09 MySQL
58 pages
Indexing The Mysql 5.1: Performance Enhancement
No ratings yet
Indexing The Mysql 5.1: Performance Enhancement
39 pages
Indexing The Mysql 5.1: Performance Enhancement
No ratings yet
Indexing The Mysql 5.1: Performance Enhancement
39 pages
Creating Tables
No ratings yet
Creating Tables
10 pages
Lec20Indexing v1
No ratings yet
Lec20Indexing v1
57 pages
Assignment 1
No ratings yet
Assignment 1
7 pages
SQL Optimization
No ratings yet
SQL Optimization
2 pages
Drop View Syntax: Qty Price Value 3 50 150
No ratings yet
Drop View Syntax: Qty Price Value 3 50 150
4 pages
Lec 8 Indexing & Data Structures For Query Processing
No ratings yet
Lec 8 Indexing & Data Structures For Query Processing
51 pages
Mysql Explain Explained
No ratings yet
Mysql Explain Explained
23 pages
What Is An Index in MySQL
No ratings yet
What Is An Index in MySQL
6 pages
Tuning
100% (2)
Tuning
29 pages
Relational Databases and Mysql: This Work Is Licensed Under A
No ratings yet
Relational Databases and Mysql: This Work Is Licensed Under A
47 pages
Tuning SQL Queries - Oracle
100% (1)
Tuning SQL Queries - Oracle
27 pages
Ebook Mql5
No ratings yet
Ebook Mql5
22 pages
VI. Indices
No ratings yet
VI. Indices
12 pages
Chapter-3 025349
No ratings yet
Chapter-3 025349
30 pages
Lecture12 (CNC 312)
No ratings yet
Lecture12 (CNC 312)
36 pages
Lab Sheet 1
No ratings yet
Lab Sheet 1
18 pages
Top 10 Mysql Best Practices: 1. Index Search Fields
No ratings yet
Top 10 Mysql Best Practices: 1. Index Search Fields
3 pages
Practical Mysql Indexing Guidelines
No ratings yet
Practical Mysql Indexing Guidelines
35 pages
CS 522 - Database Administration Manage Indexes: Dr. Dongming Liang (Dongming - Liang@svuca - Edu)
No ratings yet
CS 522 - Database Administration Manage Indexes: Dr. Dongming Liang (Dongming - Liang@svuca - Edu)
32 pages
UNIT1 Notes ABDA
No ratings yet
UNIT1 Notes ABDA
7 pages
Tuning: Overview: Leccotech
No ratings yet
Tuning: Overview: Leccotech
29 pages
UNIT 5-Part 1
No ratings yet
UNIT 5-Part 1
9 pages
Module 12 - Managing Indexes
No ratings yet
Module 12 - Managing Indexes
19 pages
An in Depth Look at Database Indexing
No ratings yet
An in Depth Look at Database Indexing
3 pages
Indexing - II
No ratings yet
Indexing - II
57 pages
SQL Labsheets
100% (1)
SQL Labsheets
98 pages
What Are Indexes?: ID First Name Last Name Class
No ratings yet
What Are Indexes?: ID First Name Last Name Class
3 pages
Lecture9 PDF
No ratings yet
Lecture9 PDF
45 pages
Lesson 9 Lecture9
No ratings yet
Lesson 9 Lecture9
45 pages
Index & Query Optimization
No ratings yet
Index & Query Optimization
21 pages
How To Optimize SQL Queries Part II - by Pawan Jain - Jul, 2020 - Towards Data Science
No ratings yet
How To Optimize SQL Queries Part II - by Pawan Jain - Jul, 2020 - Towards Data Science
11 pages
Close: Criteria Mysql SQL Server
No ratings yet
Close: Criteria Mysql SQL Server
5 pages
Lab 06
No ratings yet
Lab 06
8 pages
Physical Database Design and Tuning: R&G - Chapter 20
No ratings yet
Physical Database Design and Tuning: R&G - Chapter 20
23 pages
Creating Databases Tables and Indexes
100% (1)
Creating Databases Tables and Indexes
45 pages
MySQL Overview
No ratings yet
MySQL Overview
30 pages
CSCE5350 Activity 7
No ratings yet
CSCE5350 Activity 7
32 pages
Unit 3 Final PDF
No ratings yet
Unit 3 Final PDF
39 pages
Query Optimization in Databases
No ratings yet
Query Optimization in Databases
6 pages
Indexing
No ratings yet
Indexing
4 pages
Indexing in Relational Databases
No ratings yet
Indexing in Relational Databases
2 pages
SQL Interview Success From Beginner To Pro
From Everand
SQL Interview Success From Beginner To Pro
Shana
No ratings yet
SQL Server 2014 Development Essentials
From Everand
SQL Server 2014 Development Essentials
Basit A. Masood-Al-Farooq
4.5/5 (2)
DBMS Lab Manual
From Everand
DBMS Lab Manual
Jitendra Patel
1.5/5 (3)
SQL Aggregate Functions PDF
100% (2)
SQL Aggregate Functions PDF
19 pages
Python Mysql
No ratings yet
Python Mysql
12 pages
R23 Unit-1 DBMS
No ratings yet
R23 Unit-1 DBMS
36 pages
To Paralelel or Not
No ratings yet
To Paralelel or Not
62 pages
Engineering College, Ajmer: (An Autonomous Institute of Government of Rajasthan)
No ratings yet
Engineering College, Ajmer: (An Autonomous Institute of Government of Rajasthan)
42 pages
COC Sample Practical Question For Database Administration Le
No ratings yet
COC Sample Practical Question For Database Administration Le
6 pages
RSLTE058 - Timing advance-NPKU 0133
No ratings yet
RSLTE058 - Timing advance-NPKU 0133
71 pages
Career Objective
No ratings yet
Career Objective
3 pages
MIS - Database Management Systems
100% (1)
MIS - Database Management Systems
33 pages
Oracle Database Performance Tuning FAQ
100% (1)
Oracle Database Performance Tuning FAQ
8 pages
Ncert Notes Class 12 Ip CH 1 Querying 2024 - 25
100% (2)
Ncert Notes Class 12 Ip CH 1 Querying 2024 - 25
2 pages
PLSQL Lecture
No ratings yet
PLSQL Lecture
37 pages
Dbms MCQ Sheet-2
No ratings yet
Dbms MCQ Sheet-2
6 pages
Introduction To Transaction Processing Concepts and Theory
No ratings yet
Introduction To Transaction Processing Concepts and Theory
94 pages
Distributed DBMS: Announcements
100% (1)
Distributed DBMS: Announcements
11 pages
Dbms Lab Lab Assignments: Displaying Data From Multiple Tables (Using JOIN) + Set Operations
No ratings yet
Dbms Lab Lab Assignments: Displaying Data From Multiple Tables (Using JOIN) + Set Operations
2 pages
DMAddins SampleData en
No ratings yet
DMAddins SampleData en
2,224 pages
Database Programming With Java: JDBC Basics
No ratings yet
Database Programming With Java: JDBC Basics
40 pages
CCS110-Activity 3 Module 13-FINALS-PALMA (20230427214727)
No ratings yet
CCS110-Activity 3 Module 13-FINALS-PALMA (20230427214727)
8 pages
Jadwal Jaga Asisten Liburan Ata1819 (Revisi 1)
No ratings yet
Jadwal Jaga Asisten Liburan Ata1819 (Revisi 1)
27 pages
Delete Data On DB Lab 9: Muhammad Sufyian Mohd Azmi
No ratings yet
Delete Data On DB Lab 9: Muhammad Sufyian Mohd Azmi
11 pages
SQL Subquery
No ratings yet
SQL Subquery
7 pages
MCS-023 11
No ratings yet
MCS-023 11
5 pages
DBMS CA2 Assignment
No ratings yet
DBMS CA2 Assignment
8 pages
De Lab Manual
No ratings yet
De Lab Manual
40 pages
ASSIGNMENT 2 Output (2) Karan
No ratings yet
ASSIGNMENT 2 Output (2) Karan
18 pages
Dot Net - Chapter-4
No ratings yet
Dot Net - Chapter-4
16 pages

SQL Query Optimization

Uploaded by

SQL Query Optimization

Uploaded by

SQL Query Optimization

Along the way

How is a db query executed

Good optimized schema

Choosing optimal Data Types

Avoid NULL if possible

Uses as much space as it needs

Comparing random strings

Strings produced by MD5(), SHA1() OR UUID().

If you do store UUID values, you

Usual case, use VARCHAR(15)

The Execution Plan

Leaf nodes have pointers to the indexed data instead

CREATE TABLE testhash (

containing this data:

mysql> SELECT * FROM testhash

Suppose the index use a fn f(), which return following

The index's data structure will look like this:

A hash index on a TINYINT will be the same size as a hash

Non - Clustered Indexes

Data present in random order

MS SQL Server creates non-clustered

Dont give more than 4 fields in an

Index Column Order does matter !!

Join vs. Sub query

Join faster when we have less number of tables

Explaining the explain ??!

Table : table to which row output refers

1) DELETE FROM user WHERE

Fully scan user

What happens if query

1) And 2) can run in

2) UPDATE user SET status=9 WHERE

The underlying software component

InnoDB create a Clustered Index for

High Performance MySQL Steven Feuerstein

You might also like