0% found this document useful (0 votes)

9 views

database-optimization-2009-06-09

This document provides an overview of database optimization techniques aimed at database developers, focusing on query execution strategies, cost estimation, and indexing. It discusses the importance of creating appropriate indexes, the differences between clustered and unclustered indexes, and the impact of these strategies on query performance. The paper also includes examples and recommendations for using SQL Server's automated tools for index optimization.

Uploaded by

michaelnicolsamai8

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

database-optimization-2009-06-09

Uploaded by

michaelnicolsamai8

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

June 9

Database
Optimization 2009
A brief overview of database optimization techniques for the database
developer. Database optimization techniques include RDBMS query Kyle Lee
execution strategies, cost estimation, join performance, the proper Ayoka, L.L.C.
application of indexing, formulating intelligent queries in the context of a
single‐server RDBMS environment, and illustration of automated
optimization tools. © 2009 Ayoka, L.L.C.
Page |2

Table of Contents
Overview ....................................................................................................................................................... 3
Types of Indexes ........................................................................................................................................... 4
Crafting a Clustered Index......................................................................................................................... 4
Unclustered Indexes ................................................................................................................................. 5
SQL Server Automated Index Recommendations ......................................................................................... 6
Cost Estimation for SELECT Operations ........................................................................................................ 8
Linear Search ............................................................................................................................................. 8
Binary Search ............................................................................................................................................ 8
Primary Index / Hash ................................................................................................................................. 9
Cost Estimation of JOIN Operations.............................................................................................................. 9
Nested Loop .................................................................................................................................................. 9
Single Loop (using an index) ....................................................................................................................... 10
Sort‐Merge Join........................................................................................................................................... 10

Basic Database Optimization

Page |3

Introduction
Database management systems are pervasive in the modern world. The notion of a persistent,
redundant, and highly distributable library of information has become the single most important
concept in our information technology repertoire. In fact, virtually every human being in the Western
world interacts with a database management system of some kind on a daily basis—often without using
a personal computer at any time throughout the day.

With millions of data transactions taking place every second, it comes as little surprise that database
optimization is an area of key research for academic institutions and corporate research and
development departments. From a software company’s perspective, the relational database most often
serves as the core of data‐driven software applications, and lack of database optimization in such a key
area can incur significant costs to both the client and the company.

The purpose of this paper is to discuss basic database optimization using mathematical cost estimation
for different types of queries, a review of join performance, and the effects of various physical access
structures on specific query examples. The intended audience should be familiar with SQL and basic
relational database concepts – typically an experienced database developer. Specific examples will be
given in the context of MS SQL Server 2005, but the concepts they illustrate will be general enough to
apply to any SQL‐supporting relational DBMS (Database Management System).

After reviewing the paper, the reader will hopefully have a better understanding of how RDBMSs
formulate execution strategies for complex queries and be able to use this knowledge to retrieve
information at a lower cost.

Indexes
Overview
A database index is a physical access structure for a database table that functions much as the name
would suggest: it is a sorted file that informs the database of where records are physically located on the
disk. To get the idea of what an index does, consider a textbook. In order to find a particular section,
the reader can either start reading the book and keep reading until he finds what he is seeking, or,
alternatively, he can consult the table of contents and go directly to the desired section. A database
index functions much like a textbook index does.

Adding appropriate indexes to large tables is the single most important part of database optimization, as
we will see when we discuss some examples of cost estimation. Creating a single index for a large table
with no existing indexes can reduce a query’s execution time by an order of magnitude. As an example,
consider the following scenario. Say we have a database table called EMPLOYEE with 100,000 records.
Assume that we wish to perform the following simple query on the table and that no indexes exist on
the table:

Basic Database Optimization

Page |4

SELECT FirstName, LastName FROM EMPLOYEE WHERE EmpID = 12345;

In order to find the employee record with the appropriate EmpID, the database must potentially scan
through all 100,000 employee records to return the correct result. This type of scan is referred to as a
full table scan.

Luckily, a database developer can create an index on the EmpID column to prevent such scans from
occurring. Additionally, if this field has a UNIQUE constraint, the DBMS will internally compile the index
as a hash table, with each employee ID hashing to the desired record’s physical disk address. Scanning
thus becomes completely unnecessary, and record location is performed in constant time. After the
database developer adds this index, the DBMS can immediately locate the employee record with EmpID
12345—a potential reduction of 100,000 operations.

Types of Indexes
Indexes fall into one of two categories: clustered and unclustered. The primary distinction between the
two categories is that an unclustered index does not affect that actual ordering of the records on disk
and clustered indexing does. Because clustered indexing affects the physical ordering of the records,
there can be at most one clustered index for each table. The same restriction does not apply to
unclustered indexes, so we can create as many as disk space allows (although this is not necessarily a
good idea, as we’ll see in a moment).

Crafting a Clustered Index

An SQL database developer should be aware that the clustered index is the most important index for
any table, so it should be crafted with care. As a general guideline, every table should have at least a
clustered index. In some cases, clustered indexes are created automatically. In SQL Server, for instance,
defining a primary key for a table will automatically create a clustered index for that table. However,
once a clustered index has been created for a primary key, it is no longer editable. If the database
developer wishes to configure the columns that appear in this database index when dealing with
primary keys, it might be a better idea to remove the primary key and create an unclustered index on
the former primary key column with the unique constraint enabled. This allows the database developer
to create a more custom‐tailored clustered index.

Indeed, creating a proper clustered index is important, as not all clustered indexes are created equal.
Take, for example, a phone book. If the listings are ordered by city, then by last name, then by first
name, it is not difficult to find out how many phone numbers are listed for a particular city. If, however,
the listings are ordered by last name, then first name, then city, we would have to go through the whole
phone book and count every listing with the desired city—not a fun task for anyone.

The same idea applies to databases. Order the columns in your clustered index based on your business
logic. Group them in some way that makes sense for your software application. If your queries are most
frequently searching for all customers who have a particular type of vehicle, set the vehicle column as

Basic Database Optimization

Page |5

the first column in your clustered index and the customer ID as the second. Customers are both logically
and physically (on disk) grouped by their vehicle type. Carefully crafting these database index strategies
will improve your database optimization techniques.

Figure 1

Figure 1 shows an existing clustered index for a Customer table in SQL Server 2005. Since our database
keeps track of customer records for many different car dealerships, the customers are grouped by the
dealership and then by their customer ID for optimal selection performance.

Unclustered Indexes
From the perspective of the database developer, an unclustered database index does not seem at its
surface to be very different from a clustered index. As stated above, the real difference is that a
clustered index will reorder the records on disk, whereas an unclustered index will not. Indeed, the
same grouping ideas apply just as much to unclustered indexes as they do to clustered indexes, so we
will not go over them again.

Unlike the clustered index, there is no theoretical limit on the number of unclustered indexes that can
exist for a specific database table. There is, however, two important caveats. The first is that indexes
must be stored on disk, and generally, the larger the table, the larger the database index. Disk space
consumption can be of concern in many production environments, so the creation of indexes must be
balanced against resource scarcity.

Basic Database Optimization

Page |6

The second—and more important—caveat is that indexes must be updated whenever the tables are
modified, either through column updates, insertion, or deletion. For large indexes (i.e., for large tables)
these database index updates are an expensive and time consuming operation.

The database developer must then enter into a sort of balancing act for adequate database
optimization: indexes speed up access time enormously, but they slow down data modification. If your
software application rarely performs UPDATE, INSERT, and DELETE operations, creating indexes is not
going to incur significant data modification performance penalties beyond their initial creation, but for
applications that perform frequent updates, it might be a better idea to carefully tailor the indexes and
create as few of them as possible. The database developer can use the reporting utilities in the DBMS to
determine what queries are performed the most frequently and their average CPU times to help make
this determination.

SQL Server Automated Index Recommendations

SQL Server 2005 and SQL Server 2008 come with a utility called the Database Engine Tuning Advisor. In
SQL Server Management Studio, the database developer can highlight the query to optimize and then
click the button labeled “Database Engine Tuning Advisor.” A window should appear similar to the
following:

Basic Database Optimization

Page |7

Figure 2

The Tuning Options tab allows the database developer to configure whether or not they want the
advisor to replace existing indexes or only consider adding new ones. The database developer should
evaluate existing indexes to determine whether or not existing indexes should be dropped.

Once the configuration options are set, clicking the “Start Analysis” button will begin the tuning. Once
analysis is complete, a window with recommendations will be displayed along with the estimated
performance improvement for the query being optimized. These recommendations include:

1) The creation of indexes.

2) The creation of statistics.

Recommendations may be applied automatically via the Actions > Apply Recommendations menu item.

Cost Estimation
Cost estimation is the process of applying a meaningful and consistent measure of execution cost to a
particular query. Various metrics can be used for this purpose, but the most common and most relevant

Basic Database Optimization

Page |8

metric is the number of block accesses required by the query. Since disk I/O is such an expensive
operation in terms of time consumption, our goal is to minimize the number of block accesses as much
as possible while not sacrificing functionality.

Cost Estimation for SELECT Operations

For a given SELECT query, the DBMS has a number of possible execution strategies. Here are a few that
we will discuss (the list is not complete):

1) Linear search (brute force)

2) Binary search
3) Using a primary database index / hash key to retrieve a single record

Assumptions:

Field Value
Query SELECT FROM EMPLOYEE WHERE EmpID=125
Number of EMPLOYEE records (r) 100,000
Number of disk blocks (b) 10,000
Blocking factor (bfr) (records per block) 10
Linear Search
The DBMS must use a linear search when no database index exists on the selection condition (e.g., the
EmpID). This is precisely the type of operation that we want to prevent with proper indexing.

Given our assumptions, the cost of a linear search for this query would be:

• on average if the record exists

• if the record does not exist

So, for the query above, 5,000 .

Binary Search
The DBMS might employ a binary search on an index with non‐unique entries. Say, for instance, that the
database developer has a nonclustered index that groups by city, then first name, then last name, and
they need to retrieve all customers who live in a certain city. The DBMS can use a binary search to find
the first matching city record and then retrieve all subsequent city records by traversing down the
database index row by row.

The cost of a performing a binary search is exactly the same as performing a binary search anywhere
else, namely, log .

For the query above, the database developer has log log 10,000 14 .

Basic Database Optimization

Page |9

Primary Index / Hash

If the column is unique (like a primary key, for example) then the database index can be implemented as
a hash table. Such an index on the EmpID column would allow the database developer to hash directly
to the correct employee record in constant time.

In general, static and linear hashes have a cost of 1.

For SELECT queries with composite selection conditions, the cost must be evaluated based on the
database index structure which exists for each column involved in the join condition—which goes into
more detail than this paper is designed to cover. Nevertheless, even for our very simple example query,
we can mathematically verify an enormous performance improvement by using a good database index.

Cost Estimation of JOIN Operations

Estimating the cost of join operations is a little more complicated than doing so for SELECT operations
because we have to consider the type of indexing available for each table that participates in the join.
We must also factor in the cost of writing the result file, but since the size of the result file remains
constant despite altering the execution strategy, we can ignore it for the basis of comparison.

Assume that in addition to the EMPLOYEE table, we also have a DEPT table with a DeptNum field
indicating the department’s number.

Other assumptions:

Field Value
Number of EMPLOYEE records ( ) 100,000
Number of EMPLOYEE disk blocks ( ) 10,000
Blocking factor ( ) (records per block) 5
Number of DEPARTMENT records ( ) 25
Number of DEPARTMENT disk blocks ( 5

The query we wish to evaluate is:

1) SELECT * FROM EMPLOYEE AS E, DEPARTMENT AS D WHERE E.DeptNum = D.DeptNum;

We will compare the following execution strategies for this query:

1) Nested‐loop join (brute force)

2) Single loop join (using an index)
3) Sort‐merge join

Nested Loop
The nested loop strategy may be effective for database optimization when no indexes exist on either
table for the join column (in this case, DeptNum). The database developer has two options here. We can
either choose to use EMPLOYEE or DEPARTMENT in the outer loop, and the choice we make is very

Basic Database Optimization

P a g e | 10

important. In general, the database developer will want to choose the table with the smaller number of
records for the outer loop. This can drastically reduce the number of loop iterations. The DBMS will
make this determination automatically, but it is helpful to be aware of what’s going on behind the
scenes when writing queries.

Using EMPLOYEE as the outer loop:

10,000 10,000 5 60,000 .

Using DEPARTMENT as the outer loop:

5 5 10,000 50,005 .

Using DEPARTMENT as the outer loop cut the number of block accesses down by about 10,000, which is
a substantial improvement.

Single Loop (using an index)

If a primary index exists on one of the tables for the join column, then the DBMS can use this index and
perform the join with a single loop. So for instance, if a hash‐type index exists for the DeptNum column
for the DEPARTMENT table, the database developer can hash it rather than perform searching, so an
inner loop is unnecessary. Still, the choice of which table we want to loop with is important. Assume that
an index exists for DeptNum for both the EMPLOYEE and DEPARTMENT tables.

Note: is the number of block accesses required to retrieve a record given its hash value.

Using EMPLOYEE as the outer loop:

| | 10,000 25 1 10,025 .

Using DEPARTMENT as the outer loop:

| | 5 100,000 1 100,005 .

SortMerge Join
With this strategy, the DBMS will use a traditional merge on the two sorted files. If the two files are
already sorted on the join column, then the cost is simply

10,000 5 10,005 .

If the files are unsorted, the cost of sorting must be factored into the equation. We approximate the
sorting as for a file with disk blocks.

10,000 5 140,000 10 150,015 .

Conclusion
Basic Database Optimization
P a g e | 11

After reading this paper, the database developer will hopefully have a better understanding of basic
database optimization techniques and how the DBMS formulates execution strategies for different types
of queries, even though the provided examples are very limited in scope. The database developer
should also understand the importance of creating well‐tuned indexes and what criteria go into
selecting the columns for indexing.

Basic Database Optimization

P a g e | 12

References
The cost estimation examples provided in this document were modified from examples provided in
Fundamentals of Database Systems 5th edition by Doctors Ramez Elmasri of the University of Texas at
Arlington and Shamkant Navathe of the Georgia Institute of Technology.

Basic Database Optimization

Euronav Qug255003033 - Rev-020
No ratings yet
Euronav Qug255003033 - Rev-020
134 pages
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
Accounting Database Design
From Everand
Accounting Database Design
Derek Liew
5/5 (2)
SQL: For Beginners: Your Guide To Easily Learn SQL Programming in 7 Days
From Everand
SQL: For Beginners: Your Guide To Easily Learn SQL Programming in 7 Days
i Code Academy
5/5 (4)
SQL Interview Questions: A complete question bank to crack your ANN SQL interview with real-time examples
From Everand
SQL Interview Questions: A complete question bank to crack your ANN SQL interview with real-time examples
Prasad Kulkarni
2/5 (1)
Salesforce.com Interview Q & A & Certification Question Bank with Answers
From Everand
Salesforce.com Interview Q & A & Certification Question Bank with Answers
Mohammed Azizuddin Aamer
4/5 (5)
Learn SQL in 24 Hours
From Everand
Learn SQL in 24 Hours
Alex Nordeen
5/5 (4)
Learn SQL: Database Management Basics
From Everand
Learn SQL: Database Management Basics
Kiet Huynh
No ratings yet
Amazon DynamoDB - The Definitive Guide: Explore enterprise-ready, serverless NoSQL with predictable, scalable performance
From Everand
Amazon DynamoDB - The Definitive Guide: Explore enterprise-ready, serverless NoSQL with predictable, scalable performance
Aman Dhingra
No ratings yet
Exploring the Fundamentals of Database Management Systems: Business strategy books, #2
From Everand
Exploring the Fundamentals of Database Management Systems: Business strategy books, #2
SANJIVAN SAINI
No ratings yet
Creating your MySQL Database: Practical Design Tips and Techniques
From Everand
Creating your MySQL Database: Practical Design Tips and Techniques
Marc Delisle
3/5 (1)
The DynamoDB Handbook: Practical Solutions for Modern NoSQL Database Management
From Everand
The DynamoDB Handbook: Practical Solutions for Modern NoSQL Database Management
Robert Johnson
No ratings yet
SQL Demystified: A Beginner's Roadmap to Data Retrieval and Management
From Everand
SQL Demystified: A Beginner's Roadmap to Data Retrieval and Management
Kaushal Mehta
No ratings yet
Learn MongoDB in 24 Hours
From Everand
Learn MongoDB in 24 Hours
Alex Nordeen
5/5 (2)
Structured Query Language Simplified: Efficient and Effective Database Management
From Everand
Structured Query Language Simplified: Efficient and Effective Database Management
Angela White
No ratings yet
Introduction to Microsoft SQL Server
From Everand
Introduction to Microsoft SQL Server
Eric Frick
No ratings yet
DBA's Guide to NoSQL
From Everand
DBA's Guide to NoSQL
The Enlightened DBA
5/5 (1)
DATABASE From the conceptual model to the final application in Access, Visual Basic, Pascal, Html and Php: Inside, examples of applications created with Access, Visual Studio, Lazarus and Wamp
From Everand
DATABASE From the conceptual model to the final application in Access, Visual Basic, Pascal, Html and Php: Inside, examples of applications created with Access, Visual Studio, Lazarus and Wamp
Olga Maria Stefania Cucaro
No ratings yet
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
SQL Query Basics
From Everand
SQL Query Basics
Isabella Ramirez
No ratings yet
Oracle Quick Guides: Part 2 - Oracle Database Design
From Everand
Oracle Quick Guides: Part 2 - Oracle Database Design
Malcolm Coxall
No ratings yet
Database Management System
From Everand
Database Management System
Knowledge Flow
No ratings yet
THE SQL LANGUAGE: Master Database Management and Unlock the Power of Data (2024 Beginner's Guide)
From Everand
THE SQL LANGUAGE: Master Database Management and Unlock the Power of Data (2024 Beginner's Guide)
JAMIE POWERS
No ratings yet
SQL Programming & Database Management For Noobee
From Everand
SQL Programming & Database Management For Noobee
Kishor Sarkar X
No ratings yet
SQL Interview Success From Beginner To Pro
From Everand
SQL Interview Success From Beginner To Pro
Shana
No ratings yet
DBMS MASTER: Become Pro in Database Management System
From Everand
DBMS MASTER: Become Pro in Database Management System
Ummed Singh
No ratings yet
Databases: System Concepts, Designs, Management, and Implementation
From Everand
Databases: System Concepts, Designs, Management, and Implementation
Jonathan Rigdon
No ratings yet
SQL Made Easy: Tips and Tricks to Mastering SQL Programming
From Everand
SQL Made Easy: Tips and Tricks to Mastering SQL Programming
Ryan Campbell
No ratings yet
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Knight's Microsoft Business Intelligence 24-Hour Trainer: Leveraging Microsoft SQL Server Integration, Analysis, and Reporting Services with Excel and SharePoint
From Everand
Knight's Microsoft Business Intelligence 24-Hour Trainer: Leveraging Microsoft SQL Server Integration, Analysis, and Reporting Services with Excel and SharePoint
Brian Knight
3/5 (1)
Concise Oracle Database For People Who Has No Time
From Everand
Concise Oracle Database For People Who Has No Time
Billy Aung Myint
No ratings yet
Taking Advantage of Indexes: How It Works
No ratings yet
Taking Advantage of Indexes: How It Works
7 pages
Jump Start MySQL: Master the Database That Powers the Web
From Everand
Jump Start MySQL: Master the Database That Powers the Web
Timothy Boronczyk
No ratings yet
Learning SQL: Master SQL Fundamentals
From Everand
Learning SQL: Master SQL Fundamentals
Kiet Huynh
No ratings yet
11.physicaldesign
No ratings yet
11.physicaldesign
52 pages
Advanced SQL Performance Tuning: Optimize Your Database Workloads
From Everand
Advanced SQL Performance Tuning: Optimize Your Database Workloads
Robert Johnson
No ratings yet
Mastering ScyllaDB: High-Performance NoSQL with C++
From Everand
Mastering ScyllaDB: High-Performance NoSQL with C++
Robert Johnson
No ratings yet
Introduction to Oracle Database Administration
From Everand
Introduction to Oracle Database Administration
Ying Wang
5/5 (1)
Database Management Systems: Course Content
No ratings yet
Database Management Systems: Course Content
19 pages
Data Structures I Essentials
From Everand
Data Structures I Essentials
Dennis Smolarski
No ratings yet
Exploring Data with Access 2019
From Everand
Exploring Data with Access 2019
Larry Rockoff
No ratings yet
SQL Mastery: From Novice Queries to Advanced Database Wizardry
From Everand
SQL Mastery: From Novice Queries to Advanced Database Wizardry
Scott Markham
No ratings yet
Access 2016: Up To Speed
From Everand
Access 2016: Up To Speed
R.M. Hyttinen
5/5 (2)
Microsoft T-SQL Performance Tuning Part 2: Index Tuning Strategies
No ratings yet
Microsoft T-SQL Performance Tuning Part 2: Index Tuning Strategies
16 pages
Database Management System
From Everand
Database Management System
Manish Soni
No ratings yet
Power BI DAX: A Guide to Using Basic Functions in Data Analysis
From Everand
Power BI DAX: A Guide to Using Basic Functions in Data Analysis
Kiet Huynh
No ratings yet
SQL and NoSQL Full Mastery: A Comprehensive Guide to Modern Data Management
From Everand
SQL and NoSQL Full Mastery: A Comprehensive Guide to Modern Data Management
Kameron Hussain
No ratings yet
Microsoft Access 2003
From Everand
Microsoft Access 2003
Jitendra Patel
5/5 (1)
DEC - Indexing
No ratings yet
DEC - Indexing
21 pages
SQL
From Everand
SQL
Brandon Cooper
No ratings yet
Learn SQL with MySQL: Retrieve and Manipulate Data Using SQL Commands with Ease
From Everand
Learn SQL with MySQL: Retrieve and Manipulate Data Using SQL Commands with Ease
Ashwin Pajankar
No ratings yet
Database Indexing
No ratings yet
Database Indexing
4 pages
The Importance of Indexing in Database Design
No ratings yet
The Importance of Indexing in Database Design
6 pages
Databricks Essentials: A Guide to Unified Data Analytics
From Everand
Databricks Essentials: A Guide to Unified Data Analytics
Robert Johnson
No ratings yet
Mastering DuckDB: High-Performance Analytics Made Easy
From Everand
Mastering DuckDB: High-Performance Analytics Made Easy
Robert Johnson
No ratings yet
Introduction To Storage Strategies in DBMS
No ratings yet
Introduction To Storage Strategies in DBMS
8 pages
Presentation of DDBS
No ratings yet
Presentation of DDBS
27 pages
Power BI DAX Essentials Getting Started with Basic DAX Functions in Power BI
From Everand
Power BI DAX Essentials Getting Started with Basic DAX Functions in Power BI
Kiet Huynh
5/5 (1)
SQL Fundamentals for New Developers: A Practical Guide with Examples
From Everand
SQL Fundamentals for New Developers: A Practical Guide with Examples
William E. Clark
No ratings yet
SQL Server Index Design Guide
No ratings yet
SQL Server Index Design Guide
27 pages
How To Optimize Queries (Theory An Practice) : Cloud Computing Security 101: Learn How To Keep Your Users Safe
No ratings yet
How To Optimize Queries (Theory An Practice) : Cloud Computing Security 101: Learn How To Keep Your Users Safe
15 pages
Connected and Disconnected CH 4
No ratings yet
Connected and Disconnected CH 4
4 pages
Online Cinema Ticket Booking System
43% (14)
Online Cinema Ticket Booking System
106 pages
Hsslive Xii Cs Key Dec 2024
No ratings yet
Hsslive Xii Cs Key Dec 2024
9 pages
Eoi Bou
No ratings yet
Eoi Bou
15 pages
Cut Tutorial Lantek
No ratings yet
Cut Tutorial Lantek
63 pages
CO 2024 LS Grade7 CUF TLE Q1 W1a R, H, P, V
No ratings yet
CO 2024 LS Grade7 CUF TLE Q1 W1a R, H, P, V
9 pages
Oracle Database Concepts
100% (2)
Oracle Database Concepts
471 pages
C Programming with Database: Speaker: Guo-Heng Luo (羅國亨)
No ratings yet
C Programming with Database: Speaker: Guo-Heng Luo (羅國亨)
10 pages
CS505 Finals Highlighted Slides by Alishba (2)
No ratings yet
CS505 Finals Highlighted Slides by Alishba (2)
185 pages
Jira Tables
No ratings yet
Jira Tables
6 pages
Streaming Data: Understanding The Real-Time Pipeline 1st Edition Andrew Psaltis (Psaltis 2024 Scribd Download
100% (4)
Streaming Data: Understanding The Real-Time Pipeline 1st Edition Andrew Psaltis (Psaltis 2024 Scribd Download
55 pages
4it1 02 Que 20230527
No ratings yet
4it1 02 Que 20230527
20 pages
CDM - Class 5,6,7
No ratings yet
CDM - Class 5,6,7
8 pages
Oracle Reports Faq's
No ratings yet
Oracle Reports Faq's
21 pages
A Survey On Text-to-SQL Parsing: Concepts, Methods, and Future Directions
No ratings yet
A Survey On Text-to-SQL Parsing: Concepts, Methods, and Future Directions
19 pages
Unit3 PDMS Databases
No ratings yet
Unit3 PDMS Databases
4 pages
44635_MRV (1)
No ratings yet
44635_MRV (1)
18 pages
AAI Delivery Enablement Guide v1.8 September 2021
No ratings yet
AAI Delivery Enablement Guide v1.8 September 2021
27 pages
Project Document - Sample PDF
No ratings yet
Project Document - Sample PDF
56 pages
TR DataManagement CheatSheet
No ratings yet
TR DataManagement CheatSheet
12 pages
Unicenta oPOS 4.3 Readme
No ratings yet
Unicenta oPOS 4.3 Readme
8 pages
Pieghevole - Soluzioni CAR en-GB V18
No ratings yet
Pieghevole - Soluzioni CAR en-GB V18
36 pages
TelerikReporting LearningGuide
100% (1)
TelerikReporting LearningGuide
172 pages
CSE - 220 Database Management Systems: Subrat K Dash Lnmiit
No ratings yet
CSE - 220 Database Management Systems: Subrat K Dash Lnmiit
45 pages
Client Server 2 Marks-1
No ratings yet
Client Server 2 Marks-1
5 pages
Modeling MultiProviders and InfoSets With SAP BW PDF
No ratings yet
Modeling MultiProviders and InfoSets With SAP BW PDF
17 pages
Chapter 3 - RELATIONAL DATA MODEL - Initial - Version1
No ratings yet
Chapter 3 - RELATIONAL DATA MODEL - Initial - Version1
113 pages
InfoSphere CDC For Oracle Configurations
No ratings yet
InfoSphere CDC For Oracle Configurations
12 pages
Building Spatial Applications With Google Cloud SQL and Google Maps API
No ratings yet
Building Spatial Applications With Google Cloud SQL and Google Maps API
15 pages

database-optimization-2009-06-09

Uploaded by

database-optimization-2009-06-09

Uploaded by

June 9

Basic Database Optimization

Basic Database Optimization

SELECT FirstName, LastName FROM EMPLOYEE WHERE EmpID = 12345;

Crafting a Clustered Index

Basic Database Optimization

Basic Database Optimization

SQL Server Automated Index Recommendations

Basic Database Optimization

1) The creation of indexes.

Basic Database Optimization

Cost Estimation for SELECT Operations

1) Linear search (brute force)

• on average if the record exists

So, for the query above, 5,000 .

Basic Database Optimization

Primary Index / Hash

In general, static and linear hashes have a cost of 1.

Cost Estimation of JOIN Operations

The query we wish to evaluate is:

1) SELECT * FROM EMPLOYEE AS E, DEPARTMENT AS D WHERE E.DeptNum = D.DeptNum;

We will compare the following execution strategies for this query:

1) Nested‐loop join (brute force)

Basic Database Optimization

Using EMPLOYEE as the outer loop:

10,000 10,000 5 60,000 .

Using DEPARTMENT as the outer loop:

Single Loop (using an index)

Using EMPLOYEE as the outer loop:

Using DEPARTMENT as the outer loop:

10,000 5 140,000 10 150,015 .

Basic Database Optimization

Basic Database Optimization

You might also like