0% found this document useful (0 votes)

250 views5 pages

Index On The Search Key, and Heap Files With An Unclusted Hash Index. Briefly Discuss The

This document discusses database indexing and performance. It includes: 1) Questions about sorting data, clustered vs unclustered indexes, and different file organizations for various database operations. 2) A scenario involving professors and departments where indexes would help optimize specific queries. 3) Questions about hash indexing, linear hashing, and its performance benefits over tree indexes for tables with few inserts and frequent lookups by item ID. 4) An assignment to implement a student database with indexing to improve query performance, and report on the findings.

Uploaded by

Faizan Bashir Sidhu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

250 views5 pages

Index On The Search Key, and Heap Files With An Unclusted Hash Index. Briefly Discuss The

Uploaded by

Faizan Bashir Sidhu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Part 1: Concepts and Principles

Question 1
A. Briefly explain the three main alternatives for sorting information in a data entry of an
index.

B. Define clustered index, and discuss the relation between the three alternatives and
clustered/unclustered indexes.

Question 2
Consider the following file organizations: sorted files, heap files with an unclustered tree
index on the search key, and heap files with an unclusted hash index. Briefly discuss the
suitability of each of these file organizations to perform the following operations: file
scans, range selections, inserts, and deletes.

Question 3
A. Briefly describe the two internal organizations for heap files (using lists versus
directory of pages).
B. Explain which organization you would choose if records are variable in length.

Question 4
Compare ISAM and B+ Tree index. Explain briefly their differences in handling Search,
Insert and Delete, and discus when you would use ISAM and when you would use B+
Tree index.

Question 5
Does the final structure of a B+ tree depend on the order in which the terms are added to
it? Explain your answer using an illustration example.

Question 6
Explain how extendible hashing uses a directory of buckets and discuss the global depth
of the index and local depth of a bucket.
Part 2: Design considerations for application

Question 1

Consider the following relations:

Professor (profid: integer, name: varchar, salary: integer, age: integer, depid: integer)
Department (did: integer, budget: integer, location: varchar, mgr eid: integer)

Salaries range from $30,000 to $100,000, ages vary from 20 to 80, each department has about 20
employees on average, there are 10 locations, and budgets vary from $100,000 to $1 million.
You can assume uniform distributions of values.

For each of the following queries, what index would you choose to speed up the query? If your
database system does not consider index-only plans (i.e., data records are always retrieved even
if enough information is available in the index entry), how would your answer change? Explain
briefly.

A. Query1: Print name, age, and salary for all professors.

B. Query2: Find the dids of departments that are located in Edmonton and have a budget of
more than $150,000.

Question 2

The CVT Company is a leader in the manufacture of work clothes. You are hired as database
administrator for the company and your IT supervisor asked you to solve a retrieval speed
problem they used to have with a large file for item records. Your supervisor mentioned that they
have sorted the file but the problem didn’t improve, so they need to create a B+ tree index to
solve the problem. Your supervisor outlined the way to do it: “The best way to accomplish this
task is to scan the file, record by record, inserting each one using the B+ tree insertion
procedure.” Being a fresh graduate, you noticed that since the file is already sorted there is a
better way to do it.

a. What performance and storage utilization problems are there with your
supervisor’s approach?
b. Explain how the bulk-loading algorithm provides a better alternative than the
proposed scheme.
Question 3

Your team in charge of database administration was discussing different alternatives for indexing
your organization’s databases. Some tables in one database have very few insertions but they are
used intensively by different services to check for information about items using the item_ID
number. While many of your colleagues proposed using a tree index, you argued for a Hash
index for these tables because it provides an average-case search cost of only slightly more than
one disk I/O. The team leader agrees to adopt your solution but has asked you to write a short
explanation for two questions:

a. How does Linear Hashing provide an average-case search cost of only slightly
more than one disk I/O, given that overflow buckets are part of its data
structure? (6 marks)
b. If a Linear Hashing index using Alternative (1) for data entries contains 10,000
records, with 10 records per page and an average storage utilization of 80 percent,
what is the worst-case cost for an equality search? Under what conditions would
this cost be the actual search cost? (6 marks)
Part 3: Implementation Case

Consider the following database schema with the following relations:

Student (SID, Name, Address, Telephone, Age)

Course (CourseNo, Title, Department, NumberOfCredits, CourseFees)
Registration (SID, CourseNo, startDate, CompleteDate, Grade)

Consider the following queries:

o List the student numbers and names of students who received a grade greater or
equal to 70% in the course “COMP418,” sorted by age ascending.
o List the course numbers and titles of courses that have more than 10 students
getting a grade lower than 50. [(Use group by courseNo and count(SID)].
o List the course numbers and titles of courses whose course fees are between 400
and 600 dollars.
o List all courses in the database.
o Update all the course fees by adding 6 dollars to each course.

Your task is to implement this database using PostgreSQL or any other DBMS of the list
(Oracle, MySQL, DB2, SQL server) then compare the performance of the system before creating
the indexes and after creating the indexes. Make sure that you create indexes that support the
queries.

o You should use test data to identify performance issues: the more data, the better.
Make sure there is sufficient test data in your system to be able to run queries that
can return at least a dozen rows of data even when using the queries. Unless there
is a fair amount of test data, you will not be able to see much difference in query
execution times.
o Decide on the type of indexing that would be most appropriate. This will require
you to read about the different indexing options in your DBMS. The PostgreSQL 9
manual on the subject is available
at http://www.postgresql.org/docs/9.1/interactive/indexes.html. Most DBMSs,
including PostgreSQL, provide an 'ANALYZE,' 'EXPLAIN' or similar command
that can be used to help tune your database and make recommendations on
indexing that you may find very useful.
o Check the performance of the queries before adding indexes. If using PostgreSQL,
you will likely find the EXPLAIN command useful in accomplishing this. There is
a visual EXPLAIN tool available as part of some versions of PgAdmin that you
may want to try. Information on reading and interpreting the results as well as on
how to use the tool is available at
http://www.postgresonline.com/journal/index.php?/archives/27-Reading-PgAdmin-
Graphical-Explain-Plans.html
o Add your indexes. You will probably be using the CREATE INDEX command for
this, but do feel free to use other DBMS tools if they are available. For a better
analysis you may want to add one index at a time and check performance changes
between each addition to discover the cumulative effects of each index.
o Check the performance again, and record the results. If you have the time and
inclination, it would be informative to experiment more with your DBMS to
discover what differences different kinds of indexes make to different queries. If
you have enough test data, you may find considerable differences in performance
as a result.

Write a short report (1-2 pages maximum) that summarizes your findings during the experiment.
The report should include:

o A description of your implementation of the database. Include the SQL code for
implementing the tables. How many records did you enter in each table?
o A description of the execution time of the queries before creating the indexes.
o A description of the created indexes, and a justification of why you think those
indexes would improve the system performance for the specified queries.
o A table comparing both execution times before and after indexing for each query.

Plan A
No ratings yet
Plan A
327 pages
Epics in Agile - The Ultimate Guide To Mastering Your Workflow
100% (2)
Epics in Agile - The Ultimate Guide To Mastering Your Workflow
10 pages
CSE 444 Practice Problems
No ratings yet
CSE 444 Practice Problems
13 pages
Dbms PPT For Chapter 7
No ratings yet
Dbms PPT For Chapter 7
45 pages
Lecture9 PDF
No ratings yet
Lecture9 PDF
45 pages
Lesson 9 Lecture9
No ratings yet
Lesson 9 Lecture9
45 pages
Take Assessment: Exercise 6: Index Choice and Query Optimization
No ratings yet
Take Assessment: Exercise 6: Index Choice and Query Optimization
7 pages
Lec6 QP Indexing
No ratings yet
Lec6 QP Indexing
40 pages
V Unit
No ratings yet
V Unit
36 pages
V Unit
No ratings yet
V Unit
15 pages
Lecture3 File Orgn
No ratings yet
Lecture3 File Orgn
13 pages
Chap. 2 File Organization and Indexing: Abel J.P. Gomes
No ratings yet
Chap. 2 File Organization and Indexing: Abel J.P. Gomes
20 pages
Final Review
No ratings yet
Final Review
96 pages
Lt20 21 Index
No ratings yet
Lt20 21 Index
28 pages
Perofrmance and Indexes Discussion Questions Solutions PDF
No ratings yet
Perofrmance and Indexes Discussion Questions Solutions PDF
5 pages
Lecture12 (CNC 312)
No ratings yet
Lecture12 (CNC 312)
36 pages
File Organizations and Indexing: R&G Chapter 8
No ratings yet
File Organizations and Indexing: R&G Chapter 8
26 pages
DBMS Unit9
No ratings yet
DBMS Unit9
44 pages
SE3060 - Database Systems
No ratings yet
SE3060 - Database Systems
6 pages
Database
No ratings yet
Database
4 pages
Midterm 13w2
No ratings yet
Midterm 13w2
8 pages
W5 Storage Files Indexing pt1
No ratings yet
W5 Storage Files Indexing pt1
61 pages
Layers of A DBMS
No ratings yet
Layers of A DBMS
38 pages
Indexing and Hashing: Basic Concept, Ordered Indices: Adbms
No ratings yet
Indexing and Hashing: Basic Concept, Ordered Indices: Adbms
22 pages
DBMS A1
No ratings yet
DBMS A1
10 pages
File Organization
No ratings yet
File Organization
41 pages
Session - 10 Querying
No ratings yet
Session - 10 Querying
36 pages
Guc 437 59 31055 2023-05-25T16 41 09
No ratings yet
Guc 437 59 31055 2023-05-25T16 41 09
15 pages
Unit08 DBMS
100% (1)
Unit08 DBMS
45 pages
An Introduction To Database Systems Bipin C.desaI
No ratings yet
An Introduction To Database Systems Bipin C.desaI
849 pages
DINLect 1
No ratings yet
DINLect 1
69 pages
Database Management Systems
No ratings yet
Database Management Systems
20 pages
File Organization & Indexing: Reading: C&B, Appendix C
No ratings yet
File Organization & Indexing: Reading: C&B, Appendix C
17 pages
Information Retrieval & Data Mining: Smart PC Explorer
No ratings yet
Information Retrieval & Data Mining: Smart PC Explorer
14 pages
Dbms Assignment
0% (2)
Dbms Assignment
15 pages
Lab 06
No ratings yet
Lab 06
8 pages
Assignment On Database Indexing
No ratings yet
Assignment On Database Indexing
3 pages
DBMS Cat 3-Key
No ratings yet
DBMS Cat 3-Key
8 pages
UNIT 4 Updated - 121124
No ratings yet
UNIT 4 Updated - 121124
52 pages
Ita5008 Database-Technologies Eth 1.0 40 Ita5008
No ratings yet
Ita5008 Database-Technologies Eth 1.0 40 Ita5008
6 pages
File Organizations and Indexing: R&G Chapter 8
No ratings yet
File Organizations and Indexing: R&G Chapter 8
40 pages
File Organizations and Indexing: R&G Chapter 8
No ratings yet
File Organizations and Indexing: R&G Chapter 8
40 pages
Advance Database Management System: Unit - 2 .Query Processing and Optimization
No ratings yet
Advance Database Management System: Unit - 2 .Query Processing and Optimization
38 pages
13 QP1
No ratings yet
13 QP1
33 pages
Chapter 8 Indexing NEW
No ratings yet
Chapter 8 Indexing NEW
43 pages
DBMS Storage and Indexing
No ratings yet
DBMS Storage and Indexing
80 pages
4 File & Index
No ratings yet
4 File & Index
35 pages
An Efficient Approach For Data Indexing in Datawarehousing and Datamining
No ratings yet
An Efficient Approach For Data Indexing in Datawarehousing and Datamining
7 pages
Overview of Query Evaluation: R&G Chapter 12
No ratings yet
Overview of Query Evaluation: R&G Chapter 12
30 pages
File Organizations and Indexing: Module 2, Lecture 2
No ratings yet
File Organizations and Indexing: Module 2, Lecture 2
16 pages
Lesson 9 Mod2l2
No ratings yet
Lesson 9 Mod2l2
16 pages
DBM S Manual Final
No ratings yet
DBM S Manual Final
51 pages
Lec20Indexing v1
No ratings yet
Lec20Indexing v1
57 pages
26 - Databse Indexes
No ratings yet
26 - Databse Indexes
48 pages
Lecture 16
No ratings yet
Lecture 16
19 pages
3 - QueryProcessing - Ch15
No ratings yet
3 - QueryProcessing - Ch15
56 pages
Midterm 15w2
No ratings yet
Midterm 15w2
8 pages
Indexing
No ratings yet
Indexing
62 pages
Mod4 Chap10 - 11 Indexing
No ratings yet
Mod4 Chap10 - 11 Indexing
77 pages
Exploring Data with Access 2016
From Everand
Exploring Data with Access 2016
Larry Rockoff
No ratings yet
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Access 2016: Up To Speed
From Everand
Access 2016: Up To Speed
R.M. Hyttinen
5/5 (2)
Story Map Template
No ratings yet
Story Map Template
13 pages
Solutions CH 1
No ratings yet
Solutions CH 1
38 pages
CH 1 Pre-Assignment Practice
No ratings yet
CH 1 Pre-Assignment Practice
6 pages
CH 1
No ratings yet
CH 1
101 pages
Students Questions Group 1-2-4-5-7
No ratings yet
Students Questions Group 1-2-4-5-7
6 pages
Sprint 1: To Do Doing Done
No ratings yet
Sprint 1: To Do Doing Done
4 pages
ISO 27001 2022 Internal Audit Training Activity
100% (1)
ISO 27001 2022 Internal Audit Training Activity
7 pages
Android Enterprise Test Drive Work Profile Guide
No ratings yet
Android Enterprise Test Drive Work Profile Guide
7 pages
RTL Design of SPI Protocol
100% (1)
RTL Design of SPI Protocol
4 pages
SQL Commands
No ratings yet
SQL Commands
8 pages
Part I: Introduction: Purpose
No ratings yet
Part I: Introduction: Purpose
9 pages
STS Pointers
No ratings yet
STS Pointers
2 pages
Cloud Storage
No ratings yet
Cloud Storage
14 pages
Dotnet Unit5
No ratings yet
Dotnet Unit5
24 pages
G A II: Boruvka's Algorithm Demo
No ratings yet
G A II: Boruvka's Algorithm Demo
8 pages
The Model View Controller Pattern - MVC Architecture and Frameworks Explained
No ratings yet
The Model View Controller Pattern - MVC Architecture and Frameworks Explained
18 pages
Tutorial 2 Hci 20B10G013
No ratings yet
Tutorial 2 Hci 20B10G013
2 pages
Blockchain Quiz A
No ratings yet
Blockchain Quiz A
16 pages
6854 Proj
No ratings yet
6854 Proj
7 pages
A Virtualisation Case Study
No ratings yet
A Virtualisation Case Study
25 pages
Vision Based Autonomous Landing of UAV
No ratings yet
Vision Based Autonomous Landing of UAV
12 pages
Chapter 4, E Commerce Security & Payment Systems
No ratings yet
Chapter 4, E Commerce Security & Payment Systems
21 pages
Packet Walks in Kubernetes-V4
No ratings yet
Packet Walks in Kubernetes-V4
50 pages
TOS in Computer Technology 9 - 3rd Quarter
No ratings yet
TOS in Computer Technology 9 - 3rd Quarter
3 pages
Easy Tablet Use With No USB Dongle Key: VT5-S1L
No ratings yet
Easy Tablet Use With No USB Dongle Key: VT5-S1L
2 pages
Citrix Workspace App
No ratings yet
Citrix Workspace App
157 pages
DATA Provisioning & Replication in SAP HANA
No ratings yet
DATA Provisioning & Replication in SAP HANA
5 pages
Seif Ezzat
No ratings yet
Seif Ezzat
3 pages
Lecture - 10 Cryptographic Hash Functions
No ratings yet
Lecture - 10 Cryptographic Hash Functions
46 pages
Brochure Hydra DMC en Eng 533781
No ratings yet
Brochure Hydra DMC en Eng 533781
8 pages
Home Security System With Face Recognition Based On Convolutional Neural Network
No ratings yet
Home Security System With Face Recognition Based On Convolutional Neural Network
5 pages
Introduction To Computer: A Device That Processes Input and Generates Output
No ratings yet
Introduction To Computer: A Device That Processes Input and Generates Output
17 pages
EXERCISE 2 ARRAY RafiRidzuan
No ratings yet
EXERCISE 2 ARRAY RafiRidzuan
6 pages
Syllabus For IOT
No ratings yet
Syllabus For IOT
11 pages
The 360 of BIM 360
No ratings yet
The 360 of BIM 360
24 pages
How Does The Software Work?: Jugar - Toyota Smart Key Solution
No ratings yet
How Does The Software Work?: Jugar - Toyota Smart Key Solution
4 pages

Index On The Search Key, and Heap Files With An Unclusted Hash Index. Briefly Discuss The

Uploaded by

Index On The Search Key, and Heap Files With An Unclusted Hash Index. Briefly Discuss The

Uploaded by

Part 1: Concepts and Principles

Consider the following relations:

A. Query1: Print name, age, and salary for all professors.

Consider the following database schema with the following relations:

Student (SID, Name, Address, Telephone, Age)

Consider the following queries:

You might also like