0% found this document useful (0 votes)

336 views7 pages

HW4 Solutions

This document contains a homework assignment with 8 questions about database systems topics including block allocation, B-trees, hashing, and grid files. The questions ask the student to calculate the number of blocks needed for different data structures, determine minimum node sizes in B-trees, probabilities in extensible hashing, and buckets to examine for range and nearest neighbor queries in a grid file. Detailed answers are provided for each question.

Uploaded by

Nikhil Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

336 views7 pages

HW4 Solutions

Uploaded by

Nikhil Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

CSE 562

Database Systems
HW 4
Total Marks 100

Your Name:
Your email ID:
Your UB Person ID:

1. Suppose blocks hold either three records, or ten key-pointer pairs. As a function
of n, the number of records, how many blocks do we need to hold a data file and: (a)
A dense index (b) A sparse index?

Answers:

n
a). For dense index we need a key-pointer pair for each record, and so will need 10
blocks.
For the data, we will need n3 blocks, and so the total number of blocks is 13n
30

b). For the sparse index we need a key-pointer pair for each of the data block,and so will
n
need 30 blocks. For the data, we will need n3 blocks, and so the total number of blocks is 11n
30
.

2.Repeat Problem 1 if blocks can hold up to 30 records or 200 key-pointer pairs, but
neither data nor index-blocks are allowed to be more than 80 % full.

Answers:

n
a) For dense index we need a key-pointer pair for each record, and so will need 200.0.8 blocks.
n 23n
For the data, we will need 30.0.8 blocks, and so the total number of blocks is 480 .

b) For the sparse index we need a key-pointer pair for each of the data block,and so will
n n
need 30.0.8.200.0.8
blocks. For the data, we will need 30.0.8 blocks, and so the total number of
161n
blocks is 3840
.

3.Suppose that blocks can hold either ten records or 99 keys and 100 pointers. Also
assume that the average B-tree node is 70% full; i.e., it will have 69 keys and 70
pointers. We can use B-trees as part of several different structures. For each structure
described below, determine
(i) the total number of blocks needed for a 1,000,000-record file, and
(ii) the average number of disk I/O ’s to retrieve a record given its search key. You
may assume nothing is in memory initially, and the search key is the primary key for
the records.

1
a) The data file is a sequential file, sorted on the search key, with 10 records per block.
The B-tree is a dense index.
b) The same as (a), but the data file consists of records in no particular order, packed
10 to a block.
c) The same as (a), but the B-tree is a sparse index.
d) Instead of the B-tree leaves having pointers to data records, the B-tree leaves hold
the records themselves. A block can hold ten records, but on average, a leaf block is
70% full; i.e., there are seven records per leaf block.
e) The data file is a sequential file, and the B-tree is a sparse index, but each primary
block of the data file has one overflow block. On average, the primary block is full,
and the overflow block is half full. However, records are in no particular order within
a primary block and its overflow block

answer:
1000000 1000000
1. We would need = 100000 blocks for the data + = 14493 blocks for
10 69
14493 208
the leaf nodes + = 208 blocks for the next B-tree level, = 3 for the next
70 70
level, and one block for the root. The total would be 114705 blocks. We would need 5
I/O’s (4 for the B-tree levels + data page).
2. Same as (a)
1000000 100000
3. We would need = 100000 blocks for the data + = 1450 blocks for the
10 69
1450
leaf nodes + = 21 blocks for the next B-tree level, , and one block for the root.
70
The total would be 101472 blocks. We would need 4 I/O’s (3 for the B-tree levels +
data page).
1000000 142858
4. We would need = 142858 blocks for the leaf nodes + = 2041 blocks
7 70
2041
for the next level + = 30 blocks for the next level, , and one block for the root.
70
The total would be 144930 blocks. We would need 4 I/O’s (4 for the B-tree levels).
1000000
5. We would need = 66667 blocks for the primary data + 66667 for the overflow
15
66667 967
blocks, = 967 blocks for the leaf nodes + = 14 blocks for the next B-tree
69 70
level, and one block for the root. The total would be 134316 blocks. We would need 3
I/O’s for the B-tree levels + average of 1 + 1 · 13 for the total of 4 13 .

4. What are the minimum numbers of keys and pointers in B-tree (i) interior nodes
and (ii) leaves, when:
a) n = 10; i.e., a block holds 10 keys and 11 pointers.
b) n = 11; i.e., a block holds 11 keys and 12 pointers.

answer:

2
a) 5 keys and 6 pointers for the interior nodes, 5 keys and 5 pointers in the leaf nodes.

b) 5 keys and 6 pointers for the interior nodes, 6 keys and 6 pointers in the leaf nodes.

5. In an extensible hash table with n records per block, what is the probability that
an overflowing block will have to be handled recursively; i.e., all members of the block
will go into the same one of the two blocks created in the split?

answer: In order for all members of the block to go to the same created block, they must
have the same bit at the (j + 1)st position. The probability of all n+1 records having the
n+1
1
same bit is .
2

6.Suppose keys are hashed to four-bit sequences, as in our examples of extensible

and linear hashing in this section. However, also suppose that blocks can hold three
records, rather than the two-record blocks of our examples. If we start with a hash
table with two empty blocks (corresponding to 0 and 1), show the organization after
we insert records with hashed keys:
a)0000,0001,... ,1111, and the method of hashing is extensible hashing.
b) 0000,0001,... ,1111, and the method of hashing is linear hashing with a capacity
threshold of 100 %.
c) 1111,1110,..., 0000, and the method of hashing is extensible hashing.
d)1111,1110,... , 0000, and the method of hashing is linear hashing with a capacity
threshold of 75%.

answer:

1. ---
0000
000
0001
---
0010
001
0011
---
0100
010
0101
---
0110
011
0111
---
1000
100

3
1001
---
1010
101
1011
---
1100
110
1101
---
1110
111
1111
---
2. i = 3, n = 6, r = 16
---
0000
000 1000

---
0001
001 1001

---
0010 1110
010 0110
1010
---
0011 1111
011 0111
1011
---
0100
100 1100

---
0101
101 1101

---
3. ---
0001
000
0000
---

4
0011
001
0010
---
0101
010
0100
---
0111
011
0110
---
1001
100
1000
---
1011
101
1010
---
1101
110
1100
---
1111
111
1110
---
4. i = 3, n = 8, r = 16
---
1000
000 0000

---
1001
001 0001

---
1010
010 0010

---
1011
011 0011

---

5
1100
100 0100

---
1101
101 0101

---
1110
110 0110

---
1111
111 0111

---

7. Suppose we store a relation R (x,y) in a grid file. Both attributes have a range of
values from 0 to 1000. The partitions of this grid file happen to be uniformly spaced;
for x there are partitions every 20 units, at 20, 40, 60, and so on, while for y the
partitions are every 50 units, at 50, 100, 150, and so on.
a) How many buckets do we have to examine to answer the range query
SELECT * FROM R
WHERE 310 < x AND x < 400 AND 520 < y AND y <730
b) We wish to perform a nearest-neighbor query for the point (110,205). We begin
by searching the bucket with lower-left corner at (100,200) and upper-right corner at
(120,250), and we find that the closest point in this bucket is (115,220). What other
buckets must be searched to verify that this point is the closest?

answer: (a) 25
(b) The distance is 15.8, so the other buckets that need to be examined are: (80,200),
(120,200), (80,150), (100,150), (120,150).

8. Suppose we have a relation R (x ,y ,z), where the pair of attributes x and y together
form the key. Attribute x ranges from 1 to 100, and y ranges from 1 to 1000. For each
x there are records with 100 different values of y, and for each y there are records
with 10 different values of x. Note that there are thus 10,000 records in R. We wish
to use a multiple-key index that will help us to answer queries of the form:
SELECT z
FROM R
WHERE x = C AND y = D;
where C and D are constants. Assume that blocks can hold ten key-pointer pairs, and
we wish to create dense indexes at each level, perhaps with sparse higher-level indexes
above them, so that each index starts from a single block. Also assume that initially
all index and data blocks are on disk.
a) How many disk I/O ’s are necessary to answer a query of the above form if the first

6
index is on x?
b) How many disk I/O ’s are necessary to answer a query of the above form if the
first index is on y?
c) Suppose you were allowed to buffer 11 blocks in memory at all times. Which blocks
would you choose, and would you make x or y the first index, if you wanted to minimize
the number of additional disk I/O ’s needed?

answer:

1. For the dense index on x we would need 100/10 = 10 blocks, and so for the sparse
index of x we would need 1 block. Therefore, we need two disk I/O’s to get to the
y index. For each x there are 100 y values and so the dense index on y will need 10
blocks, and the sparse index will need one block. We would need two disk I/O’s to get
to the y value. The total number of I/O’s is then 2+2+1(for the data) = 5.
2. For the dense index on y we would need 1000/10 = 100 blocks, and so for the sparse
index on y we would need two levels (10 blocks and 1 block). Therefore, we need 3 disk
I/O’s to get to the x index. For each y there are 10 values of x and so we just need a
one block dense index for x. The total disk I/O’s is then 3+1+1 (for the data) = 5.
3. We would want to buffer the top 11 blocks of the tree (root + 10 intermediate blocks
of the next level after the root). This means that picking x as the first index is better
since the whole index would be in memory and the queries where predicate x = C is
false could be answered without any additional I/O’s.

9.For the structure of Problem 9, how many disk I/O ’s cure required to answer the
range query in which 20 < x < 35 and 200 < y < 350. Assume data is distributed
uniformly; i.e., the expected number of points will be found within any given range.

answer. To evaluate 20 6 x 6 35 we need to read the root, then 3 blocks for the range (11-
20, 21-30, 31-40). That’s 4 I/O’s. For each of the x values qualified we need to evaluate 200
6 y 6 350 which means reading the root block and then 3 blocks for the range (101-200,
201-300, 301-400). The total is then, 4+1·4+10·4+10·4 = 88.

Valuation The Art and Science of Corporate Investment Decisions 3rd Edition Titman Digital Access
100% (1)
Valuation The Art and Science of Corporate Investment Decisions 3rd Edition Titman Digital Access
405 pages
DBMS in 5 Hours
100% (2)
DBMS in 5 Hours
332 pages
Online Loan Application and Verification 2016
No ratings yet
Online Loan Application and Verification 2016
34 pages
Algorithm Design Foundations Solutions
No ratings yet
Algorithm Design Foundations Solutions
111 pages
Humidity & Temperature Monitoring System by Using Arduino
90% (10)
Humidity & Temperature Monitoring System by Using Arduino
53 pages
Group Disc
No ratings yet
Group Disc
38 pages
DBMS W09 Pas
No ratings yet
DBMS W09 Pas
12 pages
(PDF Download) Introduction To Computing Systems: From Bits & Gates To C & Beyond 3rd Edition Yale Patt Fulll Chapter
100% (8)
(PDF Download) Introduction To Computing Systems: From Bits & Gates To C & Beyond 3rd Edition Yale Patt Fulll Chapter
64 pages
Test Bank - 2
No ratings yet
Test Bank - 2
57 pages
DBMS Unit 4 Notes
No ratings yet
DBMS Unit 4 Notes
29 pages
IOT Based On Multilevel Fluid & Air-Cooling System For Battery Protection
No ratings yet
IOT Based On Multilevel Fluid & Air-Cooling System For Battery Protection
46 pages
Maximum Likelihood and Bayesian Parameter Estimation: Chapter 3, DHS
No ratings yet
Maximum Likelihood and Bayesian Parameter Estimation: Chapter 3, DHS
35 pages
Chapter 2 Boolean Algebra and Logic Gates
No ratings yet
Chapter 2 Boolean Algebra and Logic Gates
54 pages
Imdrf Rps WG pd1 n27r2
No ratings yet
Imdrf Rps WG pd1 n27r2
12 pages
Network Categories
No ratings yet
Network Categories
20 pages
Weekly Exercises 01
No ratings yet
Weekly Exercises 01
16 pages
Li - Fi Ppt-By Sampath
No ratings yet
Li - Fi Ppt-By Sampath
19 pages
Seulangatv
No ratings yet
Seulangatv
10 pages
MC 10206878 0001
No ratings yet
MC 10206878 0001
3 pages
Agilent Vacuum Station
No ratings yet
Agilent Vacuum Station
8 pages
Diagnostic Lights - Dell OptiPlex 755 User Manual (Page 347)
100% (1)
Diagnostic Lights - Dell OptiPlex 755 User Manual (Page 347)
5 pages
Numerical Analysis With - Matlab
No ratings yet
Numerical Analysis With - Matlab
76 pages
Envisalink 3: Quick-Start Manual
No ratings yet
Envisalink 3: Quick-Start Manual
1 page
Human-Centered Machine Learning Implementation in Banking Case Study in BRILink BRI Branchless Banking Agent Acquisition Upgrade and Activation
No ratings yet
Human-Centered Machine Learning Implementation in Banking Case Study in BRILink BRI Branchless Banking Agent Acquisition Upgrade and Activation
7 pages
Assignment 1: Create The Table
No ratings yet
Assignment 1: Create The Table
26 pages
Provided by Short Notes 9618 P1
No ratings yet
Provided by Short Notes 9618 P1
20 pages
Assignment #5, CS4/531
No ratings yet
Assignment #5, CS4/531
9 pages
Assignment #5, CS4/531
No ratings yet
Assignment #5, CS4/531
9 pages
Exercises B+Tree
100% (1)
Exercises B+Tree
9 pages
DataKinetics Batch Optimization Whitepaper
No ratings yet
DataKinetics Batch Optimization Whitepaper
7 pages
Onkyo TX NR 616 Service Manual PDF
No ratings yet
Onkyo TX NR 616 Service Manual PDF
138 pages
ChatPDF-Use of Hierarchical Cascading Technique For FEM Analysis of Transverse-Mode Behaviors in Surface Acoustic-Wave Devices
No ratings yet
ChatPDF-Use of Hierarchical Cascading Technique For FEM Analysis of Transverse-Mode Behaviors in Surface Acoustic-Wave Devices
3 pages
Concurrency Control: Practice Exercises
No ratings yet
Concurrency Control: Practice Exercises
4 pages
Practice Question 2
No ratings yet
Practice Question 2
20 pages
Post Lab 3 Eee205
No ratings yet
Post Lab 3 Eee205
18 pages
Solved Question Paper Questions Graph Theory1
No ratings yet
Solved Question Paper Questions Graph Theory1
67 pages
Objectorienteddbms Selective Inheritance
100% (2)
Objectorienteddbms Selective Inheritance
37 pages
HW 1 Solutions
No ratings yet
HW 1 Solutions
3 pages
Bayes Decision Theory
No ratings yet
Bayes Decision Theory
53 pages
Roll It!
No ratings yet
Roll It!
4 pages
HW 3 Sol
No ratings yet
HW 3 Sol
8 pages
Midterm I - Version B: 1 2 1.5 3 Log N
No ratings yet
Midterm I - Version B: 1 2 1.5 3 Log N
5 pages
Week 4 Solution
No ratings yet
Week 4 Solution
10 pages
Recurrence Relation by Master Method
No ratings yet
Recurrence Relation by Master Method
19 pages
Hw1-Sol CSE 531
No ratings yet
Hw1-Sol CSE 531
9 pages
Classification 1 Definition and Classification of Cyber Crime
No ratings yet
Classification 1 Definition and Classification of Cyber Crime
8 pages
Client Class Vs Object Class
No ratings yet
Client Class Vs Object Class
37 pages
The Multinomial Theorem
No ratings yet
The Multinomial Theorem
82 pages
Amortized Analysis
No ratings yet
Amortized Analysis
4 pages
Constraints in Mysql
No ratings yet
Constraints in Mysql
4 pages
SQL Queries
No ratings yet
SQL Queries
29 pages
Recurrence-Relations Time Complexity
No ratings yet
Recurrence-Relations Time Complexity
14 pages
Man User DPS232
No ratings yet
Man User DPS232
110 pages
CH - 2 Query Process
No ratings yet
CH - 2 Query Process
44 pages
HW 2 Solutions
No ratings yet
HW 2 Solutions
5 pages
Chapter 14: Indexing Structures For Files: Answers To Selected Exercises
No ratings yet
Chapter 14: Indexing Structures For Files: Answers To Selected Exercises
8 pages
COMP3711: Design and Analysis of Algorithms: Tutorial 5 Hkust
100% (1)
COMP3711: Design and Analysis of Algorithms: Tutorial 5 Hkust
31 pages
To Pattern Recognition: CSE555, Fall 2021 Chapter 1, DHS
100% (1)
To Pattern Recognition: CSE555, Fall 2021 Chapter 1, DHS
39 pages
Asdfghjk
No ratings yet
Asdfghjk
23 pages
CNC Controller Tc55xx User Manual Ru
No ratings yet
CNC Controller Tc55xx User Manual Ru
28 pages
DBMS Solutions For EndSem
No ratings yet
DBMS Solutions For EndSem
54 pages
5.1 Theinternet andtheWorldWideWeb
No ratings yet
5.1 Theinternet andtheWorldWideWeb
10 pages
Lab Course File: Galgotias University
No ratings yet
Lab Course File: Galgotias University
42 pages
Assignment #6, CS4/531
No ratings yet
Assignment #6, CS4/531
6 pages
Q. Consider The Database For A College. Write The Query For The Following. Insert at Least 5 Tuples Into Each Table
No ratings yet
Q. Consider The Database For A College. Write The Query For The Following. Insert at Least 5 Tuples Into Each Table
5 pages
6.1.3 Lab - Implement VRF-Lite - ILM
No ratings yet
6.1.3 Lab - Implement VRF-Lite - ILM
30 pages
358 33 Powerpoint Slides DSC Chapter 15
No ratings yet
358 33 Powerpoint Slides DSC Chapter 15
55 pages
Trainee - Software Engineer - JD + JNF (Engineering)
No ratings yet
Trainee - Software Engineer - JD + JNF (Engineering)
2 pages
Cyberops - Module 3 Study Notes - TH
No ratings yet
Cyberops - Module 3 Study Notes - TH
6 pages
Dbms Aicte Lab
No ratings yet
Dbms Aicte Lab
42 pages
Sample Exam LCSPC V082019A EN
No ratings yet
Sample Exam LCSPC V082019A EN
8 pages
2011 AL ICT Model Paper English
0% (1)
2011 AL ICT Model Paper English
18 pages
Dbms Assignment 2 Subhamoy Ghosh 6035
No ratings yet
Dbms Assignment 2 Subhamoy Ghosh 6035
16 pages
Dbms Ob Questions
No ratings yet
Dbms Ob Questions
41 pages
Asymptotic Analysis
No ratings yet
Asymptotic Analysis
19 pages
DBMS Cheatsheet
No ratings yet
DBMS Cheatsheet
1 page
موسوعة امثلة C++ المحلولة
No ratings yet
موسوعة امثلة C++ المحلولة
34 pages
LDM1 Module 3 Decision Tree
No ratings yet
LDM1 Module 3 Decision Tree
5 pages
Assignment 5 - Stacks and Queues
No ratings yet
Assignment 5 - Stacks and Queues
22 pages
Assignment No 1
No ratings yet
Assignment No 1
4 pages
DSA Final Fall 2022
No ratings yet
DSA Final Fall 2022
2 pages
BCA 2nd Sem Ass.2018-19
No ratings yet
BCA 2nd Sem Ass.2018-19
15 pages
7 Query Localization
No ratings yet
7 Query Localization
27 pages
Assignment 1: Data Structure
No ratings yet
Assignment 1: Data Structure
3 pages
DBMS Lab # 5 SQL Constraints
No ratings yet
DBMS Lab # 5 SQL Constraints
14 pages
Python Programming Using Problem Solving
100% (7)
Python Programming Using Problem Solving
646 pages
Fall 2018 Midterm Examination: Questions 1 2 3 4 5 6 7 Total Points 10 10 10 10 10 25 25 100 Score
No ratings yet
Fall 2018 Midterm Examination: Questions 1 2 3 4 5 6 7 Total Points 10 10 10 10 10 25 25 100 Score
8 pages
HW 2 Sol
No ratings yet
HW 2 Sol
5 pages
MYSQL Queries
No ratings yet
MYSQL Queries
4 pages
Exercise - 10: PL/SQL Cursors
No ratings yet
Exercise - 10: PL/SQL Cursors
4 pages
SQL Server Analytical Functions
No ratings yet
SQL Server Analytical Functions
9 pages
Complex Data Types: Practice Exercises
No ratings yet
Complex Data Types: Practice Exercises
4 pages
Week 9
No ratings yet
Week 9
4 pages
DDBMS Exam Questions
No ratings yet
DDBMS Exam Questions
3 pages
Normalization in DBMS
No ratings yet
Normalization in DBMS
9 pages
Questions On OOPS in C++ For Lab
No ratings yet
Questions On OOPS in C++ For Lab
2 pages
DDL DML Exercises
No ratings yet
DDL DML Exercises
2 pages
Bresenham Line Drawing Algo
No ratings yet
Bresenham Line Drawing Algo
6 pages
Analysis of Merge Sort
No ratings yet
Analysis of Merge Sort
6 pages
Clientno Cname Propertyno Paddress Rentstart Rentfinish Rent Ownerno Oname
No ratings yet
Clientno Cname Propertyno Paddress Rentstart Rentfinish Rent Ownerno Oname
3 pages
Data Structure4
No ratings yet
Data Structure4
6 pages

HW4 Solutions

Uploaded by

HW4 Solutions

Uploaded by

CSE 562

6.Suppose keys are hashed to four-bit sequences, as in our examples of extensible

You might also like