0% found this document useful (0 votes)

35 views9 pages

Project #2 - Extendible Hash Index After - 08

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views9 pages

Project #2 - Extendible Hash Index After - 08

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Project #2 - Extendible Hash Index | CMU 15-445/645 :: Intro to Database Systems (Fall 2021)

CMU 15-445/645
Home
Assignments
FAQ
Schedule
Syllabus
 Youtube
 Piazza
 Archives
Fall 2020
Fall 2019
Fall 2018
Fall 2017

Project #2 - Extendible Hash Index

 Do not post your project on a public Github repository.

Overview
The second programming project is to implement a disk-backed hash table for the BusTub DBMS. Your
hash table is responsible for fast data retrieval without having to search through every record in a database
table.

You will need to implement a hash table using the extendible hashing hashing scheme.

This index comprises a directory page that contains pointers to bucket pages. The table will access pages
through your buffer pool from Project #1. The table contains a directory page that stores all the metadata
for the table and buckets. Your hash table needs to support bucket splitting/merging for full/empty buckets,
and directory expansion/contraction for when the global depth must change.

You will need to complete the following tasks in your hash table implementation:

Page Layouts
Extendible Hashing Implementation
Concurrency Control

This is a single-person project that will be completed individually (i.e., no groups).

Release Date: Sep 27, 2021

Due Date: Oct 21, 2021 @ 11:59pm

Project Specification
Like the first project, we are providing you with stub classes that contain the API that you need to
implement. You should not modify the signatures for the pre-defined functions in these classes. If you do
this, then it will break the test code that we will use to grade your assignment you end up getting no credit

https://15445.courses.cs.cmu.edu/fall2021/project2/[2/21/2022 10:24:46 PM]

Project #2 - Extendible Hash Index | CMU 15-445/645 :: Intro to Database Systems (Fall 2021)

for the project. If a class already contains certain member variables, you should not remove them. But you
may add private helper functions/member variables to these classes in order to correctly realize the
functionality.

The correctness of the extendible hash table index depends on the correctness of your buffer pool
implementation. We will not provide solutions for the previous programming projects.

Task #1 - Page Layouts

Your hash table is meant to be accessed through the DBMS's BufferPoolManager. This means that you
cannot allocate memory to store information. Everything must be stored in disk pages so that they can
read/written from the DiskManager. If you create a hash table, write its pages to disk, and then restart the
DBMS, you should be able to load back the hash table from disk after restarting.

To support reading/writing hash table buckets on top of pages, you will implement two Page classes to
store the data of your hash table. This is meant to teach you how to allocate memory from the
BufferPoolManager as pages.

Hash Table Directory Page

Hash Table Bucket Page

Hash Table Directory Page

This class holds all of the meta-data for the hash table. It is divided into the fields as shown by the table
below:

Variable Name Size Description

page_id_ 4 bytes Self Page Id
lsn_ 4 bytes Log sequence number (Used in Project 4)
global_depth_ 4 bytes Global depth of the directory
local_depths_ 512 bytes Array of local depths for each bucket (uint8)
bucket_page_ids_ 2048 bytes Array of bucket page_id_t

The bucket_page_ids_ array maps bucket ids to page_id_t ids. The ith element in bucket_page_ids_ is the
page_id for the ith bucket.

You must implement your Hash Table Directory Page in the designated files. You are only allowed to modify
the directory file ( src/include/storage/page/hash_table_directory_page.h) and its corresponding source
file ( src/storage/page/hash_table_directory_page.cpp). The Directory and Bucket pages are fully separate
from the LinearProbeHashTable's Header and Block pages, so please make sure that you are editing the
correct files.

Hash Table Bucket Page

The Hash Table Bucket Page holds three arrays:

òccupied_` : The ith bit of òccupied_` is 1 if the ith index of àrray_` has ever been occupied.
`readable_` : The ith bit of `readable_` is 1 if the ith index of àrray_` holds a readable value.
àrray_` : The array that holds the key-value pairs.

The number of slots available in a Hash Table Bucket Page depends on the types of the keys and values
being stored. You only need to support fixed-length keys and values. The size of keys/values will be the
same within a single hash table instance, but you cannot assume that they will be the same for all

https://15445.courses.cs.cmu.edu/fall2021/project2/[2/21/2022 10:24:46 PM]

Project #2 - Extendible Hash Index | CMU 15-445/645 :: Intro to Database Systems (Fall 2021)

instances (e.g., hash table #1 can have 32-bit keys and hash table #2 can have 64-bit keys).

You must implement your Hash Table Bucket Page in the designated files. You are only allowed to modify
the header file ( src/include/storage/page/hash_table_bucket_page.h ) and its corresponding source file
( src/storage/page/hash_table_bucket_page.cpp ). The Directory and Bucket pages are fully separate from
the LinearProbeHashTable's Header and Block pages, so please make sure that you are editing the correct
files.

Each Hash Table Directory/Bucket page corresponds to the content (i.e., the byte array data_ ) of a memory
page fetched by buffer pool. Every time you try to read or write a page, you need to first fetch the page
from buffer pool using its unique page_id, then reinterpret cast to either a directory or a bucket page, and
unpin the page after any writing or reading operations.

We have provided various helpers, or documentation suggesting helper functions. The only functions you
must implement are as follows.

Bucket Page: - Insert - Remove - IsOccupied - IsReadable - KeyAt - ValueAt

Directory Page: - GetGlobalDepth - IncrGlobalDepth - SetLocalDepth - SetBucketPageId - GetBucketPageId

You are free to design and implement additional new functions as you see fit. However, you must be
careful to watch for name collisions. These should be rare, but would arise in Gradescope as a compiler
error.

Task #2 - Hash Table Implementation

You will implement a hash table that uses the extendible hashing scheme. It needs to support insertions
( Insert), point search ( GetValue ), and deletions ( Remove). There are many helper functions either
implemented or documented the extendible hash table's header and cpp files. Your only strict API
requirement is adhering to Insert, GetValue , and Remove. You also must leave the VerifyIntegrity function as
it is. Please feel free to design and implement additional functions as you see fit.

Your hash table must support both unique and non-unique keys. Duplicate values for the same key are not
allowed. This means that (key_0, value_0) and (key_0, value_1) can exist in the same hash table, but not
(key_0, value_0) and (key_0, value_0) . The Insert method only returns false if it tries to insert an
existing key-value pair.

Your hash table implementation must hide the details of the key/value type and associated comparator, like
this:
template <typename KeyType, typename ValueType, typename KeyComparator>
class ExtendibleHashTable {
// ---
};

These classes are already implemented for you:

`KeyType`: The type of each key in the hash table. This will only be GenericKey, the actual size of
GenericKey is specified and instantiated with a template argument and depends on the data type of
indexed attribute.
`ValueType`: The type of each value in the hash table. This will only be 64-bit RID.
`KeyComparator`: The class used to compare whether two KeyType instances are less/greater-than
each other. These will be included in the KeyType implementation files. Variables with the
`KeyComparator` type are essentially functions; for instance, given two keys `KeyType key1` and
`KeyType key2`, and a key comparator `KeyComparator cmp`, you can compare the keys via
`cmp(key1, key2)`.

Note that you can equality-test ValueType instances simply using the == operator.

https://15445.courses.cs.cmu.edu/fall2021/project2/[2/21/2022 10:24:46 PM]

Project #2 - Extendible Hash Index | CMU 15-445/645 :: Intro to Database Systems (Fall 2021)

Extendible Hashing Implementation Details

This implementation requires that you implement bucket splitting/merging and directory growing/shrinking.
Some implementations of extendible hashing skip the merging of buckets as it can cause thrashing in
certain scenarios. We implement it here to provide a full understanding of the data structure and provide
more opportunities for learning how to manage latches, locks, page operations (fetch/pin/delete/etc).

Directory Indexing

When inserting into your hash index, you will want to use the least-significant bits for indexing into the
directory. Of course, it is possible to use the most-significant bits correctly, but using the least-significant
bits makes the directory expansion operation much simpler.

Splitting Buckets

You must split a bucket if there is no room for insertion. You can ostensibly split as soon as the bucket
becomes full, if you find that easier. However, the reference solution splits only when an insertion would
overflow a page. Hence, you may find that the provided API is more amenable to this approach. As always,
you are welcome to factor your own internal API.

Merging Buckets

Merging must be attempted when a bucket becomes empty. There are ways to merge more aggressively by
checking the occupancy of buckets and their split images, but these expensive checks and extra merges
can increase thrashing.

To keep things relatively simple, we provide the following rules for merging:

1. Only empty buckets can be merged.

2. Buckets can only be merged with their split image if their split image has the same local depth.
3. Buckets can only be merged if their local depth is greater than 0.

If you are confused about a "split image,” please review the algorithm and code documentation. The
concept falls out quite naturally.

Directory Growing

There are no fancy rules for this. You either have to grow the directory, or you don't.

Directory Shrinking

Only shrink the directory if the local depth of every bucket is strictly less than the global depth of the
directory. You may see other tests for the shrinking of the directory, but this one is trivial since we keep the
local depths in the directory page.

Performance
An important performance detail is to only take write locks and latches when they are needed. Always
taking write locks will inevitably timeout on Gradescope.

In addition, one potential optimization is to factor your own scans types over the bucket pages, which can
avoid repeated scans in certain cases. You will find that checking many things about the bucket page often
involves a full scan, so you can potentially collect all this information in one pass.

https://15445.courses.cs.cmu.edu/fall2021/project2/[2/21/2022 10:24:46 PM]

Project #2 - Extendible Hash Index | CMU 15-445/645 :: Intro to Database Systems (Fall 2021)

Task #3 - Concurrency Control

Up to this this point you could assume that your hash table only supported single-threaded execution. In
this last task, you will modify your implementation so that it supports multiple threads reading/writing the
table at the same time.

You will need to have latches on each bucket so that when one thread is writing to a bucket other threads
are not reading or modifying that index as well. You should also allow multiple readers to be reading the
same bucket at the same time.

You will need to latch the whole hash table when you need to split or merge buckets, and when global
depth changes.

REQUIREMENTS AND HINTS

Latches

There are two latches to be aware of in this project. The first is table_latch_ in extendible_hash_table.h ,
which takes latches on the extendible hash table. This comes from the RWLatch class in
src/include/common/rwlatch.h . As you can see in the code, it is backed by std::mutex . The second is the
built-in page latching functionality in src/include/storage/page.h. This is what you must use to protect
your bucket pages. Note that to take a read-lock on the table_latch_ you call RLock from RWLatch.h , but to
take a read-lock on a bucket page you must reinterpret_cast<Page *> to a page pointer, and call the
RLatch method from page.h.

We suggest you look at the extendible hash table class, look at its members, and analyze exactly which
latches will allow which behavior. We also suggest that you do the same for the bucket pages.

Project 4 will explore concurrency control through locking with the LockManager in LockManager.h . You do
not need the LockManager at all for this project.

Transaction Pointer

You can simply pass nullptr as the Transaction pointer argument when it is required. This Transaction
object comes from src/include/concurrency/transaction.h. It provides methods to store the page on
which you have acquired latch while traversing through the Hash Table. You do not need to do this to pass
the tests.

Instructions and FAQ

Dev Environment - Setup
See the Project #0 instructions on how to create your private repository and setup your development
environment.

 You must pull the latest changes on our BusTub repo for test files and other supplementary files we
have provided for you. Run `git pull public master`.

Compilation Issues
If you have trouble compiling, try deleting your build directory and making a fresh one. Running make clean

https://15445.courses.cs.cmu.edu/fall2021/project2/[2/21/2022 10:24:46 PM]

Project #2 - Extendible Hash Index | CMU 15-445/645 :: Intro to Database Systems (Fall 2021)

does not completely reset the compilation process, so starting from scratch with cmake .. can help.

Testing - GTest and GDB

You can test the individual components of this assigment using our testing framework. We use GTest for
unit test cases.
cd build
make -j hash_table_test
./test/hash_table_test

We encourage you to use gdb to debug your project if you are having problems. Here is a nice reference for
useful commands in gdb . Make sure you've ran cmake with debug flags on so gdb has debugging symbols
to use.

Memory Safety - Valgrind

To ensure your implementation does not have memory leak, you can run the test with Valgrind. Make sure
that you run Valgrind without the cmake debug flags, as this sometime causes ASan runtime errors. If you
have debug mode on, you can either remake your build directory, or have two separate build directories.
cd build
make -j hash_table_test
valgrind --trace-children=yes \
--leak-check=full \
--track-origins=yes \
--soname-synonyms=somalloc=*jemalloc* \
--error-exitcode=1 \
--suppressions=../build_support/valgrind.supp \
./test/hash_table_test

Important: These tests are only a subset of the all the tests that we will use to evaluate and grade your
project. You should write additional test cases on your own to check the complete functionality of your
implementation. In addition, if Valgrind times out then it’s possible that your implementation is not efficient
enough. If your buffer pool manager implementation was slow, then you might need to debug the BPM as
well.

Formatting
Your code must follow the Google C++ Style Guide. We use Clang to automatically check the quality of
your source code. Your project grade will be zero if your submission fails any of these checks.

Execute the following commands to check your syntax. The format target will automatically correct your
code. The check-lint and check-clang-tidy targets will print errors and instruct you how to fix it to
conform to our style guide.
$ make -j format
$ make -j check-lint
$ make -j check-clang-tidy

If you have issues with any of these commands, it may relate to either your python installation, or file
permissions on your remote files. Please try to read the error, understand the problem, and resolve it by
doing things like inspecting and editing your $PATH , managing permissions with chmod , and managing
installed packages with brew or apt .

Development Hints
Instead of using printf or cout statements for debugging, use the LOG_* macros for logging information like

https://15445.courses.cs.cmu.edu/fall2021/project2/[2/21/2022 10:24:46 PM]

Project #2 - Extendible Hash Index | CMU 15-445/645 :: Intro to Database Systems (Fall 2021)

this:
LOG_INFO("# Pages: %d", num_pages);
LOG_DEBUG("Fetching page %d", page_id);

To enable logging in your project, you will need to reconfigure it like this:
$ mkdir build
$ cd build
$ cmake -DCMAKE_BUILD_TYPE=DEBUG ..
$ make -j

The different logging levels are defined in src/include/common/logger.h. After enabling logging, the logging
level defaults to LOG_LEVEL_INFO . Any logging method with a level that is equal to or higher than
LOG_LEVEL_INFO (e.g., LOG_INFO , LOG_WARN , LOG_ERROR ) will emit logging information. Note that you will need
to add #include "common/logger.h" to any file that you want to use the logging infrastructure.

Method Signatures
Please do not change any of the method signatures for any of the provided methods. You are free to add
your own helper methods.

Adding Helper Functions - Templating

If you are having trouble with function signatures when adding new functions, the following example may
help.

Here is one of the important templates used in Project 2:

template <typename KeyType, typename ValueType, typename KeyComparator>

You can see that it does not appear directly above the Remove function in extendible_hash_table.h :
bool Remove(Transaction *transaction, const KeyType &key, const ValueType &value);

However, it is required directly above the Remove function in extendible_hash_table.cpp :

template <typename KeyType, typename ValueType, typename KeyComparator>
bool HASH_TABLE_TYPE::Remove(Transaction *transaction, const KeyType &key, const ValueType &value)
{
return false;
}

Also notice the HASH_TABLE_TYPE template on the Remove function. This is defined in
extendible_hash_table.h as:

\#define HASH_TABLE_TYPE ExtendibleHashTable<KeyType, ValueType, KeyComparator>

When adding a new function, the following steps should work:

1. Add function signature to the header file. Use private functions whenever possible. If you need to add
a public function, be wary of name conflicts with the grading harness. You can preempt this by
making your function names unique in some way, but there should typically not be conflicts.
2. Add the function to the .cpp file with templates. You will probably need a template before the
function name like TEMPLATE::FunctionName. If you are using KeyType, ValueType, and/or
KeyComparator, you will need the aforementioned template above the function signature.

 Post all of your questions about this project on Piazza. Do not email the TAs directly with questions.

https://15445.courses.cs.cmu.edu/fall2021/project2/[2/21/2022 10:24:46 PM]

Project #2 - Extendible Hash Index | CMU 15-445/645 :: Intro to Database Systems (Fall 2021)

Gradescope Failed to Execute

If Gradescope gives you an error about failing to execute, then the most likely cause is that your code
doesn’t compile or you’re logging invalid UTF-8 characters. Remove all logging, delete your local build
folder, and rebuild your project.

Grading Rubric
Each project submission will be graded based on the following criteria:

1. Does the submission successfully execute all of the test cases and produce the correct answer?
2. Does the submission execute without any memory leaks?

Note that we will use additional test cases that are more complex and go beyond the sample test cases
that we provide you.

Late Policy
See the late policy in the syllabus.

Submission
After completing the assignment, you can submit your implementation of to Gradescope. Since we have
two checkpoints for this project, you will need to submit them separately through the following link.

https://www.gradescope.com/courses/286490/

You only need to include the following files:

src/include/buffer/lru_replacer.h
src/buffer/lru_replacer.cpp
src/include/buffer/buffer_pool_manager_instance.h
src/buffer/buffer_pool_manager_instance.cpp
src/include/buffer/parallel_buffer_pool_manager.h
src/buffer/parallel_buffer_pool_manager.cpp
src/include/storage/page/hash_table_directory_page.h
src/storage/page/hash_table_directory_page.cpp
src/include/storage/page/hash_table_bucket_page.h
src/storage/page/hash_table_bucket_page.cpp
src/include/container/hash/extendible_probe_hash_table.h
src/container/hash/extendible_probe_hash_table.cpp

Alternatively, running this zip command from your working directory (aka bustub, bustub-private, etc.) will
create a zip archive called project2-submission.zip that you can submit to Gradescope. You can also put
this command in a bash file and run the bash file to make things easier for you.
$ zip project2-submission.zip src/include/buffer/lru_replacer.h \
src/buffer/lru_replacer.cpp \
src/include/buffer/buffer_pool_manager_instance.h \
src/buffer/buffer_pool_manager_instance.cpp \
src/include/buffer/parallel_buffer_pool_manager.h \
src/buffer/parallel_buffer_pool_manager.cpp \
src/include/storage/page/hash_table_directory_page.h \
src/storage/page/hash_table_directory_page.cpp \

https://15445.courses.cs.cmu.edu/fall2021/project2/[2/21/2022 10:24:46 PM]

Project #2 - Extendible Hash Index | CMU 15-445/645 :: Intro to Database Systems (Fall 2021)

src/include/storage/page/hash_table_bucket_page.h \
src/storage/page/hash_table_bucket_page.cpp \
src/include/container/hash/extendible_hash_table.h \
src/container/hash/extendible_hash_table.cpp

You can submit your answers as many times as you like and get immediate feedback.

Collaboration Policy
Every student has to work individually on this assignment.
Students are allowed to discuss high-level details about the project with others.
Students are not allowed to copy the contents of a white-board after a group meeting with other
students.
Students are not allowed to copy the solutions from another colleague.

 WARNING: All of the code for this project must be your own. You may not copy source code from
other students or other sources that you find on the web. Plagiarism will not be tolerated. See CMU's
Policy on Academic Integrity for additional information.

Last Updated: Jan 07, 2022

https://15445.courses.cs.cmu.edu/fall2021/project2/[2/21/2022 10:24:46 PM]

Static and Dynamic Hashing
No ratings yet
Static and Dynamic Hashing
12 pages
HTML - CSS - Input Forms 2 PDF
No ratings yet
HTML - CSS - Input Forms 2 PDF
58 pages
APP Question Bank Unit3
100% (1)
APP Question Bank Unit3
5 pages
100 Mainframe Tips
100% (1)
100 Mainframe Tips
23 pages
06 Hashtables
No ratings yet
06 Hashtables
85 pages
Chapter 7 Indexing Part2
No ratings yet
Chapter 7 Indexing Part2
41 pages
DSAL Lab Manual
No ratings yet
DSAL Lab Manual
61 pages
Ds 2nd Long
No ratings yet
Ds 2nd Long
43 pages
Lecture 13 - Hash Tables
No ratings yet
Lecture 13 - Hash Tables
51 pages
Closed Hashing-ADS
No ratings yet
Closed Hashing-ADS
25 pages
String Objective Test For JAVA ICSE Class X
No ratings yet
String Objective Test For JAVA ICSE Class X
3 pages
Ravi Sontakke SAP ABAP
No ratings yet
Ravi Sontakke SAP ABAP
6 pages
DSA Lab Manual-Group A Writeup
No ratings yet
DSA Lab Manual-Group A Writeup
9 pages
Lab. Sheet Seven: Data Structure Laboratory
No ratings yet
Lab. Sheet Seven: Data Structure Laboratory
7 pages
Task 2 - Hashing and Linear Probing
No ratings yet
Task 2 - Hashing and Linear Probing
16 pages
Bcs304module5slides 241029150507 B6e5bfba
No ratings yet
Bcs304module5slides 241029150507 B6e5bfba
41 pages
Unit V
No ratings yet
Unit V
93 pages
Ch11 Hash Indexes 1perpage Annotated
No ratings yet
Ch11 Hash Indexes 1perpage Annotated
28 pages
Lab5 Hashing Algos
No ratings yet
Lab5 Hashing Algos
10 pages
Assignment 5 - Nguyen Tuan Anh - 1607847
No ratings yet
Assignment 5 - Nguyen Tuan Anh - 1607847
6 pages
DSA Lab 11 Hashing
No ratings yet
DSA Lab 11 Hashing
9 pages
Hash Tables
No ratings yet
Hash Tables
35 pages
Hashing
No ratings yet
Hashing
11 pages
How To Implement Adictionary?: University Institute of Engineering (UIE)
No ratings yet
How To Implement Adictionary?: University Institute of Engineering (UIE)
25 pages
Practical 1
No ratings yet
Practical 1
6 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
Lecture14 Hash Based Indexing and Sorting MHH 18oct 2016
No ratings yet
Lecture14 Hash Based Indexing and Sorting MHH 18oct 2016
71 pages
Project #1 - Buffer Pool - CMU 15-445 - 645 - Intro To Database Systems
No ratings yet
Project #1 - Buffer Pool - CMU 15-445 - 645 - Intro To Database Systems
1 page
What Is A PHP File?
No ratings yet
What Is A PHP File?
12 pages
Hashing
No ratings yet
Hashing
33 pages
Experiment 8 DS Student
No ratings yet
Experiment 8 DS Student
8 pages
Hashing
No ratings yet
Hashing
44 pages
Lec12 Hash Tables 09092024 090609pm
No ratings yet
Lec12 Hash Tables 09092024 090609pm
48 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
43 pages
Lab08 - DS - Hash Tables
No ratings yet
Lab08 - DS - Hash Tables
9 pages
Dsa 240404 220052
No ratings yet
Dsa 240404 220052
9 pages
DSA MK Lect2 PDF
No ratings yet
DSA MK Lect2 PDF
92 pages
Unit-II ReHashing ExtendedHashing
No ratings yet
Unit-II ReHashing ExtendedHashing
11 pages
Lab 2
No ratings yet
Lab 2
10 pages
Question Bank-Unit 4
No ratings yet
Question Bank-Unit 4
2 pages
MODULE 5 - BCS304 - HASHING - Leftisht Trees - OBST - Notes
No ratings yet
MODULE 5 - BCS304 - HASHING - Leftisht Trees - OBST - Notes
32 pages
Updated PDAssignment6
No ratings yet
Updated PDAssignment6
15 pages
Static and Dynamic Hashing
No ratings yet
Static and Dynamic Hashing
10 pages
13 Hashing
No ratings yet
13 Hashing
26 pages
Dynamic Hashing
No ratings yet
Dynamic Hashing
35 pages
Ds 5 Update
No ratings yet
Ds 5 Update
26 pages
Hashing
No ratings yet
Hashing
8 pages
Chap. 6 Hash-Based Indexing: Abel J.P. Gomes
No ratings yet
Chap. 6 Hash-Based Indexing: Abel J.P. Gomes
15 pages
DSA LAb Manual
No ratings yet
DSA LAb Manual
38 pages
Hash-Based Indexes: Introduction To Database, Fall 2004/melikyan 1
No ratings yet
Hash-Based Indexes: Introduction To Database, Fall 2004/melikyan 1
19 pages
DSAL Writeups
No ratings yet
DSAL Writeups
51 pages
Data Structure Seminar
No ratings yet
Data Structure Seminar
23 pages
Adbs 5
No ratings yet
Adbs 5
37 pages
CO3 Session 6
No ratings yet
CO3 Session 6
29 pages
07 Hashtables
No ratings yet
07 Hashtables
4 pages
Unit-3 Hashing Storage Btree
No ratings yet
Unit-3 Hashing Storage Btree
26 pages
Telephone Book
No ratings yet
Telephone Book
4 pages
Experiment 8
No ratings yet
Experiment 8
4 pages
Hashing in DBMS
No ratings yet
Hashing in DBMS
11 pages
2.8. ADS - Collision Resolution-Extendible Hashing-1
No ratings yet
2.8. ADS - Collision Resolution-Extendible Hashing-1
47 pages
Chap 12. Extendible Hashing: File Structures
No ratings yet
Chap 12. Extendible Hashing: File Structures
40 pages
There Are Two Types of Hashing
No ratings yet
There Are Two Types of Hashing
2 pages
Hashing Presentation
No ratings yet
Hashing Presentation
12 pages
Hashing
No ratings yet
Hashing
8 pages
DSL Writeup
No ratings yet
DSL Writeup
64 pages
Chapter 6 - Optimization Models With Integer Variables: Page 1
No ratings yet
Chapter 6 - Optimization Models With Integer Variables: Page 1
14 pages
Chapter 4. Computer Software
No ratings yet
Chapter 4. Computer Software
40 pages
Flutter - Database Concepts+ - 1709568879600
No ratings yet
Flutter - Database Concepts+ - 1709568879600
14 pages
Quiz DP 13
No ratings yet
Quiz DP 13
4 pages
Chapter V
No ratings yet
Chapter V
6 pages
RCSTM8
No ratings yet
RCSTM8
214 pages
Unit 5
No ratings yet
Unit 5
75 pages
Project Programe 1 To 15
No ratings yet
Project Programe 1 To 15
94 pages
Homework #4 Concurrency Control After - 16
No ratings yet
Homework #4 Concurrency Control After - 16
12 pages
Unit I Introducation and Overview Java
No ratings yet
Unit I Introducation and Overview Java
34 pages
20BCS9188 Abhishek Pandey EXP All
No ratings yet
20BCS9188 Abhishek Pandey EXP All
26 pages
Exercise 1
No ratings yet
Exercise 1
8 pages
Settingsprovider
No ratings yet
Settingsprovider
28 pages
Polymorphism in Python
No ratings yet
Polymorphism in Python
3 pages
Assignment 1 (Group Assignment)
No ratings yet
Assignment 1 (Group Assignment)
3 pages
Assignment On Declarative Programming Paradigm
No ratings yet
Assignment On Declarative Programming Paradigm
22 pages
OS PRactical Prep
No ratings yet
OS PRactical Prep
15 pages
A Complete Guide To LLVM For Programming Language Creators
No ratings yet
A Complete Guide To LLVM For Programming Language Creators
22 pages
Play Poker - 100% Bonus You On Beautiful Russian Brides Clear Computer History
No ratings yet
Play Poker - 100% Bonus You On Beautiful Russian Brides Clear Computer History
5 pages
hw2 Sols
No ratings yet
hw2 Sols
13 pages
Group2 - Chapter 2 Problems
No ratings yet
Group2 - Chapter 2 Problems
5 pages
hw2 Sols
No ratings yet
hw2 Sols
8 pages
hw5 Sols
No ratings yet
hw5 Sols
8 pages
hw6 Sols
No ratings yet
hw6 Sols
7 pages
hw3 Sols
No ratings yet
hw3 Sols
5 pages
Answer:: Free Exam/Cram Practice Materials - Best Exam Practice Materials
No ratings yet
Answer:: Free Exam/Cram Practice Materials - Best Exam Practice Materials
2 pages
Lab Quiz 1
No ratings yet
Lab Quiz 1
1 page
Homework #3 Join Algorithms After - 12
No ratings yet
Homework #3 Join Algorithms After - 12
4 pages
hw3 Sols
No ratings yet
hw3 Sols
4 pages
Document 1560390.1 - Database Error After Upgarade
No ratings yet
Document 1560390.1 - Database Error After Upgarade
2 pages
SQL Students
No ratings yet
SQL Students
2 pages
Cpu Scheduling
No ratings yet
Cpu Scheduling
4 pages
Mastering Node.js Web Development: Go on a comprehensive journey from the fundamentals to advanced web development with Node.js
From Everand
Mastering Node.js Web Development: Go on a comprehensive journey from the fundamentals to advanced web development with Node.js
Adam Freeman
No ratings yet

Project #2 - Extendible Hash Index After - 08

Uploaded by

Project #2 - Extendible Hash Index After - 08

Uploaded by

Project #2 - Extendible Hash Index | CMU 15-445/645 :: Intro to Database Systems (Fall 2021)

Project #2 - Extendible Hash Index

This is a single-person project that will be completed individually (i.e., no groups).

Release Date: Sep 27, 2021

https://15445.courses.cs.cmu.edu/fall2021/project2/[2/21/2022 10:24:46 PM]

Task #1 - Page Layouts

Hash Table Directory Page

Hash Table Directory Page

Variable Name Size Description

Hash Table Bucket Page

https://15445.courses.cs.cmu.edu/fall2021/project2/[2/21/2022 10:24:46 PM]

Bucket Page: - Insert - Remove - IsOccupied - IsReadable - KeyAt - ValueAt

Directory Page: - GetGlobalDepth - IncrGlobalDepth - SetLocalDepth - SetBucketPageId - GetBucketPageId

Task #2 - Hash Table Implementation

These classes are already implemented for you:

https://15445.courses.cs.cmu.edu/fall2021/project2/[2/21/2022 10:24:46 PM]

Extendible Hashing Implementation Details

1. Only empty buckets can be merged.

https://15445.courses.cs.cmu.edu/fall2021/project2/[2/21/2022 10:24:46 PM]

Task #3 - Concurrency Control

REQUIREMENTS AND HINTS

Instructions and FAQ

https://15445.courses.cs.cmu.edu/fall2021/project2/[2/21/2022 10:24:46 PM]

Testing - GTest and GDB

Memory Safety - Valgrind

https://15445.courses.cs.cmu.edu/fall2021/project2/[2/21/2022 10:24:46 PM]

Adding Helper Functions - Templating

Here is one of the important templates used in Project 2:

However, it is required directly above the Remove function in extendible_hash_table.cpp :

\#define HASH_TABLE_TYPE ExtendibleHashTable<KeyType, ValueType, KeyComparator>

When adding a new function, the following steps should work:

https://15445.courses.cs.cmu.edu/fall2021/project2/[2/21/2022 10:24:46 PM]

Gradescope Failed to Execute

You only need to include the following files:

https://15445.courses.cs.cmu.edu/fall2021/project2/[2/21/2022 10:24:46 PM]

Last Updated: Jan 07, 2022

https://15445.courses.cs.cmu.edu/fall2021/project2/[2/21/2022 10:24:46 PM]

You might also like