Lecture 14

The document discusses database storage concepts including file organization using pages and records, indexing, memory hierarchy, and buffer management. File organization involves storing records in pages with various options like heap files and sorted files. Buffer management caches frequently used pages in memory using policies like LRU to reduce disk I/O.

Uploaded by

Faruk Karagoz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

Lecture 14

Uploaded by

Faruk Karagoz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 69

CSE 412 Database Management

Lecture 14 Database Storage

Jia Zou
Arizona State University

1
Overview
• File Organization
• Files of records
• Page Formats
• Record Formats
• Indexing
• Memory hierarchy
• Buffer management
Files
• FILE: A collection of pages, each containing a collection of records.
• Must support:
• insert/delete/modify record
• read a particular record (specified using record id)
• scan all records (possibly with some conditions on the records to be retrieved)
Alternative File Organization
• Several options (w/ trade-offs):
• Heap files: Suitable when typical access is a file scan retrieving all records.
• Sorted Files:
Later
• Index File Organizations:
Heap File using Lists
• The header page id and heap file name must be stored someplace.
• Each page contains 2 `pointers’ plus data.
Heap File using Lists
• The header page id and Heap file name must be stored someplace.
• Each page contains 2 `pointers’ plus data.

Any problems?
Heap File Using a Page Directory

• The entry for a page can

include the number of free
bytes on the page.
• The directory is a collection of
pages; linked list
implementation is just one
alternative.
• Much smaller than linked
list of all Heap File pages!
The Problem
• How would you store records on a page/ file, such that
• you can point to them
• you can insert/delete records with few disk accesses
Fixed-Length Records
• A Packed approach
Fixed-Length Records
• Insertion?
Fixed-Length Records
• How about deletes?
Fixed-Length Records
• How about deletes?

Bad - we have too much to

reorganize/update
Another Solution for Fixed-Length Records
• Slots+Bitmaps

✔ insertions
✔ deletions
Variable Length Records
• Slotted Page

• pack them
• keep ptrs to them
• rec-id = <page-id, slot#>
• mark start of free space
Record Formats: Fixed Length
• Information about field types same for all records in a file; stored in
system catalogs.
• Finding i’th field done via arithmetic.
Record Formats
• Fixed length records: straightforward - store info in catalog
• Variable length records: encode the length of each field?
• Store the length
• Use delimiter
Variable Length records
• Two alternative formats (# fields is fixed):

Pros and Cons?

Variable Length records
• Two alternative formats (# fields is fixed):

More popular!
Overview
• File Organization
• Files of records
• Page Formats
• Record Formats
• Indexing
• Memory hierarchy
• Buffer management
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
The Storage Hierarchy
• Main memory (RAM) for currently
used data.
• Disk for the main database
(secondary storage).
• Tapes for archiving older versions of
the data (tertiary storage).
Overview
• File Organization
• Files of records
• Page Formats
• Record Formats
• Indexing
• Memory hierarchy
• Buffer management
Motivation of Initial RDBMS architecture: Disk
is relatively VERY slow
• READ: disk-> main memory (RAM)
• WRITE: main memory (RAM) -> disk
• Both are high-cost operations, relative to in-memory operations, so
must be planned carefully
Rules of Thumbs
• Memory access much faster than disk I/O (~ 1000x)
• “Sequential” I/O faster than “random” I/O (~ 10x)
• seek time: moving arms to position disk head on track
• rotational delay: waiting for block to rotate under head
Dominating
• transfer time: actually moving data to/from disk surface
• SSD?
• Similar sequential and random
• Reading is much faster than writing
Disk Arrays: RAID (Redundant Array of
Inexpensive Disks)

Mean time to failure

Why not store it all in memory?
• Costs too much.
• disk: ~$0.1/Gb; memory: ~$5~10/Gb
• High-end Databases today in the 10-100 TB range
• Approx 60% of the cost of a production system is in the disks
• Main memory is volatile.
• Note: some specialized systems do store entire database in main
memory.
Can we leverage OS for DB storage
management?

OS virtual memory
OS file system
Can we leverage OS for DB storage
management?
• Unfortunately, OS often gets in the way of DBMS
• DBMS needs to do things “its own way”
• Control over buffer replacement policy
• LRU not always best (some times worst!)
• Control over flushing data to disk
• Write-ahead logging (WAL) protocol requires flushing log entries to disk
Overview
• File Organization
• Files of records
• Page Formats
• Record Formats
• Indexing
• Memory hierarchy
• Buffer management
Organize Disk Space into Pages
• A table is stored as one or more files, a file contains one or more
pages
• Higher levels call upon this layer to:
• allocate/de-allocate a page
• read/write a page
• Best if requested pages are stored sequentially on disk! Higher levels
don’t need to know if/ how this is done, nor how free space is
managed.
Buffer Management
Pinned or
Unpinned
Buffer Management
• Data must be in RAM for DBMS to operate on it!
• Buffer Mgr hides the fact that not all data is in RAM
When a Page is Requested ...
• Buffer pool information table contains: NOT FOUND <?,?,?>
• If requested page is not in pool and the pool is not full:
• Read requested page into chosen frame
• Pin the page and return its address
• If requested page is not in pool and the pool is full:
• Choose an (un-pinned) frame for replacement
• If frame is “dirty”, write it to disk
• Read requested page into chosen frame
• Pin the page and return its address
• Buffer pool information table now contains:

• Unpin it when you finish using the page

Buffer Replacement Policy
• Frame is chosen for replacement by a replacement policy:
• Least-recently-used (LRU), MRU, Clock, etc.
• Policy -> big impact on # of I/O ’s; depends on the access pattern
LRU Replacement Policy
• Least Recently Used (LRU)
• for each page in buffer pool, keep track of time last unpinned
• replace the frame which has the oldest (earliest) time
• very common policy: intuitive and simple
• Problems?
LRU Replacement Policy
• Problem: Sequential Flooding
• LRU + repeated sequential scans.
• # buffer frames < # pages in file means each page request causes an I/O. MRU
much better in this situation (but not in all situations, of course).
Sequential Flooding – Illustration
How LRU work?
How LRU work?
How LRU work?
How will MRU Work?
How will MRU work?
How will MRU work?
How will MRU work?
How will MRU work?
Advanced Paging Algorithm
• Greedy-dual
• Locality Set
• Clock
Summary
• Buffer manager brings pages into RAM.
• Very important for performance
• Page stays in RAM until released by requestor.
• Written to disk when frame chosen for replacement (which is sometime after
requestor releases the page).
• Choice of frame to replace based on replacement policy.
Conclusions
• Memory hierarchy
• Disks: (>1000x slower) thus
• pack info in blocks
• try to fetch nearby blocks (sequentially)
• Buffer management: very important
• LRU, MRU, etc
• Record organization: Slotted page

DBMS Storage and Indexing
No ratings yet
DBMS Storage and Indexing
90 pages
Storing Data: Disks and Files: (R&G Chapter 9)
No ratings yet
Storing Data: Disks and Files: (R&G Chapter 9)
39 pages
Block Diagram of A DBMS: (R&G Chapter 9)
No ratings yet
Block Diagram of A DBMS: (R&G Chapter 9)
6 pages
Lecture15 Fall
No ratings yet
Lecture15 Fall
102 pages
Review: (R&G Chapter 9) - Aren't Databases Great? - Relational Model - SQL
No ratings yet
Review: (R&G Chapter 9) - Aren't Databases Great? - Relational Model - SQL
7 pages
The Bare Basics: Storing Data On Disks and Files
No ratings yet
The Bare Basics: Storing Data On Disks and Files
33 pages
Layers of A DBMS: Query Optimization Query Processor Query
No ratings yet
Layers of A DBMS: Query Optimization Query Processor Query
15 pages
Journey of Byte: Lecture 4: Basic Concepts of DBMS 25.10.2016
No ratings yet
Journey of Byte: Lecture 4: Basic Concepts of DBMS 25.10.2016
8 pages
Chapter 11: Indexing and Storage: Modified From: Database System Concepts, 6 Ed
No ratings yet
Chapter 11: Indexing and Storage: Modified From: Database System Concepts, 6 Ed
53 pages
Disk Organization
No ratings yet
Disk Organization
29 pages
ch1
No ratings yet
ch1
39 pages
Database Management Systems, R. Ramakrishnan and J. Gehrke 1
No ratings yet
Database Management Systems, R. Ramakrishnan and J. Gehrke 1
32 pages
DBMS Internals: How Does It All Work?
No ratings yet
DBMS Internals: How Does It All Work?
94 pages
DIsk BFR
No ratings yet
DIsk BFR
26 pages
DBMS-UNIT-6 R16 (1)
No ratings yet
DBMS-UNIT-6 R16 (1)
16 pages
DBMS Storage and Indexing
No ratings yet
DBMS Storage and Indexing
80 pages
Unit 5
No ratings yet
Unit 5
185 pages
Lecture 17
No ratings yet
Lecture 17
24 pages
VND - Ms Powerpoint&Rendition 1
No ratings yet
VND - Ms Powerpoint&Rendition 1
118 pages
Storage and File Structures: Goals
No ratings yet
Storage and File Structures: Goals
13 pages
File and File Structure: Overview of Storage Device
No ratings yet
File and File Structure: Overview of Storage Device
29 pages
DBMS Indexing and Storage
No ratings yet
DBMS Indexing and Storage
53 pages
Storing Data: Disks and Files
No ratings yet
Storing Data: Disks and Files
29 pages
File Storage and Indexing: Lesson 13 Cs 3200 Kathleen Durant PHD
No ratings yet
File Storage and Indexing: Lesson 13 Cs 3200 Kathleen Durant PHD
46 pages
Storing Data: Disks and Files
No ratings yet
Storing Data: Disks and Files
29 pages
File Organization
No ratings yet
File Organization
47 pages
Layers of a DBMS
No ratings yet
Layers of a DBMS
38 pages
Disks, Memories & Buffer Management: "The Two Offices of Memory Are Collection and Distribution." - Samuel Johnson
No ratings yet
Disks, Memories & Buffer Management: "The Two Offices of Memory Are Collection and Distribution." - Samuel Johnson
28 pages
31 File Structures
No ratings yet
31 File Structures
20 pages
1 - Disk Storage - Ch13
No ratings yet
1 - Disk Storage - Ch13
31 pages
02 Storage (1)
No ratings yet
02 Storage (1)
104 pages
Storing Data: Disks and Files: Why Not Store Everything in Main Memory?
No ratings yet
Storing Data: Disks and Files: Why Not Store Everything in Main Memory?
10 pages
Notes 03 - Database Storage - I
No ratings yet
Notes 03 - Database Storage - I
42 pages
03 Storage1
No ratings yet
03 Storage1
4 pages
Lecture16 Fall
No ratings yet
Lecture16 Fall
81 pages
Lecture Data Storage
No ratings yet
Lecture Data Storage
28 pages
ADBMS Answer Bank
No ratings yet
ADBMS Answer Bank
90 pages
15 Storage Manager
No ratings yet
15 Storage Manager
5 pages
cbab1dd746579df2cf20fe5027fbf95a_MIT6_830F10_lec07b
No ratings yet
cbab1dd746579df2cf20fe5027fbf95a_MIT6_830F10_lec07b
5 pages
Storage and Index: Chapter 8, 9
No ratings yet
Storage and Index: Chapter 8, 9
29 pages
Disk Storage, Basic File Structures, and Hashing: Database Design Database Design
No ratings yet
Disk Storage, Basic File Structures, and Hashing: Database Design Database Design
13 pages
Ch4-Data Storage and Indexing
No ratings yet
Ch4-Data Storage and Indexing
116 pages
6 Data Storage and Querying
100% (1)
6 Data Storage and Querying
58 pages
Disk
No ratings yet
Disk
49 pages
03-Storage1 Notes
No ratings yet
03-Storage1 Notes
4 pages
Chapter 6- - Copy
No ratings yet
Chapter 6- - Copy
62 pages
Chapter 6-
No ratings yet
Chapter 6-
62 pages
Dbms - Unit 5 Notes
No ratings yet
Dbms - Unit 5 Notes
30 pages
Lecture 01 - File Storage - Part 1
No ratings yet
Lecture 01 - File Storage - Part 1
48 pages
Files
No ratings yet
Files
26 pages
6 Storage
No ratings yet
6 Storage
13 pages
Database Management System Chapter 3
No ratings yet
Database Management System Chapter 3
19 pages
File Structure and Indexing
No ratings yet
File Structure and Indexing
18 pages
Chapter 4 Summery
No ratings yet
Chapter 4 Summery
14 pages
Database Management System Chapter 1
No ratings yet
Database Management System Chapter 1
53 pages
Files,Pages, Records
No ratings yet
Files,Pages, Records
56 pages
01 Disks Files
No ratings yet
01 Disks Files
30 pages
Lecture 17
No ratings yet
Lecture 17
52 pages
Lecture 18
No ratings yet
Lecture 18
49 pages
Lecture7 Fall
No ratings yet
Lecture7 Fall
53 pages
Lecture13 Part1
No ratings yet
Lecture13 Part1
25 pages
Lecture9 Fall
No ratings yet
Lecture9 Fall
86 pages
Lecture12 Part 1
No ratings yet
Lecture12 Part 1
18 pages
Lecture5 Fall
No ratings yet
Lecture5 Fall
53 pages
Lecture10 Fall
No ratings yet
Lecture10 Fall
57 pages
Dabase Management l2
No ratings yet
Dabase Management l2
101 pages
Lecture 20
No ratings yet
Lecture 20
64 pages
Cse 412
No ratings yet
Cse 412
72 pages

Lecture 14

Uploaded by

Lecture 14

Uploaded by

CSE 412 Database Management

Lecture 14 Database Storage

• The entry for a page can

Bad - we have too much to

Pros and Cons?

Mean time to failure

• Unpin it when you finish using the page

You might also like