0% found this document useful (0 votes)

18 views52 pages

Lectures - 11 Transactions Logging

Uploaded by

trailblazerwzl

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views52 pages

Lectures - 11 Transactions Logging

Uploaded by

trailblazerwzl

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 52

Transactions

Stats (Out of 90+2)

○ Median: 76.5, Mean: 73.9, High: 92, Standard Deviation: 10.5
○ Regrade requests open for 1 week (next Tuesday)

Exam
Grades
Feedback Themes
- TAs were ultra-responsive during the “24 hours”
- Test was too long. We’ll recalibrate for Finals
- Most liked 24 hour ﬂex time
UPDATE Product
SET Price = Price – 1.99
WHERE pname = ‘Gizmo’

SQL INSERT INTO SmallProduct(name, price)

Writes SELECT pname, price
FROM Product
WHERE price <= 0.99

DELETE Product
WHERE price <=0.99
UPDATE Product
SET Price = Price – 1.99
WHERE pname = ‘Gizmo’

SQL INSERT INTO SmallProduct(name, price)

Writes SELECT pname, price
FROM Product
Readfrom another table
WHERE price <= 0.99

DELETE Product
WHERE price <=0.99
How?
Mobile Game
Report & Share

Example Real-Time
DBMS
Business/Product
Analysis

Game App User Events

DB v0
Q1: 1000 users/sec? Q7: How to model/evolve game data? Q4: Which user cohorts?
Q2: Oﬄine? Q8: How to scale to millions of users? Q5: Next features to build?
(Recap lectures) Q3: Support v1, v1’ versions? Q9: When machines die, restore game Experiments to run?
state gracefully? Q6: Predict ads demand?

App designer Systems designer Product/Biz designer

How?
Mobile Game
Report & Share

Example Real-Time
DBMS
Business/Product
Analysis

Game App User Events

DB v0
Q1: 1000 users/sec? Q7: How to model/evolve game data? Q4: Which user cohorts?
Q2: Oﬄine? Q8: How to scale to millions of users? Q5: Next features to build?
(Recap lectures) Q3: Support v1, v1’ versions? Q9: When machines crash, restore Experiments to run?
game state gracefully? Q6: Predict ads demand?

App designer Systems designer Product/Biz designer

1. Why Transactions?

Today’s
Lecture 2. Transactions

3. Properties of Transactions: ACID

4. Logging
Example

Unpack
ATM DB:

Transaction
Read Balance Read Balance
Give money vs Update Balance
Update Balance Give money
Visa does > 60,000 TXNs/sec with users & merchants

Want your 4$ Starbucks transaction to wait for a stranger’s 10k$ bet in Las Vegas ?
⇒ Transactions can (1) be quick or take a long time, (2) unrelated to you
Transactions are at the core of
-- payment, stock market, banks, ticketing
-- Gmail, Google Docs (e.g., multiple people editing)
Money Money (@4:29 am day+1)

Example

Monthly
bank
interest
transaction
‘T-Monthly-423’
Monthly Interest 10%
4:28 am Starts run on 100M bank accounts
Takes 24 hours to run

UPDATE Money
SET Balance = Balance * 1.1
Money Money (@4:29 am day+1)
Example

Monthly
bank
interest
transaction
Cost to update all data
100M bank accounts → 100M seeks? (worst case)

Performance (@10 msec/seek, that’s 1 million secs)

Problem1: SLOW :(
Money Money (@10:45 am)

Example
??

Monthly ?? Did T-Monthly-423 complete?

?? Which tuples are bad?
bank
interest ??

transaction
Case1: T-Monthly-423 crashed
‘T-Monthly-423’ Case2: T-Monthly-423 completed
With crash Monthly Interest 10% 4002 deposited 20$ at 10:45 am
4:28 am Starts run on 100M bank accounts
Takes 24 hours to run
Network outage at 10:29 am,
System access at 10:45 am
Problem 2: Wrong :(
15
Primary data structures/algorithms

LOGS LOCKS

Big Scale

Roadmap
?????
1. Why Transactions?

Today’s
Lecture 2. Properties of Transactions: ACID

3. Logging
Transactions: Basic Deﬁnition

A transaction (“TXN”) is a sequence of In the real world, a TXN

either happened
one or more operations (reads or completely or not at all
writes) which reﬂects a single real-world (e.g., you withdrew 100$
transition. from bank. Or not.)

START TRANSACTION
UPDATE Product
SET Price = Price – 1.99
WHERE pname = ‘Gizmo’
COMMIT
Transactions in SQL

• In “ad-hoc” SQL, each statement = one transaction

• In a program, multiple statements can be grouped together as a transaction

START TRANSACTION
UPDATE Bank SET amount = amount – 100
WHERE name = ‘Bob’
UPDATE Bank SET amount = amount + 100
WHERE name = ‘Joe’
COMMIT
Motivation for Transactions
Group user actions (reads & writes) into Transactions helps with two goals:

1. Recovery & Durability: Keep the data consistent and durable.

Despite system crashes, user canceling TXN part way, etc.
This lecture!

Idea: Use LOGS. Support to “commit” or “rollback” TXNs

2. Concurrency: Get better performance by parallelizing TXNs

without creating ‘bad data.’ Despite slow disk writes and reads.

Next lecture

Idea: Use LOCKS. Run several user TXNs concurrently.

Example 1: Protection against crashes / aborts

Scenario: Make a CheapProducts table, from a Products table

Client 1:
INSERT INTO CheapProduct(name, price)
SELECT pname, price
FROM Product Crash / abort!
WHERE price <= 0.99

DELETE Product
WHERE price <=0.99

What goes wrong?

Client 1:
START TRANSACTION
INSERT INTO CheapProduct(name, price)
SELECT pname, price
FROM Product
WHERE price <= 0.99

DELETE Product
WHERE price <=0.99
COMMIT

Now we’d be fine! We’ll see how / why this lecture

Example 2: Multiple users: single statements

Client 1: [at 10:01 am] Client 2: [at 10:01 am]

UPDATE Product UPDATE Product
SET Price = Price – 1.99 SET Price = Price*0.5
WHERE pname = ‘Gizmo’ WHERE pname=‘Gizmo’

Two managers attempt to discount products at same time -

What could go wrong?

Client 1: START TRANSACTION Client 2: START TRANSACTION
UPDATE Product UPDATE Product
SET Price = Price – 1.99 SET Price = Price*0.5
WHERE pname = ‘Gizmo’ WHERE pname=‘Gizmo’
COMMIT COMMIT

Now works like a charm- we’ll see how / why next lecture…
3. Properties of Transactions
1. Atomicity

2. Consistency
What you will
learn about in 3. Isolation
this section
4. Durability
ACID: Atomicity

• TXN is all or nothing

• Commits: all the changes are made

• Aborts: no changes are made

ACID: Consistency

• The tables must always satisfy user-speciﬁed integrity constraints

• E.g., Account number is unique, Sum of debits and of credits is 0

• How consistency is achieved:

• Programmer writes a TXN to go from one consistent state to a
consistent state
• System makes sure that the TXN is atomic (e.g., if EXCEPTION, rolls
back)
ACID: Isolation

• A TXN executes concurrently with other TXNs

• Effect of TXNs is the same as TXNs running one after another

Conceptually,
• similar to OS “sandboxes”
• E.g. TXNs can’t observe each other’s “partial updates”
ACID: Durability

• The effect of a TXN must persist after the TXN

• And after the whole program has terminated
• And even if there are power failures, crashes, etc.

• ⇒ Write data to durable IO (e.g., disk)

ACID Summary

• Atomic
• State shows either all the effects of TXN, or none of them
• Consistent
• TXN moves from a state where integrity holds, to another where integrity
holds
• Isolated
• Effect of TXNs is the same as TXNs running one after another
• Durable
• Once a TXN has committed, its effects remain in the database
A Note: ACID is one popular option!

• Many debates over ACID, both historically and currently

• Some “NoSQL” DBMSs relax ACID

• In turn, now “NewSQL” reintroduces ACID compliance to

NoSQL-style DBMSs…

⇒ Usually, depends on what consistency and performance your

application needs

ACID is an extremely important & successful paradigm,

but still debated!
4. Atomicity & Durability via Logging
Conceptual
Idea:
Trip to Europe

1. Make TODO list. Buy 2. Actual Visit

tickets
(Much longer than buying tickets)
Recall (on disks)

▹ Sequential reads FASTER than random reads

▹ Sequential writes (aka “appends”) FASTER than random writes

Big Idea Big Idea: LOGs (or log ﬁles or ledger)

▹ Any value that changes? Append to LOG!
LOGS! ■ LOG is a compact “todo” list of data updates
▹ Intuition:
(aka TODO/ ■ Data pages: (a) Update in RAM (fast) (b) Update on disk
ledger) later (slow)
■ LOGs: ( c) Append “todo” in LOGs and (d) control when
you Flush LOGs to disk

Many kinds of LOGs. We’ll study a few key ones!

1. How to make/use LOGs?

What you will

learn about in
this section
2. How to make it fast? (Mess with memory
and disk)
Basic Idea: (Physical) Logging

Idea:
• Log consists of an ordered list of Update Records
• Log record contains UNDO information for every update!
<TransactionID, &reference, old value, new value>
(e.g., key)

What DB does?
• Owns the log “service” for all applications/transactions.
• Appends to log. Flush when necessary — force writes to disk

This is sufficient to UNDO any transaction!

Money Money (@4:29 am day+1) WA Log (@4:29 am day+1)

Example
Update
Records

Monthly
bank Commit
interest Record

transaction
‘T-Monthly-423’

Full run
Monthly Interest 10%
4:28 am Starts run on 100M bank accounts
Takes 24 hours to run

START TRANSACTION
UPDATE Money
SET Amt = Amt * 1.10
COMMIT
Money Money (@10:45 am) WA Log (@10:29 am)

Example
??

Monthly ??
??
bank
interest ??

transaction
TXN ‘T-Monthly-423’
Did T-Monthly-423 complete?
With crash Monthly Interest 10%
4:28 am Starts run on 100M bank accounts
Which tuples are bad?

Takes 24 hours to run Case1: T-Monthly-423 was crashed

Network outage at 10:29 am, Case2: T-Monthly-423 completed. 4002
System access at 10:45 am deposited 20$ at 10:45 am

Can you infer from RED log records?

Money (@10:45 am) Money (after recovery) WA Log (@10:29 am)

Example

Monthly
bank
interest
transaction
System recovery (after 10:45 am)

1. Rollback uncommitted transactions

- Restore old values from WAL Log (if any)
Recovery 2.
- Notify developers about aborted TXN
Redo Recent transactions (w/ new values)
3. Back in business; Redo (any pending) transactions
1. How to make/use LOGs?

What you will

learn about in
this section
2. ⇒ How to make it fast? (Mess with
memory and disk)
A picture of logging

Log is a file (like any

A=7 Log data table)
1. Pages updated in

B=5
Main Memory RAM
2. Flushed as DB
blocks on disk
(sequential I/O)

A=7 “Flushing to disk”

= writing to disk
Data on Disk Log on Disk from main
memory
A picture of logging

T: R(A=7), W(A=13) [Update Record]

[T reads A=7, writes A=13] <Tid, &A, 7,13>
Operation
T A=13 Log recorded in update
log in main

B=5
Main Memory memory!

A=7
Data on Disk Log on Disk
Why do we need logging for
atomicity?
• Could we just write TXN updates to disk only once whole TXN
complete?
• Then, if abort / crash and TXN not complete, it has no effect- atomicity!
• With unlimited memory and time, this could work…

• ⇒ We need to log partial results of TXNs because of:

• Memory constraints (e.g. , billions of updates)
• Time constraints (what if one TXN takes very long?)

We need to write partial results to disk!

…And so we need a LOG to (maybe) undo these partial results!
What is the correct way to LOG to disk?

• We’ll look at the Write-Ahead Logging (WAL) protocol

• We’ll see why it works by looking at other protocols which are incorrect!

Remember: Key idea is to ensure durability

while maintaining our ability to “undo”!
Write-Ahead Logging (WAL)
TXN Commit Protocol
Write-ahead Logging (WAL)
Commit Protocol
[Update Record]

Commit after we’ve written

COMMIT
T: R(A), W(A) <Tid, &A, 7,13> record
log to disk but before
we’ve written data to
disk…
T A=13 Log-RAM

B=5
Main Memory
OK, Commit!

A=7
Data on Disk Log-Disk
Write-ahead Logging (WAL)
Commit Protocol
Commit after we’ve written
T: R(A), W(A) log to disk but before
we’ve written data to
disk… this is WAL!
T A=13

Main Memory
OK, Commit!

If we crash now, is T
<Tid, &A, 7,13> durable?
Yes
A=7
A=13
USE THE LOG!
Data on Disk Log-Disk
Write-Ahead Logging (WAL)

Algorithm: WAL

For each tuple update, write Update Record into LOG-RAM

Follow two Flush rules for LOG

● Rule1: Flush Update Record into LOG-Disk before → Durability
corresponding data page goes to storage
● Rule2: Before TXN commits,
- Flush all Update Records to LOG-Disk
- Flush COMMIT Record to LOG-Disk
→ Atomicity

Transaction is committed once COMMIT

record is on stable storage

Time > 0 Time > 0

Flush Update TXN

Flush COMMIT Record
Data Flush
Record to LOG-Disk to LOG-Disk COMMIT
Rule1: For each tuple update Rule2: Before TXN commits
after
shouldhappen
all records ateflushe
to logdisk
Incorrect Commit Protocol #1
Let’s try committing
before we’ve written
either data or LOG to
T: R(A), W(A) A: 7→13 disk…

T A=13 Log-RAM
OK, Commit!
B=5
Main Memory
If we crash now, is T
durable?
No
Lost T’s update!
A=7
Data on Disk Log-Disk
Incorrect Commit Protocol #2
Let’s try committing after
we’ve written data but
before we’ve written LOG
T: R(A), W(A) A: 7→13 to disk…

T A=13 Log-RAM
OK, Commit!
B=5
Main Memory
If we crash now, is T
durable? Yes! Except…

A=13 How do we know

whether T was
Data on Disk Log-Disk
committed??

loseinformation onleg
Money Money (@4:29 am day+1) WAL (@4:29 am day+1)
Example

Monthly
bank
interest
transaction
Cost to update all data Cost to Append to log
100M bank accounts → 100M seeks? (worst + 1 seek to get ‘end of log’
case) + write 100M log entries sequentially

Performance (@10 msec/seek, that’s 1 Million secs) (fast!!! < 10 sec)

[Lazily update data on disk later,

when convenient.]

Speedup for TXN Commit

1 Million secs vs 10 sec!!!
Logging Summary

• If DB says TX commits, TX effect remains after database

crash

• DB can undo actions and help us with atomicity

• This is only half the story…

Performance Task No. 1: (Note: Do Not Copy The Instructions Written Inside The Parenthesis)
No ratings yet
Performance Task No. 1: (Note: Do Not Copy The Instructions Written Inside The Parenthesis)
8 pages
BUSI 472 - Business Etiquette PowerPoint
100% (2)
BUSI 472 - Business Etiquette PowerPoint
13 pages
Checklist of MOVs For T1 To T3 SY 2020 2021
No ratings yet
Checklist of MOVs For T1 To T3 SY 2020 2021
8 pages
Microcontroller Lab Manual GTU SEM V 2012
0% (1)
Microcontroller Lab Manual GTU SEM V 2012
76 pages
Chapter 3 - Introduction Database Transactions
No ratings yet
Chapter 3 - Introduction Database Transactions
65 pages
DBMS Module5 Questions With Answers
No ratings yet
DBMS Module5 Questions With Answers
27 pages
Unit4 2024
No ratings yet
Unit4 2024
37 pages
Transaction Processing
No ratings yet
Transaction Processing
42 pages
DBMS Unit 4
No ratings yet
DBMS Unit 4
71 pages
Scott Eldridge II - Bob Franklin - The Routledge Handbook of Developments in Digital Journalism Studies-Routledge (2018)
No ratings yet
Scott Eldridge II - Bob Franklin - The Routledge Handbook of Developments in Digital Journalism Studies-Routledge (2018)
564 pages
Advanced Database Systems: Chapter 4: Transaction Management
No ratings yet
Advanced Database Systems: Chapter 4: Transaction Management
78 pages
STS PDF
100% (1)
STS PDF
4 pages
Chapter 1 Transaction Management and Concurrency Control All Lectures
No ratings yet
Chapter 1 Transaction Management and Concurrency Control All Lectures
126 pages
Unit9 Transaction Management
No ratings yet
Unit9 Transaction Management
97 pages
Transaction Management & Concurrency Control
No ratings yet
Transaction Management & Concurrency Control
141 pages
DBMS Transaction 25.10.18
No ratings yet
DBMS Transaction 25.10.18
123 pages
ADBMSUnit3pptx 2023 10 06 09 10 59
No ratings yet
ADBMSUnit3pptx 2023 10 06 09 10 59
60 pages
Chapter - One - Transaction - Management, Concurrency Control
No ratings yet
Chapter - One - Transaction - Management, Concurrency Control
61 pages
DBMS - Transaction Management
No ratings yet
DBMS - Transaction Management
114 pages
CC Part 1 No Quizzes
No ratings yet
CC Part 1 No Quizzes
69 pages
Module 5
No ratings yet
Module 5
74 pages
L5 Transaction and Concurrency Control
No ratings yet
L5 Transaction and Concurrency Control
76 pages
DBMS CIIT Ch8
No ratings yet
DBMS CIIT Ch8
70 pages
CH-3 Transaction ProcessingSS S
No ratings yet
CH-3 Transaction ProcessingSS S
82 pages
Chapter 1 Transaction Management and Concurrency Control Lec 1 and
No ratings yet
Chapter 1 Transaction Management and Concurrency Control Lec 1 and
68 pages
1 AiM Online Review
No ratings yet
1 AiM Online Review
121 pages
6 - TransactionProcessing - Ch17 (Autosaved)
No ratings yet
6 - TransactionProcessing - Ch17 (Autosaved)
62 pages
ADB - CH4 - Transaction Management and Recovery
No ratings yet
ADB - CH4 - Transaction Management and Recovery
80 pages
Koreader User Guide
No ratings yet
Koreader User Guide
33 pages
Chapter - 3 Transaction Processing
No ratings yet
Chapter - 3 Transaction Processing
45 pages
Trans View Index m2
No ratings yet
Trans View Index m2
59 pages
Topic-1: Transaction
No ratings yet
Topic-1: Transaction
40 pages
Lecture 18
No ratings yet
Lecture 18
49 pages
UNIT V Transaction and Indexing
No ratings yet
UNIT V Transaction and Indexing
26 pages
DBMS 5 1745635988578
No ratings yet
DBMS 5 1745635988578
58 pages
Unit IV
No ratings yet
Unit IV
46 pages
Transaction PDF Part 1
No ratings yet
Transaction PDF Part 1
33 pages
Transactions All Up
No ratings yet
Transactions All Up
11 pages
Databases (24303) Group 01: Physical Model: Transactions and DBMS Structure
No ratings yet
Databases (24303) Group 01: Physical Model: Transactions and DBMS Structure
29 pages
Chapter - 3 Transaction Processing
No ratings yet
Chapter - 3 Transaction Processing
47 pages
Transaction Processing Dbms
No ratings yet
Transaction Processing Dbms
20 pages
Transaction Management and Concurrency Control
No ratings yet
Transaction Management and Concurrency Control
45 pages
Transaction Management
No ratings yet
Transaction Management
30 pages
Introduction To Database Transactions
No ratings yet
Introduction To Database Transactions
36 pages
DB 3 4
No ratings yet
DB 3 4
15 pages
Lect-Transactions-1-Week 10 (TEL)
No ratings yet
Lect-Transactions-1-Week 10 (TEL)
32 pages
UART - Specification by Texas Instruments
No ratings yet
UART - Specification by Texas Instruments
51 pages
Chapter - 3 TRANSACTION PROCESSING
No ratings yet
Chapter - 3 TRANSACTION PROCESSING
51 pages
Lecture 3.2.1 - Introduction To Transaction Processing AND Lecture 3.2.2 - Transaction Properties and States
No ratings yet
Lecture 3.2.1 - Introduction To Transaction Processing AND Lecture 3.2.2 - Transaction Properties and States
27 pages
Acid Properties DB 2022
No ratings yet
Acid Properties DB 2022
24 pages
Module 5 Session 6
No ratings yet
Module 5 Session 6
25 pages
Transaction Management - I
No ratings yet
Transaction Management - I
43 pages
Dbms-Unit-5 R16
No ratings yet
Dbms-Unit-5 R16
21 pages
Transaction DBMS
No ratings yet
Transaction DBMS
11 pages
Database II
No ratings yet
Database II
17 pages
CGB1221 - Dbms-Unit3
No ratings yet
CGB1221 - Dbms-Unit3
26 pages
11 Transaction 1928dbhc
No ratings yet
11 Transaction 1928dbhc
19 pages
Transactions PDF
No ratings yet
Transactions PDF
15 pages
Language OfDbaba
No ratings yet
Language OfDbaba
7 pages
139 DB Notes3
No ratings yet
139 DB Notes3
9 pages
3b. Transaction Processing1
No ratings yet
3b. Transaction Processing1
11 pages
Ecm F150 1994
No ratings yet
Ecm F150 1994
2 pages
c09 Transaction Management Notes
No ratings yet
c09 Transaction Management Notes
17 pages
Unit-4 DBMS
No ratings yet
Unit-4 DBMS
7 pages
139 DB Notes2
No ratings yet
139 DB Notes2
7 pages
Transaction Management: Example
No ratings yet
Transaction Management: Example
7 pages
Week 1 - Introduction To Advance DB Concepts
No ratings yet
Week 1 - Introduction To Advance DB Concepts
6 pages
Iso 10333 4 2002
No ratings yet
Iso 10333 4 2002
12 pages
Unit 3 Transactions
No ratings yet
Unit 3 Transactions
16 pages
INF20010 LEC 05 Trans Concurrency VBTrans-1
No ratings yet
INF20010 LEC 05 Trans Concurrency VBTrans-1
62 pages
ABB Alternator Name Plate Details
No ratings yet
ABB Alternator Name Plate Details
2 pages
Game Over
No ratings yet
Game Over
3 pages
PD42-x-1240 Manual
No ratings yet
PD42-x-1240 Manual
34 pages
Doc3 Main Report
No ratings yet
Doc3 Main Report
60 pages
Perhitungan PLTS Ahmad Saefudi
No ratings yet
Perhitungan PLTS Ahmad Saefudi
5 pages
Lecture4 (Piecewise Interpolation)
No ratings yet
Lecture4 (Piecewise Interpolation)
7 pages
Udyam Registration
No ratings yet
Udyam Registration
4 pages
Hayes-NewTypeEarly-1971 A New Type of Early Christian Ampulla
No ratings yet
Hayes-NewTypeEarly-1971 A New Type of Early Christian Ampulla
9 pages
All Kali Tools - Kali Linux Tools
No ratings yet
All Kali Tools - Kali Linux Tools
19 pages
INCOME STATMENT MAY North CHECK DATE ADJUSTED
No ratings yet
INCOME STATMENT MAY North CHECK DATE ADJUSTED
17 pages
25-Dijkstra and A
No ratings yet
25-Dijkstra and A
12 pages
How Do I Setup A Layer 3 Network With Static Routes On My Dgs-3324Sr/ Dgs-3324Sri/Dxs-3350Sr/Dxs-3326Gsr?
No ratings yet
How Do I Setup A Layer 3 Network With Static Routes On My Dgs-3324Sr/ Dgs-3324Sri/Dxs-3350Sr/Dxs-3326Gsr?
2 pages
24 Graphs
No ratings yet
24 Graphs
14 pages
Eb 13003
No ratings yet
Eb 13003
1 page
20 Trees
No ratings yet
20 Trees
12 pages
106 Unsupervised Learning - Association Rules
No ratings yet
106 Unsupervised Learning - Association Rules
13 pages
PDF Eng
No ratings yet
PDF Eng
8 pages
23 Hashing
No ratings yet
23 Hashing
14 pages
Stree 2 Sarkate Ka Aatank Movie Showtimes in Hyderabad & Online Ticket Booking
No ratings yet
Stree 2 Sarkate Ka Aatank Movie Showtimes in Hyderabad & Online Ticket Booking
1 page
Valse Op.70 No.1 Chopin
No ratings yet
Valse Op.70 No.1 Chopin
3 pages
Computer SSC-I Rubrics HA (19!05!2023)
No ratings yet
Computer SSC-I Rubrics HA (19!05!2023)
4 pages
Col CVT-LSCS SS
No ratings yet
Col CVT-LSCS SS
2 pages
Pathfinder
No ratings yet
Pathfinder
6 pages
Real-Time Big Data Analytics: Emerging Trends
From Everand
Real-Time Big Data Analytics: Emerging Trends
Trilokesh Khatri
No ratings yet

Lectures - 11 Transactions Logging

Uploaded by

Lectures - 11 Transactions Logging

Uploaded by

Transactions

Stats (Out of 90+2)

SQL INSERT INTO SmallProduct(name, price)

SQL INSERT INTO SmallProduct(name, price)

Game App User Events

App designer Systems designer Product/Biz designer

Game App User Events

App designer Systems designer Product/Biz designer

3. Properties of Transactions: ACID

Performance (@10 msec/seek, that’s 1 million secs)

Monthly ?? Did T-Monthly-423 complete?

A transaction (“TXN”) is a sequence of In the real world, a TXN

• In “ad-hoc” SQL, each statement = one transaction

• In a program, multiple statements can be grouped together as a transaction

1. Recovery & Durability: Keep the data consistent and durable.

Idea: Use LOGS. Support to “commit” or “rollback” TXNs

2. Concurrency: Get better performance by parallelizing TXNs

Idea: Use LOCKS. Run several user TXNs concurrently.

Scenario: Make a CheapProducts table, from a Products table

What goes wrong?

Now we’d be fine! We’ll see how / why this lecture

Client 1: [at 10:01 am] Client 2: [at 10:01 am]

Two managers attempt to discount products at same time -

What could go wrong?

• TXN is all or nothing

• Aborts: no changes are made

• The tables must always satisfy user-speciﬁed integrity constraints

• How consistency is achieved:

• A TXN executes concurrently with other TXNs

• Effect of TXNs is the same as TXNs running one after another

• The effect of a TXN must persist after the TXN

• ⇒ Write data to durable IO (e.g., disk)

• Many debates over ACID, both historically and currently

• Some “NoSQL” DBMSs relax ACID

• In turn, now “NewSQL” reintroduces ACID compliance to

⇒ Usually, depends on what consistency and performance your

ACID is an extremely important & successful paradigm,

1. Make TODO list. Buy 2. Actual Visit

▹ Sequential reads FASTER than random reads

Big Idea Big Idea: LOGs (or log ﬁles or ledger)

Many kinds of LOGs. We’ll study a few key ones!

What you will

This is sufficient to UNDO any transaction!

Takes 24 hours to run Case1: T-Monthly-423 was crashed

Can you infer from RED log records?

1. Rollback uncommitted transactions

What you will

Log is a file (like any

A=7 “Flushing to disk”

T: R(A=7), W(A=13) [Update Record]

• ⇒ We need to log partial results of TXNs because of:

We need to write partial results to disk!

• We’ll look at the Write-Ahead Logging (WAL) protocol

Remember: Key idea is to ensure durability

Commit after we’ve written

For each tuple update, write Update Record into LOG-RAM

Follow two Flush rules for LOG

Transaction is committed once COMMIT

Time > 0 Time > 0

Flush Update TXN

A=13 How do we know

Performance (@10 msec/seek, that’s 1 Million secs) (fast!!! < 10 sec)

[Lazily update data on disk later,

Speedup for TXN Commit

• If DB says TX commits, TX effect remains after database

• DB can undo actions and help us with atomicity

• This is only half the story…

You might also like