0% found this document useful (0 votes)

17 views77 pages

DSM1

This document discusses distributed shared memory (DSM), which allows processes running on separate computers to share data without explicit message passing. It covers the motivation for DSM, potential problems, design considerations including single vs multiple copy and consistency models, and examples of DSM implementations.

Uploaded by

ميلاد محمد

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views77 pages

DSM1

Uploaded by

ميلاد محمد

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 77

Distributed Systems

16 Distributed Shared Memory

July-15-2009
Gerd Liefländer
System Architecture Group

© 2009 Universität Karlsruhe, System Architecture Group 1

Schedule of Today
 Motivation & Introduction
 Potential Problems with DSM
 Design of DSM
 Single versus Multiple Copy DSM
 St t
Structure off DSM
 Synchronization Model
 Consistency Model
 Update Propagation
 Implementation of DSM
 Examples of DSM
 Literature

© 2009 Universität Karlsruhe, System Architecture Group

2
Distributed Shared Memory
See textbook Coulouris et al.:
“Distributed Systems” Ch. 16 or 18
(depending on the edition)

© 2009 Universität Karlsruhe, System Architecture Group 3

Distributed Shared Memory
 Distributed Shared Memory (DSM) allows applications
running on separate computers to share data or
address ranges without the programmer having to
deal with message passing
 Instead the underlying technology (HW or MW) will
send the messages to keep the DSM consistent (or
relatively consistent) between computer nodes
 DSM allows applications that used to operate on the
same computer to be easily adapted to operate on
multiple computers

© 2009 Universität Karlsruhe, System Architecture Group

4
What is a DSM?
 DSM is a special kind of a DDS, because this time
main memory parts are distributed and shared
 Applications should see no difference between a local
or remote memory access (except for further delays)
 Processes should see all writes by other processes
(as fast as possible)
 DSM design and implementation must provide
access transparency
 DSM is not suitable for all situations, e.g. client
server applications

© 2009 Universität Karlsruhe, System Architecture Group

5
Motivation

Motivation
Why DSM? (compare: why shared memory in local systems?)
 Some programmers want one programming concept
for distributed applications without all this IPC stuff
What’s easier?
 Sharing data no longer requires explicit IPC (even
though a DSM is based upon IPC)
 History has shown: Distributed applications based on
application IPCs tend to have more program bugs
and have larger code than DSM applications
 Shared memory was fastest collaboration in local
systems, as long as there are only few conflicting
operations
© 2009 Universität Karlsruhe, System Architecture Group
6
Why DSM?
 Better portability of distributed application programs
 Natural transition from sequential to distributed application
 Better performance of some applications
 Data locality, on-demand data movement, and larger RAMs
reduce network traffic due to remote paging
 However, ping-pong
ping pong paging due to false sharing etc. must
be avoided

 Flexible communication environment

 Sender and receiver must not know each other
 No need that they do coexist at the same time

 Ease of process migration

 Migration is completed only by transferring the
corresponding PCB (including its ASCB) to the destination

© 2009 Universität Karlsruhe, System Architecture Group

7
DSM Implementations
 Hardware
 Mainly used by SMPs. HW resolves LOAD and
STORE commands by communicating with remote
memory as well as local memory.

 Paged virtual memory

 Pages of virtual memory get the same set of
addresses for each program in the DSM system
 This only works for computers with common data
and paging formats
 This implementation does not put extra structure
requirements on the program since it is just a
series of bytes.

© 2009 Universität Karlsruhe, System Architecture Group

8
DSM Implementations (2)
 Middleware
 DSM is provided by some languages and
middleware without hardware or paging support
 For this implementation, the programming
language, underlying system libraries, or
middleware send the messages to keep the data
synchronized between programs so that the
programmer does not have to.

© 2009 Universität Karlsruhe, System Architecture Group

9
Motivation

Typical Applications of DSM

 Multiple processes sharing
 memory mapped files (already in MULTICS)
 large global data, e.g. matrices etc. in parallel
numeric applications

 DSM has to track

 how many replicas currently exist and
 where the current replicas are mapped

 Some DSMs offer

 one copy of each read only page and of each
read/write page
 one copy of read/write page, but at least
replicated read-only pages
© 2009 Universität Karlsruhe, System Architecture Group
10
Efficiency of DSM
 DSM systems can perform almost as well as
equivalent message-passing programs for
systems that run on N~ 10 or less nodes

 There are many factors

Th f t th
thatt affect
ff t th
the
efficiency of a DSM, e.g.
 implementation
 design approach
 memory consistency model

© 2009 Universität Karlsruhe, System Architecture Group

11
Introduction

Architecture of a DSM

Distributed Shared Memory Abstract Layer

CPU CPU
DSM1 DSMk
Local Local
Memory Memory
CPU CPU

Interconnection Media (Network)

© 2009 Universität Karlsruhe, System Architecture Group

12
Introduction

Page Based DSM

Node 1 Node 2 Node 3

physical physical physical
local
memory memory memory
memory

3 mapped
shared
6 9 memory

mapping mapping mapping centralized or

distributed
manager manager manager

3 6 9 DSM

© 2009 Universität Karlsruhe, System Architecture Group

13
Who is Sharing Memory in a DSM?
 A multi-threaded task of KLTs, whose KLTs have
migrated to n>1 nodes of the DS
 Thread programmers know about the shared data and have
to avoid write/write conflicts as usual using critical sections
 Challenge: we have to provide that each KLT can do its
read- & write operations on shared data without too much
delay, i.e. the handling of a remote page fault should not
take significantly more time than handling a local page fault

 Multi-process applications that have one or more

data segments in common
 It is convenient when a programmer can specify how
specific parts of his segments are implemented with
respect to sharing
© 2009 Universität Karlsruhe, System Architecture Group
14
No Usage of DSM
 Typical client/server system applications do not profit
so much from DSM, because the clients often see the
resources offered by a server as an abstract data
type that can be used using RPC or RMI

 Furthermore, a server is often not interested that a

unknown (malicious) clients access its data, i.e. in
this case sharing might be too dangerous due to
security reasons

© 2009 Universität Karlsruhe, System Architecture Group

15
Basic Concept address
Distributed Shared Memory
(exists only virtually)

Data = read(address); write(address, data);

CPU 1 CPU 1 CPU 1

: Memory : Memory : Memory
CPU n CPU n … CPU n

MMU MMU MMU

Page Mgr Page Mgr Page Mgr

Node 0 Node 1 Node 2

Communication Network

 Local pager must know the current location of an unmapped page

 Local pager must know the location of a centralized super-pager
responsible for the tracking of all page/frame locations of the DSM

© 2009 Universität Karlsruhe, System Architecture Group

16
Potential Problems with DSM

© 2009 Universität Karlsruhe, System Architecture Group 17

Main Issues
 Memory coherence and access synchronization
 Strict, Sequential, Causal, Weak, and Release Consistency models
 Data location and access
 Broadcasting, centralized data locator, fixed distributed data locator,
and dynamic distributed data locator
 Replacement
p strategy
gy
 LRU or FIFO, and using secondary store or the memory space of other
nodes (COMA)
 Thrashing (due to false sharing, i.e. ping-pong effect)
 How to prevent a block from being exchanged back and forth between
two nodes over and over again

© 2009 Universität Karlsruhe, System Architecture Group

18
Granularity
Granularity = amount of data sent with each update
 If granularity is too small and a large amount of
contiguous data is updated, the overhead of sending
many small update-messages can reduce efficiency
 Fine (less false sharing but more network traffic, e.g.
object in Orca & Linda)

 If granularity is too large, a whole page (or more)

would be sent for an update to a single byte, thus
reducing efficiency
 Coarse (more false sharing but less network traffic, e.g.
page in Ivy)

© 2009 Universität Karlsruhe, System Architecture Group

19
Problems with DSM

Granularity Problem

1. False sharing
Object O1
2. Thrashing (due to ping-pong)
Object O2

... Object O3
byte variable object 0.5 KB  page  64 KB ..
© 2009 Universität Karlsruhe, System Architecture Group
20
Granularity in a Page-Based DSM
 Typical standard page size 4 KB might be too small to
host a typical shared data object
 However, using super pages might not pay off
 Migrating a super page requires bandwidth and it might be
difficult, to find a fitting memory hole for the super page
 Furthermore, the larger the super page size the larger the
potential internal fragmentation
 In a DS there are some applications that might run
faster when using smaller than 4KB pages
 A 4 KB page might contain too many different objects
 false sharing, i.e. the ping-pong paging effect due
to conflicting activities at different nodes

© 2009 Universität Karlsruhe, System Architecture Group

21
Problems with DSM

Thrashing in Single Copy DSM

Example:
 2 processes on different nodes sharing one page

p1 p2
write(a,10)
( , )

© 2009 Universität Karlsruhe, System Architecture Group

22
Problems with DSM

Thrashing in Single Copy DSM

p1 p2
write(a,10)

a 10
9

migrate yellow page

Instead of copying “10” from node 1 to
node 2 we migrate the complete page,
i.e. we handle a “remote page fault”
© 2009 Universität Karlsruhe, System Architecture Group
23
Problems with DSM

Thrashing in Single Copy DSM

p1 p2
write(a,10)
write(a,11)
( , )

a 10

migrate yellow page back

© 2009 Universität Karlsruhe, System Architecture Group

24
Problems with DSM

Thrashing in a Single Copy DSM

p1 p2
write(a,10)
write(a,11)
( , )

a 11

© 2009 Universität Karlsruhe, System Architecture Group

25
Problems with DSM

Thrashing in Single Copy DSM

Example:
p1 only writes to object a, p2 only writes to object b, however,
both objects are in the same mapping/migration unit (e.g. page).

p1 p2
write(a,10)
write(b,11)

5 9

migrate page with a and b

Entities a and b may be
for- and backwards
completely independent
© 2009 Universität Karlsruhe, System Architecture Group
26
Problems with DSM

Thrashing in Multi Copy DSM

Example:
p1 reads a, p2 writes a, both nodes have replicas of a.

p1 p2
read(a)
write(a,11)
( , )

What to do with the

copy a’ on node 1?
It’s no longer valid!
a’ 9 9
a 11

copy a from node2 to node1

Remark:
Before writing to a data item in a replicated page, we must invalidate
all replicas
We must solve similar problems as with coherent caches in a SMP
© 2009 Universität Karlsruhe, System Architecture Group
27
Data Items Laid out over Pages

A B C

page n page
p g n+1

 Danger of false sharing when process1 accesses data item A,

and process2 accesses data item B concurrently

 Danger of two page faults in case of data item C is located on

two different pages

© 2009 Universität Karlsruhe, System Architecture Group

28
Design of DSM
Single Copy versus Multiple Copy DSM
Structure of DSM
Synchronization Model
Consistency Model
Update Options

© 2009 Universität Karlsruhe, System Architecture Group 29

Architecture of a DSM

Two DSM Principles

 Single copy, i.e. without replication
 If entity = page,  implement remote paging,
i.e. instead of swapping to and from a local disk,
swap via network or from a (local/)remote disk

 Multiple copies, i.e. with replication

 If entity = page, no problems with replicated
read-only pages, but we must deal with
reader/writer-problems
1. Single copy for read-write pages
2. N>1 copies of read-write pages with additional owner
bit, i.e. we must enforce that all writes are done on all
copies in the same order
© 2009 Universität Karlsruhe, System Architecture Group
30
Structure of DSM
 Byte oriented
 Access to a part of a byte-oriented DSM
corresponds to an access to a virtual memory, e.g.
Ivy & Mether
 Object oriented
 DSM is a collection of objects
 Operations are the methods of the object type,
e.g. Orca serializes automatically all methods of
the same object
 Constant data
 No updates, but new versions

© 2009 Universität Karlsruhe, System Architecture Group

31
Synchronization Model
 To enable synchronization on byte-oriented DSM or
synchronized methods in object-oriented DSM we
have to provide solutions enabling mutual exclusion
 Centralized lock manager
 Token manager
 Distributed CS managers

© 2009 Universität Karlsruhe, System Architecture Group

32
Update Propagation
Write-Update
Write-Invalidate

© 2009 Universität Karlsruhe, System Architecture Group 33

Write-Update
 Suppose a process has write permission for a page. It
updates a “data item” on it locally

 Updates are propagated via multicast to all replicas that

currently have a copy of this “data item”, i.e. the page

 Replica-managers update the corresponding data items in

order to allow as consistent reads as possible

 In practice you try to allow multiple writes to a page in a

row by the same process, otherwise too much overhead

 Furthermore, if possible you just propagate the updates

differences of the page to the other replicas

© 2009 Universität Karlsruhe, System Architecture Group

34
Example Write-Update

if(a=7) then
a := 7;
b := b+1;
b := 7; ...
if(b=8) then
print("after"); updates
time
time

if(b=a) then
print("before");

time

© 2009 Universität Karlsruhe, System Architecture Group

35
Implementing Sequential Consistency
Write Update

Client wants to write:

new copy

2. Replicate block

3. Update block
3. Update block
1. Request block

a copy of a copy of
new copy new copy
block new copy
block block

© 2009 Universität Karlsruhe, System Architecture Group

36
Write-Invalidate
 Before writing to a data item the process multicast an
invalidation message to all replicas that currently host
that data item, announcing the upcoming update

 As long as the process is writing, all other processes

accessing that data item,
item will be “blocked”

 Updates are sent whenever a process wants to read

a data item that had been invalidated in the past

 Reading valid local data items occurs with no delay

© 2009 Universität Karlsruhe, System Architecture Group

37
Implementing Sequential Consistency
Write Invalidation

Client wants to write:

new copy

2. Replicate block

3. Invalidate block
3. Invalidate block
1. Request block

a copy of a copy of
block
block block

© 2009 Universität Karlsruhe, System Architecture Group

38
Implement “Consistent” DSM
Single Copy DSM
Multiple Copy DSM

© 2009 Universität Karlsruhe, System Architecture Group 39

Implementing a DSM

Implement a Single Copy DSM

 Page based virtual memory management
 MMU with page-based address transformation

 Shared memoryy segment(s),

g ( ), i.e. its(their)
( )
virtual address range(s) can be mapped at
different nodes

 Page is never mapped to more than one node

© 2009 Universität Karlsruhe, System Architecture Group

40
Implementing a DSM

Implement a Single Copy DSM

 Local access:
  mapped page with presence bit = set in the corresponding
local page-frame table (PFT)
 Perform read/write accesses only to local RAM

 Remote access:
 Presence bit in local PFT is empty
 Remote access  page fault
 Pager gets page from remote node
 Set presence bit
 Repeat memory access

 DSM is coherent if
 Page transfer operations are atomic
 No node crashed occur
© 2009 Universität Karlsruhe, System Architecture Group
41
Single Copy DSM

Simple Map Protocol in a DSM

MAPPING on NODE 1 MAPPING on NODE 2
0 2 1 0
1 1 1 0 0 0 4
4 pages 2 0 1 1 5 1 1
mapped
to node 2 3 5 1 2 0 0 2 5
&
4 pages 4 0 3 7 0 1 3
mapped
to node 1 5 0 4 2 0
1 4 6
1
6 0 5 3 4 1 5 2
7 3 1 RAM1 0 RAM2
2
VAS
PFT1 PFT2 3
Frame Presence Bit
Number
1. Access 1 delivers page fault  2. Request to node2 
3. Delete presence bit of page 5 in node 2
© 2009 Universität Karlsruhe, System Architecture Group
42
Distributed
Implementing
System Models
a DSM

Mapping Protocol in a DSM

MAPPING on NODE 1 MAPPING on NODE 2
0 2 1 0
1 1 1 0 0 0 4
2 0 1 1 5 1 1
3 5 1 2 0 0 2 5
4 0 3 7 0 1 3
5 0
4 1 4 5 2 0
1 4 6
6 0 5 3 4 1 5 2
7 3 1 RAM1 0 RAM2
4
VAS
PFT1 7 6 PFT2
Frame Presence Bit
Number
4. Reply with page no. 5 having deleted PFT2 entry
6. Migrate that page into RAM1  7. Map into PFT1,
© 2009 Universität Karlsruhe, System Architecture Group
43
Distributed
Implementing
System Models
a DSM

Mapping Protocol in a DSM

MAPPING on NODE 1 MAPPING on NODE 2
0 2 1 0
1 1 1 0 0 0 4
2 0 1 1 5 1 1
3 5 1 2 0 0 2 5
4 0 3 7 0 1 3
5 4 1 4 5 2 0
1 4 6
10
6 0 5 3 4 1 5 2
7 3 1 RAM1 0 RAM2
VAS 9
PFT1 PFT2
Frame Presence Bit
Number
9. By migrating you deleted P5 10. Repeat access to page 5
11. Page fault commit message to node 2
© 2009 Universität Karlsruhe, System Architecture Group
44
Summary Single Copy DSM
 Linear consistency if we use a central coordinator
sequencing all accesses, however poor performance
 No concurrent read access to the same page
 Ping-Pong paging between nodes occurs too often, e.g.
especially if code of different threads is located on the same
page in case off a d
distributed
b d taskk

 False Sharing
 2 data objects in the same page used by different activities
 Mutual page stealing when threads write to that page

 What can we do?

 Reducing the consistency requirements
 Implement multiple copy DSM to support concurrent reads

© 2009 Universität Karlsruhe, System Architecture Group

45
Multi Copy DSM

© 2009 Universität Karlsruhe, System Architecture Group 46

Linear Consistency?
 Each read should deliver the value of the
latest write operation
 Problem
 No synchronized exact global time
 No unambiguous sequence, if clock synchronization
is too coarse, but we can achieve linear consistency
 We must resolve read/write and write/write
conflicts between concurrent processes
 A memory access is far shorter than the minimal
time deviation

© 2009 Universität Karlsruhe, System Architecture Group

47
~ Strict Consistent DSM?
 Use a Single Copy DSM (see slides before)
 Whenever a page is accessed, it first must
migrate to the accessing node, but there are no
read/write or write/write conflicts
 However, many additional page migrations, e.g.
with concurrent reads from the same page
 How to know where a page is currently located?
 Every node must know the current location of each
mapped page or you use a central super pager
 Use a shadow page table
 Whenever the mapping of a page changes, you have
to change the corresponding page tables at all
involved nodes
© 2009 Universität Karlsruhe, System Architecture Group
48
~ Strict Consistent DSM?
 Where to migrate a page when there are
concurrent accesses?
 Need a consensus on the sequence of operations
(easy with a central coordinator, otherwise
additional overhead))
 Real parallel operations only on different pages
 No distinction between read/write(RW) & read-
only(RO) pages, no support for concurrent reads
from different RO-pages

© 2009 Universität Karlsruhe, System Architecture Group

49
Multi Copy DSM
 Assume: Non modifying code 

 Code pages are similar to Read-Only Pages,

i.e. their content will never change

 Once copied to the needed location, they can

stay their until application has finished
without any additional overhead

 Changes might only happen when a thread

migrates to another location

© 2009 Universität Karlsruhe, System Architecture Group

50
Multi Copy DSM
 In the following we focus on potentially shared read-
write data-pages
 To distinguish between READ-ONLY and READ-WRITE pages
there is a permanent control-bit PRW per page
 If PRW-Bit
PRW Bit ==1,1 a write
it to
t a READ
READ-ONLY
ONLY page will
ill cause an
exception of type: address violation
 The very first time, a potentially READ-WRITE page is
mapped, it is initiated with a „temporal“ control-bit
TRW = 1, indicating that at the node where this page
is mapped, each process/KLT can write to this page
 TRW == 0 means, that at the involved node no write
access to P is temporarily allowed
© 2009 Universität Karlsruhe, System Architecture Group
51
Consistency Models

Multi Copy Consistent DSM

 A remote read (to a non local page)  page fault
 Copy page from current page owner, i.e. don’t delete page at
owner’s RAM
 Prevent writes on both sides, set TRW=0, i.e. any new write 
page-fault exception

 Whenever
h some node
d Lj tries to write to page P
 Copy P from the replica with TRW == 1, which must be the
one with the most recent writes
 Invalidate all replicas, i.e. delete their PFT entries & empty the
corresponding mapped page-frame.
 If there is no such replica with TRW==1 all replicas (also
your local one) are identical and up to date
 Set local TRW = 1 and repeat your write operation at Lj

© 2009 Universität Karlsruhe, System Architecture Group

52
Strict Consistent DSM

Multi Copy Consistent DSM

MAPPING on NODE 1 MAPPING on NODE 2
0 2 1 0 0 0
1 1 1 0 0 0 0 0 4
2 0 0 1 1 5 1 0 1
3 5 1 0 2 0 0 0 2 5
4 0 0 3 7 0 1 0 3
5 0 0 4 2 0
1 1
0 4 6
6 0 0 5 3 4 1 0 5 2
7 3 1 0 RAM1 0 0 RAM2
VAS
PFT1 PFT2

Frame Number Readonly

Bit TRW
Presence Bit
© 2009 Universität Karlsruhe, System Architecture Group
53
Strict Consistent DSM

Implement Consistent DSM

MAPPING on NODE 1 MAPPING on NODE 2
0 2 1 0 0 0
1 1 1 0 0 0 0 0 4
2 0 0 1 1 5 1 0 1
3 5 1 0 2 0 0 0 2 5
4 0 0 3 7 0 1 0 3
5 4 0
1 0
1 4 5 2 0
1 1
0 4 6
repeat
read
6read 0 0 5 3 4 1 0 5 2
7 3 1 0 RAM1 0 0 RAM2
VAS
PFT1 reply with page 5 PFT2
page fault request

© 2009 Universität Karlsruhe, System Architecture Group

54
Strict Consistent DSM

Implement Consistent DSM

MAPPING on NODE 1 MAPPING on NODE 2
0 2 1 0 0 0
1 1 1 0 0 0 0 0 4
2 0 0 1 1 5 1 0 1
3 5 1 0 2 0 0 0 2 5
4 0 0 3 7 0 1 0 3
5 4 1 1
0 4 5 2 0
1 1
0 4 6
repeat
write
6write 0 0 5 3 4 1 0 5 2
7 3 1 0 RAM1 0 0 RAM2
VAS invalidate request

commit invalidation

55
Strict Consistent DSM

Analysis of Linear Consistent DSM

 How did we achieve ~strict consistency?
 Only one process/node can write to the same
page at the same instant of time
 Only works efficiently when writes are rare
and multiple writes at one side are collected,
otherwise ping-pong paging

56
Sequential Consistent DSM
 Problem
 No linkage between real time and operations
 Find some sequential ordering of the operations on all nodes
 Ordering might conflict with application, e.g. a process
awaiting a certain value of a coordination variable

 Implementation
 Write operations have to be visible in all processes in the
same order
 Duration of a write
 Example: Forge the latest write in the next write
 Add an owner flag per PFT entry

57
Sequential Consistent DSM

Implement Sequential Consistent DSM

MAPPING on NODE 1 MAPPING on NODE 2
0 2 1 0 1 0 0 0
1 1 1 0 1 0 0 0 0 0 4
2 0 0 0 1 1 5 1 0 1 1
3 5 1 0 1 2 0 0 0 0 2 5
4 0 0 0 3 7 0 1 0 1 3
5 0 0 0 4 2 0
1 1
0 1
0 4 6
6 0 0 0 5 3 4 1 0 1 5 2
7 3 1 0 1 RAM1 0 0 0 RAM2
VAS
PFT1 PFT2
Frame Number Owner Bit
 each node = owner of its
Presence Bit Readonly Bit
4 present pages
© 2009 Universität Karlsruhe, System Architecture Group
58
Sequential Consistent DSM

Implement Sequential Consistent DSM

MAPPING on NODE 1 MAPPING on NODE 2
0 2 1 0 1 0 0 0
1 1 1 0 1 0 0 0 0 0 4
2 0 0 0 1 1 5 1 0 1 1
3 5 1 0 1 2 0 0 0 0 2 5
4 0 0 0 3 7 0 1 0 1 3
5 0 1
4 1 0 0 4 1
5
0 2 0 1 0
1 0 1 4 6
repeat
read
6 read 0 0 0 5 3 4 1 0 1 5 2
7 3 1 0 1 RAM1 0 0 0 RAM2
VAS
PFT1 PFT2 reply with page 5
page fault request

 owner dos not change its PFT

 new replica is not owner, i.e. its owner bit is not set
© 2009 Universität Karlsruhe, System Architecture Group
59
Sequential Consistent DSM

Implement Sequential Consistent DSM

MAPPING on NODE 1 MAPPING on NODE 2
0 2 1 0 1 0 0 0
1 1 1 0 1 0 0 0 0 0 4
2 0 0 0 1 1 5 1 0 1 1
3 5 1 0 1 2 0 0 0 0 2 5
4 0 0 0 3 7 0 1 0 1 3
5 0 0
4 1 1 1
0 4 5 2 0
1 1
0 0
1 4 6
write
repeat
6 write 0 0 0 5 3 4 1 0 1 5 2
7 3 1 0 1 RAM1 0 0 0 RAM2
VAS request for new ownership

commit deletion of ownership

 Change old owners read only bit and owner flag
 Give ownership to writer
© 2009 Universität Karlsruhe, System Architecture Group
60
Sequential Consistent DSM

Drawbacks Sequential Consistent DSM

 What to do when 2 concurrent writes are initiated?

 See assignments

 Overhead per access still expensive

 Per first read yyou must copy
py remote page
p g to target
g
 Per write on a not-owner node you must delete the
ownership of the owner node and shift it to the writer node,
having copied the page before

 How to propagate the updates of the owner’s side to

all other outdated copies?

 How to prevent from reading staled data?

61
Implementing Sequential Consistency
Replicated and Migrating Data Blocks

Node 1 Node 2 Node 3

Processor Processor Processor

Duplicate
x cache x cache b m cache

x a m
memory memory memory
y b n

Then what if Node 2 updates x?

62
Implementing Sequential Consistency
Read/Write Request

Unused

Read
(Read a copy from the onwer)
Replacement
Replacement
Replacement
Replacement
Write invalidate Nil
Read only

Read
(Read from memory and get an ownership)
Write Write invalidate
Write (invalidate others if they have a copy
(invalidate others if they have a copy and get an ownership)
and get an ownership)
Write invalidate

Read-owned Write Writable

(invalidate others if they have a copy)

63
Implementing Sequential Consistency
Locating Data –Fixed Distributed-Server Algorithms

Processor 0 Processor 1 Processor 2

Address Owner Address Owner Address Owner

0 P0 3 P1 6 P2
1 P0 4 P2 7 P1
2 P2 5 P0 8 P2

Read request
Addr0 Addr3 Addr2
writable read owned read owned

Addr1 Addr7 Addr4

read owned Location search writable read owned
Block replication
Addr5 Addr6
writable Read addr2 writable

Addr2 Addr8
read only read owned

64
Implementing Sequential Consistency
Locating Data – Dynamic Distributed-Server Algorithms

Processor 0 Processor 1 Processor 2

Address Probable Address Probable Address Probable
 Breaking the chain of nodes:
 When the node receives an
0 P0 3 P1 2 p1
P2
invalidation
1 P0 4 P2 7 P1
 When the node relinquishes
q
2 P2
p1 5 P0 8 P2 ownership
 When the node forwards a
Read request fault request
Addr0 Addr3 Addr2
writable read owned read
readowned
only  The node points to a new
owner
Addr1 Addr7 Addr4
read owned Location search writable read owned
Block replication
Addr5 Addr8
writable Read addr2 read owned

Addr2
read owned

65
Replacement Strategy

 Which block to replace

 Non-usage based (e.g. FIFO)
 Usage based (e.g. LRU)
 Mixed of those (e.g. Ivy )
 Unused/Nil: replaced with the highest priority

 Read-only: the second priority

 Read-owned: the third priority

 Writable: the lowest priority and LRU used.

 Where to place a replaced block

 Invalidating a block if other nodes have a copy.
 Using secondary store
 Using the memory space of other nodes

66
Thrashing
 Thrashing:
 Two or more processes try to write the same shared block.
 An owner keeps writing its block shared by two or more reader
processes.
 The larger a block, the more chances of false sharing that causes
thrashing.
 Solutions:
 Allow a process to prevent a block from being accessed by other
processes, using a lock.
 Allow a process to hold a block for a certain amount of time.
 Apply a different coherence algorithm to each block.
 What do those solutions require users to do?
 Are there any perfect solutions?

67
Literature
 B. Bershad et al: “The Midway DSM System, IEEE 1993
 N. Carreiro, D. Gelernter: “The S/Net’s Linda Kernel”, ACM
Trans. On Comp. Sys., 1986
 J. Cordsen: “Virtueller gemeinsamer Speicher”, PhD TU
Berlin, 1996
 M. Dubois et al.: “Synchronization, Coherence and Event
Ordering in Multiprocessors”, IEEE Computer, 1988
 K. Li: “Shared Virtual Memory on Loosley Coupled
Multiprocessors”, PhD Yale, 1986
 D. Mosberger: “Memory Consistency Models”, Tech.
Report, Uni. Of Arizona, 1993
 B. Nitzberg: “DSM: A Survey of Issues and Algorithms”,
IEEE Comp. Magazine, 1993
© 2009 Universität Karlsruhe, System Architecture Group
68
Appendix:
Review Consistency Models
Another Notation
See Colouris et al

Processes Accessing Shared Data
Process 1 Process 2

br := b; a := a + 1;
ar := a; b := b + 1;
if(ar ≥ br) then
print ("OK");

 a & b are initialized with 0

 Suppose, process 2 runs first, then process 1
 We expect that process1 always prints OK
 However, the update propagation of the DSM might send the
updates to process1 in reverse order, i.e. ar = k, but br =k+1
© 2009 Universität Karlsruhe, System Architecture Group
70
Interleaved Operations
Process 1
Time Process 2
read
br := b;
a := a + 1;
ar := a;
b := b + 1;
if(ar ≥ br) then
print
i t ("OK");
("OK") write

 Allowed interleaving with sequential consistency

71
Strict Consistency
 Wi(x, a): Processor i writes a on variable x.
 bRi(x): Processor i reads b from variable x.
 Any read on x must return the value of the most
recent write on x.
Strict Consistency NotStrict Consistency
P1 P2 P3 P1 P2 P3

W2(x, a) W2(x, a)

aR1(x) nilR1(x)

aR1(x) aR3(x) aR3(x)

aR1(x)

72
Linear & Sequential Consistency
 Linear Consistency: Operations of each individual process
appear to all processes in the same order as they happen.
 Sequential Consistency: Operations of each individual
process appear in the same order to all processes.

Linear Consistency Sequential Consistency

P1 P2 P3 P4 P4
P1 P2 P3
W2(x, a) W2(x, a)
W3(x, b) W3(x, b)
aR1(x) bR1(x)
aR4(x) bR4(x)
bR1(x) bR4(x) aR4(x)
aR1(x)

73
FIFO and Processor Consistency
 FIFO Consistency: writes by a single process are visible to all
other processes in the order in which they were issued.
 Processor Consistency: FIFO Consistency + all write to the
same memory location must be visible in the same order.

FIFO Consistency Processor Consistency

P1 P2 P3 P4 P1 P2 P3 P4
W2(x, a)
W2(x, a)
W2(x, b) W3(x, 0) aR1(x) aR1(x)
W2(x, b) W3(y, 0)
W3(x, 1) 0R1(x) aR1(x) 0R1(x)
aR1(x) W3(y, 1)
0R1(x) bR1(x) 0R1(x) 1R1(x)
1R1(x) W3(z, a) 1R1(x) W3(z, 1) bR1(x)
W2(y, a) 1R1(x) W2(y, a)
bR1(x) bR1(x)
aR1(z) aR1(z)
aR1(y) aR1(y)
aR1(z) aR1(y) aR1(z) aR1(y)

74
Causal Consistency
 Causally related writes must be visible to all processes in the same
order. Concurrent writes may be propagated in a different order.

Causal Consistency Not Causal Consistency

P1 P2 P3 P4 P4
P1 P2 P3
W2(x, a)
W2(x, a)
aR3(x) aR4(x)
aR3(x) aR3(x)
W2(x, c) W3(x, b)
W3(x, b)
bR4(x)
cR1(x) aR1(x) bR4(x)
bR1(x) cR4(x)
bR1(x) aR4(x)

75
Weak Consistency
 Accesses to synchronization variables must obey sequential
consistency.
 All previous writes must be completed before an access to a
synchronization variable.
 All previous accesses to synchronization variables must be
completed before access to non-synchronization variable.
Weak Consistency Not Weak Consistency
P1 P2 P3 P1 P2 P3
W2(x, a) W2(x, a)
W2(x, b)
W2(y, c) bR4(x) W2(y, c)
aR4(x) W2(x, b)
NilR4(y) S3
S3 S1
S1
S2 S2
cR4(y) bR4(x) bR4(x) aR4(x)
cR4(y) cR4(y) cR4(y)
bR4(x)

76
Release Consistency
 Access to acquire and release variables obey processor
consistency.
 Previous acquires requested by a process must be completed
before the process performs a data access.
 All previous data accesses performed by a process must be
completed before the process performs a release
release.

P1 P2 P3
Acq1(L)
W1(x, a)
W1(x, b)
Rel1(L)
Acq2(L)
bR2(x)
bR2(x)
Rel2(L) aR3(x)

Document Review Checklist
No ratings yet
Document Review Checklist
7 pages
DIRECTIONS: Work Out The Problems Below by Subtracting The Two Numbers. Make Sure You
100% (2)
DIRECTIONS: Work Out The Problems Below by Subtracting The Two Numbers. Make Sure You
3 pages
Lect5 - Distributed Shared Memory
No ratings yet
Lect5 - Distributed Shared Memory
120 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
109 pages
DSM - Distributedsharedmemory
No ratings yet
DSM - Distributedsharedmemory
108 pages
Module 2
No ratings yet
Module 2
34 pages
Distributed System (UNIT-III) 7th Sem
No ratings yet
Distributed System (UNIT-III) 7th Sem
7 pages
Proficiency - PPT (103 - Sagar Magaraiya) (DE2)
No ratings yet
Proficiency - PPT (103 - Sagar Magaraiya) (DE2)
9 pages
Distributed Shared Memory
100% (1)
Distributed Shared Memory
20 pages
Proficiency - PPT (103 - Sagar Magaraiya) (DE2)
No ratings yet
Proficiency - PPT (103 - Sagar Magaraiya) (DE2)
9 pages
WINSEM2022-23 CSE4001 ETH VL2022230503162 ReferenceMaterialI TueFeb1400 00 00IST2023 Module4DistributedSystemsLecture2
No ratings yet
WINSEM2022-23 CSE4001 ETH VL2022230503162 ReferenceMaterialI TueFeb1400 00 00IST2023 Module4DistributedSystemsLecture2
27 pages
Parallel and Distributed Computing Lec 6
No ratings yet
Parallel and Distributed Computing Lec 6
26 pages
Distributed Shared Memory - Revised
No ratings yet
Distributed Shared Memory - Revised
64 pages
Distributed Shared Memory-Report
No ratings yet
Distributed Shared Memory-Report
35 pages
Name: Jayrajsinh Vaghela Roll No: 5166 Div: B Sub: DOS (Assi-3.1)
No ratings yet
Name: Jayrajsinh Vaghela Roll No: 5166 Div: B Sub: DOS (Assi-3.1)
24 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
24 pages
10 Distributed Shared Memory
No ratings yet
10 Distributed Shared Memory
20 pages
Unit - IV Notes
No ratings yet
Unit - IV Notes
42 pages
Unit 3 DSM
No ratings yet
Unit 3 DSM
12 pages
Distributed Shared Memory (DSM)
No ratings yet
Distributed Shared Memory (DSM)
27 pages
Article 4
No ratings yet
Article 4
7 pages
Distributed Resource Management: Distributed Shared Memory
No ratings yet
Distributed Resource Management: Distributed Shared Memory
20 pages
A4
No ratings yet
A4
5 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
20 pages
Distributed Shared Memory: Pham Quoc Cuong & Phan Dinh Khoi Use Some Slides of James Deak - Njit
No ratings yet
Distributed Shared Memory: Pham Quoc Cuong & Phan Dinh Khoi Use Some Slides of James Deak - Njit
53 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
35 pages
Memory Management Technique For Paging On Distributed Shared Memory Framework
No ratings yet
Memory Management Technique For Paging On Distributed Shared Memory Framework
13 pages
Unit-6 Distributed Shared Memory
No ratings yet
Unit-6 Distributed Shared Memory
71 pages
Unit 2
No ratings yet
Unit 2
15 pages
Architectural Issues in Adopting Distributed Shared Memory For Distributed Object Management Systems
No ratings yet
Architectural Issues in Adopting Distributed Shared Memory For Distributed Object Management Systems
7 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
51 pages
P D Group2-2
No ratings yet
P D Group2-2
6 pages
Data Shared System (DSM)
No ratings yet
Data Shared System (DSM)
7 pages
Preface
No ratings yet
Preface
8 pages
Chap 5 Slides - DSM2
No ratings yet
Chap 5 Slides - DSM2
9 pages
Week# 12 & 13 (Intro To Distributed Memory)
No ratings yet
Week# 12 & 13 (Intro To Distributed Memory)
41 pages
Unit-4 DS
No ratings yet
Unit-4 DS
39 pages
Distributed Shared Memory (DSM)
No ratings yet
Distributed Shared Memory (DSM)
4 pages
Adi Mca
No ratings yet
Adi Mca
8 pages
L 14 DSM
No ratings yet
L 14 DSM
3 pages
DSM
No ratings yet
DSM
36 pages
Distributed Shared Memory Distributed Memory
No ratings yet
Distributed Shared Memory Distributed Memory
19 pages
Unit 4
No ratings yet
Unit 4
7 pages
Unit 5 DOS SCR
No ratings yet
Unit 5 DOS SCR
46 pages
Week 5 PDC
No ratings yet
Week 5 PDC
12 pages
Module-03 MSCS201
No ratings yet
Module-03 MSCS201
10 pages
V3i9201434 PDF
No ratings yet
V3i9201434 PDF
6 pages
CST402 - M4-Ktunotes - in
No ratings yet
CST402 - M4-Ktunotes - in
8 pages
Ca Research
No ratings yet
Ca Research
5 pages
Advanced Os Slides
No ratings yet
Advanced Os Slides
40 pages
5 Software - Architectures - Detailed - PPT
No ratings yet
5 Software - Architectures - Detailed - PPT
12 pages
L07 Distributed Shared Memory and File Systems
No ratings yet
L07 Distributed Shared Memory and File Systems
81 pages
Chapter Four - Parallel Computing
No ratings yet
Chapter Four - Parallel Computing
86 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
52 pages
U1-Theory of Parallelism
No ratings yet
U1-Theory of Parallelism
43 pages
Chapter 12
No ratings yet
Chapter 12
52 pages
Distributed Shared Memory For Advanced Os
No ratings yet
Distributed Shared Memory For Advanced Os
21 pages
Unit 3
No ratings yet
Unit 3
58 pages
OMONDI
No ratings yet
OMONDI
23 pages
Chapter 7: Distributed Shared Memory: Why DSM?
No ratings yet
Chapter 7: Distributed Shared Memory: Why DSM?
14 pages
Daemon Architecture and Implementation: Definitive Reference for Developers and Engineers
From Everand
Daemon Architecture and Implementation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Operating System Text Book
From Everand
Operating System Text Book
Manish Soni
No ratings yet
Modeling Web Applications
No ratings yet
Modeling Web Applications
14 pages
Lecture 2
No ratings yet
Lecture 2
36 pages
Binary Search Trees
No ratings yet
Binary Search Trees
32 pages
Topic 19 Binary Search Trees
No ratings yet
Topic 19 Binary Search Trees
21 pages
The Mediating Role of Psychological Empowerment in Information Se
No ratings yet
The Mediating Role of Psychological Empowerment in Information Se
23 pages
Human Factors in Phishing Attacks A Systematic Literature Review
No ratings yet
Human Factors in Phishing Attacks A Systematic Literature Review
35 pages
Models
No ratings yet
Models
74 pages
UNIT I Complete Notes
No ratings yet
UNIT I Complete Notes
5 pages
Sample of Writing Cover Letter For Job Application
100% (1)
Sample of Writing Cover Letter For Job Application
6 pages
Session 11
No ratings yet
Session 11
18 pages
Section 1 Technical Specifications - Electrical SMP - Rev.6
No ratings yet
Section 1 Technical Specifications - Electrical SMP - Rev.6
147 pages
04 File Handling
No ratings yet
04 File Handling
40 pages
Assignment1 20CE31002
No ratings yet
Assignment1 20CE31002
5 pages
Commander
No ratings yet
Commander
75 pages
Notion On False Friends (French - English)
No ratings yet
Notion On False Friends (French - English)
9 pages
Venkatesh Receip
No ratings yet
Venkatesh Receip
1 page
VMware Q2 CY2023 VCPP PUG EN
No ratings yet
VMware Q2 CY2023 VCPP PUG EN
159 pages
Detecting EBPF Rootkits Using Virtualization and Memory Forensics
No ratings yet
Detecting EBPF Rootkits Using Virtualization and Memory Forensics
8 pages
Selenium
No ratings yet
Selenium
33 pages
Pronto Xi 740 Solutions Overview 9 Technology
100% (1)
Pronto Xi 740 Solutions Overview 9 Technology
32 pages
CAMERON, C. e TRIVEDI, P.K. Microeconometrics Using Stata. Cambridge: CUP, 2010
No ratings yet
CAMERON, C. e TRIVEDI, P.K. Microeconometrics Using Stata. Cambridge: CUP, 2010
3 pages
LINUX
100% (1)
LINUX
3 pages
Microsoft Cognitive Toolkit
No ratings yet
Microsoft Cognitive Toolkit
2 pages
External Optical Drive Case
No ratings yet
External Optical Drive Case
2 pages
Audit Trail
No ratings yet
Audit Trail
1 page
CSE1002
No ratings yet
CSE1002
14 pages
DSP Lab 1
No ratings yet
DSP Lab 1
8 pages
Ambo University Woliso Campus
100% (1)
Ambo University Woliso Campus
6 pages
10C Form T1 PHD Thesis Submission For Repository NITT
No ratings yet
10C Form T1 PHD Thesis Submission For Repository NITT
2 pages
Pentesting Methodologies
No ratings yet
Pentesting Methodologies
12 pages
Question: 2. An Air Conditioning Plant Comprising Lter, Cooler Coil, Fan A
No ratings yet
Question: 2. An Air Conditioning Plant Comprising Lter, Cooler Coil, Fan A
2 pages
Academic Planner Class 2
No ratings yet
Academic Planner Class 2
7 pages
PPDS OSS Restriction Maintenance For Model Mix Planning SAPAPO RET2
No ratings yet
PPDS OSS Restriction Maintenance For Model Mix Planning SAPAPO RET2
2 pages
Receipt - 3873
No ratings yet
Receipt - 3873
2 pages
Disseminating Intangible Cultural Heritage Through Gamified Learning Experiences and Service Design
No ratings yet
Disseminating Intangible Cultural Heritage Through Gamified Learning Experiences and Service Design
436 pages