Thread-Level Parallelism and Synchronization Issues

This document discusses thread-level parallelism and synchronization issues in multicore processors. It describes how data races can cause non-deterministic behavior and how synchronization, such as locks, are used to prevent data races. It explains possible implementations of locks using busy waiting or hardware synchronization instructions like test-and-set. Finally, it discusses how multicore processors leverage hardware thread switching to improve performance from thread-level parallelism.

Uploaded by

SIYAB Khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views9 pages

Thread-Level Parallelism and Synchronization Issues

Uploaded by

SIYAB Khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Thread-Level Parallelism and

Synchronization Issues

1
Data Races and Synchronization
• Two memory accesses form a data race if from
different threads to same location, and at least
one is a write, and they occur one after another
• If there is a data race, result of program can vary
depending on chance (which thread ran first?)
• Avoid data races by synchronizing writing and
reading to get deterministic behavior
• Synchronization done by user-level functions that
rely on hardware synchronization instructions
2
Lock and Unlock Synchronization
• Lock used to create region Set the lock
(critical section) where only
one thread can operate Critical section
• Given shared memory, use (only one thread
memory location as gets to execute
synchronization point: lock or this section of
semaphore
code at a time)
• Thread reads lock to see if it
must wait, or OK to go into e.g., change
critical section (and set to shared variables
locked)
– 0 => lock is free / open /
Unset the lock
unlocked / lock off
– 1 => lock is set / closed /
3
locked / lock on
Possible Lock/Unlock Implementation

• Lock (aka busy wait):

addiu $t1,$zero,1 ; t1 = 1 means Locked
Loop: lw $t0,lock($s0) ; load lock
bne $t0,$zero,Loop ; loop if locked
Lock: sw $t1,lock($s0) ; Unlocked, so lock

• Unlock:
sw $zero,lock($s0)

4
Possible Lock Problem
• Thread 1 • Thread 2
addiu $t1,$zero,1
Loop: lw $t0,lock($s0)
addiu $t1,$zero,1
Loop: lw $t0,lock($s0)

bne $t0,$zero,Loop
bne $t0,$zero,Loop
Lock: sw $t1,lock($s0)
Lock: sw $t1,lock($s0)

Time Both threads think they have set the lock

Exclusive access not guaranteed!
5
Help! Hardware Synchronization
• Hardware support required to prevent
interloper (either thread on other core or
thread on same core) from changing the value
– Atomic read/write memory operation
– No other access to the location allowed between
the read and write
• Could be a single instruction
– E.g., atomic swap of register ↔ memory
– Or an atomic pair of instructions
6
Test-and-Set
• In a single atomic operation:
– Test to see if a memory location is set
(contains a 1)
– Set it (to 1) If it isn’t (it contained a
zero when tested)
– Otherwise indicate that the Set
failed, so the program can try again
– No other instruction can modify the
memory location, including another
Test-and-Set instruction
• Useful for implementing lock
operations
7
Multithreading on Multicore
• Basic idea: Processor resources are expensive and
should not be left idle
• Long memory latency to memory on cache miss?
• Hardware switches threads to bring in other useful
work while waiting for cache miss
• Cost of thread context switch must be much less than
cache miss latency
• Put in redundant hardware so don’t have to save
context on every thread switch:
– PC, Registers?
• Attractive for apps with abundant TLP
8
Concluding
• Sequential software is slow software
– Multiprocessors only path to higher performance
• Multiprocessor (Multicore) uses Shared Memory
(single address space) for TLP
• Cache coherency keeps data coherent.
1. Snooping protocols
2. Directory based protocols
– False sharing a concern in cache coherence
• Synchronization via hardware primitives
9

A350 Instructor Manual
100% (1)
A350 Instructor Manual
106 pages
R22 - OOPS Using JAVA Lab Manual 2-2
No ratings yet
R22 - OOPS Using JAVA Lab Manual 2-2
58 pages
CH 4 Synchronization Models of Memory Consistency
100% (1)
CH 4 Synchronization Models of Memory Consistency
26 pages
9-Operating Systems - Synchronization, Interprocess Communication, Deadlock
No ratings yet
9-Operating Systems - Synchronization, Interprocess Communication, Deadlock
162 pages
R.M.K. Engineering College: 22CS304 Operating System
No ratings yet
R.M.K. Engineering College: 22CS304 Operating System
101 pages
Process Synchronization
No ratings yet
Process Synchronization
104 pages
Unit 3
No ratings yet
Unit 3
104 pages
05 Ipc
No ratings yet
05 Ipc
67 pages
Process Synchronization
No ratings yet
Process Synchronization
52 pages
3 Concurrency
No ratings yet
3 Concurrency
52 pages
15 Semaphores 05 09 2024
No ratings yet
15 Semaphores 05 09 2024
63 pages
Chapter 3.2
No ratings yet
Chapter 3.2
73 pages
2007 02 01b Janecek Perceptron
No ratings yet
2007 02 01b Janecek Perceptron
37 pages
OS (Operation System) Notes 2
No ratings yet
OS (Operation System) Notes 2
53 pages
Lecture-6 Synchronization
No ratings yet
Lecture-6 Synchronization
55 pages
OS Process Synchronization Complete
No ratings yet
OS Process Synchronization Complete
63 pages
OS Process Synchronization Unit 3
No ratings yet
OS Process Synchronization Unit 3
58 pages
Synchronization 2
No ratings yet
Synchronization 2
39 pages
Concurrency Mutual Exclusion and Synchronization
No ratings yet
Concurrency Mutual Exclusion and Synchronization
32 pages
Week 2.2
No ratings yet
Week 2.2
39 pages
Chapter 5
No ratings yet
Chapter 5
66 pages
MCP-Unit 2
No ratings yet
MCP-Unit 2
77 pages
03 - Synchronization
No ratings yet
03 - Synchronization
37 pages
6 Process Synchronization
No ratings yet
6 Process Synchronization
31 pages
CS6210 4b - Synchronization
No ratings yet
CS6210 4b - Synchronization
27 pages
Ch13 Concurrency 4e
No ratings yet
Ch13 Concurrency 4e
29 pages
UNIT-3 2015 Regulation Process Synchronization and Deadlocks
No ratings yet
UNIT-3 2015 Regulation Process Synchronization and Deadlocks
58 pages
Synchronization Mechanisms
No ratings yet
Synchronization Mechanisms
41 pages
5 2 Concurrency-Locks
No ratings yet
5 2 Concurrency-Locks
26 pages
THREADS
No ratings yet
THREADS
20 pages
Lab Synchronization
No ratings yet
Lab Synchronization
20 pages
Unit 2 Notes Srmcem
No ratings yet
Unit 2 Notes Srmcem
29 pages
15 Synchronization
No ratings yet
15 Synchronization
120 pages
Synchronization
No ratings yet
Synchronization
81 pages
Os R22 2-2 Unit-3-1
No ratings yet
Os R22 2-2 Unit-3-1
18 pages
Week 9
No ratings yet
Week 9
20 pages
Summary Midterm Concurrency
No ratings yet
Summary Midterm Concurrency
22 pages
Os 5
No ratings yet
Os 5
25 pages
Lock
No ratings yet
Lock
53 pages
Lab3 Synchronization
No ratings yet
Lab3 Synchronization
17 pages
Advanced Synchronization: - Bloom Paper (Online) - Chapter 6 From Silberschatz - Slides Have Many Illustrative Examples
No ratings yet
Advanced Synchronization: - Bloom Paper (Online) - Chapter 6 From Silberschatz - Slides Have Many Illustrative Examples
61 pages
Hardware and Software Synchronization Advanced Computer Architecture COMP 140 Thursday June 26, 2014
No ratings yet
Hardware and Software Synchronization Advanced Computer Architecture COMP 140 Thursday June 26, 2014
33 pages
OS Module2 PDF
No ratings yet
OS Module2 PDF
22 pages
Lab 3 Synchronization v2-1
No ratings yet
Lab 3 Synchronization v2-1
19 pages
15 Synchronization Hardware 21082024 115748am 24042025 092355am
No ratings yet
15 Synchronization Hardware 21082024 115748am 24042025 092355am
14 pages
Merged 2
No ratings yet
Merged 2
21 pages
Unit 2 Lecture 29 - Concurrency HW Support
No ratings yet
Unit 2 Lecture 29 - Concurrency HW Support
14 pages
Lab 3
No ratings yet
Lab 3
18 pages
L7 Multicore 2
No ratings yet
L7 Multicore 2
22 pages
Operating Systems: Synchronization
No ratings yet
Operating Systems: Synchronization
26 pages
16 Synchronization
No ratings yet
16 Synchronization
9 pages
08
No ratings yet
08
26 pages
Lec07 Exclusion
No ratings yet
Lec07 Exclusion
33 pages
Lecture 05
No ratings yet
Lecture 05
8 pages
An Introduction To Programming With Threads
No ratings yet
An Introduction To Programming With Threads
29 pages
Critical Section Problem: CIS 450 Winter 2003
No ratings yet
Critical Section Problem: CIS 450 Winter 2003
26 pages
CS347 04 Process Sync
No ratings yet
CS347 04 Process Sync
14 pages
L9 - Parallelism and Instruction Synchronization
No ratings yet
L9 - Parallelism and Instruction Synchronization
4 pages
Implementing Locks: How To Write Correct Concurrent Programs? No Race
No ratings yet
Implementing Locks: How To Write Correct Concurrent Programs? No Race
4 pages
Semaphore Basics
No ratings yet
Semaphore Basics
6 pages
Lab 09 - Concurrency (Answers) PDF
No ratings yet
Lab 09 - Concurrency (Answers) PDF
5 pages
7.7.2.2 Common Problems and Solutions For Laptops
100% (1)
7.7.2.2 Common Problems and Solutions For Laptops
3 pages
Updated Fisheries Application Form
No ratings yet
Updated Fisheries Application Form
4 pages
NetBackup1011 InstallGuide
No ratings yet
NetBackup1011 InstallGuide
215 pages
Activity 3: The or Gate Objective:: A B (A) Y A+B
No ratings yet
Activity 3: The or Gate Objective:: A B (A) Y A+B
3 pages
Azure
No ratings yet
Azure
28 pages
MC024-120 and 122 - Data Sheet
No ratings yet
MC024-120 and 122 - Data Sheet
2 pages
An Answer
No ratings yet
An Answer
106 pages
PHP JSON Functions
No ratings yet
PHP JSON Functions
3 pages
Citrix Workspace App
No ratings yet
Citrix Workspace App
157 pages
w23 mr2 Us23 Enga
No ratings yet
w23 mr2 Us23 Enga
160 pages
Powermax Family
No ratings yet
Powermax Family
8 pages
Source Code Analysis Project - Hs07192 - Ri07157 - Za06539 - Za07168 - Final
No ratings yet
Source Code Analysis Project - Hs07192 - Ri07157 - Za06539 - Za07168 - Final
24 pages
COA Final Merged
No ratings yet
COA Final Merged
130 pages
Getting Started With Unity Pro
No ratings yet
Getting Started With Unity Pro
40 pages
Oracle Hash Join
No ratings yet
Oracle Hash Join
16 pages
6AV21283UB400AX0 Datasheet en
No ratings yet
6AV21283UB400AX0 Datasheet en
6 pages
Introduction Systems Development
No ratings yet
Introduction Systems Development
11 pages
Special JBW Price:: by (English - Paperback) Publisher
No ratings yet
Special JBW Price:: by (English - Paperback) Publisher
2 pages
AJP Practice Mcqs
No ratings yet
AJP Practice Mcqs
33 pages
s4 CAM July 2022
No ratings yet
s4 CAM July 2022
2 pages
Log
No ratings yet
Log
24 pages
GC 2024 12 30
No ratings yet
GC 2024 12 30
11 pages
Final Exam
No ratings yet
Final Exam
18 pages
Lecture#3 - Memory Hierarchy
No ratings yet
Lecture#3 - Memory Hierarchy
24 pages
Bakshi
No ratings yet
Bakshi
9 pages
NCH Am Collins 4
No ratings yet
NCH Am Collins 4
12 pages
SICOM3024PCatalog Sheet
No ratings yet
SICOM3024PCatalog Sheet
8 pages
Tutorial 6: Scene Graphs: New Concepts
No ratings yet
Tutorial 6: Scene Graphs: New Concepts
12 pages
BCS361: Computer Architecture: I/O Devices
No ratings yet
BCS361: Computer Architecture: I/O Devices
9 pages
Android Enterprise Test Drive Work Profile Guide
No ratings yet
Android Enterprise Test Drive Work Profile Guide
7 pages
Ravi Resume-Updated
No ratings yet
Ravi Resume-Updated
4 pages
ELL782: Computer Architecture
No ratings yet
ELL782: Computer Architecture
2 pages
Lua Programming Mastery: From Zero to Hero - A Complete Guide to Lightweight Scripting
From Everand
Lua Programming Mastery: From Zero to Hero - A Complete Guide to Lightweight Scripting
Jonathan Caldwell
No ratings yet
SPANNING TREE PROTOCOL: Most important topic in switching
From Everand
SPANNING TREE PROTOCOL: Most important topic in switching
Mulayam Singh
No ratings yet

Thread-Level Parallelism and Synchronization Issues

Uploaded by

Thread-Level Parallelism and Synchronization Issues

Uploaded by

Thread-Level Parallelism and

• Lock (aka busy wait):

Time Both threads think they have set the lock

You might also like