0% found this document useful (0 votes)

9 views41 pages

Lecture 25

This document discusses how advancements in hardware, particularly multicore CPUs and flash storage, influence the design of operating systems. It highlights the challenges of scheduling and locking mechanisms in multicore environments and the unique characteristics of flash storage that affect filesystem design. The interplay between operating systems and hardware advancements is emphasized as crucial for understanding modern computing technologies.

Uploaded by

yanggeer00

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views41 pages

Lecture 25

Uploaded by

yanggeer00

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 41

CS111, Lecture 25

Modern Technologies and OSes

This document is copyright (C) Stanford Computer Science and Nick Troccoli, licensed under
Creative Commons Attribution 2.5 License. All rights reserved.
Based on slides and notes created by John Ousterhout, Jerry Cain, Chris Gregg, and others.
NOTICE RE UPLOADING TO WEBSITES: This content is protected and may not be shared, 1
uploaded, or distributed. (without expressed written permission)
Key question: How do hardware
advances impact the design of operating
systems?

2
CS111 Topic 4: Virtual Memory
Modern Technologies and OSes - How do hardware advances impact the design
of operating systems?

Why is answering this question important?

• Understand the full impact and utility of modern technologies we take for
granted
• We can better understand the interplay between technology and OSes: OSes
are at the hardware-software boundary

3
Learning Goals
• Learn about multicore CPUs and how they change scheduling and lock
implementations
• Understand the benefits and drawbacks of flash storage and how flash storage
can impact filesystem design

4
Plan For Today
• Example 1: Multicore CPUs
• Example 2: Flash Storage

5
Plan For Today
• Example 1: Multicore CPUs
• Multicore scheduling
• Multicore locks
• Example 2: Flash Storage

6
Multicore CPUs
• True multitasking: multiple cores let us run multiple threads simultaneously
• Starting mid-2000s, multicore processors more common in consumer devices
• OS manages these cores; new challenges!

7
Multicore CPUs Picture of a snapdragon processor

• Most modern consumer devices (phones, tablets, PCs) have

multiple cores. Examples:
• Latest iPhone processors have 6 cores
• Latest Snapdragon smartphone processors (common for Android
devices) have 8 cores Picture of Intel core i7, i9 and i5 processor

• Latest Intel processors have up to 24 cores

• Now more common to have different types of cores; e.g.
“performance” and “efficiency”:
• less-intensive tasks run on efficiency cores; more power-efficient Picture of Apple A16 processor

• More intensive tasks run on performance cores; better performance

• Apple, Intel + Qualcomm (major processor manufacturers) use this
approach (Qualcomm at one point had 3 types of cores)
• E.g. iPhone 16 has 2 P-cores, 4 E-cores, one Intel Core Ultra laptop
chips have 4 P-cores, 4 E-cores 8
Aside: Other Hardware
• GPU is in charge of graphics
• Newer Development: NPU (“Neural Processing Unit”) / ”Neural Engine”
powers machine learning / AI tasks

9
Multicore Challenges
OS management of multiple cores surfaces new challenges:
• Example: how does scheduling work with multiple CPUs?
• Example: how can we implement mutexes where there are multiple CPUs?

10
Plan For Today
• Example 1: Multicore CPUs
• Multicore scheduling
• Multicore locks
• Example 2: Flash Storage

11
Scheduling
Key Question: How does the operating system decide which thread to run next?
(e.g. many ready threads).
Previously: First-Come-First-Serve, Round-Robin, SRPT, Priority-Based

What about when we have multiple cores to schedule threads on? (assume all
cores equal)

12
Multicore Scheduling
Initial idea: one ready queue shared by k cores
• Share ready queue data structure across cores, lock to synchronize access
• One dispatcher per core
• Separate timer interrupts for each core
• Run the k highest-priority threads on the k cores
• When a new thread is marked “ready”, compare its priority against lowest-
priority running thread, preempt if new thread has higher priority.
• This works fine for 2 cores but breaks down with lots more cores. What is the
main bottleneck with this approach when used with many cores?
Respond on PollEv: pollev.com/cs111
or text CS111 to 22333 once to join. 13
14
Multicore Scheduling
The single ready queue is a huge bottleneck - cores must wait for access!

Modification: have 1 ready queue per core.

Problem: how do we balance threads across different ready queues?
One idea: “work stealing”: if one core is free, take a thread from another core’s
ready queue
• Maybe want to also do this prior to ready queue being empty? e.g. if one core
has 1 ready thread and another core has 30 ready threads, the 30 threads will
get less time than the 1 thread.

Another challenge: expensive to move a thread to another core.

15
Core Affinity
Another challenge: expensive to move a thread to another core.
• Cores have caches for data; if we move to a new core, won’t have cached data
• Multiprocessor schedulers try to keep threads on same core – “core affinity”
• Maybe better in some cases to just wait for current core instead of moving?

Tension between work stealing (want to move often) and core affinity (don’t
want to move often)

16
Gang Scheduling
How should we approach scheduling if one process has several threads?
• threads may be coordinating / exchanging info
• “gang scheduling” – run all threads together on different cores.
• Why? Thread progress may be intertwined. E.g. one thread holds lock then de-
scheduled, another runs but soon needs to wait for that same lock.

17
Multicore Scheduling
In general: these systems all have good and bad situations – e.g. Linux scheduler
had problems for many years, better now, but still some problems with load
balancing and moving threads too rapidly between cores.

18
Plan For Today
• Example 1: Multicore CPUs
• Multicore scheduling
• Multicore locks
• Example 2: Flash Storage

19
Single-Core Locks
So far: our Mutex implementation relied on disabling interrupts to prevent race
conditions.
class Lock { void Lock::unlock() {
int locked = 0; IntrGuard guard;
ThreadQueue q; if (q.empty()) {
}; locked = 0;
} else {
void Lock::lock() { unblockThread(q.remove());
IntrGuard guard; }
if (!locked) { }
locked = 1;
} else {
q.add(currentThread);
blockThread();
}
}
20
Multicore Locks
Problem: only works with single-core processors! If multiple cores, even if
interrupts are disabled, some other thread could be running on another core.
How do we approach this on multicore systems?
• Turn off all other cores? Not a great option.

Key Idea: we must use a (small amount) of busy waiting (!!). We need a
mechanism for cores to sync up before proceeding, and setting/checking a
shared value is the only option.
• There’s no other way to synchronize with the other cores; until we have
synchronized, we can’t even put a thread to sleep
21
Single-Core Locks
class Lock { void Lock::unlock() {
int locked = 0; IntrGuard guard;
ThreadQueue q; if (q.empty()) {
}; locked = 0;
} else {
void Lock::lock() { unblockThread(q.remove());
IntrGuard guard; }
if (!locked) { }
locked = 1;
} else {
q.add(currentThread);
blockThread();
}
}

22
Multicore Locks, V1
class Lock { void Lock::lock() {
int locked = 0; // try to change sync from 0 to 1
ThreadQueue q; while (true) {
int sync = 0; int old = sync;
}; sync = 1;
if (old == 0) break;
}
// we are only one proceeding now

if (!locked) {
locked = 1;
sync = 0;
} else {
q.add(currentThread);
sync = 0;
blockThread();
}
}
23
Multicore Locks, V1
class Lock { void Lock::lock() {
int locked = 0; // try to change sync from 0 to 1
ThreadQueue q; while (sync.exchange(1)) {}
std::atomic<int> sync(0); // we are only one proceeding now
};
if (!locked) {
locked = 1;
sync = 0;
} else {
q.add(currentThread);
sync = 0;
blockThread();
}
}

exchange: an atomic operation that reads the memory value,

replaces it with a given value, and returns the old value. 24
Multicore Locks, V1
class Lock { void Lock::lock() {
int locked = 0; // try to change sync from 0 to 1
ThreadQueue q; while (sync.exchange(1)) {}
std::atomic<int> sync(0); // we are only one proceeding now
};
if (!locked) {
locked = 1;
sync = 0;
} else {
std::atomic is a C++ q.add(currentThread);
type that provides sync = 0;
blockThread();
atomic operations for its }
}
contained data. We use
it here for the atomic
exchange operation.
25
Multicore Locks, V1
class Lock { void Lock::unlock() {
int locked = 0; // try to change sync from 0 to 1
ThreadQueue q; while (sync.exchange(1)) {}
std::atomic<int> sync(0); // we are only one proceeding now
};
if (q.empty()) {
locked = 0;
} else {
unblockThread(q.remove());
}
sync = 0;
}

exchange: an atomic operation that reads the memory value,

replaces it with a given value, and returns the old value. 26
Multicore Locks
Key idea: we’ll rely on atomic instructions provided by hardware to avoid race
conditions when we have multiple cores.

Example: exchange: atomically read memory value, replace it with a given value,
and get old value.

Additionally: single-word references and assignments (e.g., assigning ints,

pointers, chars) are atomic on almost all systems.

Busy waiting unavoidable! However, it’s very short – just long enough to
manipulate the lock structure.
27
Multicore Locks, V1
class Lock { void Lock::unlock() {
int locked = 0; while (sync.exchange(1)) {};
ThreadQueue q; if (q.empty()) {
std::atomic<int> sync(0); locked = 0;
}; } else {
unblockThread(q.remove());
void Lock::lock() { }
while (sync.exchange(1)) {} sync = 0;
if (!locked) { }
locked = 1;
sync = 0;
} else {
q.add(currentThread);
sync = 0;
blockThread();
}
}
28
Multicore Locks, V1
class Lock { void Lock::unlock() {
int locked = 0; while (sync.exchange(1)) {};
ThreadQueue q; if (q.empty()) {
std::atomic<int> sync(0); locked = 0;
}; } else {
unblockThread(q.remove());
void Lock::lock() { }
while (sync.exchange(1)) {} sync = 0;
if (!locked) { }
locked = 1;
sync = 0;
} else {
q.add(currentThread); Problem: there’s an air gap in between unlocking the lock
sync = 0; and blocking. Another thread could call unlock here,
blockThread(); unblocking us, and then we block forever 
}
}
29
Multicore Locks
We won’t worry about these, but there are a few more steps/tweaks needed
(specifically; tweaking how we block to fix race condition and continuing to use
IntrGuard to disable interrupts). (See optional slides at end if you’re interested!)

Key overarching ideas:

• On multicore, disabling interrupts is not sufficient to eliminate race conditions
• Instead, we must rely on brief busy-waiting and provided atomic operations
(exchange) to sync up cores before proceeding.

30
Plan For Today
• Example 1: Multicore CPUs
• Example 2: Flash Storage

31
Flash Storage
• Much faster than hard disks: no moving parts (no delays from platters/head!),
smaller, faster
• Flash storage has become more common with increase in mobile devices,
nowadays common in PCs too.
• Can buy separately, or some devices have non-removable storage (e.g., many
mobile devices)
• New opportunities and challenges with managing filesystem designs for flash -
has own quirks Picture of a Samsung 980 Pro SSD, which is a small chip/board with a connector on the right side to insert into a
computer or other device.

32
Flash Storage Quirks
Quirk #1: Writing Data: flash storage doesn’t support just writing arbitrary data
to a portion of the storage. Instead, it supports two operations that combined
allow us to write data:
• Erase: set all bits of an erase unit to 1. The storage is divided up into erase
units, typically 256Kbytes big.
• Write: modify one page, can only clear bits to 0. The storage is also divided up
into pages, typically 512 bytes or 4Kbytes big.

33
Flash Storage Quirks
Quirk #2: Wear-out: after erasing an erase unit many times, it no longer reliably
stores data (!). Typically, around 100K.

Wear Leveling: want erase units to erase at same rate everywhere (rather than
having some parts wear out before others). Ideas about moving “hot” (short-
lived) and “cold” (long-lived) data around to even out storage usage.

34
Flash Storage and Filesystem Design
• A common approach has been to abstract away these quirks and include
software in the Flash Storage that makes it look like a hard disk.
• “Flash Translation Layer” – software that manages flash device, built in to drive, typically
mimics disk interface (read/write blocks)
• OS has no visibility into erase units, etc. – looks like a disk! Virtualization.
• Advantage: use existing filesystem software
• Disadvantages: sacrifice performance, no direct access to raw hardware, unnecessary
layers / duplication
• Lots of interesting questions about what filesystems would look like if designed
with flash storage in mind, without an FTL.
• Other storage technologies in the future?

35
Recap
• Example 1: Multicore CPUs Lecture 25 takeaway:
• Multicore scheduling
• Multicore locks
Operating systems and
hardware changes are tightly
• Example 2: Flash Storage
intertwined; multicore
processors and flash storage
provide two examples of the
impact of hardware changes
on OS implementations.

36
Extra Slides

37
Multicore Locks, V2
Somehow, we need to block and then unlock the lock??
• Key insight: we don’t need to block prior to unlocking the lock; we just need to
be marked as blocked.
• Solution (awkward): let’s change the interface of our thread
scheduler/dispatcher to allow us to separately mark a thread as blocked and
context switch. (Linux does something like this).

38
Multicore Locks, V2
class Lock { void Lock::unlock() {
int locked = 0; while (sync.exchange(1)) {};
ThreadQueue q; if (q.empty()) {
std::atomic<int> sync(0); locked = 0;
}; } else {
unblockThread(q.remove());
void Lock::lock() { }
while (sync.exchange(1)) {} sync = 0;
if (!locked) { }
locked = 1;
sync = 0;
} else {
q.add(currentThread);
currentThread->state = BLOCKED;
sync = 0;
blockThreadIfNecessary();
}
}
39
Multicore Locks, Final Version
One last change – we must disable interrupts.
• E.g. if the timer fires right after we acquire the int, another thread trying to get
it would just busy wait, wasting resources.

void Lock::lock() {
while (sync.exchange(1)) {}
if (!locked) {
locked = 1;
sync = 0;
} else {
q.add(currentThread);
currentThread->state = BLOCKED;
sync = 0;
blockThreadIfNecessary();
}
}
40
Multicore Locks, Final Version
class Lock { void Lock::unlock() {
int locked = 0; IntrGuard guard;
ThreadQueue q; while (sync.exchange(1)) {};
std::atomic<int> sync(0); if (q.empty()) {
}; locked = 0;
} else {
void Lock::lock() { unblockThread(q.remove());
IntrGuard guard; }
while (sync.exchange(1)) {} sync = 0;
if (!locked) { }
locked = 1;
sync = 0;
} else {
q.add(currentThread);
currentThread->state = BLOCKED;
sync = 0;
blockThreadIfNecessary();
}
} 41

Artificial Intelligence Class 10 (1) Removed
No ratings yet
Artificial Intelligence Class 10 (1) Removed
190 pages
9 M.Sc. Cyber Forensics and Information Security
No ratings yet
9 M.Sc. Cyber Forensics and Information Security
94 pages
4 Threads and Concurrency
No ratings yet
4 Threads and Concurrency
62 pages
Unit 5
No ratings yet
Unit 5
86 pages
Bourne Shell Scripting PDF
No ratings yet
Bourne Shell Scripting PDF
132 pages
CH 4 Synchronization Models of Memory Consistency
100% (1)
CH 4 Synchronization Models of Memory Consistency
26 pages
UserManual Student
No ratings yet
UserManual Student
19 pages
How Ubisoft Montreal Develops Games For Multicore - Before and After C++11 - Jeff Preshing - CppCon 2014
No ratings yet
How Ubisoft Montreal Develops Games For Multicore - Before and After C++11 - Jeff Preshing - CppCon 2014
72 pages
Threads
No ratings yet
Threads
18 pages
Memory Coherent
No ratings yet
Memory Coherent
62 pages
Multithreading, SMT and CMP
No ratings yet
Multithreading, SMT and CMP
7 pages
Travel Management System
No ratings yet
Travel Management System
29 pages
EE6304 Lecture12 TLP
No ratings yet
EE6304 Lecture12 TLP
70 pages
Design and Implementation of The UVM Virtual Memory System in NetBSD
No ratings yet
Design and Implementation of The UVM Virtual Memory System in NetBSD
270 pages
OS Module2 Unit2
No ratings yet
OS Module2 Unit2
43 pages
NV Operating Systems UNIT II
No ratings yet
NV Operating Systems UNIT II
91 pages
OS-PROCESS MANAGEMENT Module - 2.2
No ratings yet
OS-PROCESS MANAGEMENT Module - 2.2
89 pages
4.OS Threads Dr. Punit
No ratings yet
4.OS Threads Dr. Punit
48 pages
Lec04 SOFE3950 Threads
No ratings yet
Lec04 SOFE3950 Threads
53 pages
Networker Foundations - SRG
No ratings yet
Networker Foundations - SRG
58 pages
Lecture 16
No ratings yet
Lecture 16
30 pages
Arch13 Multiprocessors Afterlecture
No ratings yet
Arch13 Multiprocessors Afterlecture
70 pages
TranQuocVietAnh HW5
No ratings yet
TranQuocVietAnh HW5
5 pages
Multi-Core Architectures
100% (1)
Multi-Core Architectures
43 pages
Operating Systems Finals Revision
No ratings yet
Operating Systems Finals Revision
21 pages
05 Multiprocessor
No ratings yet
05 Multiprocessor
54 pages
Audio 2 DJ Manual English
No ratings yet
Audio 2 DJ Manual English
46 pages
Lec6 - TLP Data Dependence Solutions
No ratings yet
Lec6 - TLP Data Dependence Solutions
20 pages
284587177
No ratings yet
284587177
7 pages
2.2 DD2356 Threads
No ratings yet
2.2 DD2356 Threads
22 pages
Threads On A Multi Core Processor 1737287536
No ratings yet
Threads On A Multi Core Processor 1737287536
9 pages
Manual Helix Delta t6 PDF
100% (1)
Manual Helix Delta t6 PDF
309 pages
Lecture 04 ThreadAndMultithreading
No ratings yet
Lecture 04 ThreadAndMultithreading
35 pages
CH 4
No ratings yet
CH 4
21 pages
Threads
No ratings yet
Threads
8 pages
Threads
No ratings yet
Threads
38 pages
DigitalLogic ComputerOrganization L23 Multicore Handout
No ratings yet
DigitalLogic ComputerOrganization L23 Multicore Handout
32 pages
Chapter 04
No ratings yet
Chapter 04
37 pages
Chapter 9
No ratings yet
Chapter 9
50 pages
System 800xa Information Management Profile Historian Operation
No ratings yet
System 800xa Information Management Profile Historian Operation
128 pages
Platform Technologies Module 1
No ratings yet
Platform Technologies Module 1
39 pages
Unit-5 Part1
No ratings yet
Unit-5 Part1
85 pages
Threads & Concurrency: Lecture 23 - CS2110 - Fall 2018
No ratings yet
Threads & Concurrency: Lecture 23 - CS2110 - Fall 2018
34 pages
L7 Multicore 2
No ratings yet
L7 Multicore 2
22 pages
Design of High-Confidence Embedded Operating System Based On Artificial Intelligence and Smart Chips
No ratings yet
Design of High-Confidence Embedded Operating System Based On Artificial Intelligence and Smart Chips
5 pages
Tutorial 4 Solution - OS
No ratings yet
Tutorial 4 Solution - OS
6 pages
Topcon Manual Reference
No ratings yet
Topcon Manual Reference
150 pages
Installing The Common Array Manager Software
No ratings yet
Installing The Common Array Manager Software
31 pages
What Every Systems Programmer Should Know About Concurrency: Matt Kline
No ratings yet
What Every Systems Programmer Should Know About Concurrency: Matt Kline
12 pages
CMPF124 Chap 1 - Intro To Windows OS PT 1
No ratings yet
CMPF124 Chap 1 - Intro To Windows OS PT 1
19 pages
Threads
No ratings yet
Threads
23 pages
SSC Course 6 CPU
No ratings yet
SSC Course 6 CPU
17 pages
Synchronization
No ratings yet
Synchronization
81 pages
Pthreads Programming
No ratings yet
Pthreads Programming
54 pages
Multi Threading
No ratings yet
Multi Threading
5 pages
Institute: Uie Department: Cse: Bachelor of Engineering (Computer Science & Engineering)
No ratings yet
Institute: Uie Department: Cse: Bachelor of Engineering (Computer Science & Engineering)
14 pages
CSCI 350 Ch. 4 - Threads and Concurrency: Mark Redekopp Michael Shindler & Ramesh Govindan
No ratings yet
CSCI 350 Ch. 4 - Threads and Concurrency: Mark Redekopp Michael Shindler & Ramesh Govindan
41 pages
Concurrency in Computing
No ratings yet
Concurrency in Computing
16 pages
L7 Multicore 1
No ratings yet
L7 Multicore 1
50 pages
Knxsci08-Tpuart If
No ratings yet
Knxsci08-Tpuart If
17 pages
Mines Paristech / Cri Lal / Cnrs / In2P3
No ratings yet
Mines Paristech / Cri Lal / Cnrs / In2P3
37 pages
ECE 4100/6100 Advanced Computer Architecture: Lecture 13 Multithreading and Multicore Processors
No ratings yet
ECE 4100/6100 Advanced Computer Architecture: Lecture 13 Multithreading and Multicore Processors
56 pages
Lec 4 Superscalarprocessor PDF
No ratings yet
Lec 4 Superscalarprocessor PDF
23 pages
Bsc6900 Umts Omu Administration Guide (v900r017c10 - 02) (PDF) - en
No ratings yet
Bsc6900 Umts Omu Administration Guide (v900r017c10 - 02) (PDF) - en
310 pages
Adarsh Singh Chauhan: Education
No ratings yet
Adarsh Singh Chauhan: Education
2 pages
Configure PDS Servers On Windows 2003: Plant Design System (PDS) Installation and Configuration Checklist
No ratings yet
Configure PDS Servers On Windows 2003: Plant Design System (PDS) Installation and Configuration Checklist
16 pages
64-Bit Insider Volume 1 Issue 14
No ratings yet
64-Bit Insider Volume 1 Issue 14
6 pages
Lec 4 Superscalarprocessor Updated PDF
No ratings yet
Lec 4 Superscalarprocessor Updated PDF
40 pages
Tufin Firewall Operations Management WP
No ratings yet
Tufin Firewall Operations Management WP
9 pages
Operating System 4
No ratings yet
Operating System 4
33 pages
Practice Test: SAS Institute A00-250
No ratings yet
Practice Test: SAS Institute A00-250
21 pages
FTK 3 System Specifications Guide
No ratings yet
FTK 3 System Specifications Guide
6 pages
SIMATIC Box PC 627.
No ratings yet
SIMATIC Box PC 627.
200 pages
Processes: Hongfei Yan School of EECS, Peking University 3/16/2011
No ratings yet
Processes: Hongfei Yan School of EECS, Peking University 3/16/2011
69 pages
Threads in Operating System
No ratings yet
Threads in Operating System
103 pages
Term Paper - CSE316 - 12210271 OS Term Paper
No ratings yet
Term Paper - CSE316 - 12210271 OS Term Paper
10 pages
Final Report: Multicore Processors
No ratings yet
Final Report: Multicore Processors
12 pages
MultiProcessors Tanenbaum BP
No ratings yet
MultiProcessors Tanenbaum BP
29 pages
Multithreading: Multithreading Computers Have Hardware Support To Efficiently Execute Multiple
No ratings yet
Multithreading: Multithreading Computers Have Hardware Support To Efficiently Execute Multiple
5 pages
Placement Preparation
No ratings yet
Placement Preparation
43 pages
Multi Core 15213 Sp07
No ratings yet
Multi Core 15213 Sp07
67 pages
Concurrency Primer
No ratings yet
Concurrency Primer
12 pages
Dell Storage Compatibility Matrix - Oct 2016
No ratings yet
Dell Storage Compatibility Matrix - Oct 2016
90 pages
Lecture #10: Threads & Synchronization
No ratings yet
Lecture #10: Threads & Synchronization
7 pages
Lab 09 - Concurrency (Answers) PDF
No ratings yet
Lab 09 - Concurrency (Answers) PDF
5 pages
R4Ed Lesson1
100% (1)
R4Ed Lesson1
58 pages
CSF372
No ratings yet
CSF372
2 pages
Maxtest Software: Physical Test Solutions (PTS)
No ratings yet
Maxtest Software: Physical Test Solutions (PTS)
8 pages
Lecture 19
No ratings yet
Lecture 19
20 pages
Node.js 63 Interview Questions and Answers
From Everand
Node.js 63 Interview Questions and Answers
John Edward Cooper Berg
No ratings yet

Lecture 25

Uploaded by

Lecture 25

Uploaded by

CS111, Lecture 25

Modern Technologies and OSes

Why is answering this question important?

• Most modern consumer devices (phones, tablets, PCs) have

• Latest Intel processors have up to 24 cores

• More intensive tasks run on performance cores; better performance

Modification: have 1 ready queue per core.

Another challenge: expensive to move a thread to another core.

exchange: an atomic operation that reads the memory value,

exchange: an atomic operation that reads the memory value,

Additionally: single-word references and assignments (e.g., assigning ints,

Key overarching ideas:

You might also like