Synchronization Linux

Uploaded by

unknown Me

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views35 pages

Synchronization Linux

Uploaded by

unknown Me

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 35

Kernel Synchronization in Linux

(Chap. 5 in Understanding the

Linux Kernel)
J. H. Wang
Sep. 29, 2011
Outline
• Kernel Control Paths
• When Synchronization is not Necessary
• Synchronization Primitives
• Synchronizing Accesses to Kernel Data
Structures
• Examples of Race Condition Prevention
Kernel Control Paths
• Linux kernel: like a server that answers
requests
– Parts of the kernel are run in interleaved way
• A kernel control path: a sequence of
instructions executed in kernel mode on
behalf of current process
– Interrupts or exceptions
– Lighter than a process (less context)
Example Kernel Control Paths
• Three CPU states are considered
– Running a process in User Mode (User)
– Running an exception or a system call handler
(Excp)
– Running an interrupt handler (Intr)
Kernel Preemption
• Preemptive kernel: a process running in kernel
mode can be replaced by another process while
in the middle of a kernel function
• The main motivation for making a kernel
preemptive is to reduce the dispatch latency of the
user mode processes
– Delay between the time they become runnable and
the time they actually begin running
• The kernel can be preempted only when it is
executing an exception handler (in particular a
system call) and the kernel preemption has not
been explicitly disabled
When Synchronization in
Necessary
• A race condition can occur when the outcome of a
computation depends on how two or more interleaved
kernel control paths are nested
• To identify and protect the critical regions in exception
handlers, interrupt handlers, deferrable functions, and
kernel threads
– On single CPU, critical region can be implemented by disabling
interrupts while accessing shared data
– If the same data is shared only by the service routines of system
calls, critical region can be implemented by disabling kernel
preemption while accessing shared data
• Things are more complicated on multiprocessor systems
– Different synchronization techniques are necessary
When Synchronization is not
Necessary
• The same interrupt cannot occur until the
handler terminates
• Interrupt handlers and softirqs are non-
preemptable, non-blocking
• A kernel control path performing interrupt
handling cannot be interrupted by a kernel
control path executing a deferrable function or a
system call service routine
• Softirqs cannot be interleaved
Synchronization Primitives
Technique Description Scope
Per-CPU Duplicate a data structure All CPUs
variables among CPUs
Atomic Atomic read-modify-write All
operation instruction
Memory barrier Avoid instruction re-ordering Local CPU
Spin lock Lock with busy wait All
Semaphore Lock with blocking wait All
(sleep)
Seqlocks Lock based on access counter All
Local interrupt Forbid interrupt on a single Local
disabling CPU
Local softirq Forbid deferrable function on a Local
disabling single CPU
Read-copy- Lock-free access to shared data All
update (RCU) through pointers
Per-CPU Variables
• The simplest and most efficient synchronization
technique consists of declaring kernel variables as per-
CPU variables
– an array of data structures, one element per each CPU in the
system
– A CPU should not access the elements of the array
corresponding to the other CPUs
• While per-CPU variables provide protection against
concurrent accesses from several CPUs, they do not
provide protection against accesses from asynchronous
functions (interrupt handlers and deferrable functions)
• Per-CPU variables are prone to race conditions caused
by kernel preemption, both in uniprocessor and
multiprocessor systems
Functions and Macros for the Per-
CPU Variables
Macro/ function
Description
name
DEFINE_PER_CPU(ty Statically allocates a per-CPU array
pe, name)
per_cpu(name, cpu) Selects the element for CPU of the per-CPU array
Selects the local CPU's element of the per-CPU
__get_cpu_var(name)
array
get_cpu_var(name) Disables kernel preemption, then selects the local
CPU's element of the per-CPU array
put_cpu_var(name) Enables kernel preemption
alloc_percpu(type) Dynamically allocates a per-CPU array
free_percpu(pointer) Releases a dynamically allocated per-CPU array
per_cpu_ptr(pointer, Returns the address of the element for CPU of
cpu) the per-CPU array
Atomic Operations
• Atomic 80x86 instructions
– Instructions that make zero or one aligned
memory access
– Read-modify-write instructions (inc or dec)
– Read-modify-write instructions whose opcode
is prefixed by the lock byte (0xf0)
– Assembly instructions whose opcode is
prefixed by a rep byte (0xf2, 0xf3) are not
atmoic
• Atomic_t type: 24-bit atomic counter
• Atomic operations in Linux:
Function Description
atomic_read(v) Return *v
atomic_set(v,i) set *v to i
atomic_add(i,v) add i to *v
atomic_sub(i,v) subtract i from *v
atomic_sub_and_test(i,v) subtract i from *v and return 1 if result is 0
atomic_inc(v) add 1 to *v
atomic_dec(v) subtract 1 from *v
atomic_dec_and_test(v) subtract 1 from *v and return 1 if result is 0
atomic_inc_and_test(v) add 1 to *v and return 1 if result is 0
atomic_add_negative(i,v) add i to *v and return 1 if result is negative
Atomic Bit Handling Functions
Function Description
test_bit(nr, addr) return the nrth bit of *addr
set_bit(nr, addr) set the nrth bit of *addr
clear_bit(nr, addr) clear the nrth bit of *addr
change_bit(nr, addr) invert the nrth bit of *addr
test_and_set_bit(nr, addr) set nrth bit of *addr and return old value
test_and_clear_bit(nr, addr) clear nrth bit of *addr and return old value
test_and_change_bit(nr, addr) invert nrth bit of *addr and return old value
atomic_clear_mask(mask, addr) clear all bits of addr specified by mask
atomic_set_mask(mask, addr) set all bits of addr specified by mask
Memory Barriers
• When dealing with synchronization, instruction
reordering must be avoided
• A memory barrier primitive ensures that the
operations before the primitive are finished
before starting the operations after the primitive
– All instructions that operate on I/O ports
– All instructions prefixed by lock byte
– All instructions that write into control registers,
system registers, or debug registers
– A few special instructions, e.g. iret
– lfence, sfence, and mfence instructions for Pentium 4
Memory Barriers in Linux
Macro Description
mb() Memory barrier for MP and UP
rmb() Read memory barrier for MP, UP
wmb() Write memory barrier for MP, UP
smp_mb() Memory barrier for MP only
smp_rmb() Read memory barrier for MP only
smp_wmb() Write memory barrier for MP only
Spin Locks
• Spin locks are a special kind of lock
designed to work in a multiprocessor
environment
– Busy waiting
– Very convenient
– Represented by spinlock_t structure
• slock: 1 – unlocked, <=0 - locked
• break_lock: flag
Protecting Critical Regions with
Several Locks
Spin Lock Macros
Macro Description
spin_lock_init() set the spinlock to 1 (unlocked)
spin_lock() cycle until spin lock becomes 1, then set to 0
spin_unlock() set the spin lock to 1
spin_unlock_wait() wait until the spin lock becomes 1
spin_is_locked() return 0 if the spin lock is set to 1
spin_trylock() set the spin lock to 0 (locked), and return 1 if the
lock is obtained
Read/Write Spin Locks
• To increase the amount of concurrency in the kernel
– Multiple reads, one write
• rwlock_t structure
– lock field: 32-bit
• 24-bit counter: (bit 0-23) # of kernel control paths currently reading
the protected data (in two’s complement)
• An unlock flag: (bit 24)
• Macros
– read_lock()
– read_unlock()
– write_lock()
– write_unlock()
Read/Write Spin Locks
Seqlock
• Seqlocks introduced in Linux 2.6 are
similar to read/write spin locks
– except that they give a much higher priority
to writers
– a writer is allowed to proceed even when
readers are active
Read-Copy Update
• Read-copy update (RCU): another synchronization
technique designed to protect data structures
that are mostly accessed for reading by several
CPUs
– RCU allows many readers and many writers to
proceed concurrently
– RCU is lock-free
• Key ideas
– Only data structures that are dynamically allocated
and referenced via pointers can be protected by RCU
– No kernel control path can sleep inside a critical
section protected by RCU
• Macros
– rcu_read_lock()
– rcu_read_unlock()
– call_rcu()
• RCU
– New in Linux 2.6
– Used in networking layer and VFS
Semaphores
• Two kinds of semaphores
– Kernel semaphores: by kernel control paths
– System V IPC semaphores: by user processes
• Kernel semaphores
– struct semaphore
• count
• wait
• sleepers
– up(): to acquire a kernel semaphore (similar to signal)
– down(): to release kernel semaphore (similar to wait)
Read/Write Semaphores
• Similar to read/write spin locks
– except that waiting processes are suspended instand of spinning
• struct rw_semaphore
– count
– wait_list
– wait_lock
• init_rwsem()
• down_read(), down_write(): acquire a read/write
semaphore
• up_read(), up_write(): release a read/write semaphore
Completions
• To solve a subtle race condition in
mutliprocessor systems
– Similar to semaphores
• struct completion
– done
– wait
• complete(): corresponding to up()
• wait_for_completion(): corresponding to
down()
Local Interrupt Disabling
• Interrupts can be disabled on a CPU with
cli instruction
– local_irq_disable() macro
• Interrupts can be enabled by sti
instruction
– local_irq_enable() macro
Disabling/Enabling Deferrable
Functions
• “softirq”
• The kernel sometimes needs to disable
deferrable functions without disabling
interrupts
– local_bh_disable() macro
– local_bh_enable() macro
Synchronizing Accesses to Kernel
Data Structures
• Rule of thumb for kernel developers:
– Always keep the concurrency level as high as
possible in the system
– Two factors:
• The number of I/O devices that operate
concurrently
• The number of CPUs that do productive work
• A shared data structure consisting of a
single integer value can be updated by
declaring it as an atomic_t type and by
using atomic operations
• Inserting an element into a shared linked
list is never atomic since it consists of at
least two pointer assignments
Choosing among Spin Locks,
Semaphores, and Interrupt Disabling
Kernel control paths UP protection MP further protection
Exceptions Semaphore None
interrupts local interrupt disabling spin lock
deferrable functions none none or spin lock
exceptions+interrupts local interrupt disabling spin lock
exceptions+deferrable local softirq disabling spin lock
interrupts+deferrable local interrupt disabling spin lock
exceptions+interrupts+d local interrupt disabling spin lock
eferrable
Interrupt-aware Spin Lock Macros
• spin_lock_irq(l), spin_unlcok_irq(l)
• spin_lock_bh(l), spin_unlock_bh(l)
• spin_lock_irqsave(l,f), spin_unlock_irqrestore(l,f)
• read_lock_irq(l), read_unlock_irq(l)
• read_lock_bh(l), read_unlock_bh(l)
• write_lock_irq(l), write_unlock_irq(l)
• write_lock_bh(l), write_unlock_bh(l)
• read_lock_irqsave(l,f), read_unlock_irqrestore(l,f)
• write_lock_irqsave(l,f), write_unlock_irqrestore(l,f)
• read_seqbegin_irqsave(l,f), read_seqretry_irqrestore(l,f),
• write_seqlock_irqsave(l,f), write_sequnlock_irqrestore(l,f)
• write_seqlock_irq(l), write_sequnlock_irq(l)
• write_seqlock_bh(l), write_sequnlock_bh(l)
Examples of Race Condition
Prevention
• Reference counters: an atomic_t counter associated with
a specific resource
• The global kernel lock (a.k.a big kernel lock, or BKL)
– Lock_kernel(), unlock_kernel()
– Mostly used in early versions, used in Linux 2.6 to protect old
code (related to VFS, and several file systems)
• Memory descriptor read/write semaphore
– mmap_sem field in mm_struct
• Slab cache list semaphore
– cache_chain_sem semaphore
• Inode semaphore
– i_sem field
• When a program uses two or more semaphores,
the potential for deadlock is present because two
different paths could wait for each other
– Linux has few problems with deadlocks on
semaphore requests since each path usually acquire
just one semaphore
– In cases such as rmdir() and rename() system calls,
two semaphore requests
– To avoid such deadlocks, semaphore requests are
performed in address order
• Semaphore request are performed in predefined address
order
Thanks for Your Attention!

The Deliberate Dumbing Down of America
100% (31)
The Deliberate Dumbing Down of America
738 pages
SMP Locking PDF
No ratings yet
SMP Locking PDF
22 pages
Kernel Kernel Synchronization
No ratings yet
Kernel Kernel Synchronization
9 pages
Kernel Internals: Santosh Sam Koshy Santoshk@cdac - in Centre For Development of Advanced Computing, Hyderabad
No ratings yet
Kernel Internals: Santosh Sam Koshy Santoshk@cdac - in Centre For Development of Advanced Computing, Hyderabad
54 pages
CS347 04 Process Sync
No ratings yet
CS347 04 Process Sync
14 pages
KernelSyncmethods 10
No ratings yet
KernelSyncmethods 10
58 pages
Ksync Notes
No ratings yet
Ksync Notes
6 pages
Os Mca Unix Mechanism
No ratings yet
Os Mca Unix Mechanism
23 pages
1.interprocess Communication Mechanisms 2.memory Management and Virtual Memory
No ratings yet
1.interprocess Communication Mechanisms 2.memory Management and Virtual Memory
45 pages
Locking in Linux Kernel
No ratings yet
Locking in Linux Kernel
62 pages
Semaphore Basics
No ratings yet
Semaphore Basics
6 pages
Os Unit 1 To 5
No ratings yet
Os Unit 1 To 5
9 pages
135 LE2 Reviewer
No ratings yet
135 LE2 Reviewer
6 pages
Synchronization
No ratings yet
Synchronization
9 pages
Lecture 06
No ratings yet
Lecture 06
16 pages
Multiprocessors and Linux: Krzysztof Lichota Lichota@mimuw - Edu.pl
No ratings yet
Multiprocessors and Linux: Krzysztof Lichota Lichota@mimuw - Edu.pl
30 pages
Operating Systems (R16 Iii B.Tech I Sem) Unit - VI
No ratings yet
Operating Systems (R16 Iii B.Tech I Sem) Unit - VI
13 pages
Lab Synchronization
No ratings yet
Lab Synchronization
20 pages
Ipc and Security
No ratings yet
Ipc and Security
12 pages
135 LE2 Reviewer
No ratings yet
135 LE2 Reviewer
6 pages
Artificial Intelligence Research Notes
No ratings yet
Artificial Intelligence Research Notes
18 pages
Lecture 5: IPC - Semaphore and Shared Memory: Message Queues
No ratings yet
Lecture 5: IPC - Semaphore and Shared Memory: Message Queues
6 pages
EN-Unreliable Guide To Locking
No ratings yet
EN-Unreliable Guide To Locking
40 pages
Lec08 Notes
No ratings yet
Lec08 Notes
3 pages
Lab3 Synchronization
No ratings yet
Lab3 Synchronization
17 pages
Unit-IV: Inter-Process Communication
No ratings yet
Unit-IV: Inter-Process Communication
61 pages
Lecture 05
No ratings yet
Lecture 05
8 pages
Chapter 6 - Synchronization Tools - Part 2
No ratings yet
Chapter 6 - Synchronization Tools - Part 2
32 pages
Futex Seminar Report
No ratings yet
Futex Seminar Report
33 pages
9-Operating Systems - Synchronization, Interprocess Communication, Deadlock
No ratings yet
9-Operating Systems - Synchronization, Interprocess Communication, Deadlock
162 pages
CS 498 Lecture 4 An Overview of Linux Kernel Structure
No ratings yet
CS 498 Lecture 4 An Overview of Linux Kernel Structure
33 pages
Lab 3 Synchronization v2-1
No ratings yet
Lab 3 Synchronization v2-1
19 pages
PCB Process Control Block
No ratings yet
PCB Process Control Block
2 pages
Module3 Process Synchronization
No ratings yet
Module3 Process Synchronization
40 pages
Semaphores Tutorial OReilly Linuxdevcenter
No ratings yet
Semaphores Tutorial OReilly Linuxdevcenter
17 pages
Lab 3
No ratings yet
Lab 3
18 pages
Kernel SMP Ban Galore 2003
No ratings yet
Kernel SMP Ban Galore 2003
18 pages
Lecture 27 28 29
100% (1)
Lecture 27 28 29
17 pages
CS 3210 Operating System Design Fall 2003, MW 12-1
No ratings yet
CS 3210 Operating System Design Fall 2003, MW 12-1
6 pages
10 Ipc
No ratings yet
10 Ipc
50 pages
OS-Lab9-manual 2
No ratings yet
OS-Lab9-manual 2
13 pages
Linux Kernel Concurrency Cheat Sheet: Barriers Reference Counters Mutexes (Sleeping)
No ratings yet
Linux Kernel Concurrency Cheat Sheet: Barriers Reference Counters Mutexes (Sleeping)
2 pages
Unit Iii Os
No ratings yet
Unit Iii Os
6 pages
IPC Week 4 5 RRR
No ratings yet
IPC Week 4 5 RRR
63 pages
Intro To OS-Functions of Os: System Software
No ratings yet
Intro To OS-Functions of Os: System Software
31 pages
Synchronization OS
No ratings yet
Synchronization OS
83 pages
OS Viva 1,2,3
No ratings yet
OS Viva 1,2,3
6 pages
Read-Copy Update (RCU) : Don Porter CSE 506
No ratings yet
Read-Copy Update (RCU) : Don Porter CSE 506
25 pages
15 Synchronization
No ratings yet
15 Synchronization
120 pages
Inter-Process Communication
No ratings yet
Inter-Process Communication
55 pages
By: Paul E. Mckenney, Silas Boyd-Wickizer, Jonathan Walpole Slides by David Kennedy (And Sources)
No ratings yet
By: Paul E. Mckenney, Silas Boyd-Wickizer, Jonathan Walpole Slides by David Kennedy (And Sources)
48 pages
Week 04 Lecture Chapter 4
No ratings yet
Week 04 Lecture Chapter 4
45 pages
Linux Kernel Organization
No ratings yet
Linux Kernel Organization
18 pages
Unreliable Guide To Hacking The Linux Kernel
No ratings yet
Unreliable Guide To Hacking The Linux Kernel
24 pages
Linux Programming Assignment
No ratings yet
Linux Programming Assignment
8 pages
Os Unit2
No ratings yet
Os Unit2
39 pages
SP - Semaphores
No ratings yet
SP - Semaphores
33 pages
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
From Everand
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
Bruce Dang
No ratings yet
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
Franco Mario
No ratings yet
Kubernetes Made Easy
From Everand
Kubernetes Made Easy
Pankaj Joshi
No ratings yet
Operating Systems Interview Questions You'll Most Likely Be Asked
From Everand
Operating Systems Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Keyboard Prep Piano Classes For 8-10yo
No ratings yet
Keyboard Prep Piano Classes For 8-10yo
3 pages
Sathishkumar Thirunavukkarasu - Resume - QA & RA Profile PDF
No ratings yet
Sathishkumar Thirunavukkarasu - Resume - QA & RA Profile PDF
1 page
2 Chapter Lesson 4 Estimating Products
No ratings yet
2 Chapter Lesson 4 Estimating Products
23 pages
Edutourism: The Nigeria Educational Challenges and International Students' Choice of Study in Nigerian Universities
No ratings yet
Edutourism: The Nigeria Educational Challenges and International Students' Choice of Study in Nigerian Universities
13 pages
Zahir Curriculum Study Unit Plan Portfolio
No ratings yet
Zahir Curriculum Study Unit Plan Portfolio
6 pages
Recreation Infographics
100% (1)
Recreation Infographics
1 page
A Child Said What Is Grass
No ratings yet
A Child Said What Is Grass
2 pages
Item Analysis Mean PL Mps
No ratings yet
Item Analysis Mean PL Mps
10 pages
Exploring Student Vices
No ratings yet
Exploring Student Vices
15 pages
Mystery Character Game! Freebie Sampler
No ratings yet
Mystery Character Game! Freebie Sampler
8 pages
Gujarat Technological University: W.E.F. AY 2018-19
No ratings yet
Gujarat Technological University: W.E.F. AY 2018-19
5 pages
Tugas Individu Analisis Isu Kontemporer
No ratings yet
Tugas Individu Analisis Isu Kontemporer
5 pages
Science 5 Q1 W3 D5
No ratings yet
Science 5 Q1 W3 D5
5 pages
Maths Assignment
No ratings yet
Maths Assignment
4 pages
Education-Related Research Topics & Ideas Education
No ratings yet
Education-Related Research Topics & Ideas Education
2 pages
JARVIS-1: Open-World Multi-Task Agents With Memory-Augmented Multimodal Language Models
No ratings yet
JARVIS-1: Open-World Multi-Task Agents With Memory-Augmented Multimodal Language Models
28 pages
The Teacher As Curriculum Implementor and Manager Ma. Merle A. Lupo
No ratings yet
The Teacher As Curriculum Implementor and Manager Ma. Merle A. Lupo
27 pages
Chapt
No ratings yet
Chapt
11 pages
Develop A Competencies Framework For Digital Transformation in The Banking Industry
No ratings yet
Develop A Competencies Framework For Digital Transformation in The Banking Industry
52 pages
Lesson Plan For Position and Movement Mathematics 8 Lesson 1
No ratings yet
Lesson Plan For Position and Movement Mathematics 8 Lesson 1
5 pages
Science - Investigation - Fabric Forensics
No ratings yet
Science - Investigation - Fabric Forensics
3 pages
(Ebook PDF) Basic Concepts in Clinical Biochemistry A Practical Guide 1st Edition by Vijay Kumar, Kiran Dip Gill 9811081867 9789811081866 Full Chapters PDF Download
100% (4)
(Ebook PDF) Basic Concepts in Clinical Biochemistry A Practical Guide 1st Edition by Vijay Kumar, Kiran Dip Gill 9811081867 9789811081866 Full Chapters PDF Download
44 pages
The Beginners Guide To Virtual Training
No ratings yet
The Beginners Guide To Virtual Training
5 pages
Individual Creativity in MBA Design Partnership
No ratings yet
Individual Creativity in MBA Design Partnership
22 pages
Qualitative Data Worksheet: Historical Design Teacher's Feedback Working Title
No ratings yet
Qualitative Data Worksheet: Historical Design Teacher's Feedback Working Title
5 pages
MADE EASY GATE 2019 Rank Predictor - Rank Calculator and Estimator PDF
No ratings yet
MADE EASY GATE 2019 Rank Predictor - Rank Calculator and Estimator PDF
30 pages
Professional Practice
No ratings yet
Professional Practice
19 pages
Đề Kiểm Giữa Học Kì 1 Môn Tiếng Anh Lớp 2
No ratings yet
Đề Kiểm Giữa Học Kì 1 Môn Tiếng Anh Lớp 2
13 pages
Drag Force Report
No ratings yet
Drag Force Report
8 pages

Synchronization Linux

Uploaded by

Synchronization Linux

Uploaded by

Kernel Synchronization in Linux

(Chap. 5 in Understanding the

You might also like