0% found this document useful (0 votes)

953 views54 pages

Programming with Shared Memory: Nguyễn Quang Hùng

This document discusses programming with shared memory in multiprocessor systems. It covers several key topics: 1) Shared memory multiprocessor systems have multiple processors that can access a single shared memory address space. This allows for more convenient programming than message passing systems. 2) Processes and threads are the main constructs used for specifying parallelism. Processes use fork/join while threads share memory and can communicate through shared data. 3) Accessing and protecting shared data requires language constructs like semaphores and locks to control access and avoid race conditions. Compiler optimizations can also affect statement execution order.

Uploaded by

triquang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

953 views54 pages

Programming with Shared Memory: Nguyễn Quang Hùng

Uploaded by

triquang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 54

Programming with

Shared Memory
Nguyễn Quang Hùng
Outline
 Introduction
 Shared memory multiprocessors
 Constructs for specifying parallelism
 Creating concurrent processes
 Threads
 Sharing data
 Creating shared data
 Accessing shared data
 Language constructs for parallelism
 Dependency analysis
 Shared data in systems with caches
 Examples
 Pthreads example
 Exercises
Introduction
 This section focuses on programming on shared
memory system (e.g SMP architecture).
 Programming mainly discusses on:
 Multi-processes: Unix/Linux fork(), wait()…
 Multithreads: IEEE Pthreads, Java Thread…
Multiprocessor system
 Multiprocessor systems: two types
 Shared memory multiprocessor.
 Message-passing multicomputer.
 In “Parallel programming:Techniques & applications using networked
workstations & parallel computing” book.
 Shared memory multiprocessor:
 SMP-based architecture: IBM RS/6000, Big BLUE/Gene
supercomputer, etc.

Processors Memory modules

• A small number of processors. Perhaps, Up to 8 processors.

• Bus is used by one processor at a time. Bus contention increases
by #processors.
Shared memory multiprocessor
using a crossbar switch
IBM POWER4 Chip logical view

Source: www.ibm.com
Several alternatives for
programming shared memory
multiprocessors
 Using library routines with an existing sequential programming
language.
 Multiprocesses programming:
 fork(), execv()…
 Multithread programming:
 IEEE Pthreads library
 Java Thread. http://java.sun.com

 Using a completely new programming language for parallel

programming - not popular.
 High Performance Fortran, Fortran M, Compositional C++….

 Modifying the syntax of an existing sequential programming language

to create a parallel programming language. Using an existing sequential
programming language supplemented with compiler directives for
specifying parallelism.
 OpenMP. http://www.openmp.org
Multi-processes programming
 Operating systems often based upon notion of a process.

 Processor time shares between processes, switching from

one process to another. Might occur at regular intervals or
when an active process becomes delayed.

 Offers opportunity to de-schedule processes blocked from

proceeding for some reasons, e.g. waiting for an I/O
operation to complete.

 Concept could be used for parallel programming. Not

much used because of overhead but fork/join concepts
used elsewhere.
FORK-JOIN construct
Main program

FORK Spawned processes

FORK

FORK
JOIN

JOIN
JOIN

JOIN
UNIX System Calls
 No join routine - use exit() and wait()

 SPMD model
..
pid = fork(); /* fork */
Code to be executed by both child and parent
if (pid == 0) exit(0); else wait(0); /* join */
...
UNIX System Calls (2)
 SPMD model: master-workers model.
1. …
2. pid = fork();
3. if (pid == 0) {
4. Code to be executed by slave process
5. } else {
6. Code to be executed by master process
7. }
8. if (pid == 0) exit(0); else wait(0);
9. ...
Process vs thread
Process
- Completely separate
program with its
own variables,
stack, and memory
allocation.

Threads
– Share the same
memory space
and global
variables between
routines
IEEE Pthreads (1)
 IEEE Portable Operating System Interface,
POSIX, sec. 1003.1 standard
Executing a Pthread thread
Main program Thread1

proc1( &arg )
pthread_create( &thread, NULL, proc1, &arg ); {
….
return( *status );
Pthread_join( thread1, *status); }
The pthread_create() function
 #include <pthread.h>
 int pthread_create(
pthread_t *threadid,
pthread_attr_t * attr,
void * (*start_routine)(void *),
void * arg);

 The pthread_create() function creates a new

thread storing an identifier to the new thread in
the argument pointed to by threadid.
The pthread_join() function
 #include <pthread.h>
 void pthread_exit(void *retval);
 int pthread_join(pthread_t threadid,
void **retval);

 The function pthread_join() is used to suspend the current thread

until the thread specified by threadid terminates. The other thread’s
return value will be stored into the address pointed to by retval if
this value is not NULL.
Detached threads
 It may be that threads are not bothered when a
thread it creates terminates and then a join not
needed.
 Threads not joined are called detached threads.
 When detached threads terminate, they are
destroyed and their resource released.
Pthread detached threads
Main program
Parameter (attribute)
specifies a detached thread
pthread_create()
Thread
pthread_create()

Thread
Termination
pthread_create()
Thread

Termination

Termination
The pthread_detach() function
 #include <pthread.h>
 int pthread_detach(pthread_t threadid);

• Put a running thread into detached state.

• Can’t synchronize on termination of thread threadid using
pthread_join().
Thread cancellation
 #include <pthread.h>
 int pthread_cancel(pthread_t thread);
 int pthread_setcancelstate(int state, int *oldstate);
 int pthread_setcanceltype(int type, int *oldtype);
 void pthread_testcancel(void);

• The pthread_cancel function allows the current thread to cancel

another thread, identified by thread.
• Cancellation is the mechanism by which a thread can terminate the execution
of another thread. More precisely, a thread can send a cancellation request to
another thread. Depending on its settings, the target thread can then either
ignore the request, honor it immediately, or defer it till it reaches a cancellation
point.
Other Pthreads functions
 #include <pthread.h>
 int pthread_atfork(void (*prepare)(void), void (*parent)(void),
void (*child)(void));
Thread pools
 Master-Workers Model:
 A master thread controls a collection of worker thread.
 Dynamic thread pools.
 Static thread pools.
 Threads can communicate through shared locations or
signals.
Statement execution order
 Single processor: Processes/threads typically executed until blocked.
 Multiprocessor: Instructions of processes/threads interleaved in
time.
Example
Process 1 Process 2
Instruction 1.1 Instruction 2.1
Instruction 1.2 Instruction 2.2
Instruction 1.3 Instruction 2.3
 Several possible orderings, including
Instruction 1.1
Instruction 1.2
Instruction 2.1
Instruction 1.3
Instruction 2.2
Instruction 2.3
assuming instructions cannot be divided into smaller interruptible steps.
Statement execution order (2)
 If two processes were to print messages, for
example, the messages could appear in different
orders depending upon the scheduling of
processes calling the print routine.
 Worse, the individual characters of each message
could be interleaved if the machine instructions of
instances of the print routine could be interleaved.
Compiler/Processor optimization
 Compiler and processor reorder instructions for optimization.
 Example: the statements
a = b + 5;
x = y + 4;
could be compiled to execute in reverse order:
x = y + 4;
a = b + 5;
and still be logically correct.
May be advantageous to delay statement a = b + 5 because a
previous instruction currently being executed in processor
needs more time to produce the value for b. Very common for
processors to execute machines instructions out of order for
increased speed .
Thread-safe routines
 Thread safe if they can be called from multiple
threads simultaneously and always produce
correct results.
 Standard I/O thread safe:
 printf(): prints messages without interleaving the
characters.
 NOT thread-safe functions:
 System routines that return time may not be thread
safe.
 Routines that access shared data may require
special care to be made thread safe.
SHARING DATA
SHARING DATA
 Every processor/thread can directly access shared
variables, data structures rather than having to the
pass data in messages.
 Solution for critical sections:
 Lock
 Mutex
 Semaphore
 Conditional variables
 Monitor
Creating shared data
 UNIX processes: each process has its own virtual
address space within the virtual memory
management system.
 Shared memory system calls allow processes to attach
a segment of physical memory to their virtual memory
space.
 shmget() – creates, returns shared memory segment identifier.
 shmat() – returns the starting address of data segment.
 It’s NOT necessary to create shared data items
explicity when using threads.
 Global variables: available to all threads.
Acsessing shared data
 Accessing shared data needs careful control.
 Consider two processes each of which is to add one to a
shared data item, x. Necessary for the contents of the
location x to be read, x + 1 computed, and the result
written back to the location:

Instruction Process 1 Process 2

x = x + 1; read x read x
compute x + 1 compute x + 1
write to x write to x
Time
Conflict in accessing shared
variable
Shared variable, x

write write
read read

+1 +1

Process 1 Process 2
Critical section
 A mechanism for ensuring that only one process
accesses a particular resource at a time is to establish
sections of code involving the resource as so-called
critical sections and arrange that only one such
critical section is executed at a time

 This mechanism is known as mutual exclusion.

 This concept also appears in an operating systems.

Locks
 Simplest mechanism for ensuring mutual exclusion
of critical sections.
 A lock is a 1-bit variable that is a 1 to indicate that
a process has entered the critical section and a 0 to
indicate that no process is in the critical section.
 Operates much like that of a door lock:
 A process coming to the “door” of a critical section and
finding it open may enter the critical section, locking the
door behind it to prevent other processes from entering.
Once the process has finished the critical section, it
unlocks the door and leaves.
Control of critical sections through
busy waiting
Processs 1 Process 2
while (lock == 1) do_nothing; while (lock == 1) do_nothing;
lock = 1;
Critical section

Lock = 0; lock = 1;

Critical section

Lock = 0;
Pthreads lock functions
 Pthreads implements lock by mutally exclusive
lock variables (mutex variables).
pthread_mutex_t mutex1;
pthread_mutex_init( &mutex1, NULL );
Only 1 thread
…….. can enter the
critical section
pthread_mutex_lock ( &mutex1 ); code or wait

/// Critical section code here

pthread_mutex_unlock( &mutex1 );
Only the thread that
locks a mutex can
unlock it. Otherwise,
throws an error.
IEEE Pthreads example
 Calculating sum of an array a[ ].
 N threads created, each taking numbers from list
to add to their sums. When all numbers taken,
threads can add their partial results to a shared
location sum.
 The shared location global_index is used by each
thread to select the next element of a[].
 After index is read, it is incremented in
preparation for the next element to be read. The
result location is sum, as before, and will also
need to be shared and access protected by a lock.
IEEE Pthreads example (2)
 Calculating sum of an array a[ ].

global_index
sum
Array a[ ]

…………………………………………..

addr

Code at page 254

IEEE Pthreads example (3)
1. #include <stdio.h>
2. #include <pthread.h>
3. #define ARRAY_SIZE 1000
4. #define NUM_THREADS 10

6. // Global Variables, Shared data

7. int a[ ARRAY_SIZE ];
8. int global_index = 0;
9. int sum = 0;

11. pthread_mutex_t mutex1; // mutually exclusive lock variable

12. pthread_t worker_threads[ NUM_THREADS ];
IEEE Pthreads example (4)
1. // Worker thread
2. void *worker(void *ignored ) {
3. int local_index, partial_sum = 0;
4. do {
5. pthread_mutex_lock ( &mutex1 );
6. local_index = global_index; global_index++;
7. pthread_mutex_unlock( &mutex1 );
8. if (local_index < ARRAY_SIZE) {
9. partial_sum += a [ local_index ];
10. }
11. }
12. while ( local_index < ARRAY_SIZE );
13. pthread_mutex_lock( &mutex1 );
14. sum += partial_sum;
15. pthread_mutex_unlock( &mutex1 );
16. }
IEEE Pthreads example (5)
1. void master() {
2. int i;
3. // Initialize mutex
4. pthread_mutex_init( &mutex1, NULL );
5. init_data();
6. create_workers( NUM_THREADS );
7. // Join threads
8. for (i = 0; i < NUM_THREADS ; i++ ) {
9. if ( pthread_join( worker_threads[i], NULL ) != 0 ) {
10. perror( "PThread join fails" );
11. }
12. }
13. printf("The sum of 1 to %i is %d \n" , ARRAY_SIZE, sum );
14. }
IEEE Pthreads example (6)
1. void init_data() {
2. int i;
3. for (i = 0; i < ARRAY_SIZE ; i++ ) { a[i] = i + 1; }
4. }

6. // Create some worker threads

7. void create_workers(int n){
8. int i;
9. for (i = 0; i < n ; i++ ) {
10. if (pthread_create(&worker_threads[i], NULL,
worker, NULL ) != 0 ) {
11. perror( "Pthreads create fails" ); }
12. }
13. }
Java multithread programming
 A class extends from java.lang.Thread class.
 A class implements java.lang.Runnable interface.
// A sample Runner class
public class Runner extends Thread public static void main(String[] args)
{ {
String name; Runner hung = new Runner("Hung");
public Runner(String name) { Runner minh = new Runner("Minh");
this.name = name; Runner ken = new Runner(“Ken");
} hung.start();
public void run() { minh.start();
int N = 10; ken.start();
for (int i = 0; i < N ; i++) { System.out.println("Hello World!");
System.out.println("I am "+ }
this.name + "runner at “ + I + “ km."); } // End main
thread.delay(100);
}
}
Language Constructs for
Parallelism
Language Constructs for Parallelism -
Shared Data
Shared Data:  Par Construct
 shared memory variables par {
might be declared as shared S1;
with, say, S2;
shared int x; .
.
Sn;
}

par {
proc1();
proc2();
…
}
Forall Construct
 Keywords: forall or parfor forall (i = 0 ; i < N; i++ ) {
 To start multiple similar S1;
processes together: which S2;
generates n processes each
consisting of the statements …..
forming the body of the for loop, Sm;
S1, S2, …, Sm. Each process
uses a different value of i. }

 Example:
forall (i = 0; i < 5; i++)
a[i] = 0;

clears a[0], a[1], a[2], a[3], and a[4] to zero concurrently.

Dependency analysis
 To identify which processes could be executed
together.
 Example: can see immediately in the code
forall (i = 0; i < 5; i++)
a[i] = 0;
 that every instance of the body is independent of
other instances and all instances can be executed
simultaneously.
 However, it may not be that obvious. Need
algorithmic way of recognizing dependencies, for
a parallelizing compiler.
Bernstein's Conditions
 Set of conditions sufficient to determine whether two
processes can be executed simultaneously. Given:
 Ii is the set of memory locations read (input) by process Pi.
 Oj is the set of memory locations written (output) by process Pj.
 For two processes P1 and P2 to be executed
simultaneously, inputs to process P1 must not be part of
outputs of P2, and inputs of P2 must not be part of
outputs of P1; i.e.,
 I1 ∩ O2 = φ
 I2 ∩ O1 = φ
 where f is an empty set. Set of outputs of each process
must also be different; i.e.,
 O1 ∩ O2 = φ
 If the three conditions are all satisfied, the two processes
can be executed concurrently.
Example
 Example: suppose the two statements are (in C)
 a = x + y;
 b = x + z;
 We have
 I1 = (x, y) O1 = (a)
 I2 = (x, z) O2 = (b)
 and the conditions
 I1 ∩ O2 = φ
 I2 ∩ O1 = φ
 O1 ∩ O2 = φ
 are satisfied. Hence, the statements a = x + y and b = x +
z can be executed simultaneously.
OpenMP
 An accepted standard developed in the late 1990s by a
group of industry specialists.
 Consists of a small set of compiler directives, augmented
with a small set of library routines and environment
variables using the base language Fortran and C/C++.
 The compiler directives can specify such things as the par
and forall operations described previously.
 Several OpenMP compilers available.
 Exercise: read more & report:
 http://www.openmp.org
Shared Memory Programming
Performance Issues
 Shared data in systems with caches
 Cache coherence protocols
 False Sharing:
 Solution: compiler to alter the layout of the data stored in the
main memory, separating data only altered by one processor
into different blocks.
 High performance programs should have as few
as possible critical sections as their use can
serialize the code.
Sequential Consistency
 Formally defined by Lamport (1979):
 A multiprocessor is sequentially consistent if the result
of any execution is the same as if the operations of all
the processors were executed in some sequential order,
and the operations of each individual processors occur
in this sequence in the order specified by its program.
 i.e. the overall effect of a parallel program is not
changed by any arbitrary interleaving of
instruction execution in time.
Sequential consistency (2)
Processors (Programs)
Sequential consistency (2)
 Writing a parallel program for a system which is known
to be sequentially consistent enables us to reason about
the result of the program. For example:
Process P1 Process 2
…
data = new; .
flag = TRUE; .
… ..
while (flag != TRUE)
{ };
data_copy = data;
Expect data_copy to be set to new .because
. we expect the statement
data = new to be executed before flag = TRUE and the statement
while (flag != TRUE) { } to be executed before data_copy = data.
Ensures that process 2 reads new data from another process 1.
Process 2 will simple wait for the new data to be produced.

Java Spring Boot Enterprise Application Development
No ratings yet
Java Spring Boot Enterprise Application Development
4 pages
Boot Loader Project - OMAPpedia
No ratings yet
Boot Loader Project - OMAPpedia
7 pages
JSF++ AV Coding Standard NL
No ratings yet
JSF++ AV Coding Standard NL
36 pages
IPC Linux
No ratings yet
IPC Linux
58 pages
Threads: Tevfik Koşar
100% (1)
Threads: Tevfik Koşar
40 pages
C++ Summarized Notes
100% (2)
C++ Summarized Notes
32 pages
System Programming With Linux Debugging Using C and C++ Programming Topics
No ratings yet
System Programming With Linux Debugging Using C and C++ Programming Topics
19 pages
Analyzing Kernel Crash On Red Hat
No ratings yet
Analyzing Kernel Crash On Red Hat
9 pages
Understanding Hardware Multithreading.pptx
No ratings yet
Understanding Hardware Multithreading.pptx
12 pages
U-Boot Porting Guide
No ratings yet
U-Boot Porting Guide
7 pages
Roadmap for Embedded Linux System
No ratings yet
Roadmap for Embedded Linux System
2 pages
The String Class: Templates Tutorial
100% (1)
The String Class: Templates Tutorial
14 pages
A Thread Synchronization Model For The PREEMPT - RT Linux Kernel
No ratings yet
A Thread Synchronization Model For The PREEMPT - RT Linux Kernel
41 pages
Keerthi - C++ Resume
No ratings yet
Keerthi - C++ Resume
6 pages
Oops Through Java
No ratings yet
Oops Through Java
11 pages
Top 25 C++ Programming Viva Questions
No ratings yet
Top 25 C++ Programming Viva Questions
4 pages
29 Slide Multi Threading
No ratings yet
29 Slide Multi Threading
57 pages
OOPs Interview Question
No ratings yet
OOPs Interview Question
5 pages
C++ Interview
No ratings yet
C++ Interview
70 pages
oop-unit1-notes-java-unit-1
No ratings yet
oop-unit1-notes-java-unit-1
54 pages
Lab5 Tutorial
No ratings yet
Lab5 Tutorial
24 pages
object-oriented-programming-quiz
No ratings yet
object-oriented-programming-quiz
14 pages
Top 115 Java Interview Questions & Answers
No ratings yet
Top 115 Java Interview Questions & Answers
11 pages
05 - Attila Aszalos, Calin Enachescu - Automatic Number Plate Recognition System For Iphone Devices
No ratings yet
05 - Attila Aszalos, Calin Enachescu - Automatic Number Plate Recognition System For Iphone Devices
6 pages
CPP Interviw Question
No ratings yet
CPP Interviw Question
19 pages
9. QUESTION AND ANSWER
No ratings yet
9. QUESTION AND ANSWER
133 pages
Constexpr Applications - Scott Schurr
No ratings yet
Constexpr Applications - Scott Schurr
90 pages
C/C++ Programming Interview Questions and Answers: What Is Encapsulation??
100% (1)
C/C++ Programming Interview Questions and Answers: What Is Encapsulation??
10 pages
Variadic Templates - Guidelines, Examples and Compile-Time Computation
No ratings yet
Variadic Templates - Guidelines, Examples and Compile-Time Computation
41 pages
Segment Tree and Lazy Propagation - HackerEarth
No ratings yet
Segment Tree and Lazy Propagation - HackerEarth
23 pages
Kernel Overview: Differences Between Kernel Modules and User Programs
100% (1)
Kernel Overview: Differences Between Kernel Modules and User Programs
12 pages
THREADS - Top 80 Interview Questions: What Is Thread in Java?
No ratings yet
THREADS - Top 80 Interview Questions: What Is Thread in Java?
68 pages
C++ Notes
No ratings yet
C++ Notes
7 pages
OOAD 2marks
No ratings yet
OOAD 2marks
35 pages
CoreJava - Surprise Test Question - Paper1
No ratings yet
CoreJava - Surprise Test Question - Paper1
3 pages
COVESA Overview 202304
No ratings yet
COVESA Overview 202304
32 pages
advance OOP with Java
No ratings yet
advance OOP with Java
56 pages
STL - Vector - HackerEarth
No ratings yet
STL - Vector - HackerEarth
4 pages
C++ Project
No ratings yet
C++ Project
42 pages
Inheritance
No ratings yet
Inheritance
26 pages
Derived Classes, Templates & Exception Handling in C++
No ratings yet
Derived Classes, Templates & Exception Handling in C++
116 pages
Objectives: The C++ Programming Skills That Should Be Acquired in This Lab
No ratings yet
Objectives: The C++ Programming Skills That Should Be Acquired in This Lab
7 pages
Embedded Linux Workshop On Blueboard-AT91: B. Vasu Dev
No ratings yet
Embedded Linux Workshop On Blueboard-AT91: B. Vasu Dev
30 pages
Week 5 Lecture 05
No ratings yet
Week 5 Lecture 05
70 pages
Microservices-Based Software Architecture and Approaches
No ratings yet
Microservices-Based Software Architecture and Approaches
8 pages
The C++ Standard Template Library (STL) : Imran - Siddiqi@bahria - Edu.pk
No ratings yet
The C++ Standard Template Library (STL) : Imran - Siddiqi@bahria - Edu.pk
29 pages
C C++ Programming Interview Questions and Answers
No ratings yet
C C++ Programming Interview Questions and Answers
13 pages
C++ Questions: Keralam
No ratings yet
C++ Questions: Keralam
31 pages
C++ Interview Questions and Answers
100% (1)
C++ Interview Questions and Answers
19 pages
Technical Interview Questions
No ratings yet
Technical Interview Questions
8 pages
Prepare Oracle 1Z0-888 Exam With Real Exam Questions - Oracle 1Z0-888 Dumps
No ratings yet
Prepare Oracle 1Z0-888 Exam With Real Exam Questions - Oracle 1Z0-888 Dumps
10 pages
C++ Inteview Ques
No ratings yet
C++ Inteview Ques
27 pages
Embedded Systems: A Perspective From The Industry
No ratings yet
Embedded Systems: A Perspective From The Industry
43 pages
3.intertask Communication - Embedded OS PDF
67% (3)
3.intertask Communication - Embedded OS PDF
8 pages
C Structures and Unions
No ratings yet
C Structures and Unions
61 pages
The Good, The Bad and The Ugly: On Threads, Processes and Coprocesses
100% (1)
The Good, The Bad and The Ugly: On Threads, Processes and Coprocesses
35 pages
MCQ With Ans
No ratings yet
MCQ With Ans
8 pages
Object Oriented Analysis and Design Two Mark and Sixteen Mark Q & A Part - A Questions and Answers Unit-I
No ratings yet
Object Oriented Analysis and Design Two Mark and Sixteen Mark Q & A Part - A Questions and Answers Unit-I
39 pages
Node.js Basics for New Developers: A Practical Guide with Examples
From Everand
Node.js Basics for New Developers: A Practical Guide with Examples
William E. Clark
No ratings yet
Lect9 Pthread
No ratings yet
Lect9 Pthread
24 pages
Pthreads
No ratings yet
Pthreads
70 pages
Victron Ve Direct Protocol
No ratings yet
Victron Ve Direct Protocol
11 pages
Jyothi Swaroop Resume 2024
No ratings yet
Jyothi Swaroop Resume 2024
2 pages
1.-H3C SecPath F1000 & F5000 Firewall Series License Matrixes
No ratings yet
1.-H3C SecPath F1000 & F5000 Firewall Series License Matrixes
3 pages
Async Js
No ratings yet
Async Js
69 pages
Computerised Mackworth Clock Test
No ratings yet
Computerised Mackworth Clock Test
3 pages
Email Hijack How Hackers Break in
No ratings yet
Email Hijack How Hackers Break in
33 pages
Ge6151 Computer Programming All Five Units
No ratings yet
Ge6151 Computer Programming All Five Units
159 pages
Emotion Based Music Recommendation System
No ratings yet
Emotion Based Music Recommendation System
7 pages
Unit 3
No ratings yet
Unit 3
162 pages
Delta UPS Connectivity Training
100% (1)
Delta UPS Connectivity Training
32 pages
Ramraj Billing Software System
No ratings yet
Ramraj Billing Software System
5 pages
What Is The Spiral Model Mthandy Assignment
No ratings yet
What Is The Spiral Model Mthandy Assignment
6 pages
LA CONSUMER EX PROD Johnsonsbaby - Com.py
No ratings yet
LA CONSUMER EX PROD Johnsonsbaby - Com.py
292 pages
3549-Article Text-11834-1-10-20210125
No ratings yet
3549-Article Text-11834-1-10-20210125
11 pages
Ch.23 Algorithm Design II - Assignment - Ans
No ratings yet
Ch.23 Algorithm Design II - Assignment - Ans
8 pages
Electronics 11 04220
No ratings yet
Electronics 11 04220
24 pages
MX Master 3 For Business Data Sheet w11
No ratings yet
MX Master 3 For Business Data Sheet w11
2 pages
Appendix-1 AICTE
No ratings yet
Appendix-1 AICTE
9 pages
Jiarui WU: Application Engineer / Keysight Technologies
No ratings yet
Jiarui WU: Application Engineer / Keysight Technologies
52 pages
Hare Esh Resume
No ratings yet
Hare Esh Resume
1 page
Job Vacancy - April 2021 (Final)
No ratings yet
Job Vacancy - April 2021 (Final)
11 pages
Sequential & Binary Search
No ratings yet
Sequential & Binary Search
13 pages
SANGFOR - WANO - v9.5.3 - Associate - 04 - IPSEC VPN and Sangfor VPN
No ratings yet
SANGFOR - WANO - v9.5.3 - Associate - 04 - IPSEC VPN and Sangfor VPN
29 pages
AIA Help Doc (For Installation)
No ratings yet
AIA Help Doc (For Installation)
7 pages
Tsa Iat 12 Text and Speech Analysis
No ratings yet
Tsa Iat 12 Text and Speech Analysis
5 pages
Chapter 4 Quiz - Evidence7 PC4.1-PC4.4 Attempt Review
No ratings yet
Chapter 4 Quiz - Evidence7 PC4.1-PC4.4 Attempt Review
1 page
Cat 2 QN Bank
No ratings yet
Cat 2 QN Bank
35 pages
ELATEC TWN4 Phone Apps DocRev 7
No ratings yet
ELATEC TWN4 Phone Apps DocRev 7
9 pages
MentePC V6 3 12 Inst Manual Rev2
No ratings yet
MentePC V6 3 12 Inst Manual Rev2
33 pages

Programming with Shared Memory: Nguyễn Quang Hùng

Uploaded by

Programming with Shared Memory: Nguyễn Quang Hùng

Uploaded by

Programming with

Read more & report:

Processors Memory modules

• A small number of processors. Perhaps, Up to 8 processors.

 Using a completely new programming language for parallel

 Modifying the syntax of an existing sequential programming language

 Processor time shares between processes, switching from

 Offers opportunity to de-schedule processes blocked from

 Concept could be used for parallel programming. Not

FORK Spawned processes

 The pthread_create() function creates a new

 The function pthread_join() is used to suspend the current thread

• Put a running thread into detached state.

• The pthread_cancel function allows the current thread to cancel

Instruction Process 1 Process 2

 This mechanism is known as mutual exclusion.

 This concept also appears in an operating systems.

/// Critical section code here

Code at page 254

6. // Global Variables, Shared data

11. pthread_mutex_t mutex1; // mutually exclusive lock variable

6. // Create some worker threads

clears a[0], a[1], a[2], a[3], and a[4] to zero concurrently.

You might also like