0% found this document useful (0 votes)

68 views37 pages

Simplified Design Flow: (A Picture From Ingo Sander)

Multiprocessor systems consist of tightly connected processors through high-speed interconnects like crossbars or networks on chip. There are two main approaches for scheduling tasks on these systems: global scheduling and partitioned scheduling. Global scheduling allows tasks to migrate between any processors, but analysis is difficult and utilization is limited. Partitioned scheduling assigns each task statically to a single processor, allowing techniques from uniprocessor scheduling to be applied but limiting resource sharing.

Uploaded by

Rachana Srinivas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

68 views37 pages

Simplified Design Flow: (A Picture From Ingo Sander)

Uploaded by

Rachana Srinivas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

Simplified Design Flow

(a picture from Ingo Sander)

Hardware architecture
So far, we have talked about only single processor systems
Concurrency implemented by scheduling

RT systems often consist of several processors

Multiprocessor systems
Tightly connected processors by a high-speed interconnect e.g.
cross-bar, bus, NoC (network on chip) etc.
Single processor with multi-thread

Distributed Systems
Loosely connected processors by a low-speed network e.g. CAN, Ethernet, Token
Ring etc. --

Multiprocessor vs.Distributed Systems

(examples)

Complete system

LAN

-- Local area distributed system (our focus)

(d)

Task Assignment
In the design flow:
First, the application is partitioned into tasks or task graphs.
At some stage, the execution times, communication costs, data
and control dependencies of all the tasks become known.

Task assignment determines

how many processors needed (bin-packing problem)
on which processor each task executes

This is a very complex problem (NP-hard)

Often done off-line
Often heuristics work

Example of Task Assignment

(or Task Partitioning)
P1

Task Assignment
The task models used in task assignment can vary in complexity
depending what considered/ignored:
Communication costs
Data and control dependencies
Resource requirements e.g. WCET, memory etc

In multi-core platforms with shared caches, communication

costs for tasks on the same chip may be very small
It is often meaningful to consider the execution time
requirement (WCET) and ignore communication in an early
design phase

Todays plan

Why multiprocessor?

What are multiprocessor systems

OS etc

Design RT systems on multiprocessors

energy, performance and predictability

Task Assigment

Multiprocessor scheduling

(semi-)partitioned scheduling
global scheduling

Why multiprocessor systems?

To get high performance and to reduce energy
consumption

Hardware: Trends
Multicore:
Requires
Parallel
Applications

Performance
[log]
1000

Single Core
100

Now

Year

Theoretically you may get:

Higher Performance
Increasing the cores -- unlimited computing power

Lower Power Consumption

Increasing the cores, decreasing the frequency
Performance (IPC) = Cores * F 2* Cores * F/2 Cores * F
Power = C * V2 * F 2* C * (V /2)2 * F/2 C * V2 /4 * F

same performance using of the energy (by

doubling the cores)

Keep the

This sounds great for embedded & real-time applications!

CPU frequency vs Power consumption

1.
2.
3.

Standard processor over-clocked 20%

Standard processor
Two standard processors each under-clocked 20%

Whats happening now?

General-Purpose Computing
(Symposium on High-Performance Chips, Hot Chips 21, Palo Alto, Aug 23-25, 2009)

4 cores in notebooks
12 cores in servers
AMD 12-core Magny-Cours will consume less energy
than previous generations with 6 cores

16 cores for IBM servers, Power 7

Embedded Systems
4 cores in ARM11 MPCore embedded processors

What next?
Manycores (>100s of cores) predicted to be here
in a few years e.g. Ericsson

What are multiprocessor systems?

Tightly connected processors by a high-speed
interconnect e.g. cross-bar, bus, NoC etc.

Typical Multicore Architecture

CPU

Bandwidth

Off-chip memory

L2 Cache
L1

CPU

Single processor vs. multiprocessor OS

Each node is a complete system running

its own OS but sharing the memory

One master node running the OS

All nodes are sharing the OS kernel:

Symmetric Multiprocessing (SMP)

Multiprocessor scheduling

"Given a set J of jobs where job ji has length li and a number of processors mi,
what is the minimum possible time required to schedule all jobs in J on m
processors such that none overlap?"
Wikipedia

That is, design a schedule such that the response time of the
last tasks is minimized
(Alternatively, given M processors and N tasks,
find a mapping from tasks to processors such that
all the tasks are schedulable)
The problem is NP-complete
It is also known as the load balancing problem

Multiprocessor scheduling
static and dynamic task assignment
Partitioned scheduling
Static task assignment
Each task may only execute on a fixed processor
No task migration

Semi-partitioned scheduling
Static task assignment
Each instance (or part of it) of a task is assigned to a fixed processor
task instance or part of it may migrate

Global scheduling
Dynamic task assignment
Any instance of any task may execute on any processor
Task migration

Multiprocessor Scheduling
Global Scheduling

Partitioned Scheduling

Partitioned Scheduling
with Task Splitting

new task
4
waiting queue

1
cpu 1

cpu 2

6
cpu 3

cpu 1

cpu 2

cpu 3

cpu 1

cpu 2

cpu 3

Multiprocessor (multicore) Scheduling

Significantly more difficult:
Timing anomalies
Hard to identify the worst-case scenario

Bin-packing/NP-hard problems
Multiple resources e.g. caches, bandwidth

Underlying causes
The root of all evil in global scheduling: (Liu, 1969):

The simple fact that a task can use only one processor even
when several processors are free at the same time adds a
surprising amount of difficulty to the scheduling of multiple
processors.

Dhalls effect: with RM, DM and EDF, some lowutilization task sets can be un-schedulable regardless
of how many processors are used.
Hard-to-find critical instant: a critical instant does not
always occur when a task arrives at the same time as
all its higher-priority tasks.

Example: Anomali under Resource constraints

5 tasks on 2 CPUs, sharing 1 resource
Static assignment T1, T2 on P1 and T3, T4, T5 on P2
Reducing the computation time of T1 will increase the response time!

critical section
P1

3
0

4
4

5
8

2
3

blocking

4
4

5
14

Best Known Results

From Uppsala

Liu and Laylands

Utilization Bound

RTAS 2010

70
60

69.3

50
40

30
20

[OPODIS08]

Fixed
Priority

[TPDS05]

[ECRTS03]

[RTSS04]

Dynamic
Priority

Fixed
Priority

Dynamic
Priority

66
[RTCSA06]

Fixed
Priority

Dynamic
Priority

Task Splitting
Global

Partitioned

Multiprocessor Scheduling

Global Scheduling
Global Scheduling

new task
4
waiting queue
2

1
cpu 1

cpu 2

6
cpu 3

Global scheduling
All ready tasks are kept in a global queue
When selected for execution, a task can be
dispatched to any processor, even after being
preempted

Global scheduling Algorithms

EDF Unfortunately not optimal!
No simple schedulability test known (only sufficient)

Fixed Priority Scheduling e.g. RM

Difficult to find the optimal priority order
Difficult to check the schedulability

Any algorithm for single processor scheduling may

work, but schedulability analysis is non-trivial.

Global Scheduling: + and Advantages:

Supported by most multiprocessor operating systems
Windows NT, Solaris, Linux, ...

Effective utilization of processing resources (if it works)

Unused processor time can easily be reclaimed at run-time (mixture of hard
and soft RT tasks to optimize resource utilization)

Disadvantages:
Few results from single-processor scheduling can be used
No optimal algorithms known except idealized assumption (Pfair sch)
Poor resource utilization for hard timing constraints
No more than 50% resource utilization can be guaranteed for hard RT tasks

Suffers from scheduling anomalies

Adding processors and reducing computation times and other parameters can
actually decrease optimal performance in some scenarios!

Partition-Based Scheduling
Partitioned Scheduling

cpu 1

cpu 2

cpu 3

Partitioned scheduling
Two steps:
Determine a mapping of tasks to processors
Perform run-time scheduling

Example: Partitioned with EDF

Assign tasks to the processors such that no processors
capacity is exceeded (utilization bounded by 1.0)
Schedule each processor using EDF

Bin-packing algorithms
The problem concerns packing objects of varying
sizes in boxes (bins) with the objective of
minimizing number of used boxes.
Solutions (Heuristics): Next Fit and First Fit

Application to multiprocessor systems:

Bins are represented by processors and objects by tasks.
The decision whether a processor is full or not is derived
from a utilization-based schedulability test.

Rate-Monotonic-First-Fit (RMFF):
[Dhall and Liu, 1978]
First, sort the tasks in the order of increasing periods.
Task Assignment
All tasks are assigned in the First Fit manner starting from
the task with highest priority
A task can be assigned to a processor if all the tasks
assigned to the processor are RM-schedulable i.e.
the total utilization of tasks assigned on that processor is bounded
by n(21/n-1) where n is the number of tasks assigned.
(One may also use the Precise test to get a better assignment!)

Add a new processor if needed for the RM-test.

Partitioned scheduling
Advantages:
Most techniques for single-processor scheduling
are also applicable here

Partitioning of tasks can be automated

Solving a bin-packing algorithm

Disadvantages:
Cannot exploit/share all unused processor time
May have very low utilization, bounded by 50%

Partition-Based Scheduling with Task-Splitting

Partitioned Scheduling
with Task Splitting

cpu 1

cpu 2

cpu 3

Partition-Based scheduling with Task Splitting

High resource utilization
High overhead (due to task migration)

Fixed-Priority Multiprocessor Scheduling

REAL TIME EMBEDDED SYSTEM - Lecture 02
No ratings yet
REAL TIME EMBEDDED SYSTEM - Lecture 02
19 pages
Operating System
No ratings yet
Operating System
66 pages
OS Mini Project
50% (2)
OS Mini Project
21 pages
Cse-V-Operating Systems (10CS53) - Notes PDF
No ratings yet
Cse-V-Operating Systems (10CS53) - Notes PDF
150 pages
Arduino CC
No ratings yet
Arduino CC
14 pages
Real-Time Scheduling: Edf and RM: Daniel Mosse University of Pittsburgh
No ratings yet
Real-Time Scheduling: Edf and RM: Daniel Mosse University of Pittsburgh
40 pages
Comp2240 Os W04
No ratings yet
Comp2240 Os W04
80 pages
3 Rtos
No ratings yet
3 Rtos
60 pages
Unit 3-Fullghh
No ratings yet
Unit 3-Fullghh
65 pages
Arch13 Multiprocessors Afterlecture
No ratings yet
Arch13 Multiprocessors Afterlecture
70 pages
CPU Scheduling
No ratings yet
CPU Scheduling
53 pages
Week 5 - CPU Scheduling
No ratings yet
Week 5 - CPU Scheduling
64 pages
Lecture 5
No ratings yet
Lecture 5
47 pages
1 Evolution
No ratings yet
1 Evolution
58 pages
Unit 2
No ratings yet
Unit 2
62 pages
Simulation of SimSo
0% (1)
Simulation of SimSo
19 pages
DS Lec03
No ratings yet
DS Lec03
43 pages
Lecture 8 Scheduling
No ratings yet
Lecture 8 Scheduling
36 pages
Schedule
No ratings yet
Schedule
32 pages
3 - OS Process CPU Scheduling
No ratings yet
3 - OS Process CPU Scheduling
38 pages
Operating Systems
No ratings yet
Operating Systems
52 pages
OS 2marks With Answers For All Units
No ratings yet
OS 2marks With Answers For All Units
19 pages
Chapter 3
No ratings yet
Chapter 3
35 pages
CHAPTER 02: Big Data Analytics
No ratings yet
CHAPTER 02: Big Data Analytics
62 pages
Unit 2 RTOS
No ratings yet
Unit 2 RTOS
28 pages
06 Realtime
No ratings yet
06 Realtime
48 pages
Overview of Scheduling For Multiprocessor Real-Time System: Xuan Qi
No ratings yet
Overview of Scheduling For Multiprocessor Real-Time System: Xuan Qi
28 pages
CH 7
No ratings yet
CH 7
25 pages
Lec 17
No ratings yet
Lec 17
31 pages
Lecture 11
No ratings yet
Lecture 11
26 pages
Chapter 10
No ratings yet
Chapter 10
25 pages
Unit-VI: Advance Tools and Technologies (And Problem Solving in The OS)
No ratings yet
Unit-VI: Advance Tools and Technologies (And Problem Solving in The OS)
76 pages
Embedded Algorithims
No ratings yet
Embedded Algorithims
16 pages
Multiprocessor and Real-Time Scheduling: Operating Systems: Internals and Design Principles
No ratings yet
Multiprocessor and Real-Time Scheduling: Operating Systems: Internals and Design Principles
42 pages
OS Scalability + Multiprocessor Scheduling: (Thanks To Jonathan Appavoo, Todd Mowry, and Angela Demke Brown)
No ratings yet
OS Scalability + Multiprocessor Scheduling: (Thanks To Jonathan Appavoo, Todd Mowry, and Angela Demke Brown)
52 pages
Guided by Done By: Investigating The Schedulability of Periodic Real-Time Tasks in Virtualized Cloud Environment
No ratings yet
Guided by Done By: Investigating The Schedulability of Periodic Real-Time Tasks in Virtualized Cloud Environment
31 pages
RTS L 28 - L 29: Scheduling Real Time Task in Multi-Processor and Distributed Systems
No ratings yet
RTS L 28 - L 29: Scheduling Real Time Task in Multi-Processor and Distributed Systems
18 pages
Operating System: CPU Scheduling
No ratings yet
Operating System: CPU Scheduling
15 pages
Multiple - Processor Scheduling
No ratings yet
Multiple - Processor Scheduling
16 pages
Multiprocessor Real-Time Scheduling
No ratings yet
Multiprocessor Real-Time Scheduling
38 pages
CS Chap7 Multicores Multiprocessors Clusters
No ratings yet
CS Chap7 Multicores Multiprocessors Clusters
65 pages
Scheduling Periodic
No ratings yet
Scheduling Periodic
14 pages
Cs837: Adv. Operating Systems: Dr. Mian M.Hamayun
No ratings yet
Cs837: Adv. Operating Systems: Dr. Mian M.Hamayun
19 pages
Mini Talk 11
No ratings yet
Mini Talk 11
22 pages
Scheduling in Cloud
No ratings yet
Scheduling in Cloud
10 pages
6) Unit II - Multiprocessor Scheduling
No ratings yet
6) Unit II - Multiprocessor Scheduling
18 pages
RTOS1
No ratings yet
RTOS1
23 pages
Emulations, Scheduling and Patterns
No ratings yet
Emulations, Scheduling and Patterns
30 pages
Manuscript
No ratings yet
Manuscript
7 pages
Multicores, Multiprocessors, and P, Clusters
No ratings yet
Multicores, Multiprocessors, and P, Clusters
51 pages
ERS: Energy-Efficient Real-Time DAG Scheduling On Uniform Multiprocessor Embedded Systems
No ratings yet
ERS: Energy-Efficient Real-Time DAG Scheduling On Uniform Multiprocessor Embedded Systems
6 pages
Multiprocessor Scheduling Using Task Duplication Based Scheduling Algorithms: A Review Paper
No ratings yet
Multiprocessor Scheduling Using Task Duplication Based Scheduling Algorithms: A Review Paper
7 pages
Introduction To Parallel Computing LLNL
No ratings yet
Introduction To Parallel Computing LLNL
44 pages
Emulations, Scheduling and Patterns
No ratings yet
Emulations, Scheduling and Patterns
30 pages
Iotes Unit 1 Print
No ratings yet
Iotes Unit 1 Print
7 pages
Unit 4-RTOS
No ratings yet
Unit 4-RTOS
18 pages
Scheduling of Periodic Tasks in Multiprocessor Systems: (Under Fixed-Priority Preemptive Environment)
No ratings yet
Scheduling of Periodic Tasks in Multiprocessor Systems: (Under Fixed-Priority Preemptive Environment)
14 pages
Eui Seong 2008
No ratings yet
Eui Seong 2008
13 pages
Real Time Scheduling
No ratings yet
Real Time Scheduling
20 pages
Real Time Scheduling
No ratings yet
Real Time Scheduling
20 pages
Scheduling Periodic
No ratings yet
Scheduling Periodic
13 pages
OS U1 and 2
No ratings yet
OS U1 and 2
7 pages
CHAPTER 02: Big Data Analytics
No ratings yet
CHAPTER 02: Big Data Analytics
73 pages
Operating System Concepts - CSC322: Course Instructor: Qurat Ul Ain
No ratings yet
Operating System Concepts - CSC322: Course Instructor: Qurat Ul Ain
38 pages
Scheduling Periodic PDF
No ratings yet
Scheduling Periodic PDF
13 pages
Task Scheduling in Multiprocessor System Using Genetic Algorithm
No ratings yet
Task Scheduling in Multiprocessor System Using Genetic Algorithm
5 pages
Technical Essentials of HP Servers, Rev. 11.41
No ratings yet
Technical Essentials of HP Servers, Rev. 11.41
72 pages
Advanced Computer Architecture: 1.0 Objective
No ratings yet
Advanced Computer Architecture: 1.0 Objective
27 pages
10 - Multi Threaded Programming With Java Technology
No ratings yet
10 - Multi Threaded Programming With Java Technology
312 pages
Introduction To OS - CH 1
No ratings yet
Introduction To OS - CH 1
70 pages
Verification of High Performance Embedded Systems: Sisira K. Amarasinghe, PH.D
No ratings yet
Verification of High Performance Embedded Systems: Sisira K. Amarasinghe, PH.D
82 pages
Unit One
No ratings yet
Unit One
13 pages
Lec1 Introduction To Parallel Computing
No ratings yet
Lec1 Introduction To Parallel Computing
40 pages
HFIC Chapter 13 SoC Design Flow
100% (1)
HFIC Chapter 13 SoC Design Flow
47 pages
SMP Gateway User Manual
No ratings yet
SMP Gateway User Manual
269 pages
Devicetree Specification v0.4 rc1
No ratings yet
Devicetree Specification v0.4 rc1
61 pages
Memory Performance and Scalability of Intel's and AMD's Dual-Core Processors - A Case Study
No ratings yet
Memory Performance and Scalability of Intel's and AMD's Dual-Core Processors - A Case Study
10 pages
Devicetree Specification v0.4
No ratings yet
Devicetree Specification v0.4
64 pages
Cern Acc 2023 0002
No ratings yet
Cern Acc 2023 0002
120 pages
Parallelizing Particle-In-Cell Codes With Openmp and Mpi: Nils Magnus Larsgård
No ratings yet
Parallelizing Particle-In-Cell Codes With Openmp and Mpi: Nils Magnus Larsgård
74 pages
Datastage
100% (1)
Datastage
69 pages
Programming The Be Operating System
No ratings yet
Programming The Be Operating System
392 pages
1.4 Module-1
No ratings yet
1.4 Module-1
21 pages
Computer Report
No ratings yet
Computer Report
57 pages
Process Scheduling Simplilified Notes
No ratings yet
Process Scheduling Simplilified Notes
7 pages
Cloud Computing Prelim
No ratings yet
Cloud Computing Prelim
41 pages
NUMA & Oracle: Yuri Pudovchenko, Alexey Selin
No ratings yet
NUMA & Oracle: Yuri Pudovchenko, Alexey Selin
58 pages
Report On Mpsoc'04: Students' Summary of Lectures Xi Chen
No ratings yet
Report On Mpsoc'04: Students' Summary of Lectures Xi Chen
45 pages
Coa PPT-2
No ratings yet
Coa PPT-2
16 pages
Single Processor, Multiprocessor and Cluster Concepts
No ratings yet
Single Processor, Multiprocessor and Cluster Concepts
8 pages
Whitepaper Imsl Increase Performance Parallel Programming Numerical Libraries
No ratings yet
Whitepaper Imsl Increase Performance Parallel Programming Numerical Libraries
8 pages
Chap1 Introduction 2021 in
No ratings yet
Chap1 Introduction 2021 in
12 pages
4 ARM Parallelism PDF
No ratings yet
4 ARM Parallelism PDF
10 pages
Quantum Computer Vs Traditional Computer
From Everand
Quantum Computer Vs Traditional Computer
Arief Muinnudin
No ratings yet

Simplified Design Flow: (A Picture From Ingo Sander)

Uploaded by

Simplified Design Flow: (A Picture From Ingo Sander)

Uploaded by

Simplified Design Flow

(a picture from Ingo Sander)

RT systems often consist of several processors

Multiprocessor vs.Distributed Systems

-- Local area distributed system (our focus)

Task assignment determines

This is a very complex problem (NP-hard)

Example of Task Assignment

In multi-core platforms with shared caches, communication

What are multiprocessor systems

Design RT systems on multiprocessors

energy, performance and predictability

Why multiprocessor systems?

Theoretically you may get:

Lower Power Consumption

same performance using of the energy (by

This sounds great for embedded & real-time applications!

CPU frequency vs Power consumption

Standard processor over-clocked 20%

Whats happening now?

16 cores for IBM servers, Power 7

What are multiprocessor systems?

Typical Multicore Architecture

Single processor vs. multiprocessor OS

Each node is a complete system running

One master node running the OS

All nodes are sharing the OS kernel:

Multiprocessor (multicore) Scheduling

Example: Anomali under Resource constraints

Best Known Results

Liu and Laylands

Global scheduling Algorithms

Fixed Priority Scheduling e.g. RM

Any algorithm for single processor scheduling may

Global Scheduling: + and Advantages:

Effective utilization of processing resources (if it works)

Suffers from scheduling anomalies

Example: Partitioned with EDF

Application to multiprocessor systems:

Add a new processor if needed for the RM-test.

Partitioning of tasks can be automated

Partition-Based Scheduling with Task-Splitting

Partition-Based scheduling with Task Splitting

Fixed-Priority Multiprocessor Scheduling

You might also like