0% found this document useful (0 votes)

254 views24 pages

Reduced Instruction Set Computers (RISC) : William Stallings, Computer Organization and Architecture, 9 Edition

The document discusses Reduced Instruction Set Computers (RISC), which use a smaller set of basic instructions compared to Complex Instruction Set Computers (CISC). RISC architectures aim to have instructions with predictable costs and consistent performance by using techniques like large register files, compiler optimizations, and careful pipeline design. Key characteristics of RISC machines include optimizing for common operations in high-level languages, large register files accessed quickly, and pipelines designed for predictable instruction costs.

Uploaded by

Anh Nguyễn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

254 views24 pages

Reduced Instruction Set Computers (RISC) : William Stallings, Computer Organization and Architecture, 9 Edition

Uploaded by

Anh Nguyễn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 24

+

Reduced Instruction
Chapter 15 Set Computers
(RISC)
William Stallings, Computer Organization and Architecture, 9th Edition
Introduction
Two trends in CPU architecture:
 CISC: Complex Instruction Set Computing/Computer such as IBM
System/360, PDP-11, Motorola 6809, 68000, Intel 8080, x86,… CPU
is set up to execute many instructions.

 RISC: Reduced Instruction Set Computing/Computer: Idea: All

complex instruction is association of some basic instructions. So, a
smaller set of basic instructions is needed. Examples:
Sun UltraSPARC microprocessor

 More details:

http://en.wikipedia.org/wiki/Complex_instruction_set_computing

http://en.wikipedia.org/wiki/Reduced_instruction_set_computing
Comparing several RISC and
Non-RISC Systems

Scalar processor: CPU processes one datum at a time

Vector processor: CPU processes multiple data items at a time
Superscalar processor: Architecture implements a form of parallelism called
instruction-level parallelism within a single processor.
(Wiki)
Objectives
After studying this chapter, you should be able to:
 Provide an overview research results on instruction execution
characteristics that motivated the development of the RISC approach.

 Summarize the key characteristics of RISC machines.

 Understand the design and performance implications of using a large

 Discuss the implication of a RISC architecture for pipeline design and

performance.

 List and explain key approaches to pipeline optimization on a RISC

machine.
Contents

 15.1- Instruction Execution Characteristics

 15.2- The Use of a Large Register File
 15.3- Compiler-Based Register Optimization
 15.4- Reduced Instruction Set Architecture
 15.5- RISC Pipelining
 15.8- RISC Versus CISC Controversy
15.1- Instruction High-level languages (HLLs)
Allow the programmer to express
algorithms more concisely

Execution
Allow the compiler to take care of
details that are not important in the
programmer’s expression of algorithms Requirements
Often support naturally the use of

Characteristics structured programming and/or object-

oriented design

Semantic gap
Execution The difference
sequencing between the
Determines the operations provided
control and pipeline in HLLs and those
organization provided in computer
architecture

Operands used
The types of operands Operations performed
and the frequency of their
Determine the functions
use determine the
to be performed by the
memory organization for
processor and its
storing them and the Responses from architecture
addressing modes for
interaction with memory
accessing them
Operations and Operands are used:
Statistic

Statement
Procedure Call:
Arguments and Local Scalar Variables

Statistic

Scalar variable: Simple variable storing only one value

+
Implications
 HLLs can best be supported by optimizing performance
of the most time-consuming features of typical HLL
programs
 Three elements characterize RISC architectures:
 Use a large number of registers or use a compiler to optimize
register usage
 Careful attention needs to be paid to the design of instruction
pipelines
 Instructions should have predictable costs and be consistent with a
high-performance implementation
+
15.2- The Use of a Large Register File
Registers are accessed faster than cache or memory
 More registers are used

Software Solution Hardware Solution

 Requires compiler to  More registers

allocate registers
 Thus more variables will
 Allocatesbased on most be in registers
used variables in a given
time
 Requiressophisticated
(complex) program
analysis
+ Overlapping Register Windows
The use of register
Register windows is a group windows is a technique to
of registers which are used to improve the performance of a
pass arguments between particularly common
procedure calls. operation, the procedure call.
This was one of the main
design features of the
original Berkeley
RISC design, which would
later be commercialized as
the SPARC, AMD Am29000,
and Intel i960 (Wiki).
Circular Buffer
Organization
of Overlapped
Windows

A called B; B called C; C called D

The procedure D is active
Overlapped registers are used between callings.
A curcular chain of register references is created

If the procedure F makes

preparation to call another
procedure, registers of A are
conflicted and an interrupt
must be thrown  N
windows permits N-1 calls
only
+
Global Variables

 Variables declared as global in an HLL can be assigned memory

locations by the compiler and all machine instructions that reference
these variables will use memory reference operands
 However, for frequently accessed global variables this scheme is inefficient

 Alternative is to incorporate a set of global registers in the processor

 These registers would be fixed in number and available to all procedures
 A unified numbering scheme can be used to simplify the instruction format

 There is an increased hardware burden (gánh nặng) to accommodate

(supply) the split in register addressing

 In addition, the linker (a part of compiler) must decide which global

variables should be assigned to registers
Large-Register-File vs. Cache
+
Referencing a
Very fast
Scalar

slower
12.5- RISC Pipelining
Instruction pipelining is often used to enhance performance.
Most instructions in RISC are register to register.

Instruction cycle: two stages:

• I: Instruction fetch.
• E: Execute, ALU operation, Input and output are registers.

Load and store operations, three stages:

I: Instruction fetch.
E: Execute. Calculates memory address.
D: (direction) Memory. Register-to-memory or memory-to-
register operation.
The Effects of Pipelining: An Example

NOOP: No operation  Wait

+ Optimization of Pipelining
 Delayed branch
 Does not take effect until after execution of following instruction
 This location immediately following the branch is the delay slot  Insert the
instruction NOOP

 Delayed Load
 Register to be target is locked by processor
 Continue execution of instruction stream until register required
 Idle until load is complete
 Re-arranging instructions can allow useful work while loading

 Loop Unrolling (mở rộng vòng lặp)

 Replicate body of loop a number of times
 Iterate loop fewer times
 Reduces loop overhead
 Increases instruction parallelism
 Improved register, data cache, or TLB locality
Table 15.8: Normal and Delayed Branch
Target of JUMP is delayed  ADD is executed
before STORE

After 102 is To regularize the Increased

executed, the pipeline, a performance is
next NOOP is achieved only if
instruction to inserted after the instructions at
be executed this branch 101 and 102 are
is 105 (previous slide) interchanged.
+
Use of
the
Delayed
Branch

Program in the
table 15.6
Loop Unrolling Twice Example

Compiler technique to
improve instruction
parallelism is loop
unrolling .
Unrolling can improve
the performance by:
Reducing loop overhead,
increasing instruction
parallelism by improving
pipeline performance,
improving register, data
cache, or TLB locality

Number of loops
decreases 2 times
+15.8-RISC versus CISC Controversy
 Quantitative – So sánh định lượng
 Compare program sizes and execution speeds of programs on RISC and
CISC machines that use comparable technology

 Qualitative – so sánh chất lượng

 Examine issues of high level language support and use of VLSI real estate
(very large scale integration chip)

 Problems with comparisons:

 No pair of RISC and CISC machines that are comparable in life-cycle cost,
level of technology, gate complexity, sophistication of compiler, operating
system support, etc.
 No definitive set of test programs exists
 Difficult to separate hardware effects from complier effects
 Most comparisons done on “toy” rather than commercial products
 Most commercial devices advertised as RISC possess a mixture of RISC
and CISC characteristics

Chưa biết mèo nào cắn mèo nào!

Controversy: tranh luận The battle has no end!
+
Exercises
 15.1 What are some typical distinguishing characteristics of RISC
organization?

 15.2 Briefly explain the two basic approaches used to minimize

 15.3 If a circular register buffer is used to handle local variables for

nested procedures, describe two approaches for handling global
variables.

 15.4 What are some typical characteristics of a RISC instruction set

architecture?

 15.5 What is a delayed branch?

+ Summary Reduced Instruction Set
Computers
(RISC)
Chapter 15
 Instructionexecution  RISC pipelining
characteristics  Pipelining with regular
 Operations instructions
 Operands  Optimization of pipelining
 Procedure calls
 Implications
 Compiler-based register
optimization
 Theuse of a large register file
 Register windows
 RISC versus CISC controversy
 Global variables
 Large register file versus cache

Past Paper 10015 21 - Q
No ratings yet
Past Paper 10015 21 - Q
20 pages
Introduction To Common Lisp
No ratings yet
Introduction To Common Lisp
32 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
40 pages
Instruction-Level Parallelism and Superscalar Processors
No ratings yet
Instruction-Level Parallelism and Superscalar Processors
22 pages
Pps Previous Papers
No ratings yet
Pps Previous Papers
7 pages
Lecture 8 Mealy and Moore Machines
No ratings yet
Lecture 8 Mealy and Moore Machines
13 pages
Chapter 1 Digital Systems and Binary Numbers
No ratings yet
Chapter 1 Digital Systems and Binary Numbers
63 pages
Chapter 7: Deadlocks: Silberschatz, Galvin and Gagne ©2013 Operating System Concepts - 9 Edition
No ratings yet
Chapter 7: Deadlocks: Silberschatz, Galvin and Gagne ©2013 Operating System Concepts - 9 Edition
33 pages
Elmasri 6e - ISM 15
No ratings yet
Elmasri 6e - ISM 15
11 pages
Model Answers - HW1 PDF
No ratings yet
Model Answers - HW1 PDF
6 pages
Notes Prepared For: Mohammed Waseem Raza
No ratings yet
Notes Prepared For: Mohammed Waseem Raza
130 pages
Advanced Operating System (HW3) Department: IITA Student ID: BIA110004 Name: 哈瓦尼
No ratings yet
Advanced Operating System (HW3) Department: IITA Student ID: BIA110004 Name: 哈瓦尼
5 pages
Chapter 6: Synchronization Tools: Silberschatz, Galvin and Gagne ©2018 Operating System Concepts - 10 Edition
No ratings yet
Chapter 6: Synchronization Tools: Silberschatz, Galvin and Gagne ©2018 Operating System Concepts - 10 Edition
61 pages
ASM1 Programing
No ratings yet
ASM1 Programing
14 pages
Basic TOC 2022 - Solution of CFL - TM & Backside Test
No ratings yet
Basic TOC 2022 - Solution of CFL - TM & Backside Test
163 pages
Operating Systems: Internals and Design Principles: Uniprocessor Scheduling
No ratings yet
Operating Systems: Internals and Design Principles: Uniprocessor Scheduling
46 pages
Computer Organization
No ratings yet
Computer Organization
77 pages
Divide and Conquer Strategy
No ratings yet
Divide and Conquer Strategy
33 pages
Advanced Computer Architecture 2
No ratings yet
Advanced Computer Architecture 2
17 pages
Computer Architecture
100% (2)
Computer Architecture
46 pages
Moore and Mealy Machines: By: Engr - Syed Atir Iftikhar
No ratings yet
Moore and Mealy Machines: By: Engr - Syed Atir Iftikhar
21 pages
Operating System Exercises - Chapter 11-Sol
No ratings yet
Operating System Exercises - Chapter 11-Sol
4 pages
Chapter 05
No ratings yet
Chapter 05
19 pages
Chapter 3 Boolean Anlgebra and Logi Gates
No ratings yet
Chapter 3 Boolean Anlgebra and Logi Gates
59 pages
Advance Computer Archtecture CS501
100% (1)
Advance Computer Archtecture CS501
442 pages
Deadlock Assignment
No ratings yet
Deadlock Assignment
6 pages
Os Lab Manual - 0 PDF
No ratings yet
Os Lab Manual - 0 PDF
56 pages
Digital Logic Design Lab 7
No ratings yet
Digital Logic Design Lab 7
8 pages
Vietnamese Sign Language Detection Using Mediapipe
No ratings yet
Vietnamese Sign Language Detection Using Mediapipe
4 pages
Toc Unit-1 Part-2
No ratings yet
Toc Unit-1 Part-2
23 pages
Quiz Cea201 at Fptu
No ratings yet
Quiz Cea201 at Fptu
32 pages
Microprocessor and Interfacing Techniques: (Course Code: CET208A) Credits-3
No ratings yet
Microprocessor and Interfacing Techniques: (Course Code: CET208A) Credits-3
147 pages
Computer Organization July 2005 Old
No ratings yet
Computer Organization July 2005 Old
2 pages
2016 Complete Symbolic Simulation of SystemC Models Efficient Formal Verification of Finite Non-Terminating Programs
No ratings yet
2016 Complete Symbolic Simulation of SystemC Models Efficient Formal Verification of Finite Non-Terminating Programs
172 pages
Address Decoder For PC
No ratings yet
Address Decoder For PC
19 pages
Tcs Paper Soln
No ratings yet
Tcs Paper Soln
37 pages
C Programming Question Bank Answers
No ratings yet
C Programming Question Bank Answers
14 pages
Cs433 Fa12 Hw4 Sol Correct
No ratings yet
Cs433 Fa12 Hw4 Sol Correct
14 pages
Raptor Tasks
No ratings yet
Raptor Tasks
22 pages
CPDS Lab Manual
No ratings yet
CPDS Lab Manual
107 pages
Vending Machine - Computer Architecture
No ratings yet
Vending Machine - Computer Architecture
11 pages
Exercises 02
No ratings yet
Exercises 02
6 pages
Computer Networks Questions Solution
No ratings yet
Computer Networks Questions Solution
169 pages
Computer Science Department: Majlis Arts and Science College, Puramannur
No ratings yet
Computer Science Department: Majlis Arts and Science College, Puramannur
20 pages
Computer Peripherals & Interfacing
No ratings yet
Computer Peripherals & Interfacing
128 pages
Computer Networking: A Top Down Approach: A Note On The Use of These PPT Slides
No ratings yet
Computer Networking: A Top Down Approach: A Note On The Use of These PPT Slides
75 pages
HighPerformanceComputing DS
No ratings yet
HighPerformanceComputing DS
2 pages
DDCA Ch4 VHDL
No ratings yet
DDCA Ch4 VHDL
35 pages
Lab 9 Report
0% (1)
Lab 9 Report
11 pages
Chapter 4 (Processors and Memory Hierarchy)
100% (1)
Chapter 4 (Processors and Memory Hierarchy)
17 pages
Central Processing Unit: 6-2 General Register Organization
No ratings yet
Central Processing Unit: 6-2 General Register Organization
6 pages
Cap2 - Digital Systems - Numeric Systems and Codes
No ratings yet
Cap2 - Digital Systems - Numeric Systems and Codes
34 pages
Design Analysis and Algorithm
100% (1)
Design Analysis and Algorithm
78 pages
2 S Complement
No ratings yet
2 S Complement
4 pages
Handout 4: Iii. Turing Machines
No ratings yet
Handout 4: Iii. Turing Machines
12 pages
PIAIC AIoT Q1 Syllabus Final
No ratings yet
PIAIC AIoT Q1 Syllabus Final
3 pages
BCA (GEN) - 1st - SEM - Syllabus 2024 - 27 Batch
No ratings yet
BCA (GEN) - 1st - SEM - Syllabus 2024 - 27 Batch
16 pages
Chapter 2:instructions: Language of The Computer
No ratings yet
Chapter 2:instructions: Language of The Computer
81 pages
Slot26 CH15 ReduceInstructionSetComputers 24 Slides
No ratings yet
Slot26 CH15 ReduceInstructionSetComputers 24 Slides
24 pages
66ff99f56191750bb9ff1bc2 - COA9e - CH15 ReduceInstructionSetComputers 24 Slides
No ratings yet
66ff99f56191750bb9ff1bc2 - COA9e - CH15 ReduceInstructionSetComputers 24 Slides
24 pages
Emu 2.0 Documentation
No ratings yet
Emu 2.0 Documentation
3 pages
Design of 64-Bit Decode Stage For VLIW Processor Architecture
No ratings yet
Design of 64-Bit Decode Stage For VLIW Processor Architecture
3 pages
Nptel Cao Imp Questions
No ratings yet
Nptel Cao Imp Questions
58 pages
Fundamentals of Microcontroller and Its Application: Unit N0.1
No ratings yet
Fundamentals of Microcontroller and Its Application: Unit N0.1
16 pages
DTM Micro Project
No ratings yet
DTM Micro Project
27 pages
CN 320: Microprocessor and Microcontroller Systems: Lecture I-Introduction
No ratings yet
CN 320: Microprocessor and Microcontroller Systems: Lecture I-Introduction
35 pages
Risc & Sisc Characteristics
No ratings yet
Risc & Sisc Characteristics
9 pages
Berkeley RISC
No ratings yet
Berkeley RISC
6 pages
Unit-5 Model Questions
No ratings yet
Unit-5 Model Questions
6 pages
Lecture-14 (Co-Processor and Multi-Core Processor)
No ratings yet
Lecture-14 (Co-Processor and Multi-Core Processor)
24 pages
MPMC Notes 18.05.2024
No ratings yet
MPMC Notes 18.05.2024
124 pages
UM10120
No ratings yet
UM10120
297 pages
Difference Between RISC and CISC Architecture
No ratings yet
Difference Between RISC and CISC Architecture
4 pages
SAQA - 14917 - Learner Guide
No ratings yet
SAQA - 14917 - Learner Guide
30 pages
Hardware Architecture of Embedded Systems
100% (1)
Hardware Architecture of Embedded Systems
68 pages
Unit-I - : School of Electrical & Electronics Engineering Department of Electronics & Instrumentation
No ratings yet
Unit-I - : School of Electrical & Electronics Engineering Department of Electronics & Instrumentation
190 pages
U2 - ARM Processor
No ratings yet
U2 - ARM Processor
85 pages
MPMC Unit-3
No ratings yet
MPMC Unit-3
40 pages
The 5 Love Languages of Children by Gary Chapman and Ross Campbell
No ratings yet
The 5 Love Languages of Children by Gary Chapman and Ross Campbell
89 pages
C4 - Central Processing Unit
No ratings yet
C4 - Central Processing Unit
22 pages
Computer Organization and Architecture: Ashok Kumar Turuk
No ratings yet
Computer Organization and Architecture: Ashok Kumar Turuk
54 pages
Guess Paper - 2014 Class - Xi Subject - Computer Science: Other Educational Portals
No ratings yet
Guess Paper - 2014 Class - Xi Subject - Computer Science: Other Educational Portals
3 pages
Embedded 2
No ratings yet
Embedded 2
203 pages
Computer Science Paper 3
No ratings yet
Computer Science Paper 3
16 pages
Emerging Trends in Electrronics MCQs
No ratings yet
Emerging Trends in Electrronics MCQs
78 pages
Introduction To IBM Power Level 1 Quiz - Attempt Review
No ratings yet
Introduction To IBM Power Level 1 Quiz - Attempt Review
13 pages
Archimedes Operating System
100% (1)
Archimedes Operating System
320 pages
Embedded System Module 1
No ratings yet
Embedded System Module 1
57 pages
The ARM Is A 32
No ratings yet
The ARM Is A 32
32 pages
Unit 1 MPU Organization PDF
No ratings yet
Unit 1 MPU Organization PDF
96 pages

Reduced Instruction Set Computers (RISC) : William Stallings, Computer Organization and Architecture, 9 Edition

Uploaded by

Reduced Instruction Set Computers (RISC) : William Stallings, Computer Organization and Architecture, 9 Edition

Uploaded by

+

 RISC: Reduced Instruction Set Computing/Computer: Idea: All

Scalar processor: CPU processes one datum at a time

 Summarize the key characteristics of RISC machines.

 Understand the design and performance implications of using a large

 Discuss the implication of a RISC architecture for pipeline design and

 List and explain key approaches to pipeline optimization on a RISC

 15.1- Instruction Execution Characteristics

Characteristics structured programming and/or object-

Scalar variable: Simple variable storing only one value

Software Solution Hardware Solution

 Requires compiler to  More registers

A called B; B called C; C called D

If the procedure F makes

 Variables declared as global in an HLL can be assigned memory

 Alternative is to incorporate a set of global registers in the processor

 There is an increased hardware burden (gánh nặng) to accommodate

 In addition, the linker (a part of compiler) must decide which global

Instruction cycle: two stages:

Load and store operations, three stages:

NOOP: No operation  Wait

 Loop Unrolling (mở rộng vòng lặp)

After 102 is To regularize the Increased

 Qualitative – so sánh chất lượng

 Problems with comparisons:

Chưa biết mèo nào cắn mèo nào!

 15.2 Briefly explain the two basic approaches used to minimize

 15.3 If a circular register buffer is used to handle local variables for

 15.4 What are some typical characteristics of a RISC instruction set

 15.5 What is a delayed branch?

You might also like