Department of Computer Science and Engineering Subject Name: Advanced Computer Architecture Code: Cs2354

The document discusses advanced computer architecture topics related to RISC architecture, hazards, forwarding techniques, dependence, branch prediction, and trace scheduling. It provides questions to test understanding of these concepts and asks to explain scheduling and structuring code for parallelism in VLIW/EPIC processors or dynamic scheduling using Tomasulo's approach.

Uploaded by

kamalsomu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

89 views7 pages

Department of Computer Science and Engineering Subject Name: Advanced Computer Architecture Code: Cs2354

Uploaded by

kamalsomu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING SUBJECT NAME: ADVANCED COMPUTER ARCHITECTURE CODE: CS2354

PART-A 1. Give few essential features of RISC architecture. The RISC-based machines focused the attention of designers on two critical performance techniques, the exploitation of instruction level parallelism (initially through pipelining and later through multiple instruction issue) and the use of caches (initially in simple forms and later using more sophisticated organizations and optimizations). The RISC-based computers raised the performance bar, forcing prior architectures to keep up or disappear. (or/ both) RISC architectures are characterized by a few key properties, which dramatically simplify their implementation: All operations on data apply to data in registers and typically change the entire register (32 or 64 bits per register). The only operations that affect memory are load and store operations that move data from memory to a register or to memory from a register, respectively. Load and store operations that load or store less than a full register (e.g., a byte, 16 bits, or 32 bits) are often available. The instruction formats are few in number with all instructions typically being one size. These simple properties lead to dramatic simplifications in the implementation of pipelining, which is why these instruction sets were designed this way. (ref: text book, 4th edn. page no. appendix A-4) 2. Power sensitive designs will avoid fixed field decoding. Why? In RISC architecture, register specifiers are at a fixed location and decoding is done in parallel with reading registers. This technique is known as 'fixed field decoding'. In this method, we may read a register which we may not use. This doesn't help, but also doesn't hurt the performance. In case of power sensitive designs, it does waste energy for reading an unnecessary register. (ref: text book, 4th edn. page no. appendix A-6)

3. Give the causes of structural hazards.

Give an example of result forwarding technique to minimize data hazard stalls. Is forwarding a software technique? No, it is a hardware technique. Example:

5. Give a sequence of code that has true dependence, anti-dependence and control dependence in it.

true dependence: Instrns 1,2 (R0) antidependence: Instructions 3,4 (R1) output dependence: Instructions 2,3 (F4); 4,5 (R1) 6. What is the flaw in 1-bit branch prediction scheme?

7. What is the key idea behind the implementation of hardware speculation?

8. What is trace scheduling? Which type of processors use this technique? Trace scheduling is useful for processors with a large number of issues per clock, where conditional or predicated execution is inappropriate or unsupported, and where simple loop unrolling may not be sufcient by itself to uncover enough ILP to keep the processor busy. Trace scheduling is a way to organize the global code motion process, so as to simplify the code scheduling by incurring the costs of possible code motion on the less frequent paths. There are two steps to trace scheduling. The rst step, called trace selection, tries to nd a likely sequence of basic blocks whose operations will be put together into a smaller number of instructions; this sequence is called a trace. Loop unrolling is used to generate long traces, since loop branches are taken with high probability. Once a trace is selected, the second process, called trace compaction, tries to squeeze the trace into a small number of wide instructions. Trace compaction is code scheduling; hence, it attempts to move operations as early as it can in a sequence (trace), packing the operations into as few wide instructions (or issue packets) as possible. Trace scheduling is used in VLIW processors to exploit ILP.

9. List some of the advanced Techniques for instruction delivery and Speculation. Techniques are 1. Multiple issuing (use of multiple issue processor), register renaming, ROB, speculation techniques, value prediction etc.
Powered By www.technoscriptz.com

10 . Mention few limits on Instruction Level Parallelism. 1. 2. 3. 4. Limitations on the Window Size and Maximum Issue Count Realistic Branch and Jump Prediction The Effects of Finite Registers The Effects of Imperfect Alias Analysis

PART-B Explain how Scheduling and Structuring Code for Parallelism is done in VLIW / EPIC processors. ( 8 marks)

1. Discuss the static and dynamic branch prediction techniques with suitable examples and diagrams. (16 marks) Section 2.3 in the book. Should explain the following: Static Branch Prediction, Dynamic Branch
Prediction (2 bit prediction) and Branch-Prediction Buffers, Correlating Branch Predictors and Tournament Predictors,

Or Explain Dynamic Scheduling Using Tomasulo's Approach (16 marks) Section 2.4 in the text book. Should explain the following: The basic structure of a MIPS floating-point unit using Tomasulo's algorithm with example. 2. Discuss the essential features of Intel IA-64 Architecture and Itanium Processor. (16 marks) Section G.6 of Appendix G. Should discuss the following: The Intel IA-64 Instruction Set Architecture - The IA-64 Register Model, Instruction Format and Support for Explicit Parallelism, Instruction Set Basics, Predication and Speculation Support and The Itanium 2 Processor - Functional Units and Instruction Issue. Or Write short notes on a. Hardware versus Software Speculation (section 3.4, page 169-171 in the text book) (6 marks) To speculate extensively, we must be able to disambiguate memory references. This capability is difficult to do at compile time for integer programs that contain pointers. In a hardware-based scheme, dynamic run time disambiguation of memory addresses is done using the techniques we saw earlier for Tomasulo's algorithm. This disambiguation allows us to move loads past stores at run time. Support for speculative memory references can help overcome the conservatism of the compiler, but unless such approaches are used carefully, the overhead of the recovery mechanisms may swamp the advantages.
Powered By www.technoscriptz.com

Hardware-based speculation works better when control flow is unpredictable, and when hardwarebased branch prediction is superior to software-based branch prediction done at compile time. These properties hold for many integer programs. For example, a good static predictor has a misprediction rate of about 16% for four major integer SPEC92 programs, and a hardware predictor has a misprediction rate of under 10%. Because speculated instructions may slow down the computation when the prediction is incorrect, this difference is significant. One result of this difference is that even statically scheduled processors normally include dynamic branch predictors. Hardware-based speculation maintains a completely precise exception model even for speculated instructions. Recent software-based approaches have added special support to allow this as well. Hardware-based speculation does not require compensation or bookkeeping code, which is needed by ambitious software speculation mechanisms. Compiler-based approaches may benefit from the ability to see further in the code sequence, resulting in better code scheduling than a purely hardwaredriven approach. Hardware-based speculation with dynamic scheduling does not require different code sequences to achieve good performance for different implementations of an architecture. Although this advantage is the hardest to quantify, it may be the most important in the long run. Interestingly, this was one of the motivations for the IBM 360/91. On the other hand, more recent explicitly parallel architectures, such as IA-64, have added flexibility that reduces the hardware dependence inherent in a code sequence. The major disadvantage of supporting speculation in hardware is the complexity and additional hardware resources required. This hardware cost must be evaluated against both the complexity of a compiler for a software-based approach and the amount and usefulness of the simplifications in a processor that relies on such a compiler. a. ILP Support to Exploit Thread-Level Parallelism (section 3.5, page 172-179 in the text book) (10 marks) (out of syllabus!)

Computer Architecture Ebook
No ratings yet
Computer Architecture Ebook
443 pages
Computer Architecture
No ratings yet
Computer Architecture
667 pages
Genetic Optimization of Cut Order Planning in Apparel Manufacturing
100% (1)
Genetic Optimization of Cut Order Planning in Apparel Manufacturing
142 pages
Unit 2
No ratings yet
Unit 2
48 pages
Optical Design Software Photopia
No ratings yet
Optical Design Software Photopia
38 pages
OS Unit 5
No ratings yet
OS Unit 5
63 pages
Manage Asset Creation Requests
No ratings yet
Manage Asset Creation Requests
12 pages
L04 Pipelining
No ratings yet
L04 Pipelining
38 pages
System-On-Chip (Soc) Architecture Soc Example
No ratings yet
System-On-Chip (Soc) Architecture Soc Example
71 pages
ACA - All Unit
No ratings yet
ACA - All Unit
31 pages
Cse-211 Computer Architecture
No ratings yet
Cse-211 Computer Architecture
32 pages
Onur 447 Spring15 Lecture12 Ooo Execution Afterlecture
No ratings yet
Onur 447 Spring15 Lecture12 Ooo Execution Afterlecture
67 pages
ACA Question Bank
No ratings yet
ACA Question Bank
16 pages
MPMC Module 5
No ratings yet
MPMC Module 5
25 pages
Lecture 1 - Introduction
No ratings yet
Lecture 1 - Introduction
26 pages
Education Learning: Connections Ities e Business
No ratings yet
Education Learning: Connections Ities e Business
80 pages
L04 PipeliningII
No ratings yet
L04 PipeliningII
33 pages
Wk05 - CPU Architecture (Part 1)
No ratings yet
Wk05 - CPU Architecture (Part 1)
72 pages
Computer Architecture 600
No ratings yet
Computer Architecture 600
7 pages
Checklist For Data Decimal Error
No ratings yet
Checklist For Data Decimal Error
3 pages
Onur 447 Spring15 Lecture17 Memoryhierarchyandcaches Afterlecture
No ratings yet
Onur 447 Spring15 Lecture17 Memoryhierarchyandcaches Afterlecture
51 pages
Knowledge Representation: Facts: Representations of Facts in Some Chosen Formalism
No ratings yet
Knowledge Representation: Facts: Representations of Facts in Some Chosen Formalism
12 pages
SP ANswers
No ratings yet
SP ANswers
7 pages
Be - Computer Engineering - Semester 4 - 2019 - December - Computer Organization and Architecture Cbcgs
No ratings yet
Be - Computer Engineering - Semester 4 - 2019 - December - Computer Organization and Architecture Cbcgs
23 pages
Lecture11 Comparison of State Machine-1
No ratings yet
Lecture11 Comparison of State Machine-1
33 pages
ACA Unit 1
No ratings yet
ACA Unit 1
67 pages
CA Final PDF
No ratings yet
CA Final PDF
13 pages
CS Mark Schemes Updated
No ratings yet
CS Mark Schemes Updated
22 pages
Coa Based On Willam Stalling
No ratings yet
Coa Based On Willam Stalling
9 pages
1.2 Software and Software Development
No ratings yet
1.2 Software and Software Development
6 pages
Computer Organization Unit-1
No ratings yet
Computer Organization Unit-1
147 pages
Ashwini
No ratings yet
Ashwini
20 pages
Martin Schroder Design Patterns Callback Pattern 0.38.0
No ratings yet
Martin Schroder Design Patterns Callback Pattern 0.38.0
9 pages
Chapter1 - Basic Structure of Computers
100% (1)
Chapter1 - Basic Structure of Computers
119 pages
5.4 Error Handling in File Operations
No ratings yet
5.4 Error Handling in File Operations
10 pages
Questions With Answers
No ratings yet
Questions With Answers
22 pages
Be Computer Engineering Semester 4 2018 December Computer Organization and Architecture Cbcgs
No ratings yet
Be Computer Engineering Semester 4 2018 December Computer Organization and Architecture Cbcgs
18 pages
Spreadsheet Notes For SHS
50% (8)
Spreadsheet Notes For SHS
25 pages
Esiot Unit 3
No ratings yet
Esiot Unit 3
10 pages
Issues in Hardware-Software Design and Co-Design
No ratings yet
Issues in Hardware-Software Design and Co-Design
7 pages
Kenya - Google IMEI Tracker Online - How To Track Your Android Phone Using IMEI Number in 20251
No ratings yet
Kenya - Google IMEI Tracker Online - How To Track Your Android Phone Using IMEI Number in 20251
9 pages
From Human Experts To Machines An LLM
No ratings yet
From Human Experts To Machines An LLM
10 pages
Opens in A New Window: Types of Direct Memory Access (DMA)
No ratings yet
Opens in A New Window: Types of Direct Memory Access (DMA)
11 pages
ICS 2101: Computer Organization - Complete Notes
No ratings yet
ICS 2101: Computer Organization - Complete Notes
9 pages
Quite A Box of Tricks Book PDF
No ratings yet
Quite A Box of Tricks Book PDF
33 pages
Cosc530 Ch3all6up
No ratings yet
Cosc530 Ch3all6up
8 pages
Reading Assignment1
No ratings yet
Reading Assignment1
15 pages
Ec6009-Advanced Computer Architecture-319465054-Ec6009 - Aca
No ratings yet
Ec6009-Advanced Computer Architecture-319465054-Ec6009 - Aca
9 pages
OCR A-Level Computer Science Spec Notes - 1.2 Summarized
No ratings yet
OCR A-Level Computer Science Spec Notes - 1.2 Summarized
11 pages
R in Hydrological Modelling - Zambrano+Bigiarini
No ratings yet
R in Hydrological Modelling - Zambrano+Bigiarini
33 pages
Project Work: A Study On Communication Skills
100% (1)
Project Work: A Study On Communication Skills
56 pages
CAQA5e ch3
No ratings yet
CAQA5e ch3
45 pages
Embedded 1
No ratings yet
Embedded 1
2 pages
Recruitment System
100% (2)
Recruitment System
19 pages
Contents
No ratings yet
Contents
8 pages
A4 版本1 （未使用）
No ratings yet
A4 版本1 （未使用）
2 pages
1) Define MIPS. CPI and MFLOPS.: Q.1 Attempt Any FOUR
No ratings yet
1) Define MIPS. CPI and MFLOPS.: Q.1 Attempt Any FOUR
10 pages
USB Meter Reader: B Etjenings Vejledning
No ratings yet
USB Meter Reader: B Etjenings Vejledning
72 pages
Core Spring Framework Annotations
No ratings yet
Core Spring Framework Annotations
27 pages
ACA Question Bank
No ratings yet
ACA Question Bank
19 pages
B I N G O Sheets
100% (1)
B I N G O Sheets
18 pages
Comp Project List 2022
No ratings yet
Comp Project List 2022
3 pages
Star Lion College of Engineering & Technology: Cs2354 Aca-2 Marks & 16 Marks
No ratings yet
Star Lion College of Engineering & Technology: Cs2354 Aca-2 Marks & 16 Marks
14 pages
Aca Univ 2 Mark and 16 Mark
No ratings yet
Aca Univ 2 Mark and 16 Mark
20 pages
Microprocessor Design: Simple As Possible-2
No ratings yet
Microprocessor Design: Simple As Possible-2
36 pages
High-Level Optimizations: Embedded System Optimization
No ratings yet
High-Level Optimizations: Embedded System Optimization
5 pages
BTES-401-18 (05-12-2023) Solution
No ratings yet
BTES-401-18 (05-12-2023) Solution
11 pages
Me FIRST
No ratings yet
Me FIRST
4 pages
Aca Important Questions 2 Marks 16marks
60% (5)
Aca Important Questions 2 Marks 16marks
18 pages
Computer Architecture - 2marks: 1) What Is The Need For Speculation? (NOV/DEC 2014)
No ratings yet
Computer Architecture - 2marks: 1) What Is The Need For Speculation? (NOV/DEC 2014)
11 pages
Tax Invoice BBNL - Bangalore Broadband Network PVT, LTD: Subject To Bangalore Jurisdiction
No ratings yet
Tax Invoice BBNL - Bangalore Broadband Network PVT, LTD: Subject To Bangalore Jurisdiction
1 page
Real Time System Lect10 A
No ratings yet
Real Time System Lect10 A
25 pages
Ec 6009 - Advanced Computer Architecture 2 Marks
No ratings yet
Ec 6009 - Advanced Computer Architecture 2 Marks
8 pages
Svs College of Engineering: Department of Computer Science and Engineering Iii B.E Cse / Vi Semester
No ratings yet
Svs College of Engineering: Department of Computer Science and Engineering Iii B.E Cse / Vi Semester
1 page
Unit III and Unit IV - Question Bank With Answers
No ratings yet
Unit III and Unit IV - Question Bank With Answers
5 pages
Computer Architecture Introduction
No ratings yet
Computer Architecture Introduction
20 pages
Forwarder Conditional Forwarder Stub Zone Secondary Zone
No ratings yet
Forwarder Conditional Forwarder Stub Zone Secondary Zone
1 page
Introduction To Database Programming: Record
No ratings yet
Introduction To Database Programming: Record
19 pages
Project Work
No ratings yet
Project Work
40 pages
Project Work
No ratings yet
Project Work
40 pages
Static Pipelining #2 and Goodbye To Computer Architecture: Prof. Lawrence Rauchwerger
No ratings yet
Static Pipelining #2 and Goodbye To Computer Architecture: Prof. Lawrence Rauchwerger
22 pages
GraphWorX64 Scripting - Local and Global Aliases
No ratings yet
GraphWorX64 Scripting - Local and Global Aliases
1 page
Conservation of Biodiversity Biodiversity:-: Genetic
No ratings yet
Conservation of Biodiversity Biodiversity:-: Genetic
4 pages
SSIS
No ratings yet
SSIS
7 pages
OS Assignment 1 Part 1,2
100% (2)
OS Assignment 1 Part 1,2
9 pages
05how To Read A Synopsys Liberty File
No ratings yet
05how To Read A Synopsys Liberty File
16 pages
Skype Brute
No ratings yet
Skype Brute
2 pages
Question Bank
No ratings yet
Question Bank
10 pages
Key Terms and Concepts - Defined: - Goal-Seeking Analysis
No ratings yet
Key Terms and Concepts - Defined: - Goal-Seeking Analysis
4 pages
CS2354 Advanced Computer Architecture
No ratings yet
CS2354 Advanced Computer Architecture
14 pages
Memory Management & Virtual Memory
No ratings yet
Memory Management & Virtual Memory
16 pages
Assignment 1 Comp314
No ratings yet
Assignment 1 Comp314
6 pages
Mastering the Art of x86 Assembly Programming: Unlocking the Secrets of Expert-Level Skills
From Everand
Mastering the Art of x86 Assembly Programming: Unlocking the Secrets of Expert-Level Skills
Steve Jones
No ratings yet
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet

Department of Computer Science and Engineering Subject Name: Advanced Computer Architecture Code: Cs2354

Uploaded by

Department of Computer Science and Engineering Subject Name: Advanced Computer Architecture Code: Cs2354

Uploaded by

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING SUBJECT NAME: ADVANCED COMPUTER ARCHITECTURE CODE: CS2354

3. Give the causes of structural hazards.

7. What is the key idea behind the implementation of hardware speculation?

You might also like