I. Extending Project 2: Designs Over The Budget Will Get 0 Point

This document provides instructions for Project 3 in the EE557 Fall 2016 course. Students are tasked with iteratively redesigning a baseline processor's microarchitectural blocks within given transistor count and area budgets to maximize performance across four benchmarks. Acceptable blocks to modify include branch predictors, caches, queues, and functional units. Students must submit their final configuration file, area/transistor reports, and a project report detailing their design process and intermediate/final results. Performance and report quality will be graded against budgets and other student submissions.

Uploaded by

nikhilnarang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

151 views4 pages

I. Extending Project 2: Designs Over The Budget Will Get 0 Point

Uploaded by

nikhilnarang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

University of Southern California

Department of Electrical Engineering

EE557 Fall 2K16
Instructor: Michel Dubois
Section: 30630R and 30628D
Project #3, Due: 5 PM., Tuesday, November 29th
TOTAL SCORE: / 10

I. Extending Project 2
Project 3 builds on your experience gained in Project 2 configuring architectural simulators. In this
project your goal is to redesign the baseline processor by changing several micro-architectural blocks,
such as branch predictors, Register update units etc., to improve the performance of the baseline
processor. In this project you will iteratively look for an optimal design choice for all the
micro-architectural blocks by exploring the design space using simulations. Again, this task can be
accomplished without any need to modify the code and instead by simply (and intelligently) changing
the simulation parameters in the configuration file as you have already done in Project 2.
Unless otherwise stated, every detail in Project 3 stays the same as in Project 2. In particular, the
simulator and the benchmark locations, baseline configuration, and all other project environments are
identical to Project 2.

II. Project Description
In this project you are given a MAXIMUM transistor and area budget. Your goal is to change any
combination of the following micro-architectural blocks below to achieve the best performance for four
benchmark programs. We will measure the performance as:
!
!!! # of committed instrucitons!"#$%&'() !
! (in MIPS)
!!!(# of cyclesclock cycle period)!"#$%&'() !

The four benchmark programs are bitcnt, equake, bzip2, and art as below in the project environments.
For instance, if there are 1 million instructions committed per each benchmark; the simulation cycles of
the four benchmarks are 1, 2, 3, 4 million cycles; and the clock cycle time is 1 ns, then performance is
computed as follows:
(1 + 1 + 1 + 1)million instructions 4million instructions
= = 400MIPS
1 + 2 + 3 + 4 million cycles1ns 1010!! seconds

The transistor count including every component and the area budget are given below. Your design is
NOT allowed to exceed either of them. This budget will be measured by the Real Estate Estimator tool.
Designs over the budget will get 0 point.

Transistor count: 200 million
Area: 25 mm2
You are allowed to change only the following micro-architectural blocks. For instance, you can increase
or decrease the sizes of the components, change the cache associativities, change the cache
replacement policies.

Dynamic Branch Predictor1
Branch Target Buffer
Size of Return Address Stack
Machine Width (issue/decode/commit per cycle)
Instruction Fetch Queue Size
Register Update Unit Size2,3 (must be equal or larger than 32-entry)
Load/Store Queue Size
Number of Integer ALUs and Multiplier/Divider Units
Number of Floating-point ALUs and Multiplier/Divider Units
Number of Memory Ports
Caches (Size, Associativity, Replacement Algorithm, Block Size) 2,4

1
The perfect branch predictor is not allowed.
2
Remember when you change your RUU or cache structures, the number of read and write ports will
be affected. So, each time you change one of those, you need to check the estimator tool for any
change in number of ports, and then use CACTI to compute access time and latencies.
3
The RUU size must equal or larger than 32-entry. Any number under 32 is NOT allowed
4
The address space is assumed to be 42 bits and the number of bits per tag (Nr. Of Bits per Tag in
CACTI) should be calculated based on the cache size and structure.

Please keep in mind that as you increase or decrease some of the sizes, your CPU clock period and the
access time of your memory structures will be affected. Obviously accessing a 16KB L1 cache should be
much faster than accessing a 1MB L1 cache! So you should adjust the latency of any structure, which is
affected appropriately. Again use the CACTI tool to come up with latency estimates.
We will use CACTI, SimpleScalar and Real Estate Estimator that we already used in Project 2.

Basic Project Steps
Here are new steps for doing this project:
First, repeat the steps 1-6 of Project 2.
1. In this step you will look at the result files generated from the SimpleScalar simulation tool and
decide which one of the allowed micro-architectural blocks you want to change. Keep in mind that
you cannot exceed the area and the transistor count limits specified above when you increase the
structure sizes. Also, make sure that the clock cycle latency is appropriately adjusted to reflect the
new structure sizes. So be clever about which structure to change and by how much. Since the
SimpleScalar result file contains various block access counts, cache misses, hits etc. there is no need
to change the code.
2. Once you change one or more micro-architectural parameters you redo steps 1 through 6 of Project
#1, as necessary. Look at the new MIPS rating of the processor with your enhanced processor
configuration. Compare it with all prior configurations. Iterate the steps till you think you have the
worlds best processor.
3. Finally, you will generate a report that shows how you iterated through the design space and why
you made those design choices. Support your arguments with charts and compelling arguments.

III. Project Environment
Project environment is the same as that of Project 2 except that you need to copy the additional
benchmarks and inputs from the class directory. Please copy the following benchmarks with all other
necessary files into your directory in addition to bzip2 and art that were used in Project 2:

Executables Input Files Commands
bitcnt
-- bitcnts 1125000
(/ee557d/mibench)
equake
equake.in equake < equake.in
(/ee557d/spec2k)

For all benchmarks, we will limit all our simulations to only 50 million instructions. We will fast-forward
through first 300 million instructions. To do this, set the following parameters in your configuration files
or include them in your command line parameters.

IV. Project Submission
You must submit your final configuration file (FirstnameLastname_Proj3.conf), your excel sheet of the
Real estimator tool (FirstnameLastname_Proj3.xls), and an electronic copy of your project report
(FirstnameLastname_Proj3.pdf) that includes the followings by the due date on the Den class-page:
1. Front page
a) Title: EE557 Fall 2016 Project #3 Report; b) Name: <your name>; c) your email address; d)
affiliation (optional) 1 pt.
2. Section 1. Design Process
Description and discussion of your design process your iteration process: for example, what design
progress and iteration you made to approach your final design, based on what results you observed
and how that observation affect your next step of design iteration - at least a half page, 2 pts.
3. Section 2. Intermediate Results
a) Intermediate average MIPS rates in a graph, b) RUU access times estimated from Cacti with
converted cycle times in a table; c) transistor count and area estimates from Real Estate Estimator in
two graphs; d) cache miss rates for all caches in a table for 3 intermediate iterations, 2 pts.
4. Section 3. Final Design
a) MIPS rate; b) cycle time; c) area from Real Estate Estimator; d) transistor count from Real Estate
Estimator; e) cache latencies; f) cache miss rates in a table, 1 pt.
Please keep all of your shell scripts and simplescalar config files as they might be required to be
submitted or asked to run by the TA.

V. Grading
Your final design will be evaluated based on the following criteria:
1. Report (6 pts.)
From the report pdf.
2. Performance (4 pts.)
This part is evaluated by ranking the overall MIPS of all students. 0 point will be given for a
mismatch between a reported MIPS and a MIPS from running a config file.

0 point will be given to designs over the transistor count and the area budget.

Like other assignments, this project must be done INDIVIDUALLY!
Similar designs will be securitized.

Introduction To C Programming Course Materail
100% (1)
Introduction To C Programming Course Materail
161 pages
Final Exam Topics: CSE 564 Computer Architecture Summer 2017
No ratings yet
Final Exam Topics: CSE 564 Computer Architecture Summer 2017
78 pages
Cse-Vii-Advanced Computer Architectures (10cs74) - Solution
100% (1)
Cse-Vii-Advanced Computer Architectures (10cs74) - Solution
111 pages
Operating System Syllabus
No ratings yet
Operating System Syllabus
9 pages
Merih Instruction BUS Door
No ratings yet
Merih Instruction BUS Door
6 pages
106
No ratings yet
106
80 pages
Answer:: Remark
No ratings yet
Answer:: Remark
72 pages
Analog Electronics Instrumentation - Current Loops
No ratings yet
Analog Electronics Instrumentation - Current Loops
23 pages
1 Plant Nutrition
No ratings yet
1 Plant Nutrition
35 pages
Assignment 2
No ratings yet
Assignment 2
15 pages
Question Papers Solutions: Unit 1
No ratings yet
Question Papers Solutions: Unit 1
105 pages
Cantiliver Retaing Wall
No ratings yet
Cantiliver Retaing Wall
14 pages
CESE4040 - Processor Design Project Guide
No ratings yet
CESE4040 - Processor Design Project Guide
32 pages
Project #1: Computer Architecture EE6304 Due Date: 3/8/2012 Team Number:22
No ratings yet
Project #1: Computer Architecture EE6304 Due Date: 3/8/2012 Team Number:22
32 pages
Guidance Note C - B - ENV 002, July 02
No ratings yet
Guidance Note C - B - ENV 002, July 02
12 pages
CIS775: Computer Architecture: Chapter 1: Fundamentals of Computer Design
No ratings yet
CIS775: Computer Architecture: Chapter 1: Fundamentals of Computer Design
43 pages
Lecture 03
No ratings yet
Lecture 03
30 pages
Project 2020 Fall VLSI
No ratings yet
Project 2020 Fall VLSI
14 pages
Zainhaider COAL
No ratings yet
Zainhaider COAL
43 pages
Student Projects Using SMPCache 2.0
No ratings yet
Student Projects Using SMPCache 2.0
12 pages
Architecture
No ratings yet
Architecture
21 pages
CSE 332 L 14 Short & 15 - 24th & 26th Sep 2020
No ratings yet
CSE 332 L 14 Short & 15 - 24th & 26th Sep 2020
28 pages
prj1 Specs2010
No ratings yet
prj1 Specs2010
15 pages
Lecture 2: Performance/Power, MIPS Instructions
No ratings yet
Lecture 2: Performance/Power, MIPS Instructions
28 pages
Project 2011 Spring VLSI
No ratings yet
Project 2011 Spring VLSI
14 pages
Superscalar Processor Simulator Report PDF Version
No ratings yet
Superscalar Processor Simulator Report PDF Version
16 pages
ACA UNit 1
No ratings yet
ACA UNit 1
29 pages
Lec8 Memory
No ratings yet
Lec8 Memory
17 pages
Embedded System Final Project Report
No ratings yet
Embedded System Final Project Report
8 pages
CA2021 Project2 Spec
No ratings yet
CA2021 Project2 Spec
7 pages
OS Topics 2024-1
No ratings yet
OS Topics 2024-1
5 pages
Kien-Truc-May-Tinh - David-Brooks - cs146-hw2 - (Cuuduongthancong - Com)
No ratings yet
Kien-Truc-May-Tinh - David-Brooks - cs146-hw2 - (Cuuduongthancong - Com)
5 pages
EE 204 - Computer Architecture Assignment #1
No ratings yet
EE 204 - Computer Architecture Assignment #1
4 pages
Lab3 Cachelab
No ratings yet
Lab3 Cachelab
5 pages
AESD Vlsi
No ratings yet
AESD Vlsi
6 pages
HW 1 Sol S04
No ratings yet
HW 1 Sol S04
5 pages
Ese 2023 Coa
No ratings yet
Ese 2023 Coa
4 pages
Unit II
No ratings yet
Unit II
9 pages
COSS MidSem 2020.07.05 MakeUp With Key COPYM06Tq# Name-Rana
No ratings yet
COSS MidSem 2020.07.05 MakeUp With Key COPYM06Tq# Name-Rana
5 pages
Computer Architecture Midterm
No ratings yet
Computer Architecture Midterm
4 pages
Project 2011 Fall VLSI
No ratings yet
Project 2011 Fall VLSI
14 pages
HY425 ProgAssignment2
No ratings yet
HY425 ProgAssignment2
3 pages
Compre 23
No ratings yet
Compre 23
3 pages
Sample Midterm2
No ratings yet
Sample Midterm2
4 pages
Assignment Nov 19
No ratings yet
Assignment Nov 19
7 pages
COA Assgnment Based On William Stallings Computer Organization and Architecture
No ratings yet
COA Assgnment Based On William Stallings Computer Organization and Architecture
2 pages
Co Endsem
No ratings yet
Co Endsem
2 pages
Final Project Description - Fall2018
No ratings yet
Final Project Description - Fall2018
3 pages
A4 版本1 （未使用）
No ratings yet
A4 版本1 （未使用）
2 pages
EE204 - Computer Architecture Course Project
No ratings yet
EE204 - Computer Architecture Course Project
7 pages
Project 2
No ratings yet
Project 2
2 pages
Ee382M - Vlsi I: Spring 2009 (Prof. David Pan) Final Project
No ratings yet
Ee382M - Vlsi I: Spring 2009 (Prof. David Pan) Final Project
13 pages
GTP 25 KV VCB
No ratings yet
GTP 25 KV VCB
8 pages
ECE 6770 - Project Ideas
No ratings yet
ECE 6770 - Project Ideas
3 pages
CSE 560 - Practice Problem Set 4 Solution
No ratings yet
CSE 560 - Practice Problem Set 4 Solution
3 pages
Coss MidSemester Regular
No ratings yet
Coss MidSemester Regular
3 pages
Cse4302a1 Sol
No ratings yet
Cse4302a1 Sol
4 pages
Final Project Description
No ratings yet
Final Project Description
3 pages
Problem Project 1
No ratings yet
Problem Project 1
4 pages
Instructions: Csce 212: Final Exam Spring 2009
No ratings yet
Instructions: Csce 212: Final Exam Spring 2009
5 pages
Department of Computer Science & Engineering: University of Asia Pacific (UAP)
No ratings yet
Department of Computer Science & Engineering: University of Asia Pacific (UAP)
2 pages
Materi SMA Bahasa Inggris
No ratings yet
Materi SMA Bahasa Inggris
21 pages
Fundamental Counting Principle
No ratings yet
Fundamental Counting Principle
14 pages
Baumer Capacitive Senson
No ratings yet
Baumer Capacitive Senson
60 pages
Siemens 1LA7 Cat 48
No ratings yet
Siemens 1LA7 Cat 48
1 page
XpressBees ReverseReattemptDate CustomerAlternateAddress MobileUpdationAPI
No ratings yet
XpressBees ReverseReattemptDate CustomerAlternateAddress MobileUpdationAPI
5 pages
Mathematical Literacy P2 Feb-March 2011 Memo Eng
No ratings yet
Mathematical Literacy P2 Feb-March 2011 Memo Eng
23 pages
Pipe Glossary
No ratings yet
Pipe Glossary
3 pages
Dynamo Player: Using Revit To Run A Dynamo Script
No ratings yet
Dynamo Player: Using Revit To Run A Dynamo Script
3 pages
Iron FerroVer + TPTZ Methods
No ratings yet
Iron FerroVer + TPTZ Methods
15 pages
IEEEXplore Published Paper
No ratings yet
IEEEXplore Published Paper
8 pages
Unit-III Final Java Servlets and XML Notes
No ratings yet
Unit-III Final Java Servlets and XML Notes
64 pages
UGC NET Paper 1 16 June 2023 Morning Shift
No ratings yet
UGC NET Paper 1 16 June 2023 Morning Shift
40 pages
The Cruel Prince
No ratings yet
The Cruel Prince
4 pages
Petri Net Modeling of Biological Networks: Claudine Chaouiya
No ratings yet
Petri Net Modeling of Biological Networks: Claudine Chaouiya
27 pages
EE450 SocketProgrammingProject Fall2015
No ratings yet
EE450 SocketProgrammingProject Fall2015
25 pages
An Ontology-Driven Context Engine For The Internet of Things
No ratings yet
An Ontology-Driven Context Engine For The Internet of Things
16 pages
CLP 02.2 Course Title: Microprocessors & Microcontrollers Lab
No ratings yet
CLP 02.2 Course Title: Microprocessors & Microcontrollers Lab
6 pages
How To Resize A Garment: Method A: Increase Bust Size
No ratings yet
How To Resize A Garment: Method A: Increase Bust Size
3 pages
Multivariate Laplace Distribution
No ratings yet
Multivariate Laplace Distribution
3 pages
Pervaporation Ketazine Aq Layer Prodn HH Peroxide Proc PDF
No ratings yet
Pervaporation Ketazine Aq Layer Prodn HH Peroxide Proc PDF
6 pages
Cristal de Cuarzo 40MHz
No ratings yet
Cristal de Cuarzo 40MHz
4 pages
Module 5 in Mathematics in The Modern World: Community College of Manito Manito, Albay A.Y. 2021 - 2022
No ratings yet
Module 5 in Mathematics in The Modern World: Community College of Manito Manito, Albay A.Y. 2021 - 2022
4 pages
EE 254L Quiz 1 Preparation Guide
No ratings yet
EE 254L Quiz 1 Preparation Guide
5 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
5 pages
(Reg. Relationship Steps
No ratings yet
(Reg. Relationship Steps
4 pages
Phys BP PB 2
No ratings yet
Phys BP PB 2
1 page
Paper 2 - Synchronous Reactive Original
No ratings yet
Paper 2 - Synchronous Reactive Original
1 page
Design Principles in Architecture
From Everand
Design Principles in Architecture
Rajendra Asan
No ratings yet
Low-Current Systems Engineer’S Technical Handbook: A Guide to Design and Supervision
From Everand
Low-Current Systems Engineer’S Technical Handbook: A Guide to Design and Supervision
Habbieb T. Mansour
5/5 (2)
Digital Engineering: Complex System Design
From Everand
Digital Engineering: Complex System Design
S Mathioudakis
No ratings yet
The Software Programmer: Basis of common protocols and procedures
From Everand
The Software Programmer: Basis of common protocols and procedures
S Mathioudakis
No ratings yet
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
GameCube Architecture: Architecture of Consoles: A Practical Analysis, #10
From Everand
GameCube Architecture: Architecture of Consoles: A Practical Analysis, #10
Rodrigo Copetti
No ratings yet

I. Extending Project 2: Designs Over The Budget Will Get 0 Point

Uploaded by

I. Extending Project 2: Designs Over The Budget Will Get 0 Point

Uploaded by

University of Southern California

Department of Electrical Engineering

You might also like