0% found this document useful (0 votes)

131 views13 pages

Project

This document describes a final project for a course on designing a 32-bit pipelined CPU. Students will implement the CPU using different adder circuits - carry ripple adder, carry lookahead adder, carry skip adder, and carry select adder. Source code and testbenches are provided. Students will perform logic synthesis, post-synthesis simulation, and compare results to the RTL simulation to verify correctness before physical synthesis and implementation. The goal is to determine the maximum operating frequency after placement and routing.

Uploaded by

Kaushik Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

131 views13 pages

Project

Uploaded by

Kaushik Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

ECE429 Fall 2014 Prof.

Ken Choi

ECE 429 Final Project

Fall 2014
(Report Due Date: 12/08 (Monday) at 10:00 AM in US-Central time)
*Note: Please submit the source codes and the report to Blackboard

Case Study for 32-bit Pipelined CPU design with New ALU Architecture
Project Policy: This final project will be done individually.
Copying source codes or report will call for ECE disciplinary action strictly .

1. Introduction
This handout describes the final project for ECE 429. The objective of this project is to
understand a 32-bit Pipelined Central Processing Unit (CPU). As the name of the design
declares, the word length of the data used in the circuits is 32 bits. Furthermore, since this
circuit is pipelined, more than one instruction can be executed simultaneously. The operation of
the circuit is synchronized by an externally set clock signal. Also the instruction signals for
addressing the memory file, selecting the Arithmetic Logic Unit (ALU) operands and specifying
the operation of the ALU are also external. The correct synchronization of those signals with the
critical data path delay of the circuit that will determine the minimum operating period is one of
the objectives of the project.

2. Circuit Description
An overview of the primary building blocks and signals of the CPU is shown in Fig. 1. As shown
in Fig. 1, the primary building blocks are the memory file and the ALU. The external clock signal
synchronizes the capture and release of data within the memory file block. The circuit is
pipelined and each instruction is explained in two clock cycles. In the first clock cycle, the
two decoders are used to decode the external address selection signals used for specifying the
contents of the memory file that should be read at the memory ports in each clock cycle.
Additionally, multiplexer blocks are used to select the operands for the ALU. In the next clock
cycle, the ALU executes the specified operation. The ALU results can be read from the outside
of the CPU through a tri-state buffer, based upon the value of the externally specified OEN
(output enable) signal. Finally the ALU results can be written back in the memory file in the
word specified by the Address B.

Figure 1. Overview of the primary blocks and signals

1/13
ECE429 Fall 2014 Prof. Ken Choi

3. Memory File
The memory file of this design stores 32 32-bit words. There are two read ports in the
memory file and one write port. The words to be read in each clock cycle are specified by the
external 5-bit words address A and Address B. The internal configuration of the memory file is
illustrated below in Fig. 2.

As illustrated in Fig. 2, the primary storing element within the memory file is a D-register. The
output of each D-register is connected to the two output ports of the memory file through
tri-state buffers. The tri-state buffers are enabled by the decoded address (signals A and B) and
the contents of the D-registers appear at the output ports A and B respectively.

Furthermore, the value of address B specifies the word address in the memory file where the
results of the ALU computation are stored in the second cycle of instruction execution. The
writing of the ALU results within the memory file is synchronized by the clock signal.

Figure 2. Memory file configuration

4. Arithmetic Logic Unit (ALU)

The ALU of the circuit has two operands A and B and an implement the following eight
functions:
- A * B : multiplication
- A + B : addition
- A - B : subtraction
- B - A : subtraction
- A or B : logic OR function
- A and B : logic AND function
- A xor B : logic XOR function
- A xnor B : logic XNOR function

As illustrated in Fig. 1 the operand of the CPU can be selected among the following:
- Operand A: Operand A can be selected between the read port A of the memory file and the
externally defined data in. The selection is done by the external signal ASEL.
- Operand B: Operand B can be selected between the read port B of the memory file and the
logic zero value. The selection is done by the external signal BSEL.

2/13
ECE429 Fall 2014 Prof. Ken Choi

Internally the ALU has three primary operation blocks: the multiplier, the adder and the
logic function block. These blocks are illustrated in Fig 3. The multiplier can be implemented
as 32-by-32 array-based multiplier. The multiplier executes the multiplication function. Notice,
however, that the result of the multiplication operation is 64 bits. Therefore, in order to store
the multiplication result back to memory file we need two clock cycles. Therefore the
multiplication instruction is executed in 3 clock cycles. This is made possible by pipelining the
multiplier unit in order to produce the 16 least significant bits (LSB) of the result in once clock
cycle and the following 16 most significant bits (MSB) in the next cycle. Notice however, that
the instruction immediately following a multiplication operation should select the MSB of the
multiplier at the output of the ALU. It should also specify the storage address of the MSB bits in
the memory file.

Figure 3. Block diagram of the ALU circuit

The adder circuit within the ALU is a 32-bit adder/subtractor circuit. It executes the addition
and subtraction operations. The selection of the operation is done by the two externally defined
operation select signals OPSEL. The same signals are used for specifying the operation
executed within the logic function block of the ALU. The final output of the ALU is specified by
the output select OUTSEL signals that control the final 4-to-1 multiplexer within the ALU.

Finally, the ALU creates a control output signal that can be used externally of the CPU:
- Adder overflow - the signal is 1 if there is an adder overflow.

5. Synchronization
To better understand the circuit synchronization sequence described below, please refer to Fig 1.
The operation of the CPU is synchronized by the external clock signal.

An instruction to be executed by the CPU is determined by the external control signals. An

example of an instruction word is illustrated in Fig. 4. After the clock switches high the
instruction is applied (fetched) to the signals that control the CPU operation. Since each
instruction is executed in two steps, some of these control signals need to be stored at the
internal registers of the CPU. On the first step of the instruction, these signals will specify the
contents of the memory file that will be read from the read ports A and B. Also they will specify
the operands of the ALU. In the second cycle of the operation the control signals will determine
the operation to be executed internally the ALU, and the value of the ALU output.

3/13
ECE429 Fall 2014 Prof. Ken Choi

Figure 4. Instruction word contents

The results of the ALU will be available of the circuit if the OEN signal is set, and it will also be
written back in the memory file. The address in memory where the ALU result is written is
specified by the address B value. The data will be written that word at the next positive edge of
the clock signal.

The control signals are set after a positive edge of the clock signal and should not change
before the next positive edge. The period of the clock signal is determined by the longest path
of data within the circuit.

Case Study-1
32-bit CPU design with Different Adders - Carry Ripple Adder, Carry
Lookahead Adder, Carry Skip Adder, and Carry Select Adder
(Source codes are provided)
6. Introduction: Case Study-1
We provide the soruce verilog code and test bench for the cpu design with Carry Ripple Adder
(cpu_CRA.v), Carry Lookahead Adder (cpu_CLA.v), Carry Skip Adder (cpu_CSA.v),
Carry Select Adder (cpu_CSeA.v), and testbench verilog (tb_cpu.v) so that you can do
the logical synthesis and physical synthesis by using IIT-ECE429 ASIC flow. You can follow the
standard cell based flow to synthesize and layout the CPU design. Please refer to the tutorial IV
for detailed information which we already conducted in Lab. 9.

6.1 RTL Simulation

We must ensure that there is no bug in the design before synthesis. We verify the correctness
by running testing testbench. The whole cpu design in Verilog is provided, cpu_xxx.v, and a
testbench for verifying the CPU, tb_cpu.v, where we test functionality for store, read, addition,
and subtraction.

The testbench (tb_cpu.v) we provided tests the following instruction set:

[0] STORE -10

[1] STORE 10
[2] STORE 30
[20] STORE 50
[0][1] ADD
[1][2] ADD
[0][20] SUBTRACT
[1] READ
[2] READ
[20] READ

6.2 Logic Synthesis and Post-Synthesis Simulation

For logic synthesis with Synopsis DC, please set the initial desired clock frequency 30 MHZ (You
can figure out max frequency later in your report. What is the maximum frequency after P&R?)
Once finished, you should check the report files cell.rep and timing.rep for the area and timing
4/13
ECE429 Fall 2014 Prof. Ken Choi

of the design. Moreover, you will obtain the mapped circuit in cpu.vh. You should simulate it
with the Verilog models of the standard cells, i.e. osu05_stdcells.v, and compare the result with
the RTL simulation. The command is:

verilog osu05_stdcells.v tb_cpu.v cpu.vh

6.3 Place & Route and Post-P&R Simulation

Now, we are ready to run place and route using Cadence SOC Encounter. It wil take some time
for the tool to finish the automatic layout generation.

Once finished, you should check the final timing report in timing.rep.5.final in order to verify if
all the circuit timings are met. Moreover, you will obtain the circuit netlist in final.v, which
contains necessary buffers and inverters to overcome the interconnect delays in the signal
propagation network and the clock distribution network. You should simulate it with the Verilog
models of the standard cells, i.e. osu05_stdcells.v, and compare the result with the RTL
simulation and the post-synthesis simulation. The command is:

verilog osu05_stdcells.v tb_cpu.v final.v (with four adders design)

6.4 Explanation about the Different Adders used in Case Study-1

VLSI adders are critically important in digital designs since they are untilized in ALUs, memory
addressing, cryptography, and floating-point units. Since adders are often responsible for
setting the minimum clock cycle time in a processor, they can be critical to any improvements
seen at the VLSI level. In this project, we will examine four different adder architectures in
32-bit CPU design. We start from Ripple Carry Adder, which provides one of the simplest types
of carry-propagate adder designs, and then Carry Lookahead Adder, which is improved by
having the carries precomputed ahead of time. And, we move to Carry Skip Adder and Carry
Select Adder, both of which are attempt to obtain some of the improvements by trying to limit
the number of gates it has at the expense of some delay.

The four adder architectures that will be implemented in this project are listed below:

Carry Ripple Adder (Source code is provided, cpu_CRA.v)

Carry Lookahead Adder (Source code is provided, cpu_CLA.v)
Carry Skip Adder (Source code is provided, cpu_CSA.v)
Carry Select Adder (Source code is provided, cpu_CSeA.v)

Here are the brief description about the adder architectures.

Carry Ripple Adder

An n-bit CRA is formed by concatenating n FAs in cascade, with the cary output from one full
adder connected to the carry input of the next full adder. The carries are connected in a cahin
through the full adders as shown in Figure 5. (See module fa32 in the cpu_CRA.v)

Figure 5 Carry Ripple Adder

5/13
ECE429 Fall 2014 Prof. Ken Choi

Carry Lookahead Adder

The Carry Lookahead circuits are special logic circuit that can dramatically reduce the time to
perform addition at the price of more complex hardware. The carry Lookahead design can be
obtained by a transformation of the ripple carry design in which the carry logic over fixed
groups of bits of the adder is reduced to two-level logic. An example is shown for 4-bit adder
group in Figure 6. We can extend the 4-bit design to 32 bit (See module cla32 in the
cpu_CLA.v).

A=1011
B=0110
A+B=10001

Figure 6 4-bit Carry Lookahead Adder

where
C1 g 0 p0C0
C2 g1 p1 ( g 0 p0C0 ) g1 p1 g 0 p1 p0C0
C3 g 2 p2 ( g1 p1 ( g 0 p0C0 )) g 2 p2 g1 p2 p1 g 0 p2 p1 p0C0
*Note: pi Ai Bi (or Ai Bi ) and gi Ai Bi

Carry Skip Adder

In the Carry Skip Adder, the operands are divided into blocked of r bits blocks. Within each
block, a ripple carry adder can be utilized to produce the sum bits and a carryout bit for the
block. Carry Skip Adder is based on observation that carry process can skip stage for which xi
yi (that is pi = xiyi = 1). Each group generates Group Carry-Propagate=1 if all pi=1 in each
group. An example for 4-bit Carry Skip Adder strcuture is given in Figure 7, where
Co ,3 Ci ,0 when Block Propagate is equal to 1, i.e. pi=1 for all four bits.

Figure 7 4-bit Carry Skip Adder

6/13
ECE429 Fall 2014 Prof. Ken Choi

Carry Select Adder

The Carry Select Adder divides the operands to be added into r bit blocks similar to Carry Skip
Adder. For each block, two r-bit Ripple Carry Adders operates in parallel to form 2 sets of sum
bits and carry out signals. Each Ripple Carry Adder has two sets of hard-coded carry-in signals.
One Ripple Carry Adder has a carry-in of 0, whereas, the other has a carry-in of 1. An example
of 4-bit Carry Select Adder is shown in Figure 8.

Figure 8 4-bit Carry Select Adder

The upper half is implented by two independent 4-bit adders, one whose carry-in is hardwired
to 0, another whose carry-in is hardwired to 1. In parallel, these compute two alternative sums.
The carry-out from the previous 4-bit adder block controls multiplexers that select between the
two alternative sums. Following the same methodology, the two alternative carry-outs are
selected by carry-out from the previous block controling a multiplexer that selects the
appropriate carry-out for the next block. A structure of 16-bit Carry Select Adder blocks is
shown in Figure 9.

Figure 9 16-bit Carry Select Adder

7/13
ECE429 Fall 2014 Prof. Ken Choi

6.5 Report Submission for Case Study-1

For each cpu design with differrent adders:

1. Generate the display screenshot or the text output of the RTL simulation and the screenshot
from simvision with provided test bench (tb_cpu.v).
2. Synthesize the design and summarize cell.rep and timing.rep.
3. Provide the display screenshot or the text output of the post-synthesis simulation and the
screenshot from simvision.
4. Summarize timing.rep.5.final. What is the maximum clock frequency this circuit can run.
5. Provide the display screenshot or the text output of the post-P&R simulation and the
screenshot from simvision.
6. Generate a new test bech file (tb_test.v) for the following instruction set.

[0] STORE 5
[1] STORE AAAA_AAAA
[2] STORE 5555_5555
[3] STORE 0000_000A
[4] STORE 0000_0001
[5] STORE FFFF_FFFF
[6] STORE 0000_00C8
[7] STORE 0000_012C
[8] STORE 0000_0001
[9] STORE AAAA_AAAB
[10] STORE 5555_5555
[2][0] ADD
[1][2] ADD
[6][7] ADD
[0][3] ADD
[5][4] SUB
[5][8] ADD
[2][0] SUB
[9][10] ADD
[7] READ
[3] READ
[1] READ

7. Provide the display screenshot and the text output of the RTL simulation and the screenshot
from simvision for each cpu desgin (cpu_CRA.v, cpu_CLA.v, cpu_CSA.v, cpu_CSeA.v) with
the new generated test bench (tb_test.v).

8. Fill out the following performance comparison table after synthesis and analyze the results
(explain the reasons of your comparison results).

CRA CLA CSA CSeA

5555_5555 + 5
AAAA_AAAA + 5555_5555
Calculate the Path Delay for Each 0000_00C8 + 0000_012C
Operation (Post-Synthesis 5 + 0000_000A
Gate-Level Delay) FFFF_FFFF - 0000_0001
FFFF_FFFF + 0000_0001
5555_5555 - 5
AAAA_AAAB + 5555_5555

8/13
ECE429 Fall 2014 Prof. Ken Choi

Case Study-2
32-bit CPU design with New ALU Architecture
(Source codes are provided)
7. Introduction: Case Study-2: Comparator Design in the ALU for the 32-bit CPU
In this project, we will add a 32-bit comparator block into the ALU design.

The function of a 32-bit comparator in Verilog is shown in Table 1. Suppose we have two 32-bit
inputs (we assume them to be unsigned in this project) A and B. Since the result of comparing
them can be A > B , A = B and A < B. So two bits are needed to represent the comparison
result (two outputs f1 and f0). Note that when f1 = 1, it means two integers are equal.
Otherwise, f0 is used to determine the relation of A and B.

f1 f0
A>B 0 1
A<B 0 0
A=B 1 0

In this project, you are going to design the 32-bit comparator in a structural way. First of all,
we will explain the structure by using 4-bit comparator. Then we will give the structure view of
the 32-bit comparator, and you are supposed to finish the Verilog coding according to the
structure.

7.1 The structure of 4-bit comparator

Fig. 10 Structure of 4-bit comparator

The structure of 4-bit comparator is shown in Fig. 10. It is designed in a tree structure. At the
bottom level (Level 2), there are 4 one bit comparators. Each of them is used to compare the
corresponding bit in A and B. The meaning of the output f1 and f0 are the same as the meaning
in Fig. 10 (f1f0=10 means a=b, f1f0=00 means a<b, f1f0=01 means a>b).

Notice that the final comparison result depends on the comparison result of the most significant
bit which has determined the relation of the two integers. Take the 4 bit comparator shown in

9/13
ECE429 Fall 2014 Prof. Ken Choi

Fig. 10 for example. If the results from MSB A[3] and B[3] has shown that A[3] > B[3] or A[3]
< B[3] (in other words, f1 = 0), then it means A > B or A < B. On the other hand, if A[3] =
B[3] (f1 = 1), then we have to refer to the comparison result of next significant bit A[2] and
B[2]. If A[2] and B[2] are equal, we have to compare A[1] and B[1], and so on. If all the 4 bits
are equal (f1 from all the four one bit comparators are all 1s), then A = B. In fact, rather than
comparing bits from MSB to LSB sequentially, we can do the comparison in parallel in order to
save time, as we can see from Fig. 10. Remember that the left part of f1 and f0 results always
have higher priority than the right part of the f1 and f0. To be more specific, for the component
of mux_4to2 in Fig. 10, if hi_f1 = 0 which means the relation of A and B has already been
determined, then its output f1 and f0 should be consistent with hi_f1 and hi_f0. Otherwise, f1
and f0 should be consistent with lo_f1 and lo_f0.

From Fig. 10, we can see that the number of mux_4to2 is 3 which is equal to 4 1 and the
level of the tree is 3 which is equal to log2(4) + 1. More generally, if two N-bit (N is the power
of 2) unsigned integers are compared, then the tree comparator will be (log2(N) + 1) levels,
and it will consists of N -1 mux_4to2 and N one_bit_comp.

7.2 The structure view of the 32-bit comparator

The structure of the 32-bit comparator is shown in Fig. 11. You are supposed to finish the
Verilog coding of this structure and include it in the ALU design. There should be three modules
in your Verilog code: one_bit_comp, mux_4to2, and tree_comp. The definition part of each
module is included in file cpu_comp.v, and they are listed in Fig. 12. You should finish the
code in order to complete your new cpu design.

Fig. 11 Structure of 32-bit comparator

10/13
ECE429 Fall 2014 Prof. Ken Choi

Fig. 12: Codes to be finished

7.3 The new ALU design

After adding the 32-bit comparator to the ALU design, the new ALU will look like Fig. 13. Note
that we should extend the two bit output {f1, f0} to 32-bit result. Moreover, the original 4 to 1
MUX in the ALU should be changed to a 5 to 1 MUX, and its select signal OUTSEL should be 3
bits now. The ALU design has already been modified so that you can focus on the comparator
design.

11/13
ECE429 Fall 2014 Prof. Ken Choi

Fig. 13 Block diagram of the new ALU circuit

7.4 Report Submission for Case Study-2

For the cpu design with CLA adder:

1. Generate a new test bech file (tb_test_comp.v) for the following instruction set.

2. Provide the display screenshot or the text output of the RTL simulation and the screenshot
from simvision with the test bench (tb_test_comp.v).
12/13
ECE429 Fall 2014 Prof. Ken Choi

3. Synthesize the design and summarize cell.rep and timing.rep.

4. Provide the display screenshot or the text output of the post-synthesis simulation and the
screenshot from simvision.
5. Summarize timing.rep.5.final. What is the maximum clock frequency this circuit can run.
6. Provide the display screenshot or the text output of the post-P&R simulation and the
screenshot from simvision.

Good luck!

13/13

ECE 485 Computer Organization and Design: D I Mips CPU M D
No ratings yet
ECE 485 Computer Organization and Design: D I Mips CPU M D
32 pages
The Processor: (Datapath and Pipelining)
No ratings yet
The Processor: (Datapath and Pipelining)
144 pages
Semester_project
No ratings yet
Semester_project
3 pages
Exp 2
No ratings yet
Exp 2
10 pages
Cpu Design
No ratings yet
Cpu Design
8 pages
4 Bit Cpu Report
No ratings yet
4 Bit Cpu Report
16 pages
pipeline mips
No ratings yet
pipeline mips
28 pages
Pipelined Processor Design
No ratings yet
Pipelined Processor Design
28 pages
Lab 4
0% (1)
Lab 4
21 pages
Embedded Systems 220 Control Unit Design Notes: PC: PC + 1 PC: PC PC: Operand PC: PC + Operand
No ratings yet
Embedded Systems 220 Control Unit Design Notes: PC: PC + 1 PC: PC PC: Operand PC: PC + Operand
3 pages
MOD 4-I Simple Computer - Bottom Up Implementation
No ratings yet
MOD 4-I Simple Computer - Bottom Up Implementation
11 pages
Design and Characterization of A CMOS 8-Bit Microprocessor Data Path
No ratings yet
Design and Characterization of A CMOS 8-Bit Microprocessor Data Path
6 pages
Ch#4 Part 1, 2,34
No ratings yet
Ch#4 Part 1, 2,34
70 pages
Embedded Microprocessor Lecture 2
No ratings yet
Embedded Microprocessor Lecture 2
15 pages
MAKAUT Class Notes For Engineering
No ratings yet
MAKAUT Class Notes For Engineering
8 pages
Central Processing Unit
No ratings yet
Central Processing Unit
14 pages
UNIT 3
No ratings yet
UNIT 3
40 pages
8085 Microprocessor Architecture
No ratings yet
8085 Microprocessor Architecture
44 pages
8 Bit Microprocessor: Mayank Bhatnagar Vaibhav Mahimkar Dominic Alphonse
No ratings yet
8 Bit Microprocessor: Mayank Bhatnagar Vaibhav Mahimkar Dominic Alphonse
25 pages
ALU MISSDV0716 G.kaladhar Design Plan
No ratings yet
ALU MISSDV0716 G.kaladhar Design Plan
16 pages
CS2100: Single Cycle Implementation of MIPS Standard (Computer Organisation)
No ratings yet
CS2100: Single Cycle Implementation of MIPS Standard (Computer Organisation)
16 pages
D 32-CPU: Esign of A Bit Single Cycle
No ratings yet
D 32-CPU: Esign of A Bit Single Cycle
11 pages
Lab1 15
No ratings yet
Lab1 15
5 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
15 pages
MODULE-5
No ratings yet
MODULE-5
21 pages
Design of RISC Processor Using VHDL and Cadence
No ratings yet
Design of RISC Processor Using VHDL and Cadence
10 pages
8085 Architecture: Control Unit
No ratings yet
8085 Architecture: Control Unit
10 pages
Single Cycle
No ratings yet
Single Cycle
28 pages
Register File
No ratings yet
Register File
6 pages
ALU Design
No ratings yet
ALU Design
19 pages
Week 3 - Four Instruction CPU
No ratings yet
Week 3 - Four Instruction CPU
8 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
15 pages
DDCO - Module 5 - Part 1
No ratings yet
DDCO - Module 5 - Part 1
9 pages
Lec 21
No ratings yet
Lec 21
33 pages
CAO Unit 3 Notes
No ratings yet
CAO Unit 3 Notes
20 pages
Lecture 14 Building a Datapath Extended
No ratings yet
Lecture 14 Building a Datapath Extended
40 pages
Report On Special Assignment "Implementation of 8-Bit ALU On SPARTAN-3"
100% (6)
Report On Special Assignment "Implementation of 8-Bit ALU On SPARTAN-3"
18 pages
The Functional Block Diagram of 8085A Is Shown in Fig.4.1
No ratings yet
The Functional Block Diagram of 8085A Is Shown in Fig.4.1
9 pages
Single Cycle Datapath PDF
No ratings yet
Single Cycle Datapath PDF
30 pages
single-cycle-vs-multi-cycle-cpu
No ratings yet
single-cycle-vs-multi-cycle-cpu
11 pages
ASOGWA SOCHIOMA C.-Architecture
No ratings yet
ASOGWA SOCHIOMA C.-Architecture
12 pages
Chap 2
No ratings yet
Chap 2
70 pages
CA04_2024S2_printout
No ratings yet
CA04_2024S2_printout
31 pages
Group 43
No ratings yet
Group 43
10 pages
Microprocessor Lab Manual
No ratings yet
Microprocessor Lab Manual
85 pages
COA Module 2 Notes
No ratings yet
COA Module 2 Notes
46 pages
Unit - 3 The Processor Organization Structure
No ratings yet
Unit - 3 The Processor Organization Structure
16 pages
L-3 8 Bits Microprocessor
No ratings yet
L-3 8 Bits Microprocessor
12 pages
Lab 4: 8-Bit Arithmetic Logic Unit (ALU) Purpose: EEL 4712 - Fall 2004
No ratings yet
Lab 4: 8-Bit Arithmetic Logic Unit (ALU) Purpose: EEL 4712 - Fall 2004
4 pages
MIPS Single Cycle Processor Design
No ratings yet
MIPS Single Cycle Processor Design
59 pages
Slide 5
No ratings yet
Slide 5
31 pages
KAIST cs311 05 Proc I
No ratings yet
KAIST cs311 05 Proc I
28 pages
Unit Iii Microprocessors
No ratings yet
Unit Iii Microprocessors
31 pages
Unit 1 Lect 3
No ratings yet
Unit 1 Lect 3
16 pages
21CS401-CA-UNIT-3
No ratings yet
21CS401-CA-UNIT-3
39 pages
Deco M3
No ratings yet
Deco M3
46 pages
Computer Science II Essentials
From Everand
Computer Science II Essentials
Randall Raus
No ratings yet
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
From Everand
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
Digital Equipment Corporation
No ratings yet
Pic® Micro Principles V11
From Everand
Pic® Micro Principles V11
Clive W. Humphris
No ratings yet
Pic® Micro Principles Teachers Pack V11
From Everand
Pic® Micro Principles Teachers Pack V11
Clive W. Humphris
No ratings yet
Rockwell Automation TechED 2018 - SY10 - Lab Manual - Integrating CENTERLINE® Motor Control Centers With Studio 5000® and IntelliCENTER® Software PDF
100% (1)
Rockwell Automation TechED 2018 - SY10 - Lab Manual - Integrating CENTERLINE® Motor Control Centers With Studio 5000® and IntelliCENTER® Software PDF
110 pages
ESPA Interface Module BSL-333
No ratings yet
ESPA Interface Module BSL-333
2 pages
FGPA 2 Ahmad
No ratings yet
FGPA 2 Ahmad
15 pages
Centralized vs. Distributed Messaging Systems
No ratings yet
Centralized vs. Distributed Messaging Systems
4 pages
PE4576 - Technical Summary For Multilink Universal and Multilink Universal FX
No ratings yet
PE4576 - Technical Summary For Multilink Universal and Multilink Universal FX
6 pages
Crash Log From 12/26/15
No ratings yet
Crash Log From 12/26/15
29 pages
How To Analyze The FDR Output in Siebel Versions 7.7.x, 7.8.x and 8. (ID 473939.1)
No ratings yet
How To Analyze The FDR Output in Siebel Versions 7.7.x, 7.8.x and 8. (ID 473939.1)
11 pages
Build Info
No ratings yet
Build Info
17 pages
Reference Architecture
100% (1)
Reference Architecture
8 pages
Desktop Support Engineer
No ratings yet
Desktop Support Engineer
2 pages
Modern Programming Tools - and Techniques III - Shrivastava - Ibrg
No ratings yet
Modern Programming Tools - and Techniques III - Shrivastava - Ibrg
362 pages
Implementing Pfunctions
No ratings yet
Implementing Pfunctions
15 pages
Handout InLab1-LabA
No ratings yet
Handout InLab1-LabA
2 pages
Drive Installation Guide Leaflet v3
No ratings yet
Drive Installation Guide Leaflet v3
2 pages
Python and Data Structures Roadmap
No ratings yet
Python and Data Structures Roadmap
14 pages
The All New Switch Book (Dragged)
No ratings yet
The All New Switch Book (Dragged)
1 page
System Design Using FPGAs - Google Forms
No ratings yet
System Design Using FPGAs - Google Forms
5 pages
Case Study On Unix
No ratings yet
Case Study On Unix
6 pages
Specifications Compaq 500B Micro Tower PC - HP Customer Care (United States - English)
No ratings yet
Specifications Compaq 500B Micro Tower PC - HP Customer Care (United States - English)
2 pages
Firmware Interface Table Bios Specification r1p2
No ratings yet
Firmware Interface Table Bios Specification r1p2
17 pages
Food Ordering App in Android Development
No ratings yet
Food Ordering App in Android Development
46 pages
Tcs HR Question and Answers
No ratings yet
Tcs HR Question and Answers
7 pages
Major Incident Manager ITIL in San Antonio TX Resume Randall Nepsund
No ratings yet
Major Incident Manager ITIL in San Antonio TX Resume Randall Nepsund
3 pages
Basic Operating System Concept and Its Services
No ratings yet
Basic Operating System Concept and Its Services
10 pages
03 - Central - Analógica - Morley - DXc4
No ratings yet
03 - Central - Analógica - Morley - DXc4
4 pages
Mobile Programming
No ratings yet
Mobile Programming
13 pages
QVSTSUser Guide V3
No ratings yet
QVSTSUser Guide V3
85 pages
AJAX Interview Questions & Answers
No ratings yet
AJAX Interview Questions & Answers
7 pages
Untitled
No ratings yet
Untitled
37 pages
2.1.2 Structure Chart (MT-L)
No ratings yet
2.1.2 Structure Chart (MT-L)
8 pages

Project

Uploaded by

Project

Uploaded by

ECE429 Fall 2014 Prof.

ECE 429 Final Project

Figure 1. Overview of the primary blocks and signals

Figure 2. Memory file configuration

4. Arithmetic Logic Unit (ALU)

Figure 3. Block diagram of the ALU circuit

An instruction to be executed by the CPU is determined by the external control signals. An

Figure 4. Instruction word contents

6.1 RTL Simulation

The testbench (tb_cpu.v) we provided tests the following instruction set:

[0] STORE -10

6.2 Logic Synthesis and Post-Synthesis Simulation

verilog osu05_stdcells.v tb_cpu.v cpu.vh

6.3 Place & Route and Post-P&R Simulation

verilog osu05_stdcells.v tb_cpu.v final.v (with four adders design)

6.4 Explanation about the Different Adders used in Case Study-1

Carry Ripple Adder (Source code is provided, cpu_CRA.v)

Here are the brief description about the adder architectures.

Carry Ripple Adder

Figure 5 Carry Ripple Adder

Carry Lookahead Adder

Carry Skip Adder

Figure 7 4-bit Carry Skip Adder

Carry Select Adder

Figure 8 4-bit Carry Select Adder

Figure 9 16-bit Carry Select Adder

6.5 Report Submission for Case Study-1

CRA CLA CSA CSeA

7.1 The structure of 4-bit comparator

Fig. 10 Structure of 4-bit comparator

7.2 The structure view of the 32-bit comparator

Fig. 11 Structure of 32-bit comparator

Fig. 12: Codes to be finished

7.3 The new ALU design

Fig. 13 Block diagram of the new ALU circuit

7.4 Report Submission for Case Study-2

3. Synthesize the design and summarize cell.rep and timing.rep.

You might also like