0% found this document useful (0 votes)

101 views9 pages

LAB 5 - Implementing An ALU: Goals

The document describes implementing an arithmetic logic unit (ALU) in Verilog. It involves: 1) Drawing a block diagram of the ALU based on its operations. 2) Implementing the ALU in Verilog based on the block diagram. 3) Evaluating the speed and FPGA resource utilization of the implemented ALU using Vivado.

Uploaded by

Blmjdb Abdelhafid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

101 views9 pages

LAB 5 - Implementing An ALU: Goals

Uploaded by

Blmjdb Abdelhafid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

LAB 5 – Implementing an ALU

Goals
 Implement an Arithmetic Logic Unit (ALU) in Verilog.
 Learn how to evaluate the speed and FPGA resource utilization of a circuit in
Vivado.

To Do
 Draw a block level diagram of the MIPS 32-bit ALU, based on the description in
the textbook.
 Implement the ALU using Verilog.
 Synthesize the ALU and evaluate speed and FPGA resource utilization.
 Follow the instructions. Paragraphs that have a gray background like the current
paragraph denote descriptions that require you to do something.
 To complete the lab you have to show your work to an assistant before the
deadline, there is nothing to hand in. The required tasks are clearly marked with
gray background throughout this document. All other tasks are optional but highly
recommended. You can ask the assistants for feedback on the optional tasks.

Introduction
So far, we implemented fairly small circuits using Verilog. In this exercise we tackle
something more formidable, which is the heart of a processor – the arithmetic logic unit
(ALU). We implement an ALU that is similar to the one described in Section 5.2.4 of the
H&H textbook. This ALU is part of the small micro-controller we will build in the later
exercises.
This exercise is split over two labs: 5 and 6. In this lab (5), we write HDL description of
the ALU, and in the second lab (6) we will verify that it works correctly using a
testbench.
Until now, we have neglected to investigate performance-related numbers such as the
delay and area (i.e., FPGA resource utilization) of the circuit. We were simply happy as
long as our circuit worked. To be fair, the circuits we designed were very small, and did
not really warrant much investigation.
In this exercise, we build a decently-sized circuit, so we are also interested in how fast the
circuit is able to perform the arithmetic operations and what fraction of the available
FPGA resources it occupies. We will also try to see whether our coding style has an effect
on the speed and FPGA resource utilization.

Part 1 – Designing an ALU

We will design an ALU that can perform a subset of the ALU operations of a full MIPS
ALU. You can refer to Appendix B of the H&H textbook to see the full set of operations

1
that MIPS can support. In this exercise, we develop an ALU that takes two 32-inputs A
and B and will be able to execute the following seven instructions:
add, sub, slt, and, or, xor, nor
The ALU generates a 32-bit output that we call ‘Result’ and an additional 1-bit flag
‘Zero’ that will be set to ‘logic-1’ if all the bits of ‘Result’ are 0. The different operations
will be selected by a 4-bit control signal called ‘AluOp’ according to the following table.

AluOp (3:0) Mnemonic Result = Description

0000 add A + B Addition

0010 sub A B Subtraction

0100 and A and B Logical and

0101 or A or B Logical or

0110 xor A xor B Exclusive or

0111 nor A nor B Logical nor

1010 slt (A B)[31] Set less than

Others n.a. Don’t care

Table 1. Summary of the ALU control

(Note 1: You should extend the result of slt to 32 bits (i.e., 32’b0 or 32’b1).)
(Note 2: And, or, xor, nor are bitwise operations.)
Just to give an example when the ‘AluOp’ input is 0101, the function
Result = A or B;
should be calculated. It is easy to see that there are many values of ‘AluOp’ for which no
operation is defined. It is not very important what the circuit does when ‘AluOp’ has
these values, since the ‘Result’ will simply be ignored in these cases. You can use this to
your advantage to simplify the circuit.
Right now, the described operations may look random, but once we learn more about the
MIPS instruction set architecture, these choices will make more sense.

Designing the Block diagram

First, you need to draw a block diagram of the ALU, like the one seen in Figure 5.15 of
the H&H textbook. This exercise is based on a (more or less) real example; there will not
be a clear textbook ‘best’ solution for the circuit.

2
The following is one approach to analyze what is needed and come up with a block
diagram. You are free to follow this example or come up with your own ideas. It is just
important that you think about how the circuit should be implemented.
Let us first examine the different operations. You should see that we have two types of
instructions. The three instructions add, sub, slt require arithmetic operations, whereas
the four remaining and, or, xor, nor are bitwise operations. Now let us look at Table 1
and determine for which values of AluOp we perform an operation from which type. It
should be clear that when AluOp[2] is logic-0, we have an arithmetic operation and when
AluOp[2] is logic-1, we select a logic operation. This means that the output of either type
can be selected by a 2-input multiplexer that is controlled by AluOp[2]. Figure 1 depicts
an ALU design that includes a separate logic block (i.e., arithmetic part and logic part) for
each type of operation.

Figure 1. A possible division for the ALU

Now we can take a look at the two types individually. For the logic part, AluOp[1:0]
selects one of the 4 simple bitwise operations. In the arithmetic part, we realize that we
have an addition (add) or a subtraction (sub, slt). We can see that AluOp[1] is logic-0 for
additions and logic-1 for subtractions. This could allow us to build a structure like the one
in Figure 5.15 of the H&H textbook to design an adder-subtracter (controlled with
AluOp[1] instead of F[2]). Figure 2 shows such a design.

3
Figure 2. Possible organization for the adder subtracter in ALU
There is one more thing left, depending on the AluOp[3] we can select whether we take
only the most significant bit (logic-1, slt instruction), or we take the output as it is. We
show an example design in Figure 3.

Figure 3. A possible organization to implement slt

Draw a block diagram that will implement the ALU operations listed in Table 1. You are
free to decide how to implement the ALU and do not have to base the block diagram on
the above explanations. You may use arbitrary size adders, multiplexers, logic gates,
zero/sign extend, comparators and shifters.

Part 2 - Implementation
Once we have a good block diagram it is straightforward to implement the circuit in
Verilog. Replace each block with a Verilog description and use the signal names in the
block diagram.
Start Vivado and create a new project (you can call it Lab5). Make sure to select
“xc7a35tcpg236-1” as your FPGA since otherwise you cannot download the bitstream of
your design to the Basys 3 board. Implement the ALU based on your block diagram.
Synthesize and implement your design. (We do not transfer the design to FPGA in this

4
lab, therefore we do not provide you a constraints file. Thus, the implementation will run
correctly, but the bitstream generation will fail.)
Hint 1: You can use 32’b0 to represent a 32-bit zero.
Hint 2: In Verilog, you can concatenate multiple bits together using curly braces {}. For
example: {2’b10, 1’b1} results in 3’b101.
At this point we really do not know if our circuit functions properly. Unlike the other
exercises we cannot verify that our circuit works by directly trying it out since there are
too many input bits. Instead, we use a testbench to verify the functionality in the next lab
(Lab 6).
Until now, we have always verified our circuits by exhaustively testing them. Assume
that we can test 1 input every second, how long would it take us to test our ALU by trying
each and every possible input combination. Please consider only the 7 valid combinations
for the AluOp in Table 1. Provide the calculations.

Part 3 – The Performance of the Circuit

Until now, we did not evaluate the speed and area of our implementation. In this lab, we
will learn to check the speed (i.e., max frequency our circuit can run at) and area (i.e.,
FPGA resource utilization).

We provide instructions for evaluating speed and area using Vivado. In Vivado, after
running Implementation, go to ‘Window → Project Summary’. It shows a window similar
to the one shown in Figure 4. The design summary window provides many of the
important design parameters (e.g., the Timing and Utilization panes).

5
Figure 4. Design Summary window (example)

In the Utilization pane (left bottom area), click on the “Table” button in the “Post-
Implementation” tab. The size of the circuit is expressed in terms of the fraction of the
total available resources of the FPGA that used for the design. For instance, the above
example uses 108 out of 20800 Look-up Tables (LUTs) of the FPGA, which is less than
1% of the total.
Getting the timing report in Vivado is slightly more complicated . Design tools such as
Vivado are not really able to come up with the best possible circuit implementation for a
given Verilog description as the placement and routing procedures are computationally
expensive. Instead, the tools try to come up with a circuit that satisfies the given user
constraints. In other words, you need to tell Vivado, “here is the description of the circuit,
and I want you to implement this description so that it works with a clock frequency of
50 MHz”. Vivado tries to satisfy this constraint and reports whether or not it has achieved
it. In the Project Summary, it has a section of ‘Timing’, which lists how many of the
timing paths violate the given timing constraints. In the above example it is shown as NA
(not available). That is because we did not set any timing constraints, so Vivado cannot
report the timing. We will add a timing constraint to set the maximum delay that we
would like our ALU to have.

Adding Simple Timing Constraints

All user constraints are included in an XDC file that we have previously used for
connecting the input/output ports of the top module to the FPGA pins. Make sure to add
an XDC file into your project. If we know how to express timing constraints, we could

6
just go ahead and type in the constraint in a text editor like we did for determining the
pins. We can also use a GUI based tool to edit the same file.
In the exercises we always use fairly simple circuits, and adjust the requirements so that
exercises can be done easily. In real life, we sometimes need to add many different
constraints to get a working circuit. This is why the constraint editor is slightly complex.

In the Flow Navigator, click on “Implementation → Open Implemented Design → Edit

Timing Constraints”. In the newly opened “Timing Constraints” tab, click in the left tree
view on “Exceptions → Set Maximum Delay” and add a new constraint by clicking on the
green plus sign. A new window will pop up as shown in Figure 5. Set “Specify path
delay” to 20ns, “From” and “To” to “*” and, click OK.

Figure 5. Constraints for the ALU

This tells Vivado that you want to take a maximum of 20 ns to propagate a signal from
any input to any output. Press “Ctrl+S” to save the file and if necessary, create an
additional constraint profile. You will see that a XDC file has been added to the design. If
you open the file with a text editor, you will notice that it includes a simple line:
set_max_delay from * to * 20.000
If you know how the constraint can be expressed, it is usually much easier (and faster) to
type in the constraints in a text editor. However, it is not always easy to figure out what
exactly to type.

7
Since we have constrained our design, we can re-run the implementation to generate the
timing report, which we can see in the Project Summary.
After implementing the design, you should see values in the ‘Timing’ pane in the ‘Project
Summary’. You should see that your constraint of 20 ns was achieved. The slack (around
1 ns in this case) is the difference between the delay that the circuit actually has and the
constraint 20 ns .
More detailed reports can be found in “Taskbar → Window → Reports”. For the timing
report, select in its tree structure "Implementation -> Route Design -> Timing Summary
Report". The report provides you the slow paths. You see from which input pin each path
begins, which locations it goes through, and where it ends. At each step you see how
much delay comes due to a logic operation and routing.

Investigate the different reports to find the answers for the questions below. Show the
assistants your result in this part.
Number of 4 Input LUTs
Number of bonded IOBs
Which pin of the FPGA is the output ‘zero’ connected?
(pin name)
Where does the longest path start from
Where does the longest path end
How long is the longest path
How much of the longest path is routing
How many levels of logic is in the longest path

Last Words
It is possible to design a digital circuit without first developing a block diagram on paper.
However, it is always easier to write a hardware description of a circuit that exists as a
block diagram. After all, the ‘hardware description’ is just a translation of the circuit idea
into the syntax of the specific language.
Synthesis tools can convert your hardware idea into a working circuit and can report
performance on all related numbers. However, if you do not have an expectation of the
architecture and the performance, you cannot judge whether or not these are good
numbers.
In class, we have learned that usually adders are the most critical elements when it comes
to determining the performance of an arithmetic circuit. A high-performance adder can be
a costly block. In our example, three operations (add, sub, slt) are based on an adder. A

8
naive implementation would have a separate adder for each of these operations, resulting
in a relatively large circuit. We should make sure that all three operations are realized by
sharing one adder (at least if we are concerned about the area cost of the circuit).
Modern synthesis tools are quite sophisticated and do most of the work for you.
Moreover, they are continuously improving. Chances are very good that they
automatically figure out what is the best implementation for your code. Unfortunately,
they are far from perfect, and for larger designs with complex functionality (in designs
where things matter), experienced design engineers are still indispensable.

8 Bit ALU Design in Modelsim Using Verilog With Code and Test Bench
60% (5)
8 Bit ALU Design in Modelsim Using Verilog With Code and Test Bench
17 pages
Class 12 Sumita Arora C++ ch08 Pointers PDF
No ratings yet
Class 12 Sumita Arora C++ ch08 Pointers PDF
18 pages
Introduction
No ratings yet
Introduction
25 pages
65nm CMOS Process Data Sheet
0% (1)
65nm CMOS Process Data Sheet
1 page
1511 MAX Ds
No ratings yet
1511 MAX Ds
4 pages
Report On Special Assignment "Implementation of 8-Bit ALU On SPARTAN-3"
100% (6)
Report On Special Assignment "Implementation of 8-Bit ALU On SPARTAN-3"
18 pages
Chapter 4 Linked Stacks and Queues
No ratings yet
Chapter 4 Linked Stacks and Queues
56 pages
One Bit Arithmetic Logic Unit
No ratings yet
One Bit Arithmetic Logic Unit
77 pages
Implementation of 8 Bit Alu in Fpga: EX - NO. 1 DATE: 11-2-2010
No ratings yet
Implementation of 8 Bit Alu in Fpga: EX - NO. 1 DATE: 11-2-2010
109 pages
Standards and Best Practices On Process Server and WID
No ratings yet
Standards and Best Practices On Process Server and WID
36 pages
Avrdude Doc 5.11.1
No ratings yet
Avrdude Doc 5.11.1
42 pages
Objectives
No ratings yet
Objectives
15 pages
Ec 115 Extens Columns
No ratings yet
Ec 115 Extens Columns
34 pages
ALU Project Documentation
33% (3)
ALU Project Documentation
42 pages
Chapter 3,4
No ratings yet
Chapter 3,4
12 pages
Cover Pages
No ratings yet
Cover Pages
4 pages
ASM8085
No ratings yet
ASM8085
15 pages
Log
No ratings yet
Log
67 pages
Single Cycle Impentation
No ratings yet
Single Cycle Impentation
52 pages
Sybex CCNA 640-802: Chapter 7: Managing A Cisco Internetwork
No ratings yet
Sybex CCNA 640-802: Chapter 7: Managing A Cisco Internetwork
33 pages
Project 12
No ratings yet
Project 12
16 pages
Copyno1 Lastone February (Final)
No ratings yet
Copyno1 Lastone February (Final)
153 pages
Boom Box Manual Appendix Drawings: Seismic
No ratings yet
Boom Box Manual Appendix Drawings: Seismic
13 pages
DDCA Lab05
No ratings yet
DDCA Lab05
3 pages
Chapter 1,2
No ratings yet
Chapter 1,2
7 pages
Computer Architecture: Nguyễn Trí Thành
No ratings yet
Computer Architecture: Nguyễn Trí Thành
101 pages
Labsheet10 Functions
No ratings yet
Labsheet10 Functions
2 pages
Lab 3 Manual24
No ratings yet
Lab 3 Manual24
7 pages
Lab 4 1 6
No ratings yet
Lab 4 1 6
6 pages
Tute 6: Q.1 Q.3 Q.4 (A) Q. (5) Q.6 (A)
No ratings yet
Tute 6: Q.1 Q.3 Q.4 (A) Q. (5) Q.6 (A)
5 pages
Upgrade From Jinit To Jre 6U29 For 11i
No ratings yet
Upgrade From Jinit To Jre 6U29 For 11i
4 pages
KashifBari CV
No ratings yet
KashifBari CV
4 pages
Alu PRJCT Report
No ratings yet
Alu PRJCT Report
15 pages
Computer Science 306 - Assignment 1 - Creating A Simple ALU-1
No ratings yet
Computer Science 306 - Assignment 1 - Creating A Simple ALU-1
8 pages
Cmos Vlsi Design Lab 4: Controller Design: 1. Standard Cell Library
No ratings yet
Cmos Vlsi Design Lab 4: Controller Design: 1. Standard Cell Library
8 pages
TELKOMNIKA Journal Paper
No ratings yet
TELKOMNIKA Journal Paper
9 pages
SCG 5x Ve DG
No ratings yet
SCG 5x Ve DG
33 pages
L7 - L9 MIPS Datapath Single Cycle Datapath
No ratings yet
L7 - L9 MIPS Datapath Single Cycle Datapath
30 pages
The Design of Arithmetic Logic Unit Based On ALM
No ratings yet
The Design of Arithmetic Logic Unit Based On ALM
5 pages
ORA-06553: PLS-306: Wrong Number or Types of Arguments in Call To 'FUNC1'
No ratings yet
ORA-06553: PLS-306: Wrong Number or Types of Arguments in Call To 'FUNC1'
2 pages
Command List-81
No ratings yet
Command List-81
3 pages
Log
No ratings yet
Log
16 pages
Unit2 San Intelligent Storage System
No ratings yet
Unit2 San Intelligent Storage System
9 pages
Dbms Assignment 2
No ratings yet
Dbms Assignment 2
5 pages
L12 Sglcycle Datapath
No ratings yet
L12 Sglcycle Datapath
69 pages
Sabu Verilog
No ratings yet
Sabu Verilog
25 pages
B22EE087
No ratings yet
B22EE087
7 pages
Fenris Debug
No ratings yet
Fenris Debug
40 pages
AVA-183MP
No ratings yet
AVA-183MP
9 pages
Dokumen - Tips - 8 Bit Alu Report
No ratings yet
Dokumen - Tips - 8 Bit Alu Report
21 pages
Fahad 2019
No ratings yet
Fahad 2019
5 pages
Ece C
No ratings yet
Ece C
70 pages
Brief History of The Notebook
No ratings yet
Brief History of The Notebook
12 pages
CS F342 ComputerArchitecture Lab3
No ratings yet
CS F342 ComputerArchitecture Lab3
12 pages
Ava 183P DG+
No ratings yet
Ava 183P DG+
6 pages
Ava 24a D+
No ratings yet
Ava 24a D+
6 pages
Lab 8 Report
No ratings yet
Lab 8 Report
28 pages
C Programming Exercises
100% (1)
C Programming Exercises
11 pages
CMA-62
No ratings yet
CMA-62
5 pages
Design and Implementation of An 8-Bit ALU Based On
No ratings yet
Design and Implementation of An 8-Bit ALU Based On
6 pages
EHC-24L
No ratings yet
EHC-24L
5 pages
Lec 30
No ratings yet
Lec 30
19 pages
and Install The Windows ADK - Microsoft Learn
No ratings yet
and Install The Windows ADK - Microsoft Learn
6 pages
N-Instage Final Assigment Instructions (HFC-N)
No ratings yet
N-Instage Final Assigment Instructions (HFC-N)
2 pages
PMA3-14LN
No ratings yet
PMA3-14LN
4 pages
3.2.8 Packet Tracer - Investigate A VLAN Implementation
No ratings yet
3.2.8 Packet Tracer - Investigate A VLAN Implementation
2 pages
Log
No ratings yet
Log
2 pages
AMP-75
No ratings yet
AMP-75
2 pages
AMP-77
No ratings yet
AMP-77
2 pages
CMA-83LN
No ratings yet
CMA-83LN
5 pages
MC Labmanual
No ratings yet
MC Labmanual
17 pages
Amp 15
No ratings yet
Amp 15
2 pages
Arithmetic Logic Unit (ALU)
No ratings yet
Arithmetic Logic Unit (ALU)
11 pages
AVA-183P
No ratings yet
AVA-183P
4 pages
Lab 13
No ratings yet
Lab 13
10 pages
SS1
No ratings yet
SS1
3 pages
S3
No ratings yet
S3
3 pages
FPGA-Based-System-Design LAB JOURNAL 2
No ratings yet
FPGA-Based-System-Design LAB JOURNAL 2
56 pages
Module D - Latch (C, D, Q) Input C, D
No ratings yet
Module D - Latch (C, D, Q) Input C, D
2 pages
Install Panorama On KVM
No ratings yet
Install Panorama On KVM
8 pages
Test 3
No ratings yet
Test 3
2 pages
Test 4
No ratings yet
Test 4
2 pages
Lab1 15
No ratings yet
Lab1 15
5 pages
ALU
No ratings yet
ALU
5 pages
Team Name Groupbips: Group B
No ratings yet
Team Name Groupbips: Group B
32 pages
Lab 6
No ratings yet
Lab 6
11 pages
ICS155B Lab Assignments: Lab 4 Design of MIPS Datapath
No ratings yet
ICS155B Lab Assignments: Lab 4 Design of MIPS Datapath
3 pages
D Latch1
No ratings yet
D Latch1
1 page
Lab 4: 8-Bit Arithmetic Logic Unit (ALU) Purpose: EEL 4712 - Fall 2004
No ratings yet
Lab 4: 8-Bit Arithmetic Logic Unit (ALU) Purpose: EEL 4712 - Fall 2004
4 pages
Namal University, Mianwali: Department of Computer Science
No ratings yet
Namal University, Mianwali: Department of Computer Science
12 pages
Verilog Lab RDD
No ratings yet
Verilog Lab RDD
5 pages
Lab05 PDF
No ratings yet
Lab05 PDF
3 pages
Q.1) A) - Explain (1) ALU: Definition
No ratings yet
Q.1) A) - Explain (1) ALU: Definition
16 pages
LAB 5 - Implementing An ALU Goals
No ratings yet
LAB 5 - Implementing An ALU Goals
9 pages
S Rawat
No ratings yet
S Rawat
49 pages
Lab4 Sheet Aludesign
No ratings yet
Lab4 Sheet Aludesign
7 pages
VLSI Lab Report 8
No ratings yet
VLSI Lab Report 8
24 pages
Assignment 3
No ratings yet
Assignment 3
3 pages
Lab 3 Summer 2013
No ratings yet
Lab 3 Summer 2013
8 pages
Alu 32 Bit
No ratings yet
Alu 32 Bit
6 pages
Digital Circut Design - Project
No ratings yet
Digital Circut Design - Project
14 pages
Lab 8
No ratings yet
Lab 8
3 pages
Tribhuwan University Institute of Engineering Purwanchal Campus, Dharan
No ratings yet
Tribhuwan University Institute of Engineering Purwanchal Campus, Dharan
5 pages
Downloading The Software
No ratings yet
Downloading The Software
5 pages
Vlsi Lab Manule
No ratings yet
Vlsi Lab Manule
33 pages
Design of Combinational Logic: Full Adder, Adder, and ALU: 1 Today's Goal
No ratings yet
Design of Combinational Logic: Full Adder, Adder, and ALU: 1 Today's Goal
4 pages
Lab 4
No ratings yet
Lab 4
6 pages
James Smith - Build Your Own Web Server From Scratch in Node - JS - Learn Network Programming, HTTP, and WebSocket by Coding A Web Server (2024)
100% (2)
James Smith - Build Your Own Web Server From Scratch in Node - JS - Learn Network Programming, HTTP, and WebSocket by Coding A Web Server (2024)
132 pages
Digital Circuit Simulation Using Excel
From Everand
Digital Circuit Simulation Using Excel
Anthony Mazzurco
No ratings yet
Exploring BeagleBone: Tools and Techniques for Building with Embedded Linux
From Everand
Exploring BeagleBone: Tools and Techniques for Building with Embedded Linux
Derek Molloy
4/5 (1)
Test 2
No ratings yet
Test 2
1 page
An Introduction To Digital Design
From Everand
An Introduction To Digital Design
Jason King
2/5 (1)

LAB 5 - Implementing An ALU: Goals

Uploaded by

LAB 5 - Implementing An ALU: Goals

Uploaded by

LAB 5 – Implementing an ALU

Part 1 – Designing an ALU

AluOp (3:0) Mnemonic Result = Description

0000 add A + B Addition

0010 sub A ­ B Subtraction

0100 and A and B Logical and

0101 or A or B Logical or

0110 xor A xor B Exclusive or

0111 nor A nor B Logical nor

1010 slt (A ­ B)[31] Set less than

Others n.a. Don’t care

Table 1. Summary of the ALU control

Designing the Block diagram

Figure 1. A possible division for the ALU

Figure 3. A possible organization to implement slt

Part 3 – The Performance of the Circuit

Adding Simple Timing Constraints

In the Flow Navigator, click on “Implementation → Open Implemented Design → Edit

Figure 5. Constraints for the ALU

You might also like

0010 sub A B Subtraction

1010 slt (A B)[31] Set less than