0% found this document useful (0 votes)

11 views13 pages

Lab 6

cpen211

Uploaded by

fbadawi05

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views13 pages

Lab 6

cpen211

Uploaded by

fbadawi05

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

DEPARTMENT OF ELECTRICAL AND COMPUTER ENGINEERING

UNIVERSITY OF BRITISH COLUMBIA

CPEN 211 Computer Systems I, Fall 2024
Lab 6: Finite State Machine Controller for the “Simple RISC Machine”
Week of Nov 18 to 22 (your code must be submitted by 9:59 PM the evening before your lab session)

1 Introduction
In this lab you extend your datapath from Lab 5 to automate the process of executing instructions on your
datapath. If you did not complete Lab 5 you can use someone else’s Lab 5 solution as a starting point for
this lab, provided both you and the person sharing their code have received a mark for Lab 5 and both of
you register the borrowing on the http://cpen211.ece.ubc.ca/cwl/student_register_peer_help.php website as
outlined in the CPEN 211 Academic Integrity Policy and you MUST also explicitly mention this fact in
CONTRIBUTIONS.txt. If you do use someone else’s Lab 5 it is recommended you use it to help you fix
errors or missing functionality in your own Lab 5.
In this lab you (1) add a finite state machine controller to automate the process of setting control inputs
to the datapath; and, (2) adding an instruction register to provide some inputs to your finite state machine. In
the lab procedure outlined in Section 4 you implement these two additions together. To help you understand
the required changes Section 2 provides an explanation for why the changes are needed. It does this by
considering preliminary and incomplete designs. Reading the material in Section 2 will help you “under-
stand the system” (the first rule of debugging). Next, Section 3 introduces the six instructions you need to
implement for this lab. Section 4 specifies the changes you need to make in this lab. Sections 5, 6 and 7
describe the lab marking scheme, submission and demo procedures.
2 Tutorial: How to control a datapath with a finite state machine
In this section we consider a sequence of incomplete designs to see how each part we are adding in this lab
helps. Do not write (System)Verilog for the designs in this section.
2.1 Controller for a single, fixed instruction (ADD R2, R5, R3)
To execute “ADD R2, R5, R3” on the datapath in Lab 5 requires four clock cycles and manually setting the
control inputs. A faster way to set the control inputs is using a finite state machine. Figure 1 illustrates a
preliminary state machine design for implementing “ADD R2, R5, R3”. The inputs to the state machine
are reset, s and the clock (not shown). The outputs of the FSM are inputs to the Lab 5 datapath. In Figure 1
any datapath input not shown as an output of a given state is 0 in that state. For example, when in state
GetA the datapath input loadc is assumed to be 0 even though it is not explicitly shown inside the circle
representing state GetA. Later, in Section 4.2, you will design your own state machine. When you do,
remember you need to set all outputs in every state (including zero and don’t care outputs).

Figure 1: Finite state machine for “ADD R2, R5, R3” (do not build).

In Figure 1, after reset the state machine waits for a start signal s in state Wait. Here a value of 1 on

CPEN 211 - Lab 6 1 of 13

s indicates we should execute the sequence of four steps that implements “ADD R2, R5, R3”, where each
step takes one clock cycle (e.g., one rising edge of clk).
Figure 2 illustrates the timing of a circuit combining the controller in Figure 1 with the data path from
Lab 5 would behave assuming R5 initially contains 16’h13. ModelSim performs functional simulation
in which the output of flip-flops and combinational logic occur after an infinitesimally small (but non-zero)
delay so that values will appear to change right at the rising edge of clk. To understand Figure 2 it is
important to remember the output of flip-flops and combinational logic in a real circuit do not actually
change instantly after the rising edge of the clock in a real circuit.
In detail the behavior in Figure 2 is as follows: When in state Wait and s is 1, the rising edge of clk at the
start of Cycle n causes state to change to GetA. The change in state causes readnum to be set to 5 (shown
as “3’d5”) and loada is set to 1. At the next rising edge of the clock, between Cycle n and Cycle n+1, the
contents of register R5 are copied to Register A and simultaneously state changes to GetB. The state change
to GetB causes readnum to change from 5 to 3. If you were to look at this waveform in ModelSim the slight
delays in the change of state, readnum, loada, loadb and A after the rising edge of clk would not be shown.
The lack of such a delay in ModelSim waveforms leads some students to worry that readnum changing at
this rising edge could cause the contents of R3 to get copied into Register A (instead of R5). Provided you
have written Verilog that follows the synthesis rules taught in class, this incorrect behavior will not happen
in the real circuit or your ModelSim simulations for two reasons: First, the flip-flops inside of register A
start to copy their D input to their Q output at the rising edge of clk between Cycle n and Cycle n+1. Due
to circuit delays, we do not see the output of register A change until some time after the rising edge but the
copying started at the rising edge. Second, again reflecting how the real circuit operates the flip-flop outputs
connected to state change slightly after the rising edge of clk and the state machine outputs readnum, loada
and loadb change slightly after the change in state due to the delay of combinational logic. Thus, loada and
readnum are still equal to 1 and 5 when the update of register A starts so the contents of R5 are (correctly)
copied to A before the change in readnum during Cycle n+1 can cause a problem.

Figure 2: Register A updated on same rising edge of clk that state changes from GetA to GetB.

2.2 Controller for a more general “ADD” instruction

What if we want to be able to use different registers besides R2, R5, and R3 while performing addition? We
can introduce some programmability by adding an instruction register. Figure 3 illustrates the datapath from
Lab 5 combined with a state machine and instruction register. In this figure, the block labeled “Instruction

CPEN 211 - Lab 6 2 of 13

Register” contains three 3-bit fields: Rd, Rn and Rm. Each 3-bit field is used to specify the name of one of the
eight registers inside the register file inside the datapath. The block labeled “FSM Controller” implements
the (revised) state machine illustrated at the top of Figure 3. This state machine uses the datapath and
instruction register to implement “ADD Rd, Rn, Rm”. For example, to execute “ADD R2, R5, R3” we
would set Rd=3’b010, Rn=3’b101, and Rm=3’b011.
Compared with Figure 1 the output of the states for the state machine in Figure 3 have been modified.
Instead of directly specifying constant values for readnum and writenum the state machine now has an
output called “nsel”. The signal nsel is used as the select input of a three input multiplexer. This multiplexer
selects which of the 3-bit values Rd, Rn or Rm inside “Instruction Register” should be driven to readnum
and writenum (we used the same multiplexer for readnum and writenum as at any given time we are either
reading or writing).

Figure 3: A partial instruction register (do NOT build).

2.3 Controller for more than one instruction

Next, we consider how to extend the design above to execute multiple types of instructions. For example,
suppose we want to be able to execute “MOV R3, #42” as well as “ADD R2, R5, R3”? We can do this
by extending the instruction register to include an opcode field indicating the type of instruction. Figure 4
illustrates a modified design where the instruction register now includes a 1-bit opcode field specifying
whether to perform addition (opcode=0) or move immediate (opcode=1). To support the MOV instruction,
the instruction register is also extended to include some bits, labeled Immediate, used to specify the value
to copy into the register named by Rd during the MOV instruction. For example, for “MOV R3, #42”
Immediate would contain the value 42 represented as a binary number. The top part of Figure 4 shows
we extended our state machine to include a state Decode whose role is choose between performing the
sequence of four steps required for an ADD instruction versus the single step required for this MOV. To enable
the correct transition out of Decode note that we made the value of opcode an input to the state machine.

CPEN 211 - Lab 6 3 of 13

We also connected the 16-bit Immediate field of the new instruction register to the datapath_in portion of
the datapath.

Figure 4: Supporting both MOV and ADD instructions (do NOT build).

Our design can now support execution of multiple types of instruction and the registers used by an
instruction can be varied after the hardware is built. At this point the instruction to execute is encoded with
26-bits: One bit for the opcode, 9-bits total for the three 3-bit register specifiers and 16-bits for the constant
value. For example, the operation “ADD R2, R5, R3” would be encoded as:
0 010 101 011 0000000000000000

Different computer instruction set architectures (ISAs), such as x86 and ARM, represent a given operation,
such as addition, using a different encoding (pattern of 1’s and 0’s). For ARM processors, each instruction is
encoded in 32-bits. For x86, different instructions may be encoded using a different number of bits (between
8 and 120 bits). For Lab 6 through 7 we provide an encoding for the Simple RISC Machine ISA in which
each instruction is encoded in just 16-bits. The portion of the Simple RISC Machine ISA that you will
implement in Lab 6 is shown in Table 1, which is explained below. Using this encoding is required by the
autograder for Lab 6.
3 The Simple RISC Machine Instruction Set Architecture
This section introduces the six instructions you will implement in Lab 6.

CPEN 211 - Lab 6 4 of 13

3.1 Assembly Syntax and Encoding
Each row of Table 1 defines an instruction of the Simple RISC Machine ISA that you will implement in
Lab 6. Note the encodings in this table differ from ARM.
Table 1 is divided into three main sections. The first column, labeled “Assembly Syntax”, is a human-
readable textual representation of each instruction. In assembly format each Simple RISC Machine instruc-
tion starts with an opcode mneumonic. For example, in “MOV Rn,#<im8>” the opcode mneumonic is MOV.
We use this representation because it is easier to remember that MOV means “move a value from somewhere
to a register” than it is to remember what “110” means. Here, “110” is the value under the heading “op-
code” in the next column of Table 1. Returning to the Assembly Syntax column we see that the registers
used by each instruction are also given a mneumnic representation. For example, for “MOV Rn,#<im8>” the
instruction will move a value into the register Rn, where Rn can be R0, R1, ... R7. Recall these are the names
of registers inside the register file that you built in Lab 5. Some instructions also include other information
when written in assembly format using the Assembly Syntax. For example, in “MOV Rn,#<im8>” the part
“#<im8>” represents an 8-bit binary number encoding a value between -128 and 127 (in decimal). The
8-bit value is stored inside the instruction itself. Values used by a program that are encoded inside instruc-
tions like this are typically referred to as an “immediate operand” in most instruction set architectures. In
“MOV Rd,Rm{,<sh_op>}” the portion “{,<sh_op>}” is an optional shift operation (use “LSL#1”, “LSR#1”,
or “ASR#1” for <sh_op>). This part of the instruction indicates what you want the shifter you built in Lab 5
to do when executing the instruction.
When programming in assembly syntax in Lab 7 you will write instructions using the notation in the
first column of Table 1, but you replace Rn, Rm, and Rd with the names of specific registers between R0 and
R7. Similarly, you will replace <im8> with a signed number between -128 and 127 and replace <sh_op>
with one of LSL#1, LSR#1, or ASR#1.
The last column, labeled “Operation”, specifies precisely what the instruction should do using the dat-
apath from Lab 5. For example, for “MOV Rn,#<im8>” this column contains “R[Rn] = sx(im8)”. The
“sx()” part tells us that the 8-bit immediate value #<im8> should be sign-extended to 16-bits. That means
we interpret the 8-bit value as a 2’s complement number and if the most significant bit of the 8-bit value,
(bit 7) has the value 1, then we consider the 8-bit number to be negative. When converting the number to
16-bits, we fill in the upper 8-bits with all 1’s. Similarly, if bit 7 was 0, we would fill in the upper 8-bits
with all 0’s. The portion “R[Rn] =” indicates that this sign extended value should be placed into the 16-bit
register identified by Rn. Note that the steps specified under the “Operation” column can take more than one
clock cycle. The notation used in this part of the table is summarized as follows:

• Rn, Rd, Rm are 3-bit register number specifiers.

• im8 is an 8-bit immediate operand encoded as part of the instruction.
• <sh_op> and sh are 2-bit immediate operands encoded as part of the instruction. The value of sh
is used to control the shifter you built in Lab 5.
• sx(f) sign extends the immediate value f to 16-bits.
• sh_Rm is the value of Rm after passing through the shifter connected to the Bin input to the ALU.
• status is a 3-bit status register. The three bits are called Z, V and N which stand for zero, overflow
and negative, respectively.
• f(x) is a 3-bit value, corresponding to the three bits called Z, V and N in the status register, indicating
whether x is zero, whether the calculation caused an overflow, and/or whether x is a negative 2’s
complement value.
• R[x] refers to the 16-bit value stored in register x.

CPEN 211 - Lab 6 5 of 13

“Simple RISC Machine” 16-bit encoding
Assembly Syntax (see text) Operation (see text)
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0
Move Instructions opcode op 3b 8b
MOV Rn,#<im8> 1 1 0 1 0 Rn im8 R[Rn] = sx(im8)
MOV Rd,Rm{,<sh_op>} 1 1 0 0 0 0 0 0 Rd sh Rm R[Rd] = sh_Rm
ALU Instructions opcode ALUop 3b 3b 2b 3b
ADD Rd,Rn,Rm{,<sh_op>} 1 0 1 0 0 Rn Rd sh Rm R[Rd]=R[Rn]+sh_Rm
CMP Rn,Rm{,<sh_op>} 1 0 1 0 1 Rn 0 0 0 sh Rm status=f(R[Rn]-sh_Rm)
AND Rd,Rn,Rm{,<sh_op>} 1 0 1 1 0 Rn Rd sh Rm R[Rd]=R[Rn]&sh_Rm
MVN Rd,Rm{,<sh_op>} 1 0 1 1 1 0 0 0 Rd sh Rm R[Rd]= ~sh_Rm

Table 1: Assembly instructions introduced in Lab 6 (this table is explained in Section 3)

Finally, the set of columns under the heading “Simple RISC Machine” 16-bit encoding in Table 1 specify
exactly how an instruction in assembly format should be converted into the 1’s and 0’s that will be placed
into the instruction register. For example, the instruction “ADD R4, R0, R1” is encoded as the 16-bit value:
101 00 000 100 00 001

Bits 15, 14, and 13 of each instruction is a special “operation code” or “opcode” that identifies the basic
operation the instruction performs. Thus, the first three bits above, 101, represents the “opcode” used for
the instruction “ADD R4, R0, R1”. The opcode specifies the type of operation performed by the instruction.
Instructions with opcode 101 are defined in the Simple RISC Machine ISA to be “ALU instructions”. ALU
instructions read two registers perform an operation on the values in these registers using the ALU and then
write the result back to a register in the register file. For such instructions the next two bits, Bit 12 and Bit
11, specify the ALUop input to the ALU. In this example these bits have the value 00, which corresponds to
the addition operation for the ALU you designed in Lab 5. The next three bits indicate the middle register
(called Rn). In our example, this is R0, so the next three bits are 000. The next three bits encode the
destination register that will be written by the instruction. In our example, this is R4. For ALU instructions
the next two bits are the “shift” input to the shifter in your Lab 5 datapath. The final three bits specify the
other register that is read by the instruction. In this example, that is R1, which is encoded as 001.
Rn, Rd, Rm are 3-bit numbers that refer to one of the eight 16-bit registers inside of the register file.
in the first column “{,<sh_op>}” is an optional shift operation (use “LSL#1”, “LSR#1”, or “ASR#1” for
<sh_op>); sx() means sign extend (described below); sh_Rm is the 16-bit value resulting from shifting
Rm using the code “sh” (bits 3:4) as input to the shifter from Lab 5.
3.2 Instruction Descriptions
Next, we briefly summarize the operation of each instruction. The first instruction in Table 1, “MOV Rn, #<im8>”,
takes bits 0 to 7 of the instruction (labeled “im8”) and “sign extends” these bits to a 16-bit value. Recall that
in 2’s complement the most significant bit is a 1 if the number is negative and it is 0 if the number is positive.
We can take an 8-bit positive number (with bit 7 equal to zero) and make a 16-bit positive number with the
same value by simply concatenating 8 bits that are all zero. Similarly, we can take an 8-bit negative number
(with bit 7 equal to 1) and make a 16-bit negative number with the same value by concatenating 8-bits with
all 1’s. E.g., consider sign extending the number 3 from 8-bits to 16-bits:
8-bit representation of 3 16-bit representation of 3
00000011 0000000000000011

Similarly, consider sign extending -3 from 8-bits to 16-bits:

8-bit representation of -3 16-bit representation of -3

CPEN 211 - Lab 6 6 of 13

11111101 1111111111111101

After performing this sign extension, the MOV instruction writes the resulting 16-bit sign value to one of
the eight 16-bit registers inside the register file. It identifies which of the 8 16-bit registers inside the register
file to write using the 3-bit 8 to 10 of the instruction (labeled Rn). Recall with 3-bits we can uniquely
identify 8 things since 23 = 8. We return to discuss the second version of MOV further below.
The second MOV instruction, “MOV Rd, Rm{,<sh_op>}” reads Rm into datapath register B and then
sets asel=1 to select the 16-bit zero input for the Ain input to the ALU. Since ALUop is “00” the ALU adds
the zero on Ain to the shifted value of Rm on Bin and places the result in register C. The result is then
written to Rd.
The next four instructions in Table 1 are called ALU instructions because their main purpose is to use
the ALU you built in Lab 5. Such instructions are the main “workhorses” of any general-purpose computer
design. The ADD Rd,Rn,Rm{,<sh_op>} instruction reads the contents of register Rm, optionally shifts the
value one bit to the left (for example, “ADD Rd,Rn,Rm, LSL#1”), one bit to the right without sign extension
(“ADD Rd,Rn,Rm, LSR#1”) or with sign extension (“ADD Rd,Rn,Rm, ASR#1”). Then adds the result to
Rn and places the sum in Rd. For example, if R0 contains 25 and R1 contains 50, then after executing the
instruction “ADD R2, R1, R0” the contents of R2 would be 75.
The ADD instruction reads register Rn into register A and reads register Rm into register B. Bits 11 to
12 of the instruction register are directly fed to the ALUop input to the ALU. Since these bits are “00” for
ADD instructions ALUop will be “00” which corresponds to addition. So, the Ain and Bin inputs to the
ALU will be added together by the ALU. The operand in register B is shifted as specified by bits 3 and 4
that are fed directly from the instruction register into the “shift” input of your datapath from Lab 5.
The AND instruction is very similar to ADD. However, both the CMP and MVN instructions, while
using the ALU, are different. CMP is the only instruction that should update the three status bits. For CMP
we use the ALUop for subtraction however, we are only interested in the value of the status outputs of the
ALU. E.g., we can use CMP to check if the value in R1 and R2 are equal by subtracting R2 from R1 and
checking if the result is zero using the Z status flag. As with ADD and AND we can shift the contents of
the B register. In Lab 7 we will add branch instructions that read the status register after it is set by a CMP
instruction. For MVN we perform a bitwise NOT on the contents of Rm. As with the other ALU operations
we can shift the value in the B register.
4 Lab Procedure
The changes for this lab are broken into two stages.
4.1 Stage 1: Datapath Modifications
Extend the mux on the data input to your register file to have the four inputs illustrated on the right in
Figure 5. The sximm8 input (which stands for sign extended 8-bit immediate) will eventually be driven by
the Instruction Decoder you add in Stage 2 below. You will use this input to the mux when implementing
control logic for the “MOV Rn,#<im8>” instruction in Table 1. Here mdata is the 16-bit output of a memory
block you will be adding in Lab 7. Next, sximm8 is a 16-bit sign extended version of the 8-bit value in the
lower 8-bits of the instruction register. Next, PC is an 8-bit “program counter” input that will be explained
and used in the Lab 7 Bonus. However, to avoid introducing bugs later, it is recommended you add a 16-
bit mdata and 8-bit PC inputs to your datapath module in Lab 6 and connect them to the 4-input 16-bit
multiplexer as shown in Lab 6. For Lab 6 you can “assign” zero to mdata and PC.
Next, modify the mux input to Bin as shown in Figure 6. Here sximm5 is a 16-bit variable you should
declare in datapath. Here sximm5 stands for “sign extended 5-bit immediate”. We will connect sximm5 to
another block in Stage 2.
Next, extend the status register to three bits. One bit should represent a “zero flag”, which was what
“status” represented in Lab 5. Another bit should represent a “negative flag” and be set to 1’b1 if the most

CPEN 211 - Lab 6 7 of 13

Figure 5: Modification to Lab 5 Datapath: Input to Register File

Figure 6: Modification to Lab 5 Datapath: “Bin” Multiplexer

significant bit of the main 16-bit ALU result is 1. The final bit represents an overflow flag. You should
compute signed overflow as described in Section 10.3 of Dally. In Lab 7 you will use the status flags to
support “if” statements and “loops” in C.
4.2 Stage 2: Datapath Controller
The next step is to add an instruction register, an instruction decoder block, and finally design a state machine
to control your datapath. Inside a file cpu.sv create a module cpu to instantiate and connect together these
three components along with your datapath. A significant portion of your Lab 6 mark will be determined
using an auto grader (see Section 5). To avoid losing marks your top level module must be called cpu,
follow the specification below, and be in a file named cpu.sv:
module cpu(clk,reset,s,load,in,out,N,V,Z,w);
input clk, reset, s, load;
input [15:0] in;
output [15:0] out;
output N, V, Z, w;

Figure 7 illustrates how the various components of your design should be connected within your cpu
module. Your state machine is connected to the rest of the circuit. The instruction currently being executed

CPEN 211 - Lab 6 8 of 13

is stored in the 16-bit Instruction Register. The instruction register and nsel output of the state machine are
inputs to the Instruction Decoder block, which is described below.
The value on in should be copied into your instruction register on the rising edge of clk if load is set
to 1. If load is 0, the contents of your instruction register should NOT change. On the rising edge of clk
if reset is 1 your state machine should go to a reset state. After being reset, your state machine should
not perform any computations until s is set to 1 and there is a rising edge of clk, much like in the example
state machine in Figure 4. The value on out should be the contents of register C inside your datapath, which
is useful for the demo on your DE1-SoC (the autograder will not check the value of “out” since it can
directly inspect the contents of the register file). The outputs N, V and Z should provide the value of the
negative, overflow and zero status register bits. As shown in Table 1, these three flags are set only by the
CMP instruction. The N flag should be set to 1 if the 16-bit result of the subtraction performed by the CMP
instruction is negative, regardless of whether an overflow occurred. The output w should be set to 1 if your
state machine is in the reset state and is waiting for s to be set to 1 and set to 0 otherwise (i.e., when in any
other state). After executing an instruction, your state machine should return to this state. See the example
state machine in Figure 4 (NOTE: do not use the instruction register design from Figure 4 as it does not
correspond to the required instruction encodings in Table 1).

Figure 7: Your cpu module should contain your finite state machine controller (Controller), an instruction
register, an instruction decoder and a datapath. In this figure, TBD means “to be determined” (by you!).

The purpose of the Instruction Decoder block is to extract information from the instruction register that
can be used to help control the datapath. The Instruction Decoder block in Figure 7 should implement the
logic shown in Figure 8. Your state machine should drive any datapath inputs not set by the decoder block
(e.g., labeled “TBD” in Figure 7).
The output of the state machine should be the settings of all the inputs to the datapath, the signal nsel
used to select which register to connect to readnum and writenum, and the w output used to indicate to the
autograder that your state machine is (or is not) in the wait state. The inputs to the state machine are clk,
reset, the start signal s the opcode and op fields of the current instruction in the instruction register.
How should you design your state machine? The examples shown in Figure 3 and 4 used a Moore
type finite state machine where the output depends only on the current state. You can do the same or, if you
want, use a Mealy state machine.
Regardless of which type of state machine or the coding style you use, the best way to design your state
machine is in stages. In the first step, get your state machine to work for a single instruction from Table 1.
Pick an instruction from Table 1 and think through what steps you need to perform with your datapath from

CPEN 211 - Lab 6 9 of 13

Figure 8: Instruction Decoder

Lab 5 to perform the steps listed in the “Operation” column. One way to do this is by referring to Figure 1 in
the Lab 5 handout. Work out the number of clock cycles it takes to perform the data operations required for
the instruction. Each cycle will require an additional state beyond the “Wait” state shown in the example in
Figure 3. After the last state required to execute the instruction on your datapath, your state machine should
return to the “Wait” state as shown in the example in Figure 3. In the “Wait” state make sure your w output
is 1 so the autograder (or your own test bench) knows the computer has finished executing the instruction
and is ready to execute a new instruction.
Once you have very carefully tested your first instruction is working, both in ModelSim and on your
DE1-SoC, you should check that version into your revision control system (e.g., git) in case you make a
change that breaks that first instruction while modifying your state machine to support a second instruction.
When adding the second instruction, you should add a “Decode” state like that shown in Figure 4. You
should reuse this Decode state when adding any subsequent instructions. After adding the Decode state,
figure out the states corresponding to the steps required to execute the new instruction on your datapath.
You will add additional states (e.g., like WriteImm in Figure 4) for this new instruction. Now the “Decode”
state has two potential next states. To decide which next state your state machine should go to from “Decode”
use the opcode and op values used for encoding the instruction you are adding (find these in Table 1). See
Figure 4 for a simplified example showing how to determine which state to go to after “Decode”.
After adding each instruction test that both the new instruction and the prior instructions work (both in
ModelSim and on your DE1-SoC). Unless you are VERY confident in your Verilog coding abilities, you
should NOT attempt to code up a state machine for all instructions before doing any testing. If you do, you
will spend much more time trying to figure out the source of even a single bug than you would have by
testing each additional instruction as you add it. Since you need to show a testbench as part of the marking
scheme, why not create it as you go and use it to help you save time by catching bugs early?
To reduce the complexity of your state machine you may want to see if you can find ways to reuse states
added for earlier instructions when adding a new instruction. However, this is not required.
The input to your top level module is the encoded instruction. Thus, to test your overall design you will
first want to create some simple programs that you can input to your instruction register one at a time. To
do that, first write a textual assembly code representation and only then encode each instruction into into 1’s
and 0’s using Table 1. As an example, the following test case.

CPEN 211 - Lab 6 10 of 13

MOV R0, #7 // this means, take the absolute number 7 and store it in R0
MOV R1, #2 // this means, take the absolute number 2 and store it in R1
ADD R2, R1, R0, LSL#1 // this means R2 = R1 + (R0 shifted left by 1) = 2+14=16

can be encoded as:

1101000000000111
1101000100000010
1010000101001000

For full marks on the lab you need to encode additional instructions. You can use the lab6_top.v file we
provide to test your design on the DE1-SoC. This is a modified version of lab5_top.v and works in a
similar way.
5 Marking Scheme
If you have a partner both of you must be in attendance during the demo. You must include at least one
line of comments per always block, assign statement or module instantiation and in test benches you must
include one line of comments per test case saying what the test is for and what the expected outcome is. For
your state machine include one comment per state summarizing the datapath operations and one comment
per state transition explaining when the transition occurs.
You will lose marks if your github repo does not contain a quartus project file, modelsim project file,
or programming (.sof) file. You will also lose marks if your repo is missing any source code (whether
synthesizable or testbench code) or waveform format files.
If you used someone else’s Lab 5 code you are still responsible for being able to explain how it works. If
your submission includes AI generated code you must include an AI.txt file with enough detail to reproduce
code that looks like your submission in case of later concern about the true provenance of your code (i.e., a
suspicion of cheating).
Your mark will be computed as the sum of the following rubric:

Stage 1 changes [1 Mark] For your Stage 1 code in datapath.sv and being able to explain the associated
(System)Verilog to the TA. You may also lose marks here for lack of commenting in your code or if you
have no testbench in datapath_tb.sv for your Stage 1 changes.

Stage 2 changes [3 Marks] Your state machine must include sufficient comments. During your marking
session you must be able to explain your code in detail when asked, and demonstrate that your state machine
works using your submitted testbench with ModelSim and your submitted lab6_wave.do file. Your mark
for this part will be:

3/3 If your state machine in cpu.sv implements all instructions in Table 1 and you demonstrate a detailed
knowledge of how your state machine works when asked by your TA, your submission contains a set
of very convincing test cases of your own devising in cpu_tb.sv including at least three tests for each
instruction in Table 1. Each test should be designed to test a different part of your design that might
have an error and you should be able to explain to your TA what potential error or mistake in coding
that design is meant to catch. It is your responsibility to think of what might go wrong when coding
your state machine and combining it with your datapath and how the tests might catch those errors. For
full marks here you should be able to demonstrate some test cases using gate-level simulation (possibly
using a second testbench in cpu_tb.sv).

2/3 If your state machine implements all the instructions from Table 1, but you have less than three tests
for each instruction in your cpu_tb.v or you cannot provide examples of the types of bugs that the tests
might catch.

CPEN 211 - Lab 6 11 of 13

1/3 If your state machine does not implement all of the instructions in Table 1 or your state machine in-
cludes very few comments or comments that are not very meaningful, or you do not include a testbench.
0/3 If you cannot explain your code or there is no code submitted for this part.

DE1-SoC Demo [2 Marks] For demonstrating your CPU works on your DE1-SoC using a test case of your
own devising involving some of the LEDs on the DE1-SoC. To get 2/2 marks here this test case MUST work
AND use ALL of the instructions in Table 1. You will get 1/2 here if this test case works and it uses at least
three different types of instructions from Table 1 (but not all of them). If you cannot get a test case involving
at least three different types of instructions from Table 1 your mark will be 0/2.

Autograder [4 Marks] Finally, four marks will be assigned objectively by an auto-grader that will test your
cpu module using a variety of inputs. To ensure your code is compatible with our autograder you should
both ensure you can download your working design to your DE1-SoC AND be sure you can simulate
the lab6_check testbench module we provide in lab6_autograder_check.v provided on Piazza. You
should be sure you get the message “INTERFACE OK” in the ModelSim transcript window when you do this.
Please note that the message “INTERFACE OK” does NOT ensure your Lab 6 submission will pass any of
our autograder tests, but if you do not get this “INTERFACE OK” message you will get 0/4 for this part. Your
autograder mark may be manually reduced to 0/4 for this portion if you cannot explain to your TA
how your code works to their satisfaction.

Your cpu module must follow the specification in Section 4.2 and moreover, the output of the
registers inside your register file must be accessible via the hierarchical names cpu.DP.REGFILE.R0
through names cpu.DP.REGFILE.R7. Thus, your datapath must be instantiated with the instance name DP
inside your cpu module and inside of your datapath module your register file must have the instance name
REGFILE, inside of your register file, the 16-bit registers R0 through R7 must be accessible on signals (wire
or reg) called R0 through R7. To ensure this you may need to make minor changes to your datapath from
Lab 5. Your mark for this part will depend upon how many instructions pass our test cases and will be:

4/4 If every single type of instruction in Table 1 passes all of the auto-graders test cases.
3/4 If all but one type of instruction in Table 1 passes all of the auto-graders test cases. This means for
example, if you did not have time to get one of the instructions in Table 1 working, but you got all the
other instructions working (as judged by our autograder), then you would get this mark.
2/4 If all but two types of instruction in Table 1 passes all of the auto-graders test cases. This means for
example, if you did not have time to get two of the instructions in Table 1 working, but you got all the
other instructions working (as judged by our autograder), then you would get this mark.
1/4 If all but three types of instruction in Table 1 passes all of the auto-graders test cases. This means
for example, if you did not have time to get three of the instructions in Table 1 working, but you got all
the other instructions working (as judged by our autograder), then you would get this mark.
0/4 If four or more types of instruction in Table 1 each fail at least one of the auto-graders test cases.
This means for example, if you did not have time to get four of the instructions in Table 1 working, but
you got all the other instructions working (as judged by our autograder), then you would get this mark.

IMPORTANT: Check your submission repo on github.com carefully as you will lose marks if your
github submission does not contain a Quartus Project File (.qpf) and the associated Quartus Settings File

CPEN 211 - Lab 6 12 of 13

(.qsf) that indicates which Verilog files are part of your project. This .qsf file is created by Quartus when you
create a project. It is typically named <top_leve_module_name>.qsf and contains (among others)
lines indicating which (System)Verilog files are to be synthesized. If you open up this .qsf file you should see
lines that look like the following. The key part is that these line contain “VERILOG_FILE”. The autograder
will use your .qsf file to determine which Verilog files should be synthesized together. To be sure, note the
above .qsf file is not the file DE1_SoC.qsf we provided in Lab 3 for importing DE1-SoC pin assignments.
Also remember to include your Modelsim Project File (.mpf) and your programming (.sof) file, (.vo) file for
gate-level simulation testing, and any waveform (.do) files.
6 Lab Submission
If you work with a partner or you borrowed someone’s Lab 5 code (even if only to debug your Lab 5
code) your submission MUST include a file called “‘CONTRIBUTIONS.txt” that describes each student’s
contributions to each file that was added or modified. If either partner contributed less than one third to the
solution (e.g., in lines of code), you must state this in your CONTRIBUTIONS file and inform the instructor
by sending email to mailto:[email protected]. Note that submitted files may be stored on servers outside
of Canada. Thus, you may omit personal information (e.g., your name, SN) from your files and refer to
“Partner 1” and “Partner 2” in CONTRIBUTIONS. Submit your code using github classroom.
If you used any AI tools to help write code you must include a file AI.txt describing their use in sufficient
detail that we can reproduce your code (or something very similar) in case of any concern about the prov-
inance of your submitted code (e.g., if your code looks similar to other student submissions). A reminder
that sharing of AI prompts is not permitted.
7 Lab Demonstration Procedure
As with Lab 3 to 5, your TA will have your submitted code with them and have setup a “TA marking station”
where you will go when it is your turn to be marked. Be sure to bring your DE1-SoC in case the TA does
not have theirs and/or they need to mark multiple groups in parallel.
8 Hints and Tips
You may find it helpful to use the Verilog include directive to define constants used in multiple files.
After getting your Verilog to compile in ModelSim, but before running any simulations in ModelSim, it
is worth try to compile your synthesizable modules in Quartus just to look at the warnings. Quartus provides
useful warnings for many “silly mistakes” that ModelSim happily ignores. If you see no suspicious warnings
in Quartus, then move on to simulating your testbench in ModelSim.
When using “$stop;” in a testbench and running with “run -all” in ModelSim a source window will
pop-up when “$stop” is reached. If you use a text editor other than ModelSim to edit your files (e.g., vi
or emacs), make sure to close this window before restarting simulation. If you for any reason modify your
testbench outside of ModelSim (perhaps to add a test case) and you then restart simulation you will get
a long set of about 50 pop-ups saying the file was modified outside of ModelSim. If you forget and this
happens, note you can likely close these roughly 50 dialogs faster then restarting ModelSim by clicking on
“Skip Messages” then selecting “Reload” repeatedly.

CPEN 211 - Lab 6 13 of 13

SIFANG CSC-101 - V1.00 - Line Protection IED Technical Manual - 2011-10
No ratings yet
SIFANG CSC-101 - V1.00 - Line Protection IED Technical Manual - 2011-10
333 pages
Cheat Sheet PDF
No ratings yet
Cheat Sheet PDF
1 page
Lab 4
0% (1)
Lab 4
21 pages
Lab Sheet 3 Final
100% (1)
Lab Sheet 3 Final
9 pages
Mini Project Iot
No ratings yet
Mini Project Iot
43 pages
Operating System Concepts
No ratings yet
Operating System Concepts
9 pages
Unit 5a - CPU Design
100% (1)
Unit 5a - CPU Design
64 pages
Introduction To Power Electronics
No ratings yet
Introduction To Power Electronics
9 pages
Verilog Lab9
No ratings yet
Verilog Lab9
10 pages
Intro To Sequential Logic: CS 64: Computer Organization and Design Logic Lecture #14 Winter 2019
No ratings yet
Intro To Sequential Logic: CS 64: Computer Organization and Design Logic Lecture #14 Winter 2019
24 pages
Exam 1
No ratings yet
Exam 1
8 pages
EE7
No ratings yet
EE7
4 pages
Verilog Lab9-1
No ratings yet
Verilog Lab9-1
10 pages
Lab 11
No ratings yet
Lab 11
3 pages
FSM Design
No ratings yet
FSM Design
15 pages
Verilog Programs by Guruprasad PDF
No ratings yet
Verilog Programs by Guruprasad PDF
25 pages
Approaches To Digital System Design
No ratings yet
Approaches To Digital System Design
32 pages
HDL Lasthope
No ratings yet
HDL Lasthope
2 pages
A 8-Bit Sequential Multiplier
No ratings yet
A 8-Bit Sequential Multiplier
6 pages
Floating Point Instructions, Program Control: Soon Tee Teoh CS 147
No ratings yet
Floating Point Instructions, Program Control: Soon Tee Teoh CS 147
17 pages
EO 301 Notes
No ratings yet
EO 301 Notes
31 pages
Interview Questions-Digital Design PDF
No ratings yet
Interview Questions-Digital Design PDF
9 pages
220 PracticeProblems 8 MultiCycleDP Sol
No ratings yet
220 PracticeProblems 8 MultiCycleDP Sol
34 pages
Pre-Lab4 Digital Design
No ratings yet
Pre-Lab4 Digital Design
9 pages
MIPS Single Cycle Processor Design
No ratings yet
MIPS Single Cycle Processor Design
59 pages
Unit - 5 (FSMS)
No ratings yet
Unit - 5 (FSMS)
37 pages
LC 3b Project
No ratings yet
LC 3b Project
11 pages
Micro Processor Design-Resumen
No ratings yet
Micro Processor Design-Resumen
7 pages
Material On Embedded System 3
No ratings yet
Material On Embedded System 3
29 pages
ECE222 DP1 Binary Multiplier
No ratings yet
ECE222 DP1 Binary Multiplier
8 pages
Lecture 18: Designing A Central Processor Unit 2: The Controller
No ratings yet
Lecture 18: Designing A Central Processor Unit 2: The Controller
8 pages
127
No ratings yet
127
6 pages
Additionalnotes Group Projectassignment Clo3
No ratings yet
Additionalnotes Group Projectassignment Clo3
29 pages
Lab 8
No ratings yet
Lab 8
20 pages
Etherchannel in GNS3
No ratings yet
Etherchannel in GNS3
3 pages
Digital Electronics and Computer Architecture ELEC40003 EXAM PAPER
No ratings yet
Digital Electronics and Computer Architecture ELEC40003 EXAM PAPER
7 pages
ABB Ability Digital Solution: Slide 1
No ratings yet
ABB Ability Digital Solution: Slide 1
14 pages
10 17 13 PDF
No ratings yet
10 17 13 PDF
13 pages
Lectures Section 3 Modified
No ratings yet
Lectures Section 3 Modified
73 pages
VHDL FSM UNIT 5 ET&T 7th Sem
No ratings yet
VHDL FSM UNIT 5 ET&T 7th Sem
22 pages
Final - Sample (1) NS
No ratings yet
Final - Sample (1) NS
6 pages
SET - ARM - Inst
No ratings yet
SET - ARM - Inst
4 pages
Must Also Perform The Necessary Arithmetic Using Signed 2's Complement Notation and Show The Results Indicating Overflow, If Any
No ratings yet
Must Also Perform The Necessary Arithmetic Using Signed 2's Complement Notation and Show The Results Indicating Overflow, If Any
18 pages
COA Module 3 QB Complete Solutions
No ratings yet
COA Module 3 QB Complete Solutions
20 pages
Instruction Formats: - Memory and Branch Instructions
No ratings yet
Instruction Formats: - Memory and Branch Instructions
17 pages
Computer Architecture I: Digital Design
No ratings yet
Computer Architecture I: Digital Design
39 pages
Lec3 Part2
No ratings yet
Lec3 Part2
28 pages
MOD 4-I Simple Computer - Bottom Up Implementation
No ratings yet
MOD 4-I Simple Computer - Bottom Up Implementation
11 pages
FSMD Application
No ratings yet
FSMD Application
42 pages
Lec19 FSM
No ratings yet
Lec19 FSM
14 pages
FPGA Based Wireless Mobile Jammer
No ratings yet
FPGA Based Wireless Mobile Jammer
6 pages
Embedded Linux: Using Pee Wee Linux
100% (1)
Embedded Linux: Using Pee Wee Linux
31 pages
Advanced Microcontroller: A Laboratory Manual For
No ratings yet
Advanced Microcontroller: A Laboratory Manual For
84 pages
Embedded Systems 220 Control Unit Design Notes: PC: PC + 1 PC: PC PC: Operand PC: PC + Operand
No ratings yet
Embedded Systems 220 Control Unit Design Notes: PC: PC + 1 PC: PC PC: Operand PC: PC + Operand
3 pages
Optional Hardware Solution
No ratings yet
Optional Hardware Solution
25 pages
MCQ Question - DSD - End Sem - DR O P Das
No ratings yet
MCQ Question - DSD - End Sem - DR O P Das
30 pages
Lab 4
No ratings yet
Lab 4
9 pages
Lab 6 VXL
No ratings yet
Lab 6 VXL
15 pages
Introduction To Verilog
No ratings yet
Introduction To Verilog
5 pages
ASIS RIA Article Eng
No ratings yet
ASIS RIA Article Eng
2 pages
Mustgather: Collecting Troubleshooting Data For Notes 8.0X Standard Configuration Issues
No ratings yet
Mustgather: Collecting Troubleshooting Data For Notes 8.0X Standard Configuration Issues
3 pages
AN2548 Application Note: Using The STM32F0/F1/Lx Series DMA Controller
No ratings yet
AN2548 Application Note: Using The STM32F0/F1/Lx Series DMA Controller
15 pages
Basic Circuit Analysis
No ratings yet
Basic Circuit Analysis
11 pages
EE 533 Verilog Design: Siddharth Bhargav
No ratings yet
EE 533 Verilog Design: Siddharth Bhargav
25 pages
Altium Designer Summer 09
0% (1)
Altium Designer Summer 09
3 pages
Building The Up
No ratings yet
Building The Up
30 pages
Demunit 4
No ratings yet
Demunit 4
53 pages
Snes-Flash - Rev-C Bom - Pcbway Community
No ratings yet
Snes-Flash - Rev-C Bom - Pcbway Community
1 page
STD03N: Darlington Transistor With Built-In Compensation Diodes
No ratings yet
STD03N: Darlington Transistor With Built-In Compensation Diodes
8 pages
VxWorks Is A Proprietary Real
No ratings yet
VxWorks Is A Proprietary Real
3 pages
CMake Lists
No ratings yet
CMake Lists
42 pages
Lab 14
No ratings yet
Lab 14
11 pages
(Ebook) Hughes Electrical & Electronic Technology, 10th Edition by Edward Hughes John Hiley, Keith Brown, Ian McKenzie Smith ISBN 9780132060110, 0132060116 Download
100% (1)
(Ebook) Hughes Electrical & Electronic Technology, 10th Edition by Edward Hughes John Hiley, Keith Brown, Ian McKenzie Smith ISBN 9780132060110, 0132060116 Download
61 pages
Synopsis THIRD EYE FOR THE BLIND
No ratings yet
Synopsis THIRD EYE FOR THE BLIND
23 pages
Si5351A/MS5351M 160m-10m VFO Kit
No ratings yet
Si5351A/MS5351M 160m-10m VFO Kit
2 pages
World University of Bangladesh: Objective
No ratings yet
World University of Bangladesh: Objective
3 pages
A Guide To Transmission Line Impedance Advanced PCB Design Blog Cadence
No ratings yet
A Guide To Transmission Line Impedance Advanced PCB Design Blog Cadence
13 pages
Balwinder Singh Sandhu
No ratings yet
Balwinder Singh Sandhu
4 pages
Project Working With AWS Lambda
No ratings yet
Project Working With AWS Lambda
19 pages
CC Gigamon Product Comparison
No ratings yet
CC Gigamon Product Comparison
4 pages
Packet Scheduling in Multipath TCP Fundamentals Lessons and Opportunities
No ratings yet
Packet Scheduling in Multipath TCP Fundamentals Lessons and Opportunities
13 pages
Резюме Программист Shodlik Shomuratov от 20-06-2023 22-12
No ratings yet
Резюме Программист Shodlik Shomuratov от 20-06-2023 22-12
2 pages
Sanjeetbhushan 7 Automation
No ratings yet
Sanjeetbhushan 7 Automation
1 page
CamilaMoreira - It Resume - 2nd Draft
No ratings yet
CamilaMoreira - It Resume - 2nd Draft
1 page
Interview Questions for IBM Mainframe Developers
From Everand
Interview Questions for IBM Mainframe Developers
Robert Wingate
1/5 (1)
A Friendly Introduction to MATLAB Programming
From Everand
A Friendly Introduction to MATLAB Programming
Orhan Gazi
No ratings yet
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
From Everand
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
Bruce Dang
No ratings yet
Computer Science II Essentials
From Everand
Computer Science II Essentials
Randall Raus
No ratings yet
WAN TECHNOLOGY FRAME-RELAY: An Expert's Handbook of Navigating Frame Relay Networks
From Everand
WAN TECHNOLOGY FRAME-RELAY: An Expert's Handbook of Navigating Frame Relay Networks
Mamta Devi
No ratings yet

Lab 6

Uploaded by

Lab 6

Uploaded by

DEPARTMENT OF ELECTRICAL AND COMPUTER ENGINEERING

UNIVERSITY OF BRITISH COLUMBIA

CPEN 211 - Lab 6 1 of 13

2.2 Controller for a more general “ADD” instruction

CPEN 211 - Lab 6 2 of 13

Figure 3: A partial instruction register (do NOT build).

2.3 Controller for more than one instruction

CPEN 211 - Lab 6 3 of 13

CPEN 211 - Lab 6 4 of 13

• Rn, Rd, Rm are 3-bit register number specifiers.

CPEN 211 - Lab 6 5 of 13

Table 1: Assembly instructions introduced in Lab 6 (this table is explained in Section 3)

Similarly, consider sign extending -3 from 8-bits to 16-bits:

CPEN 211 - Lab 6 6 of 13

CPEN 211 - Lab 6 7 of 13

Figure 6: Modification to Lab 5 Datapath: “Bin” Multiplexer

CPEN 211 - Lab 6 8 of 13

CPEN 211 - Lab 6 9 of 13

CPEN 211 - Lab 6 10 of 13

can be encoded as:

CPEN 211 - Lab 6 11 of 13

CPEN 211 - Lab 6 12 of 13

CPEN 211 - Lab 6 13 of 13

You might also like