0% found this document useful (0 votes)

12 views25 pages

12 - Floating Point Instructions

Uploaded by

ranbir singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views25 pages

12 - Floating Point Instructions

Uploaded by

ranbir singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Floating-Point Instructions

1
Readings and Exercises
• ARMv8 Instruction Set Overview:
▪ Section 5.7
• ARM Procedure Call Standard
▪ Section 5.1.2

2
Objective
At the end of this section, you will
1. Work with single-precision and double-precision
ARM instructions

3
Floating-Point Numbers

REGISTERS AND INSTRUCTIONS

4
Floating-Point Numbers
• Are real numbers
▪ 16.889, 1.11e-13, -0.001

• Require different set of registers

• Require more specialized instructions

5
Floating-Point Registers
• ARMv8 has 32 128-bit FP registers
▪ S registers use the low-order 32 bits to hold single-
precision FP numbers
• s0 - s31
▪ D registers use the low-order 64 bits to hold double-
precision FP numbers
• d0 - d31
▪ The high-order 64 bits are only used when using
SIMD instructions
• Not covered in this course
6
Floating-Point Registers (cont’d)
• These registers are loaded or stored like the
general-purpose registers
▪ Eg: ldr s0, [base_r, offset_r] // read 4 bytes

▪ Eg: str d1, [x29, 16] // write 8 bytes

• Use the .single or .double pseudo-ops to

allocate and initialize memory for a FP number
▪ Use the prefix 0r to specify a “real” value
▪ Can specify exponents using E notation

7
Floating-Point Registers (cont’d)
▪ Eg:
.data
a_m: .single 0r5.0
b_m: .double 0r5.33e-18
array_m: .single 0r2.5, 0r3.5, 0r4.5

• In gdb:
▪ Use p/f to print the contents of a FP register
• Eg: p/f $d0
• Eg: p/f $s30

8
Floating-Point Registers (cont’d)
▪ Use x/wf to examine a single in memory
• Eg: x/wf <a_m address> shows 5
• x/wx shows the hex representation
▪ Eg: shows 0x40a00000
▪ Use x/gf or x/gx to examine a double in memory
• Eg: x/gf <b_m address> shows
5.3300000000000001e-18

9
Basic Floating-Point Instructions
• Arithmetic instructions
▪ Use registers only (no immediates)
▪ Have two versions:
• Single precision: use S registers
• Double precision: use D registers
▪ Addition
• fadd Sd, Sn, Sm
▪ Sd = Sn + Sm
• fadd Dd, Dn, Dm
▪ Dd = Dn + Dm
10
Basic Floating-Point Instructions
(cont’d)
▪ Subtraction
• fsub Sd, Sn, Sm
▪ Sd = Sn - Sm
• fsub Dd, Dn, Dm
▪ Dd = Dn - Dm
▪ Multiplication
• fmul Sd, Sn, Sm
▪ Sd = Sn * Sm
• fmul Dd, Dn, Dm
▪ Dd = Dn * Dm

11
Basic Floating-Point Instructions
(cont’d)
▪ Multiply-negate
• fnmul Sd, Sn, Sm
▪ Sd = -(Sn * Sm)
• fnmul Dd, Dn, Dm
▪ Dd = -(Dn * Dm)
▪ Division
• fdiv Sd, Sn, Sm
▪ Sd = Sn / Sm
• fdiv Dd, Dn, Dm
▪ Dd = Dn / Dm

12
Basic Floating-Point Instructions
(cont’d)
▪ Multiply-add
• fmadd Sd, Sn, Sm, Sa
▪ Sd = Sa + (Sn * Sm)
• fmadd Dd, Dn, Dm, Da
▪ Dd = Da + (Dn * Dm)
▪ Multiply-subtract
• fmsub Sd, Sn, Sm, Sa
▪ Sd = Sa - (Sn * Sm)
• fmsub Dd, Dn, Dm, Da
▪ Dd = Da - (Dn * Dm)

13
Basic Floating-Point Instructions
(cont’d)
▪ Absolute value
• fabs Sd, Sn
▪ Sd = abs(Sn)
• fabs Dd, Dn
▪ Dd = abs(Dn)
▪ Negation
• fneg Sd, Sn
▪ Sd = -Sn
• fneg Dd, Dn
▪ Dd = -Dn

14
Basic Floating-Point Instructions
(cont’d)
• Move instructions
▪ Register
• fmov Sd, Sn
▪ Moves 32 bits from Sn to Sd
• fmov Dd, Dn
▪ Moves 64 bits from Dn to Dd
• Variants exist for moving data between general-purpose
and FP registers

15
Basic Floating-Point Instructions
(cont’d)
▪ Immediate
• Can move a limited set of FP numbers into a register
• Form:
▪ fmov Sd, #fpimm
▪ fmov Dd, #fpimm
• #fpimm:
▪ Encoded with 1 sign bit, 4 bits of fraction, and a 3-bit exponent
▪ Must be expressible as: ±n / 16 × 2r
• n in range: 16 to 31
• r in range: -3 to +4
• Eg: fmov s0, 0.25

16
Basic Floating-Point Instructions
(cont’d)
• Conversion instructions
▪ fcvt Dd, Sn
• Converts single-precision FP in Sn to double-precision FP
in Dd
▪ fcvt Sd, Dn
• Converts double in Dn to single in Sd, rounding as
necessary
• Precision may be lost since we use fewer bits to encode Sd

17
Basic Floating-Point Instructions
(cont’d)
▪ fcvtns Wd, Sn
▪ fcvtns Xd, Sn
• Converts single in Sn to nearest signed 32-bit or 64-bit
signed integer in Wd or Xd
▪ fcvtns Wd, Dn
▪ fcvtns Xd, Dn
• Converts double to nearest signed 32-bit or 64-bit signed
integer
▪ fcvtnu converts to unsigned integers

18
Basic Floating-Point Instructions
(cont’d)
▪ scvtf Sd, Wn
▪ scvtf Sd, Xn
• Converts signed 32-bit or 64-bit signed integer in Wn or Xn
to a single
▪ scvtf Dd, Wn
▪ scvtf Dd, Xn
• Converts signed integer to a double
▪ ucvtf converts unsigned integers to floats

19
Basic Floating-Point Instructions
(cont’d)
• Compare instructions
▪ Forms:
• fcmp Sn, Sm
• fcmp Sn, 0.0
• fcmp Dn, Dm
• fcmp Dn, 0.0
▪ Like the integer cmp instruction, these set the
condition flags (NZCV)
• Normally followed by a conditional branch

20
Basic Floating-Point Instructions
(cont’d)
• Eg: assembly code to divide 7.5 by 2.0
.data
x_m: .double 0r7.5
y_m: .double 0r2.0
z_m: .double 0r0.0

.text
.balign 4
.global main
main: stp x29, x30, [sp, -16]!
mov x29, sp

adrp x19, x_m // get address of x

add x19, x19, :lo12:x_m
ldr d0, [x19] // load x into d0

21
Basic Floating-Point Instructions
(cont’d)
adrp x19, y_m // get address of y
add x19, x19, :lo12:y_m
ldr d1, [x19] // load y into d1

fdiv d2, d0, d1 // x / y

adrp x19, z_m // get address of z

add x19, x19, :lo12:z_m
str d2, [x19] // store result in z

end: nop // breakpoint

ldp x29, x30, [sp], 16

ret

22
Floating-Point Arguments
• Are passed into a subroutine using registers d0 –
d7 (for doubles) and/or s0 – s7 (for singles)
▪ Are in addition to the x (w) registers used to pass in
integers or pointers
▪ Stack memory is used if there are more than 8 FP
arguments
▪ The subroutine is free to overwrite these registers

23
Floating-Point Arguments (cont’d)
• Registers d8 - d15 are callee-saved registers
▪ If used in a subroutine, the subroutine must save and
restore their values on the stack
▪ Only the bottom 64-bits of the 128-bit register need to
be preserved
• Registers d0 - d7 and d16 - d31 can be
overwritten by a subroutine
▪ The caller is responsible for saving/restoring these if
they need to be preserved over a subroutine call
24
Floating-Point Return Values
• A double FP number is returned from a
subroutine in d0
• A single is returned in s0

General EBS Setup
No ratings yet
General EBS Setup
119 pages
ELEC1601 Week 6 2023
No ratings yet
ELEC1601 Week 6 2023
61 pages
Computer Architecture - Lab 7: Floating Point Arithmetic On MIPS
100% (1)
Computer Architecture - Lab 7: Floating Point Arithmetic On MIPS
10 pages
COA Solved Model Paper
No ratings yet
COA Solved Model Paper
36 pages
ch-5 ASM
No ratings yet
ch-5 ASM
22 pages
U1 - 8051 ALP Instructions
No ratings yet
U1 - 8051 ALP Instructions
70 pages
3 - ARMv8-A Architecture
No ratings yet
3 - ARMv8-A Architecture
67 pages
Chapter 6 - Using Floating-Poin
No ratings yet
Chapter 6 - Using Floating-Poin
24 pages
Module - 1 8086 Instruction Sets
No ratings yet
Module - 1 8086 Instruction Sets
27 pages
Fix and Floting Systems
No ratings yet
Fix and Floting Systems
28 pages
Instruction-Set-Of-8086 PPT-2-46
No ratings yet
Instruction-Set-Of-8086 PPT-2-46
45 pages
Lecture01 Intro
No ratings yet
Lecture01 Intro
67 pages
Assembly Book
No ratings yet
Assembly Book
512 pages
Lecture 6
No ratings yet
Lecture 6
11 pages
Instruction Set of 8086
100% (1)
Instruction Set of 8086
47 pages
312s16 - Floating Point Arithmetic
No ratings yet
312s16 - Floating Point Arithmetic
31 pages
EE234 - Lec - 03
No ratings yet
EE234 - Lec - 03
45 pages
312s16 - Floating Point Arithmetic
No ratings yet
312s16 - Floating Point Arithmetic
31 pages
Falcon-E: Introduction: (I.e., 4 Byte Chunks)
No ratings yet
Falcon-E: Introduction: (I.e., 4 Byte Chunks)
61 pages
UNIt 1 - ARRAY - Class - Notes - PPT
100% (1)
UNIt 1 - ARRAY - Class - Notes - PPT
34 pages
Slides Lab 92 Up
No ratings yet
Slides Lab 92 Up
4 pages
Basic Instructions
No ratings yet
Basic Instructions
24 pages
Module 2.3
No ratings yet
Module 2.3
142 pages
Final Exam Comp Org
No ratings yet
Final Exam Comp Org
4 pages
Lecture5 INSTRUCTIONS MICROPROCESSOR APLICATIONS
No ratings yet
Lecture5 INSTRUCTIONS MICROPROCESSOR APLICATIONS
58 pages
Đỗ Ngọc Đức - Ititiu22034 - Ca - lab7
No ratings yet
Đỗ Ngọc Đức - Ititiu22034 - Ca - lab7
3 pages
MP Unit2
No ratings yet
MP Unit2
106 pages
FPU Instructions
No ratings yet
FPU Instructions
3 pages
Sehs3317 L4
No ratings yet
Sehs3317 L4
53 pages
Floating-Point Multiplication Unit With 16-Bit Significant and 8-Bit Exponent
No ratings yet
Floating-Point Multiplication Unit With 16-Bit Significant and 8-Bit Exponent
6 pages
Floating Point Instructions: Ray Seyfarth
No ratings yet
Floating Point Instructions: Ray Seyfarth
18 pages
Programming in Assembly Language
100% (1)
Programming in Assembly Language
9 pages
List of Interrrupts Used: List of Assembler Directives Used: List of Macros Used: List of Procedures Used: Algorithm: Flowchart
No ratings yet
List of Interrrupts Used: List of Assembler Directives Used: List of Macros Used: List of Procedures Used: Algorithm: Flowchart
6 pages
ARM Instruction Set
100% (1)
ARM Instruction Set
75 pages
FPU-Instructions Cheat Sheet
No ratings yet
FPU-Instructions Cheat Sheet
2 pages
FALLSEM2021-22 CSE2006 ETH VL2021220104026 Reference Material I 16-11-2021 23-A-8087-Coprocessor Instructions-Programming
No ratings yet
FALLSEM2021-22 CSE2006 ETH VL2021220104026 Reference Material I 16-11-2021 23-A-8087-Coprocessor Instructions-Programming
51 pages
5.MHN ARM InstructionSet
No ratings yet
5.MHN ARM InstructionSet
44 pages
Useful x86 Instructions This Is A Very Small Subset of The Available In-Structions But Should Be Enough For Your Pur - Poses
No ratings yet
Useful x86 Instructions This Is A Very Small Subset of The Available In-Structions But Should Be Enough For Your Pur - Poses
31 pages
Csa Final
No ratings yet
Csa Final
7 pages
ACOS 4.1.4 Web Application Firewall Guide: For A10 Thunder™ Series and AX™ Series 21 February 2018
No ratings yet
ACOS 4.1.4 Web Application Firewall Guide: For A10 Thunder™ Series and AX™ Series 21 February 2018
182 pages
Arithmetic Instructions
No ratings yet
Arithmetic Instructions
100 pages
Lecture 08
No ratings yet
Lecture 08
17 pages
Addressing Modes
No ratings yet
Addressing Modes
4 pages
Module4 Part1
No ratings yet
Module4 Part1
30 pages
FALLSEM2024-25 BECE204L TH VL2024250104330 2024-10-01 Reference-Material-I
No ratings yet
FALLSEM2024-25 BECE204L TH VL2024250104330 2024-10-01 Reference-Material-I
28 pages
Instruction Set
No ratings yet
Instruction Set
69 pages
Cao
No ratings yet
Cao
13 pages
SC0x Week3 PP6 Excel Step by Step
No ratings yet
SC0x Week3 PP6 Excel Step by Step
14 pages
QRC0007 VFP
No ratings yet
QRC0007 VFP
2 pages
Arminstructionset 201124050104
No ratings yet
Arminstructionset 201124050104
26 pages
Lab 02
No ratings yet
Lab 02
7 pages
Computer Hardware Servicing 7 & 8
100% (3)
Computer Hardware Servicing 7 & 8
4 pages
DLX Instruction Set Description Notation
No ratings yet
DLX Instruction Set Description Notation
14 pages
Software Validation and Verification Plan
No ratings yet
Software Validation and Verification Plan
4 pages
14 Assembly Instructions
No ratings yet
14 Assembly Instructions
9 pages
2.1 2.2 8086 Addressing Modes and Instruction Set
No ratings yet
2.1 2.2 8086 Addressing Modes and Instruction Set
55 pages
CortexM4 FPU
No ratings yet
CortexM4 FPU
14 pages
The DLX Instruction Set
No ratings yet
The DLX Instruction Set
13 pages
Exception Handling: M. Krishna Kumar MM/M4/LU11/V1/2004 1
No ratings yet
Exception Handling: M. Krishna Kumar MM/M4/LU11/V1/2004 1
33 pages
How To Install CyberPanel On Ubuntu 20
No ratings yet
How To Install CyberPanel On Ubuntu 20
14 pages
Spim Instruction Set: Instructions and Pseudoinstructions
No ratings yet
Spim Instruction Set: Instructions and Pseudoinstructions
6 pages
b1 EXAM
No ratings yet
b1 EXAM
6 pages
EOS Attribute Identifiers
No ratings yet
EOS Attribute Identifiers
4 pages
QRC0007C VFP PDF
No ratings yet
QRC0007C VFP PDF
2 pages
Re - Transaction Confirmation
No ratings yet
Re - Transaction Confirmation
2 pages
Solvd PRBLM Co
No ratings yet
Solvd PRBLM Co
3 pages
RWD Uperform 3.0 - Administration
No ratings yet
RWD Uperform 3.0 - Administration
235 pages
Chapter 5 v8.0 ME
No ratings yet
Chapter 5 v8.0 ME
107 pages
Ebook Riversand PIM
No ratings yet
Ebook Riversand PIM
16 pages
Computer Project (Repaired)
No ratings yet
Computer Project (Repaired)
78 pages
Janisha Sethi cv.2
No ratings yet
Janisha Sethi cv.2
1 page
It Help Desk Resume Objective Examples
100% (1)
It Help Desk Resume Objective Examples
5 pages
Wafl PDF
No ratings yet
Wafl PDF
36 pages
Demystifying The Number of Vcpus For Optimal Workload Performance
No ratings yet
Demystifying The Number of Vcpus For Optimal Workload Performance
12 pages
Installer Debug
No ratings yet
Installer Debug
36 pages
Summary of Aict Project
No ratings yet
Summary of Aict Project
12 pages
Entry Exit Manual
No ratings yet
Entry Exit Manual
25 pages
Kazim Usman-Teaching PDF
No ratings yet
Kazim Usman-Teaching PDF
2 pages
Dept of CSE Even Sem Routine 2024-2025-3
No ratings yet
Dept of CSE Even Sem Routine 2024-2025-3
2 pages
Mailerlite - QA Engineer Assignment
No ratings yet
Mailerlite - QA Engineer Assignment
7 pages
Website Development Proposal: Project: Client
No ratings yet
Website Development Proposal: Project: Client
10 pages
Jam Alarm
No ratings yet
Jam Alarm
5 pages
Final Course Project - University Course Management System UCMS in Java
No ratings yet
Final Course Project - University Course Management System UCMS in Java
2 pages
Lumitester Smart
No ratings yet
Lumitester Smart
2 pages
Curiculum Vitae
No ratings yet
Curiculum Vitae
1 page
Case Study Scenario
No ratings yet
Case Study Scenario
2 pages
3D Granny Squares: 100 Crochet Patterns for Pop-Up Granny Squares
From Everand
3D Granny Squares: 100 Crochet Patterns for Pop-Up Granny Squares
Caitie Moore
5/5 (12)
Scandinavian Dart Tournament Results: Denmark, Finland, Iceland, Norway, and Sweden
From Everand
Scandinavian Dart Tournament Results: Denmark, Finland, Iceland, Norway, and Sweden
Nigel Boeg
No ratings yet
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
From Everand
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
Shubhankar Paul
No ratings yet
NorthWestNet NUSIRG Internet Guide
From Everand
NorthWestNet NUSIRG Internet Guide
NorthWestNet
No ratings yet

12 - Floating Point Instructions

Uploaded by

12 - Floating Point Instructions

Uploaded by

Floating-Point Instructions

REGISTERS AND INSTRUCTIONS

• Require different set of registers

▪ Eg: str d1, [x29, 16] // write 8 bytes

• Use the .single or .double pseudo-ops to

adrp x19, x_m // get address of x

fdiv d2, d0, d1 // x / y

adrp x19, z_m // get address of z

end: nop // breakpoint

ldp x29, x30, [sp], 16

You might also like