0% found this document useful (0 votes)

48 views22 pages

ARM Cortex-A9 ARM V7-A A Programmer's Perspective

The ARM Cortex-A9 processor contains a barrel shifter located before the ALU. It can shift one of the ALU operands by up to 32 bits. Load and store instructions in ARM allow data to be transferred between registers and memory. Memory addresses can be modified with offsets or indexing to access data at specific memory locations. Common load and store instructions include LDR, STR, LDRH, STRH, LDRB, and STRB.

Uploaded by

Rayane Gadelha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views22 pages

ARM Cortex-A9 ARM V7-A A Programmer's Perspective

Uploaded by

Rayane Gadelha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

ARM Cortex-A9

ARM v7-A

A programmer’s perspective
Part 2
ARM Instructions 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0
cond Opcode S Rn Rd Rs Opcode2 Rm
L
Immd8
General Format
Immd12
Inst Rd, Rn, Rm, Rs
Immd24
Inst Rd, Rn, #0ximm
31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0
cond Opcode S Op1 Dest. Op. Acc. Opcode2 Op2
L

Instruction Classes

Data Processing (largest class): ADD, AND, BIC, CMP, EOR, MOV, ORR, RSB, SUB, TEQ, TST
ASR, ASL, LSL, LSR, ROR, MLA, MLS, MUL, PKH, SDIV, SXT

Branch Instructions: B, BL, BX

Load/Store: LDR, LDRB, LDRW, LDRD, STR, STRB, STRH, STRD, LDM, LDMIA, LDMDA,
LDMDB, LDMIB, STM, STMIA, STMDA, STMDB, STMIB, POP, PUSH

Plus exception handling, coprocessor calls, SIMD, floating point, vector

Loads and Stores FFFC 0000
FC00 0000
OCM
QSPI
Memory Controller
R0: GPR
32 R1: GPR
F890 0000 CPU Addr R2: GPR
F800 1000 PS Data
R3: GPR
32 R4: GPR
F800 0000 SCLR Cache R5: GPR
Data
E100 0000 SMC R6: GPR
CS R7: GPR
E000 0000 IOP R8: GPR
D_IN
BFFF FFFF WE R9: GPR
Registers
R10: GPR

Ports
Port 1 OE DST_Addr R11: GPR
DST_Clk R12: GPR
AXI bus to R13: SP
FPGA R14: LR
R15: PC
8000 0000 Register File
Instruction
7FFF FFFF
Cache
Port 0
AXI bus to
FPGA
4000 0000
3FFF FFFF Immd_Op
Main Memory

1GByte DDR SRC1_Sel

Main Memory
SRC2_Sel

0000 0000 OP1 OP2

ALU_Func Sel ALU

External Source
4 Gbyte Address Space ALU_Status Status
Out
CPUClk
Controller
ARM CPU
Load and Store

32-bit external addresses (4 Gbytes). ZYNQ supports 30-bit external addresses (1 Gbyte).
Loads and stores can operate on words (4 consecutive bytes), half words (2 bytes), or bytes.
Operands residing in memory must be loaded into a register before they can be used, and
new values stored in main memory must be stored from a register.
Three sets of instructions that interact with main memory

- Single register data transfer (LDR / STR)

- Block register data transfer (LDM / STM)
- Single data swap (SWP)

NO memory-to-memory data operations

Load and Store

Basic load and store operations are: LDR/STR, LDRH/STRH, LDRB/STRB, LDRD/STRD

All load/store instructions require a base address pointer placed in a GPR (Rn). Rn is a
“pointer”, or a 32-bit memory address. Square brackets [] designate a pointer.

LDR Rt, [Rn] @ Load Rt with location pointed at by Rn

STR Rt, [Rn] @ Store Rt at location pointed at by Rn

The base address can be modified with an offset applied before access. Examples:

LDR Rt, [Rn,#<imm>] @ Load Rt with location pointed at by Rn + imm value

STR Rt, [Rn,#- <imm>] @ Store Rt at location pointed at by Rn - imm value

Imm is a 12-bit “immediate” value; if omitted, default is 0. A minus sign subtracts the
immediate value, a plus sign (or no sign) adds the immediate value.
Load and Store

Load at store operations can use the PC to load or store “literal”

LDR Rt, label @ Load Rt with location at label (can be +/- 4096 from PC)

Base address can be modified with an offset stored in a 2nd GPR.

LDR Rt, [Rn, Rm] @ Load Rn from location [Rn + Rn]
STR Rt, [Rn, -Rm] @ Store Rn at location [Rn - Rm]
Load and Store with Indexing

Indexing means modifying the base address, and writing the modified value back into the
base address register, as a part of the execution cycle.
Pre-indexing means doing a transfer first, then updating Rn. Pre-indexing is used
when a ‘!’ is added to the end of any load instruction.

LDR Rt, [Rn, #4]! @ Rt <- [Rn + 4], then Rn is updated with address that was used
STR Rt, [Rn, Rm]! @ Rt <- [Rn + Rm], then Rn is updated with address that was used
Post-indexing means updating Rn first, then doing the transfer. Post-indexing is used
when only the base address is enclosed in square bracket.

LDR Rt, [Rn], #4 @ Rn updated with (Rn+4), then Rt loaded from new address
STR Rt, [Rn], Rm @ Rn is updated with (Rn + Rm), then Rt loaded from new address

Indexing provides a powerful tool for working with regular memory structures like arrays.
A quick aside… the barrel shifter Memory Controller
R0: GPR
32 R1: GPR

ARMs shifter is located in one Addr

Data
R2: GPR
R3: GPR
R4: GPR
32
operand data path, in front of the Data Cache R5: GPR
R6: GPR

ALU. CS R7: GPR

R8: GPR
D_IN
R9: GPR
WE Registers
R10: GPR
Shifts can occur as a part of almost OE DST_Addr
DST_Clk
R11: GPR
R12: GPR
R13: SP
any instruction R14: LR
R15: PC

Instruction Register File

Shift instructions (LSR, LSL, ASR, ASL) Cache
work as expected:
Immd_Op
LSR Rd, Rm, #imm SRC1_Sel

LSLS Rd, Rn, Rm SRC2_Sel

5 Barrel
Immd_Op
Shifter
Same with Rotate (ROR):
OP1 OP2
RORS Rd, Rm, #imm ALU
ALU_Func Sel
ROR Rd, Rn, Rm ALU_Status Status

CPUClk Out
Controller
ARM CPU
Load and Store using Barrel Shifter
The shifter can also modify the base address used for load and store operations. A 5-bit
immediate field in the opcode encodes shift amount. No extra CPU time is needed.

LDR Rt, [Rn, Rm, LSL #0x4]

STR Rt, [Rn, Rm, LSR #0x3]!
Load and Store

Additional load and store commands operate on lists of registers. More information in the
Arm Architectural Reference starting on page A8-396.
LDMIA/LDMFD
LDMDA/LDMFA
LDMDB/LDMEA
LDMIB/LDMED
STMIA/STMEA
STMDA/STMED
STMDB/STMFD
STMIB/STMFA
STMDB R13, {r4, r5, r6, r7, r8, r9, r10, r11, r12, r13, r14}
LDMIA R13, {r4, r5, r6, r7, r8, r9, r10, r11, r12, r13, r14}
Load and Store

No instruction can load a 32-bit immediate constant into a register without performing a
data load from memory (ARM does not embed immediate in the instruction stream).
8 or 12 bit immediates can be loaded and rotated to give a wider range of numbers that can
be generated from immediates.
MOV r0, #0x40, 26 @ load #0x1000 into R0
The pseudo-instruction shown can move any declared constant into a register:
.set myconstant, 0xAAAA5555
LDR r0, =myconstant @ load #0xAAAA5555 into R0
The pseudo-instruction shown below can also be used. In this case, the assembler will use a
MOV or MVN instruction if possible; otherwise, it will create a constant and then load it:
LDR r0, =0xFFFF0C0C
PC-relative Load and Store

The PC can be used as the base address.

LDR R0, [PC, #4] @ load R0 from location PC + 4

The PC points 8 bytes ahead of executing instructing (why)?

Calculating relative addresses can be tedious. An ARM pseudo-instruction can help.

ADR Rd, label @ load Rd from location label

The assembler turns this instruction into the following (the assembler calculates offset)

ADD PC, #offset

MOV

A MOV instruction can move data between registers, or from an immediate to a register. A
MVN instruction also moves information, but does a bit-wise negation in the process.

MOV Rd, Rm @ Rd <= Rm.

MOVS Rd, Rm @ Rd <= Rm and updates status bits.
MOV Rd, #0xFFF @ Rd <= FFF (up to 12-bit immediates can be used).
MOVW Rd, #0xFFFF @ Rd <= FFFF (up to 16-bit immediates can be used)
MVN Rd, Rs @ Rd <= NOT Rm (bit-wise inversion).
MOVT Rd, #0xAA @ Top halfword of Rd <= AA; bottom half unchanged

There are other flavors of MOV instructions to move data into special registers and to
coprocessors. You can get more information about these move instructions from the text
book of from the Arm Architectural Reference starting on page A8-484.
Branch

Conditional and unconditional braches in program execution create loops or if-then

constructs. “Branch with Link” (BL) additionally copies the PC to the Link register, so a
subroutine return can resume execution immediately after where it was called. “Branch and
Exchange” braches to an address stored in a register.

B <label> @ Unconditional branch to instruction after label

BNE <label> @ Branch to instruction after label
BL <label> @ Branch to label and copy PC into R14 (the LR)
BX LR @ Branch to location stored in LR (R14)

Examples:
Loop_point: @ label MOV R0, #10
LDR r0, [r1] Loop_point1: SUBS r0, r0, #1
B loop_point BNE loop_point
ADD

The ADD and ADC (add with carry) instructions add the contents in two 32-bit “source”
registers and place the result in a 32-bit “destination” register (the destination register can
be the same as one of the sources).

ADD r0, r1, r2 @ r0 <= r1 + r2

ADC r0, r1, r2 @ r0 <= r1 + r2 + C
ADCS r3, r2, #0xABC @ r3 <= r2 + 0xABC + C and set status bits (up to 12 bit imm)
ADDEQ r0, r1, r2 @ r0 <= r1 + r2 if Z bit is set
ADD r0, r1, r2, LSL #0x4 @ r0 <= r1 + r2 shifted left four bits
ADD r0, r1, r2, ASR r3 @ r0 <= r1 + r2 shifted left (sign ext.) by number in r3
ADD r1, r1, #1 @ increment value in r1

More information about add instructions in text book and the Arm Architectural Reference
starting on page A8-300.
Subtract

The SUB and SBC (subtract with NOT of carry bit), and reverse-subtract RSB and RSC
instructions subtract the contents of one 32-bit register from another register, and place the
result in a 32-bit destination register (the destination register can be the same as one of the
sources).

SUB r0, r1, r2 @ r0 <= r1 - r2

SBCS r0, r1, r2 @ r0 <= r1 - r2 - Not C, and update status bits
SUB r0, r1, #0xABC @ r0 <= r1 - 0xABC (up to 12 bit immediate)
SUBNE r0, r1, r2 @ r0 <= r1 - r2 if Z bit is not set
SUB r0, r1, r2, LSL #0x4 @ r0 <= r1 - r2 shifted left four bits
RSB r0, r1, r2 @ r0 <= r2 – r1
SUB r2, r2, #1 @ Decrement value in r2

More information about sub instructions in text book and the Arm Architectural Reference.
Shift
The ARM can do arithmetic and logical shifts by up to 32 bits. Arithmetic shifts are right-shift
and sign-extended (i.e, the sign bit is placed in all vacated bits). Logical shift right or left (LSR
or LSL) can shift up to 32 bits, and a ‘0’ will be placed in all vacated bits. The number of bits
to shift can be an immediate or placed in a register.

ASR r0, r1 ,#0x04 @ r0 <= r1 contents shifted right four bits with sign extend
ASR r0, r1, r2 @ r0 <= r1 contents shifted right by the number of bits in the bottom
of r2, sign extends, and places result in r0
LSL r0, r1, #0x06 @ r0 <= r1 shifted left 6 bits with 0 fill
LSR r0, r1, r2 @ r0 <= r1 shifted right by number of bits in bottom of r2, 0 fill
Comparisons
CMP compares by subtracting the operands, updating the status bits, and discarding the
results. No need to use “S” mnemonic extension. CMN adds operands, updates status, and
discards results. TST (test) ANDs operand 1 with operand 2, updates status bits, and discards
results. TEQ (test equivalence) XORs operand 1 with operand 2, updates status bits, and
discards results.
CMP r0, #0ximm @ Subtract imm (12 bit) from r0, discard results, update status bits
CMP r0, r1 @ Subtract r1 from r0, discard results, update status bits
CMP r0, r1, LSL #0x06 @ Subtract left-shifted r1 from r0, discard results, update status bits
CMP r0, r1, ROR r2 @ Subtract r1 (right-rotated by number in r2) from r0, discard, update
CMN r0, #0ximm @ Add imm to r0, discard results, update status bits
CMN r0, r1 @ Add r1 and r0, discard results, update status bits
TST r0, #1 @ ANDs r0 with 1, discard results, update status bits (why?)
TEQ r2, r3 @ Bitwise XOR of r2 and r3, discard results, updates status bits
TEQ r2, #5 @ Bitwise XOR of r2 and 5 (Z = 1 if equal)
Logical Operations
The AND, BIC, EOR, and ORR data processing operators all use the same general addressing
modes as the other data processing instructions.

AND r0, r1, #0ximm @ Bitwise AND r1 with 12-bit immediate and write result to r0
ANDS r0, r1, r2 @ Bitwise AND r1 with r2, write result to r0, update status bits
AND r0, r1, r2 LSR r3 @ Bitwise AND r1 with left-shifted r2, write result to r0
BIC r0, r1, #0ximm @ Bitwise AND r1 with inverse of imm, write to r0, update status bits
BIC r0, r1, r2, LSL 0x#2 @ Bitwise AND r1 with left-shifted r2, write to r0, update status bits
EORS r0, r1, #0ximm @ Bitwise XOR r1 with 12-b imm, write result to r0, update status bits
EOR r0, r1, r2, LSR r3 @ Bitwise XOR r1 with r2 shifted left by r3, write to r0
ORR r2, r3, #0xAA @ Bitwise XOR of r3 and AA, write to r2, update status bits
Multiplication
ARM has three multiplication instructions: MUL to multiply two 32-bit registers, MLA
(multiply and accumulate) to multiply two 32-bit registers and add a third register; and MLS
(multiply and subtract) to multiply two registers and subtract the result from a third register.
For each instruction, the destination register holds the least-significant 32 bits of the result.

MUL r0, r1, r2 @ r0 <= r1 x r2

MUL r0, r1 @ r0 <= r1 x r0
MULS r0, r1, r2 @ r0 <= r1 x r2 and status bits are updated
MLA r0, r1, r2, r3 @ r0 <= r1 x r2 + r3
MLAS r0, r1, r2, r3 @ r0 <= r3 - r1 x r2
MLS r0, r1, r2, r3

HCIE-R&S Huawei Certified Internetwork Expert-Routing and Switching Material V1.1
No ratings yet
HCIE-R&S Huawei Certified Internetwork Expert-Routing and Switching Material V1.1
1,212 pages
Unix PPT Lesson
75% (4)
Unix PPT Lesson
70 pages
Grade Estimation Using Surpac
No ratings yet
Grade Estimation Using Surpac
58 pages
Candle Making: Leaflet NO
100% (4)
Candle Making: Leaflet NO
16 pages
PS1 Solutions PDF
100% (1)
PS1 Solutions PDF
3 pages
KRT V3.5 Eng
No ratings yet
KRT V3.5 Eng
103 pages
PMDG 737 Flows + FS2CREW PDF
100% (1)
PMDG 737 Flows + FS2CREW PDF
15 pages
How To Build A Stacking Harness
100% (2)
How To Build A Stacking Harness
5 pages
Black Sand, Tellurides Sulfides - For Recreational Gold Prospecting
100% (2)
Black Sand, Tellurides Sulfides - For Recreational Gold Prospecting
4 pages
2.RGP Corneal Lens
No ratings yet
2.RGP Corneal Lens
13 pages
Design of Horizontal Axis Tidal Turbines
No ratings yet
Design of Horizontal Axis Tidal Turbines
8 pages
Chs 10 - Lesson 2
No ratings yet
Chs 10 - Lesson 2
43 pages
Intro S4HANA Using Global Bike Exercises PP Fiori en v4.2
No ratings yet
Intro S4HANA Using Global Bike Exercises PP Fiori en v4.2
16 pages
9100 Manual
No ratings yet
9100 Manual
11 pages
Class Xii Latest (Ii) Updated Checklist
No ratings yet
Class Xii Latest (Ii) Updated Checklist
36 pages
Weld Consumable Calculator, Butt and Fillet Welds
No ratings yet
Weld Consumable Calculator, Butt and Fillet Welds
7 pages
Cribbage Rules1
No ratings yet
Cribbage Rules1
5 pages
Ionic Equilibrium DPP
No ratings yet
Ionic Equilibrium DPP
33 pages
Production of Ceramic Foam Filters For Molten Meta
No ratings yet
Production of Ceramic Foam Filters For Molten Meta
5 pages
Sample
No ratings yet
Sample
14 pages
Transactions of The Indian Institute of Metals Guidelines To Authors
No ratings yet
Transactions of The Indian Institute of Metals Guidelines To Authors
5 pages
IDS Syllabus
No ratings yet
IDS Syllabus
3 pages
2013PDE5247 - Mohit Rajput
No ratings yet
2013PDE5247 - Mohit Rajput
87 pages
Solutions: Homework 5 Partial Differential Equations M 104228
No ratings yet
Solutions: Homework 5 Partial Differential Equations M 104228
4 pages
Evolution of Object Approach: C Programming Language
No ratings yet
Evolution of Object Approach: C Programming Language
35 pages
Math 2
No ratings yet
Math 2
17 pages
CWS19产品资料英文
No ratings yet
CWS19产品资料英文
7 pages
Algebra 2 Homework Help Answers
100% (1)
Algebra 2 Homework Help Answers
7 pages
CSP2101 Scripting Languages Assignment 3 - Software Based Solution
No ratings yet
CSP2101 Scripting Languages Assignment 3 - Software Based Solution
8 pages
CE Topic 2 & 3
No ratings yet
CE Topic 2 & 3
2 pages
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (648)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1856)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1175)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4103)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (629)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1139)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (836)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2886)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)

ARM Cortex-A9 ARM V7-A A Programmer's Perspective

Uploaded by

ARM Cortex-A9 ARM V7-A A Programmer's Perspective

Uploaded by

ARM Cortex-A9

Branch Instructions: B, BL, BX

Plus exception handling, coprocessor calls, SIMD, floating point, vector

1GByte DDR SRC1_Sel

0000 0000 OP1 OP2

ALU_Func Sel ALU

- Single register data transfer (LDR / STR)

NO memory-to-memory data operations

LDR Rt, [Rn] @ Load Rt with location pointed at by Rn

LDR Rt, [Rn,#<imm>] @ Load Rt with location pointed at by Rn + imm value

Load at store operations can use the PC to load or store “literal”

Base address can be modified with an offset stored in a 2nd GPR.

ARMs shifter is located in one Addr

ALU. CS R7: GPR

Instruction Register File

LSLS Rd, Rn, Rm SRC2_Sel

LDR Rt, [Rn, Rm, LSL #0x4]

The PC can be used as the base address.

The PC points 8 bytes ahead of executing instructing (why)?

ADR Rd, label @ load Rd from location label

ADD PC, #offset

MOV Rd, Rm @ Rd <= Rm.

Conditional and unconditional braches in program execution create loops or if-then

B <label> @ Unconditional branch to instruction after label

ADD r0, r1, r2 @ r0 <= r1 + r2

SUB r0, r1, r2 @ r0 <= r1 - r2

MUL r0, r1, r2 @ r0 <= r1 x r2

You might also like