0% found this document useful (0 votes)

49 views46 pages

ARM Processor

The document discusses the ARM processor architecture. It originated from Acorn Computers' RISC Machine design. ARM designs processor cores and licenses them to partners who manufacture and sell chips containing ARM cores. ARM is a leading provider of 32-bit embedded RISC microprocessors, commonly used in applications like mobile phones, automotive systems, and more. The document describes the evolution of ARM architectures over time, from ARMv1 to newer versions, and provides details on the ARM programming model, instruction set, pipeline design, and more.

Uploaded by

yixexi7070

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views46 pages

ARM Processor

Uploaded by

yixexi7070

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 46

ARM Processor

Dr. B. Thiyaneswaran
Associate Professor,
Department of ECE,
Sona College of Technology
ARM Basics
• ARM processor core originates -> ACRON.
• ACRON RISC Machine.
• Designs the ARM range of RISC processor
cores
• Licenses ARM core designs to semiconductor
partners who fabricate and sell to their
customers.
(ARM does not fabricate silicon itself)
Leading provider of 32-bit embedded RISC
microprocessors.
• 75% of market.
• High performance
• Low power consumption
• Low system cost
Solutions for,

Embedded real-time systems for mass storage, automotive,

industrial and networking , applications, secure applications -
smartcards and SIMs, Open platforms running complex
operating systems
• ARMv1
First version of ARM processor, 26-bit addressing,
no multiply / coprocessor

• ARMv2
ARM2, First commercial chip, Included 32-bit result
multiply instructions / coprocessor support
 ARMv2a
ARM3 chip with on-chip cache, Added load and store
cache management

 ARMv3
ARM6, 32 bit addressing, virtual , memory support
ARM Processor Core
Current low-end ARM core for applications like digital
mobile phones
TDMI
T: Thumb, 16-bit instruction set
D: on-chip Debug support, enabling the processor to halt in
response to a debug request
M: enhanced Multiplier, yield a full 64-bit result, high
performance
I: EmbeddedICE hardware
Von Neumann architecture
3-stage pipeline
ARM Core Diagram
ARM Inheritance
Used features of RISC:
• A load and store architecture.
• Fixed 32 bit instructions.
• 3 byte instruction format.
Unused features:
• Register windows.
• Delayed branches.
• Single cycle instruction execution
ARM Programmer Model
• Visible registers.
• Invisible registers -> not
significant.
It may be designated as ‘scratch pad’ registers. These are the
R0-R12
registers into which data and address are loaded.

It is the pointer to stack and it is used to the Stack pointer (SP)

R13

It is the link register. It is used whether there is a procedure call or an

interrupt, that is branching to a location. The value of PC is saved in
R14
the link register, and PC takes on the new branch address. It will
store the return address.

R15 It act as PC
Current Program Status Register (CPSR)
Operating Modes

Modes:
1. USER -> Unprivileged Mode.
2. FIQ (Fast Interrupt Request) -> Entered on high priority INT
3. IRQ (interrupt Request) -> Entered on low priority INT
4. Supervisor-> Entered on Reset & SWI is executed.
5. Abort -> Used to handle memory access violations.
6. Undef -> Used to handle undefined instructions.
7. System-> Privileged Mode same as register access in
user mode.
Saved Program Status
Register(SPSR)

• 5 SPSR.
• Each one corresponding to exception
mode of operation.
• When an exception that is an
interrupt occurs the corresponding
SPSR saves the current CPSR value
into it.
ARM Memory system
• 32 bit memory.
• 8 bit / 16 bit /32 bit access flexibility.
• Little Endian & Big Endian format.
Load and Store architecture
• Direct memory to memory operation is not
allowed like CISC.
• Addition/subtraction/ or an other operation,
data’s r to be from/to in registers only.
• The data’s in memory has to be loaded in to
registers (LOAD). -> Register to ALU -> ALU to
registers.
• Register data to stored to memory (STORE)
Load and Store architecture
Data Processing instruction:
Uses registers for loading and values. Results
also stored in registers. (Not directly with
memory).
Data Transfer instruction:
Even memory to operation. Location-1 ->
Registers. Register-> Location-2.
Control Flow Instruction:
Control flow required Jumping of different
address.
3 Stage ARM Organization
3 Stage ARM Organization

• The register bank, which stores the processor

state.
• It has two read ports and one write port.
• Plus an additional read port and an additional
write port that give special access to r15, the
program counter.
(The additional write port on r15 allows it to be
updated as the instruction fetch address is incremented
and the read port allows instruction fetch to resume
after a data address has been issued.)
3 Stage ARM Organization

• The barrel shifter-> which can shift or rotate

one operand by any number of bits.
• The ALU -> Arithmetic and logic functions.
• The address register and Incrementer -> which
select and hold all memory.
• The data registers, which hold data passing to
and from memory.
• The instruction decoder and associated
control logic.
3 Stage Pipeline Organization
• 3-stage pipeline: Fetch – Decode - Execute
• Three-cycle latency, one instruction per cycle
throughput

i
n
s
t i Fetch Decode Execute
r
u
i+1 Fetch Decode Execute
c
t
i i+2 Fetch Decode Execute
o cycle
n
t t+1 t+2 t+3 t+4 19
FETCH:
Instruction is fetched from memory and placed in the
instruction pipeline.
DECODE:
The instruction is decoded and the data path control
signals prepared for the next cycle. In this stage the
instruction 'owns' the decode logic but not the data path.
EXECUTE:
The instruction 'owns' the data path. The register bank is
read, an operand shifted, the ALU result generated and
written back into a destination register.
***At any one time, three different instructions may occupy
each of these stages, so the hardware in each stage has to
be capable of independent operation.
Multiple cycle instruction in 3 stage
5 stage pipe line

Performance of the system depends on,

CPI – Average clock cycle per instruction.
Fclk - Clock frequency
CPI – Average clock cycle per instruction.
Fclk - Clock frequency

Fclk
• Logic in Each pipe line has to be simplified.
• No of pipe line stages has to be increased.

CPI 3 stage pipeline may Re-implemented.

• Pipeline stall by data dependency may
reduced.
Bottle neck of ARM 3 stage
• Due to Von-Numan (ARM 3 stage) ->
Stored Program -> Needs to single instruction
and multiple memory access is required-> It is
crucial with limited memory band width.

Need of more no of 32 bit instruction has to be

read from memory at one cycle.
Need for 5 stage Pipeline
• Dumping of complex instruction into single leads to
incompletion / required higher clock width period.

Remember

(Major issue on 3 stage pipeline to stop us to increase

the fclk)
Solution:
• Breaking Instruction processing into 5 stage execution
stage into 3 stages.
• Increase the clock speed fclk and allow each stage will
complete with in a clock.
5 stage Pipeline

• Fetch.
• Decode.
• Execute.
• Buffer / Data.
• Write Back.
5 stage Pipeline
Fetch:
The instruction is fetched from memory and placed in the instruction
pipeline.
Decode:
The instruction is decoded and register operands read from the register file.
Register bank has 3 Read ports -> ARM instructions can source all their
operands in one cycle.
Execute:
An operand is shifted to ALU input and result generated. If the instruction is
a load or store the memory address is computed in the ALU.
Buffer/data:
Data memory is accessed if required. Otherwise the ALU result is simply
buffered for one clock cycle to give the same pipeline flow for all instructions.
Write-back:
The results generated by the instruction are written back to the register file,
including any data loaded from memory.
Comparative Clock analysis of 3 & 5 stage Pipeline
3 Stage 5 Stage

3 Stage

For 3 instruction
Analysis

3 Stage: (Low clock speed)

5 x 500ms = 2500ms / 2.5s.

5 Stage: (high clock Speed)

5 Stage 7 x 250ms = 1750ms / 1.75s.
ARM 5 Stage (ARM9 TDMI)
Operations…..
STR (Store – Register Data Path activity)
Branching Instruction
ARM Instruction Set

• Data Processing instructions.

• Data Transfer instructions.

• Branching instructions.
Data Processing Instruction
• All operands are 32 bits wide and come from
registers or are specified as literals in the instruction
itself.
• The result, if there is one, is 32 bits wide and is
placed in a register. (There is an exception here: long
multiply instructions produce a 64-bit result).
• Each of the operand registers and the result register
are independently specified in the instruction. That
is, the ARM uses a '3-address' format for these
instructions.
Ex: ADD R0, R1, R2
Arithmetic Instructions
Arithmetic

Bit wise operation

Comparison instructions

Immediate operands

ADD r3, r3, #1

AND r8, r7, #&ff
Shift register operands

ADD r3, r2, r1, LSL #3 ; r3 <= r2 + r1 (LSL #3)

LSL – Logical left shift.

Arithmetic Instructions
Multiplication

SMULL R0, R1, R2, R3 ; R0 <- Higher 32 bit, R1 <- Lower 32 bit
UMULL R0, R1, R2, R3 ; R0 <- Higher 32 bit, R1 <- Lower 32 bit

No division instruction ARM.

Data Transfer instructions
Register indirect, Single register Load & Store

Base plus offset addressing (pre indexed)

Base plus offset addressing (Post indexed)

Data Transfer instructions
8 bit operation:
Control Transfer instructions
Examples
Thumb Decompressor
Thumb Properties
• The Thumb code requires 70% of the space of
the ARM code.
• The Thumb code uses 40% more instructions
than the ARM code.
• With 32-bit memory, the ARM code is 40%
faster than the Thumb code.
• With 16-bit memory, the Thumb code is 45%
faster than the ARM code.
• Thumb code uses 30% less external memory
power than ARM code.
Thumb Instructions
Thumb Instructions

Pioneer X Hm82 S X Hm82d XC Hm82d K X Hm72 X Hm72d
100% (1)
Pioneer X Hm82 S X Hm82d XC Hm82d K X Hm72 X Hm72d
110 pages
Arm9 Embedded Book-Guide
100% (2)
Arm9 Embedded Book-Guide
67 pages
MPMC Unit 3 by KS
No ratings yet
MPMC Unit 3 by KS
110 pages
ARM Processors
No ratings yet
ARM Processors
6 pages
Es U-1 Ch-2 Part2
No ratings yet
Es U-1 Ch-2 Part2
8 pages
Intro To ARM Cortex-M3 (CM3) and LPC17xx MCU: Outline
No ratings yet
Intro To ARM Cortex-M3 (CM3) and LPC17xx MCU: Outline
79 pages
ARM Architecture
No ratings yet
ARM Architecture
30 pages
ARM: An Advanced Microcontroller
No ratings yet
ARM: An Advanced Microcontroller
54 pages
Introduction To Processor Design & The ARM Architecture
100% (1)
Introduction To Processor Design & The ARM Architecture
65 pages
11 ARM Processor
No ratings yet
11 ARM Processor
54 pages
Unit 4 - ARM Processors
No ratings yet
Unit 4 - ARM Processors
68 pages
Module 4 - ECE3014 Introduction To Embedded System and ARM-1
No ratings yet
Module 4 - ECE3014 Introduction To Embedded System and ARM-1
27 pages
ARMfinal 1
No ratings yet
ARMfinal 1
114 pages
Arm Program Model
No ratings yet
Arm Program Model
4 pages
04 - The ARM Architecture and ISA
No ratings yet
04 - The ARM Architecture and ISA
73 pages
Acorn RISC Machine
No ratings yet
Acorn RISC Machine
6 pages
ARM Microcontroller - CIE 2
No ratings yet
ARM Microcontroller - CIE 2
63 pages
MS Unit2
No ratings yet
MS Unit2
94 pages
Fat MPMC
No ratings yet
Fat MPMC
97 pages
Risc Processor - Arm 9
No ratings yet
Risc Processor - Arm 9
84 pages
ARM Architecture Overview
100% (1)
ARM Architecture Overview
19 pages
Unit III Part 1
No ratings yet
Unit III Part 1
47 pages
Presentation - ARM Processors
No ratings yet
Presentation - ARM Processors
31 pages
SECA3019 Lecture 3.1 ARM Processor Basics
No ratings yet
SECA3019 Lecture 3.1 ARM Processor Basics
37 pages
ARM Core Data Flow Model and 3 Stage Pipelining
No ratings yet
ARM Core Data Flow Model and 3 Stage Pipelining
42 pages
Cpe626 ARMorganization
No ratings yet
Cpe626 ARMorganization
10 pages
CO4 - ARM & PIC Part 1
No ratings yet
CO4 - ARM & PIC Part 1
25 pages
Embedded Lecture 4 ARM
No ratings yet
Embedded Lecture 4 ARM
47 pages
Emb 3
No ratings yet
Emb 3
11 pages
Module - 5 - ARM
No ratings yet
Module - 5 - ARM
45 pages
Module 4 - Introduction To Embedded System and ARM
No ratings yet
Module 4 - Introduction To Embedded System and ARM
29 pages
Unit 5 Notes
No ratings yet
Unit 5 Notes
34 pages
MA
No ratings yet
MA
5 pages
Module3 ARM
No ratings yet
Module3 ARM
96 pages
ARM Organization and Implementation: Aleksandar Milenkovic
100% (3)
ARM Organization and Implementation: Aleksandar Milenkovic
37 pages
Unit 1 Topic 3
No ratings yet
Unit 1 Topic 3
21 pages
MPMC Unit-3 - Part-1
No ratings yet
MPMC Unit-3 - Part-1
10 pages
The ARM Processor
100% (3)
The ARM Processor
24 pages
ARM Architecture
No ratings yet
ARM Architecture
24 pages
Unit 1 ARM Architecture - Final
No ratings yet
Unit 1 ARM Architecture - Final
19 pages
Lecture6 ARM
No ratings yet
Lecture6 ARM
50 pages
CPU Instruction Set
No ratings yet
CPU Instruction Set
16 pages
ARM - Inroduction v1
No ratings yet
ARM - Inroduction v1
10 pages
ARM Notes1
No ratings yet
ARM Notes1
15 pages
Chapter 1
No ratings yet
Chapter 1
26 pages
ARM 4 Part2
100% (1)
ARM 4 Part2
9 pages
ARM Instruction Set Architecture
No ratings yet
ARM Instruction Set Architecture
8 pages
A First Look at ARM Instruction Set Architecture
No ratings yet
A First Look at ARM Instruction Set Architecture
3 pages
MPMC Unit - 4
No ratings yet
MPMC Unit - 4
15 pages
MPMC Unit 4
No ratings yet
MPMC Unit 4
23 pages
ARM
No ratings yet
ARM
44 pages
Development of The ARM Architecture
No ratings yet
Development of The ARM Architecture
44 pages
ARM Processor
No ratings yet
ARM Processor
63 pages
Aes For Ia
No ratings yet
Aes For Ia
26 pages
MC Lab Introduction Part Bcs402 Sem-4 2024-25
No ratings yet
MC Lab Introduction Part Bcs402 Sem-4 2024-25
11 pages
ARM Introduction & Architecture
No ratings yet
ARM Introduction & Architecture
33 pages
General Purpose Processor
No ratings yet
General Purpose Processor
13 pages
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
From Everand
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
Digital Equipment Corporation
No ratings yet
Computer Science II Essentials
From Everand
Computer Science II Essentials
Randall Raus
No ratings yet
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
From Everand
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
Bruce Dang
No ratings yet
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
MARIO FRANCO
No ratings yet
Yescool Amoi-C20 User Manual-English
No ratings yet
Yescool Amoi-C20 User Manual-English
30 pages
Healthcare ERP Project Success: It's All About Avoiding Missteps
No ratings yet
Healthcare ERP Project Success: It's All About Avoiding Missteps
5 pages
MX-CPG Bim Impplan Rev0
No ratings yet
MX-CPG Bim Impplan Rev0
17 pages
School Education and Sports Department
No ratings yet
School Education and Sports Department
1 page
Riki Endri S (Kipas Angin Dinding Portable)
No ratings yet
Riki Endri S (Kipas Angin Dinding Portable)
10 pages
Shweta Sharma Bengaluru - Bangalore 5.08 Yrs
No ratings yet
Shweta Sharma Bengaluru - Bangalore 5.08 Yrs
3 pages
A Level 9618 7
100% (1)
A Level 9618 7
4 pages
Smart Cities
No ratings yet
Smart Cities
6 pages
Belarc Advisor - Computer Profile
No ratings yet
Belarc Advisor - Computer Profile
3 pages
BSA Question (November 2024)
No ratings yet
BSA Question (November 2024)
2 pages
Portfolio Rituja
No ratings yet
Portfolio Rituja
18 pages
Samsung Electronics
100% (1)
Samsung Electronics
31 pages
A New Implementation: A Multiport Automatic Network Analyzer
No ratings yet
A New Implementation: A Multiport Automatic Network Analyzer
8 pages
CG PO and Co Mapping
No ratings yet
CG PO and Co Mapping
2 pages
Samsung GT c3520 Service Manual PDF
No ratings yet
Samsung GT c3520 Service Manual PDF
71 pages
Chapter 2
No ratings yet
Chapter 2
5 pages
AI
No ratings yet
AI
48 pages
PHP Full Stack Development
No ratings yet
PHP Full Stack Development
16 pages
Bilal Turabi CV
No ratings yet
Bilal Turabi CV
1 page
Vpre 2C
No ratings yet
Vpre 2C
5 pages
Application Information: Need To Know How? You've Turned To The Right Place - . - Literally
No ratings yet
Application Information: Need To Know How? You've Turned To The Right Place - . - Literally
50 pages
SET-280. Controlling AC Lamp Dimmer Through Mobile Phone
No ratings yet
SET-280. Controlling AC Lamp Dimmer Through Mobile Phone
3 pages
Assignment List
No ratings yet
Assignment List
3 pages
Lecture 01 Intro
No ratings yet
Lecture 01 Intro
31 pages
Ніч яка місячна Sheet music for Piano, Vocals (Piano-Voice)
No ratings yet
Ніч яка місячна Sheet music for Piano, Vocals (Piano-Voice)
1 page
SATIR DX-Series - DX-300 - Catalogue
No ratings yet
SATIR DX-Series - DX-300 - Catalogue
3 pages
APAAR Consent Form - Eng
No ratings yet
APAAR Consent Form - Eng
1 page
Connecting The Breakout Board To The Sun Harvester Shield V1.0
No ratings yet
Connecting The Breakout Board To The Sun Harvester Shield V1.0
4 pages
ICT Trivia
No ratings yet
ICT Trivia
9 pages

ARM Processor

Uploaded by

ARM Processor

Uploaded by

ARM Processor

Embedded real-time systems for mass storage, automotive,

It is the pointer to stack and it is used to the Stack pointer (SP)

It is the link register. It is used whether there is a procedure call or an

• The register bank, which stores the processor

• The barrel shifter-> which can shift or rotate

Performance of the system depends on,

CPI 3 stage pipeline may Re-implemented.

Need of more no of 32 bit instruction has to be

(Major issue on 3 stage pipeline to stop us to increase

3 Stage: (Low clock speed)

5 Stage: (high clock Speed)

• Data Processing instructions.

• Data Transfer instructions.

Bit wise operation

ADD r3, r3, #1

ADD r3, r2, r1, LSL #3 ; r3 <= r2 + r1 (LSL #3)

LSL – Logical left shift.

No division instruction ARM.

Base plus offset addressing (pre indexed)

Base plus offset addressing (Post indexed)

You might also like