0% found this document useful (0 votes)

35 views7 pages

Cache Memory Explained

Uploaded by

beautart00

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views7 pages

Cache Memory Explained

Uploaded by

beautart00

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Cache memory

Cache memory is a chip-based computer component that makes retrieving data from the
computer's memory more efficient. It acts as a temporary storage area that the computer's
processor can retrieve data from easily. This temporary storage area, known as a cache, is more
readily available to the processor than the computer's main memory source, typically some form
of DRAM.

Cache memory is sometimes called CPU (central processing unit) memory because it is typically
integrated directly into the CPU chip or placed on a separate chip that has a
separate bus interconnect with the CPU. Therefore, it is more accessible to the processor, and
able to increase efficiency, because it's physically close to the processor.

In order to be close to the processor, cache memory needs to be much smaller than main
memory. Consequently, it has less storage space. It is also more expensive than main memory, as
it is a more complex chip that yields higher performance.

What it sacrifices in size and price, it makes up for in speed. Cache memory operates between 10
to 100 times faster than RAM, requiring only a few nanoseconds to respond to a CPU request.

The name of the actual hardware that is used for cache memory is high-speed static random
access memory (SRAM). The name of the hardware that is used in a computer's main memory is
dynamic random access memory (DRAM).

Cache memory is not to be confused with the broader term cache. Caches are temporary stores of
data that can exist in both hardware and software. Cache memory refers to the specific hardware
component that allows computers to create caches at various levels of the network.

Types of cache memory

Cache memory is fast and expensive. Traditionally, it is categorized as "levels" that describe its
closeness and accessibility to the microprocessor. There are three general cache levels:

1
L1 cache, or primary cache, is extremely fast but relatively small, and is usually embedded in
the processor chip as CPU cache.

L2 cache, or secondary cache, is often more capacious than L1. L2 cache may be embedded on
the CPU, or it can be on a separate chip or coprocessor and have a high-speed alternative system
bus connecting the cache and CPU. That way it doesn't get slowed by traffic on the main system
bus.

Level 3 (L3) cache is specialized memory developed to improve the performance of L1 and L2.
L1 or L2 can be significantly faster than L3, though L3 is usually double the speed of DRAM.
With multicore processors, each core can have dedicated L1 and L2 cache, but they can share an
L3 cache. If an L3 cache references an instruction, it is usually elevated to a higher level of
cache.

In the past, L1, L2 and L3 caches have been created using combined processor and motherboard
components. Recently, the trend has been toward consolidating all three levels of memory
caching on the CPU itself. That's why the primary means for increasing cache size has begun to
shift from the acquisition of a specific motherboard with different chipsets and bus architectures
to buying a CPU with the right amount of integrated L1, L2 and L3 cache.

Contrary to popular belief, implementing flash or more dynamic RAM (DRAM) on a system
won't increase cache memory. This can be confusing since the terms memory caching (hard disk
buffering) and cache memory are often used interchangeably. Memory caching, using DRAM or
flash to buffer disk reads, is meant to improve storage I/O by caching data that is frequently
referenced in a buffer ahead of slower magnetic disk or tape. Cache memory, on the other hand,
provides read buffering for the CPU.

A diagram of the architecture and data flow of a typical cache memory unit.
Cache memory mapping

Caching configurations continue to evolve, but cache memory traditionally works under three
different configurations:

2
 Direct mapped cache has each block mapped to exactly one cache memory location.
Conceptually, a direct mapped cache is like rows in a table with three columns: the cache
block that contains the actual data fetched and stored, a tag with all or part of the address of
the data that was fetched, and a flag bit that shows the presence in the row entry of a valid bit
of data.

 Fully associative cache mapping is similar to direct mapping in structure but allows a
memory block to be mapped to any cache location rather than to a prespecified cache
memory location as is the case with direct mapping.

 Set associative cache mapping can be viewed as a compromise between direct mapping and
fully associative mapping in which each block is mapped to a subset of cache locations. It is
sometimes called N-way set associative mapping, which provides for a location in main
memory to be cached to any of "N" locations in the L1 cache.
Data writing policies

Data can be written to memory using a variety of techniques, but the two main ones involving
cache memory are:

 Write-through. Data is written to both the cache and main memory at the same time.

 Write-back. Data is only written to the cache initially. Data may then be written to main
memory, but this does not need to happen and does not inhibit the interaction from taking
place.

The way data is written to the cache impacts data consistency and efficiency. For example, when
using write-through, more writing needs to happen, which causes latency upfront. When using
write-back, operations may be more efficient, but data may not be consistent between the main
and cache memories.

One way a computer determines data consistency is by examining the dirty bit in memory. The
dirty bit is an extra bit included in memory blocks that indicates whether the information has
been modified. If data reaches the processor's register file with an active dirty bit, it means that it
is not up to date and there are more recent versions elsewhere. This scenario is more likely to

3
happen in a write-back scenario, because the data is written to the two storage areas
asynchronously.

Specialization and functionality

In addition to instruction and data caches, other caches are designed to provide specialized
system functions. According to some definitions, the L3 cache's shared design makes it a
specialized cache. Other definitions keep the instruction cache and the data cache separate and
refer to each as a specialized cache.

Translation lookaside buffers (TLBs) are also specialized memory caches whose function is to
record virtual address to physical address translations.

Still other caches are not, technically speaking, memory caches at all. Disk caches, for instance,
can use DRAM or flash memory to provide data caching similar to what memory caches do with
CPU instructions. If data is frequently accessed from the disk, it is cached into DRAM or flash-
based silicon storage technology for faster access time and response.

Specialized caches are also available for applications such as web browsers, databases, network
address binding and client-side Network File System protocol support. These types of caches
might be distributed across multiple networked hosts to provide greater scalability or
performance to an application that uses them.

A depiction of the memory hierarchy and how it functions

Locality

The ability of cache memory to improve a computer's performance relies on the concept of
locality of reference. Locality describes various situations that make a system more predictable.
Cache memory takes advantage of these situations to create a pattern of memory access that it
can rely upon.

There are several types of locality. Two key ones for cache are:

4
 Temporal locality. This is when the same resources are accessed repeatedly in a short
amount of time.

 Spatial locality. This refers to accessing various data or resources that are near each other.
Performance

Cache memory is important because it improves the efficiency of data retrieval. It stores program
instructions and data that are used repeatedly in the operation of programs or information that the
CPU is likely to need next. The computer processor can access this information more quickly
from the cache than from the main memory. Fast access to these instructions increases the
overall speed of the program.

Aside from its main function of improving performance, cache memory is a valuable resource
for evaluating a computer's overall performance. Users can do this by looking at cache's hit-to-
miss ratio. Cache hits are instances in which the system successfully retrieves data from the
cache. A cache miss is when the system looks for the data in the cache, can't find it, and looks
somewhere else instead. In some cases, users can improve the hit-miss ratio by adjusting the
cache memory block size -- the size of data units stored.

Improved performance and ability to monitor performance are not just about improving general
convenience for the user. As technology advances and is increasingly relied upon in mission-
critical scenarios, having speed and reliability becomes crucial. Even a few milliseconds of
latency could potentially lead to enormous expenses, depending on the situation.

A chart comparing cache memory to other memory types

Cache vs. main memory

DRAM serves as a computer's main memory, performing calculations on data retrieved from
storage. Both DRAM and cache memory are volatile memories that lose their contents when the
power is turned off. DRAM is installed on the motherboard, and the CPU accesses it through a
bus connection.

DRAM is usually about half as fast as L1, L2 or L3 cache memory, and much less expensive. It
provides faster data access than flash storage, hard disk drives (HDD) and tape storage. It came

5
into use in the last few decades to provide a place to store frequently accessed disk data to
improve I/O performance.

DRAM must be refreshed every few milliseconds. Cache memory, which also is a type of
random access memory, does not need to be refreshed. It is built directly into the CPU to give the
processor the fastest possible access to memory locations and provides nanosecond speed access
time to frequently referenced instructions and data. SRAM is faster than DRAM, but because it's
a more complex chip, it's also more expensive to make.

An example of dynamic RAM.

Cache vs. virtual memory

A computer has a limited amount of DRAM and even less cache memory. When a large program
or multiple programs are running, it's possible for memory to be fully used. To compensate for a
shortage of physical memory, the computer's operating system (OS) can create virtual memory.

To do this, the OS temporarily transfers inactive data from DRAM to disk storage. This approach
increases virtual address space by using active memory in DRAM and inactive memory in HDDs
to form contiguous addresses that hold both an application and its data. Virtual memory lets a
computer run larger programs or multiple programs simultaneously, and each program operates
as though it has unlimited memory.

In order to copy virtual memory into physical memory, the OS divides memory into page files or
swap files that contain a certain number of addresses. Those pages are stored on a disk and when
they're needed, the OS copies them from the disk to main memory and translates the virtual
memory address into a physical one. These translations are handled by a memory management
unit (MMU).

Implementation and history

Mainframes used an early version of cache memory, but the technology as it is known today
began to be developed with the advent of microcomputers. With early PCs, processor
performance increased much faster than memory performance, and memory became a
bottleneck, slowing systems.

6
In the 1980s, the idea took hold that a small amount of more expensive, faster SRAM could be
used to improve the performance of the less expensive, slower main memory. Initially, the
memory cache was separate from the system processor and not always included in the chipset.
Early PCs typically had from 16 KB to 128 KB of cache memory.

With 486 processors, Intel added 8 KB of memory to the CPU as Level 1 (L1) memory. As much
as 256 KB of external Level 2 (L2) cache memory was used in these systems. Pentium
processors saw the external cache memory double again to 512 KB on the high end. They also
split the internal cache memory into two caches: one for instructions and the other for data.

Processors based on Intel's P6 microarchitecture, introduced in 1995, were the first to incorporate
L2 cache memory into the CPU and enable all of a system's cache memory to run at the
same clock speed as the processor. Prior to the P6, L2 memory external to the CPU was accessed
at a much slower clock speed than the rate at which the processor ran and slowed system
performance considerably.

Early memory cache controllers used a write-through cache architecture, where data written into
cache was also immediately updated in RAM. This approached minimized data loss, but also
slowed operations. With later 486-based PCs, the write-back cache architecture was developed,
where RAM isn't updated immediately. Instead, data is stored on cache and RAM is updated
only at specific intervals or under certain circumstances where data is missing or old.

Cpu Concepts-2
No ratings yet
Cpu Concepts-2
52 pages
Cache Memory Presentation Slides
No ratings yet
Cache Memory Presentation Slides
25 pages
Lesson 3 Memory Hierarchy RAM Cache and ROM
No ratings yet
Lesson 3 Memory Hierarchy RAM Cache and ROM
8 pages
Lecture 4 Characteristics of Memory Systems
No ratings yet
Lecture 4 Characteristics of Memory Systems
36 pages
Cache Memory Homework
100% (1)
Cache Memory Homework
7 pages
COA Chapter Seven
No ratings yet
COA Chapter Seven
40 pages
Understanding Cache Memory Basics
No ratings yet
Understanding Cache Memory Basics
47 pages
Information System
No ratings yet
Information System
7 pages
Memory Hierarchy: Types & Organization
No ratings yet
Memory Hierarchy: Types & Organization
9 pages
Unit 5 Dpco
No ratings yet
Unit 5 Dpco
20 pages
Memory Hierarchy and CPU Connection
No ratings yet
Memory Hierarchy and CPU Connection
30 pages
Course: Computer Architecture and Organization. Faculty: Waqar Khan. Presented By: Anusha and Talha
No ratings yet
Course: Computer Architecture and Organization. Faculty: Waqar Khan. Presented By: Anusha and Talha
20 pages
Unit 3
No ratings yet
Unit 3
12 pages
Cache Memory: Function and Principles
No ratings yet
Cache Memory: Function and Principles
19 pages
Usha Mittal Institute of Technology SNDT Women'S University: MUMBAI - 400049
No ratings yet
Usha Mittal Institute of Technology SNDT Women'S University: MUMBAI - 400049
19 pages
Unit 4 Coa - Memory-1
No ratings yet
Unit 4 Coa - Memory-1
12 pages
Module 4 Memory Essay RYAN OTT
No ratings yet
Module 4 Memory Essay RYAN OTT
5 pages
Module 5
No ratings yet
Module 5
30 pages
Chapter 4 Coa
No ratings yet
Chapter 4 Coa
10 pages
Coa - Memory Organization
50% (2)
Coa - Memory Organization
31 pages
Cache Memory in Computer Organization
No ratings yet
Cache Memory in Computer Organization
12 pages
Cache Memory: Types and Performance
No ratings yet
Cache Memory: Types and Performance
4 pages
Engr:Sajida Introduction To Computing
No ratings yet
Engr:Sajida Introduction To Computing
8 pages
Cache Memory
No ratings yet
Cache Memory
6 pages
CA Unit-2 EE
No ratings yet
CA Unit-2 EE
13 pages
Guide To Cache Memory
No ratings yet
Guide To Cache Memory
5 pages
Chapter 5 Memory Organization
No ratings yet
Chapter 5 Memory Organization
75 pages
Memory Systems for Engineers
No ratings yet
Memory Systems for Engineers
77 pages
Associative Memory
No ratings yet
Associative Memory
31 pages
Unit-2 CDA DrManojY
No ratings yet
Unit-2 CDA DrManojY
81 pages
Ca 08
No ratings yet
Ca 08
17 pages
RAM Microprocessor Memory CPU Chip Bus Program Instructions Software Access
No ratings yet
RAM Microprocessor Memory CPU Chip Bus Program Instructions Software Access
4 pages
Ram and Rom
No ratings yet
Ram and Rom
32 pages
Memory Hierarchy & Troubleshooting
No ratings yet
Memory Hierarchy & Troubleshooting
63 pages
Lecture 5
No ratings yet
Lecture 5
53 pages
CH7 - Memory Organization
No ratings yet
CH7 - Memory Organization
38 pages
Cache 13115
No ratings yet
Cache 13115
20 pages
Cache Mapping
No ratings yet
Cache Mapping
23 pages
Computer Memory: Short Term Memory Long Term Memory
No ratings yet
Computer Memory: Short Term Memory Long Term Memory
5 pages
Input Output Organization (2.3)
No ratings yet
Input Output Organization (2.3)
151 pages
Coaint
No ratings yet
Coaint
16 pages
Lecture Two
No ratings yet
Lecture Two
35 pages
Cache and Virtual Memory Explained
No ratings yet
Cache and Virtual Memory Explained
50 pages
Memory Hierarchy in Computer Systems
No ratings yet
Memory Hierarchy in Computer Systems
20 pages
Lecture 2.2.4 (Associative Memory, Cache Memory and Its Design Issues)
No ratings yet
Lecture 2.2.4 (Associative Memory, Cache Memory and Its Design Issues)
54 pages
Memory Organization
No ratings yet
Memory Organization
57 pages
Cache Memory in Computer Organization
No ratings yet
Cache Memory in Computer Organization
5 pages
COA Chapter 4
No ratings yet
COA Chapter 4
11 pages
110029
No ratings yet
110029
30 pages
Memory
No ratings yet
Memory
95 pages
6 Memory Organization
No ratings yet
6 Memory Organization
44 pages
Cache Memory Detailed Notes
No ratings yet
Cache Memory Detailed Notes
2 pages
Cache Memory
No ratings yet
Cache Memory
11 pages
CPU Interaction With Memory, IO and Cache
No ratings yet
CPU Interaction With Memory, IO and Cache
4 pages
Shashank Aca Assignment
No ratings yet
Shashank Aca Assignment
21 pages
COA ch3
No ratings yet
COA ch3
39 pages
Computer Memory Systems Guide
No ratings yet
Computer Memory Systems Guide
26 pages
UNIT-2 Computer Organization
No ratings yet
UNIT-2 Computer Organization
76 pages
Unit II Numericals
No ratings yet
Unit II Numericals
5 pages
Lookup Transformation
No ratings yet
Lookup Transformation
28 pages
Processor I7 (Seminar Report)
64% (14)
Processor I7 (Seminar Report)
14 pages
Computer Architecture: Memory Hierarchy
No ratings yet
Computer Architecture: Memory Hierarchy
76 pages
Buet Pattern Syllabus
No ratings yet
Buet Pattern Syllabus
7 pages
NCC 315 & Ait 311-2
No ratings yet
NCC 315 & Ait 311-2
50 pages
The 80x86 IBM PC and Compatible Computers - 4th Edition PDF
100% (1)
The 80x86 IBM PC and Compatible Computers - 4th Edition PDF
1,019 pages
Memory Organization in Computer Systems
No ratings yet
Memory Organization in Computer Systems
44 pages
Computer Architecture
No ratings yet
Computer Architecture
7 pages
Algorithm Basics and Programming Concepts
No ratings yet
Algorithm Basics and Programming Concepts
31 pages
Unit 4 (With Page Number)
No ratings yet
Unit 4 (With Page Number)
37 pages
MPI-Unit Rer Edited-1
No ratings yet
MPI-Unit Rer Edited-1
110 pages
Green Droid
No ratings yet
Green Droid
24 pages
A Parallel SPH Implementation On Multi-Core Cpus: (1981), Number 0 Pp. 1-12
No ratings yet
A Parallel SPH Implementation On Multi-Core Cpus: (1981), Number 0 Pp. 1-12
12 pages
Central Processing Unit (The Brain of The Computer)
No ratings yet
Central Processing Unit (The Brain of The Computer)
36 pages
Monitoring Linux IO
No ratings yet
Monitoring Linux IO
66 pages
Computer Performance Evaluation Metrics
No ratings yet
Computer Performance Evaluation Metrics
53 pages
Ti Keystone Processor
No ratings yet
Ti Keystone Processor
282 pages
Embedded Systems Lecture Notes
No ratings yet
Embedded Systems Lecture Notes
115 pages
Report On 64 Bit Processor
No ratings yet
Report On 64 Bit Processor
7 pages
Cook Et Al. - 2017 - Diplomatic Design Patterns A TileLink Case Study
No ratings yet
Cook Et Al. - 2017 - Diplomatic Design Patterns A TileLink Case Study
7 pages
CS8491-Computer Architecture
No ratings yet
CS8491-Computer Architecture
18 pages
Unit - I
No ratings yet
Unit - I
74 pages
i.MX 93-IMX93IEC
No ratings yet
i.MX 93-IMX93IEC
124 pages
HAMSA-DI A Low-Power Dual-Issue RISC-V Core Targeting Energy-Efficient Embedded
No ratings yet
HAMSA-DI A Low-Power Dual-Issue RISC-V Core Targeting Energy-Efficient Embedded
14 pages
GPU Insights for CPU Experts
100% (1)
GPU Insights for CPU Experts
70 pages
Lecture 1 (Microprocessor & Microcontroller)
No ratings yet
Lecture 1 (Microprocessor & Microcontroller)
28 pages
Lllit5: Embedded System Design
No ratings yet
Lllit5: Embedded System Design
4 pages
Cache Performance Optimization Guide
No ratings yet
Cache Performance Optimization Guide
6 pages

Cache Memory Explained

Uploaded by

Cache Memory Explained

Uploaded by

Cache memory

Types of cache memory

Specialization and functionality

A depiction of the memory hierarchy and how it functions

A chart comparing cache memory to other memory types

An example of dynamic RAM.

Implementation and history

You might also like