0% found this document useful (0 votes)

6 views20 pages

HTTP Caching - Roadmap - SH

This document provides a comprehensive guide on HTTP caching, explaining its importance in reducing latency and server load. It covers various caching locations such as browser, proxy, and reverse proxy caches, as well as caching headers like Expires and Cache-Control. Additionally, it discusses validation methods using ETags and Last-Modified headers, and offers recommendations for implementing effective caching strategies.

Uploaded by

ENGINEERING zone

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views20 pages

HTTP Caching - Roadmap - SH

Uploaded by

ENGINEERING zone

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Best

Roadmaps Guides Videos ⌘K Account

Practices

Kamran Ahmed · Textual Guide · Improve this Guide

HTTP Caching
Everything you need to know about web caching

As users, we easily get frustrated by the buffering of videos, the

images that take seconds to load, and pages that got stuck because
the content is being loaded. Loading the resources from some cache is
much faster than fetching the same from the originating server. It
reduces latency, speeds up the loading of resources, decreases the
load on the server, cuts down the bandwidth costs etc.

Introduction
What is a web cache? It is something that sits somewhere between the
client and the server, continuously looking at the requests and their
responses, looking for any responses that can be cached. So that there
is less time consumed when the same request is made again.
:
Note that this image is just to give you an idea. Depending upon the
type of cache, the place where it is implemented could vary. More
on this later.

Before we get into further details, let me give you an overview of the
terms that will be used, further in the article

Client could be your browser or any application requesting the

server for some resource

Origin Server, the source of truth, houses all the content required
by the client and is responsible for fulfilling the client’s requests.

Stale Content is cached but expired content

Fresh Content is the content available in the cache that hasn’t

expired yet

Cache Validation is the process of contacting the server to check

:
the validity of the cached content and get it updated for when it is
going to expire

Cache Invalidation is the process of removing any stale content

available in the cache

Caching Locations
Web cache can be shared or private depending upon the location
where it exists. Here is the list of different caching locations

Browser Cache

Proxy Cache

Reverse Proxy Cache

Browser Cache

You might have noticed that when you click the back button in your
:
browser it takes less time to load the page than the time that it took
during the first load; this is the browser cache in play. Browser cache is
the most common location for caching and browsers usually reserve
some space for it.

A browser cache is limited to just one user and unlike other caches, it
can store the “private” responses. More on it later.

Proxy Cache

Unlike browser cache which serves a single user, proxy caches may
serve hundreds of different users accessing the same content. They
are usually implemented on a broader level by ISPs or any other
independent entities for example.
:
Reverse Proxy Cache

A Reverse proxy cache or surrogate cache is implemented close to the

origin servers in order to reduce the load on the server. Unlike proxy
caches which are implemented by ISPs etc to reduce the bandwidth
usage in a network, surrogates or reverse proxy caches are
implemented near the origin servers by the server administrators to
reduce the load on the server.
:
Although you can control the reverse proxy caches (since it is
implemented by you on your server) you can not avoid or control
browser and proxy caches. And if your website is not configured to use
these caches properly, it will still be cached using whatever defaults are
set on these caches.

Caching Headers
So, how do we control the web cache? Whenever the server emits
some response, it is accompanied by some HTTP headers to guide the
caches on whether and how to cache this response. The content
provider is the one that has to make sure to return proper HTTP
headers to force the caches on how to cache the content.

Introduction

Caching Locations
:
Browser Cache

Proxy Cache

Reverse Proxy Cache

Caching Headers

Expires

Pragma

Cache-Control

private

public

no-store

no-cache

max-age: seconds

s-maxage: seconds

must-revalidate

proxy-revalidate

Mixing Values

Validators

ETag
:
Last-Modified

Where do I start?

Utilizing Server

Caching Recommendations

Expires

Before HTTP/1.1 and the introduction of Cache-Control, there was an

Expires header which is simply a timestamp telling the caches how

long should some content be considered fresh. A possible value to this

header is the absolute expiry date; where a date has to be in GMT.
Below is the sample header

Expires: Mon, 13 Mar 2017 12:22:00 GMT

It should be noted that the date cannot be more than a year and if the
date format is wrong, the content will be considered stale. Also, the
clock on the cache has to be in sync with the clock on the server,
otherwise, the desired results might not be achieved.

Although the Expires header is still valid and is supported widely by

the caches, preference should be given to HTTP/1.1 successor of it i.e.
Cache-Control.
:
Pragma

Another one from the old, pre HTTP/1.1 days, is Pragma. Everything that
it could do is now possible using the cache-control header given below.
However, one thing I would like to point out about it is, that you might
see Pragma: no-cache being used here and there in hopes of
stopping the response from being cached. It might not necessarily
work; as HTTP specification discusses it in the request headers and
there is no mention of it in the response headers. Rather Cache-
Control header should be used to control the caching.

Cache-Control

Cache-Control specifies how long and in what manner should the

content be cached. This family of headers was introduced in HTTP/1.1
to overcome the limitations of the Expires header.

Value for the Cache-Control header is composite i.e. it can have

multiple directive/values. Let’s look at the possible values that this
header may contain.

private

Setting the cache to private means that the content will not be
cached in any of the proxies and it will only be cached by the client (i.e.
browser)
:
Cache-Control: private

Having said that, don’t let it fool you into thinking that setting this
header will make your data any secure; you still have to use SSL for that
purpose.

public

If set to public, apart from being cached by the client, it can also be
cached by the proxies; serving many other users

Cache-Control: public

no-store

`no-store` specifies that the content is not to be cached by any of

the caches

Cache-Control: no-store

no-cache

`no-cache` indicates that the cache can be maintained but the

:
cached content is to be re-validated (using ETag for example) from the
server before being served. That is, there is still a request to server but
for validation and not to download the cached content.

Cache-Control: max-age=3600, no-cache, public

max-age: seconds

`max-age` specifies the number of seconds for which the content will

be cached. For example, if the cache-control looks like below:

Cache-Control: max-age=3600, public

it would mean that the content is publicly cacheable and will be

considered stale after 60 minutes

s-maxage: seconds

`s-maxage` here s- prefix stands for shared. This directive specifically

targets the shared caches. Like max-age it also gets the number of
seconds for which something is to be cached. If present, it will override
max-age and expires headers for shared caching.
:
Cache-Control: s-maxage=3600, public

must-revalidate

`must-revalidate` it might happen sometimes that if you have

network problems and the content cannot be retrieved from the server,
the browser may serve stale content without validation. must-
revalidate avoids that. If this directive is present, it means that stale

content cannot be served in any case and the data must be re-
validated from the server before serving.

Cache-Control: max-age=3600, public, must-revalidate

proxy-revalidate

`proxy-revalidate` is similar to must-revalidate but it specifies

the same for shared or proxy caches. In other words proxy-
revalidate is to must-revalidate as s-maxage is to max-age. But

why did they not call it s-revalidate?. I have no idea why, if you have
any clue please leave a comment below.

Mixing Values

You can combine these directives in different ways to achieve different

:
caching behaviors, however no-cache/no-store and
public/private are mutually exclusive.

If you specify both no-store and no-cache, no-store will be given

precedence over no-cache.

; If specified both Cache-Control: no-store, no-cache ; Below will

Cache-Control: no-store

For private/public, for any unauthenticated requests cache is

considered public and for any authenticated ones cache is
considered private.

Validators
Up until now we only discussed how the content is cached and how
long the cached content is to be considered fresh but we did not
discuss how the client does the validation from the server. Below we
discuss the headers used for this purpose.

ETag

Etag or “entity tag” was introduced in HTTP/1.1 specs. Etag is just a

unique identifier that the server attaches with some resource. This
ETag is later on used by the client to make conditional HTTP requests
stating "give me this resource if ETag is not same as the
:
ETag that I have" and the content is downloaded only if the etags
do not match.

Method by which ETag is generated is not specified in the HTTP docs

and usually some collision-resistant hash function is used to assign
etags to each version of a resource. There could be two types of etags
i.e. strong and weak

ETag: "j82j8232ha7sdh0q2882" - Strong Etag ETag: W/"j82j8232ha7sdh

Etag (prefixed with `W/`)

A strong validating ETag means that two resources are exactly same
and there is no difference between them at all. While a weak ETag
means that two resources although not strictly the same but could be
considered the same. Weak etags might be useful for dynamic content,
for example.

Now you know what etags are but how does the browser make this
request? by making a request to server while sending the available Etag
in If-None-Match header.

Consider the scenario, you opened a web page which loaded a logo
image with caching period of 60 seconds and ETag of abc123xyz.
After about 30 minutes you reload the page, browser will notice that
the logo which was fresh for 60 seconds is now stale; it will trigger a
:
request to server, sending the ETag of the stale logo image in if-
none-match header

If-None-Match: "abc123xyz"

Server will then compare this ETag with the ETag of the current version
of resource. If both etags are matched, server will send back the
response of 304 Not Modified which will tell the client that the copy
that it has is still good and it will be considered fresh for another 60
seconds. If both the etags do not match i.e. the logo has likely changed
and client will be sent the new logo which it will use to replace the stale
logo that it has.

Last-Modified

Server might include the Last-Modified header indicating the date

and time at which some content was last modified on.

Last-Modified: Wed, 15 Mar 2017 12:30:26 GMT

When the content gets stale, client will make a conditional request
including the last modified date that it has inside the header called If-
Modified-Since to server to get the updated Last-Modified date; if

it matches the date that the client has, Last-Modified date for the
:
content is updated to be considered fresh for another n seconds. If the
received Last-Modified date does not match the one that the client
has, content is reloaded from the server and replaced with the content
that client has.

If-Modified-Since: Wed, 15 Mar 2017 12:30:26 GMT

You might be questioning now, what if the cached content has both the
Last-Modified and ETag assigned to it? Well, in that case both are to
be used i.e. there will not be any re-downloading of the resource if and
only if ETag matches the newly retrieved one and so does the Last-
Modified date. If either the ETag does not match or the Last-
Modified is greater than the one from the server, content has to be

downloaded again.

Where do I start?
Now that we have got everything covered, let us put everything in
perspective and see how you can use this information.

Utilizing Server

Before we get into the possible caching strategies , let me add the fact
that most of the servers including Apache and Nginx allow you to
implement your caching policy through the server so that you don’t
:
have to juggle with headers in your code.

For example, if you are using Apache and you have your static content
placed at /static, you can put below .htaccess file in the directory
to make all the content in it be cached for an year using below

# Cache everything for an year Header set Cache-Control "max-age=3

public"

You can further use filesMatch directive to add conditionals and use
different caching strategy for different kinds of files e.g.

# Cache any images for one year

<filesMatch ".(png|jpg|jpeg|gif)$">
Header set Cache-Control "max-age=31536000, public"
</filesMatch>

# Cache any CSS and JS files for a month

<filesMatch ".(css|js)$">
Header set Cache-Control "max-age=2628000, public"
</filesMatch>

Or if you don’t want to use the .htaccess file you can modify Apache’s
configuration file http.conf. Same goes for Nginx, you can add the
:
caching information in the location or server block.

Caching Recommendations

There is no golden rule or set standards about how your caching policy
should look like, each of the application is different and you have to
look and find what suits your application the best. However, just to give
you a rough idea

You can have aggressive caching (e.g. cache for an year) on any
static content and use fingerprinted filenames (e.g.
style.ju2i90.css) so that the cache is automatically rejected
whenever the files are updated. Also it should be noted that you
should not cross the upper limit of one year as it might not be
honored

Look and decide do you even need caching for any dynamic
content, if yes how long it should be. For example, in case of some
RSS feed of a blog there could be the caching of a few hours but
there couldn’t be any caching for inventory items in an ERP.

Always add the validators (preferably ETags) in your response.

Pay attention while choosing the visibility (private or public) of the

cached content. Make sure that you do not accidentally cache any
user-specific or sensitive content in any public proxies. When in
doubt, do not use cache at all.

Separate the content that changes often from the content that
:
doesn’t change that often (e.g. in javascript bundles) so that when
it is updated it doesn’t need to make the whole cached content
stale.

Test and monitor the caching headers being served by your site.
You can use the browser console or curl -I http://some-url.com for
that purpose.

And that about wraps it up. Stay tuned for more!

Community
roadmap.sh is the 6th most starred project on GitHub and is visited by
hundreds of thousands of developers every month.

241k GitHub Stars Join on Discord

Roadmaps Best Practices Guides Videos Store YouTube

:
roadmap.sh by @kamrify

Community created roadmaps, articles, The leading DevOps resource for

resources and journeys to help you Kubernetes, cloud-native computing,
choose your path and grow in your and the latest in at-scale development,
career. deployment, and management.

Application Admin
No ratings yet
Application Admin
136 pages
Caching Tutorial
No ratings yet
Caching Tutorial
111 pages
Performance Comparison of Graph Database and Relational Database
No ratings yet
Performance Comparison of Graph Database and Relational Database
14 pages
Malachia Ormanian - The Church of Armenia - Her History, Doctrine, Rule, Discipline PDF
No ratings yet
Malachia Ormanian - The Church of Armenia - Her History, Doctrine, Rule, Discipline PDF
316 pages
Mcma 2433
No ratings yet
Mcma 2433
53 pages
Ab Initio - Intro
100% (1)
Ab Initio - Intro
43 pages
CSE-200 Accredited Services Architect Day 3 - Performance Slide
No ratings yet
CSE-200 Accredited Services Architect Day 3 - Performance Slide
63 pages
Muet Yearly Scheme of Work 2024
100% (5)
Muet Yearly Scheme of Work 2024
19 pages
Collarity, Inc. v. Google, Inc., C.A. No. 11-1103-MPT (D. Del. May 6, 2013)
No ratings yet
Collarity, Inc. v. Google, Inc., C.A. No. 11-1103-MPT (D. Del. May 6, 2013)
20 pages
The Themes of Quine S Philosophy Meaning Reference and Knowledge 1st Edition Edward Becker
No ratings yet
The Themes of Quine S Philosophy Meaning Reference and Knowledge 1st Edition Edward Becker
44 pages
TỪ VỰNG CHỦ ĐỀ TRAVEL AND TRANSPORT
No ratings yet
TỪ VỰNG CHỦ ĐỀ TRAVEL AND TRANSPORT
11 pages
Automata Theory Lec-02
No ratings yet
Automata Theory Lec-02
31 pages
Research Defense Template by Rome
No ratings yet
Research Defense Template by Rome
28 pages
Flexible Instruction Delivery Plan Template
100% (6)
Flexible Instruction Delivery Plan Template
4 pages
One Powerful Shiva Mantra Each Rashi Needs This Sawan
No ratings yet
One Powerful Shiva Mantra Each Rashi Needs This Sawan
16 pages
Accent NeutralizationV2.0
100% (1)
Accent NeutralizationV2.0
57 pages
Signals and Daemon Processes: UNIX Programming
No ratings yet
Signals and Daemon Processes: UNIX Programming
17 pages
RFQ Process
No ratings yet
RFQ Process
19 pages
Detailed Lesson Plan For Multigrade Classes in Grade 2 and 3
100% (1)
Detailed Lesson Plan For Multigrade Classes in Grade 2 and 3
4 pages
Conti Rossini, Turajev. Vitae Sanctorum Indigenarum. 1904. Volume 1 - Textus.
100% (1)
Conti Rossini, Turajev. Vitae Sanctorum Indigenarum. 1904. Volume 1 - Textus.
278 pages
Chapter - 06 - Positive - and - Neutral - Messages Without Answer
No ratings yet
Chapter - 06 - Positive - and - Neutral - Messages Without Answer
15 pages
NEP - A Path To Paradigm Shift
No ratings yet
NEP - A Path To Paradigm Shift
4 pages
WWW - AD-POWER - CN: Class-D Amplifier Module
No ratings yet
WWW - AD-POWER - CN: Class-D Amplifier Module
6 pages
Pembahasan SMP Bahasa Inggris - FSN 2024
No ratings yet
Pembahasan SMP Bahasa Inggris - FSN 2024
16 pages
HTTP 1 1
No ratings yet
HTTP 1 1
37 pages
When To Tell Your Kids About Client Caching
No ratings yet
When To Tell Your Kids About Client Caching
166 pages
Querying The Linked Data Graph Using Owl:Sameas Provenance
No ratings yet
Querying The Linked Data Graph Using Owl:Sameas Provenance
13 pages
MATHS 205 Final (2014-2015) (1st)
No ratings yet
MATHS 205 Final (2014-2015) (1st)
11 pages
Course Syllabus Structure in English
No ratings yet
Course Syllabus Structure in English
5 pages
Ioqm Practice Test-01: Instructions
No ratings yet
Ioqm Practice Test-01: Instructions
3 pages
O'Reilly - Web Caching
No ratings yet
O'Reilly - Web Caching
331 pages
In This Issue: September 1999 Volume 2, Number 3
No ratings yet
In This Issue: September 1999 Volume 2, Number 3
40 pages
LKPD Verb 2
No ratings yet
LKPD Verb 2
3 pages
Ivan M. Linforth Soul and Sieve in Plato's Gorgias. University of California Publications in Classical Philology Tate, J
No ratings yet
Ivan M. Linforth Soul and Sieve in Plato's Gorgias. University of California Publications in Classical Philology Tate, J
2 pages
5.6 Making Things Faster Returning Visits
No ratings yet
5.6 Making Things Faster Returning Visits
35 pages
Quiz 8
No ratings yet
Quiz 8
3 pages
CCN Lecture 5
No ratings yet
CCN Lecture 5
26 pages
Websec 1
No ratings yet
Websec 1
25 pages
How To Secure Your Web App With HTTP Headers - Smashing Magazine
No ratings yet
How To Secure Your Web App With HTTP Headers - Smashing Magazine
14 pages
TOC Caching
No ratings yet
TOC Caching
16 pages
Web Cache Entanglement
No ratings yet
Web Cache Entanglement
18 pages
Integratedcacheonnetscaler 120705075853 Phpapp02
No ratings yet
Integratedcacheonnetscaler 120705075853 Phpapp02
49 pages
ComputerNetworks mod4HTTP2smtp Q1 Etext2
No ratings yet
ComputerNetworks mod4HTTP2smtp Q1 Etext2
11 pages
HTTP - Header Fields
No ratings yet
HTTP - Header Fields
17 pages
API Documentation
No ratings yet
API Documentation
10 pages
Lecture 4
No ratings yet
Lecture 4
8 pages
Controlling Items in Client Caches: Elton Stoneman
No ratings yet
Controlling Items in Client Caches: Elton Stoneman
16 pages
A Beginners Guide To Caching in Drupal 8 - Valuebound
No ratings yet
A Beginners Guide To Caching in Drupal 8 - Valuebound
5 pages
F5 Ram Cache
No ratings yet
F5 Ram Cache
9 pages
HTTP - Web Caching and Conditional Request
No ratings yet
HTTP - Web Caching and Conditional Request
12 pages
High-Performance Web Sites
No ratings yet
High-Performance Web Sites
7 pages
HTTP Cache Control Headers
No ratings yet
HTTP Cache Control Headers
6 pages
General Headers Cache-Control: Cache-Control: Cache-Request-Directive - Cache-Response-Directive
No ratings yet
General Headers Cache-Control: Cache-Control: Cache-Request-Directive - Cache-Response-Directive
14 pages
Web Distribution Systems: Caching and Replication: Chandhok@cse - Wustl.edu
No ratings yet
Web Distribution Systems: Caching and Replication: Chandhok@cse - Wustl.edu
12 pages
Caching Methodologies
No ratings yet
Caching Methodologies
16 pages
Lab Exercise - HTTP: Objective
No ratings yet
Lab Exercise - HTTP: Objective
10 pages
Web Caching: Presented by
No ratings yet
Web Caching: Presented by
12 pages
HTTP Basics3
No ratings yet
HTTP Basics3
11 pages
Unit 2
No ratings yet
Unit 2
22 pages
Web Caching Archit
No ratings yet
Web Caching Archit
8 pages
The Fundamentals of HTTP: F5 White Paper
No ratings yet
The Fundamentals of HTTP: F5 White Paper
8 pages
Just A Name Following Some Syntax. URL & URN Are Subsets of URI
No ratings yet
Just A Name Following Some Syntax. URL & URN Are Subsets of URI
30 pages
Cookies Overview and HTTP Proxies
No ratings yet
Cookies Overview and HTTP Proxies
20 pages
Advance Programming Week 7-8
No ratings yet
Advance Programming Week 7-8
4 pages
HTTP
No ratings yet
HTTP
5 pages
Web Cache Poisoning PDF
No ratings yet
Web Cache Poisoning PDF
19 pages
Wcaching PDF
No ratings yet
Wcaching PDF
34 pages
HTML 5
No ratings yet
HTML 5
64 pages
Chapter 6
No ratings yet
Chapter 6
16 pages
Caching Behavior of Web Browsers: Browser Settings
No ratings yet
Caching Behavior of Web Browsers: Browser Settings
10 pages
HTTP Caching - HttpWatch
No ratings yet
HTTP Caching - HttpWatch
4 pages
Caching From URLs With A Query String
No ratings yet
Caching From URLs With A Query String
7 pages
For Content Publishers: Michael J. Radwin O'Reilly Open Source Convention July 28, 2004
No ratings yet
For Content Publishers: Michael J. Radwin O'Reilly Open Source Convention July 28, 2004
39 pages
Tugas Networking Baru
No ratings yet
Tugas Networking Baru
3 pages
Web Technology
No ratings yet
Web Technology
11 pages
Request Headers: Response Headers:: Know Your HTTP Headers!
No ratings yet
Request Headers: Response Headers:: Know Your HTTP Headers!
1 page
Web Cache
No ratings yet
Web Cache
3 pages
Varnish, Memcached, Redis, and HTTP Caching For Increased Web App Performance
No ratings yet
Varnish, Memcached, Redis, and HTTP Caching For Increased Web App Performance
4 pages
Theo Faith
No ratings yet
Theo Faith
14 pages
Working With The JavaScript Cache API
No ratings yet
Working With The JavaScript Cache API
7 pages
Speed Up Your Internet Access Using Squid's Refresh Patterns
No ratings yet
Speed Up Your Internet Access Using Squid's Refresh Patterns
5 pages
Us 17 Gil Web Cache Deception Attack WP
No ratings yet
Us 17 Gil Web Cache Deception Attack WP
16 pages
Caching
No ratings yet
Caching
12 pages
.2 Response Headers: 5. Caching For More Information
No ratings yet
.2 Response Headers: 5. Caching For More Information
8 pages
Web Cache: BY Sudhir Dama 08GE1A1251
No ratings yet
Web Cache: BY Sudhir Dama 08GE1A1251
17 pages
MICROSOFT AZURE ADMINISTRATOR EXAM PREP(AZ-104) Part-3: AZ 104 EXAM STUDY GUIDE
From Everand
MICROSOFT AZURE ADMINISTRATOR EXAM PREP(AZ-104) Part-3: AZ 104 EXAM STUDY GUIDE
Devi Prasad
No ratings yet
Hashicorp Certified Vault Associate Certification Case Based Practice Questions - Latest Edition
From Everand
Hashicorp Certified Vault Associate Certification Case Based Practice Questions - Latest Edition
Exam OG
No ratings yet
Distributed Caching & Data Management: Mastering Redis, Memcached, And Apache Ignite Caching
From Everand
Distributed Caching & Data Management: Mastering Redis, Memcached, And Apache Ignite Caching
Rob Botwright
No ratings yet
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
JSP-Servlet Interview Questions You'll Most Likely Be Asked
From Everand
JSP-Servlet Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet