Chapter 3 - Servers
Chapter 3 - Servers
Chapter 3
Servers
Chapter 3 Objectives
Determine strategies for providing
server resources.
Determine server hardware features:
enterprise and cluster.
Discuss server hardware specifications
when purchasing an individual server.
ITT 420
Chapter 3 Outline
Server Hardware Strategies
Server Hardware Features
Server Hardware Specifications
Server Hardware Strategies
Beautiful Snowflakes
A better strategy is to use a separate machine
for each service.
Each machine is sized for the desired
application: RAM, disk, number and speeds of
NICs, and enough extra capacity, or
expansion slots, for projected growth during
the expected life of the machine.
The benefit of this strategy is that the
machine is the best possible choice that
meets the requirements.
The downside is that the result is a fleet of
unique machines causing variation.
Each new system adds administrative
overhead proportionally
Server Hardware Strategies
Beautiful Snowflakes
Asset Tracking
When managing many unique machines it becomes
increasingly important to maintain an inventory of
machines.
The inventory should document technical information such
as the operating system and hardware parameters such as
amount of RAM, type of CPU. Also owner, running service
and contact person
Reducing Variations
Always be on the lookout for opportunities to reduce the
number of variations in platforms or technologies being
supported.
Select a default hardware vendor, model, and operating
system. Automated the configuration
Global Optimization
While it sounds efficient to customize each machine to the
exact needs of the service it provides, the result tends to be
an unmanageable mess.
Avoid unnecessary local machine customization
Server Hardware Strategies
Buy in Bulk, Allocate
Fractions
The next strategy is to buy computing resources
in bulk and allocate fractions of it as needed.
One way to do this is through virtualization.
organization purchases large physical servers
and divides them up for use by customers by
creating individual virtual machines (VMs).
A virtualization cluster can grow by adding more
physical hardware as more capacity is needed.
VMs can also be resized. You can add RAM,
vCPUs, and disk space to a VM via an API call
instead of a visit to the datacenter.
Virtualization improves computing efficiency.
Server Hardware Strategies
Buy in Bulk, Allocate
Fractions
virtualizationprovides better isolation than
simple multitasking. The benefits of
isolation include:
Independence: Each VM can run a different
operating system.
Resource isolation: The disk and RAM allocated
to a VM are committed to that VM and not
shared.
Granular security: A person with root access on
one VM does not automatically have privileged
access on another VM.
Reduced dependency hell: Each machine has
its own operating system and system libraries, so
they can be upgraded independently.
Server Hardware Strategies
Buy in Bulk, Allocate
Fractions
VM Management
keeping a good inventory of VMs is important.
you need to maintain an inventory of who owns
each VM and its purpose.
Some clusters are tightly controlled, only
permitting the IT team to create VMs.
Other clusters are general-purpose compute
farms providing the ability for customers to
request new machines on demand.
There should be limits in place so that customers
can’t overload the system by creating too many
VMs.
As in the other strategies, it is important to limit
the amount of variation.
Server Hardware Strategies
Buy in Bulk, Allocate
Fractions
Live Migration
VM can be moved from one physical host to
another while it is running.
Live migration makes management easier.
It can be used to rebalance a cluster, moving
VMs off overloaded physical machines to others
that are less loaded.
If a physical machine is having a hardware
problem, its VMs can be evacuated to another
physical machine.
The architecture of a typical virtualization cluster
includes many physical machines that share a
SAN for storage of the VM’s disks.
Server Hardware Strategies
Buy in Bulk, Allocate
Fractions
Live Migration
Server Hardware Strategies
Buy in Bulk, Allocate
Fractions
VM Packing
VMs cannot span physical machines.
As a consequence, we often get into situations where
the remaining RAM on a physical machine is not
enough for a new VM.
The best way to avoid this is to create VMs that are
standard sizes that pack nicely.
Spare Capacity for Maintenance
If a physical machine needs to be taken down for
repairs, there has to be a place where the VMs can be
migrated if you are to avoid downtime.
One strategy is to keep one physical machine entirely
idle so that, when needed, the VMs from the machine
to be repaired can all be migrated to this machine.
Another strategy is to distribute the spare capacity
around the cluster so that the individual VMs can
share the extra I/O bandwidth.
Server Hardware Strategies
Buy in Bulk, Allocate
Fractions
Unified VM/Non-VM Management
Most sites end up with two entirely different ways to
request, allocate, and track VMs and non-VMs.
It can be beneficial to have one system that
manages both.
Containers
Containers are another virtualization technique.
provide isolation at the process level instead of the
machine level.
All of the containers run under the same operating
system, but each container is self-contained as far
as the files it uses.
Containers are much lighter weight and permit more
services to be packed on fewer machines.
Docker, Mesos, and Kubernetes are popular
systems for managing large numbers of containers.
Server Hardware Strategies
Buy in Bulk, Allocate
Fractions
Containers vs. Virtualization
Server Hardware Strategies
Grid Computing
Grid computing takes many similar machines and manages
them as a single unit.
Each one is configured exactly alike—same hardware and
software.
To use the grid, a customer specifies how many machines
are needed and which software package to run.
The grid management system allocates the right number of
machines, installs the software on them, and runs the
software.
A big part of grid management software is the scheduling
algorithm.
Server Hardware Strategies
Grid Computing
Grid computing has other constraints. Often there is
more bandwidth between machines on the same
rack, and less bandwidth between racks.
Grid computing is more efficient than virtualization
because it eliminates the virtualization overhead,
which is typically a 5 to 10 percent reduction in
performance.
Grids are easier to manage because what is done for
one machine is done for all machines.
Server Hardware Strategies
Blade Servers
Each machine needs to be racked, connected to
power, networked, and so on.
Blade servers reduce this overhead by providing
many servers in one chassis.
A blade server has many individual slots that take
motherboards, called blades, that contain either a
computer or storage.
Each blade can be installed quickly and easily
because you simply slide a card into a slot.
Another benefit of blade systems is that they are
software configurable.
Blade systems are most cost-effective when the
chassis lasts many years, enabling you to upgrade
the blades for the duration of its lifetime.
Server Hardware Strategies
Blade Servers
Server Hardware Strategies
Server Appliances
An appliance is a device designed specifically for a
particular task.
The computer world also has appliances:
file server appliances,
web server appliances,
email appliances,
DNS/DHCP appliances
Although a senior SA can engineer a system dedicated to
file service or email out of a general-purpose server.
Purchasing an appliance can free the SA to focus on other
tasks.
The other benefit of appliances is that they often have
features that can’t be found elsewhere.
Server Hardware Strategies
Hybrid Strategies
Most organizations actually employ a number of
different strategies.
Small organization has a few snowflakes and
possibly a few eggs in one basket.
Medium-size organizations usually have a
virtualization cluster plus a few snowflakes for
situations where virtualization would not work.
Large organization have a little of everything.
The only strategy we recommend against is an
organization trying to deploy all of these strategies
simultaneously.
Server Hardware Features
Server Reliability
Because servers need to be more reliable, and have
higher uptime than workstations.
one of the ways we prepare for equipment failure is to
buy server hardware with additional features for
reliability and data integrity.
servers should be housed in a restricted-access,
climate controlled environment, with protected power
—in other words, a computer room or data center.
Levels of Redundancy
N+1 redundancy is used when we wish to indicate that
there is enough spare capacity for one failure.
N+2 redundancy would mean there is enough spare
capacity for two failed power supplies.
For example, there may be two power supplies in the
system. Either can fail and the system keeps running.
Server Hardware Features
Server Reliability
Data Integrity
Hard disks and SSDs eventually wear out and die. we must
have a plan to deal with this eventuality.
RAID
By using RAID (Redundant Array of Independent Disks) levels
1 and higher. When a disk fails, the system keeps running and
we have time to replace the disk.
RAID 1 stores all data twice, once on each disk of a two-disk
mirror. If one disk fails, the system can simply rely on the
remaining disk. If the second disk fails before restoration is
complete, we lose all data and must restore data from
backups.
RAID levels 2 and higher are similar but create redundancy
in ways that are more efficient, have better performance, or
can survive two disk failures.
Server Hardware Features
Server Reliability
Non-RAID Approaches
The alternative to RAID is to assume data integrity is
handled elsewhere.
For example, suppose a group of redundant web servers
all contain the same data, which they receive from a
master.
The web servers themselves do not need RAID because if
a disk fails, the web server is simply shut down for
repairs and the traffic is divided among the remaining
web servers.
Another example of handling data integrity elsewhere is
a distributed storage system such as Google’s GFS,
Hadoop’s HDFS, Cassandra, and others.
These systems store copies of data on many hosts,
essentially creating RAID-like storage that is distributed
among many machines, not just many disks.
Server Hardware Features
Server Reliability
Hot-Swap Components
Hot-swap refers to the ability to add, remove, and
replace a component while the system is running.
Hot-swappable components increase the cost of a
system.
This additional cost is justified when it eliminates
downtimes for expansion or repairs.
The more quickly a failed component can be replaced,
the better.
RAID systems usually run more slowly until a failed
component has been replaced and the RAID set has been
rebuilt.
Server Hardware Features
Server Reliability
Servers Should Be in Computer Rooms
Servers should be installed in an environment with
proper power, fire protection, networking, temperature
and humidity control, and physical security.
The two most important points are power and cooling.
For reliable operations, servers need power that is
reliable and clean.
Servers are designed to run at a particular operating
temperature, usually around 10 to 35◦C (50 to 95◦F).
it is best to mount it in the rack immediately before
installing the OS and other software.
A telecom closet with good cooling and a door that can
be locked is better than having your company’s payroll
installed on a server sitting under someone’s desk.
Server Hardware Features
Network Interfaces
Servers often have multiple network interface cards (NICs)
because they are connected to different networks,
Or because they are ganged or grouped together for
increased bandwidth or resiliency.
A server with many clients needs more bandwidth than one
with fewer clients, assuming all clients generate the same
amount of bandwidth.
Sometimes multiple NICs are grouped together to provide
more bandwidth or more resiliency.
For example, two 10 Gbps NICs might connect to the same
switch, providing 20 Gbps aggregate bandwidth. This
practice is often called bonding, or teaming, the available
bandwidth.
Server Hardware Specifications
Power Supplies
The second-most failure-prone component in a server is the
power supply
It is very common to have N+1 redundant power supplies
so that if one fails, the system can keep running.
Each power supply should also have a separate power cord.
Each power supply should draw power from a different
source: a separate circuit or uninterruptible power supply
(UPS).
Generally each power distribution unit (PDU) in a data
centre is its own circuit, so plugging each power cord into a
different PDU assures two power sources.
Server Hardware Specifications