0% found this document useful (0 votes)

64 views214 pages

EEDCS Tutorial IISWC2011 PDF

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views214 pages

EEDCS Tutorial IISWC2011 PDF

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 214

2011 IEEE International Symposium on Workload Characterization

Austin, Texas
6 November 2011

Energy-Efficient Data Centers and Systems

Charles Lefurgy, Malcolm Allen-Ware, John Carter, Wael El-Essawy, Wes

Felter, Alexandre Ferreira, Wei Huang, Anthony Hylick, Tom Keller, Karthick
Rajamani, Freeman Rawson and Juan Rubio

© 2011 IBM Corporation

SWTSE 2010

Data center energy matters

In 2005, data centers accounted for

1.0% of world-wide energy consumption
1.2% of US energy consumption

– Consumption doubled between 2000-2005

– 16% annual growth rate
Unsustainable

Drivers of the DC crisis

– IT demand outpacing energy-efficiency
improvements
– Cloud services
– Escalating CMOS power density
– IT refresh is 5x faster than facilities
– Increasing energy costs

Sources: 1. Koomey, “Worldwide Electricity Used in Data Centers”, Environmental Research Letters, 2008
2. Report to Congress on Server and Data Center Energy Efficiency, U.S. Environmental Protection Agency, 2007
© 2011 IBM Corporation
SWTSE 2010

Why listen to us?

Technologies Malcolm Allen-Ware Anthony Hylick

– 1st power measurement in a server
product
– 1st power capping in a server John Carter Tom Keller
product
– Power shifting
– AMESTER
Wael El-Essawy Charles Lefurgy
Contributions to IBM products
– POWER6 and POWER7
EnergyScale firmware
Wes Felter Karthick Rajamani
– System x Active Energy Manager
firmware
– IBM Systems Director Active
Energy Manager Alex Ferreira Freeman Rawson

Energy optimization of customer data

centers
Wei Huang Juan Rubio
Patents and publications on power
management (2005 – present)
– 38 US patents awarded
– 34 peer-reviewed publications

3 © 2011 IBM Corporation

SWTSE 2010

Schedule

Time Subject Presenter

8:00 AM Fundamentals Charles Lefurgy

10:00 AM BREAK

10:15 AM Storage Wes Felter

11:00 AM Networking Wes Felter

12:00 PM LUNCH

1:00 PM Cloud Karthick Rajamani

1:45 PM Energy-efficient software Karthick Rajamani
2:15 PM Modeling Juan Rubio

3:15 PM BREAK

3:30 PM Emerging technology Karthick Rajamani

4:30 PM END

4 © 2011 IBM Corporation

What is the problem?

5 © 2011 IBM Corporation

There are many dimensions to the problem

1. Very little of the delivered power is converted to useful work

2. Poor allocation of provisioned resources
– Over cooling – too many air conditioning units are on
– “Stranded power” – available power is fragmented across circuit breakers
3. Fixed capacity
– Reaching power and cooling limits of facility
– Peak capacity cannot always be increased with business growth
4. Power is a first-class design constraint for server design
– Peak power consumption requirements vary by market
– Component-level and system-level
– CMOS technology scaling no longer providing historic trends in energy-efficiency
5. Total cost of ownership
– Capital expenses (Building data center, buying equipment, etc.)
– Operational expenses (Energy costs, staffing, etc.)

6 © 2011 IBM Corporation

1. Very little of the delivered power is converted to useful work

100 Units Data center Server hardware Processor Server loads

TYPICAL USAGE RATES
Mainframe UNIX Wintel
80 – 100% 10 – 40% 5 – 12%

Performance,
Capacity/ Up to 95% Resources
HVAC, IT Watt idle
35 resource used at
Units UPS Power Other Processor 5-20%
45% 55% 70% 30% average for
33 Units (Power supply, appl. load
fans, memory,
Delivered*

*Data source: U.S.

Department of Energy
May 18, 2007

Reduce cooling & Higher efficiency Reduce energy Reduce idle/unused

electrical distribution and infrastructure, power consumption at capacity that still
End-to-End
conditioning costs vs. management the chip level consumes energy
Initiatives energy going to servers,
capture heat at source
Better server Advanced Enhance resource
More efficient cooling hardware processor usage rate
and energy supply design & design + (consolidation/
management process virtualization)
7 © 2011 IBM Corporation
2. Wasting provisioned resources Rackable
-- Overcooling CloudRack C2
2009
Spending more than required to operate
servers
ASHRAE recommended inlet temperature
– 2004: 25 C max (35 C allowed)
– 2011: 27 C max (45 C allowed)
Vendors test at higher temperatures
Microsoft Dublin chillerless DC at 35 C

Rec.

humidity
27ºC 35ºC 45ºC

40ºC

temperature
Source: 2011 Thermal Guidelines for Data Processing Environments – Expanded Data Center Classes and Usage
8 Guidance, American Society of Heating, Refrigerating and Air-Conditioning Engineers © 2011 IBM Corporation
Wasting provisioned resources
-- Stranded power
Using nameplate power (worst-case) to allocate power on circuit breakers
– However, real workloads do not use that much power
– Result: available power is stranded and cannot be used
Stranded power is a problem at all levels of the data center
Example: IBM HS20 blade server – nameplate power is 56 W above real workloads.
350
Nameplate power: 308 W
300 Stranded power: 56 W
Server Power (W)

250
200
150 Real workload maximum
100
50
Workloads: SPEC CPU, SPEC JBB, LINPACK
0

SPECJ BB
LINPACK
perlbmk-1

ammp-1

perlbmk-2

ammp-2
mgrid-1
mes a-1

equak e-1

sixtrac k-1

mgrid-2
mes a-2

equak e-2

sixtrac k-2

nameplate
idle
gzip-1
vpr-1
gcc -1
mcf-1
parser-1
eon-1
gap-1
v ortex -1
bzip2-1

applu-1
galgel-1
art-1

lucas -1
fma3d-1
apsi-1
gzip-2
vpr-2
gcc -2
mcf-2
parser-2
eon-2
gap-2
v ortex -2
bzip2-2

applu-2
galgel-2
art-2

lucas -2
fma3d-2
apsi-2
crafty -1

swim-1

facerec -1

crafty -2

swim-2

facerec -2
wup

wup
twolf-1

twolf-2

Source: Lefurgy, IBM

9 Source: Charles Lefurgy, IBM © 2011 IBM Corporation

What are the consequences of wasted capacity?

Run out of data center capacity

– Data center is considered “virtually full”
– Unnecessary expansion of DC results in large capital expense
– Disruption to business operations

Maintaining under-utilized data centers is expensive

– Electric losses are higher at lower utilizations
– Inefficiency of power and cooling equipment at low loads

10 © 2011 IBM Corporation

3. Fixed capacity

Peak Rack power

– Air-cooled rack peak power is roughly 30 kW
– Older DC cooling infrastructure may not support such high load
• Common to see empty slots in rack
Peak DC power is a problem in some geographies
– Example: New York City
– Too expensive to expand power delivery
– Limits business growth

Power supply still a vexation for the NSA Snafus forced Twitter datacenter move
“The spy agency has delayed the deployment of “A new, custom-built facility in Utah meant to
some new data-processing equipment because it is house computers that power the popular
short on power and space. Outages have shut down messaging service by the end of 2010 has been
some offices in NSA headquarters for up to half a plagued with everything from leaky roofs to
day…Some of the rooms that house the NSA's insufficient power capacity, people familiar with
enormous computer systems were not designed to the plans told Reuters.”
handle newer computers that generate considerably
more heat and draw far more electricity than their -- Reuters, April 1, 2011
predecessors.”
-- Baltimore Sun, June 2007
11 © 2011 IBM Corporation
4. Power is a first-class design constraint for servers

Power and performance are top design parameters for microprocessors and servers
Power constraints exist across all classes of computing equipment
– Laptop (30 – 90 W)
– Desktop (100s W)
– Server (200W - 5kW)
– Data center (1-20 MW)

Peak power constraint means high performance requires high energy-efficiency

Server components
– CPU socket power Air cooling limits and fan noise consideration
– Memory power
– Electrical cord limits
– Power supply limits (physical size, reuse of standard designs)

12 © 2011 IBM Corporation

5. Total cost of ownership

Tier-3 HPC data center for financial analytics in 2007

– 4.4 MW capacity (IT + cooling)
– $100M USD installed cost
– 1 W is spent on cooling and UPS for every 1 W for IT equipment)
Capital costs dominate operating cost
Opportunity: better energy-efficiency can reduce facility capital costs
– Pack more revenue producing servers into existing facility
– Delay building new data center

¾ capital costs

¼ operating costs

Annualized cost by component as a fraction of the total

Source: Koomey et al., A Simple Model for Determining True Total Cost of Ownership for Data Centers,
Version 2.1, Whitepaper, Uptime Institute, 2008
13 © 2011 IBM Corporation
Metrics

“If you can’t measure it, you can’t manage it”

14 © 2011 IBM Corporation

Metrics and benchmark overview

Data center metrics

– PUE
– Green500
Server benchmarks that require power measurement
– SPECPower_ssj2008 (2007)
– SPECweb2009 (2009)
– SAP server power benchmark (2009)
– ENERGY STAR Computer Server (2009)
Server benchmarks with optional power measurement
– SPECvirt_sc2010
– TPC-C
– TPC-E
– TPC-H
Storage benchmarks (in storage section)
– Storage Performance Council: SPC-1/E; SPC-1C/E (2009)
– SNIA (in development)
– ENERGY STAR Storage (in development)
System benchmark
– SAP system power benchmark (in development)

15 © 2011 IBM Corporation

PUE: Power Usage Effectiveness

Indicates energy efficiency of entire facility:

Total facility power
PUE =
IT equipment power
Only data center metric widely recognized across industry
Metric created by The Green Grid

2.8
Power Usage Effectiveness

2.6
2.4
2.2
(PUE)

2.0
Typical existing DC (1.7)
1.8
1.6
1.4
Best new DCs in 2010 (<1.2)
1.2
Best possible PUE (= 1.0)
1.0
1 2 3 4 5 6 7 8 9 10 11 12 14 16 17 18 19 20 21 22
Data Center number

Source: Tschudi et al., "Measuring and Managing Energy Use in Data Centers." HPAC Engineering, LBNL/PUB-945, 2005.

16 © 2011 IBM Corporation

Problems with PUE

Appropriate metering is often not in place

Score is dependent on weather, location, application, and tier level

Fall Winter Spring

Source: Hendrik Hamman, IBM

Rewards inefficiency in the server (e.g. poor AC/DC conversion, fan power is included)
PUE is insufficient for “proving” and managing energy efficiency

© 2011 IBM Corporation

The Green500 list

Efficiency (MFLOPS/Watt)
2500
Energy-efficiency for High Performance Computing
– Large clusters are costly to operate 2000
(ASC Purple @ 4.5 MW, $0.12/kWh
$4.7M/year) 1500
Use accelerators or GPUs
– Site must be designed to supply power 1000
Green500 list reorders the Supercomputing TOP500 500
list for energy-efficiency
– Metric: LINPACK performance / Power for 0
computer 0 100 200 300 400 500
– Does not include computer room cooling
Green500 Rank

Green500 as of June 2011 (source: http://www.green500.org)

Rank MFLOPS/W Site Computer Power TOP500
(kW) rank

1 2097 IBM – Watson Research Center NNSA/SC Blue Gene/Q Prototype 2 41.0 109
2 1684 IBM – Watson Research Center NNSA/SC Blue Gene/Q Prototype 1 38.8 165
3 1375 Nagasaki U. – self-made Intel i5, ATI Radeon GPU, Infinibad 34.24 430
QDR
4 958 GSIC Center, Tokyo Institute of HP ProLiant SL390s 1243.8 5
Technology G7 Xeon 6C X5670 Nvidia GPU
5 891 CINECA/SCS – SuperComputing IBM iDataPlex DX360ME, Xeon 2.4, 160 54
Solution nVidia GPU, Infiniband

18 © 2011 IBM Corporation

Other proposed metrics from The Green Grid

DCeP (Data Center energy Productivity) DCeP =

Useful Work Produced
– Hard to define a workload that can be run Total Data Center Energy Consumed
well across data centers

Cooling + PowerDistr ibution + Lighting + IT − Reuse

ERE (Energy Reuse Effectiveness) ERE =
– Fix PUE to show benefit for reusing energy IT
– Example: Using hot water from DC to heat
nearby building

CUE (Carbon Usage Effectiveness) Total CO2 emissions caused by the Total Data Center Energy
CUE =
– Measure sustainability IT Equipment Energy
– In addition to PUE

Annual Site Water Usage

WUE (Water Usage Effectiveness) WUE =
– Measure water use IT Equipment Energy

19 © 2011 IBM Corporation

SPECPower_ssj_2008

Measure transaction-oriented servers Load Level

100
Normalized Power
Based on SPECjbb, a Java performance Average Utilization
benchmark. 90

% Load, Utilization, and Power

Range of load levels – not just peak 80
– Self-calibration phases determine peak 70
throughput on system-under-test
– Benchmark consists of 11 load levels: 60
100% of peak throughput to idle, in 10%
50
steps
– Fixed time interval per load level 40
– Random arrival times for transactions to
mimic realistic variations within each 30
load level 20
Primary benchmark metric
10
100% 0
∑ Throughput per level
100%
90%
80%
70%
60%
50%
40%
30%
20%
10%
0%
calibration
calibration
calibration
idle
100%
SPECPower Load Levels
∑idle
power per level Time
Source: Heather Hanson, IBM Source: Heather Hanson, IBM

20 © 2011 IBM Corporation

Energy Star

• United States Environmental Protection Agency

• Not a benchmark, but a government specification for compliance allowing
a manufacturer to use the Energy Star mark to help customers identify
energy-efficient products.
• Version 1 for Servers (2009)
• Power supply efficiency requirements under loading of 10%, 20%,
50%, 100%, (varies by power supply capacity)
• Idle power limits, depending on configuration
• Allowances made for extra components
• 8 W per additional hard drive
• Version 2 for Servers (in development)
• Expected to report workload energy efficiency over a range of
utilization levels

• Expanding to storage, UPS, and data center (in development)

21 © 2011 IBM Corporation

Issues with server benchmarks

Lack of realism
– Do not include network and remote storage loads
• SAP System Power benchmark will include network and storage
– No task switching
– Very strong affinity

Coverage of server classes

– Best SPECPower score likely on 1- and 2-socket servers with limited memory
– Robust (redundant) configurations are penalized
• Example: Today, dual power supplies reduce conversion efficiency

22 © 2011 IBM Corporation

Data center facilities

23 © 2011 IBM Corporation

A Typical Data Center Raised Floor
Racks (computers,
storage, tape)
Networking
equipment
(switches)

Secured
Vault

Network
Operating Fiber Connectivity Terminating
Center on Frame Relay Switch

24 © 2011 IBM Corporation

Power delivery infrastructure for a typical large data center

25 © 2011 IBM Corporation

Data center power conversion efficiencies

UPS(1) Power Distribution(2) Power Supply(3,4) DC/DC(5)

88 - 92% 98 - 99% 55 - 90% 78% - 93%

The heat generated from the losses at each step of power

conversion requires additional cooling power

(1) http://hightech.lbl.gov/DCTraining/graphics/ups-efficiency.html
(2) N. Rasmussen. “Electrical Efficiency Modeling for Data Centers”, APC White Paper, 2007
(3) http://hightech.lbl.gov/documents/PS/Sample_Server_PSTest.pdf
(4) “ENERGY STAR® Server Specification Discussion Document”, October 31, 2007.
(5) IBM internal sources
26 © 2011 IBM Corporation
Cooling infrastructure for a typical large data center

Sample chilled water circuit

Two water loops
CRAC

Chilled water (CW) loop

– Chiller(s) cool water CW pump
Chiller
which is used by
CRAC(s) to cool down
the air
– Chilled water usually
arrives to the CRACs
near 10°C (50°F)

Condensation
water pump
Condensation water loop
– Usually ends in a cooling
tower
– Needed to remove heat Cooling tower
out of the facilities

27 © 2011 IBM Corporation

Raised floor cooling

Racks
– Arranged in a hot-aisle cold-aisle configuration
Computer room air conditioning (CRAC) units
– Located in raised-floor room or right outside of raised-floor room
– Blower moves air across the raised floor and across cooling element
– Most common type in large data centers uses chilled water (CW) from facilities plant
– Adjusts water flow to maintain a constant return temperature
– Often raised floors have a subset of CRACs that also control humidity in floor

CRAC Servers Servers Servers Servers CRAC

28 © 2011 IBM Corporation

Raised floor cooling
Conventional data center
Hot-cold intermixing
– Cold air at inlet of air conditioner
– Hot air at inlet of IT equipment

Best practice is to separate

hot and cold air

Hot-aisle containment

Source: Hendrik Hamann, IBM

29 © 2011 IBM Corporation
Commercial liquid cooling solutions for racks

Purpose
– Reduce localized hotspots
– Allow higher power density in
older facilities
– Optimize cooling by rack
– No raised floor required
Implementation
– Self-contained air cooling solution
(water or glycol for taking heat
from the air)
– Air movement
Types
– Enclosures – create cool Liebert XDF™ APC InfraStruXure
microclimate for selected Enclosure (1) InRow RP (2)
‘problem’ equipment
– Sidecar heat exchanger – to “Liebert XDF™ High Heat-Density Enclosure with Integrated Cooling”,
address rack-level hotspots http://www.liebert.com/product_pages/ProductDocumentation.aspx?id=40
without increasing HVAC load “APC InfraStruXure InRow RP Chilled Water”,
http://www.apcc.com/resource/include/techspec_index.cfm?base_sku=ACRP501

30 © 2011 IBM Corporation

Container data centers
Microsoft’s Chicago data center (2009)
Calculated annual PUE of 1.22
Modular data center design
– IT equipment container: servers, storage, network
switch
– Physical infrastructure container: chiller,
UPS/batteries, etc.
– Site provides power, chilled water, and network
– Pre-assembled with multi-vendor equipment
Benefits
– Pay as you grow
– No raised floor required, just a concrete slab
– Cheaper and quicker than retrofitting a data center
• Rapid delivery (2-3 months from order)
• Support high power density (34 kW/rack)

Photo source: CNET.com

31 © 2011 IBM Corporation
Outside air cooling

Many geographies can use outside air for cooling

– Reduce or eliminate mechanical chillers
– Moderate filtration recommended
Yahoo Compute Coop data center (2010)
– PUE 1.08 (with evaporative cooling)
– Oriented for prevailing winds
– 100% outside air cooling (no chillers)
– Server inlet air typically 23 C
– Use evaporative cooling above 26 C
• Servers reach 26 – 30 C for 34 hours/year
Yahoo Compute Coop in Lockport, NY

Source: Chris Page, “Air & Water Economization & Alternative Cooling Solutions
–Customer Presented Case StudiesData Center Efficiency Summit, 2010
32 © 2011 IBM Corporation
Open Compute Project

Facebook released specifications and mechanical

designs for data center and server (April 2011)

– Data center
• Electrical
• Mechanical
• Racks
• Battery cabinet

– Server
• Chassis
• Motherboard
• Power supply

– PUE 1.07 (Oregon, December 2010)

http://opencompute.org

33 © 2011 IBM Corporation

IT equipment

34 © 2011 IBM Corporation

IT equipment

Servers
Storage
Network 33% CPUs
22% other

5% Networking

10% Disks

30% DRAM

Server peak power by hardware component from a Google data center (2007)
Source: Luiz André Barroso and Urs Hölzle, The Datacenter as a Computer: An Introduction to the Design of
Warehouse-Scale Machines, Morgan & Claypool, 2009.

35 © 2011 IBM Corporation

Server components

IBM POWER 755 Expansion card slots

4 Processor cards
4 redundant removable fans

2 redundant power supplies

Tape drive bay

o w 8 removable disks
rf l
ai
POWER7 processor

DDR3
DIMM buffers DIMM
Processor card slots

Connector
36 © 2011 IBM Corporation
Address variability in hardware and operating environment

Complex environment
– Installed component count, ambient temperature, component variability, etc.
– How to guarantee power management constraints across all possibilities?
Feedback-driven control
– Capability to adapt to environment, workload, varying user requirements
– Regulate to desired constraints even with imperfect information

Models
Estimate unmeasured quantities
Predict impact of actuators

Power management controller Sense

Guarantee constraints Real-time monitoring
Find energy-efficiency settings Power, temperature, performance,…

Actuate
Set performance state (e.g. frequency)
Set low-power modes (e.g. DRAM power-down)
Set fan speeds
37 © 2011 IBM Corporation
Sensors: temperature

Thermal sensor key characteristics

– Accuracy and precision - lower values require higher tolerance margins for thermal
control solutions.
– Accessibility and speed - Impact placement of control and rate of response.

Ambient measurement sensors

– Located on-board, inlet temperature, outlet temperature, at the fan e.g. National
Semiconductor LM73 on-board sensor with +/-1 deg C accuracy.
– Relatively slower response time – observing larger thermal constant effects.
– Standard interfaces for accessing include PECI, I2C, SMBus, and 1-wire

On-chip/-component sensors
– Measure temperatures at specific locations on the processor or in specific units
– Need more rapid response time, feeding faster actuations e.g. clock throttling.
– Proprietary interfaces with on-chip control and standard interfaces for off-chip control.
– Example: POWER7 processor has 44 digital thermal sensors per chip
– Example: Nehalem EX has 9 digital thermal sensors per chip
– Example: DDR3 specification has thermal sensor on each DIMM

38 © 2011 IBM Corporation

Sensors: power

AC power
– External components – Intelligent PDU, SmartWatt
– Intelligent power supplies – PMBus standard
– Instrumented power supplies Example: IBM DPI PDU+
DC power
– Most laptops – battery discharge rate
– IBM Active Energy Manager – system power
– Measure at VRM
Within a chip (core-level)
– Power proxy (model using performance counters)
Sensor must suit the application:
– Access rate (second, ms, us)
– Accuracy
– Precision
– Accessibility (I2C, Ethernet)

39 © 2011 IBM Corporation

Sensors: activity and performance

‘Performance’ Counters
– Traditionally part of processor performance monitoring unit
– Can track microarchitecture and system activity of all kinds
– A fast feedback for activity, have also been shown to serve as potential proxies for
power and even thermals
– Example: Instructions fetched per cycle
– Example: Non-halted cycles

Resource utilization metrics in the operating system

– Serve as useful input to resource state scheduling solutions for power reduction

Application performance metrics

– Best feedback for assessing power-performance trade-offs
– Example: Transactions per Watt.

40 © 2011 IBM Corporation

Actuators: microprocessor active states

Dynamic voltage and frequency scaling (DVFS) in modern server processors.

– Called “P-states” (performance states)
Today, voltage typically shared across cores on a chip. Cores have independent clocking
and may use different frequencies.

POWER7 DVFS (4 early samples)

100%
Normalized socket power

80%

60%

40%

20%

0%
0% 20% 40% 60% 80% 100%
Normalized socket frequency

Source: Karthick Rajamani, IBM

41 © 2011 IBM Corporation
Actuators: turbo mode

Turbo frequencies are available on Intel and IBM microprocessors

– Opportunistic performance boosting beyond nominal (guaranteed level)
– When power and thermal headroom is available

– Example: Intel Turbo Boost when there is power headroom

Source: Intel Design Forum, 2010

42 © 2011 IBM Corporation
Actuators: turbo mode

Example of turbo boost when there is thermal headroom

Source: Intel Design Forum, 2010

43 © 2011 IBM Corporation
Actuators: microprocessor idle states

Use energy-efficient idle states

when OS cannot schedule work
– Waiting on IO
– Waiting for server transactions
– Request batching
Trade-off power reduction with
latency to invoke and wakeup
Core-level and chip-level states
– Once all cores are in a low state,
chip can go to next lowest state
– Example: Chip can reduce
voltage to retention level, only
when all cores are idle.

Intel Nehalem idle states

Source: Intel Design Forum, 2008

44 © 2011 IBM Corporation

Race-to-idle vs. Just-in-time

Two strategies to save energy. Both are useful.

Race-to-idle
– Complete work as fast as possible and go into lowest power idle state (or off)
– Concern: wake-up time can be long (minutes to boot server and reload cache)
– Opportunity: more granular idle modes with different wake-up times
– Example: OS idle loop using idle states
Just-in-time
– Complete work slowly to just meet deadlines (or service level agreement)
• Useful if running CPU faster does not complete work faster (memory-bound task)
– Example: Use DVFS to CPU speed to memory bandwidth
– Useful when idle modes are not available or wake-up time is too long

Race-to-idle Just-in-time

power task
A
idle Task A idle Task B idle
time deadline deadline
45 © 2011 IBM Corporation
Actuators: DRAM

Memory consumes power as soon as plugged in

– Idle power no longer negligible
– Idle power increases with DIMM size
DDR3 Background Power is Large!
Power increases with access rate to memory
Power modes
– Powerdown
– Self-refresh
Power down
– Power is ~10-20% less than active standby
– Wake-up penalty (7-50ns)
– Power down groups of ranks within a DIMM
– DRAM Idle, IO Circuits off, Internal Clock Off,
DLL (Dynamic Loop Lock) Frozen
– Needs Refresh
Self Refresh
– Power ~60%-70% less than active standby
– Wake-up penalty: 600 – 1300 ns
– Put whole DIMM into self-timed refresh * Fig. from Micron TN-41-01
• Hub chip also in lower-power state
– DRAM Idle, IO Circuits off, Internal Clock Off,
DLL Off
– Needs No Refresh

Source: Kenneth Wright (IBM) et al., “Emerging Challenges in Memory System Design”, tutorial, 17th IEEE
46
International Symposium on High Performance Computer Architecture, 2011. © 2011 IBM Corporation
Thermal constraints

JEDEC defines max allowable temperature for DRAM

– Source: Micron, TN-00-08, Thermal Applications
• Functional limit: 95 C (industrial applications)
• Reliability limit: 110 C (prevent permanent damage)
Inputs:
– System thermals (air flow, inlet air, fan ramp, preheat)
– DRAM current / power (self heat)
Constraints / Worst case:
– Fan fail (20-25 C increase)
– DIMM position (depending on air flow, etc.)
– Inlet air (Processor speed, processor heat sink)
Solutions:
– Use low power modes
– Throttling (Limit max bandwidth limits active current)
– Double refresh (≥ 85C)

Source: Kenneth Wright (IBM) et al., “Emerging Challenges in Memory System Design”, tutorial, 17th IEEE
47
International Symposium on High Performance Computer Architecture, 2011. © 2011 IBM Corporation
Power variability
Source: Karthick Rajamani, IBM

Power can vary due to manufacturing of same part (processor leakage power)
Power consumption is different across vendors
Memory power specifications for DDR2 parts
with identical performance specifications
2.5 2.07
Normalized Current/Power

2.0
1.55
1.52

1.52

1.51

1.51
1.28
1.5

1.16
2X difference in
1.00

1.00
1.0 active power
and 1.5X
difference in
0.5 idle power!

0.0
Max Active (Idd7) Max Idle (Idd3N)

Source: Karthick Rajamani, IBM

Vendor 1 Vendor 2 Vendor 3 Vendor 4 Vendor 5
48 © 2011 IBM Corporation
Important concepts
for energy management

49 © 2011 IBM Corporation

Energy proportional computing

Popularized by Google (2007)

Definition: Energy consumed is proportional with work completed

– Ideal: When no work is performed, consume zero power
– Reality: When servers are idle, their power consumption is significant

Today, it is still not uncommon for idle servers to consume 50% of their peak power.
– Many components (memory, disks) do not have a wide range for active states.
– Power supplies are not highly efficient at every utilization level.

Can apply to data centers, servers, and components

50 © 2011 IBM Corporation

SWTSE 2010

Energy proportional computing

“The Case for

Energy-Proportional
Computing,”
Luiz André Barroso,
Urs Hölzle,
IEEE Computer
December 2007

Energy Efficiency =
Utilization/Power

Figure 2. Server power usage and energy efficiency at varying utilization levels, from
idle to peak performance. Even an energy-efficient server still consumes about half its
full power when doing virtually no work.

© 2011 IBM Corporation

SWTSE 2010

Energy proportional computing

“The Case for Even well-managed

Energy-Proportional
Computing,”
servers have a hard time
Luiz André Barroso, achieving decent levels of
Urs Hölzle, utilization.
IEEE Computer
December 2007

Averages
found
outside
the cloud

Figure 1. Average CPU utilization of more than 5,000 servers during a six-month period. Servers are rarely completely idle
and seldom operate near their maximum utilization, instead operating most of the time at between 10 and 50 percent of their
maximum
© 2011 IBM Corporation
Virtualization – opportunities for power reduction

Virtualization enables more effective resource utilization by consolidating multiple low-

utilization OS images onto a single physical server.
Multi-core processors with virtualization support and large SMP systems provide a growing
infrastructure which facilitates virtualization-based consolidation.
The common expectation is that multiple, less energy-efficient, under-utilized systems can
be replaced with fewer, more energy-efficient, higher performance systems for
– A net reduction in energy costs.
– Lower infrastructure costs for power delivery and cooling.
More in cloud section

each 10% utilized 80% utilized

53 © 2011 IBM Corporation

Workload-optimized systems

Specialize hardware/software to run a workload efficiently

– Remove components required for general purpose computing
– Size components (memory/processor/network) to workload
– ASIC
– FPGA
– Custom instruction sets,
– Accelerators (hardware + SW libraries)

Examples in emerging technologies section

54 © 2011 IBM Corporation

Power capping

Power capping is a method to control peak power consumption

– High power use slow down; Low power use speed up
– Can be applied at many levels: components, servers, racks, data center
– Control-theoretic approach

Throttle level
Desired power consumption PI controller Component

Requirements Power measurement

– Precision measurement of power
• Measurement error translates to lost performance
– Components with multiple power-performance states
• Example: microprocessor voltage and frequency scaling
Impact
– Power capping provides safety @ worst-case power consumption
– Allows IT equipment to oversubscribe available power
(better performance @ typical-case power)
– Stranded power is reduced (lowering cost)
– Power delivery is designed for typical-case power (lowering cost)

55 © 2011 IBM Corporation

Power capping – Example

IBM Power 750 Express server

System power
Caps when redundant power supply fails or Set cap to 1kW
customer sets power cap target 1500
1400
1300
Every 8 ms, measure system power and Power (W) 1200 Settles in 16 ms
adjust processor voltage and frequency to 1100
1000
meet power cap 900
Drop power cap from 2 kW to 1 kW 1000 1100 1200 1300 1400
Time (ms)
Settles to 2% of target in 16 ms
(2 control intervals)
Processor frequency

4000
3800
Frequency 3600
(GHz) 3400
3200
3000
1000 1100 1200 1300 1400
Small undershoot Lefurgy etTime
al., ICAC07
(ms)

56 © 2011 IBM Corporation

Power shifting

Set a power budget on every component to control aggregate power

Shifting = dynamically adjusting power budgets to improve aggregate performance
– Requires performance measurement from components

Maximum Expected
60 Memory Power Example: power shifting across
Maximum Expected CPU and memory for PPC 970 computer
50 CPU Power
Points show execution intervals of many
40
workloads with no limit on power budget
Conventional
CPU Budget “Static” encloses unthrottled intervals for
Power 30 (78 Watts) 40 W budget: 27 W CPU, 13 W Memory
(W)
20 “Dynamic” encloses unthrottled intervals
for 40 W budget and power shifting
10 – Better performance than static design
Static Dynamic for 40 W
0 – Lower cost than conventional 78 W
0 20 40 60 power supply design
Memory Power (W )

57 © 2011 IBM Corporation

Shift power to where it is consumed efficiently

Opportunities
– Intra-chip: shift between function units, cores, cores  caches
– Intra-node: shift between processors, processors  DRAM, leakage/fans
– Intra-rack: shift between nodes, storage  compute, disaggregated DRAM
– Intra-data center: cross-node optimization (placement, migration,
consolidation)
– Across data centers: time shifting, power arbitrage, enhanced reliability

58 © 2011 IBM Corporation

Take away

Data center and server power management addresses many problems

– Huge costs for cooling and power delivery
– Constraints on achievable performance
– Constraints on data center capacities limiting IT growth
Governments, data center operators, and IT vendors are all engaged
– New benchmarks and metrics (PUE, SPECPower)
– New standards under development
In last 10 years, we have seen considerable innovativation in data center design
– Containers, outside air, etc.
Data centers and servers are becoming more instrumented
– Many sensors and actuators allow for adaptive, flexible behavior
– Power capping, shifting
– Virtualization and dynamic consolidation
Many techniques for managing power and cooling
– Consolidation, workload-optimized systems, capping, shifting, etc.

59 © 2011 IBM Corporation

Selected Reading

What is the problem?

– US EPA, Report to Congress on Server and Data Center Energy Efficiency, August 2007.
http://www.energystar.gov/ia/partners/prod_development/downloads/EPA_Datacenter_Report_Congr
ess_Final1.pdf
– American Society of Heating, Refrigerating and Air-Conditioning Engineers http://www.ashrae.org
– James Hamilton’s blog http://perspectives.mvdirona.com/
Metrics
– The Green Grid http://www.thegreengrid.org
– W. Feng and K. Cameron, “The Green500 List: Encouraging Sustainable Supercomputing”, IEEE
Computer, December 2007. http://www.computer.org/portal/web/csdl/doi/10.1109/MC.2007.445
Data center and servers
– Luiz André Barroso and Urs Hölzle, The Datacenter as a Computer: An Introduction to the Design of
Warehouse-Scale Machines, Morgan & Claypool, 2009.
– Michael Floyd et al., “Introducing the Adaptive Energy Management Features of the POWER7 Chip”,
IEEE Micro, March/April, 2011.
Energy proportional computing
– Luiz André Barroso, and Urs Hölzle, “The Case for Energy-Proportional Computing,” IEEE Computer,
December 2007.
Power capping and shifting
– Charles Lefurgy, Xiaorui Wang, and Malcolm Ware, "Power capping: a prelude to power shifting",
Cluster Computing, Springer Netherlands, November 2007.
– W. Felter, K. Rajamani, T. Keller, C. Rusu, “A Performance Conserving Approach for Reducing Peak
Power in Server Systems”, ICS 2005.

60 © 2011 IBM Corporation

Storage

61 © 2011 IBM Corporation

Storage power problem looming right behind memory power
(and it’s already dominant in some environments)

Problem: Many enterprise data centers spend upwards of 40% of their IT power on storage
– SAS (15K, FC, …) drives are fastest,
but highest power and cost
– Optimizing performance can drive low
resource utilization (spread data across many spindles)
– Other parts of the data center are becoming
more energy proportional but storage is not
– Optimizing for better energy efficiency can potentially
reduce performance

Standards bodies are including power and energy metrics along with performance
– Storage Performance Council (SPC)
– SNIA
– EPA

More customer attention to power and energy saving features at

purchase time
62 © 2011 IBM Corporation
Storage power problem looming right behind memory power
(and it’s already dominant in some environments)

Opportunities:
– Move away from high-cost, high-power enterprise SAS/FC drives
– Consolidation (fewer spinning disks = less energy, but less throughput)
– Hybrid Configurations (Tiering/Caching): Replace power-hungry SAS with SATA (for capacity) and
flash/PCM (for IOPS)
•SATA consumes ~60% lower energy per byte
•Flash can deliver over 10X SAS performance for random accesses
•Flash has lower active energy and enables replacing SAS with SATA and spindown
(by absorbing I/O activity)
•Issue: But what data should be placed in what storage technology and when?
– Opportunistic spindown
– Write offloading
– Deduplication/Compression

Challenge: IO-intensive applications have strict

latency and bandwidth requirements

63 © 2011 IBM Corporation

Storage Background – What Customers Want

Availability
Low-Latency
Reliability
Throughput
Redundancy
Capacity
Accessibility
Feature-rich
(extensible management
Fault-tolerance options via simple software)

All at low cost, low power, and low energy!!!

64 © 2011 IBM Corporation
We’re moving in the right direction

Major trend: High-speed enterprise drives high-capacity SATA drives and

SSDs
- SATA drives have significantly lower price per GB, and also Watt per GB

Yesterday Today
Excessive Power: Reduced Power:
- No Spin-down - Aggressive Spindown
- Fans and controllers - System power mgm’t
- Flash to absorb I/O
Costly:
-15K RPM SAS Reduced Cost:
- Wasted capacity - SATA + SSD
- Storage virtualization
Wasted Capacity:
- RAID Configuration Increased Capacity:
- Short-stroking - Dense SATA drives
- No short-stroking
- Deduplication

65 © 2011 IBM Corporation

Where does the power go?
Data Center
41% 28% 31%
IT Equipment Power Distribution Cooling

IT Equipment
42% 34% 23%
1U, 2U+ Compute Servers
Storage Comms

Components
31% 8% 6% 13% 20% 22%
CPUs CPU VR Memory HDD/RMSD PSU loss Misc

Source: Dell Measurements, presented at VMWorld 2007

66 © 2011 IBM Corporation
Storage Growth

“…Total disk storage systems capacity shipped reach 3,645

petabytes, growing 54.6% year over year.”
– International Data Corporation (IDC) about 2Q10
source (press release):
http://www.idc.com/about/viewpressrelease.jsp?containerId=prUS22481410&sectionId=null&elementId=null&pageTyp
e=SYNOPSIS

http://www.emc.com/collateral/analyst-reports/diverse-
exploding-digital-universe.pdf
67 © 2011 IBM Corporation
Storage Power Capping

Goal: Add more storage for the same power capacity

Massive Arrays of Idle Disks (MAID)
– Increase the number of spindles for better throughput but manage for power and energy
– Much academic work and included in some products
Disk Acoustic Modes (i.e., slowing down the seek arm)
– Seek power is the result of current consumption through the voice coil motor during
positioning of the read/write heads
– Reducing the current through the voice coil motor reduces the actuator arm speed
– Increases seek times (lower performance)
Throttling of I/O
– Reducing the amount of work sent to the
storage system to keep drives in idle
states
Lower Power States during Idle periods
– Placing drives in standby mode when possible

68 © 2011 IBM Corporation

Storage Energy Saving

Goal: Reduce energy consumption and save money on energy costs

Massive Arrays of Idle Disks (MAID)
– Much academic work and included in products
Replace Storage Media and/or hybrid storage configurations
– Removing mechanical disks altogether or placing them in lower power states

all-SAS all-SATA all-SATA + SSD

100% energy 45% energy 16% energy

69 *normalized to all-SAS configuration © 2011 IBM Corporation

Tiering vs. Caching
Tiering
– Tiers represent the trade-off between performance, capacity, and power
– Data lives in either one tier or the other, but never simultaneously occupying both
• Capacity increases slightly
– Some are proposing more than 2 tiers
• e.g., flash15k10k or flashdisk with no spindowndisk with spindown
– Decisions about where data lives are made at time intervals from 30 minutes to hours
typically
• Due to this timescale, tiering can’t keep up with fast changes in workloads
Caching
– Cache’s purpose is to increase performance
– Data in the cache is a copy of data in the other tier or (in the case of a write) is about to
be copied back to the other tier
– Decisions about where data lives are made at each access (read or write)
• Responsive to immediate changes in workload
SSD Disk
SSD Disk
tier cache
Incoming I/O requests that Incoming I/O requests that
miss in any buffer caches miss in any buffer caches
and controller DRAM and controller DRAM

70 © 2011 IBM Corporation

Performance Need Not Suffer

Empirical experiments 8-disk RAID-6 SAS array vs. 8-disk RAID-6 SATA array + SSD
cache

Flash For the medium-duty

workloads we used, flash…

offers more spindown

opportunity
matches or exceeds the
performance of SAS

71 © 2011 IBM Corporation

To Spin Down or Not to Spin Down…(workloads matter)

Some environments/workloads are already tuned with the right spindown approach
– e.g., backup/archive (MAID)
Some environments/workloads cannot tolerate any multi-second latency from a spinup delay
– e.g., OLTP
Spindown is appropriate for medium-duty workload environments
– e.g., email, virtualization, filers

Caution:
not to exceed rated
spindown cycles

72 © 2011 IBM Corporation

Software technologies

Thin-provisioning
– Software tools to report and advise on data management/usage for energy- and capital-
conserving provisioning
Deduplication
– Either at the file or block level
– Ensures only one copy of data is stored on disk (e.g., replicate copies are turned to
pointers to the original)
Storage virtualization
– Allows for more storage systems to be hidden behind and controlled by a central
controller that can more efficiently manage the different storage systems
– Abstract physical devices to allow for more functionality
– Enables powerful volume management

73 © 2011 IBM Corporation

Emerging Technologies

Phase-change memory (PCM)

STTRAM, MRAM
And other Storage Class Memories (SCM)

74 © 2011 IBM Corporation

Metrics and Benchmarks

Storage Performance Council (SPC)

– SPC-1C/E (2009)
• For smaller storage component configurations (no larger than 4U with 48 drives)
http://www.storageperformance.org/home
– SPC Benchmark 1/Energy
• Idle Test
• IOPS/Watt
• Extends SPC-1C/E to include more complex storage configurations
SNIA Emerald (in development)
– Idle power
– Maximum response time
– Performance (IOPS) per Watt
– http://www.snia.org/home/
EPA (EnergyStar Data Center Storage Product Specification) (in development)
– Power Supply Efficiency
– Active and Idle State
– Power Management Requirements
– Not up to date due to confidentiality until released
– http://www.energystar.gov/index.cfm?c=new_specs.enterprise_storage

75 © 2011 IBM Corporation

Emerging Industry Solutions

IBM Storwize V7000

– SAN Volume Controller Software (SVC)
• Virtualizes SANs and eases storage and volume management
– EasyTier Software
• For hybrid storage systems (flash-based SSDs and mechanical disks), dynamically
moves data between SSD tier and disk tier for best performance
IBM DS8800
– SVC
– EasyTier
EMC VNX
– tiering + spin-down support
Sun Oracle Exadata Database Storage Machine
– Flash caching
– Volume management for easy scalability

76 © 2011 IBM Corporation

Take-Aways and References

Data is growing and the storage systems to satisfy the capacity demand are not energy-
proportional
More of the data center is becoming more energy-efficient, and storage energy consumption
is becoming dominant
Storage Power Capping and Energy Saving Techniques
– Hybrid storage architectures
Metrics and benchmarks adopted by the industry are beginning to drive this issue and the
importance of focus in this area
References:
– Dennis Colarelli and Dirk Grunwald. Massive arrays of idle disks for storage archives. pages 1–11. In
Proceedings of the 2002 ACM/IEEE International Conference on Supercomputing, 2002.
– Charles Weddle, Mathew Oldham, Jin Qian, An-I Andy Wang, Peter Reiher, and Geoff Kuenning.
PARAID: a gear-shifting power-aware raid. ACM Transactions on Storage, 3(3):33, October 2007.
– D. Chen, G. Goldberg, R. Kahn, R. I. Kat, K. Meth, and D. Sotnikov, Leveraging disk drive acoustic
modes for power management. In Proceedings of the 26th IEEE Conference on Mass Storage
Systems and Technologies (MSST), 2010.
– Wes Felter and Anthony Hylick and John Carter. Reliability-aware energy management for hybrid
storage systems. To Appear in the Proceedings of the 27th IEEE Symposium on Massive Storage
Systems and Technologies (MSST), 2011.

77 © 2011 IBM Corporation

Networking

78 © 2011 IBM Corporation

What’s in a network?

Source: HP

79 © 2011 IBM Corporation

Network power in the data center — Overview

Includes Ethernet LANs and Fibre Channel SANs

– Virtually no published work on SAN power modeling/management!
Most power is in switches
– Network interface cards (NICs) & appliances (firewalls, etc.) are counted as servers
– Router power is usually small due to lower number of routers than switches
Small (10-20% of data center IT power) but growing
– Replacement of direct-attached storage with networked storage (often driven by
virtualization)
– More dynamic environments (e.g. VM migration) demand more bandwidth
– Emerging bandwidth-intensive analytics workloads
Currently not energy-proportional
– Switch power is proportional to number of active ports (if you’re lucky)
– Reduces the overall proportionality of the data center
Energy is a small fraction of total network cost
– Few modifications can be justified based on energy savings

80 © 2011 IBM Corporation

Switch Power Model

switch_power = base_power + active_ports*(MAC_power + PHY_power)

Base_power includes crossbar, packet buffers, control plane, etc.
MAC_power = MAC_base + MAC_activity_factor*utilization
– Media Access Control (MAC) layer deals with protocol processing
– MAC_activity_factor ~= 0 in many switches (depends on degree of clock gating)
PHY_power depends on speed, media (copper or optical), and distance
– Physical layer (PHY) performs data coding/decoding
For chassis switches, use fixed power for the chassis and power model for each line card

Optimization: Minimize total length of cabling to minimize power

– Use top-of-rack switches instead of home-run cabling

Source: Priya Mahadevan, Sujata Banerjee, Puneet Sharma: Energy Proportionality of an Enterprise Network. Green Networking
Workshop 2010
81 © 2011 IBM Corporation
Switch Power Example

48-port 1000BASE-T switch

base_power = 151 W
MAC_power = 0.68 W/port
PHY_power = 0.22 W/port

82 © 2011 IBM Corporation

Port Power Example

Intel 1000Base-T copper gigabit NIC

MAC_power varies from 53-270 mW
depending on utilization
– Slightly energy proportional
PHY_power varies from 150-1000 mW
depending on link speed
– 1000 Mb/s is 3x the power of 100 Mb/s,
5x the power of 10 Mb/s
– Does your laptop really need 1000 Mb/s?
Unplugging the cable saves significant power

NICs and switches use the same PHYs

Switches may have higher MAC power per
port

83 © 2011 IBM Corporation

IEEE 802.3az Energy Efficient Ethernet

Final standard ratified in October 2010

Products just now appearing
Free power savings – should be no performance loss and no additional cost

Low-power idle (LPI) allows PHY Tx side to shut off when no packet is being transmitted
– Saves ~400mW per 1 Gbps port
Rx side remains on continuously
Makes Ethernet more energy-proportional

84 © 2011 IBM Corporation

Existing Actuators and Near-Term Power Reduction Policies

What’s possible with the equipment you have today?

Enable 802.3az where possible

Find unplugged ports and turn them off
– Requires administrative action to turn port back on if needed
– Can save ~1W/port
– Mostly benefits older switches; newer switches power off unplugged ports automatically
Consolidate onto fewer switches and turn off unused switches
Manual port rate adaptation (only for copper)
– Gather utilization data on switch ports
– Run 1Gbps ports at 100Mbps when possible
– Can save ~0.5W/port (out of ~2-4W/port)
– Future: Run 10Gbps at 1Gbps or slower (save 4W?)
– Requires administrative action; could be automated via SNMP

85 © 2011 IBM Corporation

Rate Adaptation vs. Low Power Idle

86 © 2011 IBM Corporation

Architectural Considerations for Future Data Centers

Tradeoffs between server, storage, and network power

– Shared storage may reduce storage power and increase network power
– Virtualization may reduce server power and increase network power (due to higher
bandwidth)
Converge LAN and storage networks
– Reduce number of switches
– Reduce number of active ports
Replace high-end chassis switches with scale-out topology of small switches
– 2X the ports -> over 2X the power
– Feature-rich switches are higher power

87 © 2011 IBM Corporation

10GbE Copper Dilemma

10GBASE-T (Cat6 cable, RJ-45 connector)

– High power (>5 W/port)
– More expensive
– Short range (100 m)
– Power proportional to range, but always higher than -CR
– Backwards compatible with gigabit (RJ-45 connector)
• Can reduce speed and power
– Compatible with existing data center cable plant
– Extra latency (+2 us) due to error correction

10GBASE-CR (aka SFP+ direct attach twinax)

– Low power (<1 W/port)
– Lower cost (despite costlier cables)
– Very short range (<10 m)
– Not backwards compatible with gigabit
– Fixed 10 Gb/s speed
– Very low latency (~0.1 us)

88 © 2011 IBM Corporation

References

Priya Mahadevan, Sujata Banerjee, Puneet Sharma: Energy Proportionality of an Enterprise

Network. Green Networking Workshop 2010
– Empirical study from HP Labs
– Switch power model, rate adaptation, and throwing away equipment
Sergiu Nedevschi, Lucian Popa, Gianluca Iannaccone, Sylvia Ratnasamy, David Wetherall:
Reducing Network Energy Consumption via Sleeping and Rate-Adaptation, NSDI 2008
– Switch power model, buffer-and-burst, rate adaptation
Brandon Heller, Srinivasan Seetharaman, Priya Mahadevan, Yiannis Yiakoumis, Puneet
Sharma, Sujata Banerjee, Nick McKeown: ElasticTree: Saving Energy in Data Center
Networks. NSDI 2010
Dennis Abts, Mike Marty, Philip Wells, Peter Klausler, Hong Liu: Energy Proportional
Datacenter Networks. ISCA 2010
– Shut off some links and switches during low network load
– Requires multipathed topology (not yet common)
Nathan Farrington, Erik Rubow, and Amin Vahdat: Data Center Switch Architecture in the
Age of Merchant Silicon. Hot Interconnects 2009
– Considers cost and power tradeoffs of different switch designs
89 © 2011 IBM Corporation
Cloud Computing: Virtualized Resources
and Energy Management

90 © 2011 IBM Corporation

Cloud Computing
User’s view of a cloud
– Pay-as-you-use cost model
– Rapidly grow (shrink) compute capacity to match need
– Ubiquitous access

Cloud:
Computing
infrastructure
designed for
dynamic
provisioning
of resources
for computing
tasks.

The growth of cloud-based

computing means an
increased use of data
centers for computing needs
Provider’s view of a cloud
– Shared resources, consolidation driving up utilization, efficiency.
– Leverage economies of scale for optimizations.
– Increased flexibility in sourcing equipment, components, pricing.
91 © 2011 IBM Corporation
Energy-efficiency, Virtualization and Cloud Computing
“..the energy expense associated with powering and cooling the worldwide server installed base increased
31.2% over the past five years … In 2009, the server energy expense represented $32.6 billion, while the
server market generated $43.2 billion.”*

“..customers have avoided $23.5 billion in

server energy expense over the past six
years from virtualizing servers.” *

*Datacenter Energy Management: How Rising Costs,

High Density, and Virtualization Are Making Energy
Management a Requirement for IT Availability (IDC
Insight Doc # 223004), Apr 2010

"Cloud will grow from a $3.8 billion opportunity in 2010, representing over 600,000 units, to a $6.4 billion
market in 2014, with over 1.3 million units.”. In Worldwide Enterprise Server Cloud Computing 2010–
2014 Forecast Abstract (IDC Market Analysis Doc # 223118), Apr 2010.
“cloud computing to reduce data centers energy consumption from 201.8TWh of electricity in 2010 to
139.8 TWh in 2020, a reduction of 31%”, in Pike Research Report on “Cloud Computing Energy
Efficiency”, as reported in Clean Technology Business Review, December, 2010.
92 © 2011 IBM Corporation
Efficiency from Cloud Model for Computing
Better utilization of systems drives increased efficiency
– Increased sharing of resources – lower instance of unused resources.
– Less variability in aggregate load for larger population of workloads – better sizing of
infrastructure to total load.
Computing on a large-scale saves materials, energy
– Study shows savings through less materials for larger cooling and UPS units.
– Similar savings also possible in IT equipment
Economies of scale fund newer technologies
– Favor exploitation of newer (riskier), cheaper cooling technologies because of scaled up
benefits.
– Favor re-design of IT equipment with greater modularity, homogeneity with efficiency as
a driving concern.

93 © 2011 IBM Corporation

Virtualization as Cloud Computing Enabler
Network

Server 1 - On Server 2 - On Server 3 - On Server 4 - On

W1 W2 W3 W4

Network

W2 W4

Virtual W1 W3
Machines
Server 1 - On Server 2 - On
(VM)
94 © 2011 IBM Corporation
Dynamic Consolidation with Live Migration

Network

Virtual
Machines
(VM)

Server 1 - On Server 2 - On Server

Server 33 -- Off
On Server 4 - Off
On

Need support in platform and VMM/hypervisor for live migration/partition mobiliy.

Network connectivity between hosts.
Server power-on/-off capability (typically managed through service processor connections).

95 © 2011 IBM Corporation

Case Study: Consolidation Benefit Analysis in an Enterprise Data
Center* Line of
• 70 Zones ISA Business
(LOB)
• Explore opportunities for power
• Over 2000 servers Family
• 5 days performance data
savings through consolidation of
server in data centers under
realistic constraints Cluster
• Clusters are formed based on (Zone, LOB, ISA)

physical proximity, ownership by

different lines of business, and
instruction set architecture family
Zone
Example Consolidation – 6 server Cluster
Server
Server0 utilization pie chart

App 0
App 1
App 2
App 3
App 4
App 5

Servers1-5 Off *Study conducted by Wael El-essawy, Karthick Rajamani, Juan Rubio, Tom Keller

96 © 2011 IBM Corporation

Case Study: Consolidation Methodology

App/Server Placement &

Data Formating
Utilization Logs

Server Performance Data

Server Capacity Summary of
(SPEC, rPerf) Emerald
Cluster &
Cluster-level
DC Level
Server Inventory DB Server Cluster
Consolidation
Analysis
(model, manuf, zone, LOB, cpus, freq, …
etc)
Formation

Derated Server Power

Server Power Model
Data

Data filtering and

modeling logic

97 © 2011 IBM Corporation

Case Study: Performance and Power Models
Performance logs sample Average CPU Utilization every 15/60 minutes
Take a common performance metric to compare servers with different architectures
Capacity:
– CINT Rate x Freqscale x #CPUsscale
– Other Performance metrics can be used (SPEC Web, rPerf, etc.)
Utilization logs for a server are relative to its capacity
Start with non-virtualized servers
Power Models

1
Assume linear power model for each server between:
– Pmax (Power at 100% utilization)
– Pidle ( Idle Power) 0.9

Pmax:

Relative Power
– Determined by the server derated power 0.8
Old-large
– Represents how much power is allocated Old-medium
– Usually less than nameplate power 0.7 Old-small
– Maximum configuration New-large
New-medium
Pidle 0.6 New-small
– Determined by the server age and model New-blade
– Assumes no DVFS
0.5
0 20 40 60 80 100

Utilization
98 © 2011 IBM Corporation
Case Study: Consolidation Results
Input Utilization Histogram

2,292 servers out of 2,977 total servers are idled 70

– 23% servers remain active, the rest can be turned 60

off

Percentage of Servers
50

Average = 8.24%
– Per cluster, on average: 40

• Active servers: 1.25, 30

• Idle servers: 4.2 20

– Average Jobs per server after consolidation: 4.3
10

0
Data Center Servers are mostly underutilized 0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100

– 8% Overall average CPU utilization Utilization

– 76% of the servers are less than 10% utilized Consolidated Utilization Histogram

– Only 2% of the servers are more than 50% 25

utilized
20

Average = 34.6%
Percentage of Servers
Cluster-level consolidation significantly raised 15

server utilization
10
– 35% Average consolidated Utilization
5

Cluster-level consolidation significantly lowered 0

0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100

aggregate server power Utilization

– 74% reduction in total server power

99 © 2011 IBM Corporation

Case Study: Energy Savings

74% Aggregate Energy Savings

100

Power Savings (%)

Across all clusters

20
Cluster Power Consumption
10
50
0
0

101

152

202

252

302

352

402

452

502
40
Cluster ID
Group Power (KVA)

Power Before Consolidation

Average cluster servers power:
20 Power After Consolidation
Before consolidation = 2.6 KVA

10 After consolidation = 0.66 KVA

0
0 50 100 150 200 250 300 350 400 450 500 550
Cluster ID

100 © 2011 IBM Corporation

Dynamic Provisioning and Consolidation

Virtual Machine (VM) consolidation as a bin-packing problem.

– Bin size: Server capacity
– Object size: Historical VM CPU utilization summary
Extensions for practical solutions
– Limit packing to a fixed percentage of server capacity, avoid resource saturation.
– Accommodate requirements for other resources such as VM memory needs.
– Provision VM resources using prior characterization with expected workload.
– Factor in SLAs and/or adopt runtime performance monitoring.
Possible additional optimizations/considerations
– Techniques for better prediction of future load characteristics.
– Factor in multiple optimization concerns with utility-function based frameworks.
– Factor in server/cluster power limits and power consumption for placement.
– Adopt energy-aware placement strategies in heterogeneous server-workload
environments.
– Factor in VM migration characteristics and cost.
– Factor in server on/off temporal characteristics.
– Understand and address impact of other shared resources such as processor caches,
networks, I/O.

101 © 2011 IBM Corporation

Consolidation and Other Techniques

Dynamic Voltage and Frequency Scaling

– Often evaluated as a competing solution.
– Provides better responsiveness to load changes, with potentially lower energy savings.
– Applicable to non-virtualized and virtualized environments without VM migration support.
– Can be transparently leveraged as complementary solution.
– Should be explicitly leveraged in conjunction for superior optimization.

Thermal Management
– Consolidation can increase the diversity in data center thermal distribution
– Thermal-aware consolidation/task placement strategies to mitigate thermal impact of
consolidation.
– Modular cooling infrastructure controls are an important complement to consolidation
solutions to reduce overall datacenter energy consumption.
– Integration of energy-aware task placement/consolidation with thermal management
solutions can be a successful approach to full Data center energy optimization.

102 © 2011 IBM Corporation

Considerations When Evaluating/Optimizing Cloud for Energy Efficiency

Energy cost of transport (network) to and from Cloud

– Volume of data transported between Cloud and local network impact which applications
are more energy-efficient with a Cloud
– Network topology and components can have a big impact on overall efficiency.
– However, most of the networking energy cost is often hidden/transparent to a user.

Performance impact of shared resource usage

– Higher response times for tasks can render a Cloud solution infeasible; a hybrid
computing solution is a likely compromise.
– Lowered performance can imply lowered energy-efficiency

Modularity, responsiveness of the infrastructure supporting the Cloud

– Higher consolidation can exacerbate cooling issues in a non-modular, cooling-
constrained facility.
– Cooling infrastructure not tunable to changes can be a source of inefficiency in
consolidated environments.
– Servers with slow on/off times can limit exploitation of dynamic consolidation.
– Networking within the cloud can be a factor for dynamic consolidation benefits.

103 © 2011 IBM Corporation

Energy Proportionality versus Dynamic Consolidation
Energy proportional components
– Consume power/energy in proportion to their utilization
– Ideally, no energy is consumed if no load and energy consumption scales in proportion
to load.
Server consolidation provides energy proportionality with non-ideal system components
– Just enough servers are kept active to service the consolidated load allowing the rest
to be powered off.
– The granularity for scaling energy to load is in energy for entire servers.

Can increasing energy proportionality in server component designs render server

consolidation solutions obsolete ?

Ideal energy proportionality is still far from reality, so continue with server
consolidation.
Clusters of servers heterogeneous in their efficiencies would continue to benefit from
energy-aware task placement/consolidation.
Cooling solutions without good tuning options can interact sub-optimally with energy
proportional hardware requiring intelligent task consolidation/placement to improve overall
datacenter efficiency.

104 © 2011 IBM Corporation

Energy Accounting

Basic Motivation
– Charging, incentivizing customers to allow better infrastructure utilization.
– Identify in-efficiencies and unanticipated consumption.
– Adapt resource provisioning and allocation with energy-usage information for more
efficient operation.
– Energy profiling of software to guide more efficient execution.

Different approaches and challenges

– Activity-based approaches (Modeling)
• Accuracy of models
• Inablity to capture power variation with environment and manufacturing.
– Power-measurement based approach (Measurement)
• Synchronizing measurement with resource ownership changes.
• Granularity of measurements versus resource ownership/usage.
– Common Challenges
• State changes with power management
• Fairness considerations

105 © 2011 IBM Corporation

Reducing power of Networked PCs
Problem: Networked PCs always on to provide remote access capability even when
mostly idle, wastes power.
Solutions:
─ Using special NIC hardware to keep limited networking active even while allowing the
PC to sleep.
─ Set up a Sleep Proxy. Sleep proxies maintain the network presence for the sleeping
PC and wake it up when needed, sleep proxies themselves could themselves be
special virtual machines1.

─ Virtualize the PC. Migrate the PC Virtual

Machine to a designated holding server
for such VMs, power down the PC till its
resources are needed2.

2LiteGreen:
Saving Energy in Networked Desktops Using Virtualization,
Tathagata Das, Pradeep Padala, Venkata N. Padmanabhan,
Ramachandran Ramjee, Kang G. Shin, USENIX 2010.
1SleepServer:A Software-Only Approach for Reducing the Energy
Consumption of PCs within Enterprise Environments, Yuvraj Agarwal
Stefan Savage Rajesh Gupta, USENIX 2010.

106 © 2011 IBM Corporation

Benchmarks: SPECvirt_sc2010

SPEC’s first virtualized environment benchmark

Reporting in performance-only and performance-per-watt categories
– Two efficiency categories:
1. Full system (server + storage) performance-per-watt
2. Server performance-per-watt
Workload organized in sets of VMs, called tiles.
– Six VMs per tile running three applications, applications are modified versions of SPECweb2005,
SPECjAppServer2004, and SPECmail2008.
– Acceptable performance criteria set for each application within a tile.
Measures
– Performance measure is arithmetic mean of normalized performance measure for each of the three
applications expressed as <Performance>@<number of VMs>.
– Allows for fractional tiles.
– Peak performance/Peak power is the measure for performance-per-watt categories
Still in early adoption stage
– Released 2010
– Eighteen results in performance category and one in each performance-per-watt category.

107 © 2011 IBM Corporation

Takeaway

Cloud is an attractive computing infrastructure model with rapid growth because of its on-
demand resource provisioning feature.
– Growth of cloud computing would lead to growth of large data centers. Large-scale
computing in turn enables increased energy efficiency and overall cost efficiency.
The business models (cloud provider) around clouds incentivize energy efficiency
optimizations creating a big consumer for energy-efficiency research.
– The cloud’s transparent physical resource usage model facilitates sharing and
efficiency improvements through virtualization and consolidation.
– Energy-proportionality and consolidation need to co-exist to drive Cloud energy-
efficiency
– End-to-end (total DC optimization) design and operations’ optimization for efficiency
will also find a ready customer in Cloud Computing.
Efficiency optimization while guaranteeing SLAs will continue to drive research directions
in the Cloud.

108 © 2011 IBM Corporation

References
1. Using Virtualization to Improve Datacenter Efficiency, Version 1, Richard Talaber, Tom Brey, Larry Lamers, Green Grid White Paper #19, January,
2009..
2. Quantifying the Environmental Advantages of Large-Scale Computing, Vlasia Anagnostopoulou, Heba Saadeldeen, Frederic T. Chong,
International Conference on Green Computing, August, 2010. (material and operational cost reduction).
3. Green Cloud Computing: Balancing Energy in Processing, Storage, and Transport, Jayant Baliga, Robert W A Ayre, Kerry Hinton, and Rodney S
Tucker, Proceedings ot the IEEE 99(1), January, 2011.
4. pMapper: power and migration cost aware application placement in virtualized systems, A. Verma, P. Ahuja, and A. Neogi, in Proceedings of the
9th ACM/IFIP/USENIX International Conference on Middleware, 2008.
5. Energy Aware Consolidation for Cloud Computing, Shekhar Srikantaiah, Aman Kansal, Feng Zhao, HotPower 2008.
6. Performance and Power Management for Cloud Infrastructures, Hien Nguyen Van, Fr´ed´eric Dang Tran and Jean-Marc Menaudy, 3rd IEEE
International Conference on Cloud Computing, 2010.
7. Mistral: Dynamically Managing Power, Performance, and Adaptation Cost in Cloud Infrastructures, Gueyoung Jung, Matti A. Hiltunen, Kaustubh R.
Joshi, Richard D. Schlichting, Calton Pu, ICDCS 2010.
8. vGreen: A System for Energy Efficient Computing in Virtualized Environments, Gaurav Dhiman, Giacomo Marchetti, Tajana Rosing, ISLPED 2009.
9. Temperature-Aware Dynamic Resource Provisioning in a Power-Optimized Datacenter, Ehsan Pakbaznia, Mohammad Ghasemazar, and
Massoud Pedram, DATE 2010.
10. Trends and Effects of Energy Proportionality on Server Provisioning in Data Centers, Georgios Varsamopoulos, Zahra Abbasi, and Sankeep K. S.
Gupta, International Conference on High Performance Computing (HiPC), December, 2010.
11. Virtual Machine Power Metering and Provisioning, Aman Kansal, Feng Zhao, Jie Liu, Nupur Kothari, Arka A. Bhattacharya, ACM SOCC 2010.
12. VMeter: Power Modelling for Virtualized Clouds, Ata E Husain Bohra and Vipin Chaudhary, IPDPS 2010.
13. VM Power Metering: Feasibility and Challenges, Bhavani Krishnan, Hrishikesh Amur, Ada Gavrilovska, Karsten Schwan, GreenMetrics 2010, in
conjunction with SIGMETRICS'10), New York, June 2010 (Best Student Paper).
14. LiteGreen: Saving Energy in Networked Desktops Using Virtualization, Tathagata Das, Pradeep Padala, Venkata N. Padmanabhan,
Ramachandran Ramjee, Kang G. Shin, USENIX 2010.
15. SleepServer: A Software-Only Approach for Reducing the Energy Consumption of PCs within Enterprise Environments, Yuvraj Agarwal Stefan
Savage Rajesh Gupta, USENIX 2010.
16. Somniloquy: Augmenting Network Interfaces to Reduce PC Energy Usage, Y. Agarwal, S. Hodges, R. Chandra, J. Scott, P. Bahl, and R. Gupta,
NSDI’09, Berkeley, CA, USA, 2009

109 © 2011 IBM Corporation

Energy-efficient Software

110 © 2011 IBM Corporation

Software Components and Compute Resources
Workload Managers

Applications
Libraries

Runtime/Data Management Systems

Operating Systems

Hypervisor /
Virtual Machine Monitor

Service Firmware BIOS

Manage Provision,
Schedule, Consume/utilize
resource resources
states Manage
111 states © 2011 IBM Corporation
The Many Roles of Software in Energy-efficient Computing
Exploiting lower energy states and lower power operating modes
– Support all hardware modes e.g. S3/S4, P-states in virtualized environments
– Detect and/or create idleness to exploit modes.
– Software stack optimizations to reduce mode entry/exit/transition overheads.
Energy-aware resource management
– Understand and exploit energy vs performance trade-offs e.g. Just-in-time vs Race-to-idle
– Avoid resource waste (bloat) that leads to wasted energy
– Adopt energy-conscious resource management methods e.g. polling vs interrupt, synchronizations.
Energy-aware data management
– Understand and exploit energy vs performance trade-offs e.g. usage of compression
– Energy-aware optimizations for data layout and access methods e.g. spread data vs consolidate
disks, inner tracks vs outer tracks
– Energy-aware processing methods e.g. database query plan optimization
Energy-aware software productivity
– Understand and limit energy costs of modularity and flexibility
– Target/eliminate resource bloat in all forms
– Develop resource-conscious modular software architectures
Enabling hardware with lower energy consumption
– Parallelization to support lower power multi-core designs
– Compiler and Runtime system enhancements to help accelerator-based designs

112 © 2011 IBM Corporation

Processor and System State Management
ACPI states for OSPM, Intel (Enhanced) SpeedStep, AMD PowerNow
– Encounter incomplete Chipset/BIOS support and/or lack of enablement by user.

Source:
Mondira (Mandy)
Pant, Intel
Presentation at
GLSVLSI, May
2010

Linux governors for user-level power management.

Folding on IBM POWER platforms.
Managing states in virtualized environments via service firmware, aggregate utilization,
hypervisor with OS hints.
Increasing idle exploitation opportunities – tickless kernels and timer/interrupt-service
migrations.
Coordinating voltage-frequency scaling and idle state management.
113 © 2011 IBM Corporation
Exploiting Dynamic Voltage and Frequency Scaling: A New Approach
Core-pool algorithm* for slack detection in Power reduction for SPECpower_ssj2008
multi-core, multi-threaded environments 1.6

1.4

Lower power/energy for Core-pool1.2

Show slack in utilization

Normalized Power
1

0.8

No slack in utilization 0.6

0.4

0.2

0
100 90 80 70 60 50 40 30 20 10 0
SPECpower_ssj2008 Load Levels

DVFS average utilization DVFS core-pool

Improvement greater for increased SMT

1 .4
1.2 7 1.28 1 .29

Score normalized to Nominal mode with fan-control

1.17 1.19
1 .2

1.0 3
1

0 .8

0 .6

0 .4

0 .2
Minimum Utilization Threshold
0
smt4 sm t2 smt1

*Power-performance Management on an IBM POWER7 Server, Core-po ol app roach Ave rag e-utilization ap proach

114 Rajamani et al, ISLPED 2010 © 2011 IBM Corporation

Memory Sub-system Power Management
Idle memory power
– Large memory systems can have a greater fraction of memory power in idle devices.
– Exploiting DRAM idle power modes critical to energy-efficiency.
– Power-aware virtual memory, coordinated processor scheduling and memory-state
management.
Active memory power
– DRAM device active power is not reducing fast enough to keep pace with bandwidth
growth demands.
– Providing adequate power for DRAM accesses can be critical to system performance.
– Power shifting between processor and memory - regulating power consumption for
maximizing performance.
Support in today’s servers
– Transparent to systems software and applications
– System-state driven e.g. S3 state entry can place DRAM in self-refresh mode.
– Idle-detect driven - DRAM power-down (e.g. Nehalem EX, POWER6) and self-refresh
(e.g. POWER7) triggered when memory controller detects adequate idleness.

115 © 2011 IBM Corporation

Dynamic Memory Idle State Management
Power mgmt
Memory Controller 0 state machines

ch0 Idle mode ch1 ch2 ch3

changes

buf0 buf1 buf2 buf3

DRAM
cmds/signals

Rank
idle
2GB 2GB 16GB
Large regions of memory need to be idle before lower power mode can be used.
Higher savings/latency mode (O(µs)) needs even larger regions to be idle, infeasible.
Granularity needs worsen with larger capacity devices/DIMMs, i.e., can be worse than shown..
Hypervisor and system software involvement to consolidate data in fewest power domains can
maximize idle opportunities

116 © 2011 IBM Corporation

Ideas we are exploring for software assisted memory power management

Energy-aware virtual memory re-sizing by hypervisor/operating systems

– Improve system memory utilization and lower physical memory occupancy
by idle data to reduce associated energy.

Affinity-aware placement and memory allocation limiting device

occupancy
– Lowering memory access cost (active power) and memory occupancy
cost (standby power) .

Software assisted tiered main memory architecture

– Facilitate incorporation of new memory technologies and/or aggressive
exploitation of low power (but higher latency) modes for energy-efficient
capacity expansion.

117 © 2011 IBM Corporation

Software Bloat
Modularity and flexibility for software development can have performance and energy-efficiency
overheads.
– Temporary object bloat scenarios for SPECpower_ssj2008, data measured on POWER750.
– Shows equi-performance power (and consequently energy) for different levels of bloat.
– Primary source of inefficiency is lower performance due to cache pollution and memory
bandwidth impact from higher incidence of temporary objects.

Nominal – Original, unmodified code

Higher Bloat – Disabled explicit object
reuse at one code site
Less Bloat – Introduced object reuse at
another code site

Use of DVFS enabled big power

reductions when bloat is reduced

Source:
The interplay of Software Bloat, Hardware
Energy Proportionality and System Bottlenecks
– HotPower 2011.

118 © 2011 IBM Corporation

Energy-Performance Trade-offs

Optimizing for performance need not always optimize energy-efficiency. Examples

Race-to-idle optimizes performance, but inefficient if workload is memory bound.
Usage of compression to improve performance under limited storage access bandwidth.

Usage of disk parallelism to address limited storage bandwidth.

Source for figures: Energy Efficiency: The New Holy Grail of Data
Management Systems Research, S Harizoupoulos, M A Shah, J
Meza, P Ranganathan, CIDR Perspectives 2009.

119 © 2011 IBM Corporation

Cluster, Parallel and High-performance Computing Applications
Energy-aware server pool sizing for multi-server applications.
– Incorporate energy-aware optimizers in workload management systems to
choose the number/type of servers for multi-tier workloads based on real time
site traffic.
Cluster resource sizing and power-mode usage based on load.
– Load-balancer for cluster can utilize energy considerations to shut down
additional servers not required for SLA compliance.
Coordinating processor/system state management and job scheduling
– Job schedulers for Supercomputer clusters can adapt performance states of
servers to nature of workload launched on specific servers.
Energy-aware parallel application runtimes/libraries
– Exploiting processor idle states at synchronization points
– Exploiting network link states based on communication patterns

120 © 2011 IBM Corporation

Potential Optimizations in Data Management

Query optimizers in database systems

– Factor in performance implications of accessing disk-resident/memory-resident
data in formulating a query plan, incorporate energy considerations.
Group/batch processing of queries
– Both throughput and energy-efficiency can be improved by (delayed) batch
processing of related queries trading of higher latencies for individual (early)
queries.
Enabling adoption of energy-efficient media
– Optimize software stack for usage of newer media like Flash with better
energy-efficiency for random I/O, enable tiered storage.
Data layout and energy optimizations
– Coordinate data accesses and disk idle-mode change commands based on
knowledge of data layout on disks to lower disk energy.
Energy-efficient data node management
– Adopt energy-aware data replication/placement strategies in multi-node/multi-
replica environments.

121 © 2011 IBM Corporation

Downloadable tools for energy-awareness
PowerTOP
– http://www.linuxpowertop.org/

JouleMeter
– http://research.microsoft.com/en-us/projects/joulemeter/default.aspx

122 © 2011 IBM Corporation

Take Away

Energy-aware software is integral to energy-efficient computing

Intelligent resource provisioning and management is key at all levels of resource
management.
Appropriately managing component low-power modes requires architecting
software for dynamic power-performance trade-off management, idle opportunity
detection and creation.
It is important to realize flexibility in software development without exacerbating
resource waste leading to lowered performance and inefficiency.
Data placement and access have important implications on resource usage and
consequently energy-efficiency.

123 © 2011 IBM Corporation

References
1. Power-performance Management on an IBM POWER7 Server, Karthick Rajamani, Malcolm Ware, Freeman Rawson,
Malcolm Ware, Heather Hanson, John Carter, Todd Rosedahl, Andrew Geissler, Guillermo Silva, Hong Hua, 2010
IEEE/ACM International Symposium on Low-power Electronics and Design (ISLPED 2010).
2. Energy Reduction in Consolidated Servers through Memory-Aware Virtual Machine Scheduling, Jae-Wan Jang,
Myeongjae Jeon, Hyo-Sil Kim, Heeseung Jo, Jin-Soo Kim, Member, and Seungryoul Maeng, IEEE Transactions on
Computers 60(4), April 2011.
3. The New Holy Grail of Data Management Systems Research, S Harizoupoulos, M A Shah, J Meza, P Ranganathan,
CIDR Perspectives 2009
4. The Thrifty Barrier: Energy-efficient Synchronization in Shared-memory Multiprocessors, J. Li, J.F. Martínez, and M.C.
Huang, In International Symposium on High Performance Computer Architecture (HPCA), February 2004.
5. On Evaluating Request-Distribution Schemes for Saving Energy in Server Clusters, Karthick Rajamani and Charles
Lefurgy, ISPASS 2003.
6. Towards Eco-friendly Database Management Systems, Willis Lang and Jignesh M Patel, 4th Biennial Conference on
Innovative Data Systems Research, Jan 2009.
7. Exploring Power-performance Trade-offs in Database Systems, Zichen Xu, Yi-Cheng Tu, Xiaorui Wang, 26th IEEE
International Conference on Data Engineering, March, 2010.
8. Robust and Flexible Power-Proportional Storage, Hrishikesh Amur, James Cipar, Varun Gupta, Gregory R. Ganger,
Michael A. Kozuch, Karsten Schwan, ACM Symposium on Cloud Computing (SoCC 2010), Indianapolis, June 2010.
9. Evaluation and Analysis of GreenHDFS: A Self-Adaptive, Energy-Conserving Variant of the Hadoop Distributed File
System, Rini T. Kaushik, Milind Bhandarkar, Klara Nahrstedt, 2nd IEEE International Conference on Cloud Computing
Technology and Science, 2010.
10. Compiler-directed Energy Optimization for Parallel Disk Based Systems, S. W. Son, G. Chen, O. Ozturk, M. Kandemir,
A. Choudhary, IEEE Transactions on Parallel and Distributed Systems (TPDS) 18(9), September 2007.

124 © 2011 IBM Corporation

Energy Efficient Data Centers and Systems

Energy Modeling

125 © 2011 IBM Corporation

Energy Efficient Data Centers and Systems

Section Outline

Modeling of energy-efficient data centers

– Principles used in data center modeling tools
– State of the art in data center modeling
– Future research topics (model integration, off-line vs. real-time modeling)

126 © 2011 IBM Corporation

Energy Efficient Data Centers and Systems

Modeling Goals and Process

Goal can be:

– Estimate variables that are hard to measure (e.g., total energy spend in power
conversion)
– Understand impact of changes to the scenario (e.g., introduce new server, change
temperature set point, failure of cooling unit)
– Optimize a scenario (e.g., determine best location for a new server, reduce number of
applications that fail after a cooling unit shuts down)

To evaluate the energy efficiency of computer systems, it is necessary to model both

workloads and their physical environment

Researchers have produced several tools to model parts of the system:

– Workloads in computer systems (e.g., SimpleScalar, SimOS, Simics) and large scale
systems (e.g., MDSim)
– Physical properties such as current drawn, power dissipation and heat transport

127 © 2011 IBM Corporation

Energy Efficient Data Centers and Systems

Modeling Workflow

A full system simulation requires: Step

– Modeling of the application workloads µs to ms
– Use those to drive the power load models
– Use power loads to drive the electrical network ms to sec
– Use power loads as heat loads to drive the thermal models
– Use the thermal transfers to evaluate the facility cooling system sec to min

Feedback from later stages is needed to improve accuracy:

– Ambient temperature affects cooling within the server impacts power consumed by fan, and
leakage power of processor
– Power management of server can impact performance of workload
– Failure in one domain can propagate to other domains

Initial Applications
Pass

Electric

Feedback Thermal

128 © 2011 IBM Corporation

Energy Efficient Data Centers and Systems

Assessing Modeling Solutions

Multiple tools exist, each modeling different aspects of the problem

– Selecting the right ones is key!

Modeling domain focus:

– Application: performance, utilization
– Electric: server power, data center current distribution, energy consumption
– Thermal: room air temperature, heat transport, mechanical plant
– Reliability: thermal cycles, electric quality
– Cost: operational expenses, capital expenses, return-on-the-investment (ROI)

Data:
– Measurement-based: use real workloads or systems, and sensors
– Analytical: use models of system to estimate state variables

Execution:
– Real-time
– Off-line

129 © 2011 IBM Corporation

Energy Efficient Data Centers and Systems

Electrical/Power Modeling

The first stage in developing a comprehensive data center r r r

power model is to estimate power for the individual
components
P =V ×I
PVA = VRMS × I RMS (in Volt − Ampere)
Power (in the AC domain) is the complex product of the
current and voltage PREAL = pf × VRMS × I RMS (in Watts )
Power Factor (pf) is the portion of the virtual power that is
actually consumed
– Most, but not all translates into heat
Most systems’ power supplies have power factor correction
So, a power model can estimate power as the sum of those V‹0°
products
I‹60° I‹300°
Caveat: data center power distribution is usually done in 3-
phases
– The current is not in phase with the voltage V‹120°
– Furthermore, with unbalanced phases (unequal loads on
each branch), result in changes to the angle of the
current I‹180°
– Requires a calculation of the resulting power factors in V‹240°
situations with unbalanced phases
– Usually, the power network is AC, which requires a
complex number solver

Tools usually have models to represent the efficiency of power

delivery components
– Transformer or cable power losses, etc.

130 © 2011 IBM Corporation

Energy Efficient Data Centers and Systems

Thermal Modeling

There are multiple methods, which use varying

degrees of basic principles and system
characterization

Computational Fluid Dynamics (CFD)

– Determine heat loads (power of system),
transports (air or liquid flows), topology and
boundary conditions
– Apply finite element (FE) mathematics on
system
– Use transport and thermal equations (Navier-
Stoke)

System characterization:
– Perform experiments on system
– Build polynomial models for components
– Obtain “steady-state” by solving system of
equations

Energy balance equations

– Arithmetic tabulation of power loads and heat
removal capabilities of equipment

131 © 2011 IBM Corporation

Energy Efficient Data Centers and Systems

Data Center Power Model

Source:
– University of Michigan
– “Understanding and
Abstracting Total Data Center Power”,
Workshop on Energy-Efficient Design
(WEED), held in conjunction with ISCA 2009.

Focus:
– Electric power of data centers

Approach:
– Power is a function of equipment utilization
and ambient outside temperature

Good for fast exploration of high-level what-if

scenarios, with simplified models.

132 © 2011 IBM Corporation

Energy Efficient Data Centers and Systems

Stochastic Queuing Simulation

Source:
– University of Michigan
– “Stochastic Queuing Simulation for Data Center Workloads”, Workshop on Exascale Evaluation
and Research Techniques (EXERT), 2010.
Focus:
– Integrate workload characteristics in a data center power model
Approach:
– Characterize equipment
– Characterize workloads build distributions
Suited for data center design, and “what-if” modeling, not for runtime management

133 © 2011 IBM Corporation

Energy Efficient Data Centers and Systems

Data Center Power Delivery Reliability Model

Source:
– Frank Bodi, “Super Models in Mission Critical Facilities”, INTELEC 2010
Focus:
– Electrical and mechanical modeling of power distribution and its use to detect failures
Approach:
– Develop an electrical model for each component
– Models are connected according to topology of data center
Virtual stress test
– Monte-Carlo simulation of failures: loss of main power, loss of redundant power,
switching sequences
– Determine mean-time-between-failure (MTBF) useful to determine equipment
deficiencies

134 © 2011 IBM Corporation

Energy Efficient Data Centers and Systems

Data Center Power Delivery Reliability Model

Components modeled:
– Incoming AC grid, standby power generator, transfer switches, USP, transformers, power panels,
breakers, computer and air-conditioning loads
Data center is represented as a “net-list” of components

135 © 2011 IBM Corporation

Energy Efficient Data Centers and Systems

HotSpot: a Compact Thermal Model at the Processor-Architecture Level

Source:
– Univ. Virginia
– “HotSpot: A Compact Thermal Modeling
Methodology for Early-Stage VLSI
Design”, Transactions of VLSI, 2006
– http://lava.cs.virginia.edu/HotSpot/

Focus:
– Thermal modeling of microprocessors
– Integration with power simulation

Approach:
– All thermal interfaces (heat sink, heat
spreader, silicon) are represented as
resistors or capacitors
– Values are obtain out of “basic principles”
– RC-network for the microprocessor and
package is iteratively solved

136 © 2011 IBM Corporation

Energy Efficient Data Centers and Systems

Thermal Faults Modeling Using an RC model

Source:
– Univ. of Pittsburgh
– “Thermal Faults Modeling Using a RC
Model with an Application to Web
Farms”, ECRTS, 2007

Focus:
– Thermal modeling of servers

Approach:
– Abstracts properties of the system and
develops network inspired by electrical
components

– Current sources heat producers

– Voltage temperature

– Easy to develop hierarchical models:

•Components servers cluster

Energy Efficient Data Centers and Systems

Mercury Suite: Server-level Thermal Model

Source:
– Rutgers University
– “Mercury and Freon: Temperature Emulation and Management for Server Systems”, ASPLOS,
2006.
Focus:
– Thermal modeling of servers
Approach:
– Build graphs to represent heat transfer paths and air flow paths
– Based on conservation of energy laws
Provides a good link between data center level and chip level thermal models

Energy Efficient Data Centers and Systems

Mercury Suite: Server-level Thermal Model

Basic principles:
– Conservation of energy:

– Newton’s Law of heat transfer. Heat transfer

from region of highest to lowest temperature
according to:

– ‘k’ is a function of the heat capacity of the

material, the area exposed to airflow, the
speed of the air, its moisture content and
pressure
– In this work, ‘k’ is assumed constant and
computed empirically.

Energy Efficient Data Centers and Systems

Other Commercial Thermal Tools

General purpose simulation tools

– Accurate and high resolution CFD
(computational fluid dynamics)
– High performance and parallel solvers

Popular tools:
– ANSYS: Fluent
– Mentor Graphics: FloTherm
– SolidWorks: FloWorks

Challenges:
– Not particularly addressing data centers
– Need accurate input data (dimensions of
equipment, thermal properties, etc.)
– Steep learning curves
– Can be expensive

New breed of tools suited for data center

modeling
– ANSYS: CoolSim
– Future Facilities: 6Sigma
– Innovative Research: TileFlow
– Mentor Graphics: FloVent

Energy Efficient Data Centers and Systems

BCS (British Computer Society) Data Centre Simulator

Source:
– http://dcsg.bcs.org/welcome-dcsg-simulator
– http://www.romonet.com/content/prognose

Focus:
– High-level data center cost and energy
simulator

Approach:
– Use component efficiency curves from
manufacturers
– Build electrical and thermal “topology” of data
center
– Energy balance of components

Reports:
– Detailed information and classification of
energy consumption
– Simulation across seasons
– Cost

Energy Efficient Data Centers and Systems

BCS (British Computer Society) Data Centre Simulator

Tool has great topology input

capabilities that allow modeling of
complex systems

Reports:
– Detailed information and
classification of energy consumption
– Simulation across seasons
– Cost

Energy Efficient Data Centers and Systems

GreenGrid Data Center Modeling Work

Source:
– The Green Grid consortium
– http://thegreengrid.org/library-and-tools.aspx?category=All&type=Tool

Focus:
– Address multiple aspects of data center energy efficiency

Approach:
– High level tools, useful for planning or rough estimation

Power Usage Effectiveness Estimator

– http://estimator.thegreengrid.org/puee
– http://estimator.thegreengrid.org/pcee

Free-cooling Estimated Savings

– For US http://cooling.thegreengrid.org/namerica/WEB_APP/calc_index.html
– For Europe http://cooling.thegreengrid.org/europe/WEB_APP/calc_index_EU.html
– For Japan http://cooling.thegreengrid.org/japan/WEB_APP/calc_index_jp.html

Energy Efficient Data Centers and Systems

APC Models to Estimate Cost of Ownership

Source:
– APC (American Power Conversion), manufacturer of data center infrastructure
equipment
– http://tools.apc.com/
– http://www.apc.com/tools/isx/tco/
Focus:
– Model energy efficiency of APC’s InfraStruxure™ components to show their TCO (total
cost of ownership)
Approach:
– Use energy efficiency curves for components
– “Arithmetic” tabulation of energy consumption and cost

Energy Efficient Data Centers and Systems

APC Models – UPS Efficiency Comparison Calculator

Energy Efficient Data Centers and Systems

APC Models – PUE Calculator

Energy Efficient Data Centers and Systems

APC Models – Cost of Ownership

Energy Efficient Data Centers and Systems

APC Models – Cost of Ownership

Energy Efficient Data Centers and Systems

IBM Measurement and Management Technology (MMT)

Source:
– IBM Research
– “Measurement-based modeling for data
centers”, ITHERM, 2010

Focus:
– Thermal modeling of data centers for
improving energy efficiency

Approach:
– Thermal scanning of data center build
thermal model
– Thermal sensors are strategically placed
allows using simpler version of
thermodynamic equations

Energy Efficient Data Centers and Systems

IBM Measurement and Management Technology (MMT)

Coupling measurements with

models
– Point measurements
(temperature, power,
humidity, air flow, pressure,
etc)
– Model for details not caught
by sensors
On average, demonstrated 10%
overall cooling power for existing
data centers using this service
(with only a moving cart of
sensors, and moving floor tiles)

Energy Efficient Data Centers and Systems

Research Opportunities

Integration of all models:

– Different purposes (performance, IT and cooling power, cost, reliability)
• Is there a need for a tightly-coupled “supermodel”?
• What is the right interface between multiple simulation domains?
– Time scales in different models (ns, us, ms, sec, min, hour)
• What is the right time granularity to keep for each domain?
– Spatial scales (chips servers racks data centers)
• How to scale the simulation to large data centers?
– Advantage:
• What do we gain by integrating those models?

Hybrid models:
– Off-line models are for planning and design
– Real-time models requires sensor data

Energy Efficient Data Centers and Systems

Thermal-Aware Power Optimization (TAPO) – Optimizing Total Power

Tradeoff between data center cooling power and IT/server fan power
– Higher IT/server inlet temperature less CRAH power, higher server fan power
– Server fan is limited in form factor, can’t use large, power-efficient fans.
– Fan power is quadratic/cubic to cooling capability
Total power is a strong relationship with IT utilization per cooling zone
– Low utilization favors warm inlet temperature
– High utilization favors cool inlet temperature
Binary control of CRAH setpoint is close to optimal
• >10% total Data Center power reduction
Wei Huang, et al., IGCC 2011, best paper

High server Low server

Total power (IT + cooling)

utilization utilization

optimal optimal
setpoint setpoint
more IT fan power, more IT fan power,
less chiller power less chiller power

Chiller thermal setpoint Chiller thermal setpoint

Reliability-Aware Power Performance Optimization
Reliability has a power cost
Power management can affect reliability
Reliability considerations
–Data center tier classifications
–Redundant branch circuits increase power delivery
infrastructure costs
–Redundant power delivery components in servers increase
power delivery infrastructure costs
–Chip aging leads to higher operational voltages, reducing
energy efficiency
Power management considerations
–Thermal cycling of chips and early failures of packages
–Disk drive failure rates due to spin downs to save power
157
–Fan failure rates due to power cycling fans © 2011 IBM Corporation
History of Reliability in Data Centers and the Concept of Tiers
IT customers expect availability of “Five Nines” or 99.999%
Although hardware and software platforms may meet Five Nines, the
complementary site infrastructure can’t support these availability goals
Uptime Institute’s Tier Performance Standards established in 1995 has
become the default standard for the uninterruptible uptime industry
–See “Tier Classifications Define Site Infrastructure Performance” white
paper
Actual measured site availability ranging from 99.67% to 99.99%
Substantially less than Five Nines: conclusion is site availability limits
overall IT availability
Highest tier, Tier IV, first appeared in 1995 with UPS Windward data
center project working with IBM and other vendors
–Requires at least two completely independent electrical systems
connected to two redundant power supplies in all IT equipment
–Last point of electrical redundancy is between UPS and actual IT
equipment
–Human factors are important because 70% or more of all site failures
involve people
–4 hours for IT to recover from a failure leads to 1 failure every 5 years
158 for 99.995% availability © 2011 IBM Corporation
Data Center Tier Descriptions Based on Power, Cost, Reliability
Tier I Tier II Tier III Tier IV

Utility Voltage 208, 480 208, 480 12-15kV 12-15kV

(typical)
Single Points-of- Many + human Many + human Some + human None + Fire
Failure
error error error and EPO
Annual Site Caused 28.8 hours 22.0 hours 1.6 hours 0.8 hours
IT Downtime (actual
field data)
Representative Site 99.67% 99.75% 99.98% 99.99%
Availability
Typical Months to 3 3-6 15-20 15-20
Implement
Year first deployed 1965 1970 1985 1995

Construction Cost: $220/sq ft $220/sq ft $220/sq ft $220/sq ft

Raised Floor
Usable UPS Output
$10,000/kW $11,000/kW $20,000/kW $22,000/kW

Costs based on 2005 estimates

Source: Uptime Institute white paper: “Tier Classifications Define Site Infrastructure Performance”
159 © 2011 IBM Corporation
Data Center Up-Time Institute Tier 4 Reliability Support

Source: Wiboonrat,, ICOQM’07

The cost of power delivery infrastructure is high, especially for Tier IV

data centers
–Motivation here to use that expensive infrastructure investment and
pack more than 50% more equipment than is used today without
impacting reliability, but some compromise in performance
Example: branch circuits and racks
–Consider a typical data center rack has two branch circuits feeding it
for redundancy, say BC0 and BC1, with 30 AMPs per phase on each
branch circuit
• Call this current capacity per branch circuit BCC
–Within a rack, all the servers are fed from two independent power
strips, each power strip fed by 0.8*BCC AMPs of current
–Fuse limit is 1.25*BCC AMPs, this means each branch circuit could
handle this for a short number of seconds
–This leads to the actual potential amount of density improvement with
oversubscription as (1.25/0.8)=1.5625 or a 56.25% denser IT
equipment with same Tier IV uptime and redundancy support

Branch Circuit Power (example: 30 AMPs)

Breaker trip point:

typically 120%-125%, 6 servers at 6A
e.g. 36 AMPs
Nominal branch circuit 6 servers at 4A
capacity: 100%,
36A 125% BCC
e.g. 30 AMPs 33A
BC0 30A BCC
Safe limit due to 27A
24A 80%BCC
National Electric 21A
Code: 80%, 18A
15A
e.g 24 AMPs
0A
Measured value BC1 18A
15A

BC1 experiences a failure 0A

Component Redundancy and Reducing Guardbands

Component redundancy reductions are being pursued to

reduce power delivery infrastructure costs through
oversubscription
Guardbands are used for reliable operation of chips
–Definition of guardband for chips: amount of additional
margin in a key parameter, e.g. voltage, to assure chip
timing never fails under all worst case scenarios including
aging and workloads
Energy efficiency gains are being pursued from guardband
reductions

Component Redundancy in Servers Impacts Cost of Power Delivery
Redundancy in components adds cost to server design
Voltage Regulators with additional phases for current
delivery, fail in place
Two power supplies, each capable of handling the full load of
the server
–With power capping, new means to “oversubscribe” the
supplies so that one power supply can’t handle the full load
of the server, but server still continues operation if one of
the two supplies fails
–Oversubscription of power supplies is analogous to the
branch circuit oversubscription described earlier

Guardband Reduction at the Chip Level
Chip aging, workload variability leading to higher voltages
over chip lifetimes increasing power consumption
Josep Torrellas, from the University of Illinois, has a
representative research project on this topic
Variation-Tolerant Architectures
Describes active voltage management techniques to
manage aging of chips
Reducing voltage guardbands over lifetime of the chip can
increase power efficiency

and more energy efficient operation for chips with high levels of
circuit variability
165 © 2011 IBM Corporation
Aging-Induced Degradation

Source: Josep Torrellas, “Variation-Tolerant Architecture”

Managing Aging

Source: Josep Torrellas, “Variation-Tolerant Architecture”

DVSAM-Pow: Power Efficiency

Source: Josep Torrellas, “Variation-Tolerant Architecture”

Could Thermal Cycling of Chips Lead to Packaging Failures?

IBM study of actual customer environments

– System operation is unique (based on power management policies)
– Customer applications and workloads on the system
– Unique data center environment
IBM Developed a Figure of Merit (FOM)
– The purpose of the FOM is to have a metric that is related to the frequency
and the depth of the thermal cycling going on inside chips
– Reads and averages a couple of on-chip thermal sensors which are
spatially separated and segregated from high power dissipation areas on
chip
– Parse temp data into discrete elements feed through an algorithm which
then normalizes this data to a defined thermal cycle condition keep a
running tab of thermal cycles a given processor experiences in the field
– FOM is saved on all modules in the field, can be retrieved from returned
modules
– Possible to read FOM values off of machines in the field
– The larger the FOM, the more likely failures could occur with the packaging
– A FOM approaching 10,000 over a 7 year lifetime is seen as a problem

7 Year Field Projections for FOM (larger is more harmful on package)
Projections shown below were developed from actual field data collected from customer
installed systems, after operation for from 1 to 6 months
The x axis below is the projected 7 year FOM value
The y axis is a count of the number of IBM chips that are at a given FOM projection
Each color is a different IBM system design running in the customer’s environment

141

121
# of processors at FOM value

101

81 Worry
Point
61 ~ 4000 F

1
0 100 200 300 400 500 600 700 800 900 1000 1100 1200
Projected FOM Growth worst case FOM = 1058
170 © 2011 IBM Corporation
Improvements to Reduce Thermal Cycling

Adjust voltages to control chip temperatures, while leaving

performance (frequency) alone

…
Vi-1 lower
voltage

Acc. Temps of all cores >

Calculate average
temperature T_current > T_prev?
= Vi

same
voltage
<

Vi+1 higher
voltage
…

Results from Technique to Mitigate Thermal Cycling
Average chip temperature shown, with controlled maintaining +/-1C temperature swing, but
at higher energy cost
Technique resulted in 7% reduction in projected FOM growth

controlled
uncontrolled

Power
overhead

Disk Reliability Study by Google
Many data centers are moving to higher ambient temperatures: is there
a risk for disk drives?
“Failure Trends in a Large Disk Drive Population” FAST’07 paper from
Google
–Over 100,000 hard disk drives studied
–Examined SMART (Self-Monitoring Analysis and Reporting
Technology) parameters from within drives as well as temperatures
–Found that only at disk temperatures above 40 deg C was there a
noticeable correlation to drive failures
–Some SMART parameters with higher correlation to failures
included first scan errors, reallocations, offline reallocations, and
probational counts (suspect sectors on probation)
–Key missing piece from study is extensive power cycling other than
reporting that after 3 years, higher power cycle counts can increase
failures rate to be over 2%
–Assumes server class drives are running continuously as the normal
mode of operation (little change in power)
173 © 2011 IBM Corporation
Reliability Aware Disk Power Management

Storage consumes a large fraction (up to 40%) of

total IT equipment power
–High cost of operation (energy cost)
–Limits capacity of facility
Disks are not power- or energy-proportional:
–Consume roughly the same power idle (but spinning) or
servicing a request (e.g., 7W idle vs 10W active)
–Only by spinning disks down (turning off spindle motor) can
you achieve very low power state (e.g., sub 1W)
Spindown or Power Cycling introduces problems:
Our –Reliability: Disks are mechanical devices and can only spin-
focus up / spin-down limited number of times over lifetime
–Latency: It takes 5-10 seconds to spin up a disk and access
a block (vs ~10ms when spinning)
174 © 2011 IBM Corporation
Reliability Aware Approach to Spindown
Manage idle timeout periods dynamically, rather than using a
fixed timeout period
–If disk has been spun down less frequently than
conservative rate (e.g., every 15 minutes) in past, can spin
down more often in the future
–Need to limit lifetime spindowns to manufacturer
specification maintain lifetime spindown rate
One approach to controlling spindowns: token bucket
–Every N minutes (e.g., every 15 minutes), add a token to
the “spindown bucket”
–When energy management policy wishes to spin down
disk, must remove token (or defer spindown)

Fan Failure Mechanisms and Impacts on Reliability
Dynamic fan management for higher ambient conditions for
data center PUE: impacts on reliability?
HP paper: “Cooling Fan Reliability: Failure Criteria,
Accelerated Life Testing, Modeling and Qualification”
–Most common failure mechanism is mechanical due to
bearings wearing out
–Bearings wear out due to loss of lubricant
–Higher temperatures accelerate MTTF for fans
–Vendors prefer to quote 25 deg C ambient which is cooler
than many of the more efficient Data Center designs with
higher ambient temperatures
–Power cycling may increase failure rates, but powering
down fans can maximize energy savings for idle servers

Improvements in Server Cooling Efficiency
Power
Supply

On-demand On-demand
Compute cooling Compute cooling
power power power power

System System
Fan 1

Fan 2

Fan 3

Fan 4
Components Components
1 2

• Redundant series fan pairs, for normal mode, only one fan in a set is on (Fan 1
and Fan 3)
• Assign additional cooling (Fan2 or Fan4) on demand
• When one fan fails, the other fan is switched on just-in-time before thermal
emergency (a few seconds observed in real system). From then on, use normal
mode.
• When a failed fan is replaced, higher performance can be resumed when the
utilization requires it.
177 © 2011 IBM Corporation
Research in Emerging Technologies and
Solutions

Outline

Storage Class Memories

Power delivery, cooling and packaging technologies
Workload optimized systems

Storage Class Memory – Motivation
High end Consolidation Server Power Budgets
Memory power is an increasing 100%
90% 19% 13%

Normalized Power Budget

fraction of server power 80%
70%
28% 46%
Growing demand for capacity 60%
50%
– Virtualization 40%
– Data-intensive applications 30%
53%
20% 41%
– In-memory databases 10%
0%
POWER6 POWER7-Est
Ramcloud*
Processor+Cache Memory Rest of Processor-Memory Cards
– Latency is a problem, so keep
everything in memory Source: Architecting for Power Management: The IBM
POWER7 Approach: Ware, Rajamani, Floyd, Brock, Rubio,
Rawson, Carter, HPCA 2010
– Large number of diskless computing
nodes.

Scaling DRAM increasingly difficult (like CMOS scaling of logic chips)

*Ramcloud:
Ramcloud: http://fiz.stanford.edu:8081/display/ramcloud/Home

SCM Technologies – Flash

Control gate
Floating gate
source drain

Source: Winfried Wilcke, Flash and Storage Class Memories:

Near term use of SCM technologies

Need higher capacity, lower power

Memory System High-Performance Disk

L1(SRAM) EDRAM DRAM PCM Flash HDD

21 23 25 27 29 211 213 215 217 219 221 223

Typical access latency in processor cycles (@ 4 GHz)

Source: Scalable High Performance Main Memory System Using Phase-Change Memory Technology,
Moinuddin K. Qureshi, Vijayalakshmi Srinivasan, Jude A. Rivers, ISCA 2009.

PCM, Flash and DRAM: A Quantitative View

Attribute DRAM PCM NAND Flash

Non-Volatile No Yes Yes

Idle Power 100mW/GByte 1 mW/GByte 10 mW/GByte

Erase / Page Size No / 64Bytes No / 64Bytes Yes / 256KB

Write Bandwidth per die 1-6GBytes/s 50-100 MB/s 5-40 MB/s

Page Write Latency 20-50 ns 1 µs 500 µs

Page Read Latency 20-50 ns 50 ns 25 µs

Endurance 1016 107 105

Maximum Density 4Gbits 4Gbits 64Gbits

Emerging technology trends – Main Memory
Alternatives
– PCM - Already commercially available
– MRAM - Commercially available but limited to 4Mbits
– STT-RAM - Early prototypes
– Memristors and dual-gate - single cell or very small prototypes
All alternatives are persistent
– Additional power implications (suspend/resume).
– OS and applications can also use this property.
– Security
• In memory data is persistent and can be physically accessed
– Reliability
• How to trust information in memory?
Endurance
New system architectures

Storage Class Memory as Secondary Storage Alternatives to Magnetic
Disks (HDD)
Flash
– Lower power
• 0.5W-2W versus 2W-10W for HDD
– Lower latency for random I/O
• Larger number of IOPs: 20K-35K vs 100-300 for HDD
– Similar sequential access bandwidth
– Flash has comparable density, but suffers from scalability problem
• Endurance decreasing – 3K erases
• Cells more unreliable – More bits dedicated to error-correction
PCM
– Less dense than Flash
– Hybrid designs with Flash
• Metadata on PCM, data on Flash
• Reduce write amplification
• Update in place – PCM re-writable and byte-addressable

Emerging System Architectures with PCM

PCM + DRAM
– PCM as main memory, DRAM as large cache
– Virtual memory managed – IBM Watson
– Hardware managed – University of Pittsburgh
3D architectures
– 3D chip containing processors and DRAM/PCM
• IBM, University of Pittsburgh
– Used to reduce power consumption
– Diskless (HP Nanostore)

References: Storage Class Memory
1. The Basics of Phase Change Memory Technology :
http://www.numonyx.com/Documents/WhitePapers/PCM_Basics_WP.pdf
http://www.numonyx.com/Documents/WhitePapers/PCM_Basics_WP.pdf
2. S. Raoux et al. Phase-
Phase-change random access memory: A scalable technology. IBM Journal of R. and D.,
D., 52(4/5):465–
52(4/5):465–479,
2008.
3. International Technology Roadmap for Semiconductors - 2010 http://www.itrs.net/Links/2010ITRS/Home2010.htm
4. Nanostore:
Nanostore: Ranganathan, P , From Microprocessors to Nanostores: Rethinking Data-Centric Systems, IEEE Computer ,
vol.44, no.1, pp.39-48, Jan. 2011
5. B. C. Lee et al, Phase Change Technology and the Future of Main Memory, IEEE Micro, Special Issue: Micro's Top Picks
from 2009 Computer Architecture Conferences (MICRO TOP PICKS), Vol. 30(1), 2010.
6. Jian-Gang Zhu; , "Magnetoresistive Random Access Memory: The Path to Competitiveness and Scalability," Proceedings of
the IEEE , vol.96, no.11, pp.1786-1798, Nov. 2008.
7. M. Qureshi et al. Scalable high performance main memory system using phase-change memory technology. In ISCA ’09:
Proceedings of the 36th annual international symposium on Computer architecture, pages 24–33, New York, NY, USA,
2009. ACM.
8. B. C. Lee et al. Architecting phase change memory as a scalable DRAM alternative. In ISCA ’09: Proceedings of the 36th
annual international symposium on Computer architecture, pages 2–13, New York, NY, USA, 2009.ACM.
9. Winfried Wilcke, IBM. Flash and Storage Class Memories: Technology Overview & Systems Impact., HEC FSIO 2008
Conference. http://institute.lanl.gov/hec-fsio/workshops/2008/presentations/day3/Wilcke-PanelTalkFlashSCM_fD.pdf

Power Delivery, Cooling and Packaging Technologies

Voltage Regulator Phase Shedding for Increased Efficiency
Modern processor VRMs are multiphase
designs, with the total load split among
the phases.
The efficiency of a multiphase regulator
varies with load, with efficiency falling at
lower loads.
When the load is small instead of using
all phases, each providing a small
current, shut-off some of them and
increase the load for the others.
Nehalem-EX
– extends the VR phase shut-off to
the cache supply,
– obtains about 2W power reduction
per socket in idle mode.

Figures and Data: A 45 nm 8-Core Enterprise

Xeon® Processor, S. Rusu et al., IEEE Journal
of Solid-state Circuits, 45(1), January 2010.

On-chip Voltage Regulation

Benefits
– Lower distribution losses from higher voltage on-board power distribution.
– Lower energy spent for droop control, as regulation is closer to load.
– Enables fine-grained voltage control (spatial and temporal) leading to better
load-matching and improved energy-efficiency
– Reduction in board/system costs, reducing voltage regulation needs on board.
Challenges
– Space overheads on processor chip
– Difficulty realizing good discrete components in same technology as digital
circuits.
Opportunities
– 3D packaging can help address both challenges above.

3D Chip Stacks
Multi-Chip Design

System on Chip

3D Integration

3D Benefits:
High core-cache bandwidth
Integration of disparate technologies
Reduction in wire length
Reduced interconnect, I/O cost – eliminates off-
chip drivers, lower power overheads & faster,
higher energy-efficiency

3D
Currently being embraced for DRAM devices e.g. Samsung 8Gb 3D DRAM
Power reduction
Standby (IDD2N): 50%,
Active (IDD1): 25%
Faster speeds
Less loading on channel.

Source: 8Gb 3D DDR3 DRAM using

Through-Silicon-Via Technology
(Samsung), U. Kang et al., IEEE Solid-
State Circuits Conference, 2009

Challenges
New technology: initial development costs, tool costs.
Cooling

CMOSAIC Project

Ongoing collaborative project between IBM, École Polytechnique Fédérale de Lausanne

(EPFL) and the Swiss Federal Institute of Technology Zurich (ETH).
Evaluate chip cooling techniques to support a 3D chip architecture.
3D stack-architecture of multiple cores with a interconnect density from 100 to 10,000
connections per sq. mm.
Liquid cooling microchannels ~50um in diameter between the active chips.
Single-phase liquid and two-phase cooling systems using nano-surfaces that pipe
coolants—including water and environmentally-friendly refrigerants—within a few millimeters
of the chip.
Two phase cooling
– Once the liquid leaves the circuit in the form of steam, a condenser returns it to a liquid
state, where it is then pumped back into the processor, completing the cycle.

Source: Bruno Michel, IBM

Limits of Traditional Back-side Heat Removal

Microchannel back-side heat removal

Heat removal limit constrains performance

Source: Bruno Michel, IBM

3D integration requires (scalable) interlayer liquid cooling

Challenge: isolate electrical interconnects from liquid

Microchannel
Pin fin

Through silicon via electrical bonding

and water insulation scheme

Test vehicle with fluid

manifold and connection
Source: Bruno Michel, IBM
196 © 2011 IBM Corporation
SuperMUC Super Computer (2012)
SuperMUC is based on hot-water-cooling technology invented for QPACE, Aquasar and
iDataCool. (prototype at University of Regensburg)

In operation at the Leibniz Supercomputing Centre (LRZ) in Munich, Germany, by 2012.

Energy-efficiency:
– PUE 1.1 – Green IT
– 40% less energy consumption compared to air-cooled systems
– 90% of waste heat will be reused

Based on an IBM System X iDataPlex ®:

– Peak performance of 3 PF/s
– 9531 Nodes with total 19476 Intel Xeon CPUs / 157464 Cores, 324 TB Memory
– InfiniBand FDR10 Interconnect with ~ 11900 (optical) IB cables
– 10 PetaByte File Space based on IBM GPFS and 2 PetaByte NAS Storage

Source: Bruno Michel, IBM

Hot-water-cooled datacenters – towards zero emission

Micro-channel
liquid coolers
Heat exchanger

CMOS 80ºC

Direct „Waste“-Heat usage

e.g. heating

Iceotope: Two-stage modular Liquid Cooling

Images from web-site of Iceotope Limited, Sheffield, UK

Heat captured by individually sealed liquid coolant in a primary circuit.
Transferred to water through a secondary circuit, which is in turn cooled by the building
water in a final circuit
End to end liquid cooling without messing with coolant during maintenance
199 © 2011 IBM Corporation
Total Immersion-in-Oil Cooling

Photos courtesy Green Revolution Cooling, Austin Texas

High heat capacity coolant (1,300x by volume than air)
– Direct contact to CPU reduces its temperature (10-15 deg C reduction reported)
– Lower power for cooling
• less coolant volume to circulate (95% cooling power reduction claimed)
• 10-20% less server power due to elimination of internal server fans
– Improved reliability
• Fan failures are eliminated by removing fans
• Disk drive reliability improved, with temperatures at coolant temperature level, and reduced
vibrations associated with fans and pressurized air.
Advertise upto 100kW per rack power density.

Intelligent Management of Power Distribution in a Data Center

Problem
– Overprovisioning of power distribution components in data centers for availability and to
handle workload spikes
Solutions
– Provision for average load => reducing stranded power, use power capping.
– Oversubscribe with redundancy and power cap upon failure of one of the supplies/PDUs.
– Employ power distribution topologies with overhead power busses to spread secondary
power feeds over larger number of PDUs, reducing the reserve PDU capacity at each
PDU.
– Use power-distribution-aware workload scheduling strategies to match load more evenly
with power availability.
Challenges
– Separated IT and facilities operations, not enough instrumentation – no integrated,
complete view of power consumption versus availability for optimizations.
– Existing methods for increased availability of the power delivery infrastructure have high
energy/power costs.

*Power Routing: Dynamic Power Provisioning in the Data Center, Steven Pelley, David Meisner, Pooya Zandevakili,
Thomas F Wenisch, Jack Underwood, ASPLOS 2010

New Technologies for Datacenter Power distribution Management

Datacenter Power Management – Vision for Projects at IBM Research Austin

– Develop new technologies which enable impact of integrated management without
actual merger of IT and Facilities operations, and
– Develop optimization and management techniques which demonstrate enhanced
benefits where integrated control of IT and facilities infrastructure is possible

Intelligent Power Distribution and Control

– Low-cost power sensors
– Power monitoring and management infrastructure prototype
– Branch Circuit Identification

Branch Circuit Power Sensors in the Data Center

Power Sensors

Power Monitoring Infrastructure (1/3)

Power Distribution
Unit showing 2
panels of circuit
Power Distribution Units in a Data Center breakers

Power Monitoring Infrastructure (3/3)
Instrumented 42-branch
circuit Power Panel with 6
Sensor Boards

Ribbon Web pages

cable

eth pwr

Processing Panel Computer

Card

Branch Circuit Identification (BCID)
BCID – determining connectivity of power panel’s
branch circuit by novel power signal generation
& detection technology

Applications of BCID information

– Installation of equipment on desired branch
circuits
– Load balancing among phases/circuits and
consumption-aware placement
– Power distribution based on load
– Load-aware failure mitigation

Three methods for generating signals for BCID

• IBM systems with EnergyScale – custom power-signaling technology.
• Systems with USB interface – USB-clicker
• Available power outlet in a rack PDU – AC clicker

BCID Demo
Displays branch circuit identification in real time using power
measurement infrastructure

Power Monitoring Demo

Displays live monitoring of currents on branch circuits in a power-panel,
showing load over time.

Power capping demo

A real-time demonstration of gracefully managing overcurrent in a branch
circuit via power capping of connected servers.

Workload-optimized Systems

Workload-optimized Systems for Energy-efficiency

Custom designs/accelerators for specific functions at greater efficiency.

Lower-power processors driven by focus on throughput versus costlier single-thread

performance.
– Fewer active components (functions) not pertinent to current execution.

Application-specific integrated systems (ASIS <= ASIC)

– Reduced components to improve cooling, lower power.

Quantum Chromodynamics Parallel Computing on the Cell Broadband
Engine

Computer optimized for lattice quantum chromodynamics

– QPACE System @ Forschungzentrum Jülich 2010
– Commodity PowerXCell 8i processor
– Custom FPGA-based network chip
– Custom communication protocol for LQCD torus network
– Custom voltage tuning
– Custom liquid cooling
– LQCD performance
• 544-681 MFLOPS/W (QPACE)
• 492 MFLOPS/W (Intel + Nvidia GPU-based Dawning
Nebulae @ National Supercomputing Centre in Shenzen)
– #7 on Green500 June 2011 with 773.4 MFLOPS/W

H. Baier et al., “QPACE: Power-efficient parallel architecture based on

IBM PowerXCell 8i”, First Intl. Conf. on Energy-Aware High-
Performance Computing, 2010.
http://www.ena-hpc.org/2010/talks/EnA-HPC2010-Pleiter-QPACE_Power-
efficient_parallel_architecture_based_on_IBM_PowerXCell_8i.pdf
210 © 2011 IBM Corporation
4. Node Card:
Blue Gene/Q 3. Compute card: 32 Compute Cards,
One chip module, Optical Modules, Link Chips, Torus
16 GB DDR3 Memory,

2. Single Chip Module

1. Chip:
16 application cores

5b. IO drawer: 7. System:

8 IO cards w/16 GB 20PF/s
8 PCIe Gen2 x8 slots

5a. Midplane:
16 Node Cards

MF/Watt: 2 GF/W, #1 Green500

~6x BG/P, ~10x BG/L

6. Rack: 2 Midplanes
211 © 2011 IBM Corporation
SeaMicro SM10000-64
Goal is improve compute/power and
compute/space metrics
10U server
– 512 ATOM 64 bit cores (256 sockets)
– 4GB per socket
– 2.5KW
– Operates as 256 node cluster
– High-speed internal network – 1.2Tbit/s
– External 64 x 1Gb ethernet ports
– Virtualized I/O
• All I/O is shared between sockets
• Improves efficiency – very low
overhead per socket.
Designed for high volume of modest
computational workloads
– Web servers
– Hadoop

ARM servers on the horizon

ARM still 32bits (limited to 4GB per socket)

– 64bits version announced - A15
Marvell, Calxeda, STM announcements
– Dual (STM) or quad core (Marvell, Calxeda) A9 from 1 to 2GHz Calxeda ECX-1000
– Calxeda: 5W for CPU+4GB DRAM DDR3
ZT systems R1801e (product)
– 8 modules with each module:
• dual core STM processor,1GB DRAM 1333MHz DDR3, 1GB Flash, 1Gbit ethernet,
USB and SATA.
• 80GB SSD
– 80W in a 1U system.

Cluster of ‘low-power’ nodes

Each node has a processor-memory socket
– 3D stack with processor and DRAM
– Stacks of PCM-Memory and PCM-Storage, connected to the processor-memory stack
via Silicon Interposer.
– Integrated switch routers for inter-node connectivity
Inter-cluster connectivity with optics.
Powered from renewable resources
Cooled for free.

But surely we cannot predict the future !

Dcdsfdatacenterdesignforthe21stcentury 120712165831 Phpapp02
No ratings yet
Dcdsfdatacenterdesignforthe21stcentury 120712165831 Phpapp02
19 pages
Energy Efficient Datacenters
No ratings yet
Energy Efficient Datacenters
22 pages
Hughes
No ratings yet
Hughes
21 pages
Data Center
No ratings yet
Data Center
15 pages
3-Minimizing Power Usage (Itscholar - Codegency.co - In)
No ratings yet
3-Minimizing Power Usage (Itscholar - Codegency.co - In)
18 pages
Data Center Energy Consumption Modeling
No ratings yet
Data Center Energy Consumption Modeling
65 pages
Driving Up Data Center Energy Efficiency With Scalable Server-Level Power Management and Control
No ratings yet
Driving Up Data Center Energy Efficiency With Scalable Server-Level Power Management and Control
2 pages
Power and Performance Tuning
No ratings yet
Power and Performance Tuning
9 pages
5 DataCenters
No ratings yet
5 DataCenters
57 pages
Generated 100
0% (1)
Generated 100
1,531 pages
Datacenter Handbook-Pages-18
No ratings yet
Datacenter Handbook-Pages-18
12 pages
Datacenter Handbook-Pages-17
No ratings yet
Datacenter Handbook-Pages-17
14 pages
DC Training Manual - Part - 2 - DC - Power - Electrical - Design - v4
No ratings yet
DC Training Manual - Part - 2 - DC - Power - Electrical - Design - v4
107 pages
Key To 5 Critical Data Center
No ratings yet
Key To 5 Critical Data Center
6 pages
GC 2
No ratings yet
GC 2
4 pages
05-Chapter 5 - DataCenters
No ratings yet
05-Chapter 5 - DataCenters
57 pages
The Green Data Center Chapter 1
No ratings yet
The Green Data Center Chapter 1
19 pages
DC Essentials G1-Data Center Introduction
No ratings yet
DC Essentials G1-Data Center Introduction
37 pages
Datacenter
No ratings yet
Datacenter
63 pages
HTML Slides
No ratings yet
HTML Slides
192 pages
Robust Free Power Management: Dell OpenManage Power Center
No ratings yet
Robust Free Power Management: Dell OpenManage Power Center
22 pages
Integrated Approach To Data Center Power Management
No ratings yet
Integrated Approach To Data Center Power Management
13 pages
The Problem of Power Consumption in Servers: L. Minas and B. Ellison Intel-Lab in Dr. Dobb's Journal, May 2009
No ratings yet
The Problem of Power Consumption in Servers: L. Minas and B. Ellison Intel-Lab in Dr. Dobb's Journal, May 2009
21 pages
The Green Data Center Chapter 2
No ratings yet
The Green Data Center Chapter 2
17 pages
Escope An Energy Efficiency Simulator For Internet Data Centers 2023 MDPI
No ratings yet
Escope An Energy Efficiency Simulator For Internet Data Centers 2023 MDPI
21 pages
Growth in Data Center Electricity Use 2005 To 2010: Jonathan G. Koomey, Ph.D. Consulting Professor, Stanford University
No ratings yet
Growth in Data Center Electricity Use 2005 To 2010: Jonathan G. Koomey, Ph.D. Consulting Professor, Stanford University
24 pages
Power Management: Managing and Monitoring IT Energy Use
No ratings yet
Power Management: Managing and Monitoring IT Energy Use
18 pages
Message-6 2
No ratings yet
Message-6 2
226 pages
Redefining The Economics of Running The Modern Data Center WP EN 6 2012
No ratings yet
Redefining The Economics of Running The Modern Data Center WP EN 6 2012
12 pages
Beyond PUE: Tackling IT's Wasted Terawatts
No ratings yet
Beyond PUE: Tackling IT's Wasted Terawatts
21 pages
Energy Logic WP0208
No ratings yet
Energy Logic WP0208
21 pages
6 Ways To Deploy Small IT Easily
No ratings yet
6 Ways To Deploy Small IT Easily
21 pages
Data Center NCPI Considerations
100% (1)
Data Center NCPI Considerations
16 pages
Data Centres and Power
No ratings yet
Data Centres and Power
16 pages
Server Consolidation: An Approach To Make Data Centers Energy Efficient & Green
No ratings yet
Server Consolidation: An Approach To Make Data Centers Energy Efficient & Green
7 pages
Exam Practise Booklet - Unit 2
No ratings yet
Exam Practise Booklet - Unit 2
45 pages
1 - Icue49301.2020.9307075
No ratings yet
1 - Icue49301.2020.9307075
7 pages
Eurasia 2012 Istanbul - Final
No ratings yet
Eurasia 2012 Istanbul - Final
28 pages
APC Data Centres Designing For Business Needs
No ratings yet
APC Data Centres Designing For Business Needs
38 pages
Reducing Data Center Power: and Energy Consumption
No ratings yet
Reducing Data Center Power: and Energy Consumption
16 pages
VRV System PDF
No ratings yet
VRV System PDF
100 pages
Green Storage
No ratings yet
Green Storage
8 pages
Nran-8fl6lw R0 en PDF
No ratings yet
Nran-8fl6lw R0 en PDF
19 pages
ASHRAE-Lighing Power Densities
No ratings yet
ASHRAE-Lighing Power Densities
1 page
A Computer Scientist Looks at The Energy Problem: Randy H. Katz
No ratings yet
A Computer Scientist Looks at The Energy Problem: Randy H. Katz
41 pages
Premium Oo Greendc Top 10
No ratings yet
Premium Oo Greendc Top 10
11 pages
Data Centers and The Environment Dec2018 Final
No ratings yet
Data Centers and The Environment Dec2018 Final
10 pages
Revision 2 Board Examination
No ratings yet
Revision 2 Board Examination
9 pages
Electrical Efficiency Measurement For Data Centers: White Paper 154
No ratings yet
Electrical Efficiency Measurement For Data Centers: White Paper 154
19 pages
Boiler Dynamics and Controls
From Everand
Boiler Dynamics and Controls
F. Paul de Mello
No ratings yet
Calculating Total Power Requirements For Data Centers
No ratings yet
Calculating Total Power Requirements For Data Centers
11 pages
User Manual: Powerlogic Pm5500 / Pm5600 / Pm5700 Series
No ratings yet
User Manual: Powerlogic Pm5500 / Pm5600 / Pm5700 Series
228 pages
12th IP Unit-1 Numpy - Array
No ratings yet
12th IP Unit-1 Numpy - Array
21 pages
Strategies in Energy Conservation and Ma
No ratings yet
Strategies in Energy Conservation and Ma
5 pages
Calculating Space and Power Density Requirements For Data Centers
No ratings yet
Calculating Space and Power Density Requirements For Data Centers
19 pages
CS311 MCQs Mids 2024 Mam Mehwish
No ratings yet
CS311 MCQs Mids 2024 Mam Mehwish
9 pages
Cypress Programmer User Guide
No ratings yet
Cypress Programmer User Guide
28 pages
Gamified Education System
No ratings yet
Gamified Education System
10 pages
Choosing The Optimal Data Center Power Density PDF
100% (1)
Choosing The Optimal Data Center Power Density PDF
14 pages
Overview of Data Center Energy Use: Bill Tschudi, LBNL Wftschudi@Lbl - Gov
No ratings yet
Overview of Data Center Energy Use: Bill Tschudi, LBNL Wftschudi@Lbl - Gov
26 pages
Advidia Catalogue
No ratings yet
Advidia Catalogue
7 pages
8B10B Encoder/Decoder Megacore Function (Ed8B10B) : November 2001 Ver. 1.02 Data Sheet
No ratings yet
8B10B Encoder/Decoder Megacore Function (Ed8B10B) : November 2001 Ver. 1.02 Data Sheet
11 pages
2PAA110661 en Overcoming Hidden Costs in The Data Center
No ratings yet
2PAA110661 en Overcoming Hidden Costs in The Data Center
12 pages
WPA3
No ratings yet
WPA3
3 pages
BICSI Presentation - Raritan
No ratings yet
BICSI Presentation - Raritan
19 pages
Calculating Total Power Requirements For Data Centers-En
No ratings yet
Calculating Total Power Requirements For Data Centers-En
10 pages
Analytical Modeling For Thermodynamic Characterization of Data Center Cooling Systems
No ratings yet
Analytical Modeling For Thermodynamic Characterization of Data Center Cooling Systems
9 pages
Iot (Internet of Things) : Connect The Things, Shrink The World
No ratings yet
Iot (Internet of Things) : Connect The Things, Shrink The World
26 pages
Maharishi - Resume Lam
No ratings yet
Maharishi - Resume Lam
5 pages
Calculation For Data Center Efficiency.: 2.1 Total Facility Power
No ratings yet
Calculation For Data Center Efficiency.: 2.1 Total Facility Power
6 pages
Intel Data Center Design
100% (1)
Intel Data Center Design
21 pages
Keshav Com Seminar
No ratings yet
Keshav Com Seminar
5 pages
Seminar Final Report
No ratings yet
Seminar Final Report
26 pages
DC-30 - System Recovery Guide - V2.0 - EN
No ratings yet
DC-30 - System Recovery Guide - V2.0 - EN
12 pages
Data Center Calculations
100% (6)
Data Center Calculations
12 pages
Duct Design - Final
No ratings yet
Duct Design - Final
23 pages
VTOC (Comp. Student Record Mngt. System)
No ratings yet
VTOC (Comp. Student Record Mngt. System)
8 pages
Distributed Facts Device for Flow Controls
From Everand
Distributed Facts Device for Flow Controls
Dr.V.V.L.N. Sastry
No ratings yet
Introduction To Gift Shop
50% (2)
Introduction To Gift Shop
9 pages
Corrigendum To Advt - No. - 13 of 2013 Dt. 29.10.
No ratings yet
Corrigendum To Advt - No. - 13 of 2013 Dt. 29.10.
1 page
Building Brand Loyalty Through User Engagement in Online Brand Communities in
No ratings yet
Building Brand Loyalty Through User Engagement in Online Brand Communities in
21 pages
Lesson Exemplar in MATHEMATICS 11 Using The IDEA Instructional Process
No ratings yet
Lesson Exemplar in MATHEMATICS 11 Using The IDEA Instructional Process
4 pages
Chapter 3: Solving Systems of Linear Equations Using Gaussian Elimination
No ratings yet
Chapter 3: Solving Systems of Linear Equations Using Gaussian Elimination
13 pages
My Staging Table (XXSD - STAG - AP - INV - TABLE)
No ratings yet
My Staging Table (XXSD - STAG - AP - INV - TABLE)
2 pages
HPE Compute Certification Guide: 444 Practice Questions for the Advanced HPE1-H02 Exam
From Everand
HPE Compute Certification Guide: 444 Practice Questions for the Advanced HPE1-H02 Exam
Steve Brown
No ratings yet
Difficult Riddles For Smart Kids 300 Dif PDF
0% (3)
Difficult Riddles For Smart Kids 300 Dif PDF
7 pages
Data Center Report
100% (2)
Data Center Report
66 pages
Network Powershell Commands
No ratings yet
Network Powershell Commands
2 pages
Process Plant Layout and Piping Design by Ed Bausbacher Roger Hunt
No ratings yet
Process Plant Layout and Piping Design by Ed Bausbacher Roger Hunt
462 pages
2.PC Jotun Chart1011 PDF
No ratings yet
2.PC Jotun Chart1011 PDF
3 pages
04 Specifications of Ductable Split Air Conditioners
100% (1)
04 Specifications of Ductable Split Air Conditioners
9 pages
Air Balancing
No ratings yet
Air Balancing
23 pages
Fundamentals of Data Warehouses
No ratings yet
Fundamentals of Data Warehouses
3 pages
Nihilize
No ratings yet
Nihilize
6 pages
Asset Management 01 8
No ratings yet
Asset Management 01 8
16 pages
Static Pressure For AHU's and Fans - 21.01.2013
No ratings yet
Static Pressure For AHU's and Fans - 21.01.2013
12 pages
Data - Collection - Form - Data Centre
No ratings yet
Data - Collection - Form - Data Centre
16 pages
Hexagon Radial Smartart: Step 1 Title Step 2 Title Step 6 Title
No ratings yet
Hexagon Radial Smartart: Step 1 Title Step 2 Title Step 6 Title
6 pages
Sciencedirect Sciencedirect Sciencedirect
No ratings yet
Sciencedirect Sciencedirect Sciencedirect
6 pages
Specifications of PAC
No ratings yet
Specifications of PAC
12 pages
Power Factor Correction
From Everand
Power Factor Correction
Dr. Hedaya Alasooly
No ratings yet
Water Heater Calculator
75% (4)
Water Heater Calculator
5 pages
EtherNet/IP Engineering Guide: Definitive Reference for Developers and Engineers
From Everand
EtherNet/IP Engineering Guide: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Static Pressure For AHU's and Fans - 06.02.2019
No ratings yet
Static Pressure For AHU's and Fans - 06.02.2019
13 pages
Air Leakage in Air Handling Units PDF
No ratings yet
Air Leakage in Air Handling Units PDF
10 pages
Device Management in Operating System
No ratings yet
Device Management in Operating System
5 pages
Duct Pressure Loss Calculations - V3
No ratings yet
Duct Pressure Loss Calculations - V3
3 pages
Ydh - Ventilation Calculation PDF
No ratings yet
Ydh - Ventilation Calculation PDF
2 pages
Hvac-Cooling Load & Ventilation Calculations Summary For Rdso and Nuc Areas Rdso - SK - Summary
No ratings yet
Hvac-Cooling Load & Ventilation Calculations Summary For Rdso and Nuc Areas Rdso - SK - Summary
1 page
Ef-10 Duty: Laboratory Building Hvac Airflow and Control Diagram
No ratings yet
Ef-10 Duty: Laboratory Building Hvac Airflow and Control Diagram
1 page
Analog Dialogue, Volume 46, Number 2: Analog Dialogue, #6
From Everand
Analog Dialogue, Volume 46, Number 2: Analog Dialogue, #6
Analog Dialogue
No ratings yet
ISO 55001 Implementation Guide Web NOV2016 FINAL PDF
100% (2)
ISO 55001 Implementation Guide Web NOV2016 FINAL PDF
12 pages