0% found this document useful (0 votes)

49 views17 pages

2019 Asplos Parties Slides

This document discusses a resource partitioning technique called PARTIES for colocating multiple latency-critical applications on the same server. It presents a characterization of interactive applications and their sensitivity to different resource interference. PARTIES dynamically partitions resources like CPU, memory and cache to provide quality of service guarantees for multiple latency-critical applications without requiring prior knowledge about the applications.

Uploaded by

arthur seokwon choi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views17 pages

2019 Asplos Parties Slides

Uploaded by

arthur seokwon choi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

PARTIES:

QOS-AWARE RESOURCE PARTITIONING

FOR MULTIPLE INTERACTIVE SERVICES

Shuang Chen, Christina Delimitrou, José F. Martínez

Cornell University
COLOCATION OF APPLICATIONS

Best- Latency
effort -critical
P P … P P
Private Private Private Private
caches caches caches caches

Last-level Cache

Motivation• Characterization• PARTIES• Evaluation • Conclusions

Page 1 of 15
PRIOR WORK
§ Interference during colocation
§ Scheduling [Nathuji’10, Mars’13, Delimitrou’14]
• Avoid co-scheduling of apps that may interfere
- May require offline knowledge
- Limit colocation options
§ Resource partitioning [Sanchez’11, Lo’15]
• Partition shared resources
- At most 1 LC app + multiple best-effort jobs

Motivation• Characterization• PARTIES• Evaluation • Conclusions

Page 2 of 15
TRENDS IN DATACENTERS
1 LC + many BE Monolith
Best-
effort

Latency
-critical

many LC + many BE
All have QoS targets More LC jobs
Microservices
Motivation• Characterization• PARTIES• Evaluation • Conclusions
Page 3 of 15
MAIN CONTRIBUTIONS
§ Workload characterization
• The impact of resource sharing
• The effectiveness of resource isolation
• Relationship between different resources

§ PARTIES: First QoS-aware resource manager for

colocation of many LC services
• Dynamic partitioning of 9 shared resources
• No a priori application knowledge
• 61% higher throughput under QoS constraints
• Adapts to varying load patterns

Motivation• Characterization• PARTIES• Evaluation • Conclusions

Page 1 of 15
INTERACTIVE LC APPLICATIONS
Table 1: Latency-Critical Applications

Application Memcached Xapian NGINX Moses MongoDB Sphinx

Key-value Web Web Real-time Persistent Speech
Domain
store search server translation database recognition
Target QoS 600us 5ms 10ms 15ms 300ms 2.5s
Max Load 1,280,000 8,000 560,000 2,800 240 14
User / Sys /
13 / 78 / 0 42 / 23 / 0 20 / 50 / 0 50 / 14 / 0 0.3 / 0.2 / 57 85 / 0.6 / 0
IO CPU%
LLC MPKI 0.55 0.03 0.06 10.48 0.01 6.28
Memory
9.3 GB 0.02 GB 1.9 GB 2.5 GB 18 GB 1.4 GB
Capacity
Memory
0.6 GB/s 0.01 GB/s 0.6 GB/s 26 GB/s 0.03 GB/s 3.1 GB/s
Bandwidth
Disk
0 MB/s 0 MB/s 0 MB/s 0 MB/s 5 MB/s 0 MB/s
Bandwidth
Network
3.0 Gbps 0.07 Gbps 6.2 Gbps 0.001 Gbps 0.01 Gbps 0.001 Gbps
Bandwidth

Motivation• Characterization• PARTIES• Evaluation • Conclusions

Page 5 of 15
INTERACTIVE LC APPLICATIONS
Table 1: Latency-Critical Applications

Application Memcached Xapian NGINX Moses MongoDB Sphinx

Max load: max RPS under QoS target when running alone

Motivation• Characterization• PARTIES• Evaluation • Conclusions

Page 6 of 15
: Impact of resource interference. Each row corresponds to a type of resource. Values in the table are the maximum per
INTERFERENCE STUDY
load for which the server can satisfy QoS when the LC application is running under interference. Cells with smaller n
rker colors mean that applications are more sensitive to that type of interference.

Memcached Xapian NGINX Moses MongoDB Sphinx

Hyperthread
CPU
Power
LLC Capacity
LLC Bandwidth
Memory Bandwidth
Memory Capacity
Disk Bandwidth
Network Bandwidth

0% 100%
Extremely sensitive % of max load under QoS target Not sensitive at all

• Applications are sensitive to resources with high usage

• Applications with strict QoS
1 targets are more sensitive

Motivation• Characterization• PARTIES• Evaluation • Conclusions

Page 7 of 15
ISOLATION MECHANISMS
• Core mapping
» Hyperthreads
» Core counts …
P P P P
• Memory capacity
Private Private Private Private
• Disk bandwidth caches caches caches caches
• Core frequency
» Power Last-level Cache
• LLC capacity
» Cache capacity
» Cache bandwidth
» Memory bandwidth cgroup ACPI frequency driver
• Network bandwidth qdisc
Intel CAT

Motivation• Characterization• PARTIES• Evaluation • Conclusions

Page 8 of 15
RESOURCE FUNGIBILITY
Stand-alone With memory interference
Xapian Xapian
1 XX XX XX XX XX XX XX XX XX XX XX XX XX XX XX XX XX XX XX XX 1 XX XX XX XX XX XX XX XX XX XX XX XX XX XX XX XX XX XX XX
3 XX XX X X X X X X X X X X X X X X X X X X 3 XX XX XX XX XX XX XX XX XX XX XX XX X X X X X X X
5X 5 XX XX XX XX XX XX XX XX X
7 7 XX XX XX XX X X
9 9 XX X X
11
11 X
13
13
1 3 5 7 9 1113151719 1 3 5 7 9 1113151719
Cache ways Cache ways

§ Resources are fungible

• More flexibility in resource allocation
• Simplifies resource manager

Motivation• Characterization• PARTIES• Evaluation • Conclusions

Page 9 of 15
PARTIES: DESIGN PRINCIPLES
§ PARTIES
• PARTitioning for multiple InteractivE Services

§ Design principles
• LC apps are equally important
• Allocation should be dynamic and fine-grained
• No a priori application knowledge or offline profiling
• Recover quickly from incorrect decisions
• Migration is used as a last resort

Motivation• Characterization• PARTIES• Evaluation • Conclusions

Page 10 of 15
PARTIES
Client side
• 5 knobs organized into 2 wheels
Latency
• Start from a random resource Monitor
• Follow the wheels to visit all resources
Unallocated pool Poll latency
every 100ms
Main Function
QoS violations?
Upsize!
Slack

Excess resources?
App 1 C
20% Downsize!
C
C No Benefit M
$ 0 time C No Benefit M
App 2 C
$ Compute Storage
Storage
Upsizing AppApp
DownsizingC
1…2…
$ Compute F
No Benefit
Server
D side
$ F M
D
F No Benefit
F$ No Benefit
Motivation•Compute Storage
Characterization• PARTIES• Evaluation • Conclusions
$ F D Page 11 of 15
METHODOLOGY
§ Platform: Intel E5-2699 v4
• Single socket with 22 cores (8 IRQ cores)
§ Virtualization
• LXC 2.0.7
§ Load generators
• Open loop
• Request inter-arrival distribution: exponential
• Request popularity: Zipfian
§ Testing strategy
• Constant load: 30s warmup, 1m measurement (x5)
• Varying load simulates diurnal load patterns
Motivation• Characterization• PARTIES• Evaluation • Conclusions
Page 12 of 15
CONSTANT LOADS: MEMCACHED, XAPIAN & NGINX
Oracle Unmanaged Oracle

Max Load of Memcached(%)

100

Max Load of NGINX(%)

10 50 30 20 10 10 80 70 50 40 30 30 20 10

- Offline profiling 20 30 20 70 50 40 30 20 10
80

30 10 60 40 30 20 10 60
- Always finds the 40 10 40 30 20 40
global optimum 50 20 10
20
60 10
10 20 30 40 50 60 70 80 10 20 30 40 50 60 70 80
Max Load of Xapian(%) Max Load of Xapian(%)
Heracles
Heracles PARTIES
- No partitioning
Max Load of Memcached(%)

100

Max Load of NGINX(%)

10 60 50 40 30 10 70 60 50 40 30 20 20 10

between BE jobs 20 50 40 20 10 70 50 40 20 20 10
80

30 40 30 10 60 40 30 10 10 60
- Suspend BE upon 40 30 10 30 20 10 40
QoS violation 50 20 10
20
60 10
- No interaction 10 20 30 40 50 60 70 80 10 20 30 40 50 60 70 80

between resources
Max Load of Xapian(%) Max Load of Xapian(%)

Motivation• Characterization• PARTIES• Evaluation • Conclusions

Page 13 of 15
MORE EVALUATION
Constant loads
§ All 2- and 3-app mixes under PARTIES
§ Comparison with Heracles for 2- to 6-app mixes

Diurnal load pattern

§ Colocation of Memcached, Xapian and Moses

PARTIES overhead
§ Convergence time for 2- to 6-app mixes

Motivation• Characterization• PARTIES• Evaluation • Conclusions

Page 14 of 15
CONCLUSIONS
§ Need to manage multiple LC apps
§ Insights
• Resource partitioning
• Resource fungibility
§ PARTIES
• Partition 9 shared resources
• No offline knowledge required
• 61% higher throughput under QoS targets
• Adapts to varying load patterns

Motivation• Characterization• PARTIES• Evaluation • Conclusions

Page 15 of 15
PARTIES:
QOS-AWARE RESOURCE PARTITIONING
FOR MULTIPLE INTERACTIVE SERVICES
http://tiny.cc/parties

Shuang Chen, Christina Delimitrou, José F. Martínez

Cornell University

S11 - System Architecture
No ratings yet
S11 - System Architecture
79 pages
System Design
No ratings yet
System Design
56 pages
FortiClient EMS 6.4.8 Administration Guide
No ratings yet
FortiClient EMS 6.4.8 Administration Guide
258 pages
HMI and PLC Connecting Guide PDF
No ratings yet
HMI and PLC Connecting Guide PDF
427 pages
Workload For Infra
No ratings yet
Workload For Infra
32 pages
Module 16 - Event Monitoring
No ratings yet
Module 16 - Event Monitoring
16 pages
WORKLOAD FOR INFRA v2
No ratings yet
WORKLOAD FOR INFRA v2
30 pages
Berkeley Latency Mar2012
No ratings yet
Berkeley Latency Mar2012
83 pages
Dns 1
No ratings yet
Dns 1
110 pages
Oracle RAC Performance Management
No ratings yet
Oracle RAC Performance Management
35 pages
SW Architecture - Lecture - 06
No ratings yet
SW Architecture - Lecture - 06
35 pages
Performance Concepts
No ratings yet
Performance Concepts
42 pages
OPNsense Whitepaper Features
No ratings yet
OPNsense Whitepaper Features
11 pages
CLARiiON Performance Practices
No ratings yet
CLARiiON Performance Practices
26 pages
(PDC) Lecture 12 - The Client-Server Paradigm
No ratings yet
(PDC) Lecture 12 - The Client-Server Paradigm
15 pages
System Design
No ratings yet
System Design
56 pages
LoadMaster X15-NG-spec-sheet
No ratings yet
LoadMaster X15-NG-spec-sheet
13 pages
L5 LargeScaleWebApps
No ratings yet
L5 LargeScaleWebApps
22 pages
Performance Concepts
No ratings yet
Performance Concepts
35 pages
Ac 2005 Scalable We Barch
No ratings yet
Ac 2005 Scalable We Barch
74 pages
Networking Long
No ratings yet
Networking Long
17 pages
NOTES - CIT 237 - Web Services and Cloud Computing
No ratings yet
NOTES - CIT 237 - Web Services and Cloud Computing
20 pages
Cloud Computing: M Varaprasad Rao
No ratings yet
Cloud Computing: M Varaprasad Rao
123 pages
Capacity Planning
No ratings yet
Capacity Planning
22 pages
User Manual - Acer - 1.0 - A - A
No ratings yet
User Manual - Acer - 1.0 - A - A
45 pages
Linux Clusters Institute: Scheduling
No ratings yet
Linux Clusters Institute: Scheduling
93 pages
All1 7ForMidTerm PDF
No ratings yet
All1 7ForMidTerm PDF
97 pages
Practical Research 2
67% (3)
Practical Research 2
20 pages
3PAR Performance
No ratings yet
3PAR Performance
45 pages
DT Associate
No ratings yet
DT Associate
60 pages
Liferay Portal 6 1 Performance Whitepaper
No ratings yet
Liferay Portal 6 1 Performance Whitepaper
15 pages
BootCamp XML
No ratings yet
BootCamp XML
94 pages
MB Manual Z370-Aorus-Gaming-5 1002 e
No ratings yet
MB Manual Z370-Aorus-Gaming-5 1002 e
60 pages
Capacity Planning For IT - 2
No ratings yet
Capacity Planning For IT - 2
20 pages
Experimental Performance Evaluation of BACnet MSTP
No ratings yet
Experimental Performance Evaluation of BACnet MSTP
11 pages
Linux Perf Tuning 2010 1up
No ratings yet
Linux Perf Tuning 2010 1up
91 pages
Hdr-10001s Tivumax Manual
No ratings yet
Hdr-10001s Tivumax Manual
41 pages
Dintek Catalogue Cat6 Cable
No ratings yet
Dintek Catalogue Cat6 Cable
2 pages
Can Bus Operation Manual LK9893!82!1
100% (2)
Can Bus Operation Manual LK9893!82!1
35 pages
MySQLConf2007 Capacity
No ratings yet
MySQLConf2007 Capacity
54 pages
T Rec K.50 201801 I!!pdf e
No ratings yet
T Rec K.50 201801 I!!pdf e
28 pages
Real World Web: Performance & Scalability
100% (26)
Real World Web: Performance & Scalability
189 pages
Citrix Xendesktop/Xenapp 7.6: Technical Update
No ratings yet
Citrix Xendesktop/Xenapp 7.6: Technical Update
63 pages
Axis p1425 Le Network Camera en US 208243
No ratings yet
Axis p1425 Le Network Camera en US 208243
2 pages
MPLS PDF
No ratings yet
MPLS PDF
127 pages
Consultancy Report Hussain Almukhtar 201101942
No ratings yet
Consultancy Report Hussain Almukhtar 201101942
18 pages
Ca-Nfs: Paper: A. Batsakis, R. Burns, A. Kanevsky, J. Letini and T. Talpey Slides: Joe Buck For CMPS 292 - Spring 2010
No ratings yet
Ca-Nfs: Paper: A. Batsakis, R. Burns, A. Kanevsky, J. Letini and T. Talpey Slides: Joe Buck For CMPS 292 - Spring 2010
28 pages
System Design Cheat Sheet
No ratings yet
System Design Cheat Sheet
6 pages
Realworld Rac PerfRealworld Rac Perf
No ratings yet
Realworld Rac PerfRealworld Rac Perf
52 pages
T5-Linux Performance Tuning
No ratings yet
T5-Linux Performance Tuning
52 pages
Xsight For Volte Customer Experience Assurance (Cea) : Key Benefits
No ratings yet
Xsight For Volte Customer Experience Assurance (Cea) : Key Benefits
4 pages
2.3.2.7 Lab - Configuring Basic PPP With Authentication
100% (1)
2.3.2.7 Lab - Configuring Basic PPP With Authentication
17 pages
Ds Nemo Outdoor Nemo FSR1
No ratings yet
Ds Nemo Outdoor Nemo FSR1
6 pages
Ts Circle: Test Schedule BSNL Phase Viii.4 Swap P RE & P OST Drive Test - 2G
No ratings yet
Ts Circle: Test Schedule BSNL Phase Viii.4 Swap P RE & P OST Drive Test - 2G
8 pages
Staimer On PCA X8 Final
No ratings yet
Staimer On PCA X8 Final
10 pages
2023-LoadMaster BuyersGuide
No ratings yet
2023-LoadMaster BuyersGuide
14 pages
Building Scalable Web Architectures: Aaron Bannert
No ratings yet
Building Scalable Web Architectures: Aaron Bannert
74 pages
LTSP Practice Guide
100% (4)
LTSP Practice Guide
14 pages
CacheMARA Spec Brief
50% (2)
CacheMARA Spec Brief
3 pages
Distributed File Systems
No ratings yet
Distributed File Systems
18 pages
Citrix Xendesktop/Xenapp 7.6: Technical Update
No ratings yet
Citrix Xendesktop/Xenapp 7.6: Technical Update
63 pages
CAT6A Reference Guide
No ratings yet
CAT6A Reference Guide
56 pages
Configuring Vlans, VTP, and Vmps
No ratings yet
Configuring Vlans, VTP, and Vmps
30 pages
Capacity Planning PDD Final
100% (1)
Capacity Planning PDD Final
47 pages
E05 001 Information Storage Management Exam Format
No ratings yet
E05 001 Information Storage Management Exam Format
2 pages
PHP Sample Project: Bus Ticket Reservation With Dynamic Forms Code
No ratings yet
PHP Sample Project: Bus Ticket Reservation With Dynamic Forms Code
6 pages
SP2013 Streamlined Topology Model
No ratings yet
SP2013 Streamlined Topology Model
1 page
Section 5. Bill of Quantity (Boq) : Group - "A" Raise Floor With Accessories
No ratings yet
Section 5. Bill of Quantity (Boq) : Group - "A" Raise Floor With Accessories
6 pages
Banglalink New SIM Offer
No ratings yet
Banglalink New SIM Offer
2 pages
Forecasting MySQL Performance and Scalability
100% (1)
Forecasting MySQL Performance and Scalability
41 pages
Zelio Control RM17JC00MW
No ratings yet
Zelio Control RM17JC00MW
3 pages
Realworld Rac Perf
No ratings yet
Realworld Rac Perf
52 pages
Xpress 13.2
No ratings yet
Xpress 13.2
24 pages
Interesting Facts About RAC
No ratings yet
Interesting Facts About RAC
40 pages
Device Integration Cisco WLC
No ratings yet
Device Integration Cisco WLC
6 pages
Installing The Ubuntu and Debian Agent in Nagios XI PDF
No ratings yet
Installing The Ubuntu and Debian Agent in Nagios XI PDF
2 pages
Automating HFM Tasks
No ratings yet
Automating HFM Tasks
10 pages
Setting Up I2p For IRC+Browsing
100% (1)
Setting Up I2p For IRC+Browsing
3 pages
Pulsar for Scalable Messaging Systems: Definitive Reference for Developers and Engineers
From Everand
Pulsar for Scalable Messaging Systems: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Yarn Essentials: Definitive Reference for Developers and Engineers
From Everand
Yarn Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Deno KV for Scalable, Distributed Applications: The Complete Guide for Developers and Engineers
From Everand
Deno KV for Scalable, Distributed Applications: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
LEMP Architecture and Administration: Definitive Reference for Developers and Engineers
From Everand
LEMP Architecture and Administration: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Couchbase Essentials: Definitive Reference for Developers and Engineers
From Everand
Couchbase Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Kubernetes Clusters with KIND: Definitive Reference for Developers and Engineers
From Everand
Kubernetes Clusters with KIND: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Developing Desktop Applications with NW.js: Definitive Reference for Developers and Engineers
From Everand
Developing Desktop Applications with NW.js: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Daemon Architecture and Implementation: Definitive Reference for Developers and Engineers
From Everand
Daemon Architecture and Implementation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Podman Essentials: Definitive Reference for Developers and Engineers
From Everand
Podman Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Node.js Basics for New Developers: A Practical Guide with Examples
From Everand
Node.js Basics for New Developers: A Practical Guide with Examples
William E. Clark
No ratings yet
PHP Microservices
From Everand
PHP Microservices
Carlos Pérez Sánchez
3/5 (1)
Mastering Apache Cassandra - Second Edition
From Everand
Mastering Apache Cassandra - Second Edition
Nishant Neeraj
No ratings yet