0% found this document useful (0 votes)

51 views36 pages

Cher: Cheating Resilience in The Cloud Via Smart Resource Allocation

This document describes CheR, a system for detecting and mitigating cheating in cloud computing resource allocation. CheR models the problem as a linear programming problem to minimize costs while ensuring results are correct above a given confidence threshold and within a maximum time. It takes as input parameters about nodes, tasks, costs and times to generate an optimal assignment matrix distributing tasks to nodes. CheR has been implemented in Java using GLPK to solve the linear program and return an assignment that meets reliability and cost objectives.

Uploaded by

sankar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views36 pages

Cher: Cheating Resilience in The Cloud Via Smart Resource Allocation

Uploaded by

sankar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

CheR: Cheating Resilience in the Cloud via Smart

Resource Allocation

Roberto Di Pietro, Flavio Lombardi, Fabio Martinelli,

Daniele Sgandurra

Dept. of Mathematics and Physics - University Roma Tre, Italy

Institute for Informatics and Telematics - CNR, Italy
[email protected]

FPS 2013
22/10/2013
Outline

Background
Cloud Computing
Use Case

CheR
Modeling The Problem
Current Prototype

Results
Tests
Conclusion and Future Works
Background

Part I

Background
Background

Background
Computational outsourcing
• is needed:
fast complex computations are required;
very large datasets are to be processed;
• is successful since cloud computing offers:
cheap computing/storage resources;
scalable computing resources.
Background

Motivation
However, computational outsourcing does NOT guarantee:
• correctness of computations;
• availability of computations;
• timeliness of results.
In particular, cheating:
• allows computing nodes to save resources;
• if undetected, leads to erroneous results.
Background

Use Case

• A sysadmin needs to rent some cloud resources to perform a parallel

task over a large data set.
• Two main possible scenarios exist:
1. the computation needs to end reliably within a given timeframe
and the sysadmin is willing to spend as little as possible;
2. the sysadmin has a fixed maximum budget and she has to
reliably compute the function by minimizing the required time.
Background

Use Case

Figure : Use Case: Node N2 is a Cheater

Background

Possible Solution?

• Use only “trusted” nodes:

no parallelization;
not cost-effective.
• The main goal here is to guarantee timeliness, cost-effectiveness and
correctness of the computed results.
CheR

Part II

CheR
CheR

Generalizing the Problem (1)

• There are n cloud nodes that host computation.

• At most k nodes are rational adversaries.
• The remaining (n − k) cloud nodes are non-malicious and
well-behaving nodes
• We model the adversary as a static cheater:
it always fakes its computations with a given probability.
CheR

Generalizing the Problem (2)

• Cloud nodes are required to compute a parallel function f over an

input vector X of length m.
• The output is a vector of length m where the j-th element is f (j).
• We assume that there are n cloud nodes where each node ni can
compute a subset (possibly overlapping) of the input vector.
• Each node ni has an associated unitary cost per operation ci and the
time to perform an unitary operation is ti .
CheR

Generalizing the Problem (3)

• The percentage of cheaters (CheaterRate) is k

n.
• The system administrator has to send multiple (possibly overlapping)
subsets of X to the nodes to be confident that the output is correct.
• The confidence threshold is DetConf .
• The goal of the system administrator is to:
minimize the total cost of the operations,
given a maximum time of computation Tmax ,
all the results to be correct with an error less than DetConf .
CheR

Assignment Matrix

• To better model the scenario we use an assignment matrix M n×m :

where Mi,j means that the node ni receives the element xj to
be computed on the function f .
• Indexes of the rows are coupled with the nodes, where 1 ≤ i ≤ n;
• Indexes of the columns are associated with the elements of the
vector, where 1 ≤ j ≤ m.
CheR

An Example

VM1 VM2 VM3 VM4 VM5

c1 c2 c3 c4 c5
t1 t2 t3 t4 t5

Table : Cost and Time Vectors

x1 x2 x3 x4 x5 x6 x7
VM1 1 0 1 0 0 1 0
VM2 0 1 0 1 1 0 1
VM3 1 0 0 1 0 0 1
VM4 0 1 0 1 1 0 1
VM5 0 1 1 0 1 1 0

Table : Matrix Assignment of the Example

CheR

Modeling The Problem

The total cost Ci of the operations performed by the i-th node:
m
X m
X
Ci = ci · Mi,j = ci · Mi,j (1)
j=1 j=1
The total time Ti of the i-th node is:
m
X m
X
Ti = ti · Mi,j = ti · Mi,j (2)
j=1 j=1
We can formulate the problem as a Linear Programming Model:
n X
X m
Minimize ci · Mi,j (3)
i=1 j=1

s.t.:
m
X
ti · Mi,j ≤ Tmax ∀i (4)
j=1
CheR

Modeling The Problem

• Each input element can also be processed by a cheater node.

• To this end, we introduce the following equation:
n
X
Mi,j ≥ Repl(j) ∀j (5)
i=1

where Repl(j) is the number of elements that has to be replicated

for each input element j according to the confidence level DetConf .
• The chances of wrong results due to cheater nodes are lower.
• The number of requested replicas is computed using the
hypergeometric distribution by considering that at least half of the
replicated elements are given to, and processed by, cheater nodes.
CheR

Number of Replicas

• The hypergeometric distribution describes the probability of k

successes in n draws without replacement from a finite population of
size N containing exactly K successes.
• Given the hypergeometric distribution H(n, h, r ), where n is the
number of nodes and h is the number of cheaters.
• Find the smallest r , i.e. the number of replicated elements for each
input (i.e., Repl(j)), for which the following equation holds:
max h n−h

r −i
X i
P(i) = n
< DetConfLocal (6)
i= 2r r
CheR

Modeling The Problem

• Finally, we have to consider the binary condition variables that are

used to decide which of the input elements are given to which nodes:

Binary Mi,j ∀i, j (7)

• Once this LP model is solved, if a solution exists, an optimal
assignment of input elements to cloud nodes is returned that
satisfies all the system administrator-imposed requirements.
CheR

Modeling The Problem

That can be written as:

n X
X m
Minimize ci · Mi,j
i=1 j=1
Xm
Subject to Mij ≤ Max(i) ,1≤i ≤n
j=1
Xn
Mij ≥ Repl(j) , 1 ≤ j ≤ m
i=1
Binary Mi,j , 1 ≤ i ≤ n, 1 ≤ j ≤ m

Table : LP Model
CheR

CheR Implementation

• CheR receives as input the cloud scenario and creates a linear

programming problem and returns an optimal solution.
• Implemented in Java.
• GLPK used as problem solver.
• It includes also a simulator/validator.
CheR

CheR Implementation

The CheR meta-program takes as input the following parameters:

• the number of nodes n;
• the number of input elements m;
• the confidence level DetConf ;
• the features of available nodes:
time ti required to process a single element,
cost per unit ci .
• the time constraints for the termination of the reliable distributed
computation (Tmax ):
this means that every node ni can process at most Max(i)
elements.
CheR firstly computes the number of replicas and then it outputs the LP
model that is later fed to the LP solver.
Results

Part III

Results
Results

Testbed

• To model an Amazon-like cloud, 5 node typologies were tested:

ranging from Medium to XXXLarge according to real
(normalized) cost and performance.
• The number of nodes n is set to 100.
20 nodes for each of the 5 typologies.
• 5 static cheaters P, i.e. 5% of the total nodes
• 1,000 input elements m are considered for each round.
Results

Tested Scenarios

• We have depicted four scenarios, where Tmax is set so that nodes

can process a different (decreasing) number of elements.
• These four scenarios depict four different alternatives:
1. extreme cost savings;
2. moderate balance requirements;
3. tight time constraints and cost containment;
4. with extremely tight time constraints.
Results

Matrix Assignment 1

Figure : Bitmap representing an actual assignment of elements (x-axis) to

nodes (y -axis) for scenario E1 targeted at cost savings.
Results

Matrix Assignment 1

Figure : Bitmap representing an actual assignment of elements (x-axis) to

nodes (y -axis) for scenario E2 targeted at moderate cost savings.
Results

Matrix Assignment 3

Figure : Bitmap representing an actual assignment of elements (x-axis) to

nodes (y -axis) for scenario E3 targeted at moderate time constraints.
Results

Matrix Assignment 4

Figure : Bitmap representing an actual assignment of elements (x-axis) to

nodes (y -axis) for scenario E4 targeted at extremely tight time constraints.
Results

Performance

Both the execution time and memory requirements are O(n × m)

Figure : Execution Time of the Prototype

Results

Validation Tests

• Several simulation tests that exploit the assignment matrix of E4:

to discover whether there was any wrong result that would be
erroneously considered as correct by the collector.
• For each test, each experiment was repeated several times by
randomly choosing the set of k cheaters.
• In the end, for each test, we counted:
1. the number of wrong results;
2. the number of tests that contain at least one wrong results.
Results

Validation Tests

• Select a cheating probability for each static cheater P:

i.e. how often a cheater returns a fake result.
• As a consequence, if a cheater returns wrong results more often, it
will be easier to detect it.
• Conversely, if a smaller number of wrong results is returned, then it
can be detected with more difficulty.
• The central collector can discover a cheater by calculating a
divergence index for each node:
keeping track of how many times such node has given responses
that were different from other ones on the same input
(percentage of results in which the cheater was in minority).
Results

Validation Tests
P FNT FNE (Avg) FNTC FNEC (Avg)
1 0.236% 0.006% (6.1) 0.00375% 0.0025% (2.445)
0,9 0.223% 0.0045% (4.5) 0.022% 0.0012% (1.17)
0.8 0.228% 0.0038% (3.8) 0.0075% 3.393E-4% (0.34)
0.75 0.226% 0.0036% (3.45) 0.0047% 2.144E-4% (0.21)
0.66 0.208% 0.0025% (2.5) 0.0011% 5.14E-5% (0.05)
0.6 0.2% 0.0021% (2.1) 6.0E-4% 1.267E-5% (0.0127)
0.5 0.19% 0.0015% (1.5) 0% 0% (0)
0.4 0.17% 9.436E-4% (0.94) 0% 0% (0)
0.33 0.16% 6.573E-4% (0.66) 0% 0% (0)
0.3 0.16% 5.358E-4% (0.54) 0% 0% (0)
0.25 0.14% 3.77E-4% (0.38) 0% 0% (0)
0.2 0.12% 2.5E-4% (0.25) 0% 0% (0)
0.1 0.05% 6.14E-5% (0.06) 0% 0% (0)

Table : Results of the Validation tests

P cheating probability of static cheaters: how often a cheater cheats;

FNT % of false negative tests: ratio of failed tests without centralized control (at least one wrong results in an experiment);
FNE % of false negative elements: ratio of wrong results without centralized control (in all the experiments);
FNTC % of false negative tests (centralized scenario): ratio of failed tests with centralized control (at least one wrong results in an
experiment).
FNEC % of false negative elements (centralized scenario): ratio of wrong results with centralized control (in all the experiments).
Results

Discussion

• As the maximum execution time is compressed, the workload,

including the replicated computations, seamlessly shifts towards the
bottom lines, representing the more costly but faster nodes.
• For large sizes of the problem matchmaking this is nontrivial and as
such an automated approach is needed.
• The assignment matrix strives to pair elements to less-costly nodes
as soon as the time constraints have been satisfied.
Results

Conclusion

• CheR is a model for reliable execution of workload over a large

number of heterogeneous computing nodes:
some node can cheat according to some identified models.
• CheR helps sysadmin that are assessing how to distribute
computation over a cloud, to reason about the possible scenarios,
available approaches, and their convenience and feasibility.
• CheR provides probabilistic assurance that the result of the
distributed computations is not affected by misbehaving nodes.
• Experimental evidence based on a real-world cloud provider such as
Amazon shows the viability of CheR.
Results

Future Work

• Develop resilience approaches against smart cheaters.

• Consider trust of the cheaters:
past behaviors.
• Dynamically change the assignment matrix at each round.
• Further improvements: paper accepted at “The 10th IEEE
International Conference on Autonomic and Trusted Computing”:
AntiCheetah: an Autonomic Multi-round Approach for Reliable
Computing.
Results

Questions?
E-mail:
• [email protected]
• [email protected]

API 6d 24ed. & 25ed. Comparision
100% (5)
API 6d 24ed. & 25ed. Comparision
23 pages
Gold Care
No ratings yet
Gold Care
16 pages
Given Interarrival Time (Ati) and Service Time (Sti) of 8 Customers As
100% (5)
Given Interarrival Time (Ati) and Service Time (Sti) of 8 Customers As
176 pages
Benefits in Planting Trees and Fruit Trees
100% (1)
Benefits in Planting Trees and Fruit Trees
2 pages
Petrinets General 5 6
No ratings yet
Petrinets General 5 6
61 pages
Ref System Fault Location
No ratings yet
Ref System Fault Location
24 pages
Hardening Function For Large Scale Distributed Computations: Doug Szajda Barry Lawson Jason Owen
No ratings yet
Hardening Function For Large Scale Distributed Computations: Doug Szajda Barry Lawson Jason Owen
39 pages
Drug Calculation Tutorial
100% (2)
Drug Calculation Tutorial
13 pages
Thriller English
No ratings yet
Thriller English
69 pages
Computer Simulation Techniques: The Definitive Introduction!
No ratings yet
Computer Simulation Techniques: The Definitive Introduction!
176 pages
Lecture 6a
No ratings yet
Lecture 6a
54 pages
Pt. Fortindo Sukses Makmur: Price List
No ratings yet
Pt. Fortindo Sukses Makmur: Price List
22 pages
Import As Import As From Import
No ratings yet
Import As Import As From Import
23 pages
44 DP Reliability Equipment Replacement Problems
No ratings yet
44 DP Reliability Equipment Replacement Problems
26 pages
An Investigation of Evolutionary Programming With Sheet
No ratings yet
An Investigation of Evolutionary Programming With Sheet
6 pages
Midterm Exam Solution
No ratings yet
Midterm Exam Solution
11 pages
Heat and Mass Transfer
No ratings yet
Heat and Mass Transfer
29 pages
MC For Programmers2022
No ratings yet
MC For Programmers2022
68 pages
Evolved Discrete Harmony Search Algorithm For Multi-Objective No-Wait Flow Shop Scheduling Problem
No ratings yet
Evolved Discrete Harmony Search Algorithm For Multi-Objective No-Wait Flow Shop Scheduling Problem
4 pages
Towards Utilitarian Combinatorial Assignment With Deep Learning
No ratings yet
Towards Utilitarian Combinatorial Assignment With Deep Learning
7 pages
What Are The Signs of An Impending Geologic Hazard
100% (2)
What Are The Signs of An Impending Geologic Hazard
2 pages
Computational Complexity: Definition of Mixed-Integer Programming
No ratings yet
Computational Complexity: Definition of Mixed-Integer Programming
8 pages
Architecting Local-Area Networks Using Omniscient Modalities
No ratings yet
Architecting Local-Area Networks Using Omniscient Modalities
3 pages
Russell & Norvig Ch. 5: - Constraint Satisfaction Offers A Powerful Problem-Solving Paradigm
No ratings yet
Russell & Norvig Ch. 5: - Constraint Satisfaction Offers A Powerful Problem-Solving Paradigm
20 pages
Synthesizing IPv6 Anfgfgfgfgfgd Internet QoS
No ratings yet
Synthesizing IPv6 Anfgfgfgfgfgd Internet QoS
5 pages
U-Test: Evolving, Modelling and Testing Realistic Uncertain Behaviours of Cyber-Physical Systems
No ratings yet
U-Test: Evolving, Modelling and Testing Realistic Uncertain Behaviours of Cyber-Physical Systems
2 pages
Failure Rates in PV Systems: A Careful Selection of Quantitative Data Available in The Literature
No ratings yet
Failure Rates in PV Systems: A Careful Selection of Quantitative Data Available in The Literature
2 pages
Lect19 Applicationo of Probbability
No ratings yet
Lect19 Applicationo of Probbability
8 pages
Pre Project Revised
No ratings yet
Pre Project Revised
20 pages
Decoupling Replication From The Turing Machine in Link-Level Acknowledgements
No ratings yet
Decoupling Replication From The Turing Machine in Link-Level Acknowledgements
4 pages
2017 Microprocessor and Interface
No ratings yet
2017 Microprocessor and Interface
3 pages
Static Methods and Variables
No ratings yet
Static Methods and Variables
2 pages
CS 322 Assignment 2 UBC 2015
No ratings yet
CS 322 Assignment 2 UBC 2015
3 pages
Amendment in Regional Transmission Grid Plan of Gwadar Area - Complete (1) - Pages-70-74
No ratings yet
Amendment in Regional Transmission Grid Plan of Gwadar Area - Complete (1) - Pages-70-74
5 pages
10.1201 b12095 Previewpdf
No ratings yet
10.1201 b12095 Previewpdf
55 pages
AIML CIA II Question Paper ECE Remedial Anskey
No ratings yet
AIML CIA II Question Paper ECE Remedial Anskey
33 pages
Project Integration Management
No ratings yet
Project Integration Management
5 pages
School of Computer Science University of ST Andrews 2022-23 CS4402 Constraint Programming Practical 2: Constraint Solver Implementation
No ratings yet
School of Computer Science University of ST Andrews 2022-23 CS4402 Constraint Programming Practical 2: Constraint Solver Implementation
8 pages
Dijkstra's Algorithm With Fibonacci Heaps: An Executable Description in CHR
No ratings yet
Dijkstra's Algorithm With Fibonacci Heaps: An Executable Description in CHR
10 pages
Unit2 B
No ratings yet
Unit2 B
31 pages
(IJCST-V12I2P10) :CH. Nikitha Reddy, P.V.Shilohini Angel, P. Hrithika Malkan, V. Nikitha, Mr.K. Anil Kumar
No ratings yet
(IJCST-V12I2P10) :CH. Nikitha Reddy, P.V.Shilohini Angel, P. Hrithika Malkan, V. Nikitha, Mr.K. Anil Kumar
4 pages
Activity 3.1.3 Commercial Wall Systems Answer Key
No ratings yet
Activity 3.1.3 Commercial Wall Systems Answer Key
4 pages
T1 QP-Answer Key
No ratings yet
T1 QP-Answer Key
5 pages
Test 2 Answers
No ratings yet
Test 2 Answers
8 pages
Dsa Ta 3-2
No ratings yet
Dsa Ta 3-2
9 pages
A Study On Employees Satisfaction Towards Their Job in Seshsayee Paper and Boards Limited
No ratings yet
A Study On Employees Satisfaction Towards Their Job in Seshsayee Paper and Boards Limited
7 pages
Chapter-1 Group7MMM
No ratings yet
Chapter-1 Group7MMM
4 pages
M D A I C: Measure Define Improve Control
No ratings yet
M D A I C: Measure Define Improve Control
1 page
BMC Script Writing
No ratings yet
BMC Script Writing
2 pages
How Living Things Grow and Change
No ratings yet
How Living Things Grow and Change
14 pages
Chap 4
No ratings yet
Chap 4
17 pages
Quality Control Analysis of Cube Fish With Fault Tree Analysis (FTA) Method in ALJB A Case Study
No ratings yet
Quality Control Analysis of Cube Fish With Fault Tree Analysis (FTA) Method in ALJB A Case Study
6 pages
Religion, Guilt, and Ethical Standards
No ratings yet
Religion, Guilt, and Ethical Standards
17 pages
Alpha Beta Cutoff
No ratings yet
Alpha Beta Cutoff
80 pages
Job Vacancies Beatrice (Mine)
No ratings yet
Job Vacancies Beatrice (Mine)
3 pages
Technical Spec For Gas Detectors
No ratings yet
Technical Spec For Gas Detectors
19 pages
Ai - Faizan - s2018266056 Final
No ratings yet
Ai - Faizan - s2018266056 Final
13 pages
National University of Computer and Emerging Sciences, Lahore Campus
No ratings yet
National University of Computer and Emerging Sciences, Lahore Campus
10 pages
STQA Manual (210305124014)
No ratings yet
STQA Manual (210305124014)
25 pages
Constraint Satisfaction Problems (CSPS) : General-Purpose
No ratings yet
Constraint Satisfaction Problems (CSPS) : General-Purpose
30 pages
3 Cps170 More Search
No ratings yet
3 Cps170 More Search
34 pages
Major Project Synopsis Front Page
100% (1)
Major Project Synopsis Front Page
7 pages
Mid 2
No ratings yet
Mid 2
7 pages
05-Constraint Satisfaction Problems
No ratings yet
05-Constraint Satisfaction Problems
56 pages
05 Constraint Satisfaction Problems (Us)
No ratings yet
05 Constraint Satisfaction Problems (Us)
36 pages
Reliability and Availability Analysis - Toward Muliplevel Models For Complex Systems, Trivedi2024
No ratings yet
Reliability and Availability Analysis - Toward Muliplevel Models For Complex Systems, Trivedi2024
11 pages
Rec2 Sol
No ratings yet
Rec2 Sol
3 pages
Project - 2a Report
No ratings yet
Project - 2a Report
28 pages
6 Constraint Satisfaction Problems
No ratings yet
6 Constraint Satisfaction Problems
30 pages
Project - 2a Report
No ratings yet
Project - 2a Report
26 pages
IT3052E Presentation
No ratings yet
IT3052E Presentation
25 pages
AIML Exp 1.1 Narendra
No ratings yet
AIML Exp 1.1 Narendra
3 pages
Slides CSP
No ratings yet
Slides CSP
33 pages
05 Expectimax and Utilities
No ratings yet
05 Expectimax and Utilities
9 pages
Cst401 Scheme
No ratings yet
Cst401 Scheme
9 pages
Grade 10 Physics Assessment
No ratings yet
Grade 10 Physics Assessment
1 page
CS 6 Aci
No ratings yet
CS 6 Aci
27 pages
XS2D LogPlot
No ratings yet
XS2D LogPlot
16 pages
Ai 2.1
No ratings yet
Ai 2.1
3 pages
Sample Midterm
No ratings yet
Sample Midterm
10 pages
2425 CS420 22TT1 Midterm Solution
No ratings yet
2425 CS420 22TT1 Midterm Solution
5 pages
1 s2.0 S221471601500007X Main
No ratings yet
1 s2.0 S221471601500007X Main
11 pages
S23 PDC Mid Exam
No ratings yet
S23 PDC Mid Exam
2 pages
Ai&ml Rec Final
No ratings yet
Ai&ml Rec Final
66 pages
AI - Mid 1 Exam - Sol
No ratings yet
AI - Mid 1 Exam - Sol
10 pages
4TH Sem
No ratings yet
4TH Sem
15 pages
FULLTEXT01
No ratings yet
FULLTEXT01
101 pages
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
From Everand
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
Fouad Sabry
No ratings yet
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Digital Signal Processing (DSP) with Python Programming
From Everand
Digital Signal Processing (DSP) with Python Programming
Maurice Charbit
No ratings yet