Cloud Complete
Cloud Complete
• Part I: Introduction
• Part II: Security and Privacy Issues in Cloud Computing
• Part III: Possible Solutions
3
Part I. Introduction
4
Cloud Computing Background
• Features
– Use of internet-based services to support business process
– Rent IT-services on a utility-like basis
• Attributes
– Rapid deployment
– Low startup costs/ capital investments
– Costs based on usage or subscription
– Multi-tenant sharing of services/ resources
• Essential characteristics
– On demand self-service
– Ubiquitous network access
– Location independent resource pooling
– Rapid elasticity
– Measured service
• “Cloud computing is a compilation of existing techniques and
technologies, packaged within a new infrastructure paradigm that offers
improved scalability, elasticity, business agility, faster startup time,
reduced management costs, and just-in-time availability of resources”
• Delivery Models
– SaaS
– PaaS
– IaaS
• Deployment Models
– Private cloud
– Community cloud
– Public cloud
– Hybrid cloud
• We propose one more Model: Management Models (trust and tenancy
issues)
– Self-managed
– 3rd party managed (e.g. public clouds and VPC)
10
From [6] Cloud Security and Privacy by Mather and Kumaraswamy
If cloud computing is so great,
why isn’t everyone doing it?
• The cloud acts as a big black box, nothing inside the cloud is visible
to the clients
• Clients have no idea or control over what happens inside a cloud
• Even if the cloud provider is honest, it can have malicious system
admins who can tamper with the VMs and violate confidentiality and
integrity
• Clouds are still subject to traditional data confidentiality, integrity,
availability, and privacy issues, plus some additional attacks
11
Companies are still afraid to use clouds
[Chow09ccs
w] 12
Causes of Problems Associated
with Cloud Computing
• Confidentiality
– Fear of loss of control over data
• Will the sensitive data stored on a cloud remain confidential?
• Will cloud compromises leak confidential client data
– Will the cloud provider itself be honest and won’t peek
into the data?
• Integrity
– How do I know that the cloud provider is doing the
computations correctly?
– How do I ensure that the cloud provider really stored
my data without tampering with it?
17
Taxonomy of Fear (cont.)
• Availability
– Will critical systems go down at the client, if the
provider is attacked in a Denial of Service attack?
– What happens if cloud provider goes out of business?
– Would cloud scale well-enough?
– Often-voiced concern
• Although cloud providers argue their downtime compares well with
cloud user’s own data centers
18
Taxonomy of Fear (cont.)
19
Taxonomy of Fear (cont.)
20
Taxonomy of Fear (cont.)
(http://www.exforsys.com/tutorials/cloud-computing/cloud-computing-security.html)
21
Threat Model
22
Threat Model
• Basic components
– Attacker modeling
• Choose what attacker to consider
– insider vs. outsider?
– single vs. collaborator?
• Attacker motivation and capabilities
– Attacker goals
– Vulnerabilities / threats
23
What is the issue?
24
Attacker Capability: Malicious Insiders
• At client
– Learn passwords/authentication information
– Gain control of the VMs
• At cloud provider
– Log client communication
– Can read unencrypted data
– Can possibly peek into VMs, or make copies of VMs
– Can monitor network communication, application
patterns
– Why?
• Gain information about client data
• Gain information on client behavior
• Sell the information or use itself
25
Attacker Capability: Outside attacker
• What?
– Listen to network traffic (passive)
– Insert malicious traffic (active)
– Probe cloud structure (active)
– Launch DoS
• Goal?
– Intrusion
– Network analysis
– Man in the middle
– Cartography
26
Challenges for the attacker
27
Part II: Security and Privacy Issues
in Cloud Computing - Big Picture
• Infrastructure Security
• Data Security and Storage
• Identity and Access Management (IAM)
• Privacy
• And more…
28
Infrastructure Security
• Network Level
• Host Level
• Application Level
29
The Network Level
30
The Network Level - Mitigation
31
The Host Level
• SaaS/PaaS
– Both the PaaS and SaaS platforms abstract and hide
the host OS from end users
– Host security responsibilities are transferred to the
CSP (Cloud Service Provider)
• You do not have to worry about protecting hosts
– However, as a customer, you still own the risk of
managing information hosted in the cloud services.
32
The Host Level (cont.)
33
From [6] Cloud Security and Privacy by Mather and Kumaraswamy
Case study: Amazon's EC2 infrastructure
34
Local Host Security
36
The Application Level
• DoS
• EDoS(Economic Denial of Sustainability)
– An attack against the billing model that underlies the
cost of providing a service with the goal of bankrupting
the service itself.
• End user security
• Who is responsible for Web application security in the cloud?
• SaaS/PaaS/IaaS application security
• Customer-deployed application security
37
Data Security and Storage
• Data remanence
– Inadvertent disclosure of sensitive information is possible
• Data security mitigation?
– Do not place any sensitive data in a public cloud
– Encrypted data is placed into the cloud?
• Provider data and its security: storage
– To the extent that quantities of data from many
companies are centralized, this collection can become an
attractive target for criminals
– Moreover, the physical security of the data center and the
trustworthiness of system administrators take on new
importance.
42
What is Privacy?
e.g., Suppose a hacker breaks into Cloud Provider A and steals data from
• Data breaches
Company X.have a cascading effect
• Full reliance
Assume that on the
a third party toserver
compromised protectalso personal data?
contained data from Companies Y
and Z.
• In-depth understanding of responsible data stewardship
• • Who investigates
Organizations can transfer liability, but not accountability
this crime?
• Is it the Cloud Provider, even though Company X may fear that
• Risk assessment and
the provider will trymitigation throughout
to absolve itself the data life cycle is
from responsibility?
critical.
• Is it Company X and, if so, does it have the right to see other data on that
• Many server,
new risks and unknowns
including logs that may show access to the data of Companies Y and Z?
– The overall complexity of privacy protection in the
cloud represents a bigger challenge.
52
Security Issues in the Cloud
• Confidentiality issues
• Malicious behavior by cloud provider
• Known risks exist in any industry practicing outsourcing
• Provider and its infrastructure needs to be trusted
New Vulnerabilities & Attacks
• Collaborative attacks
• Mapping of internal cloud infrastructure
• Identifying likely residence of a target VM
• Instantiating new VMs until one gets co-
resident with the target
• Cross-VM side-channel attacks
• Extract information from target VM on the
same machine
More on attacks…
59
Minimize Lack of Trust:
Policy Language
• Consumers have specific security needs but don’t have a say-so in
how they are handled
– What the heck is the provider doing for me?
– Currently consumers cannot dictate their requirements
to the provider (SLAs are one-sided)
• Standard language to convey one’s policies and expectations
– Agreed upon and upheld by both parties
– Standard language for representing SLAs
– Can be used in a intra-cloud environment to realize
overarching security posture
Minimize Lack of Trust:
Policy Language (Cont.)
• Create policy language with the following characteristics:
– Machine-understandable (or at least processable),
– Easy to combine/merge and compare
– Examples of policy statements are, “requires isolation
between VMs”, “requires geographical isolation between
VMs”, “requires physical separation between other
communities/tenants that are in the same industry,”
etc.
– Need a validation tool to check that the policy created
in the standard language correctly reflects the policy
creator’s intentions (i.e. that the policy language is
semantically equivalent to the user’s intentions).
61
Minimize Lack of Trust: Certification
• Certification
– Some form of reputable, independent, comparable
assessment and description of security features and
assurance
– Sarbanes-Oxley, DIACAP, DISTCAP, etc (are they
sufficient for a cloud environment?)
• Risk assessment
– Performed by certified third parties
– Provides consumers with additional assurance
Minimize Loss of Control
- MONITORING
- UTILIZING DIFFERENT CLOUDS
- ACCESS CONTROL MANAGEMENT
- IDENTITY MANAGEMENT (IDM)
63
Minimize Loss of Control:
Monitoring
• Cloud consumer needs situational awareness for critical applications
– When underlying components fail, what is the effect of
the failure to the mission logic
– What recovery measures can be taken (by provider and
consumer)
• Requires an application-specific run-time monitoring and management
tool for the consumer
– The cloud consumer and cloud provider have different
views of the system
– Enable both the provider and tenants to monitor the
components in the cloud that are under their control
Minimize Loss of Control:
Monitoring (Cont.)
– Provide mechanisms that enable the provider to
act on attacks he can handle.
• infrastructure remapping (create new or move
existing fault domains)
• shutting down offending components or targets
(and assisting tenants with porting if necessary
• Repairs
– Provide mechanisms that enable the consumer to
act on attacks that he can handle (application-level
monitoring).
• RAdAC (Risk-adaptable Access Control)
• VM porting with remote attestation of target
physical host
• Provide ability to move the user’s application to
another cloud
65
Minimize Loss of Control:
Utilize Different Clouds
• The concept of ‘Don’t put all your eggs in one basket’
– Consumer may use services from different clouds through an
intra-cloud or multi-cloud architecture
– Propose a multi-cloud or intra-cloud architecture in which
consumers
• Spread the risk
• Increase redundancy (per-task or per-application)
• Increase chance of mission completion for critical applications
– Possible issues to consider:
• Policy incompatibility (combined, what is the overarching policy?)
• Data dependency between clouds
• Differing data semantics across clouds
• Knowing when to utilize the redundancy feature (monitoring
technology)
• Is it worth it to spread your sensitive data across multiple clouds?
– Redundancy could increase risk of exposure
Minimize Loss of Control:
Access Control
• Many possible layers of access control
– E.g. access to the cloud, access to servers, access to
services, access to databases (direct and queries via web
services), access to VMs, and access to objects within a VM
– Depending on the deployment model used, some of these will
be controlled by the provider and others by the consumer
• Regardless of deployment model, provider needs to manage the user
authentication and access control procedures (to the cloud)
– Federated Identity Management: access control management
burden still lies with the provider
– Requires user to place a large amount of trust on the
provider in terms of security, management, and maintenance
of access control policies. This can be burdensome when
numerous users from different organizations with different
access control policies, are involved
Minimize Loss of Control:
Access Control (Cont.)
• Consumer-managed access control
– Consumer retains decision-making process to retain
some control, requiring less trust of the provider
(i.e. PDP is in consumer’s domain)
– Requires the client and provider to have a pre-
existing trust relationship, as well as a pre-
negotiated standard way of describing resources,
users, and access decisions between the cloud
provider and consumer. It also needs to be able to
guarantee that the provider will uphold the
consumer-side’s access decisions.
– Should be at least as secure as the traditional
access control model.
– Facebook and Google Apps do this to some degree,
but not enough control
– Applicability to privacy of patient health records 68
Minimize Loss of Control:
Access Control
1. Authn request
3. Resource request (XACML Request) + SAML assertion IDP
2. SAML Assertion
. access requests
from all client 7. Send signed and encrypted ticket
for cloud
resource
ACM
. domains) on Domain A
(XACML
policies)
User on Amazon
Cloud
1. Name
2. E-mail
3. Password
4. Billing Address 1. Name
5. Shipping Address 2. Billing Address
6. Credit Card 3. Credit Card
1. Name
2. E-mail
3. Password
4. Billing Address
5. Shipping Address
6. Credit Card
1. Name
2. E-mail
3. Shipping Address
1. Name
2. E-mail
3. Shipping Address
Minimize Loss of Control: IDM
Identity in the Cloud
User on Amazon
Cloud
1. Name
2. Billing Address
3. Credit Card
1. Name
2. E-mail
3. Password
4. Billing Address
5. Shipping Address
6. Credit Card
Minimize Loss of Control: IDM
Present IDMs
• IDM in traditional application-centric IDM model
– Each application keeps track of identifying information of its
users.
• Existing IDM Systems
– Microsoft Windows CardSpace [W. A. Alrodhan]
– OpenID [http://openid.net]
– PRIME [S. F. Hubner]
Anonymous Identification
(Shamir's approach for Credit Cards)
• IdP provides Encrypted Identity Information to the user and SP.
• SP and User interact
• Both run IdP's public function on the certain bits of the Encrypted data.
• Both exchange results and agree if it matches.
Minimize Loss of Control: IDM
Usage Scenario (Approach – 1)
Minimize Loss of Control: IDM
Approach - 2
• Active Bundle scheme to protect PII from untrusted hosts
• Predicates over encrypted data to authenticate without disclosing
unencrypted identity data.
• Multi-party computing to be independent of a trusted third party
Minimize Loss of Control: IDM
Usage Scenario (Approach – 2)
• Owner O encrypts Identity Data(PII) using algorithm Encrypt and O’s
public key PK. Encrypt outputs CT—the encrypted PII.
• SP transforms his request for PII to a predicate represented by function
p.
• SP sends shares of p to the n parties who hold the shares of MSK.
• n parties execute together KeyGen using PK, MSK, and p, and return TKp
to SP.
• SP calls the algorithm Query that takes as input PK, CT, TKp and
produces p(PII) which is the evaluation of the predicate.
• The owner O is allowed to use the service only when the predicate
evaluates to “true”.
Minimize Loss of Control: IDM
Representation of identity information
for negotiation
Token/Pseudonym
Identity Information in clear plain text
Active Bundle
Minimize Loss of Control: IDM
Motivation-Authentication Process using PII
• Active Bundle
• Anonymous Identification
• Computing Predicates with encrypted data
• Multi-Party Computing
• Selective Disclosure
Proposed IDM:
Active Bundle
85
Proposed IDM:
Active Bundle Scheme
– Metadata:
• Access control policies
• Data integrity checks
• Dissemination policies
• Life duration
• ID of a trust server
• ID of a security server
• E(Name)
• E(E-mail) • App-dependent information
• E(Password) • …
• E(Shipping Address)
E(Billing Address)
– Sensitive Data:
•
• E(Credit Card)
• …
• Identity
Information
• ...
– Virtual Machine
(algorithm):
• Interprets metadata
• Checks active bundle integrity
• Enforces access and
dissemination control policies
* E( ) - Encrypted Information • …
Proposed IDM:
Anonymous Identification
• Use of Zero-knowledge proofing for user authentication
without disclosing its identifier.
User on Amazon
Cloud ZKP Interactive Protocol
User Application
Active Bundle
Active Bundle Active
Creator Bundle (AB)
Directory
Facilitator
Trust Evaluation
Active Bundle Coordinator Agent (TEA)
Predicate Request*
• E-mail
• E(Name)
• Password
• E(Billing
• E(Name)
Address)
• E(Shipping Address)
• E(Credit Card)
• E(Billing Address)
• E(Credit Card)
Predicate Request
• E(Name)
• E(Billing
Address)
• E(Credit Card)
Predicate Reply*
• Name
• Billing Address
• Credit Card
*Age Verified
*Credit Card Verified
Proposed IDM:
Selective Disclosure
• User Policies in the Active Bundle dictate dissemination
Selective disclosure*
• E-mail • E(E-mail)
• Password • E(Name)
• E(Name) • E(Shipping
• E(Shipping Address) Address)
• E(Billing Address)
• E(Credit Card)
Selective disclosure*
• E-mail • E-mail
• Password • E(Name)
• E(Name) • E(Shipping
• E(Shipping Address) Address)
• E(Billing Address)
• E(Credit Card)
Selective disclosure*
• E-mail • E(Name)
• E(Name) • E(Shipping
• E(Shipping Address) Address)
*e-bay seller shares the encrypted information based on the user policy
Proposed IDM:
Selective Disclosure
Selective disclosure
• E-mail • Name
• E(Name) • Shipping Address
• E(Shipping Address)
Selective disclosure
• E-mail • Name
• E(Name) • Shipping Address
• E(Shipping Address)
User on Amazon
Cloud
1. E-mail
2. Password
1. Name
2. Billing Address
3. Credit Card
1. Name
2. E-mail
3. Password
4. Billing Address
5. Shipping Address
6. Credit Card
1. E-mail
1. Name
2. Shipping Address
Proposed IDM:
Characteristics and Advantages
• Ability to use Identity data on untrusted hosts
• Self Integrity Check
100
Minimize Multi-tenancy
wireless bandwidth
Various studies have identified longer battery
lifetime as the most desired feature of such
systems.
the energy cost per second when the mobile phone is idle.
pi
the energy cost per second when the mobile is transmission the data.
ptr
Suppose the server is F times faster—that is, S
= F × M. We can rewrite the formula as
• Set up
o 19 EC2 small instances (US East)
o 342 links between VMs
o Ubuntu 10.04 server version
• ***Make a difference***
Bring mobile technology in the daily lives of blind and visually
impaired people to help achieve a higher standard of life
• Take a major step in context-aware navigation of the blind and
visually impaired
• Bridge the gap between the needs and available technology
• Guide users in a non-overwhelming way
• Protect user privacy
Challenges
• Real-time guidance
• Portability
• Power limitations
• Appropriate interface
• Privacy preservation
• Continuous availability
• No dependence on infrastructure
• Low-cost solution
• Minimal training
Discussions
• RADAR/LIDAR
• Kay’s Sonic glasses (audio for 3D representation of environment)
(www.batforblind.co.nz)
• Sonic Pathfinder (www.sonicpathfinder.org) (notes of musical
scale to warn of obstacles)
• MiniGuide (www.gdp-research.com.au/) (vibration to indicate
object distance)
• VOICE (www.seeingwithsound.com) (images into sounds heard
from 3D auditory display)
• Tactile tongue display [6]
• …
Putting all together…
Services:
• Google Maps (outdoor navigation, pedestrian mode)
• Micello (indoor location-based service for mobile devices)
• Object recognition (Selectin software etc)
• Traffic assistance
• Obstacle avoidance (Time-of-flight camera technology)
• Speech interface (Android text-to-speech + speech recognition
servers)
• Remote vision
• Obstacle minimized route planning
Use of the Android Platform
Advantages of a Mobile-Cloud Collaborative
Approach
• Open architecture
• Extensibility
• Computational power
• Battery life
• Light weight
• Wealth of context-relevant information resources
• Interface options
• Minimal reliance on infrastructural requirements
Traffic Lights Status Detection Problem
LabelMe Dataset (
http://labelme.csail.mit.edu)
Relational Learning with Multiple Boosted
Detectors for Object Categorization
CloudCom ‘09
What did they do?
Proposed identity-based authentication for cloud computing, based
on the identity-based hierarchical model for cloud computing
(IBHMCC) and corresponding encryption and signature schemes
Being certificate-free, the authentication protocol aligned well with
demands of cloud computing
Identity-Based Hierarchical Model for Cloud
Computing (IBHMCC)
Define the identity of node is the DN
string from the root node to the current
node itself.
The identity of entity N is
ID_N = DN_0 || DN_M || DN_N
Deployment of IBHMCC
Root PKG setup and Low-level setup
Deployment of IBHMCC (cont.)
After that, all nodes in the level-1 get and securely keep their secret keys and
the secret points.
The public key and the Q-value are publicized.
Then, Each node in the level-1 similarly repeats the above steps (2-5).
Identity-Based Encryption
Identity-Based Encryption (cont.)
Identity-Based Signature
Identity-Based Authentication for Cloud
Computing
CloudCom ’09
What did they do?
Simple technique implemented with Open Source software solves the
confidentiality of data stored on Cloud Computing Infrastructure by
using public key encryption to render stored data at rest unreadable
by unauthorized personnel, including system administrators of the
cloud computing service on which the data is stored
Validated their approach on a network measurement system
implemented on PlanetLab
Used it on a service where confidentiality is critical – a scanning
application that validates external firewall implementations
Problem Scope
Goal is to ensure the confidentiality of data at rest
“Data at rest” means that the data that is stored in a readable form on
a Cloud Computing service, whether in a storage product like S3 or in
a virtual machine instance as in EC2
Problem Scope (cont.)
To protect data at rest, they want to prevent other users in the cloud
infrastructure who might have access to the same storage from
reading the data our process has stored
They also want to prevent system administrators who run the cloud
computing service from reading the data.
They assume that it is unlikely for an adversary to snoop on the
contents of memory.
If the adversary had that capability, it is unlikely that
we could trust the confidentiality of any of the data
that we generated there.
Problem Scope (cont.)
While the administrative staff of the cloud computing service could
theoretically monitor the data moving in memory before it is stored in
disk, we believe that administrative and legal controls should prevent
this from happening.
They also do not guard against the modification of the data at rest,
although we are likely to be able to detect this.
Solution Design
Solution Design (cont.)
On a trusted host, collect the encrypted data, as shown in Figure 3,
and decrypt it with the collection agent’s private key which stays on
that host. Note that in this case, we are in exclusive control of the
private key, which the cloud service provider has no view or control
over.
They will discuss this feature of our solution later.
Implementation Experiences
Implementation Experiences (cont.)
Privacy in a Semantic Cloud:
What’s Trust Got to Do with It?
• M. Armbrust, et al., "Above the Clouds: A Berkeley View of Cloud Computing," UC Berkeley
Reliable Adaptive Distributed Systems LaboratoryFebruary 10 2009.
• Cloud Security Alliance, "Security Guidance for Critical Areas of Focus in Cloud
Computing, ver. 2.1," 2009.
• M. Jensen, et al., "On Technical Security Issues in Cloud Computing," presented at the
2009 IEEE International Conference on Cloud Computing, Bangalore, India 2009.
• P. Mell and T. Grance, "Effectively and Securely Using the Cloud Computing Paradigm," ed:
National Institute of Standards and Technology, Information Technology Laboratory, 2009.
• N. Santos, et al., "Towards Trusted Cloud Computing," in Usenix 09 Hot Cloud Workshop ,
San Diego, CA, 2009.
• R. G. Lennon, et al., "Best practices in cloud computing: designing for the cloud,"
presented at the Proceeding of the 24th ACM SIGPLAN conference companion on Object
oriented programming systems languages and applications, Orlando, Florida, USA, 2009.
• P. Mell and T. Grance, "The NIST Definition of Cloud Computing (ver. 15)," National Institute
of Standards and Technology, Information Technology LaboratoryOctober 7 2009.
• C. Cachin, et al., "Trusting the cloud," SIGACT News, vol. 40, pp. 81-86, 2009.
• J. Heiser and M. Nicolett, "Assessing the Security Risks of Cloud Computing," Gartner
2008.
• A. Joch. (2009, June 18) Cloud Computing: Is It Secure Enough? Federal Computer Week .
• AWS Amazon EC2: http://aws.amazon.com/ec2/
• Amazon CloudWatch: http://aws.amazon.com/cloudwatch/
• Iperf: http://iperf.sourceforge.net/
Cloud Blind References
• L. Ran, A. Helal, and S. Moore, “Drishti: An Integrated Indoor/Outdoor Blind Navigation System
and Service,” 2nd IEEE Pervasive Computing Conference (PerCom 04).
• S.Willis, and A. Helal, “RFID Information Grid and Wearable Computing Solution to the Problem
of Wayfinding for the Blind User in a Campus Environment,” IEEE International Symposium on
Wearable Computers (ISWC 05).
• Y. Sonnenblick. “An Indoor Navigation System for Blind Individuals,” Proceedings of the 13th
Annual Conference on Technology and Persons with Disabilities, 1998.
• J. Wilson, B. N. Walker, J. Lindsay, C. Cambias, F. Dellaert. “SWAN: System for Wearable Audio
Navigation,” 11th IEEE International Symposium on Wearable Computers, 2007.
• J. Nicholson,
Nicholson, V. Kulyukin, D. Coster, “ShopTalk: Independent Blind Shopping Through Verbal
Route Directions and Barcode Scans,” The Open Rehabilitation Journal, vol. 2, 2009, pp. 11-
23.
• Bach-y-Rita, P., M.E. Tyler and K.A. Kaczmarek. “Seeing with the Brain,” International Journal
of Human-Computer Interaction, vol 15, issue 2, 2003, pp 285-295.
• Y.K. Kim, K.W. Kim, and X.Yang, “Real Time Traffic Light Recognition System for Color Vision
Deficiencies,” IEEE International Conference on Mechatronics and Automation (ICMA 07).
• R. Charette, and F. Nashashibi, “Real Time Visual Traffic Lights Recognition Based on Spot
Light Detection and Adaptive Traffic Lights Templates,” World Congress and Exhibition on
Intelligent Transport Systems and Services (ITS 09).
• A.Ess, B. Leibe, K. Schindler, and L. van Gool, “Moving Obstacle Detection in Highly Dynamic
Scenes,” IEEE International Conference on Robotics and Automation (ICRA 09).
• P. Angin, B. Bhargava, R. Ranchal, N. Singh, L. Lilien, L. B. Othmane, “A User-centric Approach
for Privacy and Identity Management in Cloud Computing,” submitted to SRDS 2010.