AWS S3 Quizet
AWS S3 Quizet
object storage
Amazon S3 is what type of storage?
NO - Amazon S3 objects are automatically replicated on multiple device in multiple
facilities within
a region.
With S3 do you have to worry about data durability or replication across availability
zones?
NO - if your request rate grows steadily, Amazon S3 automatically partitions buckets
to support very high request rates and simultaneous access by many clients.
With S3 do you have to worry about scalability?
Yes - not just within your own account
Are S3 bucket names global?
100
What's the default bucket amount per account?
Place your buckets in different regions. (Just because the names are global doesn't
mean you don't have to place it in a region)
What can you do with your buckets to minimize latency, satisfy data locality
concerns, or satisfy disaster recover and compliance needs.
0 bytes up to 5TB
What is the size range for Objects?
Unlimited
How many objects can a single bucket store?
Object = data (the file itself) + metadata (data about the file)
Every object in a bucket consists of what two parts?
1) Bucket 2) Key 3) Optional Version ID
What 3 pieces uniquely identity an Amazon S3 object.
1) Web services endpoint 2) Bucket Name 3) Object Key
What 3 pieces form the the S3 url?
"Will my dad still be there in the future?"
What does durability address?
"Can I access my data right now?"
What does availability address?
Amazon S3 is an eventually consistent system.
What type of consistency system is S3?
No. PUTS to new objects are classified as read-after-write consistency.
Is there a concern for PUTS to new objects?
YES. PUTS to existing objects as well as DELETES are actions classified as
eventual consistency.
Is there a concern for PUTS to existing objects or DELETES?
No. Updates to a single key are atomic. You will never retrieve an inconsistent mix of
data.
For reads to a single key will I ever get some old data and some new data?
Yes
Is S3 secure by default?
1) Through course-grained access controls (ACLS
2) Through fine-grained access controls (Bucket policies,IAM policies, and query
string authentication)
What two ways can you secure an S3 bucket?
READ,WRITE,FULL-CONTROL at the object or bucket level. ACLS are a legacy
access control mechanism created before IAM existed.
What type of coarse-grained permissions can you define in an S3 ACL
S3 policies are:
1) They are associated with the bucket resource instead of an IAM principle
2) They include an explicit reference to the IAM principal in the policy.
What are the differences between IAM policies and S3 bucket policies?
An Amazon S3 bucket policy
What type of control would allow me to specify what CIDR block or IP and during
what time of day can an entity access an S3 object.
Prefixes and delimiters
What allows you to organize,browse and retrieve objects objects within a bucket
hierarchically.
1) Amazon S3 Standard
2) Amazon S3 Standard - Infrequent Access
3) Amazon S3 Reduces Redundancy
Name the three Amazon S3 storage classes.
Short-term or long-term storage of frequently accessed data.
Amazon S3 Standard is best suited for?
Love-lived, less frequently accessed data. Data that is stored for longer of 30 days.
Amazon S3 Standard - Infrequent Access (Standard-IA) is best suited for?
S3
Simple Storage Service
What is S3?
Storage for the internet. Secure, durable, highly-scalable object storage. Can upload files, but
cannot install OS or software.
What can I do with S3?
Store and retrieve any amount of data, at any time, from anywhere on the web.
What size objects can be uploaded to S3?
1 byte - 5 TB. Largest object in a single PUT = 5GB.
What kind of data can be stored in S3?
Virtually any kind of data in any format.
How much storage is potentially available?
Unlimited (however much you can pay for)
How are files stored in S3?
In buckets (conceptually like folders)
True or False. Filenames in S3 do NOT have to be unique across regions.
False. S3 is a universal namespace, so names must be unique globally.
True or False. You can read immediately after adding a new object to S3.
True. Read after write consistency for PUTS of new Objects.
True or False. Updates and deletes in S3 will be visible immediately.
False. Eventual consistency for overwrite PUTS and DELETES (Updating or deleting objects
can take some time to propagate)
What are the S3 storage classes/tiers?
1. S3,
2. S3-IA,
3. S3-RRS,
4. Glacier
What is S3 standard tier?
For frequently accessed data. Low latency and high throughput. Availability = 99.99%.
Durability = 99.999999999% (11x9's). Use cases including cloud applications, dynamic
websites, content distribution, mobile and gaming applications, and big data analytics
What is S3-IA tier?
S3 Standard - Infrequent Access. Availability = 99.99%. Durability = 99.999999999%
(11x9's). Data is accessed less frequently, but requires rapid access when needed. Low per
GB storage price and per GB retrieval fee. Ideal for long-term storage, backups, and as a data
store for disaster recovery.
True or False. Data deleted from S3-IA tier within 30 days will be charged for a full 30 days.
True.
What is Glacier tier?
Secure, durable, and extremely low-cost ($0.01/GB/mo) storage service for data archiving.
Optimized for data that is rarely accessed and a retrieval time of several hours is suitable.
Charged for amount of storage, # requests, data transfer pricing
What does it cost to recover from Glacier?
Archive and Restore requests are priced from $0.05 per 1,000 requests. For large restores,
there is also a restore fee starting at $0.01 per gigabyte. Objects are restored to RRS, so you
are charged for RRS and Glacier until restored object is moved.
True or False. Data deleted from Glacier w/in 90 days are charged a fee.
True. There is a pro-rated charge of $0.03 per GB.
How much data can be restored from Glacier for free?
You can restore up to 5% of the data stored in Glacier for free each month.
What is S3-RRS tier?
Reduced Redundancy Storage. Store non-critical, reproducible data (storing thumbnails,
transcoded media, etc.) at lower levels of redundancy than S3's standard storage. Availability
= 99.99%, Durability = 99.99%. Data is replicated fewer times, so the cost is less
What are the identifying parts of an S3 object?
1. Key - The name,
2. Value - The data,
3. Version ID,
4. Metadata (system and/or user-defined),
5. ACLs
True or False. Bucket names don't have to be globally unique.
False. They must be globally unique AND lower case letters.
True or False. Uploaded objects are private by default.
True.
How many buckets can each account have?
100 by default.
True or False. All objects in a bucket are replicated when replication is enabled.
False. Pre-existing objects are not replicated, but future uploads are replicated.
True or False. Versioning is not a requirement for cross-region replication.
False. Cross-region replication requires versioning.
True or False. Versioning can only be disabled, not turned off.
True. To stop versioning completely, you must delete and recreate the bucket.
True or False. You have to pay for each version of a file.
True. Each file has its own version ID, which means it's taking up space, which has to be paid
for.
True or False. Lifecycle rules do NOT require versioning.
True, but you CAN use Lifecycle rules with versioning.
True or False. You can use multi-factor authentication with versioning.
True. Multi-factor authentication is used to enforce second authentication so objects are less
likely to be deleted accidentally.
What are lifecycle rules?
Rules you can set up to automatically transition items from one type of storage to another.
What are the types of lifecycle rules?
1. Transition to Standard - Infrequent Access Storage,
2. Archive to Glacier storage,
3. Permanently delete
Explain the 'Transition to Standard - Infrequent Access Storage Class' rule.
Must wait minimum 30 days and min 128kb file size. Fee for retrieval, but almost
instantaneous.
Explain the 'Archive to Glacier storage class' rule.
Can do 1 day after uploading (30 days after infrequently accessed). Fee for retrieval, takes 3-
5 hrs for retrieval.
Explain the 'Permanently Delete' rule.
Can do 1 day after uploading. If versioning enabled, must expire, then permanently delete.
True or False. You can use lifecycle rules for versions of objects.
True.
What is a CDN?
Content Delivery Network. Network of distributed servers that deliver webpages and content
users based on their geographic location (and other factors).
What is CloudFront used for?
Deliver your entire website, including dynamic, static, streaming, and interactive content
using a global network of edge locations.
How are request handled with CloudFront?
Requests are automatically routed to the nearest edge location, so content is delivered with
the best possible performance.
True or False. CloudFront can only be used with other AWS services.
False. CloudFront works with non-AWS origin servers, but is optimized to work with other
AWS services like S3, EC2, Elastic Load Balancing, and Route 53.
In CloudFront, what is an origin location?
Location of original, uncached, files. S3 bucket, EC2 instance, Elastic Load Balancer,
Route53, or custom
In CloudFront, what is an edge location?
Location where content will be cached. Different from Regions and Availability Zones.
True or False. Edge locations are read-only.
False. Can send the PUT messages, which will be forwarded to the Origin.
How are objects removed from edge locations?
Objects are removed for free after the TTL expires, but you can manually clear objects for a
fee.
What is a Distribution?
A collection of Edge Servers.
What are the types of distributions?
Web distribution and RTMP
What is a Web Distribution?
A distribution specifically for websites and static files (html, css, xml, etc)
What is RTMP?
Adobe's Real-Time Message Protocol. For media streaming (flash, etc). Allows an end user
to begin playing a media file before the file has finished downloading
Why would you want to restrict user access to a distribution?
If users access your objects directly in S3, they bypass the controls provided by CloudFront
signed URLs or signed cookies.
http://docs.aws.amazon.com/AmazonCloudFront/latest/DeveloperGuide/private-content-
restricting-access-to-s3.html
True or False. When a bucket is created it is private by default.
True.
How can you control access to a bucket.
Using bucket policies and ACLs.
True or False. There is no way to track who accesses a bucket.
False. You can configure buckets to store logs w/in the bucket or in another bucket.
What are the 3 methods of encryption?
1. SSL/TLS (in transit),
2. Server Side Encryption (SSE),
3. Client Side Encryption (CSE)
What is SSE-S3?
Server Side Encryption that is fully Amazon managed. S3 encrypts each object with a unique
key and it encrypts the key itself with a master key that it regularly rotates. Uses 256-bit
Advanced Encryption Standard (AES-256).
What is SSE-KMS?
Server Side Encryption Key Management Services. Combines secure, highly available
hardware and software to provide a key management system scaled for the cloud. AWS KMS
uses customer master keys (CMKs) to encrypt your S3 objects. Provides audit trail.
What is SSE-C?
Server Side Encryption with customer provided keys. You manage the keys, AWS manages
encryption/decryption when you write or read.
What is Client Side Encryption?
Refers to encrypting data before sending it to S3. Two options for using data encryption keys:
- You are NOT charged for data transfer between EC2 and S3 of same region
- You are CHARGED for data transfer between buckets of different regions via
COPY request.
- You are CHARGED for data transfer between EC2 and S3 belonging to different
regions
Default Access in S3
By default, only the bucket owners and objects owners have access to the S3
resources
Access Control Mechanism in S3
You can use Access Control Lists (ACLs) or Bucket Policies to grant permissions to
users/groups
Secure Upload/Download in S3
You can securely upload/download objects in S3 via SSL endpoints using HTTPS
ServerSide Encryption for additional security at rest
For additional security, you can use SSE (ServerSide Encryption) or SSE with
customer-provided keys (SSE-C) to encrypt data at rest
Amazon S3 Access Control
Four mechanisms for controlling access:
1. IAM Policies: They can be used to grant IAM users fine-grained control to S3
bucket or objects
2. Bucket Policies: Defining rules which apply broadly across all requests to S3.
Access can be restricted to an aspect of the request, such as HTTP referrer and IP
address.
3. Access Control Lists: Can be used to grant specific permissions (READ, WRITE
etc) to specific users for an individual bucket or object.
4. Query String Authentication: Customers can create a URL to an S3 object which is
only valid for a limited time.
Does S3 support Data Access Auditing
Yes, S3 buckets can be configured to create access log records for all requests
made against it. These log records contain details about the request, such as
request type, resource specified in the request, date & time.
SSE Encryption Key Protection
With SSE, every protected object is encrypted with a unique key. This unique object
key is itself encrypted by a separate master key. A new master key is issued at least
monthly. Encrypted data, keys and master keys are stored and secured on separate
hosts for multiple layer of protection
VPC endpoint for S3
An Amazon VPC Endpoint for Amazon S3 is a logical entity within a VPC that allows
connectivity only to S3. The VPC Endpoint routes requests to S3 and routes
responses back to the VPC
Allowing ONLY specific VPC endpoints access to an S3 bucket
Access to a bucket can be limited to a specific VPC Endpoint or a set of endpoints
using S3 bucket policies. S3 bucket policies now support a condition,
aws:sourceVpce, that you can use to restrict access.
Checksums which S3 uses for checking Data Corruption
S3 uses a combination of Content-MD5 checksums and cyclic redundancy checks
(CRC) to detect data corruption. S3 performs these checksums on data at rest and
repairs the corruption using redundant data It also calculates the checksum on all
network traffic to detect corruption of data packets when storing or retrieving data.
Advantages of Versioning in S3
When a user performs DELETE operation on an object, subsequent simple (un-
versioned) requests will no longer retrieve the object. However, all versions of the
object are retained in the S3 bucket. Only the owner of the S3 bucket can
permanently delete a version.
Setting up a rollback window, recycle bin on Amazon S3 to recover from deletes and
overwrites.
By using Versioning with Lifecycle rules, you can create rollback window
MFA for additional protection of versions of an S3 object
By enabling versioning with MFA Delete on S3 bucket, two forms of authentication is
required to permanently delete the version of an object.
Minimum duration for S3-IA
Data deleted within 30 days will be charged for a full 30-day period
Minimum object size for S3-IA
Minimum object size is 128 KB. It means even a smaller object which is less than
128 KB will be charged as if the object was 128 KB.
Types of Data Retrieval for Glacier
There are three ways to retrieve data:
1. Expedited Retrieval (Most expensive) (Retrieval Time: 1-5 minutes)
2. Standard Retrieval (Retrieval Time: 3-5 hours)
3. Bulk Retrieval (Least expensive) (Retrieval Time: 5-12 hours)
Where does the archived object reside?
When an archived object is retrieved, it resides in both RRS and Glacier.
S3 Event Notifications
S3 event notifications can be sent in response to actions in S3 like PUT, POST,
COPY, DELETE. Notification can be sent through SNS, SQS or directly to Lambda.
Static website redirects in S3
S3 provides multiple ways to enable redirection of web content for static websites.
You can achieve this by setting rules on your bucket to enable automatic redirection.
You can also configure a redirect on an individual S3 object
Any additional charge for hosting static websites on S3?
No additional charge for hosting static websites on S3. Same charges of storage,
requests, and data transfer apply to your website objects.
Applying Object Tags to S3 Objects
You can add up to 10 object tags for an S3 object using either the Console, REST
API, CLI or the SDK.
Updating Object Tags on S3 Objects
Can be updated through Console, CLI, REST API or SDK. Other than the AWS
console, for all other mediums, you have to specify the full tag set while updating.
Replication of Object Tags in Cross-Region Replication
Object tags can be replicated using cross-region replication. If cross-region
replication is already enabled, new permissions are required in order for tags to
replicate.
S3 Storage Class Analytics
Storage Class Analytics can be used to analyze storage access patterns which can
then be used to transition data to appropriate storage class. You can setup a storage
class analysis policy to monitor an S3 bucket, prefix or object tag.
Configuring Storage Class Analysis Policy
Storage Class Analysis Policy can be configured either using Management Console
or through S3 PUT Bucket Analytics API
S3 Inventory
S3 Inventory is a scheduled alternative to synchronous LIST API call which simplifies
& speeds up business workflows and big data applications
Setup S3 Inventory
Using S3 PUT Bucket inventory API call, you can configure a daily or weekly
inventory for all objects in a bucket or shared prefix. You can specify a destination
S3 bucket for your inventory, the output file output format (CSV), and specific object
metadata necessary for your business application, such as: object name, size, last
modified date, storage class, version id, delete marker, noncurrent version flag,
multipart upload flag, or replication status.
S3 Inventory Use-case
S3 inventory can be used as ready-made input to big data job or workflow
application instead of the synchronous S3 LIST API, saving the time and compute
resources it takes to call and process the LIST API response.
Setup CloudWatch metrics for S3
You can use console or S3 PUT Bucket Metrics API to enable or configure filters for
metrics. Metrics are available 15 minutes after enabling.
Enabling CRR (Cross-Region Replication)
CRR is configured at bucket level. You must first turn on Versioning on both source
and destination buckets.
CRR Replications to Target bucket
CRR will also replicate meta-data and acls of the objects in source bucket. Also,
changes to meta-data , acl will also be replicated to target bucket. CRR will not
replicate the existing objects in source bucket when it is enabled.
CRR with Life Cycle rules
You can setup SEPARATE Lifecycle rules for both source and target buckets.
Does transfer acceleration support multipart uploads?
Yes. It supports multipart uploads
Is Transfer Acceleration HIPAA compliant?
Yes
Minimum Duration for Glacier
Objects that are archived to Glacier have a minimum of 90 days of storage, and
objects deleted before 90 days will be charged for a full 90-day period
or
https://<bucket name>.s3-<Region>.amazonaws.com/
Example:
https://s3-us-west-2.amazonaws.com/seramrincodi1
What is the structure (format) of the address of the static website in an S3 bucket?
http://<bucket name>.s3-website-<Region>.amazonaws.com
Example:
mys3website-mrincodi.s3-website-us-west-2.amazonaws.com
https://goo.gl/B1PYiV
https://goo.gl/hJRK7S
What are you charged for when using S3?
Storage, requests, Storage Management (tags), data transfer and transfer
acceleration.
Regarding data transfer: Putting data inside S3 is free, but moving data inside S3
(like when you do replication from one region to another) is charged.
(from https://goo.gl/a7MaoD):
Amazon S3 offers three pricing options. Storage (per GB per month), data transfer in
or out (per GB per month), and requests (per x thousand requests per month).
Can you remove versioning after you have activated it to a bucket?
No, but you can suspend it (disable it).
Do I need to give permissions again to a file if I re-upload it?
Only if versioning is active for the bucket where the file is put.
In lifecycle management, what are the storage classes involved?
S3, Infrequent Access, Glacier and Deletion.
Can I give lifecycle management to a specific file?
No. You can give it to the whole bucket or to subfolders of it.
In lifecycle management, what's the minimum amount of days that an object can be
in S3 before being moved to infrequent access, and what is the minimum size for
moving it?
A minimum of 30 days is required before transitioning to the Standard - Infrequent
Access Storage Class
Lifecycle policy will not transition objects that are less than 128KB to the Standard -
Infrequent Access Storage Class.
In lifecycle management, what's the minimum amount of days that an object can be
in S3 - Infrequent Access Storage Class before transitioning to the Glacier Storage
Class?
30 days.
In lifecycle management, what's the minimum amount of days that an object can be
in S3 before being moved to Glacier?
If it has not been in Standard - Infrequent Access, any amount of days is OK.
If it has not been in Standard - Infrequent Access: an object must remain in the
Standard - Infrequent Access Storage Class for a minimum of 30 days before
transitioning to the Glacier Storage Class, so 30 days in S3 + 30 days in Infrequent
Access = 60 days.
In lifecycle management, what's the minimum amount of days that an object can be
in S3 before being automatically deleted?
Any amount, as long as it is more than the amount of days it will stay in Glacier (if
active) and/or Standard - Infrequent Access Storage (if active).
What's an Edge Location?
The end points for CloudFront, the CDN of AWS. This is where the content will be
cached.
What kind of AWS resources can be Origin Servers for CloudFront?
The origin of all the file can be:
S3 Bucket, EC2 instance, ELB or Route53.
You can also have your custom origin server outside of AWS
What is a distribution in CloudFront?
A set of Edge Locations. You create a distribution to use CloudFront.
What types of distributions can be created in CloudFront?
Web distributions for HTTP/HTTPS and RTMP Distributions for RTMP (media
streaming) and its variants.
Are edge locations read-only?
No, you can write to them.
For how long are objects cached in an Edge Location?
For the TTL of the objects there.
What's the default TTL of an object in an edge location in CloudFront?
24 hours.
Can I clear cached objects in my edge locations?
Yes, but you will be charged.
What is Transfer Acceleration?
A technology that let users upload or download files in S3 buckets, by using Edge
locations from CloudFront. It gives you a new URL location (Endpoint).
Can I use transfer acceleration for a subfolder only?
No. You can use it for the whole bucket only.
What is the URL format of a bucket using transfer acceleration?
<bucket name>.s3-accelerate.amazonaws.com
https://<bucket name>.s3-<region>.amazonaws.com
What is the format of the URL given by using Transfer Acceleration in an S3 bucket?
<Bucket name>.s3-accelerate.amazonaws.com
Example:
my-seoul-cloudfront-dist.s3-accelerate.amazonaws.com
What does CORS mean?
Cross-Origin Resource Sharing
What is CORS for?
Using CORS (Cross-Origin Resource Sharing) you can selectively allow web
applications running on other domains to access content in your Amazon S3 bucket.
Where do I change the CORS configuration for an S3 bucket?
In Properties -> Permissions -> Edit CORS configuration. Enter the URL of the
website that can access the ST resources, inside the AllowedOrigin XML tags.
Example:
<AllowedOrigin>http://myindexwebsitemrincodi.s3-website-us-west-
2.amazonaws.com</AllowedOrigin>
How do I upload another page from a different S3 resource, in HTML? (Ignore)
<script>
$("#get-html-from-other-s3").load("http://mycorstestbucketmrincodi.s3-website-us-
west-2.amazonaws.com/loadpage.html")
</script>
What does Cross-Region Replication do?
It replicates every future upload of every object to another bucket in another region.
What does Cross-Region Replication needs in order to be available?
That versioning is active in both the source and the destination buckets.
How can you setup access control to your buckets?
By using bucket policies (bucket level) or Access Control Lists (object level).
What are the types of encryption available in S3?
In transition: SSL/TSL
At rest:
- Server side encryption.
- S3 managed keys (SSE-S3)
- AWS Key-management service (SSE-KMS)
- Server-side encryption with customer-provided keys (SSE-C)
- Client-side encryption.
Types of storage gateway? (Ignore)
Should not be in the exam, but see slideshow 37 (S3 tips), after the 13th minute.
Types of snowball (Ignore)?
Should not be in the exam, but see slideshow 37 (S3 tips), after the 14:30 mark.
What is multipart upload for S3?
Multipart upload allows you to upload a single object as a set of parts. It makes
uploading to S3 much faster.
If you encrypt a bucket on S3 what encryption does AWS use?
Advanced Encryption Standard (AES) 256
What is the largest size file you can transfer to S3 using a PUT operation?
5Gb. After that you must use a multipart upload.
What can I do If I want to enable a user to download my private data directly from
S3?
If you want to enable a user to download your private data directly from S3, you can
insert a pre-signed URL into a web page before giving it to your user. (? See Exam
4: S3 quiz).
AWS command to create a new S3 bucket.
aws s3 mb s3://newbucket
What is another name for AWS Snowball?
AWS Import/export
You can also modify/add a file to the bucket, which will begin the process. This does
not work when it comes to deleting, for security purposes as you wouldn't want
someone who has access to the other bucket to be able to be deleting objects off of
your bucket.
What are delete markers in S3 Versioning?
When deleting a file, a file isn't truly deleted, it's just marked as delete but can still be
recovered via versioning. If you delete the delete marker, you undo the deletion, but
if you delete all versions of the file, you will completely delete the file.
AWS CLI
AWS Command Line Interface
What does AWS CLI require in order for you to start using it?
AWS Access and Secret Access Key
Does Lifecycle Management require versioning to function?
No it's optional
What are the minimum life-cycle transition rules for IA, OZIA, and Glacier?
IA - At least 30 days after creation
OZIA - At least 30 days after creation or last transition
Glacier - At least 30 days after last transition, or 1 day after creation
Expiration time for a current version of a file is:
the summation of all transition times +1.
CDN
Content Delivery Network
Edge Location
Location where content is cached, separate to AWS region/AZ
Origin
Origin of all files
Distribution
Name given to CDN which consists of collection of edge locations
Once you setup a brand new Cloudfront property for a brand new site, will
performance be improved for all users?
Everyone except the 1 first that hits the site first, because initially the edge servers
will not contain any cached content.
Web Distribution vs RTMP
Web Distribution is for websites while RTMP is for media streaming
T/F - Edge locations can be written to
True, Edge locations can be read/written from. If written to, then the file will be
replicated back to origin
TTL
Time to live
Origin Access Identity
Creates a new user that can access the bucket directly. Should be used if you
decide to restrict access to bucket so users can't bypass cloudfront and hit bucket
directly
Why is manual cache clearing not always advised?
You get charged for it
Is it possible to restrict who sees your side via cloudfront? Via countries
Yes to both - pre-signed cookies/Geo restrictions
What is invalidation in Cloudfront?
Clearing cache on edge server
T/F - By default, all newly created buckets are private
True
Bucket Polices
Applies to bucket entirely
Access Control Lists
More specific control rules within the bucket
What are 2 ways you can security your S3 bucket?
Bucket policies and ACL
T/F - S3 Buckets supports logging
True
4 Methods of Encryption for S3
- In Transit
- At Rest
Encryption In Transit
SSL/TLS - TLS is the replacement to SSL
Storage Gateway
Connects on-premise DC to replicate to S3 or Glacier. Download software as VM
image.
Encryption At Rest
Server Side Encryption - S3
- Managed Keys (SS3-S3) [Each object encrypted with unique key. Encrypts key w/
master key, master key commonly rotated. AES 256 bit encryption]
- AWS Key Management Service (Managed Keys - SSE-KMS) [Similar to SS3-S3.
Uses envelope key which protects data encryption key from unauthorized access.
Provides audit trail of when key was used and by who. Can create/manage key.
Cheaper than SS3-S3.
- Server Side Encryption w/ Customer Provided Keys (SSE-C). User manages
encryption keys. AWS manages encryption; encrypts as it writes to disk, and
decrypts as it reads.
https://s3-acloudguru1234.amazonaws.com/
https://s3.acloudguru1234.amazonaws.com/eu-west-1
https://s3-eu-west-1.amazonaws.com/acloudguru1234
https://s3-us-east-1.amazonaws.com/acloudguru1234
3
The difference between S3 and EBS is that EBS is object based where as S3 is
block based.
true
false
False
What does S3 stand for?
1KB
1MB
0 bytes
1 byte
3
S3 has eventual consistency for which HTTP Methods?
S3 - RRS
Glacier
S3
S3 - IA
2
One of your users is trying to upload a 7.5GB file to S3. However, they keep getting
the following error message: "Your proposed upload exceeds the maximum allowed
object size.". What solution to this problem does AWS recommend?
Log in to the S3 console, click on the bucket and then click properties. You can then
increase your maximum object size to 1TB.
Design your application to use the Multipart Upload API for all objects.
Design your application to use large object upload API for this object.
10
20
100
50
3
What is the availability on S3 Standard?
100%
99.99%
99.90%
99%
2
You work for a busy digital marketing company who currently store their data on
premise. They are looking to migrate to AWS S3 and to store their data in buckets.
Each bucket will be named after their individual customers, followed by a random
series of letters and numbers. Once written to S3 the data is rarely changed, as it
has already been sent to the end customer for them to use as they see fit. However
on some occasions, customers may need certain files updated quickly, and this may
be for work that has been done months or even years ago. You would need to be
able to access this data immediately to make changes in that case, but you must
also keep your storage costs extremely low. The data is not easily reproducible if
lost. Which S3 storage class should you choose to minimise costs and to maximize
retrieval times?
S3 - IA
Glacier
S3 - RRS
S3
1
You work for a major news network in Europe. They have just released a new mobile
app that allows users to post their photos of newsworthy events in real time. Your
organization expects this app to grow very quickly, essentially doubling its user base
each month. The app uses S3 to store the images, and you are expecting sudden
and sizable increases in traffic to S3 when a major news event takes place (as users
will be uploading large amounts of content.) You need to keep your storage costs to
a minimum, and it does not matter if some objects are lost. With these factors in
mind, which storage media should you use to keep costs as low as possible?
Glacier
S3 - Provisioned IOPS
100%
99.99%
99.90%
99%
2
What is AWS Storage Gateway?
It allows large scale import/exports in to the AWS cloud without the use of an internet
connection.
3
You run a meme creation website that stores the original images in S3 and each
meme's meta data in DynamoDB. You need to decide upon a low-cost storage
option for the memes, themselves. If a meme object is lost, a Lambda function will
automatically recreate it using the original file from S3 and the metadata from
DynamoDB. Which storage solution should you use to store the non-critical, easily
reproducible memes in the most cost effective way?
S3
S3 - RRS
S3 - 1Zone-IA
Glacier
S3 - IA
3
You need to use an Object based storage solution to store your critical, non
replaceable data in a cost effective way. This data will be frequently updated and will
need some form of version control enabled on it. Which S3 storage solution should
you use?
S3
Glacier
S3 - RRS
S3 - IA
1
S3 has what consistency model for PUTS of new objects
Eventual Consistency
Use security groups to blacklist the IP addresses of the sites that link directly to your
S3 bucket.
Remove the ability for images to be served publicly to the site and then use signed
URLs with expiry dates.
4
What is the durability on RRS?
100%
99.99%
99.90%
99%
2
You are a solutions architect who works with a large digital media company. The
company has decided that they want to operate within the Japanese region and they
need a bucket called "testbucket" set up immediately to test their web application on.
You log in to the AWS console and try to create this bucket in the Japanese region
however you are told that the bucket name is already taken. What should you do to
resolve this?
Raise a ticket with AWS and ask them to release the name "testbucket" to you.
Change your region to Korea and then create the bucket "testbucket".
Bucketnames are global, not regional. This is a popular bucket name and is already
taken. You should choose another bucket name.
Run a WHO IS request on the bucket name and get the registered owners email
address. Contact the owner and ask if you can purchase the rights to the bucket.
3
Which of the following options allows users to have secure access to private files
located in S3? (Choose 3)
Public S3 buckets
Refine the key hashing to randomise the name Key to achieve the potential of 300
PUTs per second.
Change the trigger level to around 3000 as S3 can now accommodate much higher
PUT and GET levels.
Set up multiple accounts so that the per account hard limit on S3 buckets is avoided.
3
Regional Edge Cache
Used by Cloudfront to offload your origin by caching content that has been ejected
from an Edge location. Ejected when content is not really being accessed anymore
by users. If users begin accessing the content again, instead of having the edge
server reach the origin again, it can retrieve it from the regional cache for quicker
times.
S3 Object Lock
Write once, read many [WORM]. Can set a certain amount of time, or indefinitely, for
a file to not be deletable.
Cloudfront's TTL for a certain object is set to 0 seconds for a certain object, however,
when the user requests the object, it turns out the TTL is actually a day. Why?
Cloudfront TTL rule is overridden if object has cache control header
OAI
Origin Access Identity. Used on bucket policy.
Signed URL/Cookie
Specify expiration, IP, and trust signers (CloudFront Key Pairs)
http://docs.aws.amazon.com/AmazonCloudFront/latest/DeveloperGuide/private-
content-restricting-access-to-s3.html
True or False. When a bucket is created it is private by default.
True.
How can you control access to a bucket.
Using bucket policies and ACLs.
True or False. There is no way to track who accesses a bucket.
False. You can configure buckets to store logs w/in the bucket or in another bucket.
What are the 3 methods of encryption?
1. SSL/TLS (in transit),
2. Server Side Encryption (SSE),
3. Client Side Encryption (CSE)
What is SSE-S3?
Server Side Encryption that is fully Amazon managed. S3 encrypts each object with
a unique key and it encrypts the key itself with a master key that it regularly rotates.
Uses 256-bit Advanced Encryption Standard (AES-256).
What is SSE-KMS?
Server Side Encryption Key Management Services. Combines secure, highly
available hardware and software to provide a key management system scaled for the
cloud. AWS KMS uses customer master keys (CMKs) to encrypt your S3 objects.
Provides audit trail.
What is SSE-C?
Server Side Encryption with customer provided keys. You manage the keys, AWS
manages encryption/decryption when you write or read.
What is Client Side Encryption?
Refers to encrypting data before sending it to S3. Two options for using data
encryption keys:
S3
Provides secure, durable, highly scalable object storage
- object based, not operating system
- data is spread across multiple devices and facilities
- read after write consistency for PUTS of new objects
- universal namespace therefore must be unique
Consistency
Immediate read after write for PUTS however overwrites or DELETES will take some
time to propagate
Key value storage
Key (name of object)
Value (data)
Version ID
Metadata
Basic Characteristics
99.99% availability
11X9s durability
Lifecycle management
Versioning
Encryption
Secure data using access control lists and bucket policies
Storage Tiers
- S3 - designed to sustain the loss of two facilities concurrently
- IA - infrequently accessed, requires rapid access when needed
- Reduced redundancy storage - way to store objects that is okay being lost
- Glacier - cheap but archival
Charges
Storage, requests, data transfer pricing
Upgrade to remove adverts
Only ₹83.25/month
Versioning
Allows you to retrieve and preserve every version, cannot be disable once can only
be suspended
- only way to get rid of is to delete the bucket
Cross Region Replication
Files will be replicated to another region
- needs versioning enabled on both source and destination region buckets
Lifecycle rule
helps manage cost by applying rules to buckets and moving to different storage
- permanently delete after a certain time
- object has to wait a min of 30 dats to move to IA storage however waits only one to
move to glacier
EX: social media site
Cloud Front
content delivery network - system of distributed servers that deliver webpages and
other web content to a user based on geographic locations of the user, origin of the
web page, and a content delivery server
Edge location
Location where content can be cached
- separate to an AWS region
- Objects are cached for the TTL (time to live)
- Clear cached objects but will be charged
Distribution
name given to the CDN which consists of a collection of edge locations
Cloud Front Characteristics
You can have multiple origins to a distribution
- can restrict bucket access
- default root object
- HTTP methods - get, head, options, put, post, patch, delete
-- Selecting this would mean that when users upload file to cloud distribution it goes
to edge before server
TTL
time to live
- default is 24 hours
- Always in seconds
S3 encryption
By default all newly created buckets are private
- Encryption:
- In transit: when information is sent to and from buckets
- At rest - service side encryption
-- S3 managed keys - each object is encrypted with a unique key
-- SSE KMS - key management service - uses an envelope key that protects the
data encryption key and provides an audit trail
- client side
Storage Gateway
service that connects on premise software appliance with cloud based storage to
provide seamless integration between on premise IT environment
Gateway stored volumes
Keep entire data set on site, storage gateway backs up to Amazon S3
- stored locally
Gateway cached Volumes
only most frequently accessed data is stored locally
Gateway Virtual tape library
- each virtual tape can be stored in a virtual tape library backed by AWS
Import / export Disk
accelerates moving large amount of data into and out of the AWS cloud using
portable storage devices for transport
Import / export Snowball
petabyte scale data transport solution that uses secure appliances to transfer large
amounts of data into and out of AWS
- multiple layers of security designed to protect your data
- always use instead of disk
Transfer acceleration
utilized the cloud front edge network to accelerate uploads to S3 - instead of using a
bucket you can use a distinct url to upload directly to edge location