Module3 5

Uploaded by

Ganapathi Karthik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views11 pages

Module3 5

Uploaded by

Ganapathi Karthik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

AWS for Cloud

Computing
Module 3: AWS Database Services
• AWS Lambda
• Amazon Dynamo DB
• Amazon ECS (Elastic Container Service) & Amazon S3
Glacier
• Amazon Kinesis, Amazon Redshift
• Amazon EMR (Elastic MapReduce), AWS Disaster Recovery
and Backup.
Amazon EMR (Elastic MapReduce)
• Amazon EMR (previously called Amazon Elastic MapReduce)
is a managed cluster platform that simplifies running big
data frameworks, such as Apache Hadoop and Apache Spark
, on AWS to process and analyze vast amounts of data.
• Using these frameworks and related open-source projects,
you can process data for analytics purposes and business
intelligence workloads.
• Amazon EMR also lets you transform and move large
amounts of data into and out of other AWS data stores and
databases, such as Amazon Simple Storage Service
(Amazon S3) and Amazon DynamoDB.
Amazon EMR Use Cases
• Machine learning. EMR's built-in ML tools use the Hadoop framework to create a variety of
algorithms to support decision-making, including decision trees, random forests, support-vector
machines and logistic regression.
• Extract, transform and load. ETL is the process of moving data from one or more data stores to
another. Data transformations -- such as sorting, aggregating and joining -- can be done using EMR.
• Clickstream analysis. Clickstream data from Amazon S3 can be analyzed with Apache Spark and
Apache Hive. Apache Spark is an open source data processing tool that can help make data easy to
manage and analyze. Spark uses a framework that enables jobs to run across large clusters of
computers and can process data in parallel.
• Real-time streaming. Users can analyze events using streaming data sources in real time with
Apache Spark Streaming and Apache Flink. This enables streaming data pipelines to be created on
EMR.
• Interactive analytics. EMR Notebooks are a managed service that provide a secure, scalable and
reliable environment for data analytics. Using Jupyter Notebook -- an open source web application
data scientists can use to create and share live code and equations -- data can be prepared and
visualized to perform interactive analytics.
• Genomics. Organizations can use EMR to process genomic data to make data processing and
analysis scalable for industries including medicine and telecommunications.
Amazon EMR deployment options
• Amazon EMR on Amazon EC2. Amazon EMR can quickly process
large amounts of data using Amazon EC2. Users can configure Amazon
EMR to take advantage of On-Demand, Reserved and Spot Instances.
• Amazon EMR on Amazon Elastic Kubernetes Service (EKS). The
Amazon EMR console enables users to run Apache Spark applications
with other applications on the same EKS cluster. Organizations can
share compute and memory resources across all applications and use a
Kubernetes tool to monitor and manage the infrastructure.
• Amazon EMR on AWS Outposts. AWS Outposts enables
organizations to run EMR in their own data centers. This makes it easier
to set up, deploy, manage and scale EMR in on-premises environments.
Advantages of Amazon EMR
[Link]: EMR allows users to easily scale up or down the number of
instances in a cluster to handle varying amounts of data processing and analysis
tasks.
[Link] Effectiveness: EMR allows users to pay for the resources they need, when
they need them, making it a cost-effective solution for big data processing.
[Link] With Other AWS Services: EMR can be easily integrated with
other AWS services such as Amazon S3, Amazon DynamoDB, and Amazon
Redshift for data storage and analysis.
[Link]: EMR supports a wide range of open-source big data frameworks,
including Hadoop, Spark, and Hive, giving users the flexibility to choose the tools
that best fit their needs.
[Link] To Use: EMR provides an easy-to-use web interface that allows users to
launch and manage clusters, as well as monitor and troubleshoot performance
issues.
Disadvantages Of Amazon EMR
[Link] Customization: EMR is pre-configured with popular big data
frameworks such as Hadoop and Spark, so users may have limited options for
customizing their cluster.
[Link]: The latency of data processing tasks may increase as the size of the
data set increases.
[Link]: EMR can be expensive for users with large amounts of data or high-
performance requirements, as costs are based on the number of instances and
the amount of storage used.
[Link] Control Over The Infrastructure: EMR is a managed service, which
means that users have limited control over the underlying infrastructure. This
can be a disadvantage for users who need more control over their big data
environments.
[Link] Support For Certain Big Data Frameworks: EMR does not support
some big data frameworks such as Flink, which may be a deal breaker for some
organizations.
[Link] Support For Certain Applications: EMR is not suitable for all types
of applications, it mainly supports big data processes and analytics.
AWS Disaster Recovery and Backup
Disaster recovery involves the process of
restoring and recovering an
organization’s critical systems,
infrastructure, and data after a
disruptive event.
It aims to minimize downtime, recover
lost data, and resume normal operations
as quickly as possible.
AWS (Amazon Web Services) offers a full
range of tools and services to assist
organizations in establishing effective
disaster recovery strategies. Services like
Amazon S3 for secure and reliable object
storage, Amazon EC2 for adaptable and
scalable computing instances, and AWS
Backup for automatic backup and recovery
procedures are all provided by AWS. AWS also
offers options like cross-region replication
which lets companies duplicate data. Also,
AWS Services for Disaster
Recovery
Amazon S3 (Simple Storage Service): It is perfect for safely storing backup
data and making it possible for quick retrieval during the recovery stage.
Amazon EC2 (Elastic Compute Cloud): Organizations can build a failover
architecture by replicating their on-premises virtual machine (VM) environments.
High availability and seamless failover are made possible by EC2’s Auto Scaling
and Load Balancing features.
AWS Database Services: Database services from AWS include
Amazon RDS (Relational Database Service), Amazon DynamoDB, and Amazon
Aurora. The automated backups, point-in-time recovery, and cross-region
replication provided by these services ensure data durability and availability.
AWS Storage Gateway: AWS Storage Gateway acts as a bridge between on-
premises environments and AWS storage services. It allows for seamless
integration of existing infrastructure with AWS, enabling hybrid cloud
architectures for disaster recovery purposes.
Steps to Set Up AWS Disaster
Recovery
[Link] Recovery Objectives:
[Link] DR Architecture:
[Link] Data and Applications:
[Link] Up Automation:
[Link] Monitoring and Alerting:
[Link] and Validate:
[Link] the DR Plan:

•Recovery time objective (RTO): The maximum

acceptable delay between the interruption of service
and restoration of service. This determines an
acceptable length of time for service downtime.
•Recovery point objective (RPO): The maximum
acceptable amount of time since the last data
recovery point. This determines what is considered
an acceptable loss of data.

Amazon EMR: Hadoop Data Processing Guide
No ratings yet
Amazon EMR: Hadoop Data Processing Guide
16 pages
AWS SAA C03 Service Overview
No ratings yet
AWS SAA C03 Service Overview
7 pages
Amazon EMR
No ratings yet
Amazon EMR
6 pages
AWS Cheatbook For Dummies
No ratings yet
AWS Cheatbook For Dummies
172 pages
AWS Project by AnwarAkhtar
No ratings yet
AWS Project by AnwarAkhtar
7 pages
AWS Services List and CLF02 Content - Services and Usage-1
No ratings yet
AWS Services List and CLF02 Content - Services and Usage-1
57 pages
AWS Cloud Practitioner (CLF C02)
100% (1)
AWS Cloud Practitioner (CLF C02)
102 pages
Big Data Analytics Adoption in Cloud Amazone MR
No ratings yet
Big Data Analytics Adoption in Cloud Amazone MR
10 pages
AWS SAA C03 Exam Service Overview
No ratings yet
AWS SAA C03 Exam Service Overview
8 pages
Amazon EMR Serverless Architecture and Use Cases
No ratings yet
Amazon EMR Serverless Architecture and Use Cases
6 pages
Unit 6 - CC
No ratings yet
Unit 6 - CC
40 pages
Storage: The Node Types in Amazon EMR Are As Follows
No ratings yet
Storage: The Node Types in Amazon EMR Are As Follows
10 pages
Building Data Pipelines with EMR & MWAA
No ratings yet
Building Data Pipelines with EMR & MWAA
26 pages
SAA 03 Notes
No ratings yet
SAA 03 Notes
32 pages
AWS CPP CLF-C02 CheatSheet
100% (1)
AWS CPP CLF-C02 CheatSheet
21 pages
WhizCard CLF C02 Cheat Sheet Nov 2024
No ratings yet
WhizCard CLF C02 Cheat Sheet Nov 2024
110 pages
Data Engineering by AWS
100% (1)
Data Engineering by AWS
11 pages
AWS CodeArtifact & Lambda ENI Insights
No ratings yet
AWS CodeArtifact & Lambda ENI Insights
177 pages
AWS Cloud Computing Overview
No ratings yet
AWS Cloud Computing Overview
7 pages
AWS White Paper
No ratings yet
AWS White Paper
6 pages
AWS Certified Cloud Practitioner Guide
No ratings yet
AWS Certified Cloud Practitioner Guide
111 pages
AWS Cloud Practitioner Exam Prep
100% (1)
AWS Cloud Practitioner Exam Prep
111 pages
Cheat Sheet AWS Data Engineer Associate
No ratings yet
Cheat Sheet AWS Data Engineer Associate
117 pages
2ba24mc023 (Aws)
No ratings yet
2ba24mc023 (Aws)
18 pages
AWS Lake House for Data Insights
No ratings yet
AWS Lake House for Data Insights
59 pages
Overview of Key AWS Services
No ratings yet
Overview of Key AWS Services
11 pages
How Are Hadoop and Big Data Related?
No ratings yet
How Are Hadoop and Big Data Related?
18 pages
Understanding Amazon EMR Architecture
No ratings yet
Understanding Amazon EMR Architecture
14 pages
What Is Cloud Computing
No ratings yet
What Is Cloud Computing
11 pages
AWS Data Pipelines: 7 Easy Steps
No ratings yet
AWS Data Pipelines: 7 Easy Steps
26 pages
Overview of Key AWS Services
No ratings yet
Overview of Key AWS Services
14 pages
AWS Solutions Architect: Associate Level
No ratings yet
AWS Solutions Architect: Associate Level
69 pages
CC Lab7
No ratings yet
CC Lab7
7 pages
AWS Cloud Services Overview
0% (1)
AWS Cloud Services Overview
48 pages
AWS DevOps - Cheat Sheet
100% (1)
AWS DevOps - Cheat Sheet
21 pages
AWS Services Cheat Sheet & Overview
No ratings yet
AWS Services Cheat Sheet & Overview
13 pages
AWSomeDay 2021 2. Introduction To AWS Services - Compute.storage - Database
No ratings yet
AWSomeDay 2021 2. Introduction To AWS Services - Compute.storage - Database
32 pages
Lec-AWS-part 2
No ratings yet
Lec-AWS-part 2
76 pages
2nd Note
No ratings yet
2nd Note
6 pages
AWS Exam Prep for Cloud Professionals
No ratings yet
AWS Exam Prep for Cloud Professionals
9 pages
AWS Sheet
No ratings yet
AWS Sheet
15 pages
AWS Essentials for IT Professionals
No ratings yet
AWS Essentials for IT Professionals
41 pages
Handout Introduction To AWS Services Compute, Storage, Databases
No ratings yet
Handout Introduction To AWS Services Compute, Storage, Databases
32 pages
Amazon Web Services (AWS) White Paper-1
No ratings yet
Amazon Web Services (AWS) White Paper-1
12 pages
SN 1712934164767
No ratings yet
SN 1712934164767
13 pages
1605192076066-614 DAS-C01 Study Guide
No ratings yet
1605192076066-614 DAS-C01 Study Guide
18 pages
AWS Certified Cloud Practitioner
No ratings yet
AWS Certified Cloud Practitioner
18 pages
Handout Introduction To AWS Services Compute, Storage, Databases
No ratings yet
Handout Introduction To AWS Services Compute, Storage, Databases
32 pages
Amazon Emr Migration Guide
No ratings yet
Amazon Emr Migration Guide
167 pages
AWS Cloud Services Overview
No ratings yet
AWS Cloud Services Overview
59 pages
Managed Resource Scaling in Amazon Emr
No ratings yet
Managed Resource Scaling in Amazon Emr
13 pages
AWS Cheatsheet 1
No ratings yet
AWS Cheatsheet 1
5 pages
Amazon Web Services
No ratings yet
Amazon Web Services
10 pages
Reinvent Online Recap 2018 v5 425675166 190103174030 PDF
No ratings yet
Reinvent Online Recap 2018 v5 425675166 190103174030 PDF
50 pages
Overview of Amazon Web Services Cloud
No ratings yet
Overview of Amazon Web Services Cloud
21 pages
Module3 4
No ratings yet
Module3 4
16 pages
Module3 - 3 - ECS & S3g
No ratings yet
Module3 - 3 - ECS & S3g
21 pages
Module3 1
No ratings yet
Module3 1
10 pages
MP & MC Module-3
No ratings yet
MP & MC Module-3
106 pages
MP MC Module-5
No ratings yet
MP MC Module-5
55 pages
MP & MC Module-3
No ratings yet
MP & MC Module-3
106 pages
D. Chaum Et Al. (Eds.), Advances in Cryptology © Springer Science+Business Media New York 1983
No ratings yet
D. Chaum Et Al. (Eds.), Advances in Cryptology © Springer Science+Business Media New York 1983
2 pages
Forces, Movement, Shape and Momentum 2 MS2
No ratings yet
Forces, Movement, Shape and Momentum 2 MS2
8 pages
Python Regular Expression MCQs
No ratings yet
Python Regular Expression MCQs
25 pages
RG4R L1 C0x OpsMan v1.6
No ratings yet
RG4R L1 C0x OpsMan v1.6
55 pages
To 220
No ratings yet
To 220
3 pages
Electricity
No ratings yet
Electricity
4 pages
Social Support and Work-Family Balance
100% (1)
Social Support and Work-Family Balance
8 pages
0580 F - M 2025 Paper 2 Model Answer
No ratings yet
0580 F - M 2025 Paper 2 Model Answer
20 pages
Grade 12 Physics Model Exam 2024
No ratings yet
Grade 12 Physics Model Exam 2024
10 pages
420 KV Gis
100% (2)
420 KV Gis
144 pages
Home Remedies Stories Xuan Juliana Wang All Chapter Instant Download
No ratings yet
Home Remedies Stories Xuan Juliana Wang All Chapter Instant Download
81 pages
Olympus Szh10 Brochure
No ratings yet
Olympus Szh10 Brochure
16 pages
Investigatory Project on Spectroscopy
No ratings yet
Investigatory Project on Spectroscopy
9 pages
Split Local Artificial Boundary Conditions For The Two-Dimensional Sine-Gordon Equation On
No ratings yet
Split Local Artificial Boundary Conditions For The Two-Dimensional Sine-Gordon Equation On
23 pages
Complete Bar Bending Schedule For Different Structure (Free E-Book)
No ratings yet
Complete Bar Bending Schedule For Different Structure (Free E-Book)
20 pages
Factors Influencing The Recovery and Addition of Magnesium
No ratings yet
Factors Influencing The Recovery and Addition of Magnesium
4 pages
Charles Correa's Kanchanjunga Apartments
No ratings yet
Charles Correa's Kanchanjunga Apartments
11 pages
Sysmex White Paper Differential Diagnosis of Thrombocytopenia
No ratings yet
Sysmex White Paper Differential Diagnosis of Thrombocytopenia
5 pages
Annexes To A3 - Global PDF
No ratings yet
Annexes To A3 - Global PDF
257 pages
History of Sulfuric Acid
No ratings yet
History of Sulfuric Acid
2 pages
Polymerization
No ratings yet
Polymerization
30 pages
Ultrasonic Thickness Gaging
No ratings yet
Ultrasonic Thickness Gaging
4 pages
LHMX - Lesson Learned Matrix
No ratings yet
LHMX - Lesson Learned Matrix
3 pages
Audi Q5 Quattro (8RB) - EWD Headlamps
100% (1)
Audi Q5 Quattro (8RB) - EWD Headlamps
43 pages
Energy-Efficient Wireless Design
No ratings yet
Energy-Efficient Wireless Design
1 page
Review: Hawking's Last Book Insights
No ratings yet
Review: Hawking's Last Book Insights
2 pages
VTE Mathematics
No ratings yet
VTE Mathematics
6 pages
Technical Manual Motor 4G52
No ratings yet
Technical Manual Motor 4G52
10 pages
Boring Methods - Site Exploration
No ratings yet
Boring Methods - Site Exploration
4 pages
Ratliperl: The Modern Solution For Energy Efficient Building
100% (1)
Ratliperl: The Modern Solution For Energy Efficient Building
18 pages

Module3 5

Uploaded by

Module3 5

Uploaded by

AWS for Cloud

•Recovery time objective (RTO): The maximum

You might also like