0% found this document useful (0 votes)
2K views14 pages

Tech Leap-AWS-Data-Engineer-TeachLeap-School-Final PDF

Uploaded by

Partha Ghosh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2K views14 pages

Tech Leap-AWS-Data-Engineer-TeachLeap-School-Final PDF

Uploaded by

Partha Ghosh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 14

TECHLEAP

AWS Data Engineer School


Course
Objective
• AWS Data Engineer School will help
you to gain overall understanding of
AWS Data related services that fit all
your data analytics needs. You will get
into the details of data movement, data
storage, data lakes, big data analytics,
log analytics, streaming analytics
which will enable you to build data
pipelines and enhance your knowledge
in AWS Data space.
• This school will help you to sharpen
your skills with wide variety of hands-
on labs.

Copyright © 2020 Accenture All rights reserved. 2


AWS Data Engineer School - Training Plan (25 hours)
AWS Data Engineer Learning
Journey
Mod- Mod- Mod- Mod-
1 2 3 4

Collecting Data Data Storage in Processing Data Data Analysis on Building Data
into AWS AWS with AWS AWS Pipeline with
Serverless
services on AWS
Self-paced Self-paced Self-paced Self-paced Self-paced
(ACoudGuru)
6 Hours (ACoudGuru) 6 Hours (ACoudGuru) 3 Hours (ACoudGuru) 5 Hours 5 Hours
(ACoudGuru)

Mod- Mod- Mod-


5 6 7

Building Data Pipeline Log Data Analysis AWS Security


with AWS ASSESSMENT
with Serverless Services
services on AWS OpenSearch
Assessment Platform
Self-paced (METTL/Pearson
Self-paced Self-paced
(ACoudGuru) 2 Hours 1 Hours
(ACoudGuru) 2 Hours (ACoudGuru) 1 Hours

Copyright © 2020 Accenture All rights reserved. 3


Module 1 : Collecting Streaming Data
Collecting Data
into AWS Introduction to Collecting Streaming Data
The Kinesis Family
Kinesis Data Stream (Part 1)
This course will introduce you to Kinesis Data Stream (Part 2)
collection of streaming data and Kinesis Data Fire Hose (Part 1)
persisted data into AWS. Now Kinesis Data Fire Hose (Part 2)
streaming data makes up mass Kinesis Videos Streams
amounts of data that is collected, and Kinesis Data Analytics (Part 1)
it's important to understand what it is Kinesis Data Analytics (Part 1)
and the different ways that we can Amazon Managed Service for Kafka (MSK)
collect it, process it, and store it within
AWS. Streaming data is fresh data
and it plays a big role in actionable
Getting Data into AWS
decisions that can be made with that
data.
Introduction to Data Collection and Getting Data into AWS
Direct Connect, Snowball Family
Database Migration Service
Data Pipeline
Lambda , API Gateway and CloudFront (Part 1)
Duration Platform Lambda , API Gateway and CloudFront (Part 2)
6 hours ACloudGuru Comparing our Options

Copyright © 2020 Accenture All rights reserved. 4


Module 2 : Data AWS Simple Storage Service
Storage in AWS
Introduction to S3
Getting Data into S3 (Part 1)
Getting Data into S3 - Boto 3 (Part 2)
S3 Multi Part Upload (Part 1)
S3 Multi Part Upload (Part 2)
This course will take you through S3
S3 Storage Classes
and databases in AWS. So what do
S3 Lifecycle Policies
we use databases for in the analytics
S2 Security and Encryption
process? Databases for the most part
are going to sit in our data
preparation area. They're going to be
a place to start aggregating data or a
source of data that we're going to
Databases in AWS
feed into our pipeline,

Introduction to Databases in AWS

Database Engine Types

Relational Database Service (RDS)

Neptune

Duration Platform Document DB


6 hours ACloudGuru
Serverless Options

Copyright © 2020 Accenture All rights reserved. 5


Module 3 :
Processing
Data with AWS Amazon Elastic Map Reduce

Introduction to Amazon EMR


This Course is going to cover Elastic
MapReduce or EMR. Now this Apache Hadoop and EMR Software Collection
service plays a huge role in data
analytics and data processing and big EMR Architecture
data frameworks. So you need to
have a good understanding on the EMR Operations - Transient and Long Running
various services that you can use, the
various scenarios. EMR Operations - Choosing an Instance Type

EMR Operations - Choosing the right number of Instances

EMR Operations - On-Demand and Spot Instances

EMR Operations - Monitoring and Resizing Clusters

EMR File Storage and Compression


Duration Platform
3 hours ACloudGuru

Copyright © 2020 Accenture All rights reserved. 6


Module 4 : Data AWS Redshift
Analysis on
AWS Introduction to Using Redshift

Reshift Architecture
This course will take you through
Redshift architecture. As you move Redshift in the AWS Service Ecosystem
through the lesson you are going to
learn about what a Redshift cluster is, Redshift UseCases
what a node is, what a Slice is, you
will look at the Redshift query process
Reshift Table Design
and how that works in the Redshift
architecture and then you'll quickly
Redshift Spectrum
summarize what we've talked about
in this lesson. When we look at a
Launching a Redshift Cluster
Redshift cluster.

Resizing a Redshift Cluster

Utilizing Vaccum and Deep Copy

Back and Restore

Duration Platform
5 hours ACloudGuru Monitoring

Copyright © 2020 Accenture All rights reserved. 7


Module 5 :
Building Data
Pipeline with AWS Glue. Athena and Quicksight
Serverless
services on AWS Introduction to AWS Glue , Athena and QuickSight

Glue Data Catalog (Part 1)


This course will take you through
AWS Glue, Athena, and QuickSight. Glue Data Catalog (Part 2)
These services combined can help us
solve a plethora of data analytics Glue Jobs (Part 1)
problems. Throughout this section
you'll cover each of these services, Glue Jobs Demo (Part 2)
how they fit together, and how you
Glue Jobs (Part 3)
can use them for our data analytics
pipelines. Job Bookmarks

Getting Started with Athena

Athena Demo

When to Use Athena

QuickSight Visualization and Dashboard


Duration Platform
2 hours ACloudGuru QuickSight Security and Authentication

Copyright © 2020 Accenture All rights reserved. 8


Module 6 : Log
Data Analysis OpenSearch

with AWS
OpenSearch Introduction to ElasticSearch

This course will introduce Using ElasticSearch


ElasticSearch service on AWS. So in
our data analytics steps for success,
Visualizing ElasticSearch Data
you are going to cover a whole lot
with this one service. you are in data
analysis, data interpretation, and
discovery. That's because of that
search and its associated tools cover
a lot of ground.

Duration Platform
2 hours ACloudGuru

Copyright © 2020 Accenture All rights reserved. 9


Module 7 : AWS
Security AWS Security Services

Services
Introduction to AWS Security Services
This course will take you through
AWS security services. In our data
analytics steps for success, security IAM
services don't really fit in any of our
steps, but we need to secure our KMS
data. Hence this course will help you
learn services like Identity and
Secrets Manager
Access Management or IAM, VPC
security features, the Key
Management Service, and Secrets VPC Network Security Features
Manager. All of these have some
integration or functionality that allows
us to secure the other services you
learnt about in this course.

Duration Platform
1 hours ACloudGuru

Copyright © 2020 Accenture All rights reserved. 10


Training Environment

ACloudGuru

Copyright © 2020 Accenture All rights reserved. 11


Tech Leap Stream Readiness Status

Activity Status
Course Contents Completed. The course contents have been
identified and reviewed.
Training Environment Access to ACloudGuru will be provided.
Quiz In-Progress
Final assessment In-Progress

Copyright © 2020 Accenture All rights reserved. 12


Q&A

Copyright © 2020 Accenture All rights reserved. 13


Thank You

Copyright © 2020 Accenture All rights reserved. 14

You might also like