0% found this document useful (0 votes)
82 views21 pages

AWS Big Data Crash Course

Download as pdf or txt
Download as pdf or txt
Download as pdf or txt
You are on page 1/ 21

Noah

Gift, UC Davis &


Northwestern Lecturer
AWS Big Data Crash Course (Cloud, ML,AI), Founder
@Pragmatic AI Labs, Author:
Pragmatic AI
• Part 1: AWS Machine Learning-Specialty (ML-S)
Certification Overview, Collection, and Storage
(90 min)
• QA (15 min)
• Break (15 min)
Day 1 • Part 2: Processing for Big Data on AWS (45 min)
Schedule • QA (10 min)
• Break (5 min)
• Part 3: Analysis for Big Data on AWS (45 min)
• QA (15 min)
Survey: Experience with AWS
• Novice (No experience)
• Beginner (< 1 Year)
• Intermediate (1-3 Years)
• Advanced (3+ Years)
Survey: Experience with Big Data
• Novice (No experience)
• Beginner (< 1 Year)
• Intermediate (1-3 Years)
• Advanced (3+ Years)
Part 1: AWS Machine
Learning-Specialty (ML-S)
Certification Overview,
Collection, and Storage
(90 min)
• Get an overview of the certification
• Use exam study resources
• Review the exam guide
• Learn the exam strategy
• Learn the best practices of Big Data on
AWS
• Learn the techniques to accelerate
hands-on practice
• Understand important Big Data related
services
• Determine the operational characteristics
of the collection system
• Determine and optimize the operational
characteristics of the storage solution
QA (15 min)
Break (15 min) QA & Break Part 1
Part 2: Processing for
Big Data on AWS (45
min)

• Identify the appropriate data


processing technology for a given
scenario
• Determine how to design and architect
the data processing solution
• Determine the operational
characteristics of the solution
implemented
• Understand Overview of AWS
Processing
• Understand Elastic MapReduce (EMR)
• Learn about Apache Hadoop - Intro
• Apply EMR - Architecture
QA (10 min)
Break (5 min) QA and Break Part 2
• Determine the tools and techniques required
for analysis
• Determine how to design and architect the
analytical solution
• Determine and optimize the operational
Part 3: Analysis characteristics of the Analysis
• Understand Redshift Overview
for Big Data on • Learn Redshift Design

AWS (45 min) • Use Redshift Data Ingestion


• Apply Redshift Operations
• Use AWS Elasticsearch - operational analytics
• Implement Machine Learning - Clustering &
Regression
• Use AWS Athena - interactive analytics
QA (10 min) QA and Day 1 Wrap up
Related Safari Properties
• Pragmatic AI (Book)

• Essential Machine Learning and AI (Video)

• AWS Certified Machine Learning-Specialty (Video)

• Essential Machine Learning and Pragmatic AI (Learning Path)


• Python for Data Science (Video)

• AWS Certified Big Data-Speciality ( Video)


• Part 4: Visualization & Data Security for Big Data on
AWS (90 min)
• QA (15 min)
• Break (15 min)
• Part 5: Case Studies Part 1(45 min)
Day 2 • QA (10 min)
Schedule • Break (5 min)
• Part 6: Case Studies Part 2 and Exam Sample
Questions Review (45 min)
• QA (15 min)
Survey: Experience with Visualization
• Novice (No experience)
• Beginner (< 1 Year)
• Intermediate (1-3 Years)
• Advanced (3+ Years)
Survey: Experience with Containers
• Novice (No experience)
• Beginner (< 1 Year)
• Intermediate (1-3 Years)
• Advanced (3+ Years)
• Determine the appropriate techniques for
delivering the results/output
• Determine how to design and create the
Visualization platform
• Determine and optimize the operational
characteristics of the Visualization system
• Understand AWS Visualization - Overview
• Use AWS Quicksight - dashboards &
visualizations
• Determine encryption requirements and/or
implementation technologies
• Choose the appropriate technology to
enforce data governance
• Identify how to ensure data integrity
• Evaluate regulatory requirements
• Implement AWS IAM
Part 4: Visualization & Data Security for Big Data on AWS • Implement EMR Security
Length • Implement Redshift Security
(90 min)
QA (15 min)
Break (15 min) QA & Break Part 1
• Understand Big Data for
Sagemaker
• Learn Sagemaker and EMR
Integration
• Learn Serverless Production
Big Data Application
Development

Part 2: Processing for Big Data on AWS (45 min)


QA (10 min)
Break (5 min) QA and Break Part 2
Implement Containerization for
Part 3: Case Big Data
Studies Part 2
and Exam
Sample Implement Spot Instances for
Questions Big Data Pipeline

Review (45
min)
Exam Review
QA (10 min) QA and Day 2 Wrap up
Related Safari Properties
• Pragmatic AI (Book)

• Essential Machine Learning and AI (Video)

• AWS Certified Machine Learning-Specialty (Video)

• Essential Machine Learning and Pragmatic AI (Learning Path)


• Python for Data Science (Video)

• AWS Certified Big Data-Speciality ( Video)

You might also like