Amazon Capstone Project

Uploaded by

famell qawiem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

164 views2 pages

Amazon Capstone Project

Uploaded by

famell qawiem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

To meet the company's objectives for ingesting and converting data into their data lake, as well as

providing dashboards for visual representations, the following options can be investigated:

Data Ingestion via IoT Sensors:

Use Amazon Kinesis Data Streams or Amazon IoT Core to capture real-time data from IoT sensors.

Configure Kinesis Data Firehose to convert and load data into the data lake in near real-time.

Create an AWS Glue crawler to catalogue the data and make it available for analysis.

Use Apache Hadoop-based frameworks such as Apache Spark on Amazon EMR to manage and
analyse IoT data.

Database Data Ingestion:

To migrate data from an on-premises database to Amazon S3, use AWS Database Migration Service
(DMS).

Use AWS Glue ETL jobs or Apache Spark on Amazon EMR to prepare the data for analysis.

Keep the modified data in the data lake for subsequent use.
Third-Party Data Ingestion:

Obtain more data from third-party organisations by utilising their APIs or data transmission methods.

Before placing the data in the data lake, use AWS Lambda or EC2 instances to process and enhance it.

Use AWS Glue or Apache Spark to perform transformations as needed.

Cleaning and Transformation of Data:

Use AWS Glue ETL jobs or Apache Spark on Amazon EMR to clean, convert, and enhance the
imported data.

To use current Apache Hadoop-based software skills, use Apache Spark on Amazon EMR, which
provides a comparable environment and capabilities.

Dashboard Design:

Create dynamic dashboards and visualisations using Amazon QuickSight, a cloud-scale business
intelligence (BI) tool.

Connect QuickSight to the data lake and use the modified data to build visualisations.

For insights and analysis, share the dashboards with the analytics team and other stakeholders.

The organisation may ingest and transform data from numerous sources into their data lake utilising
technologies comparable to their present Apache Hadoop-based setup by leveraging AWS services
such as Amazon S3, Amazon Kinesis, AWS Glue, and Amazon EMR. Additionally, users can use
Amazon QuickSight to create interactive dashboards to visualise the data insights.

Unit 5
No ratings yet
Unit 5
6 pages
Cheat Sheet AWS Data Engineer Associate
No ratings yet
Cheat Sheet AWS Data Engineer Associate
117 pages
Bda Unit 2 - Mam
No ratings yet
Bda Unit 2 - Mam
63 pages
Selecting An APM
No ratings yet
Selecting An APM
21 pages
Agile Exam Questions and Answers 1
80% (41)
Agile Exam Questions and Answers 1
14 pages
AWS Data Analytics - Technical - Student
No ratings yet
AWS Data Analytics - Technical - Student
160 pages
DP 900 Day 4
No ratings yet
DP 900 Day 4
40 pages
NorthBays CRISP Artificial Data Lakes
No ratings yet
NorthBays CRISP Artificial Data Lakes
149 pages
A - Learning - Oreilly.com-Preface Data Engineering With AWS
No ratings yet
A - Learning - Oreilly.com-Preface Data Engineering With AWS
6 pages
Babok Visual v3
92% (12)
Babok Visual v3
218 pages
AWS Data Lake
100% (1)
AWS Data Lake
104 pages
Data Engineering by AWS
100% (1)
Data Engineering by AWS
11 pages
Itil V4
100% (13)
Itil V4
260 pages
PMP Exam Prep 2023-2024 Covers The Current PMP Exam Content Agile and Predictive Content 2023
100% (11)
PMP Exam Prep 2023-2024 Covers The Current PMP Exam Content Agile and Predictive Content 2023
391 pages
Best Practices in Change Management Full Report Digital 11thedition
100% (5)
Best Practices in Change Management Full Report Digital 11thedition
378 pages
Unit 3 - BDA - Notes
No ratings yet
Unit 3 - BDA - Notes
9 pages
AWS Cloud Data Ingestion Patterns Practices
No ratings yet
AWS Cloud Data Ingestion Patterns Practices
40 pages
70 ELT Tools
No ratings yet
70 ELT Tools
29 pages
Aws Data Service Notes
No ratings yet
Aws Data Service Notes
9 pages
Servicenow Tutorial
100% (3)
Servicenow Tutorial
78 pages
Architecture
No ratings yet
Architecture
6 pages
Data Engineering and Data Engineer - Students
No ratings yet
Data Engineering and Data Engineer - Students
56 pages
Data Arch Base
No ratings yet
Data Arch Base
11 pages
Dev Sec Ops
100% (1)
Dev Sec Ops
33 pages
Notes For DMML
No ratings yet
Notes For DMML
27 pages
AWS-BIGD Big Data On AWS
No ratings yet
AWS-BIGD Big Data On AWS
5 pages
Ppb1 Workshop Batch v2
No ratings yet
Ppb1 Workshop Batch v2
43 pages
Awsdataanalyticsonawstechnicaliltinstructordeck2023 230304021823 0674c2bb
No ratings yet
Awsdataanalyticsonawstechnicaliltinstructordeck2023 230304021823 0674c2bb
146 pages
Data Engineering Data Science Concepts
No ratings yet
Data Engineering Data Science Concepts
5 pages
AWS Data Lake
No ratings yet
AWS Data Lake
87 pages
Data Analysis With Hive
No ratings yet
Data Analysis With Hive
2 pages
Business Intelligence Notes
No ratings yet
Business Intelligence Notes
27 pages
Observability Maturity Assessment
No ratings yet
Observability Maturity Assessment
2 pages
Architecture For Data Ingestion Clean Processing and Visulizationyounesse
No ratings yet
Architecture For Data Ingestion Clean Processing and Visulizationyounesse
2 pages
Assignment Group 3
No ratings yet
Assignment Group 3
21 pages
Research - IBM DataStage To AWS Glue Migration
No ratings yet
Research - IBM DataStage To AWS Glue Migration
7 pages
How To Build Data Pipelines On AWS - Reference Workflow
No ratings yet
How To Build Data Pipelines On AWS - Reference Workflow
26 pages
Bigdata Pipeline With AWS: Author: Diksha Singh Tomer Computer and Science Engineering Banasthali University, India
No ratings yet
Bigdata Pipeline With AWS: Author: Diksha Singh Tomer Computer and Science Engineering Banasthali University, India
9 pages
Data Lake
No ratings yet
Data Lake
2 pages
Basic Terms of DATA ENGINEERING
No ratings yet
Basic Terms of DATA ENGINEERING
9 pages
Analytics Services v2
No ratings yet
Analytics Services v2
59 pages
AWS Machine Learning Specialty
100% (1)
AWS Machine Learning Specialty
67 pages
Aditya Technical Seminar
No ratings yet
Aditya Technical Seminar
10 pages
Project
No ratings yet
Project
3 pages
Modernserverlessdatalak
No ratings yet
Modernserverlessdatalak
45 pages
Data Capture Services
No ratings yet
Data Capture Services
10 pages
Handout Streamline Data and AI Governance With Amazon SageMaker Catalog
No ratings yet
Handout Streamline Data and AI Governance With Amazon SageMaker Catalog
35 pages
AWS Services - Analytics and ML
No ratings yet
AWS Services - Analytics and ML
2 pages
Scrum Cheat Sheet
100% (103)
Scrum Cheat Sheet
1 page
DataAnalytics AWS PDF
No ratings yet
DataAnalytics AWS PDF
133 pages
Data Governance Toolkit
100% (10)
Data Governance Toolkit
29 pages
BDC Output 10
No ratings yet
BDC Output 10
7 pages
Introduction To Analytics On AWS
No ratings yet
Introduction To Analytics On AWS
34 pages
Complete Data Engineering Roadmap With Resources
No ratings yet
Complete Data Engineering Roadmap With Resources
16 pages
Modernize Your Analyticsand Data Architecture
No ratings yet
Modernize Your Analyticsand Data Architecture
47 pages
Key Performance Indicators KPIs
100% (23)
Key Performance Indicators KPIs
142 pages
Agenda at A Glance: Level 100 Level 200 Level 300
No ratings yet
Agenda at A Glance: Level 100 Level 200 Level 300
1 page
Best PMP Exam Prep Guide 2023 - 2024 Get PMP Certified in 2 Weeks - Study 2 Hours A Day Before-After Work 2023
100% (6)
Best PMP Exam Prep Guide 2023 - 2024 Get PMP Certified in 2 Weeks - Study 2 Hours A Day Before-After Work 2023
274 pages
DocScanner 20 Oct 2024 2-19 PM
No ratings yet
DocScanner 20 Oct 2024 2-19 PM
16 pages
AWS DDA Agenda PDF
No ratings yet
AWS DDA Agenda PDF
1 page
Competitive Analysis Application Performance Management and Business
100% (1)
Competitive Analysis Application Performance Management and Business
29 pages
Final Project On Data Lakes With AWS
No ratings yet
Final Project On Data Lakes With AWS
2 pages
Data Glossary - Michael Dillon
No ratings yet
Data Glossary - Michael Dillon
11 pages
Product Backlog: Backlog Item Responsible Status Story Points
No ratings yet
Product Backlog: Backlog Item Responsible Status Story Points
4 pages
Introducing The Scaled Agile Framework 6.0
No ratings yet
Introducing The Scaled Agile Framework 6.0
54 pages
User Stories Quick Reference Guide
100% (7)
User Stories Quick Reference Guide
2 pages
Operating Model and Organization Design Toolkit - Overview and Approach
83% (12)
Operating Model and Organization Design Toolkit - Overview and Approach
47 pages
AWS Innovate23 Data Agenda
No ratings yet
AWS Innovate23 Data Agenda
1 page
Data Platform On Aws and Snowflake Ra
No ratings yet
Data Platform On Aws and Snowflake Ra
1 page
The Effective Change Manager's Handbook - Essential Guidance To The Change Management Body of Knowledge PDF
83% (36)
The Effective Change Manager's Handbook - Essential Guidance To The Change Management Body of Knowledge PDF
633 pages
Essential PMP Preparation A Practical Exam Prep With Simplified Explanations Definitions and Examp 2022
91% (11)
Essential PMP Preparation A Practical Exam Prep With Simplified Explanations Definitions and Examp 2022
336 pages
PMP Exam Prep Skyrocket Your Career by Becoming A Certified Project
73% (11)
PMP Exam Prep Skyrocket Your Career by Becoming A Certified Project
110 pages
Scrum Master Interview Questions
100% (11)
Scrum Master Interview Questions
93 pages
Agile Project Management Step by Step Guide To Agile Project Management
100% (6)
Agile Project Management Step by Step Guide To Agile Project Management
65 pages
APC Building Data Lakes On AWS SG
No ratings yet
APC Building Data Lakes On AWS SG
187 pages
Enterprise Systems Architecture Aligning Business Operating Models To Technology Landscapes (Daljit Roy Banger)
100% (5)
Enterprise Systems Architecture Aligning Business Operating Models To Technology Landscapes (Daljit Roy Banger)
311 pages
Dice Resume CV Al Kazendar
No ratings yet
Dice Resume CV Al Kazendar
8 pages
AWS 05 DataLake
No ratings yet
AWS 05 DataLake
78 pages
Building High-Performing Agile Teams Release Trains EB038LTREN
No ratings yet
Building High-Performing Agile Teams Release Trains EB038LTREN
12 pages
Enterprise Data Warehousing On Aws
No ratings yet
Enterprise Data Warehousing On Aws
26 pages
Product Roadmap Guide
96% (24)
Product Roadmap Guide
67 pages
Enterprise Architecture
100% (3)
Enterprise Architecture
57 pages
Deloitte Cloud - Task 3 - Cloud Suitability Assessment - Ideal Response
No ratings yet
Deloitte Cloud - Task 3 - Cloud Suitability Assessment - Ideal Response
2 pages
Data Lakes For Maximum Flexibility
No ratings yet
Data Lakes For Maximum Flexibility
29 pages
Azure Solution Architect Map
100% (1)
Azure Solution Architect Map
1 page
Product Manager's Toolkit
100% (1)
Product Manager's Toolkit
30 pages

Amazon Capstone Project

Uploaded by

Amazon Capstone Project

Uploaded by

To meet the company's objectives for ingesting and converting data into their data lake, as well as

Data Ingestion via IoT Sensors:

Database Data Ingestion:

Use AWS Glue or Apache Spark to perform transformations as needed.

Cleaning and Transformation of Data:

You might also like