Class Activity

This document outlines building a climate data analysis pipeline using AWS services including S3, Athena, and Glue to upload, query, transform, and analyze a climate dataset to generate insights on global temperature trends over time.

Uploaded by

samreen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views

Class Activity

Uploaded by

samreen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

Activity Overview: Climate Data Analysis Pipeline

Objective: Build a data analysis pipeline that uploads, queries, and transforms a climate
dataset to generate insights on global temperature trends.
Duration: Approximately 3-4 hours
Prerequisites:
 Basic understanding of AWS services (S3, Athena, and Glue)
 AWS Account and AWS CLI installed
 Basic knowledge of SQL
Step 1: Setup and Data Preparation
1. Create an S3 Bucket: Follow Lab 7 to create an S3 bucket named climate-data-
bucket. Ensure the bucket has encryption enabled and public access blocked.
2. Upload Dataset: Obtain a climate dataset, preferably in CSV format, that contains
daily temperature records. The dataset should have columns for Date, Temperature,
City, and Country. Upload this dataset to climate-data-bucket.
Step 2: Data Querying with Athena
1. Query Data with Athena: Follow the guidance from Lab 8 to setup Athena. Use
Athena to create a database climate_analysis and a table temperature_records that
references the CSV file in your S3 bucket.
2. Perform Initial Analysis: Run SQL queries to answer the following:
 Average temperature per country.
 Top 10 hottest cities.
Step 3: Data Transformation with AWS Glue
1. Create a Glue Crawler: Using instructions from Lab 9, setup a Glue crawler to
populate the AWS Glue Data Catalog with the climate_analysis database schema.
This database should now appear in the AWS Glue Data Catalog.
2. Transform Data: Create an ETL job in AWS Glue that transforms the temperature
from Celsius to Fahrenheit (if applicable) and filters records to only include data from
the last decade.
3. Store Transformed Data: Save the transformed data back into climate-data-bucket
in a new folder named transformed.
Step 4: Advanced Analysis and Visualization
1. Advanced SQL Queries: Using Athena, perform more complex queries on the
transformed dataset to uncover insights, such as:
 Yearly temperature trends.
 Comparison of temperature changes by country.
2. Visualization (Optional): Utilize Amazon QuickSight or a tool of your choice to
visualize the query results, showcasing temperature trends over time.
Deliverables:
 A document outlining:
 The SQL queries used and their outputs.
 A brief analysis of the findings from the temperature data.
 (Optional) Visualizations of the temperature trends.
Reflection:
After completing the activity, reflect on how each AWS service contributed to the data
pipeline and how this approach can be scaled or modified for different datasets or analytical
needs.
This activity provides a hands-on experience with AWS services for handling data at scale,
from storage and querying to transformation, and leverages the power of the cloud to analyze
and visualize climate data trends.

AWS Certified Solutions Architect Study Guide with 900 Practice Test Questions: Associate (SAA-C03) Exam
From Everand
AWS Certified Solutions Architect Study Guide with 900 Practice Test Questions: Associate (SAA-C03) Exam
David Clinton
No ratings yet
AWS Certified Solutions Architect Study Guide: Associate SAA-C02 Exam
From Everand
AWS Certified Solutions Architect Study Guide: Associate SAA-C02 Exam
David Clinton
No ratings yet
MICROSOFT AZURE ADMINISTRATOR EXAM PREP(AZ-104) Part-3: AZ 104 EXAM STUDY GUIDE
From Everand
MICROSOFT AZURE ADMINISTRATOR EXAM PREP(AZ-104) Part-3: AZ 104 EXAM STUDY GUIDE
Devi Prasad
No ratings yet
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
EMR-Technical Specification Document Draft 0
No ratings yet
EMR-Technical Specification Document Draft 0
118 pages
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
Administering Microsoft Azure SQL Solutions DP 300
From Everand
Administering Microsoft Azure SQL Solutions DP 300
Manish Soni
No ratings yet
AWS Certified Data Analytics Study Guide: Specialty (DAS-C01) Exam
From Everand
AWS Certified Data Analytics Study Guide: Specialty (DAS-C01) Exam
Asif Abbasi
No ratings yet
Aditya Technical Seminar
No ratings yet
Aditya Technical Seminar
10 pages
AWS Capstone Project
No ratings yet
AWS Capstone Project
4 pages
Knight's Microsoft SQL Server 2012 Integration Services 24-Hour Trainer
From Everand
Knight's Microsoft SQL Server 2012 Integration Services 24-Hour Trainer
Brian Knight
No ratings yet
AWS Cloud Practitioner Study Guide & Practice Tests
From Everand
AWS Cloud Practitioner Study Guide & Practice Tests
SUJAN
No ratings yet
Report Document
No ratings yet
Report Document
5 pages
60 Day Data Lake Plan v2
No ratings yet
60 Day Data Lake Plan v2
4 pages
Implementing Travel & Hospitality Data Mesh: AWS Reference Architecture
No ratings yet
Implementing Travel & Hospitality Data Mesh: AWS Reference Architecture
2 pages
Amazon Web Services: Migrating your .NET Enterprise Application
From Everand
Amazon Web Services: Migrating your .NET Enterprise Application
Rob Linton
No ratings yet
Fast Data Processing Systems with SMACK Stack
From Everand
Fast Data Processing Systems with SMACK Stack
Raúl Estrada
No ratings yet
AWS Data Analytics - Technical - Student
No ratings yet
AWS Data Analytics - Technical - Student
160 pages
AWS Solutions Architect Certification Case Based Practice Questions Latest Edition 2023
From Everand
AWS Solutions Architect Certification Case Based Practice Questions Latest Edition 2023
Exam OG
No ratings yet
Collect Process Analyze
No ratings yet
Collect Process Analyze
13 pages
AC52010
No ratings yet
AC52010
4 pages
Climate Data Management System Specifications: WMO-No. 1131
No ratings yet
Climate Data Management System Specifications: WMO-No. 1131
170 pages
AWS Certified Database Study Guide: Specialty (DBS-C01) Exam
From Everand
AWS Certified Database Study Guide: Specialty (DBS-C01) Exam
Matheus Arrais
No ratings yet
Oracle Essbase 9 Implementation Guide
From Everand
Oracle Essbase 9 Implementation Guide
Joseph Sydney Gomez
No ratings yet
Implementing Splunk: Big Data Reporting and Development for Operational Intelligence
From Everand
Implementing Splunk: Big Data Reporting and Development for Operational Intelligence
Vincent Bumgarner
4/5 (2)
AWS Certified Solutions Architect - Associate Exam Prep kit
From Everand
AWS Certified Solutions Architect - Associate Exam Prep kit
SUJAN
No ratings yet
Microsoft Dynamics GP 2010 Reporting
From Everand
Microsoft Dynamics GP 2010 Reporting
Christopher Liley
5/5 (2)
Real-time_Environmental_Data_Analysis_Synopsis
No ratings yet
Real-time_Environmental_Data_Analysis_Synopsis
2 pages
AWS Glue for Data Engineers: Serverless ETL Made Easy
From Everand
AWS Glue for Data Engineers: Serverless ETL Made Easy
Robert Johnson
No ratings yet
Building Modern Data Applications Using Databricks Lakehouse: Develop, optimize, and monitor data pipelines on Databricks
From Everand
Building Modern Data Applications Using Databricks Lakehouse: Develop, optimize, and monitor data pipelines on Databricks
Will Girten
No ratings yet
Google Cloud Data Engineer 100+ Practice Exam Questions With Well Explained Answers
From Everand
Google Cloud Data Engineer 100+ Practice Exam Questions With Well Explained Answers
vivian njoroge
No ratings yet
Hallo Microsoft Excel: Mastering Data Analytics
From Everand
Hallo Microsoft Excel: Mastering Data Analytics
Agus Kurniawan
No ratings yet
Data Engineering Nanodegree Program Syllabus
No ratings yet
Data Engineering Nanodegree Program Syllabus
16 pages
Webapplication Cloud
No ratings yet
Webapplication Cloud
21 pages
The Informed Company: How to Build Modern Agile Data Stacks that Drive Winning Insights
From Everand
The Informed Company: How to Build Modern Agile Data Stacks that Drive Winning Insights
Dave Fowler
No ratings yet
M-R 1
No ratings yet
M-R 1
12 pages
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
From Everand
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
Byron Ellis
No ratings yet
AWS SysOps Administrator Associate: From basic to advanced
From Everand
AWS SysOps Administrator Associate: From basic to advanced
Alex Carvalho
No ratings yet
Aws Data Service Notes
No ratings yet
Aws Data Service Notes
9 pages
AWS Cloud Practitioner Exam Success Kit
From Everand
AWS Cloud Practitioner Exam Success Kit
SUJAN
No ratings yet
Data Analytics in the AWS Cloud: Building a Data Platform for BI and Predictive Analytics on AWS
From Everand
Data Analytics in the AWS Cloud: Building a Data Platform for BI and Predictive Analytics on AWS
Joe Minichino
No ratings yet
Project Proposal CS661
No ratings yet
Project Proposal CS661
6 pages
MICROSOFT AZURE ADMINISTRATOR EXAM PREP(AZ-104) Part-4: AZ 104 EXAM STUDY GUIDE
From Everand
MICROSOFT AZURE ADMINISTRATOR EXAM PREP(AZ-104) Part-4: AZ 104 EXAM STUDY GUIDE
Devi Prasad
No ratings yet
Statistical Analysis of Climate Change Data
No ratings yet
Statistical Analysis of Climate Change Data
3 pages
AWS Certified Cloud Practitioner - Practice Paper 2: AWS Certified Cloud Practitioner, #2
From Everand
AWS Certified Cloud Practitioner - Practice Paper 2: AWS Certified Cloud Practitioner, #2
Tech Interviews
5/5 (2)
Thesis Access
No ratings yet
Thesis Access
195 pages
AWS Certified Solutions Architect Associate Exam Insights : Q&A with Explanations
From Everand
AWS Certified Solutions Architect Associate Exam Insights : Q&A with Explanations
SUJAN
No ratings yet
Amazon Capstone Project
No ratings yet
Amazon Capstone Project
2 pages
MC Microsoft Certified Azure Data Fundamentals Study Guide: Exam DP-900
From Everand
MC Microsoft Certified Azure Data Fundamentals Study Guide: Exam DP-900
Jake Switzer
No ratings yet
Climate_Change_Analysis_Project (1)
No ratings yet
Climate_Change_Analysis_Project (1)
9 pages
How to Hack Like a Ghost: Breaching the Cloud
From Everand
How to Hack Like a Ghost: Breaching the Cloud
Sparc Flow
No ratings yet
Microsoft Dynamics GP 2013 Reporting, Second Edition
From Everand
Microsoft Dynamics GP 2013 Reporting, Second Edition
David Duncan
5/5 (2)
Oracle SQL Developer 2.1
From Everand
Oracle SQL Developer 2.1
Sue Harper
No ratings yet
Architecture For Data Ingestion Clean Processing and Visulizationyounesse
No ratings yet
Architecture For Data Ingestion Clean Processing and Visulizationyounesse
2 pages
devops lead
No ratings yet
devops lead
10 pages
Microsoft SQL Azure Enterprise Application Development
From Everand
Microsoft SQL Azure Enterprise Application Development
Jayaram Krishnaswamy
No ratings yet
AWS Certified Advanced Networking - Specialty ANS-C01 Exam Preparation
From Everand
AWS Certified Advanced Networking - Specialty ANS-C01 Exam Preparation
Georgio Daccache
No ratings yet
Lab_ Performing ETL on a Dataset by Using AWS Glue
100% (1)
Lab_ Performing ETL on a Dataset by Using AWS Glue
26 pages
Research - IBM DataStage to AWS Glue Migration
No ratings yet
Research - IBM DataStage to AWS Glue Migration
7 pages
Bhardwaj_Eshta_2022_Masters
No ratings yet
Bhardwaj_Eshta_2022_Masters
206 pages
Mastering QlikView
From Everand
Mastering QlikView
Stephen Redmond
5/5 (1)
Data Capture Format - Report: The Unified District Information System For Education (UDISE+)
No ratings yet
Data Capture Format - Report: The Unified District Information System For Education (UDISE+)
13 pages
Main Characteristics:: A. Formal Invitations
No ratings yet
Main Characteristics:: A. Formal Invitations
25 pages
Chapter 11 and 12 Little Women Questions
No ratings yet
Chapter 11 and 12 Little Women Questions
7 pages
An Investigation of Synthetic Resins For Water Softening
No ratings yet
An Investigation of Synthetic Resins For Water Softening
1 page
Course - FinTech
No ratings yet
Course - FinTech
3 pages
Serie GJN DGBB Booklet 16434 - 1 EN PDF
No ratings yet
Serie GJN DGBB Booklet 16434 - 1 EN PDF
28 pages
Sai Group 18 Final Report
No ratings yet
Sai Group 18 Final Report
13 pages
Sai Sreelatha
No ratings yet
Sai Sreelatha
3 pages
SM10 E0 CA 1109 03 Relay Coordination
No ratings yet
SM10 E0 CA 1109 03 Relay Coordination
536 pages
JDN 1 23 ISR Web
No ratings yet
JDN 1 23 ISR Web
114 pages
Worksheet 8 Memorandum Algebraic Expressions Term 2
No ratings yet
Worksheet 8 Memorandum Algebraic Expressions Term 2
4 pages
(Ebook) Who's been sleeping in your head? : the secret world of sexual fantasies by Kahr, Brett ISBN 9780465037667, 9780465037674, 9782692893122, 0465037666, 0465037674, 2692893123 - Get the ebook in PDF format for a complete experience
No ratings yet
(Ebook) Who's been sleeping in your head? : the secret world of sexual fantasies by Kahr, Brett ISBN 9780465037667, 9780465037674, 9782692893122, 0465037666, 0465037674, 2692893123 - Get the ebook in PDF format for a complete experience
48 pages
Suctioning
No ratings yet
Suctioning
12 pages
"Foreign Aid and The Tyranny of Experts": Professor William Easterly
No ratings yet
"Foreign Aid and The Tyranny of Experts": Professor William Easterly
5 pages
Leadin G: Group 6 Concon, Alyssa Anne P. Al-Ghazali, Ahmed Qasem Omar Melbert Mabingnay
No ratings yet
Leadin G: Group 6 Concon, Alyssa Anne P. Al-Ghazali, Ahmed Qasem Omar Melbert Mabingnay
31 pages
Formato Brose 8-D-Problem Solving Schemexlsx
No ratings yet
Formato Brose 8-D-Problem Solving Schemexlsx
17 pages
GOST 60601-2-2019 Equipamentos de estimulação
No ratings yet
GOST 60601-2-2019 Equipamentos de estimulação
16 pages
Global Competencies
No ratings yet
Global Competencies
33 pages
CDG 117 Inter Standard Roaming White Paper Ver2.0
No ratings yet
CDG 117 Inter Standard Roaming White Paper Ver2.0
49 pages
B&K 1212 Instruction Manual
No ratings yet
B&K 1212 Instruction Manual
37 pages
CHP 1 C.S Number System and Conversion
No ratings yet
CHP 1 C.S Number System and Conversion
14 pages
Etr 560
No ratings yet
Etr 560
17 pages
Gravity Feeding Drip Rate Chart: Home Care Services
No ratings yet
Gravity Feeding Drip Rate Chart: Home Care Services
2 pages
Travelling in Time: Using Interpretative Phenomenological Analysis (IPA) To Examine Temporal Process in Personal Experience
No ratings yet
Travelling in Time: Using Interpretative Phenomenological Analysis (IPA) To Examine Temporal Process in Personal Experience
33 pages
Traditional_and_medicinal_uses_of_Morinda_lucida (1)
No ratings yet
Traditional_and_medicinal_uses_of_Morinda_lucida (1)
7 pages
1 - Ad-R Series - User Manual
No ratings yet
1 - Ad-R Series - User Manual
76 pages
Sept 25-26-2021 The Desert Sun
No ratings yet
Sept 25-26-2021 The Desert Sun
128 pages
Figure of Speech
0% (2)
Figure of Speech
16 pages
Pile Foundation
No ratings yet
Pile Foundation
44 pages

Class Activity

Uploaded by

Class Activity

Uploaded by

Activity Overview: Climate Data Analysis Pipeline

You might also like