0% found this document useful (0 votes)
206 views2 pages

CI7320 Data Analysis Assignment 2 2025-1

This assignment focuses on data analysis using punctuality statistics from selected UK airports, requiring the design of a data warehouse with a star schema and the use of Tableau for visualizations. Part A involves creating tables, discussing data preparation steps, and producing four visualizations with key findings. Part B offers two options: a reflective report on AWS Glue for automated batch data ingestion or completing a lab activity related to ETL using AWS Glue.

Uploaded by

meshan milinda
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
206 views2 pages

CI7320 Data Analysis Assignment 2 2025-1

This assignment focuses on data analysis using punctuality statistics from selected UK airports, requiring the design of a data warehouse with a star schema and the use of Tableau for visualizations. Part A involves creating tables, discussing data preparation steps, and producing four visualizations with key findings. Part B offers two options: a reflective report on AWS Glue for automated batch data ingestion or completing a lab activity related to ETL using AWS Glue.

Uploaded by

meshan milinda
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

CI7320 Beryl Jones

CI7320: Databases & Data Management – Assignment 2


Data Analysis

Assessment weight: 60%

Part A [75 marks]


The data set provided for this assignment contains punctuality statistics for selected UK airports. It
includes 24 files, each representing a month’s data, covering the years 2023 and 2024.
Airport IATA codes can be found at iata.org.

For this assignment, you will be reporting on how the data could be used for research using a data
warehouse and Tableau.

The report should include the following:

1. Design a data warehouse using a star schema. You must justify your design decisions.

2. Write the CREATE table statements for the tables in your star schema (include all primary
and foreign keys).

3. Discuss the steps you took in creating and populating the database. This should include the
steps you took in preparing the data and the transformation tasks performed. Include
screenshots of your populated tables.

4. Create 4 visualizations using Tableau. For each visualization, you should include the
following:
Aim of the visualisation
Bullet points covering the data preparation and steps you followed in Tableau to
produce the graph
The effectiveness and presentation of the graph
Key findings from the visualisation

Part B [25 marks]

There are two options to select from;

Option 1: Compose a brief reflective report (500-1000 words) that demonstrates your understanding
of how AWS Glue facilitates automated batch data ingestion. Specifically, address the following:
1. Core Glue Components:
o Briefly describe the key components of AWS Glue (Crawlers, Data Catalog, ETL Jobs,
Triggers).
o Explain the role each component plays in a typical batch ingestion workflow.
2. Automating Batch Ingestion:
o Illustrate how these components work together to automate the process of
extracting, transforming, and loading batch data.
o Focus on a simple, illustrative scenario: for example, ingesting daily CSV files from an
S3 bucket.

Page 1
CI7320 Beryl Jones

3. Reflective Insights:
o Provide a short reflection on the benefits and potential limitations of using AWS
Glue for batch ingestion.
o Consider aspects such as ease of use, scalability, and cost-effectiveness.
o Include a simple diagram or flowchart to visualise the glue workflow.

Option 2:
Complete the AWS Academy Data Engineering [AWS-KU course ref’ 114897] course’s Module 7 –
lab activity – “Lab: Performing ETL on a Dataset by Using AWS Glue”. As you perform each step of
the lab, capture a screenshot. Submit a Word document containing all screenshots as evidence of
your completion.

Please note: Completion will be verified using lab activity statistics from the AWS educator admin
console. Do not share your screenshots with others.

Page 2

You might also like