Assignment Question
Assignment Question
Page 1 of 4
This assignment will help you to explore and analyse a set of data and reconstruct it into
meaningful representations for decision making.
This assignment needs to explore hourly weather data set and categorize it by different
techniques in such a way that it should retrieve the necessary information which helps to do
a decision making. Your analysis should be deep and in detail, also it must go further than
what has already been covered in this course.
You have to import the data then do the necessary pre-processing on the dataset, use the
necessary commands to convert it into the desired format. You have to apply the data
visualization, exploration, and manipulation techniques in your project. It is very important
to explain and justify the techniques that have been chosen. Outline the findings, analyse
them, and justify correctly with appropriate graphs. Also, a supporting document is needed
to reflect the graph and code using R programming concepts. Additional features must
explore further concepts that can improve retrieval effects.
The dataset provided for this assignment is related to the hourly meteorological data for
LaGuardia Airport (LGA) and John F. Kennedy International Airport (JFK) in the United
States. It contains 15 columns and 17,412 rows. The columns with the description are given
in the table below.
4.0 DELIVERABLES:
The complete code and report must be submitted to APU Learning Management
System (Webspace).
Program coded in R.
o Name the file under your name and TP number.
o Start the first two lines in your program by typing your name and TP number.
For example:
# NAME
#TP123456
o For each analysis example, give an id and explain the analysis that you did in a
comment. For example:
# Analysis Example 1
# In this example, an analysis between X and Y is given to …….
o For each extra feature example, give an id and explain the analysis that you did
in a comment. For example:
# Extra feature 1
# comments about the extra feature
i. Coding 50%
Running the submitted code to show the conducted analysis; Application of data
analysis techniques covered in the course; good programming practices such as
comments and indentation; how good the created graphs; and additional features and
techniques which can improve the result which is apart from the course.