Spatial Data Analytics_coursework
Spatial Data Analytics_coursework
Completed pieces of work must be submitted/uploaded as report in the form of a single MS-WORD and
a PDF document with its file size not exceeding 20MB.
The report should include your maps, calculations and results from your analysis, as well as discussion
and interpretation of your results. The report should be no more than 2,000 words long.
The word count for this assignment primarily refers to the length of the main body of the text and
excludes references, table/figure captions, and any numerical information in the tables. Equations in the
main text will count as one word each. Please ensure that figure captions provide information about the
respective figure only and that they do not contain additional information that should be explained in the
main text.
----------------------------------
BRIEFING
This assignment gives you an opportunity to consolidate your knowledge and skills to investigate spatial
association among geographical data. The objective of this type of analysis is to understand
the mechanism of how spatial phenomena/events occur and predict what the outcome will be under
certain scenarios. In this assignment, you will be asked to collect data of geographical events or
phenomena whose attributes are assumed to have causal relationship with each other. You will then
describe your assumption/hypothesis about these possible associations, and carry out non-spatial and/or
spatial regression analyses. Finally, you are expected to interpret the findings about such
associations, and report them in your report of up to 2,000 words.
TASKS
1. Collect geographical data in which you hypothesise the presence of causal relationship between the
attribute variables attached to the geographical features. In your report,
• briefly describe the data you collected;
• specify the dependent and the independent variables to run the regression analysis, and
describe your hypothesis about the causal relationship between these variables.
1
• Present the distribution of the dependent and the independent variables using maps and/or basic
statistics and provide brief comments on possible relationship between these variables (perhaps
through visual observation of the maps).
2. Using GeoDa (or ArcGIS or R), carry out the ordinary least square (OLS) regression analysis (i.e. a
standard, non-spatial regression analysis). Interpret and discuss the results (e.g. is there any
statistically significant relationship between the dependent and the independent variables? Is the
produced regression model a good model, and why do you think that is the case?).
3. Check the presence/absence of spatial autocorrelation in the regression residuals using a global
spatial autocorrelation method. Using GeoDa (or ArcGIS or R), map the spatial distribution of the
residuals, and then carry out a statistical test for spatial autocorrelation on the regression residuals
using Moran’s I index. Interpret the outcome (e.g. does the result indicate the presence of
significant spatial autocorrelation? Does the model require further re-specification and why?).
ASSESSMENT GUIDANCE
Good marks will be awarded to submissions that successfully demonstrate the following:
1. The data set collected is appropriate for this type of analysis, and a clear and succinct description
is provided for the distribution of the variables used in the analysis.
2. The hypothesis on the relationship between the dependent and the independent variables is
reasonable.
3. The OLS regression analysis is correctly carried out, and the result of the analysis is properly
assessed.
4. The statistical test for spatial autocorrelation in the regression residuals is correctly carried out.
5. The re-specification of the regression model and comparison of the results from the different
models are carried out in a statistically correct manner.
6. Overall results and findings on the relationship between the dependent and the independent
variables are clearly and logically presented.