0% found this document useful (0 votes)
375 views9 pages

BI Written Assignment

This document outlines an assignment for students to analyze datasets related to the COVID-19 pandemic in Sri Lanka using business analytics tools. The tasks include analyzing relationships between infected cases and other factors, developing statistical and geospatial models, creating maps visualizing case data and population data by district, developing a geospatial database, and identifying suitable land for a new research center. The analysis requires using tools like R, RStudio, QGIS, PostgreSQL, and Google Earth to complete the assigned data visualization and modeling tasks.

Uploaded by

Jimmy Cyrus
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
375 views9 pages

BI Written Assignment

This document outlines an assignment for students to analyze datasets related to the COVID-19 pandemic in Sri Lanka using business analytics tools. The tasks include analyzing relationships between infected cases and other factors, developing statistical and geospatial models, creating maps visualizing case data and population data by district, developing a geospatial database, and identifying suitable land for a new research center. The analysis requires using tools like R, RStudio, QGIS, PostgreSQL, and Google Earth to complete the assigned data visualization and modeling tasks.

Uploaded by

Jimmy Cyrus
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 9

Purpose:

This assignment is to assess student’s ability to perform business analysis using Statistical and
Geographic Information Systems(GIS) related tools, techniques and methodologies to find out
applicable and useful intelligence for taking informed decision making in private and government sector
institutions in the island and different parts of the countries in the world The relevant higher level
administrative officials of the those institutions can use GIS to generate maximum efficiency and benefits
on informed business decision making while eliminating discrimination, ambiguity and uncertainty.

Tasks Introduction
Understand the given tasks based on Sri Lanka using data science domain associate with non-geospatial
and geospatial data models. Apply relevant tools, techniques and methodologies found in business
analytics subject relevant to the module scope and conduct analysis on different subject matters with the
support of source data provided in shape files (.shp), raster files(.asc),comma separated (.csv),text and
excel file formats. The data analysis and visual demonstration required to be done using standard software
tools recommended for the module (R, R-studio, R-commander, QGIS, PostgreSQL and Google
Earth etc.).

Task 1 – Report (100 Marks)

The student required to do the following data analysis visualizations based on datasets provided with this
assignment using R, R-Studio, R-commander, QGIS, PostgreSQL, GOOGLE EARTH and other related
supportive tools. All required datasets have been included within the “Data Sets” folder as separate
subfolders per each question.
a) The Epidemiology Unit of Ministry of Health Sri Lanka currently involving with various research
studies to uncover new findings of COVID-19 pandemic acceleration within the country. A
dataset named COVID19_STAT.CSV has been provided for you to uncover possible associations
available with COVID-19 infected people and the rest of the factors available within the dataset
with the aid of suitable business analytics tools such as R, R-studio and R-commander. Analysis
should be followed by critical discussions of the findings.
(15 Marks)

b) As per the new findings of question a) you are required to develop precise informed decision
making statistical numerical and graphical models. As per the requirement develop a precise
statistical model having recognized possible associations available with infected people and
other factors of the provided dataset (COVID19_STAT.CSV) in Sri Lanka with the aid of
suitable business analytics tools such as R, R-studio, R-commander. Analysis should be followed
by critical discussions of the findings.
(15 Marks)

c) Develop a Sri Lanka map that visualizes information of Table: district distribution of COVID-19
confirmed patient by Epidemiology Unit of Ministry of Health Sri Lanka (provided as Covid-
19Situation-Report.pdf).The information such as District Name, Total Patient Count with
Respective percentages by district should be retrieved via newly created SLCOVID19-2020.csv
file to the map. All the information in .CSV should be shown in the map while classifying it by
Total Patient Count by district. The map processing should be done using provided vector data
set and the visualized information should be well described.
(10 Marks)
d) Using Table 3.3: Growth in population by districts, 1981- 2012 in SLCensesPop2011.pdf
published by Department of Census and Statistics Sri Lanka develop an informative map using
provided shape files and classify it by district Total population 2012. The map should contains
District Name, District Total Population 2012 , Average Annual Growth Rate (%) 2001-2012.Do
precise and critical justifications about potential risk districts in Sri Lanka spreading Novel
Corona Virus -COIVD-19 in order to take suitable preventive measures by the government
authorities based on informed decision making. .
. (10 Marks)

e) Develop a digitized informative area map with suitable information provided. The map should
contain separate new vector layers of the natural and manmade land covers such as buildings,
roads and forests etc. in Infectious Diseases Hospital (IDH) and its suburbs developed by
digitization process. The provided google earth areal image should be used to support
digitization with QGIS open layer plugins/Google Earth/Google maps. Every vector layer
attribute table should contain id, name and type fields and associated data. By analyzing the map
describe possible facility development can be done to increase the COVID-19 patient treatments
in IDH Complex as per the rising requirements at present due to COVID-19 second wave and
predicting third wave in the country. (For map development Use coordinates reference system as
WGS84-EPSG4326).

(10 Marks)

f) Develope a PostgreSQL Geospatial Databse named “SLCOVID19-2020” to include data


provided by the Table: district distribution of COVID-19 confirmed patient by Epidemiology Unit
of Ministry of Health Sri Lanka (provided as Covid-19-Situation-Report.pdf ).The database
should contain data such as District Name, Total Patient Count, Total Patient Count Percentage
and the shape files provided. Using SLCOVID19-2020 geospatial database, develop a classified
thematic map by Total Patient Count by district and visualize following information.

i) District Name iii) Total Patient Count Percentage ii) Total Patient Count
(10 Marks)

g) Develop a Sri Lanka map contains COVID-19 Testing Centers and Quarantine Details
Centers declared by the Ministry of Health Sri Lanka (as at 11/30/2020) in order to respond
COVID-19 Outbreak within the Island. The map should visualize the information such as Center
Name, Center Type (Testing / Quarantine), District Name, Latitude and Longitude. The exact
GPS locations should be retrieved via Google Earth with the support of a KML/KMZ file. A
Comprehensive description should be included about all identified centers and their importance in
national mission against the Novel Corona Virus Pandemic (The official information can be
accessed via Sri Lanka response to covid 19: http://www. covid19.gov.lk)
. (10 Marks)

h) Develop a map with the support of provided data set in order to find out suitable land for newly
establishing State of the Art Infectious Diseases Research Center in Peradeniya. The suitability
area should be located 600m away from the Pushpadana Vidyalaya and 800m away from the
Power House respectively. The land should not be a scrub, plantation, paddy or a barren land.
The map should be followed by a critical discussion of the suitability of the decision of
establishing the aforesaid Research Center in the identified suitable area. The discussion should
be followed by the following supportive information as well.

i) Total number of buildings situated within the suitability area at present. ii) Total land area
occupied by the buildings within the suitability area. iii) Total suitable land area.
(20 Marks)

Guidelines for the report format

• Paper A4
• Margins 1.5” left, 1” right, top and bottom
• Page numbers – bottom, right
• Line spacing 1.5
• Font o Headings 14pt, Bold o Normal 12pt o Font face- Times New Roman
• Referencing and in-text citation should be done strictly using Harvard Referencing System.

Recommended Reading
• Tableau certification (2019) Desktop Specialist. https://www.tableau.com/support/certification ;
https://www.tableau.com/learn/classroom/desktop-one
• Business Intelligence and Analytics with Tableau (2019)
https://www.tableau.com/learn/whitepapers/modern-approach-business-intelligence
• Business Intelligence and Analytics with SAS (2019)
https://www.sas.com/en_us/solutions/businessintelligence.html
• Business Intelligence and Analytics with IBM (2018) https://www.ibm.com/business-intelligence
• MIT Media Lab (2017) Social Computing Research.
https://www.media.mit.edu/research/groups/socialcomputing
• Mitchell, R. (2016) Web Scrapping with Python: Collecting Data from the Modern Web. CA: O’Reilly
Media Inc.
• Castrillo-Fernández, O. (2015) Web Scraping: Applications and Tools. European Public Sector
Information (PSI) Platform.
• ITIL certification (2017) https://www.axelos.com/certifications/itil-certifications
• Intl. J. of Business Intelligence and Data Mining, http://www.inderscience.com/ijbidm, ISSN (Online):
1743-8195 - ISSN (Print): 1743-8187
• Bocij, P., Greasley, A. and Hickie, S. (2014) Business Information Systems: Technology, Development
and Management for E Business, 5th edn., Harlow, UK: Pearson. (Kindle version available)
• Cragg, P., Mills, A., and Suraweera, T. (2010). Understanding IT management in SMEs. Electronic
Journal of Information Systems Evaluation, 13(1), 27-34.
• Chen, D.Q., Mocker, M., Preston, D.S., and Teubner, A. (2010). Information Systems strategy:
Reconceptualization, measurement, and implications. MIS Quarterly, 34(2), pp.233-259.
• Luftman, J., and Kempaiah, R. (2007). An update on business-IT alignment: “A line” has been drawn.
MIS Quarterly Executive, 6(3), pp.165-177.

Marking Scheme

Learning Outcomes:

1. Demonstrate understanding of the leading technologies relating to business intelligence, data


analysis, predictive and other analytical technologies (e.g. geospatial, social), and be able to apply
them appropriately in real world scenarios.
2. Demonstrate understanding of and application of specialist technologies used to harvest, analyses
and visualize business data in an intelligent way.
3. Critically evaluate, design, prototype and implement business intelligence from data harvesting,
processing visualizations to business analysis and storytelling.
4. Explore the latest visualization techniques, business-IT project governance and related industry
certifications.

Learning Outcomes covered from the course work. LO2, LO3, LO4

a) LO2, LO3
b) LO2, LO3
c) LO2, LO3, LO4
d) LO2, LO3, LO4
Marking criteria – for Task 1(Coursework)
Marks Description of the criteria
e)
Part a) The Epidemiology Unit of Ministry of Health Sri Lanka currently involving with various
research studies to uncover new findings of COVID-19 pandemic acceleration within the country. A
dataset named COVID19_STAT.CSV has been provided for you to uncover possible associations
available with COVID-19 infected people and the rest of the factors available within the dataset with
the aid of suitable business analytics tools such as R, R-studio and R-commander. Analysis should be
followed by critical discussions of the findings.
.
. (15
Marks)
0-1 No or Very poor reporting and statistical testing has been done based on subject
matter.
1-5 Basic reporting with hypothesis based correlation testing has been done for the
subject matter while selecting most suitable variables.
5-8 Very good reporting with hypothesis based correlation testing has been done for
the subject matter while selecting most suitable variables. Box plot graphical
simulation has been supported with the findings.
8-15 Excellent reporting with full scale of hypothesis based correlation tests have been
done for the subject matter. Scatter plot graphical simulations have been supported
with the findings. A clear justification of the findings has been done.
Part b) As per the new findings of question a) you are required to develop precise informed decision
making statistical numerical and graphical models. As per the requirement develop a precise statistical
model having recognized possible associations available with infected people and other factors of the
provided dataset (COVID19_STAT.CSV) in Sri Lanka with the aid of suitable business analytics tools
such as R, R-studio, R-commander. Analysis should be followed by critical discussions of the
findings.
.
(15
Marks)

0-1 No or Very poor reporting and statistical testing has been done based on subject
matter.

LO2, LO3, LO4


f) LO2, LO3, LO4
g) LO2, LO3, LO4
h) LO2, LO3, LO4
1-4 Basic reporting with full scale of regression analysis has been done for the subject
matter.
4-7 Very good reporting with full scale of regression analysis has been done for the
subject matter. Scatter plot graphical simulation has been supported with the findings.
Precise statistical model has been developed.
7-15 Excellent reporting with full scale of regression analysis have been done for the
subject matter. Scatter plot graphical simulation has been supported with the
findings. Precise statistical model has been developed. A clear justification of the
findings has been done.
Part c) Develop a Sri Lanka map that visualizes information of Table: district distribution of COVID-
19 confirmed patient by Epidemiology Unit of Ministry of Health Sri Lanka (provided as Covid-
19Situation-Report.pdf).The information such as District Name, Total Patient Count with Respective
percentages by district should be retrieved via newly created SLCOVID19-2020.csv file to the map.
All the information in .CSV should be shown in the map while classifying it by Total Patient Count by
district. The map processing should be done using provided vector data set and the visualized
information should be well described.

(10
Marks)
0-2 No or Very poor ordinary map has been included.
3-5 Basic map with some required information has been included. No vector data layer
used for the map to visualize information clearly. The CSV file;
SLCOVID192020.csv created.
5-7 Very good map with all required information has been included. All standard map
elements (North Arrow, Map Scale-Graphic, Map Scale- numeric, Map title,
Map legends) have been included to easily read the map. A vector data layer used
for the map to visualize information clearly. The CSV file; SLCOVID19-2020.csv
created.
7-10 Excellent map with all required information has been included. All standard map
elements (North Arrow, Map Scale-Graphic, Map Scale- numeric, Map title,
Map legends) have been included to easily read the map. A suitable base map has
been included. The map has been properly captioned. The map has been well
explained.
The CSV file; SLCOVID19-2020.csv created.
Part d) Using Table 3.3: Growth in population by districts, 1981- 2012 in SLCensesPop2011.pdf
published by Department of Census and Statistics Sri Lanka develop an informative map using
provided shape files and classify it by district Total population 2012. The map should contains
District Name, District Total Population 2012 , Average Annual Growth Rate (%) 2001-2012.Do
precise and critical justifications about potential risk districts in Sri Lanka spreading Novel Corona
Virus -COIVD-19 in order to take suitable preventive measures by the government authorities based on
informed decision making.
. (10
Marks)
0-3 No or Very poor ordinary map has been included.
3-6 Basic map with some required information has been included.
6-8 Very good map with all required information has been included. All standard map
elements (North Arrow, Map Scale-Graphic, Map Scale- numeric, Map title,
Map legends) have been included to easily read the map. A suitable base map has
been included and area found.
8-10 Excellent map with all required information has been included. All standard map
elements (North Arrow, Map Scale-Graphic, Map Scale- numeric, Map title,
Map legends) have been included to easily read the map. A suitable base map has
been

included area found. The map has been properly captioned. The map well explained
with precise and critical justifications.

Part e) Develop a digitized informative area map with suitable information provided. The map should
contain separate new vector layers of the natural and manmade land covers such as buildings, roads
and forests etc. in Infectious Diseases Hospital (IDH) and its suburbs developed by digitization
process. The provided google earth areal image should be used to support digitization with QGIS open
layer plugins/Google Earth/Google maps. Every vector layer attribute table should contain id, name
and type fields and associated data. By analyzing the map describe possible facility development can
be done to increase the COVID-19 patient treatments in IDH Complex as per the rising requirements at
present due to COVID-19 second wave and predicting third wave in the country. (For map
development Use coordinates reference system as WGS84-EPSG4326).
(10
Marks)
0-3 No or Very poor ordinary map has been included.
3-6 Basic map with some required digitized information has been included.
6-8 Very good map with all required information has been included. All standard map
elements (North Arrow, Map Scale-Graphic, Map Scale- numeric, Map title,
Map legends) have been included to easily read the map. A suitable base map has
been included.
8-10 Excellent map with all required information has been included. All standard map
elements (North Arrow, Map Scale-Graphic, Map Scale- numeric, Map title,
Map legends) have been included to easily read the map. A suitable base map has
been included. The map has been properly captioned. The map has been well
explained.

Part f) Develope a PostgreSQL Geospatial Databse named “SLCOVID19-2020” to include data


provided by the Table: district distribution of COVID-19 confirmed patient by Epidemiology Unit of
Ministry of Health Sri Lanka (provided as Covid-19-Situation-Report.pdf ).The database should contain
data such as District Name, Total Patient Count, Total Patient Count Percentage and the shape files
provided. Using SLCOVID19-2020 geospatial database, develop a classified thematic map by Total
Patient Count by district and visualize following information.
i) District Name ii) Total Patient
Count iii) The Patient Count
Percentage

(10
Marks)
0-1 No or Very poor ordinary map has been included.
1-3 Basic map with some required information has been included.
3-5 Very good map with all required information has been included. All standard map
elements (North Arrow, Map Scale-Graphic, Map Scale- numeric, Map title,
Map legends) have been included to easily read the map. A suitable base map has
been included. Geo database has been developed using PostGIS spatial DBMS.
Suitable screen shots of the work have been included in appendix.
5-10 Excellent map with all required information has been included. All standard map
elements (North Arrow, Map Scale-Graphic, Map Scale- numeric, Map title,
Map legends) have been included to easily read the map. A suitable base map has
been included. The map has been properly captioned. Geo database has been
developed

using PostGIS spatial DBMS. The map has been well explained. Suitable screen shots
of the work have been included in appendix.

Part g) Develop a Sri Lanka map contains COVID-19 Testing Centers and Quarantine Details
Centers declared by the Ministry of Health Sri Lanka (as at 11/30/2020) in order to respond COVID-19
Outbreak within the Island. The map should visualize the information such as Center Name, Center
Type (Testing / Quarantine), District Name, Latitude and Longitude. The exact GPS locations should
be retrieved via Google Earth with the support of a KML/KMZ file. A Comprehensive description
should be included about all identified centers and their importance in national mission against the
Novel Corona Virus Pandemic (The official information can be accessed via Sri Lanka response to
covid 19: http://www.covid19.gov.lk)
.
(10 Marks)
0-1 No or Very poor ordinary map has been included.
1-3 Basic map with some required information has been included.
3-5 Very good map with all required information has been included. All standard map
elements (North Arrow, Map Scale-Graphic, Map Scale- numeric, Map title,
Map legends) have been included to easily read the map. A suitable base map has
been included. KML/KMZ file(s) included using Google Earth. Suitable screen
shots of the work have been included in appendix.
5-10 Excellent map with all required information has been included. All standard map
elements (North Arrow, Map Scale-Graphic, Map Scale- numeric, Map title,
Map legends) have been included to easily read the map. A suitable base map has
been included. The map has been properly captioned. KML/KMZ file(s) included
using Google Earth. The map has been well explained. Suitable screen shots of the
work have been included in appendix.
Part h) Develop a map with the support of provided data set in order to find out suitable land for
newly establishing State of the Art Infectious Diseases Research Center in Peradeniya. The suitability
area should be located 600m away from the Pushpadana Vidyalaya and 800m away from the Power
House respectively. The land should not be a scrub, plantation, paddy or a barren land. The map
should be followed by a critical discussion of the suitability of the decision of establishing the
aforesaid Research Center in the identified suitable area. The discussion should be followed by the
following supportive information as well.
i) Total number of buildings situated within the suitability area at present.
ii) Total land area occupied by the buildings within the suitability area.
iii) Total suitable land area
(20
Marks)

0-1 No or Very poor ordinary map has been included.


1-5 Basic map with some required information has been included.
5-12 Very good map with all required information has been included. All standard map
elements (North Arrow, Map Scale-Graphic, Map Scale- numeric, Map title,
Map legends) have been included to easily read the map. A suitable geo-processing
tools such as buffering, clipping and intersection etc. has been used. Suitable
screen shots of the work have been included in appendix.
12-20 Excellent map with all required information has been included. All standard map
elements (North Arrow, Map Scale-Graphic, Map Scale- numeric, Map title,
Map legends) have been included to easily read the map. A suitable geo-processing
tools such as buffering, clipping and intersection etc. has been used. Sub
questions have been answered. The map has been well explained. Suitable screen
shots of the work have been included in appendix.

Final Grading criteria for the coursework

Marks Final Grade


>=70 1
69-60 2:1
59-50 2:2
49-40 3
<40 fail

You might also like