Group Assignment
Group Assignment
Group Assignment
Page 1 of 4
For the assignment, you are asked to explore the application of big data analytics techniques to
the data problem of your choice. You can choose to study one particular data problem, giving
special consideration to the unique properties of the problem domain, and testing one or more
methods on it.
Learning Outcomes
Course Learning Outcome 3: Demonstrate the analytical and visualization methods for
effective storytelling in a given business case.
On conclusion students should be able to: Explain and implement the concept along with
methods for knowledge discovery. Select, analyse and evaluate the most suitable data mining
methods for solving specific problems. Demonstrate the analytical and visualization methods for
effective storytelling in each business case selected by team or assigned by lecturer.
Assessment
The total assessment mark of this group case study is 40%, with 50% of the total contributed by
an individual component and remaining a group marks. Marking criteria is attached on this
assignment.
Groups
Your class will be divided into groups. Each group will contain maximum of 4 members only.
Dataset Preparation
To go through data selection, cleaning, formatting and exploring. The goal of exploring is to
identify the most important fields in predicting an outcome, and determine which derived values
may be useful.
Getting datasets
Every project must involve at least one dataset. The data set should be unbiased and the minimal
size of data is required to fulfil your assignment objective. There are many interesting and freely
available datasets that you can find in the internet especially on social networking datasets,
airline data, weather forecasting and much more.
1. https://www.kdnuggets.com/datasets/government-local-public.html
2. https://github.com/awesomedata/awesome-public-datasets
5. http://www.rdatamining.com/resources/data
You can implement your project using one of the following data mining software packages:
a) Rapid Miner
b) WEKA http://www.cs.waikato.ac.nz/ml/weka/.
c) R – rattle http://rattle.togaware.com/.
f) Microsoft
h) SPOTFIRE
Deliverables
Source code should be submitted together with documentation digital submission. All links
for submission will be created by class lecturer using Moodle.
Each student is required to present their assignment model (scope – data pre processing –
model developed- analysis – interpretation) during group presentation. Presentation schedule
would be announced by lecturer in class.
Note: If unable to form a group due to insufficient student numbers or other approved reasons by module
lecturer, marking criteria above will be considered 100% as Individual component (all criteria marked as
individual component)
PERFORMANCE CRITERIA
This grade will be assigned to work which is considered to be of very high standard and which
meets above 75% of the basic requirements listed above. The mapping between methodology
steps should be excellent. All deliverables should be coherent with detailed descriptions. Overall
documentation standards should be of excellent quality. Accurate, relevant and up-to-date
referencing is visible. In order to obtain a grade at this level, the group should be able to address
all issues with regards to the module.
This grade will be assigned to work which is considered to be of high standard and which meets
at least 65% of the basic requirements listed above. The mapping between methodology steps
should be good. To obtain this grade, the assignment should show all techniques applied but may
contain some errors. All deliverables should be coherent with detailed descriptions. Overall
documentation should be of excellent quality. In order to obtain a grade at this level, the group
should be able to address most issues with regards to the module. Accurate, relevant and up-to-
date referencing is visible.
This grade will be assigned to work which is considered to be of average standard and which
meets at least 50% of the basic requirements listed above. The mapping between methodology
steps should be good. The documentation should be of adequate standard in terms of language,
layout and flow. Some accurate, relevant and up-to-date referencing is visible. The group has an
adequate level of professionalism and project knowledge.