COM7039M MachineLearning Assignment Brief-Level 7-1
COM7039M MachineLearning Assignment Brief-Level 7-1
Module Details
Assignment Description
b. How would you evaluate the performance of the classification algorithm with
appropriate metrics? (10 Marks)
Assignment Description
Dataset Description:
The "Hate Speech and Offensive Language" dataset is collected from Twitter.
It is primarily designed to support research and development in detecting and
analyzing hate speech and offensive language on social media, distinguishing
them from ordinary slang and neutral content.
Dataset Link:
https://www.kaggle.com/datasets/mrmorj/hate-speech-and-offensive-
language-dataset
Alternative Source:
The datasets are available to download from the Machine Learning (ML) module
in the Moodle platform.
Assignment Description
Learning Outcomes
PLOs 7.1-7.7
7.1Evaluate computer science concepts and principles and their application to the
effective
7.2 Apply the findings of advanced scholarship and/or contemporary research and
practice to
7.3 Critically evaluate computer science problems, including those at the forefront of
the field.
7.4 Demonstrate operation within applicable professional, legal, social, and ethical
frameworks.
7.5 Demonstrate originality and creativity in the solution of computer science problems.
7.7 Apply standards, quality processes, and engineering principles to the solution of
computer science problems.
School of London
Submission Guidelines:
Your work will be marked according to the assessment instructions provided within this
document and the selected Learning Outcomes’ (LOs) (see above).
Furthermore, this assessment is marked using the assessment marking criteria or a similar
rubric that aligns with the University’s Generic Assessment Descriptors (see below).1 This
is to ensure all assessment decisions are comparable regardless of the discipline or mode
of assessment.
Please note that you must meet the required baseline standards (50 – 59%) which will
include the LOs and minimum expectations of the assessment. Further still, you must
ensure you meet the requirements of each grade boundary to progress to the next, i.e., you
should demonstrate your learning through the standards of the Pass, Merit and Distinction
to reach a Distinction (70 – 84%). These standards are designed to scaffold and build your
learning to achieve your fullest potential in each criterion being assessed.
School of London
Deliverables for Task 1 and Task 2
a An extraordinary conceptual 10
understanding of K-means Marks
clustering algorithm, advantages,
Task and disadvantages with real-world
01- 20 applications. If any examples are
provided.
Marks b An appropriate description of all 10
the metrics like accuracy, Marks
precision, recall, F1-score, area
under the receiver operating
characteristic curve (AUC-ROC),
Confusion matrix, and Logarithmic
loss (Log Loss.)
Data Exploration Correct handling of missing values, 10
and Pre-processing outliers, and data normalization. Marks
Effective exploratory data analysis
(EDA) to gain insights into the
dataset. Provide a well-documented
Jupyter Notebook or script
Task
containing the code for data
02- preprocessing steps. Ensure that
each step is properly commented on
80
to explain its purpose and
Marks functionality.
Feature Present code segments that 15
Engineering generate new features based on Marks
domain knowledge or creative
insights. Discuss the encoding
methods chosen and their suitability
for the problem. Applying
mathematical transformations to
numerical features. Scaling
numerical features to ensure they
are on similar scales. Discuss the
feature selection and its importance.
Model Selection Correct implementation of selected 10
and Training machine learning algorithms Marks
should be presented. Splitting the
dataset into training and testing
(validation) sets using techniques
School of London
like the train-test split or k-fold
cross-validation. Adequate use of
libraries and tools to streamline the
implementation process.
Explanation of the method you used 10
Hyperparameter for hyperparameter tuning reasons Marks
Tuning for selecting this method and how it
suits your specific problem. Data
Splitting Strategy: How you divided
your data into training, validation,
and test sets. Model Training and
Evaluation Protocol: Description of
how you trained and evaluated
models for different hyperparameter
configurations. Explanation of the
performance metric(s) you used to
assess model performance.
Model Evaluation Accurate evaluation of model 20
performance using relevant metrics Marks
(e.g., accuracy, precision, recall, F1-
score).
Comprehensive comparison of
multiple models and their
strengths/limitations. Insightful
interpretation of results and trends
observed.
Model Deployment Testing the developed model using 10
real-world data or unseen data that Marks
it performs as expected and
provides accurate predictions and
validates the model performance.
Conclusion and The clear and organized structure of 5
Recommendations the report with proper sections Marks
(Introduction, Methodology, Results,
Discussion, Conclusion). Coherent
explanations of the implemented
algorithms and techniques. Effective
visualization of results through
graphs, charts, and tables. Cohesive
and well-written analysis of findings
and conclusions.
Total Marks 100
Marks
School of London
School of London
Marking Criteria
Pass Grade Bands (100 – 50) (Learning Outcomes must be met)
Fail Grade Bands (49 – 0) (Learning Outcomes are not met)
Assessment Criteria Pass Merit Distinction Distinction Borderline Fail Fail Fail
(50 – 59) (60 – 69) (70 – 84) (85 – 100) (45 - 49) (30 - 44) (0 - 29)
(Credits may be (Credits may not be (Credits may
compensated) compensated) not be
compensated)
Research An Demonstrates a deep and Shows a strong Provides a good Demonstrates adequate Shows limited Demonstrates a poor Fails to
Skills extraordina insightful understanding of the K- understanding with understanding with understanding with basic understanding with understanding with demonstrate
ry means clustering algorithm with relevant examples and some examples and examples and some insufficient examples or little to no relevant understanding
conceptual detailed examples and critical solid analysis. analysis. discussion. analysis. examples. of the K-
understandi analysis. means
ng of the K- clustering
means algorithm.
clustering
algorithm,
including
its
advantages
,
disadvanta
ges, and
real-world
application
s. 10%
Thinking Skills An Exhibits exceptional creativity and Shows strong creativity Demonstrates good Provides adequate Limited creativity with Shows minimal Fails to
and Creativity innovative originality in problem-solving. with effective problem- creativity with some creativity with basic few innovative solutions. creativity with demonstrate
approach to solving approaches. innovative solutions. problem-solving ineffective problem- creativity or
problem- approaches. solving. effective
solving problem-
with solving.
creative
insights
and
solutions.
10%
School of London
Assessment Criteria Pass Merit Distinction Distinction Borderline Fail Fail Fail
(50 – 59) (60 – 69) (70 – 84) (85 – 100) (45 - 49) (30 - 44) (0 - 29)
(Credits may be (Credits may not be (Credits may
compensated) compensated) not be
compensated)
Practical Skills Data Excellent handling of missing Strong handling of data Good handling of data Adequate handling of data Limited handling of data Poor handling of data Fails to handle
and Exploration values, outliers, and data issues and effective issues and EDA. issues with basic EDA. issues or EDA. issues with insufficient data issues
normalization. Effective and EDA. Well-documented Adequate Documentation and Incomplete EDA. Inadequate effectively.
Professional and Pre-
insightful EDA with a well- with minor gaps in documentation and explanations are present documentation or documentation and Lacks proper
Learning Skills processing- documented Jupyter Notebook or explanation. explanation. but may lack depth. explanation. explanations. EDA and
10% script. documentation
Thinking skills Feature Innovative and effective feature Strong feature Good feature Adequate feature Limited feature Poor feature Fails to
&Practical Engineering engineering with detailed engineering with good engineering with some engineering with basic engineering with minimal engineering with demonstrate
explanation of encoding methods, explanation of methods explanation of methods explanation of methods explanation. insufficient effective
Skills and Process-15%
mathematical transformations, and transformations. and transformations. and transformations. explanation. feature
Professional scaling, and feature selection. engineering.
Learning Skills
Practical Skills Model Excellent implementation of Strong implementation Good implementation Adequate implementation Limited implementation Poor implementation Fails to
and Selection algorithms with thoughtful data with appropriate data with correct data splitting with basic data splitting with inappropriate data with ineffective data implement
and splitting and optimal use of tools splitting and effective and adequate tool and tool usage. splitting or tool usage. splitting or minimal models
Professional
Training- and libraries. use of tools. usage. tool usage. correctly or
Learning Skills 10% use tools
effectively.
Practical Skills Hyperpara Comprehensive evaluation using Accurate evaluation with Good evaluation with Adequate evaluation with Limited evaluation with Poor evaluation with Fails to
and meter relevant metrics with insightful good comparison and appropriate metrics and basic metrics and limited minimal use of metrics inadequate metrics evaluate
Tuning-10% comparison and interpretation of interpretation of multiple some comparison of comparison of models. and comparison. and no comparison of models
Professional
results. models. models. models. effectively or
Learning Skills provide
meaningful
insights.
Practical Skills Model Comprehensive evaluation using Accurate evaluation with Good evaluation with Adequate evaluation with Limited evaluation with Poor evaluation with Fails to
and Evaluation- relevant metrics with insightful good comparison and appropriate metrics and basic metrics and limited minimal use of metrics inadequate metrics evaluate
20% comparison and interpretation of interpretation of multiple some comparison of comparison of models. and comparison. and no comparison of models
Professional
results. models. models. models. effectively or
Learning Skills provide
meaningful
insights.
Practical Skills Model Thorough testing of the model with Effective testing with real- Good testing with some Adequate testing with basic Limited testing with Poor testing with Fails to test or
and Deploymen real-world or unseen data, world or unseen data and validation of performance validation of performance. insufficient validation of minimal or ineffective validate model
Professional t-10% demonstrating accurate predictions validation of performance. using unseen data. performance. validation of performance
and validation of performance. performance. effectively.
Learning Skills
School of London
Assessment Criteria Pass Merit Distinction Distinction Borderline Fail Fail Fail
(50 – 59) (60 – 69) (70 – 84) (85 – 100) (45 - 49) (30 - 44) (0 - 29)
(Credits may be (Credits may not be (Credits may
compensated) compensated) not be
compensated)
Communicati Report Clear, organized report with detailed Well-structured report Clear report with adequate Adequate report with basic Limited report with Poor report with Fails to provide
on Structure sections, insightful explanations, and with good explanations structure, explanations, structure and analysis, unclear structure and minimal organization, a coherent
and Clarity- effective visualizations. Cohesive and visualizations. and some visualizations. though visualizations may be insufficient analysis or analysis, and report or
analysis and conclusions. lacking. visualizations. visualizations. meaningful
5%
analysis and
visualizations.