Unit 7 - AI (Evaluation)
Unit 7 - AI (Evaluation)
Evaluation
Evaluation
• Method used for understanding the reliability of AI model on basis of
output provided after test data is given.
• It is measurement of model’s reliability & performance as per
requirement.
• This is done by comparing the output generated(Predictions) with the
actual outputs (Reality).
• For evaluation testing data is very crucial.
Characteristics of Testing Data
• It is completely different, unique and new entity when compared to
training data.
• Data is prepared carefully by trained professionals after exploring big
data.
• It’s recommended that the testing data be different from entire
training data set to evaluate model without any biases.
Scenario
• Scenario is the problem area for which the model has been deployed.
• It is the source of real data which is fed into the model for processing.
YES / TRUE/
YES / TRUE/ POSITIVE TRUE POSITIVE
POSITIVE
NO/ FALSE/
NO/ FALSE/ NEGATIVE TRUE NEGATIVE
NEGATIVE
YES / TRUE/
NO/ FALSE/ NEGATIVE FALSE POSITIVE
POSITIVE
NO/ FALSE/
YES / TRUE/ POSITIVE FALSE NEGATIVE
NEGATIVE
Confusion Matrix / Error Matrix
• Outcome of the comparison between the prediction & reality,
can be recorded in a tabular form called Confusion Matrix or Error Matrix.
• It helps in visualizing the performance of algorithm & the model
• Mostly used to supervise the learning of the models.
• It is not an evaluation matrix but a
• TRUE POSITIVE
record which can help in evaluation. • Prediction and Reality Matches (TRUE)
TP • Prediction is True (POSITIVE)
• TRUE NEGATIVE
• Prediction and Reality Matches (TRUE)
TN • Prediction is False (NEGATIVE)
• FALSE POSITIVE
• Prediction and Reality DO NOT Match (FLASE)
FP • Prediction is True (POSITIVE)
• FLASE NEGATIVE
• Prediction and Reality DO NOT Match (FLASE)
FN • Prediction is False (NEGATIVE)
REALITY
The Confusion
Matrix
YES NO
• Depending on the situation the model has been deployed, choosing between
Precision and Recall is necessary.
• A False Negative can cost us a lot of money and put us in danger in a situation
like a forest fire.
• Viral Outbreak is another situation in which a False Negative might be harmful.
• Consider a scenario in which a fatal virus has begun to spread but is not being
detected by the model used to forecast viral outbreaks. The virus may infect
numerous people and spread widely.