0% found this document useful (0 votes)

23 views16 pages

B17 Discrete Report

fraud detection using random forest

Uploaded by

arunkumar17012006

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views16 pages

B17 Discrete Report

fraud detection using random forest

Uploaded by

arunkumar17012006

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

1

CONTENTS
Abstract ................................................................................. 3
1. INTRODUCTION…............................................................... 4
1.1. FRAUD DETECTION… ................................................. 4
1.2. MODEL EXPLAINABILITY… ......................................... 4
1.3.ML FOR FRAUD DETECTION ....................................... 4
2. DATASET ............................................................................. 6
3. METHADOLOGY… ................................................................6
3.1 Data Loading And Exploration ..................................... 6
3.2. Data Preparation..........................................................6
3.3. Model Creation and Training ....................................... 7
3.4. Model Evaluation ........................................................ 7
3.5. Predictions on New Data ............................................ 7
4. RESULTS ............................................................................. 7
4.1.Results .......................................................................... 7
4.2. Accuracy ...................................................................... 7
4.3. Classification Report ....................................................8
4.4.Confusion Matrix ......................................................... 8
5. ANALYSIS .............................................................................9
5.1.High Accuracy............................................................... 9
5.2.Class Imbalance Consideration .................................... 9
5.3 Precision and Recall Trade-off .................................... 9
5.4.Generalization to New Data ....................................... 9
6. EXPLANATION OF CODE .................................................... 10
6.1 DATA LOADING AND EXPLORATION ........................... 10
6.2 DATA SPLITTING AND MODEL TRAINING ................... 10
6.3.MODEL EVALUATION ................................................... 11
6.4 NEW DATA PREDICTION...............................................13
7. ANALYSIS TABLE ................................................................ 14
8. CONCLUSION ......................................................................15
9. REFERENCES ………………………………………………………………… 16

2
ABSTRACT

Complex financial transactions require new strategies to combat

fraudulent activities. This report critically examines the use of random
forest algorithms in fraud detection, highlighting their flexibility to
meet the complex challenges posed by evolving fraud techniques
Traditional rule-based systems often fail in this dynamic environment,
requiring machine learning adaptations.

The analysis is based on an analysis of the ‘creditcard.csv’ data set,

including a critical analysis of its systematic complexity . Identifying the
‘group’ pillars representing eligible services and fraud is an important
step. Then, the dataset is optimally partitioned into training and testing
sets to ensure robust analysis of the random forest sample

3
1. INTRODUCTION

1.1. FRAUD DETECTION

Fraud detection is the process of identifying and preventing unauthorized

activity in organizations. It has become a major challenge for companies in
industries such as banking, finance, retail, and e-commerce. Fraud can
negatively impact an organization's financial performance and reputation,
making it crucial for companies to prevent and predict suspicious activity.

1.2 MODEL EXPLAINABILITY

Predicting if a transaction is fraudulent or not is inadequate for meeting the

transparency standards of the banking sector. It is also essential to
comprehend why certain deals are marked as deceitful. This explicability is
critical for comprehending how fraud occurs, how to execute procedures to
reduce fraud, and to make sure the process isn’t prejudiced. As a result, fraud
detection models must be comprehensible and interpretable, which restricts
the choices of models that investigators can apply

1.3.ML FOR FRAUD DETECTION

Machine Learning (ML) algorithms in the fraud detection scenario are an

important defense mechanism against increasing and sophisticated fraudulent
activities These algorithms, like a random forest , decision trees, the
mechanical support of vector and neural networks, excel in complexity and
abnormal feature detection in large embedded datasets.

The main advantage of ML-based fraud detection is its flexibility. ML models

are constantly learning and improving with new data, allowing them to adapt
to emerging fraud techniques. This shift is especially important in the financial
sector, where fraudsters are constantly finding new ways to exploit
vulnerabilities.
4
ML models greatly contribute to reducing false positives, which is a common
challenge in rule-based systems. By analyzing historical transaction data, these
models learn to distinguish between legitimate and fraudulent transactions,
improving accuracy and efficiency in flagging potential threats

Furthermore, the real-time capabilities of ML are paramount. Given the rapid

pace of financial transactions, ML models enable fast and automatic decision-
making, providing an appropriate response to potential fraud.

The integration of ML into fraud detection extends beyond traditional financial

institutions, and explores applications in e-commerce, healthcare, and a variety
of industries that require close analysis of transaction data. As technology
advances, ML algorithms play an increasingly important role in protecting
sensitive information and preserving the integrity of digital communications

2. DATASET

The IBM credit card transaction dataset is a publicly available dataset that
contains information about credit card transactions. It is often used for
research and testing of fraud detection models. The dataset includes a variety
of features such as the amount of the transaction, the type of card used, and
the location of the transaction. It also includes a label indicating whether the
transaction was fraudulent. The dataset is designed to be representative of
real-world transactions and therefore contains a certain level of class
imbalance, with a higher number of non-fraudulent transactions than
fraudulent transactions. The dataset is provided by IBM and the data is
simulated, but it is not specified the exact process of data simulation. It is
important to note that the data is not real and it is not linked to any real
customer or financial institution.

5
The data set contains:

24 million unique transactions 6,000 unique merchants

100,000 unique cards
30,000 fraudulent samples (0.1% of total transactions) Where to find the data

In our case we will use Kaggle to create our model so we are using the dataset
on Kaggle.

3. METHADOLOGY

3.1. Data Loading And Exploration:

Load the credit card transaction dataset.

Explore the structure of the dataset, examining the initial 20,000 rows.

3.2. Data Preparation:

Split the dataset into features (X) and the target variable (y).
Divide the data into training and testing sets

6
3.3. Model Creation and Training:

Instantiate a Random Forest Classifier with 100 decision trees.

Train the model on the training set.

3.4. Model Evaluation:

Make predictions on the test set.

Evaluate model performance using accuracy, classification report, and
confusion matrix

3.5. Predictions on New Data:

Load new data from 'm1.csv.'

Make predictions on the new data using the trained model.

4. RESULTS

4.1. Results

The evaluation of the Random Forest model for fraud detection yielded
insightful outcomes. The following key metrics were employed to assess the
model's performance

4.2. Accuracy

The model achieved an accuracy score of [insert accuracy score], reflecting the
proportion of correctly classified instances in the test set.

7
4.3. Classification Report

The detailed classification report provides precision, recall, F1-score, and

support metrics for both the 'fraudulent' (Class 1) and 'non-fraudulent' (Class 0)
categories. This breakdown offers a nuanced understanding of the model's
performance across different classes.

4.4. Confusion Matrix

The confusion matrix provides a visual representation of the model's

performance, showcasing true positives, true negatives, false positives, and
false negatives.

Here the accuracy is

8
5.ANALYSIS

5.1. High Accuracy

The version famous a high accuracy rating, indicating its skillability in

effectively classifying transactions.

5.2. Class Imbalance Consideration

Given the imbalanced nature of the dataset (with appreciably more non-
fraudulent transactions), extra recognition is needed on metrics like precision,
remember, and the F1-score to evaluate the model's effectiveness in figuring
out fraudulent cases.

5.3 Precision and Recall Trade-off

While the model demonstrates fantastic precision (low false effective fee), the
remember for the fraudulent magnificence is extraordinarily lower. Striking a
balance among precision and do not forget is important in fraud detection, as
lacking real fraud instances (fake negatives) is a significant problem.

5.4.Generalization to New Data

The model's robustness may be similarly assessed via comparing its overall
performance on new information ('m1.Csv').

9
6. EXPLANATION OF CODE

6.1 DATA LOADING AND EXPLORATION

Mainly this part code of the code gives an detailed idea on how the data is is
collected an being processed to the next. Here in this part the data will be
collected from the file that we had been submitted

6.2 DATA SPLITTING AND MODEL TRAINING

X contains the features of the dataset, excluding the 'Class' column. Each row in
X represents a set of features for a specific data point.
y contains the target variable, which is the 'Class' column in this case. This
10
column typically contains labels indicating whether a transaction is fraudulent
(1) or not fraudulent (0).
‘train_test_split’ is a function from scikit-learn that splits the dataset into
training and testing sets.
X_train and y_train represent the features and labels of the training set,
respectively.

‘X_test’ and ‘y_test’ represent the features and labels of the testing set,
respectively. ‘test_size=0.2’ indicates that 20% of the data will be used for
testing, and the remaining 80% will be used for training.
random_state=42 ensures reproducibility by fixing the random seed for
the split. ‘RandomForestClassifier’ is a machine learning algorithm used
for classification tasks, and it belongs to the ensemble learning family.
‘n_estimators=100’ specifies the number of trees in the forest (you can
adjust this number based on your needs).
‘random_state’=42 ensures reproducibility by fixing the random seed for the
algorithm.
The fit method is used to train the Random Forest Classifier on the training data
(X_train and y_train).
After this step, the model has learned the patterns in the training data and is
ready to make predictions on new, unseen data.

6.3.MODEL EVALUATION

‘model.predict(X_test)’ uses the trained Random Forest model (model) to

predict the labels for the test set (X_test).

‘accuracy_score(y_test, y_pred)’ calculates the accuracy of the model's

predictions on the test set. Accuracy is the ratio of correctly predicted

11
instances to the total instances.

‘classification_report(y_test, y_pred)’ generates a detailed classification report,

including precision, recall, and F1-score for each class.

‘confusion_matrix(y_test, y_pred)’ computes the confusion matrix, which is a

table showing the number of true positives, true negatives, false positives, and
false negatives.

These evaluation metrics provide a comprehensive view of the model's

performance. Here's a brief overview of the metrics mentioned:

Accuracy:
The percentage of correctly classified instances.

Precision:
The ratio of true positive predictions to the total predicted positives. It
measures the accuracy of positive predictions.

Recall (Sensitivity or True Positive Rate):

The ratio of true positive predictions to the total actual positives. It measures
the ability of the model to capture all relevant instances.

F1-score:
The harmonic mean of precision and recall. It provides a balance between
precision and recall.

Confusion Matrix:
A table showing the true positive, true negative, false positive, and false
negative counts. It gives insights into the types of errors the model makes.

12
6.4 NEW DATA PREDICTION

Reads a new dataset (presumably containing features for which predictions

need to be made) from a CSV file named 'm1.csv' and stores it in the variable
new_data.

Checks if the 'Class' column is present in the new data.

If 'Class' is present, it assumes it is the target variable and drops it to create the
feature set (new_data_features).

If 'Class' is not present, it assumes the entire new data as the feature set.
Uses the pre-trained Random Forest model (model) to make predictions on the
new data features (new_data_features).

Prints the predictions made by the model on the new data.

13
7.Analysis Table

Testing Accuracy Confusion Matrix

0.5 0.9894

0.6 0.9912

0.7 0.9849

0.8 0.9868

0.9 0.9883

14
7. CONCLUSION

In the realm of fraud detection, the utility of the Random Forest algorithm has
verified top notch efficacy in discerning fraudulent sports inside credit score
card transactions. The version exhibited a commendable accuracy fee,
underscoring its ability to efficiently classify times and distinguish among valid
and fraudulent transactions.

However, a nuanced examination of extra metrics which includes precision,

recall, and the F1-score revealed a alternate-off between precision and take
into account, especially in the identity of fraudulent cases. The version's
robustness and generalization to new information, as evidenced with the aid of
its performance on 'm1.Csv,' in addition emphasize the need for non-stop
refinement and optimization.

While the high precision price suggests a low fake advantageous charge,
making sure that valid transactions aren't incorrectly flagged as fraudulent,
there may be room for development in don't forget to limit fake negatives.
Achieving a balanced precision-remember change-off is vital in fraud detection,
in which each minimizing false positives and capturing as many real fraud cases
as possible are critical targets.

In conclusion, the Random Forest version offers a robust basis for fraud
detection, and its overall performance can be further more advantageous thru
nice-tuning and model optimization. The dynamic nature of fraud strategies
necessitates a continuous dedication to investigate and development, making
sure that the model stays adaptive and resilient to emerging threats in the
ever-evolving panorama of financial fraud. As technology evolves, the
integration of advanced methodologies will play a pivotal function in fortifying
security features and retaining the integrity of financial transactions.

15
8. REFERENCES:

https://github.com/Sivaramasaran2773/Credit-Card-Fraud-Detection-using-Machine-
Learning-Models ------- Learned on how to use ML in this project

https://github.com/ ---------- For data

https://www.kaggle.com/ ---------- For data

https://www.kaggle.com/code/kabure/credit-card-fraud-prediction-rf-smote ---For
detailed understanding of randomforest analysis of fraud detection

Credit Card Fraud Detection Using AI
No ratings yet
Credit Card Fraud Detection Using AI
18 pages
Fraud Detection in Financial Transactions - PPT.PPTX - 20240805 - 175608 - 0000
No ratings yet
Fraud Detection in Financial Transactions - PPT.PPTX - 20240805 - 175608 - 0000
22 pages
Biochem Solved B.pharm 2nd Semester
No ratings yet
Biochem Solved B.pharm 2nd Semester
87 pages
Credit Card PPT'
No ratings yet
Credit Card PPT'
12 pages
Credit Card Fraud Detection
No ratings yet
Credit Card Fraud Detection
8 pages
Presentation Slides
No ratings yet
Presentation Slides
18 pages
Credit Card Fraud Detection Report
100% (1)
Credit Card Fraud Detection Report
17 pages
Leveraging Lookups and Subsearches
100% (2)
Leveraging Lookups and Subsearches
72 pages
ONLINE PAYMENT FRAUD DETECTION USING MACHINE LEARNING MODEL - Key
No ratings yet
ONLINE PAYMENT FRAUD DETECTION USING MACHINE LEARNING MODEL - Key
12 pages
New Report
No ratings yet
New Report
61 pages
Credit Card Fraud Detection
No ratings yet
Credit Card Fraud Detection
34 pages
Fraud Detection Using Machine LearningV2
No ratings yet
Fraud Detection Using Machine LearningV2
33 pages
Session 5
No ratings yet
Session 5
21 pages
Chapter 4 Conic Section and Its Application
100% (1)
Chapter 4 Conic Section and Its Application
13 pages
BW GrR-3 Sepedi Dictionary - V1
No ratings yet
BW GrR-3 Sepedi Dictionary - V1
64 pages
Creditcard Fraud Detection
No ratings yet
Creditcard Fraud Detection
26 pages
Phase 3
No ratings yet
Phase 3
19 pages
Report
No ratings yet
Report
14 pages
Draughtsman Mechanical 1st Year (Volume II of II) TT
No ratings yet
Draughtsman Mechanical 1st Year (Volume II of II) TT
211 pages
Andy Clark - Associative Engines
100% (3)
Andy Clark - Associative Engines
248 pages
1 4 Scalars and Vectors MCQ
No ratings yet
1 4 Scalars and Vectors MCQ
30 pages
Credit Card Fraud Detection
No ratings yet
Credit Card Fraud Detection
25 pages
ANN, KNN & Decision Tree
No ratings yet
ANN, KNN & Decision Tree
13 pages
Fraud Detection Project Report
No ratings yet
Fraud Detection Project Report
4 pages
Script KHDL
No ratings yet
Script KHDL
4 pages
Credit Card Fraud Detection Using Machine Learning
No ratings yet
Credit Card Fraud Detection Using Machine Learning
8 pages
Fraud Detection
No ratings yet
Fraud Detection
19 pages
AI and DS Final Document For Phase 5
No ratings yet
AI and DS Final Document For Phase 5
9 pages
Fraud Detection Using Machine Learning V 2
No ratings yet
Fraud Detection Using Machine Learning V 2
33 pages
Ibm Project
No ratings yet
Ibm Project
18 pages
Final Year Project
No ratings yet
Final Year Project
27 pages
Module 3.4 Classification Models, Case Study
No ratings yet
Module 3.4 Classification Models, Case Study
12 pages
SSRN 5240326
No ratings yet
SSRN 5240326
8 pages
University of Cambridge International Examinations General Certificate of Education Ordinary Level
No ratings yet
University of Cambridge International Examinations General Certificate of Education Ordinary Level
24 pages
Presentation Credit Card
No ratings yet
Presentation Credit Card
25 pages
3rd PRESENTATION
No ratings yet
3rd PRESENTATION
25 pages
DBNex Deep Belief Network and Explainable AI Based Financial Fraud Detection
No ratings yet
DBNex Deep Belief Network and Explainable AI Based Financial Fraud Detection
10 pages
Fraud Detection Synopsis
No ratings yet
Fraud Detection Synopsis
5 pages
TE Seminar Formatfinal
No ratings yet
TE Seminar Formatfinal
16 pages
Link For Google Colab Note Book: Pa Ge
No ratings yet
Link For Google Colab Note Book: Pa Ge
17 pages
307 A029 Seminar
No ratings yet
307 A029 Seminar
16 pages
Financial Fraud Detection
No ratings yet
Financial Fraud Detection
11 pages
Secureswipe Pioneering Strategies For Next-Gen Credit Card Fraud Prevention 1
No ratings yet
Secureswipe Pioneering Strategies For Next-Gen Credit Card Fraud Prevention 1
9 pages
Case Study Front Page
No ratings yet
Case Study Front Page
11 pages
HR Template
No ratings yet
HR Template
6 pages
Pdsreport
No ratings yet
Pdsreport
6 pages
Capstone Project - Credit Card Fraud Prediction - Alexandre Daltro
No ratings yet
Capstone Project - Credit Card Fraud Prediction - Alexandre Daltro
15 pages
Report
No ratings yet
Report
14 pages
DBNex Deep Belief Network and Explainable AI Based Financial Fraud Detection
No ratings yet
DBNex Deep Belief Network and Explainable AI Based Financial Fraud Detection
10 pages
Phase 5
No ratings yet
Phase 5
10 pages
Cbjeit 24 P
No ratings yet
Cbjeit 24 P
30 pages
Developmentof Palm Oil Extraction System
No ratings yet
Developmentof Palm Oil Extraction System
9 pages
PID 89: Analysis and Performance Evaluation of Credit Card Fraud Detection by Multi-Model ML
No ratings yet
PID 89: Analysis and Performance Evaluation of Credit Card Fraud Detection by Multi-Model ML
19 pages
Credit Card Fraud Detection Using Machine Learning Techniques
No ratings yet
Credit Card Fraud Detection Using Machine Learning Techniques
4 pages
Presentation Slides
No ratings yet
Presentation Slides
16 pages
Internship Project
No ratings yet
Internship Project
8 pages
Bank Fraud Prediction
No ratings yet
Bank Fraud Prediction
16 pages
Tensor Software
100% (1)
Tensor Software
6 pages
Nityananda Vyawhare 2223216 Case Study 5
No ratings yet
Nityananda Vyawhare 2223216 Case Study 5
5 pages
Online Transactions Fraud Detection Using Machine Learning
No ratings yet
Online Transactions Fraud Detection Using Machine Learning
4 pages
Advancements in Fraud Detection Systems Using Machine Learning
No ratings yet
Advancements in Fraud Detection Systems Using Machine Learning
3 pages
Topic 2
No ratings yet
Topic 2
5 pages
Credit Card Fraud Detection Using Machine Learning
No ratings yet
Credit Card Fraud Detection Using Machine Learning
6 pages
Industrial Oriented Mini Project - Summer Internship On
No ratings yet
Industrial Oriented Mini Project - Summer Internship On
14 pages
FT-891 Quick Manual: (PWR/LOCK) Key RF/SQL Knob
No ratings yet
FT-891 Quick Manual: (PWR/LOCK) Key RF/SQL Knob
2 pages
Scholes 2021
No ratings yet
Scholes 2021
11 pages
IJIRSET Paper Sample
No ratings yet
IJIRSET Paper Sample
4 pages
Final Eddited Research Paper1
No ratings yet
Final Eddited Research Paper1
6 pages
Summative Test No. 1 Objectives Code Percenta Ge No. of Items Item Placement
50% (2)
Summative Test No. 1 Objectives Code Percenta Ge No. of Items Item Placement
4 pages
Credit Card Fraud Detection and Analysis
No ratings yet
Credit Card Fraud Detection and Analysis
4 pages
An Empirical Assessment of Empirical Corporate Finance
No ratings yet
An Empirical Assessment of Empirical Corporate Finance
40 pages
CINPD Unit 5
No ratings yet
CINPD Unit 5
16 pages
IEEE Conference Template
No ratings yet
IEEE Conference Template
3 pages
Manual Neuraltools6 en
No ratings yet
Manual Neuraltools6 en
104 pages
Compactly Powerful: Ugeo Pt60A
No ratings yet
Compactly Powerful: Ugeo Pt60A
6 pages
008 ISO 10421998 Laboratory Glassware One Mark Volum
100% (1)
008 ISO 10421998 Laboratory Glassware One Mark Volum
2 pages
(09.10.30) Dsme H-4453-4 Magnetic Compass (Final)
No ratings yet
(09.10.30) Dsme H-4453-4 Magnetic Compass (Final)
18 pages
Natural Gas Pressure Test
No ratings yet
Natural Gas Pressure Test
4 pages
Circuit Note: Dual-Channel Colorimeter With Programmable Gain Transimpedance Amplifiers and Digital Synchronous Detection
No ratings yet
Circuit Note: Dual-Channel Colorimeter With Programmable Gain Transimpedance Amplifiers and Digital Synchronous Detection
8 pages
Various Methods of Ligation Ties: Review Article
No ratings yet
Various Methods of Ligation Ties: Review Article
6 pages
Numerical and Experimental Modelling of The Steam Assisted Gravity Drainage (SAGD) Process
No ratings yet
Numerical and Experimental Modelling of The Steam Assisted Gravity Drainage (SAGD) Process
7 pages
Third Year Civil Engg. 3rd Year Scheme Syllabus 2018-19 PDF
No ratings yet
Third Year Civil Engg. 3rd Year Scheme Syllabus 2018-19 PDF
24 pages
GE 10 Lab Ex 4
No ratings yet
GE 10 Lab Ex 4
8 pages
A9F74220
No ratings yet
A9F74220
3 pages
Basic Principles of Air Springs: General Discussion
No ratings yet
Basic Principles of Air Springs: General Discussion
4 pages
Corn Starch
No ratings yet
Corn Starch
8 pages
JST PH Connectors - Datasheet
No ratings yet
JST PH Connectors - Datasheet
2 pages
AI in Quantitative Analysis
From Everand
AI in Quantitative Analysis
Anand Vemula
No ratings yet
Risk Modeling: Practical Applications of Artificial Intelligence, Machine Learning, and Deep Learning
From Everand
Risk Modeling: Practical Applications of Artificial Intelligence, Machine Learning, and Deep Learning
Terisa Roberts
No ratings yet
Consumption-Based Forecasting and Planning: Predicting Changing Demand Patterns in the New Digital Economy
From Everand
Consumption-Based Forecasting and Planning: Predicting Changing Demand Patterns in the New Digital Economy
Charles W. Chase
No ratings yet

B17 Discrete Report

Uploaded by

B17 Discrete Report

Uploaded by

1

Complex financial transactions require new strategies to combat

The analysis is based on an analysis of the ‘creditcard.csv’ data set,

1.1. FRAUD DETECTION

Fraud detection is the process of identifying and preventing unauthorized

1.2 MODEL EXPLAINABILITY

Predicting if a transaction is fraudulent or not is inadequate for meeting the

1.3.ML FOR FRAUD DETECTION

Machine Learning (ML) algorithms in the fraud detection scenario are an

The main advantage of ML-based fraud detection is its flexibility. ML models

Furthermore, the real-time capabilities of ML are paramount. Given the rapid

The integration of ML into fraud detection extends beyond traditional financial

24 million unique transactions 6,000 unique merchants

3.1. Data Loading And Exploration:

Load the credit card transaction dataset.

3.2. Data Preparation:

Instantiate a Random Forest Classifier with 100 decision trees.

3.4. Model Evaluation:

Make predictions on the test set.

3.5. Predictions on New Data:

Load new data from 'm1.csv.'

The detailed classification report provides precision, recall, F1-score, and

4.4. Confusion Matrix

The confusion matrix provides a visual representation of the model's

Here the accuracy is

5.1. High Accuracy

The version famous a high accuracy rating, indicating its skillability in

5.2. Class Imbalance Consideration

5.3 Precision and Recall Trade-off

5.4.Generalization to New Data

6.1 DATA LOADING AND EXPLORATION

6.2 DATA SPLITTING AND MODEL TRAINING

‘model.predict(X_test)’ uses the trained Random Forest model (model) to

‘accuracy_score(y_test, y_pred)’ calculates the accuracy of the model's

‘classification_report(y_test, y_pred)’ generates a detailed classification report,

‘confusion_matrix(y_test, y_pred)’ computes the confusion matrix, which is a

These evaluation metrics provide a comprehensive view of the model's

Recall (Sensitivity or True Positive Rate):

Reads a new dataset (presumably containing features for which predictions

Checks if the 'Class' column is present in the new data.

Prints the predictions made by the model on the new data.

Testing Accuracy Confusion Matrix

However, a nuanced examination of extra metrics which includes precision,

https://github.com/ ---------- For data

https://www.kaggle.com/ ---------- For data

You might also like