0% found this document useful (0 votes)

8 views10 pages

Phyton

The document outlines the principles of supervised learning, focusing on training classifier models using labeled data. It details key concepts such as features, labels, training/testing data, and the steps involved in model development, including data preprocessing, model selection, and evaluation. Additionally, it provides examples of classification tasks and practical coding implementations using PyCaret for model training and evaluation.

Uploaded by

bhavneetdhaliwal502

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views10 pages

Phyton

Uploaded by

bhavneetdhaliwal502

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Experiment-2.

Mark as done

Experiment-2.2

Understand supervised learning to train and develop classifier models. CO2, CO4

Tools/ Platforms Used:

Google Colaboratory

Theory:

Supervised Learning
Supervised learning is a type of machine learning where the model learns from labeled data. In this approach, the dataset
provided to the model contains input features (independent variables) and corresponding target labels (dependent variable). The
model learns the relationship between the inputs and the outputs to make predictions on new, unseen data.

Key Concepts in Supervised Learning:

1. Features and Labels:

Features (X): Independent variables that act as input to the model.

Labels (Y): Dependent variables or the output the model needs to predict.
2. Training and Testing:

Training Data: The subset of the dataset used to train the model.
Testing Data: The subset used to evaluate the model's performance.
The dataset is typically split into 70–80% training data and 20–30% testing data.
3. Objective:
The goal is to minimize the error between the predicted and actual outputs and to generalize well to unseen data.

https://lms.cuchd.in/mod/page/view.php?id=1883200 05/03/25, 9 59 PM
Page 1 of 10
:
Classification in Supervised Learning:
Classification is a supervised learning task where the output variable is categorical. Examples include:

Binary Classification: Predicting one of two categories (e.g., spam or not spam).
Multi-class Classification: Predicting one of multiple categories (e.g., types of fruits).

Steps to Train and Develop Classifier Models:

1. Data Preprocessing:

Clean the data (handle missing values, outliers).

Encode categorical variables.
Normalize or standardize numerical features.
2. Feature Selection and Engineering:

Select relevant features to improve model performance.

Create new features from existing ones if necessary.
3. Model Selection:

Choose an appropriate classification algorithm, such as:

Logistic Regression
Decision Trees
Random Forest
Support Vector Machines (SVM)
Neural Networks
4. Training:

Fit the selected algorithm to the training dataset using fit().

5. Evaluation:

Use metrics like accuracy, precision, recall, F1-score, and ROC-AUC to assess the model’s performance.
6. Hyperparameter Tuning:

Optimize the model's performance by adjusting hyperparameters using techniques like Grid Search or Random
Search.

https://lms.cuchd.in/mod/page/view.php?id=1883200 05/03/25, 9 59 PM
Page 2 of 10
:
Common Use Cases of Classification Models:

Email spam detection.

Fraud detection in financial systems.
Disease diagnosis in healthcare.
Sentiment analysis of customer reviews.

The coding example will help you understand how to implement these concepts practically.
# Import required libraries
from pycaret.datasets import get_data # To load example datasets

from pycaret.classification import * # To perform classification tasks using PyCaret

# Load available datasets list

dataSets = get_data('index') # Fetches the index of all available datasets in PyCaret

# Use this to explore and select appropriate datasets for analysis

# Load the diabetes dataset

diabetesDataSet = get_data("diabetes") # Loads the "diabetes" dataset, which is a binary classification problem

# The target column, "Class variable", has two classes (binary values)

https://lms.cuchd.in/mod/page/view.php?id=1883200 05/03/25, 9 59 PM
Page 3 of 10
:
# Set up the classification environment
s = setup(data=diabetesDataSet, target='Class variable')

# Initializes the PyCaret classification environment

# Specifies the dataset and the target column to be used for training

# Create a Random Forest model

rfModel = create_model('rf')

# Trains a Random Forest classifier model using the default hyperparameters

https://lms.cuchd.in/mod/page/view.php?id=1883200 05/03/25, 9 59 PM
Page 4 of 10
:
# Plot the confusion matrix
plot_model(rfModel, plot='confusion_matrix')

# Visualizes the confusion matrix for the Random Forest model to evaluate its performance

# Plot other default visualizations

plot_model(rfModel)

# Generates standard evaluation plots like ROC curve, Precision-Recall curve, etc.

https://lms.cuchd.in/mod/page/view.php?id=1883200 05/03/25, 9 59 PM
Page 5 of 10
:
# Save the trained Random Forest model to a file
sm = save_model(rfModel, 'rfModelFile')

# Saves the trained model to a file named 'rfModelFile.pkl' for future use

# Plot feature importance

plot_model(rfModel, plot='feature')

# Visualizes the importance of features in making predictions with the Random Forest model

# Prepare a new dataset for predictions

https://lms.cuchd.in/mod/page/view.php?id=1883200 05/03/25, 9 59 PM
Page 6 of 10
:
newDataSet = get_data("diabetes").iloc[:10]

# Loads a fresh copy of the diabetes dataset and selects the first 10 rows for testing

# Make predictions on the new dataset

newPredictions = predict_model(rfModel, data=newDataSet)

# Uses the trained Random Forest model to predict the class labels for the new data

# Display the predictions

newPredictions

# Outputs the predictions, including the class labels and probabilities for the new dataset

Additional Resources
1. Openclass Room Tutorials: https://openclassrooms.com/en/courses/6389626-train-a-supervised-machine-
learning-model/6405911-build-and-evaluate-a-classification-model
2. Datacamp Tutorial: https://www.datacamp.com/blog/classification-machine-learning
3. GeeksforGeeks - https://www.geeksforgeeks.org/basic-concept-classification-data-mining/

Video Links
1.Machine Learning in Python: Building a Classification Model

https://lms.cuchd.in/mod/page/view.php?id=1883200 05/03/25, 9 59 PM
Page 7 of 10
:
2.Random Forest Algorithm Explained with Python

3.Machine Learning Algorithms

4.PyCaret Tutorial: Splitting Data into Training and Testing Sets

https://lms.cuchd.in/mod/page/view.php?id=1883200 05/03/25, 9 59 PM
Page 8 of 10
:
TEXT BOOKS/REFERENCE BOOKS
TEXT BOOKS

T1: Data Science from Scratch, Joel Grus, Shroff Publisher Publisher /O’Reilly Publisher Media, 2019.

https://drive.google.com/file/d/1qv89LVaEshX9hcmSS9KDMsvBP-UYC78h/view?usp=sharing

T2: Artificial Intelligence: A Modern Approach, 3rd Edition, by Stuart Russell and Peter Norvig, Pearson Publisher, 2010.

https://drive.google.com/file/d/1G-s5fsBh5rLMdWmIYvyeI2zclcDCAA_D/view?usp=sharing

T3: Machine Learning, Tom Mitchell, McGraw Hill, 2017.

https://drive.google.com/file/d/1IBgLq2GvyEXURAPfSDm-Eep94X0vYXDb/view?usp=sharing

REFERENCE BOOKS

RB1: Philipp Janert, Data Analysis with Open-Source Tools, Shroff Publisher Publisher /O’Reilly Publisher Media.

https://drive.google.com/file/d/1SVtjE5XEih7_aU433_cAJKiDF41-KuzU/view?usp=sharing

RB2: Andreas C. Müller & Sarah Guido ,Introduction to Machine Learning with Python,published by O'Reilly Media

https://www.nrigroupindia.com/e-
book/Introduction%20to%20Machine%20Learning%20with%20Python%20(%20PDFDrive.com%20)-min.pdf

RB3: Ms.Anitha Patibandla, Dr.B.Jyothi, Ms.K.Bhavana,ARTIFICIAL INTELLIGENCE & MACHINE LEARNING,Lecture notes

https://mrcet.com/downloads/digital_notes/ECE/III%20Year/AI%20&%20ML%20DIGITAL%20NOTES.pdf

Last modified: Monday, 6 January 2025, 10:57 AM

https://lms.cuchd.in/mod/page/view.php?id=1883200 05/03/25, 9 59 PM
Page 9 of 10
:
Contact us

! "

    

You are logged in as BHAVNEET KAUR . (Log out)

Switch to the standard theme

https://lms.cuchd.in/mod/page/view.php?id=1883200 05/03/25, 9 59 PM
Page 10 of 10
:

6 - Steps of The Classification Algorithm in Supervised Learning
No ratings yet
6 - Steps of The Classification Algorithm in Supervised Learning
15 pages
Machine Learning
100% (1)
Machine Learning
21 pages
AI 501 - Lesson 4 - Supervised Learning
No ratings yet
AI 501 - Lesson 4 - Supervised Learning
41 pages
Python Predictive Modeling
No ratings yet
Python Predictive Modeling
24 pages
Minor Project
No ratings yet
Minor Project
21 pages
Basic Computer Concepts
100% (12)
Basic Computer Concepts
9 pages
ML 3RD Unit
No ratings yet
ML 3RD Unit
67 pages
Introduction To Scikit Learn
100% (1)
Introduction To Scikit Learn
108 pages
Diabetes Prediction
No ratings yet
Diabetes Prediction
28 pages
Supervised Learning Workshop
No ratings yet
Supervised Learning Workshop
30 pages
Data Science and Data Analytics Lab CS695A: Sayan Maity Cse 3B Roll-05 12017009001193
No ratings yet
Data Science and Data Analytics Lab CS695A: Sayan Maity Cse 3B Roll-05 12017009001193
30 pages
11-AI ML Intro 2022
No ratings yet
11-AI ML Intro 2022
54 pages
Scikit Learn What Were Covering
No ratings yet
Scikit Learn What Were Covering
15 pages
Session 2 Machine Learning Execution
No ratings yet
Session 2 Machine Learning Execution
12 pages
ML Notes - 2025
No ratings yet
ML Notes - 2025
145 pages
In5490 Classification
No ratings yet
In5490 Classification
85 pages
On Daibeteg
No ratings yet
On Daibeteg
27 pages
Information Retrieval Master Thesis
100% (2)
Information Retrieval Master Thesis
7 pages
Developing A Machining Learning Models From Start To Finish.
No ratings yet
Developing A Machining Learning Models From Start To Finish.
59 pages
19-Introduction Classification Algorithm-18-09-2024
No ratings yet
19-Introduction Classification Algorithm-18-09-2024
102 pages
SK Learn
No ratings yet
SK Learn
9 pages
2.0 - Decision Tree
No ratings yet
2.0 - Decision Tree
50 pages
Chapter 7 Learning
No ratings yet
Chapter 7 Learning
34 pages
Pyhton 2
No ratings yet
Pyhton 2
8 pages
Mid Sem
No ratings yet
Mid Sem
11 pages
What Is Classification? What Is Prediction?
No ratings yet
What Is Classification? What Is Prediction?
36 pages
Phython 3
No ratings yet
Phython 3
10 pages
2018 02 Msu Data Science
No ratings yet
2018 02 Msu Data Science
65 pages
Diabetes Prediction
No ratings yet
Diabetes Prediction
15 pages
Classification & Prediction: - Shailesh Yadav Central University of Rajasthan
No ratings yet
Classification & Prediction: - Shailesh Yadav Central University of Rajasthan
28 pages
ML Unit-Ii
No ratings yet
ML Unit-Ii
37 pages
University Institute of Engineering Department of Computer Science & Engineering
No ratings yet
University Institute of Engineering Department of Computer Science & Engineering
7 pages
Week 01
No ratings yet
Week 01
37 pages
CHAPTER 4 Diabetes
No ratings yet
CHAPTER 4 Diabetes
6 pages
IntroClassificationDA 2024
No ratings yet
IntroClassificationDA 2024
129 pages
Unit 4 Classification
No ratings yet
Unit 4 Classification
87 pages
ClassificationandPrediction Module3
No ratings yet
ClassificationandPrediction Module3
88 pages
Statistical Learning Slides
No ratings yet
Statistical Learning Slides
60 pages
Tutorial 6
No ratings yet
Tutorial 6
8 pages
Decision Tree Part 1
No ratings yet
Decision Tree Part 1
16 pages
Cancer CLassification
No ratings yet
Cancer CLassification
15 pages
Module 5.pptx - 20250608 - 201231 - 0000
No ratings yet
Module 5.pptx - 20250608 - 201231 - 0000
43 pages
Minesched PDF
No ratings yet
Minesched PDF
4 pages
MLPPT 11 45
No ratings yet
MLPPT 11 45
31 pages
Data Science II: Charles C.N. Wang
No ratings yet
Data Science II: Charles C.N. Wang
38 pages
04 - Model Selection
No ratings yet
04 - Model Selection
62 pages
ML Unit 2
No ratings yet
ML Unit 2
37 pages
Algorithmeknn 121213175830 Phpapp02
No ratings yet
Algorithmeknn 121213175830 Phpapp02
52 pages
Chapter 03 - 1731422626
No ratings yet
Chapter 03 - 1731422626
42 pages
Experiment 8
No ratings yet
Experiment 8
4 pages
HUAWEI IdeaHub S2 Must-See Tips
No ratings yet
HUAWEI IdeaHub S2 Must-See Tips
50 pages
An Introduction To Supervised Machine Learning and Pattern Classification - The Big Picture
No ratings yet
An Introduction To Supervised Machine Learning and Pattern Classification - The Big Picture
55 pages
Overview DIP5K/EN OS/A22 DIP 5000
No ratings yet
Overview DIP5K/EN OS/A22 DIP 5000
8 pages
B.Sc. (Data Science)
No ratings yet
B.Sc. (Data Science)
9 pages
Beginner's Guide To Implementing A Simple Machine Learning Project - DeV Community
No ratings yet
Beginner's Guide To Implementing A Simple Machine Learning Project - DeV Community
9 pages
Week 7 Laboratory Activity
No ratings yet
Week 7 Laboratory Activity
12 pages
Machine Learning Notes22
No ratings yet
Machine Learning Notes22
45 pages
An Introduction To Near-Field Communication and The Contactless Communication API
No ratings yet
An Introduction To Near-Field Communication and The Contactless Communication API
17 pages
Model Learning Steps
No ratings yet
Model Learning Steps
12 pages
Disease Prediction Using Machine Learning
No ratings yet
Disease Prediction Using Machine Learning
4 pages
Warhammer Fantasy Roleplay Lustria 4th Edition Edition Cubicle 7 Entertainment LTD - The Ebook in PDF and DOCX Formats Is Ready For Download Now
No ratings yet
Warhammer Fantasy Roleplay Lustria 4th Edition Edition Cubicle 7 Entertainment LTD - The Ebook in PDF and DOCX Formats Is Ready For Download Now
14 pages
Artificial Intelligence - AL3391 2021 Regulation - Question Paper 2023 Nov Dec
No ratings yet
Artificial Intelligence - AL3391 2021 Regulation - Question Paper 2023 Nov Dec
4 pages
Fundamental Data Structures
No ratings yet
Fundamental Data Structures
160 pages
Stiker Tinggal Print NEW
No ratings yet
Stiker Tinggal Print NEW
50 pages
Chapter 2 Machine Learning Draft-85-172
No ratings yet
Chapter 2 Machine Learning Draft-85-172
88 pages
Oracle 1Z0-083 v2022-05-21 q220 - 2
No ratings yet
Oracle 1Z0-083 v2022-05-21 q220 - 2
75 pages
Self Repair App
No ratings yet
Self Repair App
38 pages
Cics Mock Test III
No ratings yet
Cics Mock Test III
6 pages
Delegates C#
No ratings yet
Delegates C#
6 pages
Lecture 1: Matrices and Systems of Linear Equations: Brandon Behring
No ratings yet
Lecture 1: Matrices and Systems of Linear Equations: Brandon Behring
37 pages
SeeGull MX 2.6.5.0 Release Notes Rev Y
No ratings yet
SeeGull MX 2.6.5.0 Release Notes Rev Y
51 pages
Chapter 4
No ratings yet
Chapter 4
58 pages
Computer Programming I (Python) : Dr. Sami Al-Maqtari
No ratings yet
Computer Programming I (Python) : Dr. Sami Al-Maqtari
170 pages
January Budget 2021
No ratings yet
January Budget 2021
6 pages
Tentative Program
No ratings yet
Tentative Program
3 pages
Pas-Compact Fact-Sheet Display en July 2023
No ratings yet
Pas-Compact Fact-Sheet Display en July 2023
20 pages
Log
No ratings yet
Log
25 pages
What Is Semi-Supervised Learning
No ratings yet
What Is Semi-Supervised Learning
5 pages
De Pin
No ratings yet
De Pin
22 pages
Device Management in Operating System
No ratings yet
Device Management in Operating System
5 pages
2016 Quiz Paper
No ratings yet
2016 Quiz Paper
1 page
Character Design Quarterly Issue 12 (Downloadable Edition) - 3dtotal Shop
No ratings yet
Character Design Quarterly Issue 12 (Downloadable Edition) - 3dtotal Shop
12 pages
Development of E-Learning Models in Database System Courses
No ratings yet
Development of E-Learning Models in Database System Courses
4 pages
Alpha-Test Questionnaire
No ratings yet
Alpha-Test Questionnaire
4 pages
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
Machine Learning with Python: Foundations and Applications: ML, #1
From Everand
Machine Learning with Python: Foundations and Applications: ML, #1
Mohammed Nurudeen
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet