Open navigation menu

Scribd

0% found this document useful (0 votes)

10 views

Scikit Learn

Tutorial

Uploaded by

Basker PalaniSwamy

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Scikit Learn

Tutorial

Uploaded by

Basker PalaniSwamy

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 28

Features Vs

Target
TYPES OF
MACHINE
LEARNING
1 . SUPERVISED
LEARNING
1 . SUPERVISED
LEARNING
2 . UNSUPERVISED
LEARNING
3 . REINFORCEMENT
LEARNING
Preprocessing
tools

Feauture
Scikit-
learn selection train,

test, split
Model
Algorithms
evaluation
S C I K I T-
LEARN
Scikit-learn, also known as sklearn, is a popular open-source machine learning library in
Python

that provides a wide range of tools for data analysis, modeling, and evaluation.

Sklearn is built on top of NumPy, SciPy, and Matplotlib, and supports integration with
Pandas,

which makes it easy to use in data science workflows.

Sklearn is widely used in the data science community for various applications such as
predictive

modeling, natural language processing, computer vision, and time series forecasting,

among others.
I NS TA L L AT I O
N
IMPOR
T
Feature
Scaling

PREPROCESSING Encoding
Imputing null values

Outlier - detection & Handling

F E AT U R E
SCALING
F E AT U R E
SCALING
Feature scaling is a method used to normalize the range of
features

of data.

Fea ture S c a ling involves m odifying va lues by m ethods like

Normalization or Standardization.

It helps to avoid bias in machine learning

model.
WHY
SCALING?
When dataset has
numerical

fea tures a nd ea c h of them a re in

different scale.

ML m odel ca n put weight

on features with larger scale.

S c a ling helps to c ontribute

a ll features equally.
NORMALIZATI
ON

It is the method of scaling

the

d a t a by fitting the d a t a

points between a range of 0

to 1.
MIN-MAX
SCALER

MinMaxScaler from sklearn perform

normalization
S TA N DA R D I Z AT I
ON

This converts all the d a t a

points

to h a v e a m e a n value of 0

and standard deviation of 1

S TA N D A R D
SCALER

StandardScaler from sklearn perform

standardization
ROBUST
SCALER

This uses interquartile range

so

that it is robust to outliers

ROBUST
SCALER
ROBUST
SCALER
WHICH I S
BETTER?
Normalization:

Useful when the d a t a doesn't follow ga us sia n(norma l)

distrubution Useful in algorithms like KNN, and Neural networks

like CNN, ANN

Standardization:

W hen your d a t a follows gaussian distribution

R ob u st Scaler:

W hen your d a t a has outliers

E N C OD IN
G
ENCODIN
G
Machine learning models c a n only work with numerical
values.

For this reason, it is necessary to transform the categorical values

of

the relevant features into numerical ones.

This process is called feature

encoding.
T Y P E S OF
ENCODING
1. Nominal encoding :

Represent d a t a without an y order or

hierarchy It c a n be done with

OneHotEncoder

2. Ordinal Encoding :

Assigning unique integer based on

rank/order It c a n be done with LabelEncoder

ONEHOT
ENCODER
LABEL
ENCODER

You might also like

3.electricity and Magnetism EOT 2021 Test
No ratings yet
3.electricity and Magnetism EOT 2021 Test
9 pages
Trane VRF Catalogue
67% (3)
Trane VRF Catalogue
124 pages
ML - WEEK 04
No ratings yet
ML - WEEK 04
33 pages
Djghuh
No ratings yet
Djghuh
2 pages
Unit 3-2
No ratings yet
Unit 3-2
15 pages
3_AML _Lecture 3_Feature Engg
No ratings yet
3_AML _Lecture 3_Feature Engg
39 pages
Machine Learning (2) : Inteligência Artificial E Cibersegurança (Inacs)
No ratings yet
Machine Learning (2) : Inteligência Artificial E Cibersegurança (Inacs)
45 pages
Feature Engineering: Getting The Most Out of Data For Predictive Models
No ratings yet
Feature Engineering: Getting The Most Out of Data For Predictive Models
75 pages
Feature Engineering PDF
100% (1)
Feature Engineering PDF
75 pages
Feature Scaling Techniques: Machine Learning
No ratings yet
Feature Scaling Techniques: Machine Learning
27 pages
Standar Ization
No ratings yet
Standar Ization
7 pages
Feature Engineering
No ratings yet
Feature Engineering
23 pages
Week 10
No ratings yet
Week 10
50 pages
FeatureEngineering (1)
No ratings yet
FeatureEngineering (1)
50 pages
6 - Machine Learning 2
No ratings yet
6 - Machine Learning 2
14 pages
Python Scikit-Learn Cheat Sheet For Machine Learning
No ratings yet
Python Scikit-Learn Cheat Sheet For Machine Learning
3 pages
Towards Data Science All About Feature Scaling
No ratings yet
Towards Data Science All About Feature Scaling
16 pages
Feature Scaling in Machine Learning
No ratings yet
Feature Scaling in Machine Learning
4 pages
Feature Engineering For Machine Learning
No ratings yet
Feature Engineering For Machine Learning
41 pages
ML Unit 2
No ratings yet
ML Unit 2
90 pages
Exp2 - Data Visualization and Cleaning and Feature Selection
No ratings yet
Exp2 - Data Visualization and Cleaning and Feature Selection
13 pages
Featureengineering 171206213206
No ratings yet
Featureengineering 171206213206
45 pages
Session 7 Feature Selection & Dimensionality Reduction
No ratings yet
Session 7 Feature Selection & Dimensionality Reduction
20 pages
1737527078055
No ratings yet
1737527078055
111 pages
Machine Learning
No ratings yet
Machine Learning
17 pages
Scikit Learn
No ratings yet
Scikit Learn
17 pages
Assignment 121
No ratings yet
Assignment 121
9 pages
Feature Scaling (Standardization & Normalization)
No ratings yet
Feature Scaling (Standardization & Normalization)
35 pages
MODELS (AutoRecovered)
No ratings yet
MODELS (AutoRecovered)
9 pages
Lecture Material 3
No ratings yet
Lecture Material 3
7 pages
100 Days of Machine Learning
No ratings yet
100 Days of Machine Learning
14 pages
Summery of Feature Eng
No ratings yet
Summery of Feature Eng
4 pages
Machine Learning: by Team 2
No ratings yet
Machine Learning: by Team 2
41 pages
Unit 2 ML 2019
No ratings yet
Unit 2 ML 2019
91 pages
Preprocessing ch.2
No ratings yet
Preprocessing ch.2
19 pages
mini4
No ratings yet
mini4
9 pages
1.3.2. Feature Engineering and Variable - Transformation
No ratings yet
1.3.2. Feature Engineering and Variable - Transformation
29 pages
Data Pre-Processing with Sklearn using Standard and Minmax
No ratings yet
Data Pre-Processing with Sklearn using Standard and Minmax
21 pages
Unit-II
No ratings yet
Unit-II
119 pages
06 - Data Preprocessing
No ratings yet
06 - Data Preprocessing
68 pages
Scikit Hca
No ratings yet
Scikit Hca
8 pages
ML_DA
No ratings yet
ML_DA
55 pages
Summary Chap 1 & 2
No ratings yet
Summary Chap 1 & 2
5 pages
Feature Engineering: Short Study: Indian Institute of Space Science and Technology, Department of Mathematics
No ratings yet
Feature Engineering: Short Study: Indian Institute of Space Science and Technology, Department of Mathematics
6 pages
UNIT 2 PART 2
No ratings yet
UNIT 2 PART 2
6 pages
ML1
No ratings yet
ML1
69 pages
ML Lectures Summary 2
No ratings yet
ML Lectures Summary 2
52 pages
Lecture5
No ratings yet
Lecture5
26 pages
CH1
No ratings yet
CH1
64 pages
Data Preprocessing
No ratings yet
Data Preprocessing
65 pages
Well Posed Learning Problem
100% (1)
Well Posed Learning Problem
4 pages
Lecture-2-20022025-092902am
No ratings yet
Lecture-2-20022025-092902am
87 pages
Final ML
No ratings yet
Final ML
2 pages
Presentation
No ratings yet
Presentation
10 pages
Scikit-Learn: Scikit-Learn Is An Open Source Python Library That
100% (1)
Scikit-Learn: Scikit-Learn Is An Open Source Python Library That
1 page
Scikit Learn Cheat Sheet Python
No ratings yet
Scikit Learn Cheat Sheet Python
1 page
Data Structures and Algorithms with Python
From Everand
Data Structures and Algorithms with Python
Aadinath Pothuvaal
No ratings yet
Python Programming: General-Purpose Libraries; NumPy,Pandas,Matplotlib,Seaborn,Requests,os & sys: Python, #2
From Everand
Python Programming: General-Purpose Libraries; NumPy,Pandas,Matplotlib,Seaborn,Requests,os & sys: Python, #2
e3
No ratings yet
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
The Numpy Pocketbook: Essentials on the Go
From Everand
The Numpy Pocketbook: Essentials on the Go
Silas Meadowlark
No ratings yet
Block Diagram of PNA
No ratings yet
Block Diagram of PNA
20 pages
PRASHANT BHAIYA COMPLETE SHORT NOTES
No ratings yet
PRASHANT BHAIYA COMPLETE SHORT NOTES
26 pages
Annex 1: Bibliography: DBA Design by Analysis
No ratings yet
Annex 1: Bibliography: DBA Design by Analysis
12 pages
Kwikstage: Bringing Structures To Life
No ratings yet
Kwikstage: Bringing Structures To Life
28 pages
Atlas Plan Technical Specifications v4
No ratings yet
Atlas Plan Technical Specifications v4
1 page
C++ Practical
100% (1)
C++ Practical
107 pages
Kimo Debimo Airflow Blade Data Sheet
No ratings yet
Kimo Debimo Airflow Blade Data Sheet
4 pages
iWRAP4 User Guide-1
No ratings yet
iWRAP4 User Guide-1
241 pages
Chemistry Gemini..
No ratings yet
Chemistry Gemini..
2 pages
Ultimate Electrical Technical Office Course Agenda
No ratings yet
Ultimate Electrical Technical Office Course Agenda
11 pages
Asignment 1
No ratings yet
Asignment 1
2 pages
Unit 3 Test Review Answers
100% (1)
Unit 3 Test Review Answers
2 pages
NFC Par I 03 15 2021 1454
No ratings yet
NFC Par I 03 15 2021 1454
6 pages
Condution Heat Transfer-1
No ratings yet
Condution Heat Transfer-1
35 pages
4 PG
No ratings yet
4 PG
3 pages
The Embodied Mind
100% (4)
The Embodied Mind
29 pages
Course3 1100
No ratings yet
Course3 1100
69 pages
Computer Studies Notes Form 1
No ratings yet
Computer Studies Notes Form 1
7 pages
Rain Gauge: Non-Recording Type
No ratings yet
Rain Gauge: Non-Recording Type
3 pages
Type C Travel +
No ratings yet
Type C Travel +
2 pages
Sparbs Luxuat Iixit: Slev Gearb0X Ttpe: Htls.385: Octobxr 1989
No ratings yet
Sparbs Luxuat Iixit: Slev Gearb0X Ttpe: Htls.385: Octobxr 1989
11 pages
What Is A Case Study and What Is It Good
No ratings yet
What Is A Case Study and What Is It Good
14 pages
References For Alge&Trig 2020
No ratings yet
References For Alge&Trig 2020
4 pages
CNC Retrofit Kit
No ratings yet
CNC Retrofit Kit
4 pages
Aerodinamika
No ratings yet
Aerodinamika
15 pages
Riko DN10003 PN10 SZ40 22may
No ratings yet
Riko DN10003 PN10 SZ40 22may
5 pages
Lesson 10 VC.08 Triple Integrals 1
No ratings yet
Lesson 10 VC.08 Triple Integrals 1
16 pages
Canon LBP-1760 Parts Manual
No ratings yet
Canon LBP-1760 Parts Manual
84 pages