0% found this document useful (0 votes)

13 views18 pages

Lecture-12 Machine Learning With Python

The document provides an overview of the Random Forest algorithm, a popular supervised machine learning technique used for classification and regression tasks. It explains the concepts of ensemble learning, bagging, and boosting, highlighting how Random Forest combines multiple decision trees to improve predictive accuracy while preventing overfitting. Additionally, it discusses the advantages, disadvantages, and applications of Random Forest in various sectors such as banking, medicine, land use, and marketing.

Uploaded by

dhruvjaisinghanioberoi484

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views18 pages

Lecture-12 Machine Learning With Python

Uploaded by

dhruvjaisinghanioberoi484

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Lecture-12

Machine Learning with

Python
Random Forest Algorithm
❖Random Forest is one of the most popular and commonly used algorithms by
Data Scientists. Random forest is a Supervised Machine Learning
Algorithm that is used widely in Classification and Regression problems.
❖It is based on the concept of ensemble learning,
❖Ensemble simply means combining multiple models. Thus a collection of models
is used to make predictions rather than an individual model.
Bagging
❖Bagging, also known as Bootstrap Aggregation, is the ensemble technique used by
random forest.
❖Bagging chooses a random sample/random subset from the entire data set. Hence
each model is generated from the samples (Bootstrap Samples) provided by the
Original Data with replacement known as row sampling. This step of row sampling
with replacement is called bootstrap.
❖Now each model is trained independently, which generates results. The final output
is based on majority voting after combining the results of all models. This step which
involves combining all the results and generating output based on majority voting, is
known as aggregation.
Bagging
Boosting
❖Boosting is one of the techniques that use the concept of ensemble learning. A
boosting algorithm combines multiple simple models (also known as weak
learners or base estimators) to generate the final output. It is done by building a
model by using weak models in series.
❖There are several boosting algorithms; AdaBoost was the first really successful
boosting algorithm that was developed for the purpose of binary classification.
AdaBoost is an abbreviation for Adaptive Boosting and is a prevalent boosting
technique that combines multiple “weak classifiers” into a single “strong
classifier.”
ADA-Boost
Random forest Algorithm
❖ "Random Forest is a classifier that contains a number of decision trees on
various subsets of the given dataset and takes the average to improve the
predictive accuracy of that dataset."
❖Instead of relying on one decision tree, the random forest takes the prediction
from each tree and based on the majority votes of predictions, and it predicts
the final output.
❖The greater number of trees in the forest leads to higher accuracy and
prevents the problem of overfitting.
❖Random forest works on the Bagging principle
Assumptions for Random Forest
Since the random forest combines multiple trees to predict the class of the
dataset, it is possible that some decision trees may predict the correct output,
while others may not. But together, all the trees predict the correct output.
Therefore, below are two assumptions for a better Random forest classifier:
❖There should be some actual values in the feature variable of the dataset so that
the classifier can predict accurate results rather than a guessed result.
❖The predictions from each tree must have very low correlations.
Steps Involved in Random Forest Algorithm
Random Forest works in two-phase first is to create the random forest by
combining N decision tree, and second is to make predictions for each tree
created in the first phase.
The Working process can be explained in the below steps and diagram:
Step-1: Select random K data points from the training set.
Step-2: Build the decision trees associated with the selected data points
(Subsets)
Step-3: Choose the number N for decision trees that you want to build.
Step-4: Repeat Step 1 & 2.
Step-5: For new data points, find the predictions of each decision tree, and
assign the new data points to the category that wins the majority votes.
Example: Suppose there is a dataset
that contains multiple fruit images. So,
this dataset is given to the Random
forest classifier. The dataset is divided
into subsets and given to each decision
tree. During the training phase, each
decision tree produces a prediction
result, and when a new data point
occurs, then based on the majority of
results, the Random Forest classifier
predicts the final decision
Why use Random Forest?
❖It takes less training time as compared to other algorithms.
❖It predicts output with high accuracy, even for the large dataset it runs
efficiently.
❖It can also maintain accuracy when a large proportion of data is missing.
Applications of Random Forest
There are mainly four sectors where Random forest mostly used:
❖Banking: Banking sector mostly uses this algorithm for the identification of loan
risk.
❖Medicine: With the help of this algorithm, disease trends and risks of the
disease can be identified.
❖Land Use: We can identify the areas of similar land use by this algorithm.
❖Marketing: Marketing trends can be identified using this algorithm.
Advantages of Random Forest
❖Random Forest is capable of performing both Classification and Regression
tasks.
❖It is capable of handling large datasets with high dimensionality.
❖It enhances the accuracy of the model and prevents the overfitting issue.
Disadvantages of Random Forest
❖Although random forest can be used for both classification and regression tasks,
it is not more suitable for Regression tasks.
Difference Between Decision Tree and
Random Forest
Decision trees Random Forest
1. Random forests are created from
1. Decision trees normally suffer from subsets of data, and the final output is
the problem of overfitting if it’s allowed based on average or majority ranking;
to grow without any control. hence the problem of overfitting is
taken care of.
2. A single decision tree is faster in
2. It is comparatively slower.
computation.
3. When a data set with features is 3. Random forest randomly selects
taken as input by a decision tree, it observations, builds a decision tree,
will formulate some rules to make and takes the average result. It
predictions. doesn’t use any set of formulas.
Thank you!!

CNS 3-1 Lab Manual
100% (2)
CNS 3-1 Lab Manual
34 pages
Random Forest
No ratings yet
Random Forest
18 pages
2CPSC531 Simulation
No ratings yet
2CPSC531 Simulation
28 pages
Presentation of Master's Thesis: Gait Analysis: Is It Possible To Learn To Walk Like Someone Else?
No ratings yet
Presentation of Master's Thesis: Gait Analysis: Is It Possible To Learn To Walk Like Someone Else?
27 pages
Bagging and Random Forest Presentation1
100% (3)
Bagging and Random Forest Presentation1
23 pages
Machine Learning Random Forest Algorithm - Javatpoint
No ratings yet
Machine Learning Random Forest Algorithm - Javatpoint
14 pages
Random Forest (RF) : Decision Trees
No ratings yet
Random Forest (RF) : Decision Trees
3 pages
Multiple Choice Questions
No ratings yet
Multiple Choice Questions
8 pages
Module 4 - Synchronization Tools
No ratings yet
Module 4 - Synchronization Tools
28 pages
Exercises Tarea 1
No ratings yet
Exercises Tarea 1
6 pages
03 A Polynomial Linear Regression
No ratings yet
03 A Polynomial Linear Regression
6 pages
Random Forests
No ratings yet
Random Forests
1 page
Lecture+Notes+-+Random Forests
No ratings yet
Lecture+Notes+-+Random Forests
10 pages
Random Forest Classifier
No ratings yet
Random Forest Classifier
9 pages
Machine Learning: Practical Tutorial On Random Forest and Parameter Tuning in R
No ratings yet
Machine Learning: Practical Tutorial On Random Forest and Parameter Tuning in R
11 pages
Megaprojetos 2008 - Apresentação - Rob Smith - Megaprojects Need New Tools: Why Current PM Tools Don't Deliver The Goods
No ratings yet
Megaprojetos 2008 - Apresentação - Rob Smith - Megaprojects Need New Tools: Why Current PM Tools Don't Deliver The Goods
30 pages
RSA Cryptosystem Using Python
No ratings yet
RSA Cryptosystem Using Python
3 pages
ERM Study Schedule
No ratings yet
ERM Study Schedule
32 pages
Random Forest
No ratings yet
Random Forest
25 pages
Random Forest Summary
No ratings yet
Random Forest Summary
6 pages
ALGO Practice Session-I
No ratings yet
ALGO Practice Session-I
2 pages
Random FOrest
No ratings yet
Random FOrest
19 pages
Random Forest Algorithms - Comprehensive Guide With Examples
No ratings yet
Random Forest Algorithms - Comprehensive Guide With Examples
13 pages
CSL0777 L26
No ratings yet
CSL0777 L26
33 pages
Exploring The Mysteries of Quantum Computing
No ratings yet
Exploring The Mysteries of Quantum Computing
3 pages
Random Forest
No ratings yet
Random Forest
10 pages
CS3381 Oop Manual Cse
No ratings yet
CS3381 Oop Manual Cse
52 pages
Random Forest
No ratings yet
Random Forest
29 pages
Numerical Solutions of The Integral Equations of The First Kind
100% (1)
Numerical Solutions of The Integral Equations of The First Kind
8 pages
Week 6 - Random Forest
No ratings yet
Week 6 - Random Forest
12 pages
Random Forest Summary - Rashmi
No ratings yet
Random Forest Summary - Rashmi
2 pages
Random Forest Algorithm
No ratings yet
Random Forest Algorithm
4 pages
Random Forest Algorithm
No ratings yet
Random Forest Algorithm
3 pages
Optimization For Machine Learning: Lecture 12: Coordinate Descent, BCD, Altmin 6.881: MIT
No ratings yet
Optimization For Machine Learning: Lecture 12: Coordinate Descent, BCD, Altmin 6.881: MIT
124 pages
Unit 3 Queue
No ratings yet
Unit 3 Queue
52 pages
ML Unit 3
No ratings yet
ML Unit 3
22 pages
Computer Graphics Using OpenGL - 4
No ratings yet
Computer Graphics Using OpenGL - 4
80 pages
Lecture 6
No ratings yet
Lecture 6
24 pages
Random Forest
No ratings yet
Random Forest
8 pages
Random Forests
No ratings yet
Random Forests
35 pages
Mtes1104 Coursework
100% (1)
Mtes1104 Coursework
3 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
4 pages
RandomForest ML
No ratings yet
RandomForest ML
5 pages
Class 6 AI Excite - Period 1
No ratings yet
Class 6 AI Excite - Period 1
13 pages
AMME
No ratings yet
AMME
192 pages
Module 5 - Clustering - Afterclassb
No ratings yet
Module 5 - Clustering - Afterclassb
49 pages
Random Forest
No ratings yet
Random Forest
27 pages
Random Forest Algorithm
No ratings yet
Random Forest Algorithm
9 pages
Unit 4 (Ensemble Methods)
No ratings yet
Unit 4 (Ensemble Methods)
24 pages
Random Forest Algorithm
No ratings yet
Random Forest Algorithm
39 pages
Classification Algorithms
No ratings yet
Classification Algorithms
68 pages
Polynomials
No ratings yet
Polynomials
2 pages
UNIT-3 Material
No ratings yet
UNIT-3 Material
19 pages
Data Mining Notes
No ratings yet
Data Mining Notes
5 pages
Stats - Mock Set 2
No ratings yet
Stats - Mock Set 2
20 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
32 pages
Random Forest
No ratings yet
Random Forest
4 pages
Random Forest Algorithm Unit 3
No ratings yet
Random Forest Algorithm Unit 3
2 pages
An Introduction To Random Forest Algorithm For Beginners
No ratings yet
An Introduction To Random Forest Algorithm For Beginners
16 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Deep Learning and Neural Networks
No ratings yet
Deep Learning and Neural Networks
21 pages
Random Forest
No ratings yet
Random Forest
13 pages
Random Forest Algorithm
No ratings yet
Random Forest Algorithm
2 pages
Random Forest
No ratings yet
Random Forest
6 pages
Lecture Note On Control Engineering 1
No ratings yet
Lecture Note On Control Engineering 1
29 pages
015 - Random Forest
No ratings yet
015 - Random Forest
15 pages
Macroscopic and Large Scale Phenomena Coarse Graining Mean Field Limits and Ergodicity 1st Edition Adrian Muntean
No ratings yet
Macroscopic and Large Scale Phenomena Coarse Graining Mean Field Limits and Ergodicity 1st Edition Adrian Muntean
65 pages
Random Forest-Supervised ML
No ratings yet
Random Forest-Supervised ML
45 pages
Random Forest
No ratings yet
Random Forest
2 pages
Project File - AI (Final Term Class X) 2024
No ratings yet
Project File - AI (Final Term Class X) 2024
7 pages
Random Forest Algorithm 1
No ratings yet
Random Forest Algorithm 1
14 pages
Random Forest Summary
No ratings yet
Random Forest Summary
6 pages
Technical Report of HCB Team For Multiview Egocentric Hand Tracking Challenge On HANDS 2024 Challenge
No ratings yet
Technical Report of HCB Team For Multiview Egocentric Hand Tracking Challenge On HANDS 2024 Challenge
3 pages
Classification Analysis Report PDF
No ratings yet
Classification Analysis Report PDF
9 pages
Aditri Chaudhuri - DM
No ratings yet
Aditri Chaudhuri - DM
10 pages
Assingment Ai
No ratings yet
Assingment Ai
7 pages
Random Forest Algorithm Updated
No ratings yet
Random Forest Algorithm Updated
11 pages
Random Forest
No ratings yet
Random Forest
14 pages
Randon Forest
No ratings yet
Randon Forest
34 pages
D3 IT Random Forest Apr 2023
No ratings yet
D3 IT Random Forest Apr 2023
32 pages
Random Forest
No ratings yet
Random Forest
9 pages
Random Forest
No ratings yet
Random Forest
10 pages
DCS Unit - 1
No ratings yet
DCS Unit - 1
7 pages
Random Forest Classic Style
No ratings yet
Random Forest Classic Style
9 pages
Random Forest, CNN and Different Algorithm
No ratings yet
Random Forest, CNN and Different Algorithm
14 pages
Eda - M4
No ratings yet
Eda - M4
7 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Differential Evolution: Fundamentals and Applications
From Everand
Differential Evolution: Fundamentals and Applications
Fouad Sabry
No ratings yet
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet

Lecture-12 Machine Learning With Python

Uploaded by

Lecture-12 Machine Learning With Python

Uploaded by

Lecture-12

Machine Learning with

You might also like