0% found this document useful (0 votes)

31 views6 pages

Machine Learning - Random Forest

Uploaded by

A Sekar CSE KIOT

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views6 pages

Machine Learning - Random Forest

Uploaded by

A Sekar CSE KIOT

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

7/1/24, 10:56 AM Machine Learning - Random Forest

Machine Learning - Random Forest

Random Forest is a machine learning algorithm that uses an ensemble of decision trees to
make predictions. The algorithm was first introduced by Leo Breiman in 2001. The key
idea behind the algorithm is to create a large number of decision trees, each of which is
trained on a different subset of the data. The predictions of these individual trees are then
combined to produce a final prediction.

Working of Random Forest Algorithm

We can understand the working of Random Forest algorithm with the help of following
steps −

Step 1 − First, start with the selection of random samples from a given dataset.
Step 2 − Next, this algorithm will construct a decision tree for every sample. Then
it will get the prediction result from every decision tree.

Step 3 − In this step, voting will be performed for every predicted result.

Step 4 − At last, select the most voted prediction result as the final prediction
result.

The following diagram illustrates how the Random Forest Algorithm works −

https://www.tutorialspoint.com/machine_learning/machine_learning_random_forest_classification.htm 1/6
7/1/24, 10:56 AM Machine Learning - Random Forest

Random Forest is a flexible algorithm that can be used for both classification and
regression tasks. In classification tasks, the algorithm uses the mode of the predictions of
the individual trees to make the final prediction. In regression tasks, the algorithm uses
the mean of the predictions of the individual trees.

Advantages of Random Forest Algorithm

Random Forest algorithm has several advantages over other machine learning algorithms.
Some of the key advantages are −

Robustness to Overfitting − Random Forest algorithm is known for its

robustness to overfitting. This is because the algorithm uses an ensemble of
decision trees, which helps to reduce the impact of outliers and noise in the data.
High Accuracy − Random Forest algorithm is known for its high accuracy. This is
because the algorithm combines the predictions of multiple decision trees, which
helps to reduce the impact of individual decision trees that may be biased or
inaccurate.

Handles Missing Data − Random Forest algorithm can handle missing data
without the need for imputation. This is because the algorithm only considers the
features that are available for each data point and does not require all features to
be present for all data points.

https://www.tutorialspoint.com/machine_learning/machine_learning_random_forest_classification.htm 2/6
7/1/24, 10:56 AM Machine Learning - Random Forest

Non-Linear Relationships − Random Forest algorithm can handle non-linear

relationships between the features and the target variable. This is because the
algorithm uses decision trees, which can model non-linear relationships.

Feature Importance − Random Forest algorithm can provide information about

the importance of each feature in the model. This information can be used to
identify the most important features in the data and can be used for feature
selection and feature engineering.

Implementation of Random Forest Algorithm in Python

Let's take a look at the implementation of Random Forest Algorithm in Python. We will be
using the scikit-learn library to implement the algorithm. The scikit-learn library is a
popular machine learning library that provides a wide range of algorithms and tools for
machine learning.

Step 1 − Importing the Libraries

We will begin by importing the necessary libraries. We will be using the pandas library for
data manipulation, and the scikit-learn library for implementing the Random Forest
algorithm.

import pandas as pd
from sklearn.ensemble import RandomForestClassifier

Step 2 − Loading the Data

Next, we will load the data into a pandas dataframe. For this tutorial, we will be using the
famous Iris dataset, which is a classic dataset for classification tasks.

# Loading the iris dataset

iris = pd.read_csv('https://archive.ics.uci.edu/ml/machine-learningdatabases/iris/

iris.columns = ['sepal_length', 'sepal_width', 'petal_length','petal_width', 'spec

Step 3 − Data Preprocessing

Before we can use the data to train our model, we need to preprocess it. This involves
separating the features and the target variable and splitting the data into training and
testing sets.

https://www.tutorialspoint.com/machine_learning/machine_learning_random_forest_classification.htm 3/6
7/1/24, 10:56 AM Machine Learning - Random Forest

# Separating the features and target variable

X = iris.iloc[:, :-1]
y = iris.iloc[:, -1]

# Splitting the data into training and testing sets

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.35, random_s

Step 4 − Training the Model

Next, we will train our Random Forest classifier on the training data.

# Creating the Random Forest classifier object

rfc = RandomForestClassifier(n_estimators=100)

# Training the model on the training data

rfc.fit(X_train, y_train)

Step 5 − Making Predictions

Once we have trained our model, we can use it to make predictions on the test data.

# Making predictions on the test data

y_pred = rfc.predict(X_test)

Step 6 − Evaluating the Model

Finally, we will evaluate the performance of our model using various metrics such as
accuracy, precision, recall, and F1-score.

# Importing the metrics library

from sklearn.metrics import accuracy_score, precision_score,
recall_score, f1_score

# Calculating the accuracy, precision, recall, and F1-score

accuracy = accuracy_score(y_test, y_pred)
precision = precision_score(y_test, y_pred, average='weighted')
recall = recall_score(y_test, y_pred, average='weighted')
f1 = f1_score(y_test, y_pred, average='weighted')

https://www.tutorialspoint.com/machine_learning/machine_learning_random_forest_classification.htm 4/6
7/1/24, 10:56 AM Machine Learning - Random Forest

print("Accuracy:", accuracy)
print("Precision:", precision)
print("Recall:", recall)
print("F1-score:", f1)

Complete Implementation Example

Below is the complete implementation example of Random Forest Algorithm in python
using the iris dataset −

import pandas as pd
from sklearn.ensemble import RandomForestClassifier

# Loading the iris dataset

iris = pd.read_csv('https://archive.ics.uci.edu/ml/machine-learningdatabases/iris/

iris.columns = ['sepal_length', 'sepal_width', 'petal_length', 'petal_width', 'spe

# Separating the features and target variable

X = iris.iloc[:, :-1]
y = iris.iloc[:, -1]

# Splitting the data into training and testing sets

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y,

test_size=0.35, random_state=42)

# Creating the Random Forest classifier object

rfc = RandomForestClassifier(n_estimators=100)

# Training the model on the training data

rfc.fit(X_train, y_train)
# Making predictions on the test data
y_pred = rfc.predict(X_test)
# Importing the metrics library
from sklearn.metrics import accuracy_score, precision_score,
recall_score, f1_score

# Calculating the accuracy, precision, recall, and F1-score

accuracy = accuracy_score(y_test, y_pred)
precision = precision_score(y_test, y_pred, average='weighted')

https://www.tutorialspoint.com/machine_learning/machine_learning_random_forest_classification.htm 5/6
7/1/24, 10:56 AM Machine Learning - Random Forest

recall = recall_score(y_test, y_pred, average='weighted')

f1 = f1_score(y_test, y_pred, average='weighted')

print("Accuracy:", accuracy)
print("Precision:", precision)
print("Recall:", recall)
print("F1-score:", f1)

Output

This will give us the performance metrics of our Random Forest classifier as follows −

Accuracy: 0.9811320754716981
Precision: 0.9821802935010483
Recall: 0.9811320754716981
F1-score: 0.9811157396063056

https://www.tutorialspoint.com/machine_learning/machine_learning_random_forest_classification.htm 6/6

Random Forest Algorithm Updated
No ratings yet
Random Forest Algorithm Updated
11 pages
Random Forest
No ratings yet
Random Forest
9 pages
Random Forest 1737667979
No ratings yet
Random Forest 1737667979
11 pages
Hartshorn, Scott 2016 - Machin Learning With Random Forests and Decision Trees - A Visual Guide For Beginners
No ratings yet
Hartshorn, Scott 2016 - Machin Learning With Random Forests and Decision Trees - A Visual Guide For Beginners
98 pages
Class 8 - Linear Regression
No ratings yet
Class 8 - Linear Regression
56 pages
Machine Learning With Random Forests and Decision Trees - A Visual Guide For Beginners by Scott Hartshorn
No ratings yet
Machine Learning With Random Forests and Decision Trees - A Visual Guide For Beginners by Scott Hartshorn
73 pages
Python Implementation of Random Forest Algorithm
No ratings yet
Python Implementation of Random Forest Algorithm
10 pages
17 Random Vectors 2 Lecture
No ratings yet
17 Random Vectors 2 Lecture
49 pages
Machine Learning Random Forest Algorithm - Javatpoint
No ratings yet
Machine Learning Random Forest Algorithm - Javatpoint
14 pages
Machine Learning With Random Forests and Decision Trees - A Visual Guide For Beginners (Naren) PDF
No ratings yet
Machine Learning With Random Forests and Decision Trees - A Visual Guide For Beginners (Naren) PDF
68 pages
Binary Logistic Regression Using Stata 17 Drop-Down Menus
No ratings yet
Binary Logistic Regression Using Stata 17 Drop-Down Menus
53 pages
Unit V Fds Notes
No ratings yet
Unit V Fds Notes
35 pages
Data Analysis Coca Cola
No ratings yet
Data Analysis Coca Cola
7 pages
Random Forest Classification
No ratings yet
Random Forest Classification
8 pages
Day93 94 Diabetes Prediction Model
No ratings yet
Day93 94 Diabetes Prediction Model
27 pages
Assessment of The Random Forest Algorithm 1
No ratings yet
Assessment of The Random Forest Algorithm 1
4 pages
Stastyy
No ratings yet
Stastyy
2 pages
Random - Forest - Classification - Ipynb - Colab
No ratings yet
Random - Forest - Classification - Ipynb - Colab
3 pages
ChatGPT Randomforest
No ratings yet
ChatGPT Randomforest
4 pages
Data Mining A Tutorial-Based Primer, Second Edition PDF
100% (1)
Data Mining A Tutorial-Based Primer, Second Edition PDF
530 pages
Python Programming for Beginners_ From Basics to AI Integrations. 5-Minute Illustrated Tutorials, Coding Hacks, Hands-On Exercises & Case Studies to Master Python in 7 Days and Get Paid More by Prince
100% (10)
Python Programming for Beginners_ From Basics to AI Integrations. 5-Minute Illustrated Tutorials, Coding Hacks, Hands-On Exercises & Case Studies to Master Python in 7 Days and Get Paid More by Prince
244 pages
BES220 - Theme 5 Linear Regression - Lecture 2 Line Fitting and Correlation - Slides
No ratings yet
BES220 - Theme 5 Linear Regression - Lecture 2 Line Fitting and Correlation - Slides
20 pages
Regression On Real Estate
No ratings yet
Regression On Real Estate
54 pages
PROPOSAL
No ratings yet
PROPOSAL
3 pages
Lecture 19 Different Classification Models
No ratings yet
Lecture 19 Different Classification Models
22 pages
Important Questions
No ratings yet
Important Questions
3 pages
RandomForest ML
No ratings yet
RandomForest ML
5 pages
Introduction To Econometrics ECO 356 Course Guide and Course Material
No ratings yet
Introduction To Econometrics ECO 356 Course Guide and Course Material
139 pages
Unleashing The Power of Random Forest - A Journey Through Algorithmic Canopies
No ratings yet
Unleashing The Power of Random Forest - A Journey Through Algorithmic Canopies
14 pages
Machine Learning
No ratings yet
Machine Learning
23 pages
Random Forest
No ratings yet
Random Forest
10 pages
10 Random - Forest - Algo
No ratings yet
10 Random - Forest - Algo
6 pages
Machine Learning (VR20) III B.Tech - II Semester: Random Forest Algorithm
No ratings yet
Machine Learning (VR20) III B.Tech - II Semester: Random Forest Algorithm
14 pages
Random Forest
No ratings yet
Random Forest
2 pages
Worksheet
100% (1)
Worksheet
2 pages
AAM 6th Prac
No ratings yet
AAM 6th Prac
3 pages
Random Forest
No ratings yet
Random Forest
14 pages
Analysis of Variance-Two Way Classification
No ratings yet
Analysis of Variance-Two Way Classification
4 pages
Tutorial Chapter 3-STA
No ratings yet
Tutorial Chapter 3-STA
2 pages
Randon Forest
No ratings yet
Randon Forest
34 pages
015 - Random Forest
No ratings yet
015 - Random Forest
15 pages
Regression Anaysis Explaination Lecture Notes by Dr. Wahid Sherani
No ratings yet
Regression Anaysis Explaination Lecture Notes by Dr. Wahid Sherani
7 pages
Random Forest
No ratings yet
Random Forest
11 pages
DS 7
No ratings yet
DS 7
5 pages
Module 5
No ratings yet
Module 5
28 pages
Data Science Theory, Analysis and Applications - Memon - Ahmed
100% (12)
Data Science Theory, Analysis and Applications - Memon - Ahmed
345 pages
2023AIB1008 Lab08
No ratings yet
2023AIB1008 Lab08
8 pages
Chapter 12 Heteroskedasticity PDF
No ratings yet
Chapter 12 Heteroskedasticity PDF
20 pages
Random Forest
No ratings yet
Random Forest
21 pages
ARIMA and Sugar Cane Juice
No ratings yet
ARIMA and Sugar Cane Juice
4 pages
Unit - 2 ML Notes
No ratings yet
Unit - 2 ML Notes
14 pages
ML Asst.-01
No ratings yet
ML Asst.-01
21 pages
Random Forest Algorithm 1
No ratings yet
Random Forest Algorithm 1
14 pages
Econ 582 Forecasting: Eric Zivot
No ratings yet
Econ 582 Forecasting: Eric Zivot
20 pages
Random Forest in ML
No ratings yet
Random Forest in ML
13 pages
Random Forest
No ratings yet
Random Forest
2 pages
Learn Excel Data Analysis
100% (15)
Learn Excel Data Analysis
721 pages
Random Forest
No ratings yet
Random Forest
6 pages
Hamza Samad 3
No ratings yet
Hamza Samad 3
2 pages
Machine Learning With Python
100% (14)
Machine Learning With Python
692 pages
Forest
No ratings yet
Forest
2 pages
Random Forest Classic Style
No ratings yet
Random Forest Classic Style
9 pages
Machine Learning With Decision Trees and Random Forest ?
No ratings yet
Machine Learning With Decision Trees and Random Forest ?
31 pages
Excel Assignment Opre 3360 Nateb
No ratings yet
Excel Assignment Opre 3360 Nateb
70 pages
Lecture-12 Machine Learning With Python
No ratings yet
Lecture-12 Machine Learning With Python
18 pages
Random Forest
No ratings yet
Random Forest
13 pages
03 - Random Forest
No ratings yet
03 - Random Forest
24 pages
Hang Li - Machine Learning Methods-Springer (2023) (Z-Lib - Io)
100% (9)
Hang Li - Machine Learning Methods-Springer (2023) (Z-Lib - Io)
530 pages
4 Curve Fitting Least Square Regression and Interpolation
No ratings yet
4 Curve Fitting Least Square Regression and Interpolation
59 pages
Data Analysis From Scratch With Python - Beginner Guide Using Python, Pandas, NumPy, Scikit-Learn, IPython, TensorFlow and
100% (10)
Data Analysis From Scratch With Python - Beginner Guide Using Python, Pandas, NumPy, Scikit-Learn, IPython, TensorFlow and
104 pages
Deep Learning and Neural Networks
No ratings yet
Deep Learning and Neural Networks
21 pages
Student Performance Analysis Using Machine Learning: Yamnampet, Hyderabad.
No ratings yet
Student Performance Analysis Using Machine Learning: Yamnampet, Hyderabad.
8 pages
DATA ANALYTICS - A Comprehensive Beginner's Guide To Learn About The Realms of Data Analytics From A-Z
88% (17)
DATA ANALYTICS - A Comprehensive Beginner's Guide To Learn About The Realms of Data Analytics From A-Z
102 pages
Multiple Linear Regression: Application
No ratings yet
Multiple Linear Regression: Application
22 pages
Random Forest Medical Diagnosis 1684665707
No ratings yet
Random Forest Medical Diagnosis 1684665707
10 pages
Data Visualization With Python PDF
93% (14)
Data Visualization With Python PDF
662 pages
Machine Learning With Random Forests - by Knoldus Inc. - Knoldus - Technical Insights - Medium
No ratings yet
Machine Learning With Random Forests - by Knoldus Inc. - Knoldus - Technical Insights - Medium
12 pages
Random Forest Algorithms - Comprehensive Guide With Examples
No ratings yet
Random Forest Algorithms - Comprehensive Guide With Examples
13 pages
Cia 4 ML
No ratings yet
Cia 4 ML
60 pages
The Python Bible
97% (31)
The Python Bible
506 pages
CSL0777 L26
No ratings yet
CSL0777 L26
33 pages
Random Forest Algorithm
No ratings yet
Random Forest Algorithm
2 pages
Random FOrest
No ratings yet
Random FOrest
19 pages
Random Forest Algorithm Unit 3
No ratings yet
Random Forest Algorithm Unit 3
2 pages
Step-By-Step-Diabetes-Classification-Knn-Detailed-Copy1 - Jupyter Notebook
No ratings yet
Step-By-Step-Diabetes-Classification-Knn-Detailed-Copy1 - Jupyter Notebook
12 pages
2019 Book DataScienceAndBigDataAnalytics
100% (15)
2019 Book DataScienceAndBigDataAnalytics
418 pages
Random Forest
No ratings yet
Random Forest
4 pages
Random Forest Algorithm
No ratings yet
Random Forest Algorithm
9 pages
Introduction To HTML & CSS
94% (35)
Introduction To HTML & CSS
155 pages
Random Forest - Basics
No ratings yet
Random Forest - Basics
9 pages
Machine Learning
100% (11)
Machine Learning
135 pages
EBOOK - Python Crash Course For Data Analysis
100% (12)
EBOOK - Python Crash Course For Data Analysis
168 pages
Random Forest Algorithm
No ratings yet
Random Forest Algorithm
4 pages
MS - Excel - Linear - & - Multiple - Regression Office 2007
No ratings yet
MS - Excel - Linear - & - Multiple - Regression Office 2007
7 pages
Random Forest
No ratings yet
Random Forest
18 pages
Understanding Machine Learning
100% (69)
Understanding Machine Learning
416 pages
Random Forest Algorithm
No ratings yet
Random Forest Algorithm
3 pages
Lecture+Notes+-+Random Forests
No ratings yet
Lecture+Notes+-+Random Forests
10 pages
A "Short" Introduction To Model Selection
No ratings yet
A "Short" Introduction To Model Selection
25 pages
Recommendation System in Python
No ratings yet
Recommendation System in Python
13 pages
Full Course of Machine Learning
100% (16)
Full Course of Machine Learning
660 pages
Hands On Machine Learning With Python Concepts and Applications For Beginners - John Anderson 2018
91% (11)
Hands On Machine Learning With Python Concepts and Applications For Beginners - John Anderson 2018
166 pages
Statistical Data Analysis Explained
93% (27)
Statistical Data Analysis Explained
359 pages
Learning The Pandas Library Python Tools For Data Munging Analysis and Visual PDF
100% (18)
Learning The Pandas Library Python Tools For Data Munging Analysis and Visual PDF
208 pages
Algorithms For Data Science 1st Brian Steele (WWW - Ebook DL - Com)
94% (16)
Algorithms For Data Science 1st Brian Steele (WWW - Ebook DL - Com)
438 pages
Midterm 2008s Solution
No ratings yet
Midterm 2008s Solution
12 pages
STEP SPSS ANALYSIS COHEN KAPPA and ICC
No ratings yet
STEP SPSS ANALYSIS COHEN KAPPA and ICC
5 pages
Hackers Guide To Machine Learning With Python PDF
100% (15)
Hackers Guide To Machine Learning With Python PDF
272 pages
9781838826321-Managing Data Science
100% (7)
9781838826321-Managing Data Science
276 pages
Cluster Analysis: Concepts and Techniques - Chapter 7
100% (1)
Cluster Analysis: Concepts and Techniques - Chapter 7
60 pages
Machine Learning Projects in Python
100% (16)
Machine Learning Projects in Python
135 pages
AI Publishing. Python Scikit-Learn For Beginners... For Data Scientist 2021
100% (8)
AI Publishing. Python Scikit-Learn For Beginners... For Data Scientist 2021
339 pages
R Book PDF
100% (4)
R Book PDF
291 pages
Machine Learning Paradigms
100% (10)
Machine Learning Paradigms
336 pages
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
From Everand
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
Andrei Besedin
2.5/5 (2)
Artificial Intelligence Algorithms
From Everand
Artificial Intelligence Algorithms
akosnemeth
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
From Everand
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Mark Magic
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet