0% found this document useful (0 votes)

17 views7 pages

ML Assignment-01

Uploaded by

Aakash

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views7 pages

ML Assignment-01

Uploaded by

Aakash

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

ML ASSIGNMENT -01

PART –A

Question-01:Define random forest algorithm?

ANS: The Random Forest algorithm is an ensemble learning method primarily used for classification
and regression tasks. It operates by constructing multiple decision trees during training and outputs the
mode of the classes (classification) or the mean prediction (regression) of the individual trees.

Question-02:Define logistic Regression?

ANS: Logistic Regression is a statistical method used for binary classification problems, where
the goal is to predict one of two possible outcomes (e.g., yes/no, true/false, 0/1). It estimates the
probability that a given input belongs to a particular class based on a set of features

Question-03:Define Support vector machine?

ANS: A Support Vector Machine (SVM) is a supervised learning algorithm used for classification and
regression tasks. It is particularly effective for binary classification problems. The key idea behind SVM is
to find the optimal boundary, known as a hyperplane, that best separates the data into different classes

PART-B

Question-01:Decission Tree with numwrical example?

ANS: A Decision Tree is a supervised learning algorithm used for both classification and
regression tasks. It splits the dataset into subsets based on the most significant feature to predict
the target variable, using a tree-like structure.

Key Concepts:

1. Root Node: Represents the entire dataset and splits into subsets.
2. Decision Nodes: Intermediate nodes where the data gets further split.
3. Leaf Nodes: The final nodes representing the output (class label in classification or value
in regression).
4. Splitting Criteria: Measures like Gini Index, Information Gain, or Variance Reduction
(for regression) are used to split the data at each node.
Example: Classification using a Decision Tree

Let's use a simple dataset to predict whether a person will buy a product based on age and
income:

Step-by-Step Construction of the Decision Tree

1. Choose the Best Splitting Attribute: To determine the best split, we use Information
Gain or the Gini Index. Let's assume we use Gini Index here.

The Gini Index for a split is calculated as:

where pi is the proportion of data points belonging to class iii.

2. Calculate Gini for Each Split: Let's consider splitting the data based on Age and
Income. We start with Age and find the best threshold for splitting.
o Age ≤ 35:
 Group 1 (Age ≤ 35): {Person 1, Person 2, Person 3}
 Group 2 (Age > 35): {Person 4, Person 5, Person 6, Person 7}
 For Group 1:
 2 people don’t buy (No), 1 person buys (Yes).
 Gini Index for Group 1:

 For Group 2:
 1 person doesn’t buy (No), 3 people buy (Yes).
 Gini Index for Group 2:

 Weighted Gini for this split :

o Similarly, we calculate for other possible splits (like Income) and find the split
that minimizes the Gini Index.
3. Split the Data: Assume that splitting by Age ≤ 35 gives the best Gini index. We now
create two branches:
o For Age ≤ 35, we check further conditions like Income.
o For Age > 35, we check the next best feature.
4. Repeat Until Stopping Criteria: The process repeats for each subset until either:
o All data points in a node belong to a single class.
o The maximum depth of the tree is reached.
o There are no further significant splits.

Final Decision Tree:

[Age <= 35]
/ \
No (Most) [Income]
/ \
Low (Yes) High (Yes)

This is a simplified example where:

 If the Age ≤ 35, the prediction is No.

 If the Age > 35, we look at Income:
o If the income is Low, the prediction is Yes.
o If the income is High, the prediction is Yes.

Advantages:

 Easy to understand and interpret.

 Handles both categorical and numerical data.
 Can model non-linear relationships.
Disadvantages:

 Prone to overfitting, especially with deep trees.

 Unstable: A small change in data can significantly change the tree structure.

Question-02:Linear Regression with example?

ANS: Linear Regression is a supervised learning algorithm used for predicting a continuous
target variable based on one or more independent (input) variables. It assumes a linear
relationship between the input variables (features) and the output variable (target). The goal is to
find the line (in the case of one feature) or the hyperplane (in the case of multiple features) that
best fits the data.

Key Concepts:

Introduction To Big Data and Data Mining
No ratings yet
Introduction To Big Data and Data Mining
130 pages
Fundamentals of Data Science Unit 4
100% (1)
Fundamentals of Data Science Unit 4
31 pages
Unit-4 (1) .Docx ML
No ratings yet
Unit-4 (1) .Docx ML
42 pages
Unit-5 Decision Trees and Ensemble Learning
100% (1)
Unit-5 Decision Trees and Ensemble Learning
162 pages
Decision Trees
100% (2)
Decision Trees
16 pages
Mod 3 Part1 - Merged
No ratings yet
Mod 3 Part1 - Merged
101 pages
ML Classifiers
No ratings yet
ML Classifiers
48 pages
Classification, Prediction
100% (1)
Classification, Prediction
67 pages
EDA Cat2
No ratings yet
EDA Cat2
54 pages
Decision Trees
67% (3)
Decision Trees
14 pages
ML Unit 2
No ratings yet
ML Unit 2
84 pages
Module 5 Machine Learning
No ratings yet
Module 5 Machine Learning
36 pages
M2 - Supervised Machine Learning
No ratings yet
M2 - Supervised Machine Learning
79 pages
21CS54 Module 1
100% (2)
21CS54 Module 1
35 pages
Refer For KNNDecison Tree SVM
No ratings yet
Refer For KNNDecison Tree SVM
90 pages
DM Unit-4
No ratings yet
DM Unit-4
75 pages
CSE445 NSU Week - 4
No ratings yet
CSE445 NSU Week - 4
48 pages
06-Classification Part1
No ratings yet
06-Classification Part1
44 pages
Unit 1 Classification & Prediction DM
No ratings yet
Unit 1 Classification & Prediction DM
71 pages
Pa Unit-Iii
No ratings yet
Pa Unit-Iii
75 pages
AI Chapter 3 Part 2
No ratings yet
AI Chapter 3 Part 2
51 pages
Data Analytics Unit IV
No ratings yet
Data Analytics Unit IV
36 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
Classification and Prediction
No ratings yet
Classification and Prediction
40 pages
Module 5 - Supervised Learning Algorithms
No ratings yet
Module 5 - Supervised Learning Algorithms
38 pages
Unit 3 (MLT)
No ratings yet
Unit 3 (MLT)
42 pages
ML Lecture 8 9 Classification
No ratings yet
ML Lecture 8 9 Classification
35 pages
1.decision Trees Concepts
No ratings yet
1.decision Trees Concepts
70 pages
Business Analytics: Data Classification
No ratings yet
Business Analytics: Data Classification
36 pages
Machine Learning: Classification & Decision Trees
No ratings yet
Machine Learning: Classification & Decision Trees
24 pages
SemVII MachineLearning
No ratings yet
SemVII MachineLearning
22 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
22 pages
Machine - Learning - Lecture - 08 - Decision Tree Learning
No ratings yet
Machine - Learning - Lecture - 08 - Decision Tree Learning
67 pages
Chapter 2 Types of Machine Learning and Their Learning Strategies
No ratings yet
Chapter 2 Types of Machine Learning and Their Learning Strategies
45 pages
Unit 2
No ratings yet
Unit 2
11 pages
Types of Kernels in Support Vector Machines
No ratings yet
Types of Kernels in Support Vector Machines
14 pages
ML Unit 2 Final - III Yr
No ratings yet
ML Unit 2 Final - III Yr
72 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Unit-IV New
No ratings yet
Unit-IV New
18 pages
Project Report
No ratings yet
Project Report
24 pages
Machine Learning Notes ?
No ratings yet
Machine Learning Notes ?
14 pages
S&ML Unit 6 - Q & A
No ratings yet
S&ML Unit 6 - Q & A
12 pages
Decision Tree
No ratings yet
Decision Tree
12 pages
11) Elaborate On The Types of Machine Learning With Appropriate Examples
No ratings yet
11) Elaborate On The Types of Machine Learning With Appropriate Examples
9 pages
Decision Tree Introduction
No ratings yet
Decision Tree Introduction
14 pages
Authors' Profiles
No ratings yet
Authors' Profiles
378 pages
Data Science Interview Preparation (30 Days of Interview Preparation)
No ratings yet
Data Science Interview Preparation (30 Days of Interview Preparation)
22 pages
Updated Masterclass Curriculum-2
No ratings yet
Updated Masterclass Curriculum-2
35 pages
ML Unit 03
No ratings yet
ML Unit 03
23 pages
End Users Weigh in On Which Companies Provide Best Hardware, Software and Systems
100% (1)
End Users Weigh in On Which Companies Provide Best Hardware, Software and Systems
60 pages
ML Unit3 QB Solutions
No ratings yet
ML Unit3 QB Solutions
11 pages
Module 04
No ratings yet
Module 04
75 pages
DM Unit 4
No ratings yet
DM Unit 4
24 pages
FMLanswerkey-IT 2
No ratings yet
FMLanswerkey-IT 2
11 pages
Unit 5 - Data Mining - WWW - Rgpvnotes.in
No ratings yet
Unit 5 - Data Mining - WWW - Rgpvnotes.in
15 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
Data Minning Unit 2-1
No ratings yet
Data Minning Unit 2-1
10 pages
MISY 631 Final Review Calculators Will Be Provided For The Exam
No ratings yet
MISY 631 Final Review Calculators Will Be Provided For The Exam
9 pages
ML - Machine Learning PDF
No ratings yet
ML - Machine Learning PDF
13 pages
Conformation Meeting PPT Presentation
No ratings yet
Conformation Meeting PPT Presentation
20 pages
Unit-4 FDS
No ratings yet
Unit-4 FDS
19 pages
ML Unit-2
No ratings yet
ML Unit-2
16 pages
System Analysis Assignment
No ratings yet
System Analysis Assignment
62 pages
Machine Learning
No ratings yet
Machine Learning
9 pages
Machine Learning QNA
No ratings yet
Machine Learning QNA
1 page
UPSC Daily Current Affairs 01 January 2025
No ratings yet
UPSC Daily Current Affairs 01 January 2025
9 pages
Artificial Intelligence Unit 1
No ratings yet
Artificial Intelligence Unit 1
15 pages
Hemler Presentation
No ratings yet
Hemler Presentation
25 pages
Module 1 Introduction To AI
No ratings yet
Module 1 Introduction To AI
40 pages
AI ML 5day Learning Plan
No ratings yet
AI ML 5day Learning Plan
3 pages
Salute To Nurses 2020
100% (1)
Salute To Nurses 2020
12 pages
4.introduction To Learning - Unit 2
No ratings yet
4.introduction To Learning - Unit 2
8 pages
Project File
No ratings yet
Project File
69 pages
Nca Aiio
No ratings yet
Nca Aiio
11 pages
Aimcq
No ratings yet
Aimcq
11 pages
NLP Unit 5
No ratings yet
NLP Unit 5
12 pages
PLA's Intelligentized Warfare: The Politics On China's Military Strategy
No ratings yet
PLA's Intelligentized Warfare: The Politics On China's Military Strategy
20 pages
STDIX Unit4 IntroductiontoGenerativeAIExercise (2024 25)
No ratings yet
STDIX Unit4 IntroductiontoGenerativeAIExercise (2024 25)
6 pages
Industrial IoT Application Architectures and Use Cases 1st Edition A. Suresh 2024 Scribd Download
No ratings yet
Industrial IoT Application Architectures and Use Cases 1st Edition A. Suresh 2024 Scribd Download
49 pages
Chatgpt in Arabic-English Translation
No ratings yet
Chatgpt in Arabic-English Translation
20 pages
Prediction Guard Case Study
No ratings yet
Prediction Guard Case Study
3 pages
Self-Organizing Map (SOM) : Categorization Method, Neural Network Technique, Unsupervised Learning
No ratings yet
Self-Organizing Map (SOM) : Categorization Method, Neural Network Technique, Unsupervised Learning
8 pages
Nsf-Gov researchExperienceSites
No ratings yet
Nsf-Gov researchExperienceSites
12 pages
AI Policy - Taliaferro County School District
No ratings yet
AI Policy - Taliaferro County School District
5 pages
Human Computer Interaction Presentation
No ratings yet
Human Computer Interaction Presentation
14 pages
ASurveyon Face Detectionand Recognition Techniquesfor Applicationin Educational Institutions
No ratings yet
ASurveyon Face Detectionand Recognition Techniquesfor Applicationin Educational Institutions
8 pages
SL Classification For Data Science..
No ratings yet
SL Classification For Data Science..
4 pages
Tuto 6 Optimisation ENSIA
No ratings yet
Tuto 6 Optimisation ENSIA
3 pages
Information Technology
No ratings yet
Information Technology
2 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet

ML Assignment-01

Uploaded by

ML Assignment-01

Uploaded by

ML ASSIGNMENT -01

Question-01:Define random forest algorithm?

Question-02:Define logistic Regression?

Question-03:Define Support vector machine?

Question-01:Decission Tree with numwrical example?

Step-by-Step Construction of the Decision Tree

The Gini Index for a split is calculated as:

where pi is the proportion of data points belonging to class iii.

 Weighted Gini for this split :

Final Decision Tree:

This is a simplified example where:

 If the Age ≤ 35, the prediction is No.

 Easy to understand and interpret.

 Prone to overfitting, especially with deep trees.

Question-02:Linear Regression with example?

You might also like