ML Intw - Points To Remember

Random forest and gradient boosting are both tree-based algorithms. Random forest uses bagging to make predictions by dividing data into samples and building models on each, then combining results through voting or averaging. Gradient boosting uses boosting, weighting misclassified predictions more so they can be corrected sequentially until a stopping point. Random forest reduces variance while gradient boosting reduces both bias and variance. Type I error rejects a true null hypothesis as false, while Type II error accepts a false null hypothesis as true. Stratified sampling maintains class proportions across samples for classification problems, unlike random sampling.

Uploaded by

amar7chauhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as RTF, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views2 pages

ML Intw - Points To Remember

Uploaded by

amar7chauhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as RTF, PDF, TXT or read online on Scribd

You are on page 1/ 2

1 - In stratified cross-validation, the split preserves the ratio of the

categories on both the training and validation datasets.

2- Q21. Both being tree based algorithm, how is random

forest different from Gradient boosting algorithm (GBM)?
Answer: The fundamental difference is, random forest
uses bagging technique to make predictions. GBM uses
boosting techniques to make predictions.

In bagging technique, a data set is divided into n

samples using randomized sampling. Then, using a
single learning algorithm a model is build on all samples.
Later, the resultant predictions are combined using
voting or averaging. Bagging is done is parallel. In
boosting, after the first round of predictions, the
algorithm weighs misclassified predictions higher, such
that they can be corrected in the succeeding round. This
sequential process of giving higher weights to
misclassified predictions continue until a stopping
criterion is reached.

Random forest improves model accuracy by reducing

variance (mainly). The trees grown are uncorrelated to
maximize the decrease in variance. On the other hand,
GBM improves accuracy my reducing both bias and
variance in a model.

3- Q30. What do you understand by Type I vs Type II error

?
Answer: Type I error is committed when the null
hypothesis is true and we reject it, also known as a
‘False Positive’. Type II error is committed when the null
hypothesis is false and we accept it, also known as
‘False Negative’.

4- In case of classification problem, we should always

use stratified sampling instead of random sampling. A
random sampling doesn’t takes into consideration the
proportion of target classes. On the contrary, stratified
sampling helps to maintain the distribution of target
variable in the resultant distributed samples also.

Great LEarning Weekly Quiz - Bagging and Random Forest
100% (4)
Great LEarning Weekly Quiz - Bagging and Random Forest
5 pages
Machine Learning Interview Questions.
50% (2)
Machine Learning Interview Questions.
43 pages
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet
Classification Algorithms
No ratings yet
Classification Algorithms
20 pages
Random Forest
No ratings yet
Random Forest
32 pages
??????? ???????? ??????????!
No ratings yet
??????? ???????? ??????????!
16 pages
Review Questions DS
No ratings yet
Review Questions DS
14 pages
Q1-What's The Trade-Off Between Bias and Variance?
100% (1)
Q1-What's The Trade-Off Between Bias and Variance?
5 pages
Pa ZG512 Ec-3r First Sem 2022-2023
No ratings yet
Pa ZG512 Ec-3r First Sem 2022-2023
5 pages
Machine Learning QB
No ratings yet
Machine Learning QB
26 pages
Nptel Week 7
No ratings yet
Nptel Week 7
3 pages
ML Theory
No ratings yet
ML Theory
10 pages
Machine Learning: Classification & Decision Trees
No ratings yet
Machine Learning: Classification & Decision Trees
24 pages
ML Interview Ques
No ratings yet
ML Interview Ques
12 pages
Unit 2
No ratings yet
Unit 2
57 pages
Data Science Interview Questions With Answers ?
No ratings yet
Data Science Interview Questions With Answers ?
16 pages
Lecture 5
No ratings yet
Lecture 5
53 pages
Dma
No ratings yet
Dma
4 pages
Data Science Interview Questions
100% (1)
Data Science Interview Questions
68 pages
ML 2 (Mainly KNN)
100% (1)
ML 2 (Mainly KNN)
12 pages
Week 7
No ratings yet
Week 7
10 pages
Ensemble Interview Questions
No ratings yet
Ensemble Interview Questions
3 pages
What Is An SVM
No ratings yet
What Is An SVM
24 pages
ML Endsem
No ratings yet
ML Endsem
14 pages
CP 4
No ratings yet
CP 4
2 pages
Interview Questions
No ratings yet
Interview Questions
23 pages
15 Mlops Interview Questions For 2025
No ratings yet
15 Mlops Interview Questions For 2025
13 pages
ML Lec6
No ratings yet
ML Lec6
4 pages
Assignment 9 Solution
No ratings yet
Assignment 9 Solution
4 pages
Data Science Questions
No ratings yet
Data Science Questions
4 pages
Data Index
No ratings yet
Data Index
2 pages
FAQ in Data Science Interviews
No ratings yet
FAQ in Data Science Interviews
93 pages
Assignment 04
No ratings yet
Assignment 04
17 pages
I Am Sharing 'Interview' With You
100% (3)
I Am Sharing 'Interview' With You
65 pages
Random Forests
No ratings yet
Random Forests
43 pages
Interview Questions For DS & DA (ML)
100% (1)
Interview Questions For DS & DA (ML)
66 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
10 pages
UNIT-V (Bagging, Boosting, Random Forest) : by Dr. K. Aditya Shastry Associate Professor Dept. of ISE NMIT, Bengaluru
No ratings yet
UNIT-V (Bagging, Boosting, Random Forest) : by Dr. K. Aditya Shastry Associate Professor Dept. of ISE NMIT, Bengaluru
27 pages
Ml-Unit 2-QB
No ratings yet
Ml-Unit 2-QB
6 pages
Classification Algorithms
No ratings yet
Classification Algorithms
68 pages
Regression
No ratings yet
Regression
13 pages
Week 12
No ratings yet
Week 12
34 pages
ML Mod1
No ratings yet
ML Mod1
48 pages
08 Tree Advanced
No ratings yet
08 Tree Advanced
68 pages
ADS-Methodology and Data Visualization
No ratings yet
ADS-Methodology and Data Visualization
12 pages
Final Data Mining 2023
No ratings yet
Final Data Mining 2023
5 pages
Random Forest
No ratings yet
Random Forest
8 pages
Bagging Vs Boosting - Javatpoint
No ratings yet
Bagging Vs Boosting - Javatpoint
8 pages
Bagging and Random Forest Presentation1
100% (3)
Bagging and Random Forest Presentation1
23 pages
Data Science Interview Quesions
No ratings yet
Data Science Interview Quesions
22 pages
Machine Learning One Mark Answers
No ratings yet
Machine Learning One Mark Answers
4 pages
Unit III Questions 5 To 10 Additional
No ratings yet
Unit III Questions 5 To 10 Additional
3 pages
A) What Is Motivation Behind Ensemble Methods? Give Your Answer in Probabilistic Terms
100% (1)
A) What Is Motivation Behind Ensemble Methods? Give Your Answer in Probabilistic Terms
6 pages
Machine Learning: Practical Tutorial On Random Forest and Parameter Tuning in R
No ratings yet
Machine Learning: Practical Tutorial On Random Forest and Parameter Tuning in R
11 pages
Here Are Some Possible Questions and Answers Based On The Uploaded Documents
No ratings yet
Here Are Some Possible Questions and Answers Based On The Uploaded Documents
8 pages
Aiml-Qb - Unit 3
No ratings yet
Aiml-Qb - Unit 3
6 pages
Assessing A Single Classification Algorithm and Two Classification Algorithms
No ratings yet
Assessing A Single Classification Algorithm and Two Classification Algorithms
12 pages
ML QB Solutionss
No ratings yet
ML QB Solutionss
16 pages
MLMock Testsolution
No ratings yet
MLMock Testsolution
15 pages
Data Science Intervieew Questions
100% (1)
Data Science Intervieew Questions
16 pages
InterviewNotes RahualChauhan 2017-01-21
No ratings yet
InterviewNotes RahualChauhan 2017-01-21
5 pages
Primer Extension, and Nuclease Mapping
No ratings yet
Primer Extension, and Nuclease Mapping
35 pages
ABES Geek-50 Question Paper
No ratings yet
ABES Geek-50 Question Paper
2 pages
Transposable Elements-Transposons and Retrotransposons: BY P.Divya, M.Tech Biotechnology
No ratings yet
Transposable Elements-Transposons and Retrotransposons: BY P.Divya, M.Tech Biotechnology
19 pages

ML Intw - Points To Remember

Uploaded by

ML Intw - Points To Remember

Uploaded by

1 - In stratified cross-validation, the split preserves the ratio of the

categories on both the training and validation datasets.

2- Q21. Both being tree based algorithm, how is random

In bagging technique, a data set is divided into n

Random forest improves model accuracy by reducing

3- Q30. What do you understand by Type I vs Type II error

4- In case of classification problem, we should always

You might also like