0% found this document useful (0 votes)

42 views5 pages

Statistics Project

The AdaBoost algorithm builds models sequentially and focuses more on misclassified examples by adjusting example weights. It uses decision stumps as weak learners and combines them into a single model. The SVM algorithm finds the optimal hyperplane that separates classes with the maximum margin and can perform linear and non-linear classification using kernels.

Uploaded by

Test User

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views5 pages

Statistics Project

Uploaded by

Test User

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

AdaBoost Algorithm:

AdaBoost also called Adaptive Boosting is a technique in Machine

Learning used as an Ensemble Method. The most common algorithm
used with AdaBoost is decision trees with one level that means with
Decision trees with only 1 split. These trees are also called Decision
Stumps.

What this algorithm does is that it builds a model and gives equal
weights to all the data points. It then assigns higher weights to points that are wrongly classified.
Now all the points which have higher weights are given more importance in the next model. It will
keep training models until and unless a lower
error is received.

How AdaBoost Works:

Step 1: A weak classifier (e.g. a decision stump) is made on top of the training data based on the
weighted samples. Here, the weights of each sample indicate how important it is to be correctly
classified. Initially, for the first stump, we give all the samples equal weights.

Step 2: We create a decision stump for each variable and see how well each stump classifies samples
to their target classes. For example, in the diagram below we check for Age, Eating Junk Food, and
Exercise. We'd look at how many samples are correctly or incorrectly classified as Fit or Unfit for
each individual stump.

Step 3: More weight is assigned to the incorrectly classified samples so that they're classified
correctly in the next decision stump. Weight is also assigned to each classifier based on the accuracy
of the classifier, which means high accuracy = high weight!

Step 4: Reiterate from Step 2 until all the data points have been correctly classified, or the maximum
iteration level has been reached.

AdaBoost Formula:

Here comes the hair-tugging part. Let's break AdaBoost down, step-by-step and equation-by-
equation so that it's easier to comprehend.

Let's start by considering a dataset with N points, or rows, in our dataset.

In this case,

n is the dimension of real numbers, or the number of attributes in our dataset

x is the set of data points

y is the target variable which is either -1 or 1 as it is a binary classification problem, denoting the first
or the second class (e.g. Fit vs Not Fit)
We calculate the weighted samples for each data point. AdaBoost assigns weight to each training
example to determine its significance in the training dataset. When the assigned weights are high,
that set of training data points are likely to have a larger say in the training set. Similarly, when the
assigned weights are low, they have a minimal influence in the training dataset.

Initially, all the data points will have the same weighted sample w:

Where, N is the total number of data points.

The weighted samples always sum to 1, so the value of each individual weight will always lie
between 0 and 1. After this, we calculate the actual influence for this classifier in classifying the data
points using the formula:

Alpha is how much influence this stump will have in the final classification. Total Error is nothing but
the total number of misclassifications for that training set divided by the training set size. We can
plot a graph for Alpha by plugging in various values of Total Error ranging from 0 to 1.

After plugging in the actual values of Total Error for each stump, it's time for us to update the sample
weights which we had initially taken as 1/N for every data point. We'll do this using the following
formula:

In other words, the new sample weight will be equal to the old sample weight multiplied by Euler's
number, raised to plus or minus alpha (which we just calculated in the previous step).

The two cases for alpha (positive or negative) indicate:

Alpha is positive when the predicted and the actual output agree (the sample was classified
correctly). In this case we decrease the sample weight from what it was before, since we're already
performing well.
Alpha is negative when the predicted output does not agree with the actual class (i.e. the sample is
misclassified). In this case we need to increase the sample weight so that the same misclassification
does not repeat in the next stump. This is how the stumps are dependent on their predecessors.

Support Vector Machine (SVM):

“Support Vector Machine” (SVM) is a supervised machine learning algorithm that can be used for
both classification and regression challenges. However, it is mostly used in classification problems. In
the SVM algorithm, we plot each data item as a point in n-dimensional space (where n is a number
of features you have) with the value of each feature being the value of a particular coordinate. Then,
we perform classification by finding the hyper-plane that differentiates the two classes very well.

Support Vectors are simply the coordinates of individual observation. The SVM classifier is a frontier
that best segregates the two classes (hyper-plane/ line).

Let’s understand:

Identify the right hyper-plane (Scenario-1): Here, we have three hyper-planes (A, B, and C). Now,
identify the right hyper-plane to classify stars and circles.

We need to remember a thumb rule to identify the right hyper-plane: “Select the hyper-plane which
segregates the two classes better”. In this scenario, hyper-plane “B” has excellently performed this
job.

Identify the right hyper-plane (Scenario-2): Here, we have three hyper-planes (A, B, and C) and all
are segregating the classes well.

Here, maximizing the distances between nearest data point (either class) and hyper-plane will help
us to decide the right hyper-plane. This distance is called as Margin. Let’s look at the below
snapshot:
Above, we can see that the margin for hyper-plane C is high as compared to both A and B. Hence, we
name the right hyper-plane as C. Another lightning reason for selecting the hyper-plane with higher
margin is robustness. If we select a hyper-plane having low margin then there is high chance of miss-
classification.

Identify the right hyper-plane (Scenario-3): Use the rules as discussed in previous section to identify
the right hyper-plane

Some of we may have selected the hyper-plane B as it has higher margin compared to A. But, here is
the catch, SVM selects the hyper-plane which classifies the classes accurately prior to maximizing
margin. Here, hyper-plane B has a classification error and A has classified all correctly. Therefore, the
right hyper-plane is A.

Can we classify two classes (Scenario-4)?: Below, I am unable to segregate the two classes using a
straight line, as one of the stars lies in the territory of other(circle) class as an outlier.

As I have already mentioned, one star at other end is like an outlier for star class. The SVM algorithm
has a feature to ignore outliers and find the hyper-plane that has the maximum margin. Hence, we
can say, SVM classification is robust to outliers.

Find the hyper-plane to segregate to classes (Scenario-5): In the scenario below, we can’t have linear
hyper-plane between the two classes, so how does SVM classify these two classes? Till now, we have
only looked at the linear hyper-plane.
SVM can solve this problem. Easily! It solves this problem by introducing additional feature. Here, we
will add a new feature z=x^2+y^2. Now, let’s plot the data points on axis x and z:

In above plot, points to consider are:

 All values for z would be positive always because z is the squared sum of both x and y
 In the original plot, red circles appear close to the origin of x and y axes, leading to lower
value of z and star relatively away from the origin result to higher value of z.

In the SVM classifier, it is easy to have a linear hyper-plane between these two classes. But, another
burning question which arises is, should we need to add this feature manually to have a hyper-plane.
No, the SVM algorithm has a technique called the kernel trick. The SVM kernel is a function that
takes low dimensional input space and transforms it to a higher dimensional space i.e. it converts
not separable problem to separable problem. It is mostly useful in non-linear separation problem.
Simply put, it does some extremely complex data transformations, then finds out the process to
separate the data based on the labels or outputs you’ve defined.

When we look at the hyper-plane in original input space it looks like a circle:

Gradient Boosting in ML
No ratings yet
Gradient Boosting in ML
5 pages
14-AI ML Ensemble 2022
No ratings yet
14-AI ML Ensemble 2022
41 pages
Adaboost
No ratings yet
Adaboost
29 pages
8 Bagging Boosting Annotated
No ratings yet
8 Bagging Boosting Annotated
31 pages
کتاب هفتم بارگزاری شده
No ratings yet
کتاب هفتم بارگزاری شده
57 pages
Math Behind AdaBoost Algorithm in 3 Steps
No ratings yet
Math Behind AdaBoost Algorithm in 3 Steps
10 pages
Tex
No ratings yet
Tex
7 pages
Class Adv Classification V
No ratings yet
Class Adv Classification V
50 pages
Ensemble (v6)
No ratings yet
Ensemble (v6)
45 pages
Ensemble - Part 1
No ratings yet
Ensemble - Part 1
33 pages
Handout9 Trees Bagging Boosting
100% (1)
Handout9 Trees Bagging Boosting
23 pages
Bagging
No ratings yet
Bagging
7 pages
Random Forest-Supervised ML
No ratings yet
Random Forest-Supervised ML
45 pages
ml1 Lab 6
No ratings yet
ml1 Lab 6
5 pages
ML-Unit I - Ensemble Methods
No ratings yet
ML-Unit I - Ensemble Methods
54 pages
07 Boosting Notes
No ratings yet
07 Boosting Notes
10 pages
Lecture 16: Boosting - Applied ML
No ratings yet
Lecture 16: Boosting - Applied ML
20 pages
Ensemble Methods
No ratings yet
Ensemble Methods
30 pages
Unit V - Multiple Learners
No ratings yet
Unit V - Multiple Learners
54 pages
Addaboost
No ratings yet
Addaboost
12 pages
ML Exp 9
No ratings yet
ML Exp 9
3 pages
LECTURE+NOTES Boosting
No ratings yet
LECTURE+NOTES Boosting
8 pages
16-Ensemble Learning - Cont... - 12-04-2024
No ratings yet
16-Ensemble Learning - Cont... - 12-04-2024
13 pages
Ada Boost
No ratings yet
Ada Boost
22 pages
Data Mining - Ensemble Methods
No ratings yet
Data Mining - Ensemble Methods
12 pages
Ensemble
No ratings yet
Ensemble
33 pages
MLDM Lect17 Classification Ensembles
No ratings yet
MLDM Lect17 Classification Ensembles
2 pages
ENG6500 7 Ensembles Boosting
No ratings yet
ENG6500 7 Ensembles Boosting
49 pages
ML Minors Exp8
No ratings yet
ML Minors Exp8
6 pages
DM (Boosting)
No ratings yet
DM (Boosting)
15 pages
Adaboost
No ratings yet
Adaboost
5 pages
Pradipta Kumar Pattanayak - Ada Boosting
No ratings yet
Pradipta Kumar Pattanayak - Ada Boosting
44 pages
Ensemble Classifiers
No ratings yet
Ensemble Classifiers
37 pages
TM Adaboost
No ratings yet
TM Adaboost
12 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
Scania Parts List
100% (4)
Scania Parts List
2 pages
AdaBoost Notes
No ratings yet
AdaBoost Notes
5 pages
AIML Lect6 Ensembles
No ratings yet
AIML Lect6 Ensembles
41 pages
Computational Data Analysis: Machine Learning
No ratings yet
Computational Data Analysis: Machine Learning
26 pages
ADABOOST
No ratings yet
ADABOOST
9 pages
Ada Boost
No ratings yet
Ada Boost
7 pages
Boosting
No ratings yet
Boosting
2 pages
CS Configuration Document Ace V1.0
100% (5)
CS Configuration Document Ace V1.0
106 pages
Ensemble Learning
No ratings yet
Ensemble Learning
9 pages
Boosting
No ratings yet
Boosting
13 pages
107 Boostong Models
No ratings yet
107 Boostong Models
27 pages
Introduction To Machine Learning - Boosting
No ratings yet
Introduction To Machine Learning - Boosting
6 pages
Adaboost Solutions
No ratings yet
Adaboost Solutions
6 pages
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
No ratings yet
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
6 pages
Zhu - Multiclass Adaboost2009 PDF
No ratings yet
Zhu - Multiclass Adaboost2009 PDF
12 pages
Lesson 8 - Ensemble Learning
No ratings yet
Lesson 8 - Ensemble Learning
61 pages
Boosting Buehlmann
No ratings yet
Boosting Buehlmann
52 pages
Sample Study Matter JEE (Advanced) PDF
100% (1)
Sample Study Matter JEE (Advanced) PDF
89 pages
Fee Structure Agm Current
No ratings yet
Fee Structure Agm Current
2 pages
FAQ - Boosting - Ensemble Techniques - Great Learning
No ratings yet
FAQ - Boosting - Ensemble Techniques - Great Learning
2 pages
A Brief Introduction To Adaboost: Hongbo Deng 6 Feb, 2007
No ratings yet
A Brief Introduction To Adaboost: Hongbo Deng 6 Feb, 2007
35 pages
Introduction To Boosting - 2
No ratings yet
Introduction To Boosting - 2
79 pages
Active Directory Administration The Personal Trainer For Windows Server 2008 and Windows Server 2008 R2 William Stanek Instant Download
100% (1)
Active Directory Administration The Personal Trainer For Windows Server 2008 and Windows Server 2008 R2 William Stanek Instant Download
41 pages
Ensemble Classifiers
100% (1)
Ensemble Classifiers
37 pages
Bagging and Boosting: 9.520 Class 10, 13 March 2006 Sasha Rakhlin
No ratings yet
Bagging and Boosting: 9.520 Class 10, 13 March 2006 Sasha Rakhlin
19 pages
Pre-Schwarzian and Schwarzian Norm Estimates For Subclasses of Univalent Functions
No ratings yet
Pre-Schwarzian and Schwarzian Norm Estimates For Subclasses of Univalent Functions
19 pages
TX Planning Presentation
No ratings yet
TX Planning Presentation
18 pages
Boosting Approach To Machine Learn
No ratings yet
Boosting Approach To Machine Learn
23 pages
Boosting and AdaBoost For Machine Learning
No ratings yet
Boosting and AdaBoost For Machine Learning
18 pages
Teaching Resume 2017
No ratings yet
Teaching Resume 2017
2 pages
Aircraft Communication System AKD20603: Practical Assignment - Aircraft Hs-125
No ratings yet
Aircraft Communication System AKD20603: Practical Assignment - Aircraft Hs-125
16 pages
1
No ratings yet
1
5 pages
Deloitte Full Test 1 Q
No ratings yet
Deloitte Full Test 1 Q
13 pages
Surprise Test Solution
No ratings yet
Surprise Test Solution
1 page
Work at Height Permit
No ratings yet
Work at Height Permit
1 page
Bell, SOME EXPERIMENTS IN DIAGNOSTIC TEACHING
No ratings yet
Bell, SOME EXPERIMENTS IN DIAGNOSTIC TEACHING
23 pages
Sanskrit PDF
No ratings yet
Sanskrit PDF
33 pages
Harmon Group Assignment
No ratings yet
Harmon Group Assignment
2 pages
Assisting Decision-Making On Age of Neutering For
No ratings yet
Assisting Decision-Making On Age of Neutering For
8 pages
Solved PGDBA 2019 Paper With Solutions
No ratings yet
Solved PGDBA 2019 Paper With Solutions
28 pages
B.ing Kls XII
No ratings yet
B.ing Kls XII
1 page
Continuity at A Point
No ratings yet
Continuity at A Point
20 pages
Saint Louis College: Legislative Committee
No ratings yet
Saint Louis College: Legislative Committee
3 pages
The Status of Knowledge
No ratings yet
The Status of Knowledge
8 pages
Research Proposal
No ratings yet
Research Proposal
10 pages
Kerry Anderson Resume 2017 Weebly
No ratings yet
Kerry Anderson Resume 2017 Weebly
3 pages
Wi-Fi Controlled Automatic Food Maker: December 2017
No ratings yet
Wi-Fi Controlled Automatic Food Maker: December 2017
6 pages
Spark Streaming Assignment
No ratings yet
Spark Streaming Assignment
2 pages
Lab Exercise: 8
No ratings yet
Lab Exercise: 8
5 pages
Korea University Urban Planning and Urban Design Lab
No ratings yet
Korea University Urban Planning and Urban Design Lab
4 pages
Marketing Mix of Kratos
No ratings yet
Marketing Mix of Kratos
5 pages
Design and Development of An Automatic Dosa Maker For Indian Households
No ratings yet
Design and Development of An Automatic Dosa Maker For Indian Households
4 pages
Practice Question Bank UNIT 1&2
No ratings yet
Practice Question Bank UNIT 1&2
3 pages
Assignment 5
No ratings yet
Assignment 5
2 pages
Purcom Speech 1
No ratings yet
Purcom Speech 1
1 page
Ka & TN Cbse (c3 To c5) C Batch BWT - 7 Syllabus (19.02.2024)
No ratings yet
Ka & TN Cbse (c3 To c5) C Batch BWT - 7 Syllabus (19.02.2024)
2 pages
Computer Ports and Cables
No ratings yet
Computer Ports and Cables
7 pages
Mohammed Azhar Ali Anjum - Quote
No ratings yet
Mohammed Azhar Ali Anjum - Quote
4 pages
QFD: House of Quality: Correlations
No ratings yet
QFD: House of Quality: Correlations
1 page
Salesman Showroom Sales 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5
No ratings yet
Salesman Showroom Sales 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5
1 page
Regression
No ratings yet
Regression
1 page
RIGZONE - How Does Coiled Tubing Work
No ratings yet
RIGZONE - How Does Coiled Tubing Work
2 pages
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet