Hypothesis in ML

The document discusses the concept of hypotheses in machine learning, explaining how they represent the model's assumptions about input-output relationships and the importance of hypothesis testing in validating research findings. It also covers inductive bias, its significance in generalizing from training data, and the challenges of underfitting and overfitting in model performance. Best practices for achieving a generalized model, including data handling, feature engineering, and regularization techniques, are also outlined.

Uploaded by

Shamilie M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views8 pages

Hypothesis in ML

Uploaded by

Shamilie M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

HYPOTHESIS IN MACHINE LEARNING

A hypothesis in machine learning is the model’s presumption regarding the

connection between the input features and the result. It is an illustration of the
mapping function that the algorithm is attempting to discover using the training set. To
minimize the discrepancy between the expected and actual outputs, the learning process
involves modifying the weights that parameterize the hypothesis. The objective is to
optimize the model’s parameters to achieve the best predictive performance on new,
unseen data, and a cost function is used to assess the hypothesis’ accuracy.
What is Hypothesis Testing?
Researchers must consider the possibility that their findings could have happened
accidentally before interpreting them. The systematic process of determining whether
the findings of a study validate a specific theory that pertains to a population is known as
hypothesis testing.To assess a hypothesis about a population, hypothesis testing is done
using sample data. A hypothesis test evaluates the degree of unusualness of the result,
determines whether it is a reasonable chance variation, or determines whether the result
is too extreme to be attributed to chance.
How does a Hypothesis work?
In most supervised machine learning algorithms, our main goal is to find a possible
hypothesis from the hypothesis space that could map out the inputs to the proper
outputs.
The following figure shows the common method to find out the possible hypothesis from
the Hypothesis space:

Hypothesis Space (H)

Hypothesis space is the set of all the possible legal hypothesis. This is the set from which
the machine learning algorithm would determine the best possible (only one) which
would best describe the target function or the outputs.
Hypothesis (h)
A hypothesis is a function that best describes the target in supervised machine learning.
The hypothesis that an algorithm would come up depends upon the data and also
depends upon the restrictions and bias that we have imposed on the data.
The Hypothesis can be calculated as:
y = mx + b
Where,
 y = range
 m = slope of the lines
 x = domain
 b = intercept
To better understand the Hypothesis Space and Hypothesis consider the following
coordinate that shows the distribution of some data:
Say suppose we have test data for which we have to determine the outputs or results.
The test data is as shown below:

We can predict the outcomes by dividing the coordinate as shown below:

So the test data would yield the following result:

But note here that we could have divided the coordinate plane as:

The way in which the coordinate would be divided depends on the data, algorithm and
constraints.
 All these legal possible ways in which we can divide the coordinate plane to
predict the outcome of the test data composes of the Hypothesis Space.
 Each individual possible way is known as the hypothesis.
Hence, in this example the hypothesis space would be like:

Hypothesis in Statistics
In statistics, a hypothesis refers to a statement or assumption about a population
parameter. It is a proposition or educated guess that helps guide statistical analyses.
There are two types of hypotheses: the null hypothesis (H0) and the alternative
hypothesis (H1 or Ha).
 Null Hypothesis(H0): This hypothesis suggests that there is no significant
difference or effect, and any observed results are due to chance. It often represents
the status quo or a baseline assumption.
 Aternative Hypothesis(H1 or Ha): This hypothesis contradicts the null
hypothesis, proposing that there is a significant difference or effect in the
population. It is what researchers aim to support with evidence.
INDUCTIVE BIAS
Definition
At its core, inductive bias refers to the set of assumptions that a learning algorithm makes
to predict outputs for inputs it has never seen. It’s the bias or inclination of a model
towards making a particular kind of assumption in order to generalize from its training
data to unseen situations.
Why is Inductive Bias Important?
Learning from Limited Data: In real-world scenarios, it’s practically impossible to have
training data for every possible input. Inductive bias helps models generalize to unseen
data based on the assumptions they carry.
Guiding Learning: Given a dataset, there can be countless hypotheses that fit the data.
Inductive bias helps the algorithm choose one plausible hypothesis over another.
Preventing Overfitting: A model with no bias or assumptions might fit the training data
perfectly, capturing every minute detail, including noise. This is known as overfitting. An
inductive bias can prevent a model from overfitting by making it favour simpler
hypotheses.
Types of Inductive Bias
Preference Bias: It expresses a preference for some hypotheses over others. For
example, in decision tree algorithms like ID3, the preference is for shorter trees over
longer trees.
Restriction Bias: It restricts the set of hypotheses considered by the algorithm. For
instance, a linear regression algorithm restricts its hypothesis to linear relationships
between variables.
Examples of Inductive Bias in Common Algorithms
Decision Trees: Decision tree algorithms, like ID3 or C4.5, have a bias towards shorter
trees and splits that categorize the data most distinctly at each level.
k-Nearest Neighbors (k-NN): The algorithm assumes that instances that are close to
each other in the feature space have similar outputs.
Neural Networks: They have a bias towards smooth functions. The architecture itself
(number of layers, number of neurons) can also impose bias.
Linear Regression: Assumes a linear relationship between the input features and the
output.
Trade-offs
While inductive bias helps models generalize from training data, there’s a trade-off. A
strong inductive bias means the model might not be flexible enough to capture all
patterns in the data. On the other hand, too weak a bias could lead the model to overfit
the training data.

Generalization in Machine Learning

Have you ever noticed that your model false predictions over your testing data? Even
though you have trained your model with enough data still you get false negatives or false
positives for your test data. Why is that?
Either your model is underfitting or overfitting to your training data. Generalization is a
measure of how your model performs on predicting unseen data. So, it is important to
come up with the best-generalized model to give better performance against future data.
Let us first understand what is underfitting and overfitting, and then see what are the
best practices to train a generalized model.
A: Underfitting, B: Generalized, C: Overfitting
What is Underfitting?
Underfitting is a state where the model cannot model itself on the training data. And also
not able to generalize new data. You can notice it with the help of loss function during
your training. A simple rule of thumb is if both training loss and cross-validation loss are
high, then your model is underfitting.
Lack of data, not enough features, lack of variance in training data or high regularization
rate can cause underfitting. A simple solution is to add more shuffled data to your
training. Depending on what causes underfitting to your model, you can try introducing
more meaningful features, feature crossing and introducing higher order polynomials as
features or reducing regularization rate if you are using regularization. In some cases
trying out with different training algorithm will work fine.
What is Overfitting?
Overfitting is a situation where your model force learns the whole variance. Experts say
it as model starts to memorize all the noise instead of learning. A simple rule of thumb to
identify the overfitting is if your training loss is low and cross-validation loss is high then
your model is overfitting.
Uncleaned data, fewer steps in training, higher complexity of the model (due to higher
weights in data) can cause overfitting. It is always recommended to preprocess data and
create a good data pipeline. Select only necessary and meaningful features with good
variance. Reduce the complexity of the model using good regularization algorithm (L1
norm or L2 norm).

Comparison
What are the best practices to get a Generalized model?
It is important to have a training dataset with good variance (i.e. a shuffled data set). The
Best way to do this is computing the hash for an appropriate feature and split data into
training, evaluation and test sets based on the computed hash value. Here the evaluation
set is used to cross-validate the trained model. It is always good to ensure that the
distribution in all the dataset is stationary(same).
Handling outliers also important, it always depends on the task you are working around.
If you are training the model to detect anomalies you should consider outliers, in such
case, these anomalies may be the labels you need to identify. So you cannot classify or
detect without outliers. On the other hand, if you are modeling a regression-based
classification it is good to remove outliers.
Using resampling during the training. Resampling enables you to reconstruct your
sample dataset in different ways for each iteration. One of the most popular resampling
technique is k-fold cross-validation. It does training and testing on the model for k times
with different subsets of your testing data.
It is always good to know when to stop training. It is a common human insight to
determine when to stop training. When you reach a good training loss and a good
validation loss at that point stop training.

Learn to do some feature engineering when needed. In some cases, your model cannot be
able to converge, there may be not a meaningful relation found on the raw features you
have. Doing Feature crosses and introducing new features with meaningful relation helps
the model to converge.

In addition to these parameters tunings, Hyper parameter tunings, using regularization

algorithms also helps to generalize the model for better performance.

Hope you all get a basic idea of generalization, underfitting, and overfitting. Use this as a
base and keep exploring on subtopics for deeper understandings.

Don’t forget to applaud if you find this article useful. Your doubts and feedbacks are
always welcomed.

Lab Cs3591 Computer Networks Lab
100% (2)
Lab Cs3591 Computer Networks Lab
38 pages
NPJH50878
0% (3)
NPJH50878
28 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Inductive Bias, Hypothesis, Hypothesis Space, Variance
No ratings yet
Inductive Bias, Hypothesis, Hypothesis Space, Variance
23 pages
Lec 02 03
No ratings yet
Lec 02 03
39 pages
Inductive Bias Hypothesis Hypothesis Space Variance
No ratings yet
Inductive Bias Hypothesis Hypothesis Space Variance
12 pages
UNIT I-Part 2
No ratings yet
UNIT I-Part 2
35 pages
Lecture 8
No ratings yet
Lecture 8
23 pages
ML Unit-2
No ratings yet
ML Unit-2
23 pages
Unit 1-1
No ratings yet
Unit 1-1
75 pages
Csa202 Unit 2
No ratings yet
Csa202 Unit 2
36 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
27 pages
Module3 DS PPT
No ratings yet
Module3 DS PPT
68 pages
What Is A Hypothesis
No ratings yet
What Is A Hypothesis
2 pages
Unit 2
No ratings yet
Unit 2
76 pages
ML Unit1 6
No ratings yet
ML Unit1 6
3 pages
Data Science Unit 5
No ratings yet
Data Science Unit 5
11 pages
Unit 2
No ratings yet
Unit 2
15 pages
Hypothesis Space and Inductive Bias - Inductive Bias - Inductive Learning - Underfitting and Overfitting
No ratings yet
Hypothesis Space and Inductive Bias - Inductive Bias - Inductive Learning - Underfitting and Overfitting
4 pages
Overfitting & Feature Engineering
No ratings yet
Overfitting & Feature Engineering
37 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
21 pages
05-1 Supervised Learning
No ratings yet
05-1 Supervised Learning
65 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
20 pages
KSMF
No ratings yet
KSMF
35 pages
Machine Learning General: Definiton
No ratings yet
Machine Learning General: Definiton
14 pages
Module 04
No ratings yet
Module 04
16 pages
Underfitting and Overfitting Slides and Transcript
No ratings yet
Underfitting and Overfitting Slides and Transcript
13 pages
All Cards
No ratings yet
All Cards
104 pages
Week 3
No ratings yet
Week 3
43 pages
DL Unit1
100% (2)
DL Unit1
79 pages
ML 01
No ratings yet
ML 01
24 pages
Supervised Learning
No ratings yet
Supervised Learning
5 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
32 pages
Machine Learning-2
No ratings yet
Machine Learning-2
16 pages
Machine Learning Juunit2.pdf Lands
No ratings yet
Machine Learning Juunit2.pdf Lands
7 pages
Unit 5
No ratings yet
Unit 5
21 pages
Machine Learning Coms-4771: Alina Beygelzimer Tony Jebara, John Langford, Cynthia Rudin
No ratings yet
Machine Learning Coms-4771: Alina Beygelzimer Tony Jebara, John Langford, Cynthia Rudin
17 pages
Machine Learning Unil-1
No ratings yet
Machine Learning Unil-1
20 pages
Inductive Bias
No ratings yet
Inductive Bias
3 pages
Over Fitting
No ratings yet
Over Fitting
19 pages
Bias and Variance in Machine Learning
No ratings yet
Bias and Variance in Machine Learning
3 pages
FAI - ch5 - Remaining Topics
No ratings yet
FAI - ch5 - Remaining Topics
3 pages
Lecture 3 Factor Influence Learning - Recovered
No ratings yet
Lecture 3 Factor Influence Learning - Recovered
26 pages
Hypothesis in ML
No ratings yet
Hypothesis in ML
16 pages
Machine Leaning 1 Unit
No ratings yet
Machine Leaning 1 Unit
10 pages
What Is Supervise
No ratings yet
What Is Supervise
3 pages
Notes
No ratings yet
Notes
125 pages
CS7641 Machine Learning Midterm Notes PDF
No ratings yet
CS7641 Machine Learning Midterm Notes PDF
239 pages
Machine Learning Moudle - 1: There Are Three Main Types of Machine Learning
No ratings yet
Machine Learning Moudle - 1: There Are Three Main Types of Machine Learning
86 pages
Overfitting and Underfitting
No ratings yet
Overfitting and Underfitting
25 pages
Machine Learning Math Essentials - 12.02.2025
No ratings yet
Machine Learning Math Essentials - 12.02.2025
88 pages
ML & DL
No ratings yet
ML & DL
19 pages
5.3 Model
No ratings yet
5.3 Model
26 pages
Chapter 1-ML
No ratings yet
Chapter 1-ML
27 pages
ERROR and Confusion Matrix
No ratings yet
ERROR and Confusion Matrix
29 pages
ChatGPT - Machine Learning Overview
No ratings yet
ChatGPT - Machine Learning Overview
34 pages
Classification
No ratings yet
Classification
53 pages
Sec 1630
No ratings yet
Sec 1630
145 pages
Lec 3
No ratings yet
Lec 3
21 pages
Statistical Learning Theory
No ratings yet
Statistical Learning Theory
4 pages
Machine - Learning (Unit 3)
No ratings yet
Machine - Learning (Unit 3)
9 pages
Lecture 8
No ratings yet
Lecture 8
15 pages
Candidate Elimination Algorithm
No ratings yet
Candidate Elimination Algorithm
3 pages
Bias - Variance
No ratings yet
Bias - Variance
2 pages
Week 2
No ratings yet
Week 2
222 pages
CS3491 Ai Lab Manula R2021 Final
100% (4)
CS3491 Ai Lab Manula R2021 Final
43 pages
Ad3301 Data Exploration and Visualization
100% (3)
Ad3301 Data Exploration and Visualization
30 pages
Lab Manual
No ratings yet
Lab Manual
42 pages
Ad3301 Data Exploration and Visualization
100% (3)
Ad3301 Data Exploration and Visualization
30 pages
Virtualized Research Environments On Bwforcluster Nemo: Zki Arbeitskreis Supercomputing, 17.03.2017, Duisburg
No ratings yet
Virtualized Research Environments On Bwforcluster Nemo: Zki Arbeitskreis Supercomputing, 17.03.2017, Duisburg
18 pages
Complete Project
No ratings yet
Complete Project
43 pages
SIM7100 SIM7500 SIM7600 Sleep Mode Application Note V1.01
No ratings yet
SIM7100 SIM7500 SIM7600 Sleep Mode Application Note V1.01
11 pages
5.1.9-Packet-Tracer - Investigate-Stp-Loop-Prevention
No ratings yet
5.1.9-Packet-Tracer - Investigate-Stp-Loop-Prevention
6 pages
STC St04014at B757
No ratings yet
STC St04014at B757
1 page
SUPPLEMENT System of Linear Equation by Graphing Method
No ratings yet
SUPPLEMENT System of Linear Equation by Graphing Method
4 pages
Cola2 Manual
No ratings yet
Cola2 Manual
29 pages
Smart Car Parking System in Multiplexes
No ratings yet
Smart Car Parking System in Multiplexes
6 pages
Code Is Political
No ratings yet
Code Is Political
11 pages
CT 9000
100% (1)
CT 9000
256 pages
Cspo GEM
No ratings yet
Cspo GEM
3 pages
Relational Database and SQL
No ratings yet
Relational Database and SQL
35 pages
OAT Unit-5
No ratings yet
OAT Unit-5
8 pages
SAP MM Bootcamp Exercises-3.0 Vendor Master
No ratings yet
SAP MM Bootcamp Exercises-3.0 Vendor Master
21 pages
Logcat CSC Update Log
No ratings yet
Logcat CSC Update Log
2,493 pages
GIM 165 Lecture 6
No ratings yet
GIM 165 Lecture 6
18 pages
Iphone 12 Mini 07300290A Repair
100% (1)
Iphone 12 Mini 07300290A Repair
81 pages
SAP HANA 2.0 Cockpit Central Release Note
No ratings yet
SAP HANA 2.0 Cockpit Central Release Note
4 pages
Maret 12
No ratings yet
Maret 12
8 pages
Docu48340 - NetWorker 8.1 Installation Guide
No ratings yet
Docu48340 - NetWorker 8.1 Installation Guide
152 pages
Installation Manual PXR1Tube - P Rev20 (140115)
100% (1)
Installation Manual PXR1Tube - P Rev20 (140115)
91 pages
Federated Learning For Healthcare - Systematic Review and Architecture Proposal
No ratings yet
Federated Learning For Healthcare - Systematic Review and Architecture Proposal
23 pages
Community-Infineon
No ratings yet
Community-Infineon
6 pages
AIOT Report
No ratings yet
AIOT Report
20 pages
Sympathy For The Traitor: A Translation Manifesto (The MIT Press) - Mark Polizzotti (PDF File (PDF, Epub, TXT) )
No ratings yet
Sympathy For The Traitor: A Translation Manifesto (The MIT Press) - Mark Polizzotti (PDF File (PDF, Epub, TXT) )
5 pages
Tsedey Bank
No ratings yet
Tsedey Bank
11 pages
Peoplesoft Enterprise Recruiting Solutions 9.0
No ratings yet
Peoplesoft Enterprise Recruiting Solutions 9.0
68 pages
200-301 Cisco CCNA Exam Updated Practice Questions
No ratings yet
200-301 Cisco CCNA Exam Updated Practice Questions
67 pages
Auto Flight - FMS Management of Vertical Navigation
No ratings yet
Auto Flight - FMS Management of Vertical Navigation
11 pages

Hypothesis in ML

Uploaded by

Hypothesis in ML

Uploaded by

HYPOTHESIS IN MACHINE LEARNING

A hypothesis in machine learning is the model’s presumption regarding the

Hypothesis Space (H)

We can predict the outcomes by dividing the coordinate as shown below:

Generalization in Machine Learning

In addition to these parameters tunings, Hyper parameter tunings, using regularization

You might also like