Unit Vi: TO Artificial Neural Network
Unit Vi: TO Artificial Neural Network
INTRODUCTION
TO
ARTIFICIAL NEURAL NETWORK
CONTENTS:
Perceptron Learning– Biological Neuron, Introduction to
ANN, McCulloch Pitts Neuron, Perceptron and its Learning
Algorithm, Sigmoid Neuron, Activation Functions: Tanh,
ReLu
Multi-layer Perceptron Model – Introduction, Learning
Parts of Neuron
Following are the different parts of a neuron:
Dendrites
These are branch-like structures that receive messages from other
neurons and allow the transmission of messages to the cell body.
Cell Body
Each neuron has a cell body with a nucleus, Golgi body, endoplasmic
reticulum, mitochondria and other components.
Axon
Axon is a tube-like structure that carries electrical impulse from the
cell body to the axon terminals that pass the impulse to another
neuron.
Synapse
It is the chemical junction between the terminal of one neuron and the
dendrites of another neuron.
INTRODUCTION TO ANN:
Artificial Neural Networks (ANN) are algorithms based on brain function
and are used to model complicated patterns and forecast issues.
The Artificial Neural Network (ANN) is a deep learning method that arose
from the concept of the human brain Biological Neural Networks.
The development of ANN was the result of an attempt to replicate the
workings of the human brain.
The workings of ANN are extremely similar to those of biological neural
networks, although they are not identical.
ANN algorithm accepts only numeric and structured data.
WHAT IS ARTIFICIAL NEURAL NETWORK(ANN)?
So, the value of both scenarios can be either 0 or 1. We can use the
value of both weights X1 and X2 as 1 and a threshold function as 1.
So, the neural network model will look like:
MCCULLOCH PITTS NEURON
Truth Table for this case will be:
1 0 0 0 0
2 0 1 1 1
3 1 0 1 1
4 1 1 2 1
So, I can say that,
input.
In this case, as we just sum up all the inputs for g and since all
Independent Variable: The factors which affect the dependent variables or which are
used to predict the values of the dependent variables are called independent variable,
also called as a predictor.
Outliers: Outlier is an observation which contains either very low value or very high
value in comparison to other observed values. An outlier may hamper the result, so it
should be avoided.
Outliers are defined as abnormal values in a dataset that don't go with the regular
distribution and have the potential to significantly distort any regression model.
Multicollinearity: If the independent variables are highly correlated with each other
than other variables, then such condition is called Multicollinearity. It should not be
present in the dataset, because it creates problem while ranking the most affecting
variable.
WHY DO WE USE REGRESSION ANALYSIS?
Regression estimates the relationship between the target and the independent
variable.
It is used to find the trends in data.
It helps to predict real/continuous values.
By performing the regression, we can confidently determine the most important
factor, the least important factor, and how each factor is affecting the other
factors.
Types of Regression
Linear Regression
Logistic Regression
Polynomial Regression
Support Vector Regression
Decision Tree Regression
Random Forest Regression
Ridge Regression
Lasso Regression:
LINEAR REGRESSION
R- Sqaured
MEAN ERROR (ME):
In this cost function, the error for each training data is
calculated and then the mean value of all these errors is
derived.
The errors can be both negative and positive. So they can
cancel each other out during summation giving zero mean
error for the model.
MEAN SQUARED ERROR (MSE) :
This improves the drawback we encountered in Mean Error above. Here a
square of the difference between the actual and predicted value is calculated
to avoid any possibility of negative error.
It is measured as the average of the sum of squared differences between
predictions and actual observations.
Where,
N=Total number of observation
Yi = Actual value
(a1xi+a0)= Predicted value.
Residuals: The distance between the actual value and
predicted values is called residual. If the observed points are
far from the regression line, then the residual will be high, and
so cost function will high. If the scatter points are close to the
regression line, then the residual will be small and hence the
cost function.
GRADIENT DESCENT:
Gradient Descent:
Gradient descent is used to minimize the MSE by
Model Performance:
The Goodness of fit determines how the line of
The equation for a model with two input variables can be written as:
y = β0 + β1.x1 + β2.x2
The equation for a model with three input variables can be written as:
y = β0 + β1.x1 + β2.x2 + β3.x3
variance.
INTRODUCTION TO DEEP LEARNING
The definition of Deep learning is that it is the branch of
machine learning that is based on artificial neural network
architecture. An artificial neural network or ANN uses layers of
interconnected nodes called neurons that work together to
process and learn from the input data.
In a fully connected Deep neural network, there is an input layer
and one or more hidden layers connected one after the other.
Each neuron receives input from the previous layer neurons or
the input layer. The output of one neuron becomes the input to
other neurons in the next layer of the network, and this process
continues until the final layer produces the output of the network.
The layers of the neural network transform the input data
through a series of nonlinear transformations, allowing the
network to learn complex representations of the input data.
INTRODUCTION TO DEEP LEARNING
• Today Deep learning AI has become one of the most popular and visible areas of
machine learning, due to its success in a variety of applications, such as computer
vision, natural language processing, and Reinforcement learning.