0% found this document useful (0 votes)

13 views4 pages

NN Concepts

Uploaded by

aimad baigouar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views4 pages

NN Concepts

Uploaded by

aimad baigouar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Concepts

Neural Network 1
Neuron 1
Synapse 2
Weights 2
Bias 2
Layers 2
Weighted Input 2
Activation Functions 3
Loss Functions 3
Optimization Algorithms 3
Gradient Accumulation 3

Neural Network
Neural networks are a class of machine learning algorithms used to model complex patterns in datasets
using multiple hidden layers and non-linear activation functions. A neural network takes an input, passes it
through multiple layers of hidden neurons (mini-functions with unique coefficients that must be learned),
and outputs a prediction representing the combined input of all the neurons.

Neural networks are trained iteratively using optimization techniques like gradient descent. After each
cycle of training, an error metric is calculated based on the difference between prediction and target. The
derivatives of this error metric are calculated and propagated back through the network using a technique
called backpropagation. Each neuron's coefficients (weights) are then adjusted relative to how much they
contributed to the total error. This process is repeated iteratively until the network error drops below an
acceptable threshold.

Neuron
A neuron takes a group of weighted inputs, applies an activation function, and returns an output.

Inputs to a neuron can either be features from a training set or outputs from a previous layer’s neurons.
Weights are applied to the inputs as they travel along synapses to reach the neuron. The neuron then
applies an activation function to the “sum of weighted inputs” from each incoming synapse and passes the
result on to all the neurons in the next layer.
Synapse
Synapses are like roads in a neural network. They connect inputs to neurons, neurons to neurons, and
neurons to outputs. In order to get from one neuron to another, you have to travel along the synapse
paying the “toll” (weight) along the way. Each connection between two neurons has a unique synapse with
a unique weight attached to it. When we talk about updating weights in a network, we’re really talking
about adjusting the weights on these synapses.

Weights
Weights are values that control the strength of the connection between two neurons. That is, inputs are
typically multiplied by weights, and that defines how much influence the input will have on the output. In
other words: when the inputs are transmitted between neurons, the weights are applied to the inputs along
with an additional value (the bias)

Bias
Bias terms are additional constants attached to neurons and added to the weighted input before the
activation function is applied. Bias terms help models represent patterns that do not necessarily pass
through the origin. For example, if all your features were 0, would your output also be zero? Is it possible
there is some base value upon which your features have an effect? Bias terms typically accompany
weights and must also be learned by your model.

Layers

Input Layer
Holds the data your model will train on. Each neuron in the input layer represents a unique attribute in your
dataset (e.g. height, hair color, etc.).
Hidden Layer
Sits between the input and output layers and applies an activation function before passing on the results.
There are often multiple hidden layers in a network. In traditional networks, hidden layers are typically
fully-connected layers — each neuron receives input from all the previous layer’s neurons and sends its
output to every neuron in the next layer. This contrasts with how convolutional layers work where the
neurons send their output to only some of the neurons in the next layer.
Output Layer
The final layer in a network. It receives input from the previous hidden layer, optionally applies an
activation function, and returns an output representing your model’s prediction.

Weighted Input
A neuron’s input equals the sum of weighted outputs from all neurons in the previous layer. Each input is
multiplied by the weight associated with the synapse connecting the input to the current neuron. If there
are 3 inputs or neurons in the previous layer, each neuron in the current layer will have 3 distinct weights
— one for each each synapse.
Single Input
Z &= Input \cdot Weight \\■ &= X W
Multiple Inputs
Z &= \sum_{i=1}^{n}x_i w_i \\■ &= x_1 w_1 + x_2 w_2 +
Notice, it’s exactly the same equation we use with linear regression! In fact, a neural network with a single
neuron is the same as linear regression! The only difference is the neural network post-processes the
weighted input with an activation function.

Activation Functions
Activation functions live inside neural network layers and modify the data they receive before passing it to
the next layer. Activation functions give neural networks their power — allowing them to model complex
non-linear relationships. By modifying inputs with non-linear functions neural networks can model highly
complex relationships between features. Popular activation functions include :ref:`relu <activation_relu>`
and :ref:`sigmoid <activation_sigmoid>`.
Activation functions typically have the following properties:

• Non-linear - In linear regression we’re limited to a prediction equation that looks like a straight
line. This is nice for simple datasets with a one-to-one relationship between inputs and outputs,
but what if the patterns in our dataset were non-linear? (e.g. , sin, log). To model these
relationships we need a non-linear prediction equation.¹ Activation functions provide this
non-linearity.
• Continuously differentiable — To improve our model with gradient descent, we need our
output to have a nice slope so we can compute error derivatives with respect to weights. If our
neuron instead outputted 0 or 1 (perceptron), we wouldn’t know in which direction to update our
weights to reduce our error.
• Fixed Range — Activation functions typically squash the input data into a narrow range that
makes training the model more stable and efficient.

Loss Functions
A loss function, or cost function, is a wrapper around our model's predict function that tells us "how good"
the model is at making predictions for a given set of parameters. The loss function has its own curve and
its own derivatives. The slope of this curve tells us how to change our parameters to make the model more
accurate! We use the model to make predictions. We use the cost function to update our parameters. Our
cost function can take a variety of forms as there are many different cost functions available. Popular loss
functions include: :ref:`mse` and :ref:`Cross-entropy Loss <loss_cross_entropy>`.

Optimization Algorithms
Be the first to contribute!

Gradient Accumulation
Gradient accumulation is a mechanism to split the batch of samples—used for training a neural
network—into several mini-batches of samples that will be run sequentially.
This is used to enable using large batch sizes that require more GPU memory than available. Gradient
accumulation helps in doing so by using mini-batches that require an amount of GPU memory that can be
satisfied.
Gradient accumulation means running all mini-batches sequentially (generally on the same GPU) while
accumulating their calculated gradients and not updating the model variables - the weights and biases of
the model. The model variables must not be updated during the accumulation in order to ensure all
mini-batches use the same model variable values to calculate their gradients. Only after accumulating the
gradients of all those mini-batches will we generate and apply the updates for the model variables.
This results in the same updates for the model parameters as if we were to use the global batch.
More details, a technical and algorithmical deep-dive, how-to tutorials, and examples can be found at [2].
References

1 http://sebastianruder.com/optimizing-gradient-descent/
2 https://github.com/run-ai/runai/tree/master/runai/ga/

Chapter 1 - Staffing Models and Strategies
100% (1)
Chapter 1 - Staffing Models and Strategies
10 pages
Notes On Introduction To Deep Learning
No ratings yet
Notes On Introduction To Deep Learning
19 pages
DG 07 001 e 04 10 Control Device For Conventional Injection With Actuators
100% (1)
DG 07 001 e 04 10 Control Device For Conventional Injection With Actuators
435 pages
Leadership Theory Application and Skill Development 5th Edition by Lussier and Achua Test Bank
100% (2)
Leadership Theory Application and Skill Development 5th Edition by Lussier and Achua Test Bank
68 pages
FIXatdl-1 1-Specification With Errata 20101221
100% (1)
FIXatdl-1 1-Specification With Errata 20101221
63 pages
Unit 2 - Machine Learning
No ratings yet
Unit 2 - Machine Learning
19 pages
Unit 2
No ratings yet
Unit 2
18 pages
Unit V
No ratings yet
Unit V
9 pages
Unit III
No ratings yet
Unit III
37 pages
Neural Networks - Annotated
No ratings yet
Neural Networks - Annotated
21 pages
Deep Learning
No ratings yet
Deep Learning
37 pages
DeepLearing Theory
No ratings yet
DeepLearing Theory
51 pages
Neural Networks Essay Feranmi Dere
No ratings yet
Neural Networks Essay Feranmi Dere
7 pages
CS 329 Lecture4 2025new
No ratings yet
CS 329 Lecture4 2025new
61 pages
Understanding and Coding Neural Networks From Scratch in Python and R
No ratings yet
Understanding and Coding Neural Networks From Scratch in Python and R
12 pages
Neural Networks
No ratings yet
Neural Networks
29 pages
Machine Learning With Artificial Neural Networks
No ratings yet
Machine Learning With Artificial Neural Networks
44 pages
Unit 1
No ratings yet
Unit 1
16 pages
L10 Neural Network
No ratings yet
L10 Neural Network
52 pages
Copy of SSG 311 - Module 6 - Neural Network
No ratings yet
Copy of SSG 311 - Module 6 - Neural Network
41 pages
Unit 2 Deep Learning and Neural Networks
No ratings yet
Unit 2 Deep Learning and Neural Networks
38 pages
What Is A Neural Network? - IBM
No ratings yet
What Is A Neural Network? - IBM
10 pages
NN Unit - 1
No ratings yet
NN Unit - 1
27 pages
AI & ML Unit 5 Notes
No ratings yet
AI & ML Unit 5 Notes
23 pages
Neural Networks
No ratings yet
Neural Networks
27 pages
A Weight Decides How Much Influence The Input Will Have On The Output
No ratings yet
A Weight Decides How Much Influence The Input Will Have On The Output
1 page
Neural Networks
No ratings yet
Neural Networks
10 pages
Machine Learning
No ratings yet
Machine Learning
77 pages
Session NN
No ratings yet
Session NN
32 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
86 pages
Mid 1 DL Notes
No ratings yet
Mid 1 DL Notes
15 pages
Module 5 Lecture 2
No ratings yet
Module 5 Lecture 2
45 pages
Unit 2 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 2 - Machine Learning - WWW - Rgpvnotes.in
18 pages
Intro To DL
No ratings yet
Intro To DL
28 pages
Unit I
No ratings yet
Unit I
90 pages
AI Unit5 Neural Network 1c2c9166 c1b7 47a3 8ce1 E914f1ab6afb
No ratings yet
AI Unit5 Neural Network 1c2c9166 c1b7 47a3 8ce1 E914f1ab6afb
52 pages
CNN and Gan: Introduction To
No ratings yet
CNN and Gan: Introduction To
58 pages
Neural Networks - Annotated
No ratings yet
Neural Networks - Annotated
21 pages
Introduction To Neural Network
No ratings yet
Introduction To Neural Network
20 pages
Understanding and Coding Neural Networks From Scratch in Python and R
100% (1)
Understanding and Coding Neural Networks From Scratch in Python and R
15 pages
NNDL Unit 1 To Unit - 3
No ratings yet
NNDL Unit 1 To Unit - 3
75 pages
Perceptron: Single Layer Neural Network
No ratings yet
Perceptron: Single Layer Neural Network
14 pages
Unit 1
No ratings yet
Unit 1
20 pages
Unit II
No ratings yet
Unit II
12 pages
Lesson 3 Artificial Neural Network
No ratings yet
Lesson 3 Artificial Neural Network
77 pages
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
No ratings yet
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
31 pages
Activation Functions in Neural Networks - 241102 - 224129
No ratings yet
Activation Functions in Neural Networks - 241102 - 224129
7 pages
Domnic Object Detecion Basics
No ratings yet
Domnic Object Detecion Basics
62 pages
2-Neural Networks Basics - Functions in Neural Networks-22-07-2024
No ratings yet
2-Neural Networks Basics - Functions in Neural Networks-22-07-2024
4 pages
Neural Networks
No ratings yet
Neural Networks
61 pages
ML Unit 2
No ratings yet
ML Unit 2
23 pages
CS 522 Selected Topics in CS: Lecture 07 - Artificial Neural Network
No ratings yet
CS 522 Selected Topics in CS: Lecture 07 - Artificial Neural Network
52 pages
Lesson 7.0 Supervised Learning With Neural Networks
No ratings yet
Lesson 7.0 Supervised Learning With Neural Networks
22 pages
Unit 5 ML
No ratings yet
Unit 5 ML
37 pages
Unit 3 - Ann
No ratings yet
Unit 3 - Ann
49 pages
Artificial Intelligence: Outline
No ratings yet
Artificial Intelligence: Outline
35 pages
Neural Networks - 2
No ratings yet
Neural Networks - 2
79 pages
Introduction Deep Eng
No ratings yet
Introduction Deep Eng
50 pages
Artificial Neural Networks (Anns) : Intro
No ratings yet
Artificial Neural Networks (Anns) : Intro
15 pages
Unit III
No ratings yet
Unit III
29 pages
Deep Learning
No ratings yet
Deep Learning
19 pages
ML Unit 4
No ratings yet
ML Unit 4
23 pages
Neural Network
No ratings yet
Neural Network
7 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Investment Grade Energy Auditor Certification Detailv5
No ratings yet
Investment Grade Energy Auditor Certification Detailv5
20 pages
Kinetic Energy Recovery System
No ratings yet
Kinetic Energy Recovery System
9 pages
03: Digital Design and Construction of Organic Form: Patrik Schumacher Zaha Hadid Architects
No ratings yet
03: Digital Design and Construction of Organic Form: Patrik Schumacher Zaha Hadid Architects
15 pages
Sap Education: Sample Questions: C - Tadm51 - 74
No ratings yet
Sap Education: Sample Questions: C - Tadm51 - 74
5 pages
Finance Technical Assessment
No ratings yet
Finance Technical Assessment
3 pages
Exploring New Insights and Stratergies in Nursing Education
No ratings yet
Exploring New Insights and Stratergies in Nursing Education
49 pages
AKI 2 Primary Care Bundle Oxfordshire V 1.1
No ratings yet
AKI 2 Primary Care Bundle Oxfordshire V 1.1
1 page
11plus Y3 English Comprehension Poetry Test & Answers
No ratings yet
11plus Y3 English Comprehension Poetry Test & Answers
9 pages
S2 CH 1 Estimation and Approximation Q
0% (1)
S2 CH 1 Estimation and Approximation Q
8 pages
Architecture of Computer System
No ratings yet
Architecture of Computer System
64 pages
The Studying Mastermind Guide
No ratings yet
The Studying Mastermind Guide
35 pages
Poster Presentation-Assessment Rubric: Group: Class
No ratings yet
Poster Presentation-Assessment Rubric: Group: Class
2 pages
Course Outcome - BCA - BU - Sep - 2023 - Update
No ratings yet
Course Outcome - BCA - BU - Sep - 2023 - Update
24 pages
10-CHP-5 Periodic Classification of Element
No ratings yet
10-CHP-5 Periodic Classification of Element
7 pages
Voltammetry and Polarography
No ratings yet
Voltammetry and Polarography
46 pages
Antibacterial Polymers - A Mini Review: Sciencedirect
No ratings yet
Antibacterial Polymers - A Mini Review: Sciencedirect
6 pages
Activity Design District Municipal Festival of Talents 2024
No ratings yet
Activity Design District Municipal Festival of Talents 2024
6 pages
Slope Analysis Formula
No ratings yet
Slope Analysis Formula
2 pages
User Manual en Skimmer EM0130 EM0140 EMEM22010612
No ratings yet
User Manual en Skimmer EM0130 EM0140 EMEM22010612
4 pages
35 Handicrafts in India
No ratings yet
35 Handicrafts in India
8 pages
Carbohydrates Pcog
No ratings yet
Carbohydrates Pcog
8 pages
Compliance Gap Analysis
No ratings yet
Compliance Gap Analysis
1 page
IPB New PGS Proposal Form
No ratings yet
IPB New PGS Proposal Form
3 pages
Experiment For ES
No ratings yet
Experiment For ES
8 pages
AlertConfiguration STEPS&Troubleshoot
No ratings yet
AlertConfiguration STEPS&Troubleshoot
13 pages
Alfa Laval Decanter Centrifuge Reduces Chemical Losses in Green Liquor Dregs
No ratings yet
Alfa Laval Decanter Centrifuge Reduces Chemical Losses in Green Liquor Dregs
2 pages

NN Concepts

Uploaded by

NN Concepts

Uploaded by

Concepts

You might also like