0% found this document useful (0 votes)

14 views26 pages

Lecture07. ANN (Chapter 10-2)

Uploaded by

emad qedies

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views26 pages

Lecture07. ANN (Chapter 10-2)

Uploaded by

emad qedies

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 26

CSC 485/585 DA 515

Introduction to Machine Learning

Lecture 7
MLP or ANNs
Chapter 10-2

Fall 2024
ANNs: artificial neural networks

Outline:
 Perceptron
 MLPs: Multilayer Perceptrons == ANNs
 Classification
 Regression

-----------
 Playground
https://playground.tensorflow.org

 Keras or Pytorch
 Tensorbaord
Short History
 1958: Perceptron (linear model)
 1969: Perceptron has limitation
 1980s: Multi-layer perceptron
Do not have significant difference from DNN today

 1986: Backpropagation

 1989: 1 hidden layer is “good enough”, why deep?

Usually more than 3 hidden layers is not helpful

3
Toward Deep Learning

After year 2000: More data and more computing power, ANN->DNN

 2009: GPU
 2011: Start to be popular in speech recognition
 2012: Win ILSVRC image competition
 2014-2015: Alpha Go

 2022 ChatGPT
 Now: Deep Learning =
Lots of training data + Parallel
Computation + Scalable, smart algorithms
(transformer)
4
ANN: Why Deep

 Recent publications show that Deep is more efficient:

 Same number of neurons: deep gets better performance.
 Same performance: deep uses fewer neurons

5
Shallow vs Deep NNs
 Shallow vs Deep NNs?
 Deep is better

 Now Deep Learning uses ANNs as the very core of Deep Learning
 is built around ANNs, which form the core of its architecture
 Is versatile, powerful, and scalable.

 Applications: ideal to tackle large and highly complex Machine

Learning tasks such as
 Images classification (e.g., Google Images),
 speech recognition services (e.g., Apple’s Siri),
 recommending the best videos to watch to hundreds of millions of users every
day (e.g., YouTube), or
 learning to beat the world champion at the game of Go (DeepMind’s AlphaGo)
 ChatGPT (larger scale language model or LLM)

6
Deep Models of Recent years

7
1. Sk-learn ANN
 Classifier:
https://scikit-learn.org/stable/modules/generated/sklearn.neural_network.MLPCl
assifier.html

 For three layers:

hidden_layer_sizesarray-like of shape(n_layers - 2,),
default=(100,)
MLPClassifier( hidden_layer_sizes=(100,))
The ith element represents the number of neurons in the ith
hidden layer.

 For four layers: For example, if you specify

model = MLPClassifier( hidden_layer_sizes=(50, 25))
it means there are two hidden layers with 50 and 25 neurons
respectively.
8

 Similar for regressor: MLPRegressor

2. Implementing MLPs with Keras

Keras is a high-level Deep Learning API that allows you to easily build,
train, evaluate, and execute all sorts of neural networks.

APIs:
1. Sequential API: easy but limited
2. Functional API:
 more complex topologies, or
 with multiple inputs or outputs.
9
2.1 Sequential API:
 Sequential API code:
model = keras.models.Sequential([
keras.layers.Flatten(input_shape=[28, 28]),
keras.layers.Dense(300, activation="relu"),
keras.layers.Dense(100, activation="relu"),
keras.layers.Dense(10, activation="softmax")
])

10
2.2 Building Complex Models
Using the Functional API
 use function:
 Wide + deep model
 Use all features twice

input_ = keras.layers.Input(shape=X_train.shape[1:])
hidden1 = keras.layers.Dense(30,activation="relu")(input_)
hidden2 = keras.layers.Dense(30, activation="relu")(hidden1)
concat = keras.layers.Concatenate()([input_, hidden2])
output = keras.layers.Dense(1)(concat)
model = keras.Model(inputs=[input_], outputs=[output])

11
Handling multiple inputs: split features

12
Handling multiple inputs

input_A = keras.layers.Input(shape=[5],name="wide_input")
input_B = keras.layers.Input(shape=[6],name="deep_input")
hidden1 = keras.layers.Dense(30,activation="relu")(input_B)
hidden2 = keras.layers.Dense(30, activation="relu")(hidden1)
concat = keras.layers.concatenate([input_A, hidden2])
output = keras.layers.Dense(1, name="output")(concat)
model = keras.Model(inputs=[input_A, input_B], outputs=[output])

Figure see previous slide

13
Multi-output

output = keras.layers.Dense(1, name="main_output")(concat)

aux_output = keras.layers.Dense(1, name="aux_output")(hidden2)

model = keras.Model(inputs=[input_A, input_B], outputs=[output,

aux_output])

14
5 Step Life-Cycle for Neural Network
Models in Keras

15
Saving and Restoring a Model

Saving a trained Keras model:

model.save("my_keras_model.h5")

HDF5 format to save:

 model’s architecture (including every
layer’s hyperparameters)
 values of all the model parameters for
every layer (e.g., connection weights and
biases).
 optimizer (including its hyperparameters
and any state it may have). 16
Restoring a saved model

Loading the model is just as easy:

model = keras.models.load_model("my_keras_model.h5")

Reuse the model:

 For prediction
 deployment

17
Using Callbacks

The fit() method accepts a callbacks argument: specify a list

of objects that Keras will call:
 at the start and end of training,
 at the start and end of each epoch,
 even before and after processing each batch.

For example, the ModelCheckpoint callback saves checkpoints of your

model at regular intervals during training, by default at the end of each
epoch:

[...] # build and compile the model

checkpoint_cb = keras.callbacks.ModelCheckpoint("my_keras_model.h5")
history = model.fit(X_train, y_train, epochs=10,callbacks=[checkpoint_cb])

18
Callback example

checkpoint_cb =
keras.callbacks.ModelCheckpoint("my_keras_model.h5",
save_best_only=True)

history = model.fit(X_train, y_train, epochs=10,

validation_data=(X_valid, y_valid),
callbacks=[checkpoint_cb])

model = keras.models.load_model("my_keras_model.h5")

# roll back to best model:

when creating the ModelCheckpoint:
use a validation set during training, and set save_best_only=True.

19
Early Stopping: over epochs
 Train the model many epochs:
 if val_error < minimum_val_error:

20
Early stopping

To implement early stopping:

 use the EarlyStopping callback.

early_stopping_cb =
keras.callbacks.EarlyStopping(patience=10,
restore_best_weights=True)

history = model.fit(X_train, y_train, epochs=100,

validation_data=(X_valid, y_valid),
callbacks=[checkpoint_cb, early_stopping_cb])

no progress on the validation set for a number of epochs (defined by

the patience)
21
Demo
 You need install tensorflow uinsg pip install xxx in Jupyter notebook
pip install tensorflow

1. Classification
2. Regression
3. Complex model
4. Save and restore

As you can see, you can build any sort of architecture you want quite
easily with the Functional API. Let’s look at one last way you can build
Keras models.

22
Parameter Tuning
 Grid Search (P321, might take hours)
 autoML (both best structure and parameters)

https://machinelearningmastery.com/automl-libraries-for-python/

 Wide or Deep:
 An MLP with just one hidden layer can theoretically model even the most
complex functions, provided it has enough neurons.
 But for complex problems, deep networks have a much higher parameter
efficiency than shallow ones: they can model complex functions using
exponentially fewer neurons than shallow nets, allowing them to reach
much better performance with the same amount of training data.

 Number of Hidden Layers/Number of Neurons per Hidden Layer

More complex problem, more neurons. Too many, overfitting
 Learning Rate, Batch Size, and Other Hyperparameters
23
Summary

 Single Perceptron
 MLP: ANN for Classification/Regression

 Train: Back-propagations

 Keras:
 Sequential/Functional API
 Save and Restore model
 Callback to early stop, find the best model

24
-- END --

 Next Week:
 Oct. 16: Naïve Byes

 Homework
 HW4.Part A: due by this Friday
 HW4.Part B: due in two weeks (same day as the
Midterm)

25
Mid-term: Oct. 23

 Mid-term: Oct. 23
 Lecture time
 2.5 hrs (all lectures)
 100 points
 paper-pencil, closed-book
 Materials: Lecture 1-7

Deep Learning With Python
100% (6)
Deep Learning With Python
396 pages
Deep Learning
No ratings yet
Deep Learning
37 pages
Notes ML 02 Slides RNN ANN
No ratings yet
Notes ML 02 Slides RNN ANN
105 pages
Deep Learning Basics
No ratings yet
Deep Learning Basics
28 pages
Chapter 7 - Neural-Networks
100% (1)
Chapter 7 - Neural-Networks
60 pages
Deep Learning
No ratings yet
Deep Learning
28 pages
Unit 3 Slides - Getting Started With Neural Networks
No ratings yet
Unit 3 Slides - Getting Started With Neural Networks
70 pages
Unit 5
No ratings yet
Unit 5
12 pages
Jntuk Machine Learning 3-2 Unit-5
No ratings yet
Jntuk Machine Learning 3-2 Unit-5
15 pages
Introduction To Deep Neural Networks - DataCamp
No ratings yet
Introduction To Deep Neural Networks - DataCamp
10 pages
CNN Short
No ratings yet
CNN Short
61 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
9 MLP Example 08 08 2024
No ratings yet
9 MLP Example 08 08 2024
50 pages
A Parallel Fortran Framework For Neural Networks and Deep Learning
No ratings yet
A Parallel Fortran Framework For Neural Networks and Deep Learning
12 pages
TensorFlow Regression
No ratings yet
TensorFlow Regression
445 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
22 pages
ML Unit-5
No ratings yet
ML Unit-5
14 pages
Unit 5
No ratings yet
Unit 5
61 pages
Week 9 - Neural Networks
No ratings yet
Week 9 - Neural Networks
27 pages
Introduction To Deep Learning - With Complexe Python and TensorFlow Examples - Jürgen Brauer PDF
No ratings yet
Introduction To Deep Learning - With Complexe Python and TensorFlow Examples - Jürgen Brauer PDF
245 pages
09-Neural Networks
No ratings yet
09-Neural Networks
17 pages
ML-Lec10-Artificial Neural Networks
No ratings yet
ML-Lec10-Artificial Neural Networks
76 pages
MLP 1122 20240509 ch10 DeepNN
No ratings yet
MLP 1122 20240509 ch10 DeepNN
47 pages
Machine Learning: Feed Forward Neural Networks Backpropagation Algorithm Cnns and Rnns
No ratings yet
Machine Learning: Feed Forward Neural Networks Backpropagation Algorithm Cnns and Rnns
127 pages
Ker As Tutorial
No ratings yet
Ker As Tutorial
33 pages
Step by Step Guide How To Rapidly Build Neural Networks
No ratings yet
Step by Step Guide How To Rapidly Build Neural Networks
6 pages
Lecture2 Slides 1
No ratings yet
Lecture2 Slides 1
28 pages
Deep Learning Lab
No ratings yet
Deep Learning Lab
11 pages
Week 02 Ch2.1 Introduction To Neural Networks
No ratings yet
Week 02 Ch2.1 Introduction To Neural Networks
44 pages
09 Tensorflow101 Slide
No ratings yet
09 Tensorflow101 Slide
78 pages
Deep Learning
No ratings yet
Deep Learning
1 page
Unit - 3 DL
No ratings yet
Unit - 3 DL
17 pages
Unit 2
No ratings yet
Unit 2
10 pages
Deep Learning With Keras - Quick Guide
No ratings yet
Deep Learning With Keras - Quick Guide
22 pages
Python DL
No ratings yet
Python DL
52 pages
Workshop 1 Frameworks Deep Learning
No ratings yet
Workshop 1 Frameworks Deep Learning
16 pages
Chapter 3
No ratings yet
Chapter 3
24 pages
Keras v.2.1.6
No ratings yet
Keras v.2.1.6
244 pages
Tensorflow 2.0 Cheat Sheet: Some Pre-Requisites TF Core Learning Algorithms Working With Keras Models
No ratings yet
Tensorflow 2.0 Cheat Sheet: Some Pre-Requisites TF Core Learning Algorithms Working With Keras Models
2 pages
Chapter10 Keras
No ratings yet
Chapter10 Keras
66 pages
Sony Ai Content
No ratings yet
Sony Ai Content
26 pages
Deep Learning With Python: TH TH
No ratings yet
Deep Learning With Python: TH TH
36 pages
Lecture 3 V33
No ratings yet
Lecture 3 V33
52 pages
Dla
No ratings yet
Dla
23 pages
ML Unit-5
No ratings yet
ML Unit-5
19 pages
Introduction To Artificial Neural Networks
No ratings yet
Introduction To Artificial Neural Networks
31 pages
Unit3 DLT Material Important Notes
No ratings yet
Unit3 DLT Material Important Notes
33 pages
CSM 422
No ratings yet
CSM 422
2 pages
1 AI - Introduction and ML
No ratings yet
1 AI - Introduction and ML
32 pages
NN Fundamentals CS
No ratings yet
NN Fundamentals CS
36 pages
Eng PPT Tech
No ratings yet
Eng PPT Tech
18 pages
DL Unit 3
No ratings yet
DL Unit 3
21 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
DSE 3141 Deep Learning Lab Manual 2024 Week4
No ratings yet
DSE 3141 Deep Learning Lab Manual 2024 Week4
14 pages
Notes DL-1
No ratings yet
Notes DL-1
10 pages
Module V
No ratings yet
Module V
19 pages
Keras1-Introduction Two KEras
No ratings yet
Keras1-Introduction Two KEras
6 pages
Class Notes DL Unit 2
No ratings yet
Class Notes DL Unit 2
47 pages
Neural Network Representation
No ratings yet
Neural Network Representation
5 pages
DL LAB Manual (Uma)
No ratings yet
DL LAB Manual (Uma)
20 pages
Lec 07 8
No ratings yet
Lec 07 8
40 pages
The Multilayer Perceptron
No ratings yet
The Multilayer Perceptron
11 pages
Deep Learning Handson
No ratings yet
Deep Learning Handson
65 pages
Unit 12
No ratings yet
Unit 12
26 pages
NN Part1
No ratings yet
NN Part1
43 pages
Counterpropagation Networks
No ratings yet
Counterpropagation Networks
6 pages
Recurrent Neural Networks (RNNS) : Shusen Wang
No ratings yet
Recurrent Neural Networks (RNNS) : Shusen Wang
33 pages
Deep Learning
No ratings yet
Deep Learning
95 pages
Convolutional Neural Network - 5
No ratings yet
Convolutional Neural Network - 5
21 pages
Neural Network
No ratings yet
Neural Network
4 pages
AI, ML, DL Introduction
No ratings yet
AI, ML, DL Introduction
19 pages
100 Day AI Roadmap
No ratings yet
100 Day AI Roadmap
5 pages
Artificial Intelligence 101
No ratings yet
Artificial Intelligence 101
1 page
AlexNet Algorithm Presentation ML AI Deep Learning
No ratings yet
AlexNet Algorithm Presentation ML AI Deep Learning
10 pages
Introduction To Deep Convolutional Neural Networks: March 2016
No ratings yet
Introduction To Deep Convolutional Neural Networks: March 2016
51 pages
Aiml Demo
No ratings yet
Aiml Demo
12 pages
SSL 18 Mar 23 PDF
No ratings yet
SSL 18 Mar 23 PDF
50 pages
Module 2
No ratings yet
Module 2
40 pages
Machine Learning: Lunch & Learn - Session 8 Luis Borbon 25/07/2017
No ratings yet
Machine Learning: Lunch & Learn - Session 8 Luis Borbon 25/07/2017
30 pages
CISC 867: Deep Learning Assignment #2 (2 Points/question)
No ratings yet
CISC 867: Deep Learning Assignment #2 (2 Points/question)
2 pages
Research
No ratings yet
Research
25 pages
Bab I Mcculloch-Pitts Neuron: %program % Illustration of Various Activation Functions Used in NN's
No ratings yet
Bab I Mcculloch-Pitts Neuron: %program % Illustration of Various Activation Functions Used in NN's
8 pages
Deep Learning NLP and Computer Vision
No ratings yet
Deep Learning NLP and Computer Vision
9 pages
Xor in C#
No ratings yet
Xor in C#
3 pages
CAM++: A Fast and Efficient Network For Speaker Verification Using Context-Aware Masking
No ratings yet
CAM++: A Fast and Efficient Network For Speaker Verification Using Context-Aware Masking
5 pages
Problems On Som
No ratings yet
Problems On Som
11 pages
Arslan Tariq S CV PDF
No ratings yet
Arslan Tariq S CV PDF
1 page
Deep Learning Flowchart
No ratings yet
Deep Learning Flowchart
2 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
From Everand
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Lecture07. ANN (Chapter 10-2)

Uploaded by

Lecture07. ANN (Chapter 10-2)

Uploaded by

CSC 485/585 DA 515

Introduction to Machine Learning

 1989: 1 hidden layer is “good enough”, why deep?

 Recent publications show that Deep is more efficient:

 Applications: ideal to tackle large and highly complex Machine

 For three layers:

 For four layers: For example, if you specify

 Similar for regressor: MLPRegressor

Figure see previous slide

output = keras.layers.Dense(1, name="main_output")(concat)

model = keras.Model(inputs=[input_A, input_B], outputs=[output,

Saving a trained Keras model:

HDF5 format to save:

Loading the model is just as easy:

Reuse the model:

The fit() method accepts a callbacks argument: specify a list

For example, the ModelCheckpoint callback saves checkpoints of your

[...] # build and compile the model

history = model.fit(X_train, y_train, epochs=10,

# roll back to best model:

To implement early stopping:

 use the EarlyStopping callback.

history = model.fit(X_train, y_train, epochs=100,

no progress on the validation set for a number of epochs (defined by

 Number of Hidden Layers/Number of Neurons per Hidden Layer

You might also like