0% found this document useful (0 votes)

43 views17 pages

NeuralNets DeepLearning

The document discusses neural networks and their application to classifying handwritten digits from the MNIST dataset. It provides background on classification problems and machine learning from examples. It then describes the MNIST dataset and provides results from training basic neural networks on MNIST with varying network architectures.

Uploaded by

damasodra33

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views17 pages

NeuralNets DeepLearning

Uploaded by

damasodra33

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

NEURAL NETWORKS, DATA

CLASSIFICATION AND MNIST

JEFFREY L. POPYACK
THE PROBLEM: RECOGNITION /
CLASSIFICATION

• Classification (the simple version):

• Given a collection of data {x0, x1, …, xn-1},
where each datum can be classified as one of a set of possible
values
{y0, y1, …, yM-1},
• Create an algorithm that will
• classify each sample item correctly
• learn features from the samples that can be applied correctly to new
items
LEARNING FROM EXAMPLES

• General Learning:
• Given a collection of sample data {x0, x1, …, xn-1},
where each datum can be classified as one of a set of possible
values
{y0, y1, …, yM-1}:
• Create an algorithm that will
• classify each sample item correctly
• learn features from the samples that can be applied correctly to new
items
• Examples :
• Given a set of emails, which we have designated as spam or not
spam,
learn to classify future items correctly as spam or not spam.
• Given a collection of handwritten digits, learn to classify
handwritten digits.
LEARNING FROM EXAMPLES

• General approach:
“Ground truth”: data we will use that has already been classified
(correctly, we hope). Training data + Testing data
“Training data”: Used to train your algorithm.
• Need to know the answers in order to train the algorithm correctly.
• Hope/expect that nearly all will be categorized correctly by
algorithm.
“Testing data”: Used to test how well your algorithm works on data that
was not used in the training.
• Pretend not to know the answers, use answers to determine if
correct.
• Examples:
• Spam detection:
use last month’s emails to train your algorithm; use this week’s to test
your algorithm.
• Handwritten digits: with a set of 500 samples, use 400 to train, 100 to test.
LEARNING FROM EXAMPLES

• General approach, continued:

• Use results of testing to tweak your algorithm
• Try again
• Repeat…
• When satisfied with results, ready to try with new, unclassified data
• Issues:
• How will your algorithm differ with different choices of training,
testing data?
• What to do with examples for which algorithm fails?
• Can results from several variations of the basic algorithm be used
together?
THE MNIST DATABASE OF HANDWRITTEN DIGITS

NIST (National Institute of Standards and Technology):

• Idea: Train a learning algorithm to recognize handwritten
digits, using samples from Census Bureau employees
(adult professionals who spent a lot of time writing
numbers, expected to be legible). Test algorithm on
data collected from less reliable sources.
• Ground Truth: images have been labeled with correct
values
• Collected binary images of handwritten digits
• Training data:
• Special Database 3 (SD-3): collected from Census Bureau employees.
• Testing data:
• Special Database 1 (SD-1): collected from high-school students. What could
• 58,527 digital images written by 500 different writers. possibly go
wrong?
THE MNIST DATABASE OF HANDWRITTEN DIGITS

MNIST (Modified NIST database):

• Yann LeCun, Courant Institute, NYU
Corinna Cortes, Google Labs, New York
Christopher J.C. Burges, Microsoft Research, Redmond
• Modified, normalized to 28x28 pixel images
• Mixture of original training/testing data:
• Training Set: 30,000 patterns each from SD-3 and SD-1 (from ~250 writers)
• Testing Set:
• 60,000 patterns from SD-3 and SD-1 (~250 writers, different from Training Set
writers)
• Typical benchmark Testing Set is 5,000 patterns each from SD-3 and SD-1

Website contains 60,000 element Training and 10,000

element Test set.
http://yann.lecun.com/exdb/mnist/
THE MNIST DATABASE OF HANDWRITTEN DIGITS

• Training set: 60,000 examples

Test set: 10,000 examples
• Digits have been size-normalized
and centered, fixed size.
• “It is a good database for people
who want to try learning techniques
and pattern recognition methods on
real-world data while spending
minimal efforts on preprocessing and
formatting.”
http://yann.lecun.com/exdb/mnist/

Image: http://neuralnetworksanddeeplearning.com/chap1.html
NEURAL NETWORKS AND DEEP LEARNING

• Based on Neural Networks and Deep Learning

• By Michael Nielsen
• http://neuralnetworksanddeeplearning.com/
SAMPLE NEURAL NETWORK

Image: http://neuralnetworksanddeeplearning.com/chap1.html
SAMPLE NEURAL NETWORK

Sample Results:
Training for 30 epochs, learning rate 3.0
>>> net = network.Network([784, 15, 10])

Epoch 0: 8870 / 10000 Inputs

Outputs
Epoch 1: 9094 / 10000
Hidden
Epoch 2: 9112 / 10000
. . .
Epoch 27: 9275 / 10000
Epoch 28: 9283 / 10000
Epoch 29: 9257 / 10000
92.6% accuracy

Can we do better with more hidden units?

Image: http://neuralnetworksanddeeplearning.com/chap1.html
SAMPLE NEURAL NETWORK

Sample Results:
Training for 30 epochs, learning rate 3.0
>>> net = network.Network([784, 30, 10])

Epoch 0: 9057 / 10000 Inputs

Outputs
Epoch 1: 9222 / 10000
Hidden
Epoch 2: 9259 / 10000
. . .
Epoch 27: 9462 / 10000
Epoch 28: 9482 / 10000
Epoch 29: 9482 / 10000
94.8% accuracy

Can we do better with more hidden layers?

Image: http://neuralnetworksanddeeplearning.com/chap1.html
SAMPLE NEURAL NETWORK

Sample Results: Sample Results:

Training for 30 epochs, learning rate 3.0 Training for 30 epochs, learning rate 3.0
>>> net = network.Network([784, 30, 10]) >>> net = network.Network([784,196,36,10])

Epoch 0: 9057 / 10000 Epoch 0: 9098 / 10000

Epoch 1: 9222 / 10000 Epoch 1: 9165 / 10000
Epoch 2: 9259 / 10000 Epoch 2: 9368 / 10000
Hidden Hidden
. . . . . . Layer Layer
1 2
Epoch 27: 9462 / 10000 Epoch 26: 9628 / 10000
Epoch 28: 9482 / 10000 Epoch 27: 9675 / 10000
Epoch 29: 9482 / 10000 Epoch 28: 9650 / 10000
94.8% accuracy 96.5% accuracy

Can we do better with more hidden layers?

Image: http://neuralnetworksanddeeplearning.com/chap1.html
SAMPLE NEURAL NETWORK

Sample Results: Sample Results:

Training for 30 epochs, learning rate 3.0 Training for 30 epochs, learning rate 3.0
>>> net = network.Network([784, 30, 10]) >>> net = network.Network([784,196,36,10])

Epoch 0: 9057 / 10000 Epoch 0: 9098 / 10000

Epoch 1: 9222 / 10000 Inputs Epoch 1: 9165 / 10000
Outputs
Hidden Hidden
Epoch 2: 9259 / 10000 Hidden Epoch 2: 9368 / 10000
Layer Layer
. . . . . . 1 2
Epoch 27: 9462 / 10000 Epoch 26: 9628 / 10000
Epoch 28: 9482 / 10000 Epoch 27: 9675 / 10000
Epoch 29: 9482 / 10000 Epoch 28: 9650 / 10000

94.8% accuracy 96.5% accuracy

Can we do better with more hidden layers?

SAMPLE NEURAL NETWORK

Why not 100% accuracy?

• Model is not detailed enough to handle intricacies of data set.
• Some sample data values too hard to recognize – even for humans:
DEEP LEARNING
CONVOLUTIONAL NEURAL NETWORKS

Some basic concepts:

• Not beneficial/meaningful to have all input units
connected to all hidden units
• Let a hidden unit be assigned to a subsection of
the original data, extracting features of that
subsection (convolutional layer, feature map)
• Follow each convolutional layer with a pooling convolutional layer
layer, that simplifies/summarizes the features in
the convolutional layer (e.g., “was a particular
feature found somewhere?”)
• This means the exact position of the object in
pooling layer
the frame is not as important.

http://neuralnetworksanddeeplearning.com/chap6.html
DEEP LEARNING EXAMPLE
Recognizes as “5”

https://www.cs.ryerson.ca/~aharley/vis/conv/flat.html

Deep Learning Model
No ratings yet
Deep Learning Model
144 pages
A Little Book of Deep Learning - Francois Fleuret
No ratings yet
A Little Book of Deep Learning - Francois Fleuret
149 pages
Business Data Mining Week 12
No ratings yet
Business Data Mining Week 12
24 pages
Module11 - NNandDeep Learning
No ratings yet
Module11 - NNandDeep Learning
84 pages
04introduction To Neural Networks
No ratings yet
04introduction To Neural Networks
62 pages
Lecture 3 - Introduction To Deep Learning
No ratings yet
Lecture 3 - Introduction To Deep Learning
27 pages
Chapter 11 Neural Nets (Python)
No ratings yet
Chapter 11 Neural Nets (Python)
43 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
24 pages
NN Mat Workshop
No ratings yet
NN Mat Workshop
36 pages
Deep Learning Tutorial: Reference: Hung-Yi Lee
100% (1)
Deep Learning Tutorial: Reference: Hung-Yi Lee
179 pages
Report
No ratings yet
Report
14 pages
DEU CSC5045 Intelligent System Applications Using Fuzzy - 7+Deep+Learning
No ratings yet
DEU CSC5045 Intelligent System Applications Using Fuzzy - 7+Deep+Learning
108 pages
Deep Learning Day 27
No ratings yet
Deep Learning Day 27
43 pages
Deep Learning Models (Basic)
No ratings yet
Deep Learning Models (Basic)
35 pages
AA12 Deep Learning 2024
No ratings yet
AA12 Deep Learning 2024
30 pages
Lecture-2 Learning Process45452465442
No ratings yet
Lecture-2 Learning Process45452465442
50 pages
Assignement 4
No ratings yet
Assignement 4
1 page
LEC-5 - DL Intro
No ratings yet
LEC-5 - DL Intro
63 pages
1 AI - Introduction and ML
No ratings yet
1 AI - Introduction and ML
32 pages
2nd Research
No ratings yet
2nd Research
7 pages
Lecture W15ab
No ratings yet
Lecture W15ab
44 pages
Neural Networks Essay
No ratings yet
Neural Networks Essay
5 pages
LBDL
No ratings yet
LBDL
156 pages
Aiml Neural Net
No ratings yet
Aiml Neural Net
19 pages
AI Principles and Applications: Artificial Neural Networks
No ratings yet
AI Principles and Applications: Artificial Neural Networks
29 pages
ML LittelBook
No ratings yet
ML LittelBook
161 pages
Introduction To DL With TensorFlow
No ratings yet
Introduction To DL With TensorFlow
55 pages
DL Lab-III-II
No ratings yet
DL Lab-III-II
98 pages
NN Examples
No ratings yet
NN Examples
91 pages
ML Overview
No ratings yet
ML Overview
26 pages
Blockchain Tehnologije EN
No ratings yet
Blockchain Tehnologije EN
6 pages
PNN
No ratings yet
PNN
19 pages
DL Lab-Final
No ratings yet
DL Lab-Final
22 pages
Deep Learning
No ratings yet
Deep Learning
44 pages
Control System Term Paper
No ratings yet
Control System Term Paper
12 pages
Assignment 2 - Neural Network Fundamentals
No ratings yet
Assignment 2 - Neural Network Fundamentals
7 pages
Handwritten Digit Recognition
No ratings yet
Handwritten Digit Recognition
19 pages
LBDL
No ratings yet
LBDL
143 pages
Lecture2 Slides 1
No ratings yet
Lecture2 Slides 1
28 pages
Mnist Handwritten Digit Classification
No ratings yet
Mnist Handwritten Digit Classification
26 pages
LP V GRPB 2b
No ratings yet
LP V GRPB 2b
8 pages
The Little Book of Deep Learning
No ratings yet
The Little Book of Deep Learning
143 pages
Data Mining: Practical Machine Learning Tools and Techniques
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
123 pages
Unit 1
No ratings yet
Unit 1
43 pages
NISS Deep Learning Tutorial
No ratings yet
NISS Deep Learning Tutorial
58 pages
Chapter - 1 Stress and Strain PDF
98% (44)
Chapter - 1 Stress and Strain PDF
21 pages
Day 5 IELTS Academic Reading Questions by KenyanNurse-1
No ratings yet
Day 5 IELTS Academic Reading Questions by KenyanNurse-1
12 pages
Deep Learning With Keras - Quick Guide
No ratings yet
Deep Learning With Keras - Quick Guide
22 pages
Program FTGBC
0% (1)
Program FTGBC
20 pages
Conservation Equations and Modeling of Chemical and Biochemical Processes 1st Edition Said S.E.H. Elnashaie Download
No ratings yet
Conservation Equations and Modeling of Chemical and Biochemical Processes 1st Edition Said S.E.H. Elnashaie Download
63 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
22 pages
Deep Neural Network
No ratings yet
Deep Neural Network
12 pages
CS464 Ch1 Intro Fall2020
No ratings yet
CS464 Ch1 Intro Fall2020
83 pages
AI Lab 1
No ratings yet
AI Lab 1
11 pages
Production and Operations Management 5th Edition S. N. Chary Ebook All Chapters PDF
100% (5)
Production and Operations Management 5th Edition S. N. Chary Ebook All Chapters PDF
55 pages
ECE/CS 559 - Neural Networks Lecture Notes #3 Some Example Neural Networks
No ratings yet
ECE/CS 559 - Neural Networks Lecture Notes #3 Some Example Neural Networks
7 pages
Artificial Neural Network Part-2
No ratings yet
Artificial Neural Network Part-2
15 pages
Neural Networks: MATLAB
No ratings yet
Neural Networks: MATLAB
91 pages
Nabl 100
No ratings yet
Nabl 100
45 pages
The Deep Learning Revolution: Introductory Overview Lecture
No ratings yet
The Deep Learning Revolution: Introductory Overview Lecture
35 pages
Neural Networks
No ratings yet
Neural Networks
5 pages
Codes Us
No ratings yet
Codes Us
56 pages
Chapter 2-3 The Simplex Method
No ratings yet
Chapter 2-3 The Simplex Method
19 pages
26 Neural Nets
No ratings yet
26 Neural Nets
77 pages
SPI's First Decade Mirrors Gaming's Progress: John Prados
100% (1)
SPI's First Decade Mirrors Gaming's Progress: John Prados
24 pages
Case Study Presentation Two Tough Calls A Harvard Business School
No ratings yet
Case Study Presentation Two Tough Calls A Harvard Business School
10 pages
Handwritten Digit Recognition Using ML&DL
No ratings yet
Handwritten Digit Recognition Using ML&DL
3 pages
IIMK Year 1 Syllabus
No ratings yet
IIMK Year 1 Syllabus
3 pages
Pattern Recognition Using Neural Network (Project Proposal For Image Processing)
No ratings yet
Pattern Recognition Using Neural Network (Project Proposal For Image Processing)
6 pages
Ethical Issues in Research
No ratings yet
Ethical Issues in Research
5 pages
15 Finetune
No ratings yet
15 Finetune
33 pages
6 Perceptron
No ratings yet
6 Perceptron
32 pages
MLRD 1
No ratings yet
MLRD 1
28 pages
3 2KNN
No ratings yet
3 2KNN
27 pages
Research Title Proposal
No ratings yet
Research Title Proposal
3 pages
Geotechnics: Marcin Cudny, Lech Bałachowski
No ratings yet
Geotechnics: Marcin Cudny, Lech Bałachowski
52 pages
HPC Lectures 1 5
No ratings yet
HPC Lectures 1 5
18 pages
Food Delivery Truck Inspection Form2016
100% (1)
Food Delivery Truck Inspection Form2016
2 pages
Module Iii - Object Oriented Testing
No ratings yet
Module Iii - Object Oriented Testing
10 pages
Article
No ratings yet
Article
10 pages
Bachelor of Studies Draft 1
No ratings yet
Bachelor of Studies Draft 1
4 pages
Thomson One Wealth Solutions Brochure
No ratings yet
Thomson One Wealth Solutions Brochure
10 pages
Unicusano Flyer
No ratings yet
Unicusano Flyer
4 pages
Importance of TQM
No ratings yet
Importance of TQM
4 pages
Digest By: Shimi Fortuna Ali Akang Vs Municipality of Isulan
No ratings yet
Digest By: Shimi Fortuna Ali Akang Vs Municipality of Isulan
2 pages
Sagara Technology Profile
No ratings yet
Sagara Technology Profile
39 pages
Image Augmentation
No ratings yet
Image Augmentation
8 pages
MLP Scratch
No ratings yet
MLP Scratch
8 pages
Aesv
No ratings yet
Aesv
32 pages
Neural Style
No ratings yet
Neural Style
6 pages
Fine Tuning
No ratings yet
Fine Tuning
3 pages
Untitled Document
No ratings yet
Untitled Document
1 page
Types of Videos For Social Media
No ratings yet
Types of Videos For Social Media
4 pages
Put The Verbs Into The Correct Tense (Simple Present or Present Progressive)
No ratings yet
Put The Verbs Into The Correct Tense (Simple Present or Present Progressive)
3 pages
Case 2 and 3 For Practice of Profession
No ratings yet
Case 2 and 3 For Practice of Profession
3 pages
To Whomsoever It May Concern Statement For Claiming Deductions Under Sections 24 (B) & 80C (2) (Xviii) of The Income Tax ACT, 1961
No ratings yet
To Whomsoever It May Concern Statement For Claiming Deductions Under Sections 24 (B) & 80C (2) (Xviii) of The Income Tax ACT, 1961
1 page
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet
The Numpy Pocketbook: Essentials on the Go
From Everand
The Numpy Pocketbook: Essentials on the Go
Silas Meadowlark
No ratings yet

NeuralNets DeepLearning

Uploaded by

NeuralNets DeepLearning

Uploaded by

NEURAL NETWORKS, DATA

CLASSIFICATION AND MNIST

• Classification (the simple version):

• General approach, continued:

NIST (National Institute of Standards and Technology):

MNIST (Modified NIST database):

Website contains 60,000 element Training and 10,000

• Training set: 60,000 examples

• Based on Neural Networks and Deep Learning

Three-Layer Neural Network:

Epoch 0: 8870 / 10000 Inputs

Can we do better with more hidden units?

Epoch 0: 9057 / 10000 Inputs

Can we do better with more hidden layers?

Sample Results: Sample Results:

Epoch 0: 9057 / 10000 Epoch 0: 9098 / 10000

Can we do better with more hidden layers?

Sample Results: Sample Results:

Epoch 0: 9057 / 10000 Epoch 0: 9098 / 10000

94.8% accuracy 96.5% accuracy

Can we do better with more hidden layers?

Why not 100% accuracy?

Some basic concepts:

You might also like