0% found this document useful (0 votes)

7 views

Learning Algorithms For Classification A Compariso

This document compares several machine learning algorithms for classifying handwritten digits on a standard dataset. It finds that a large fully connected multi-layer neural network achieved the best test accuracy of 1.6% for classifying handwritten digits from a dataset of 60,000 training examples. A radial basis function network achieved an error rate of 3.6%, while principal component analysis combined with a polynomial classifier achieved 3.3%. The simplest linear classifier achieved the highest error rate of 8.4%. The document also considers factors like training time, recognition time, and memory requirements in addition to raw accuracy.

Uploaded by

Tosi Tutoriais

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Learning Algorithms For Classification A Compariso

Uploaded by

Tosi Tutoriais

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/2599424

Learning Algorithms For Classiﬁcation: A Comparison On Handwritten Digit

Recognition

Article · July 2000

Source: CiteSeer

CITATIONS READS
203 1,872

10 authors, including:

Yann Lecun Larry Jackel

New York University North C Technologies
568 PUBLICATIONS 171,997 CITATIONS 163 PUBLICATIONS 30,460 CITATIONS

SEE PROFILE SEE PROFILE

Corinna Cortes John Denker

Google Inc. 114 PUBLICATIONS 28,577 CITATIONS
98 PUBLICATIONS 89,352 CITATIONS
SEE PROFILE
SEE PROFILE

All content following this page was uploaded by Yann Lecun on 23 May 2013.

The user has requested enhancement of the downloaded file.

COMPARISON OF LEARNING
ALGORITHMS FOR HANDWRITTEN DIGIT
RECOGNITION
Y. LeCun, L. Jackel, L. Bottou, A. Brunot, C. Cortes,
J. Denker, H. Drucker, I. Guyon, U. Muller,
E. Sackinger, P. Simard, and V. Vapnik
Bell Laboratories, Holmdel, NJ 07733, USA
Email: [email protected]

Abstract
This paper compares the performance of several classi er algorithms
on a standard database of handwritten digits. We consider not only raw
accuracy, but also rejection, training time, recognition time, and memory
requirements.

1
COMPARISON OF LEARNING ALGORITHMS FOR
HANDWRITTEN DIGIT RECOGNITION
Y. LeCun, L. Jackel, L. Bottou, A. Brunot, C. Cortes,
J. Denker, H. Drucker, I. Guyon, U. Muller,
E. Sackinger, P. Simard, and V. Vapnik
Bell Laboratories, Holmdel, NJ 07733, USA
Email: [email protected]
1 Introduction
The simultaneous availabilityof inexpensive powerful computers, powerful learn-
ing algorithms, and large databases, have caused rapid progress in handwriting
recognition in the last few years. This paper compares the relative merits of
several classi cation algorithms developed at Bell Laboratories and elsewhere
for the purpose of recognizing handwritten digits. While recognizing individual
digits is only one of many problems involved in designing a practical recognition
system, it is an excellent benchmark for comparing shape recognition methods.
Though many existing method combine a handcrafted feature extractor and a
trainable classi er, this study concentrates on adaptive methods that operate
directly on size-normalized images.

2 Database
The database used to train and test the systems described in this paper was
constructed from the NIST's Special Database 3 and Special Database 1 con-
taining binary images of handwritten digits. Our training set was composed
of 30,000 patterns from SD-3, and 30,000 patterns from SD-1. Our test set
was composed of 5,000 patterns from SD-3 and 5,000 patterns from SD-1. The
60,000 pattern training set contained examples from approximately 250 writ-
ers. We made sure that the sets of writers of the training set and test set
were disjoint. All the images were size normalized to t in a 20x20 pixel box
(while preserving the aspect ratio). For some experiments, the 20x20 images
were deslanted using moments of inertia before being presented. For other ex-
periments they were only centered in a larger input eld using center of mass.
Grayscale pixel values were used to reduce the e ects of aliasing. Two methods
(LeNet 1 and Tangent Distance) used subsampled versions of the images to 16
by 16 pixels.

3 The Classi ers

In this section we brie y describe the classi ers used in our study. For more
complete descriptions readers may consult the references.
Baseline Linear Classi er: Possibly the simplest classi er that one might
consider is a linear classi er. Each input pixel value contributes to a weighted
sum for each output unit. The output unit with the highest sum (including the
contribution of a bias constant) indicates the class of the input character. For
this experiment, we used deslanted 20x20 images. The network has 4010 free
parameters. The de ciencies of the linear classi er are well documented (Duda
& Hart 73) and it is included here simply to form a basis of comparison for more
sophisticated classi ers. The test error rate is 8.4%. Various combinations of
sigmoid units, linear units, gradient descent learning, and learning by directly
solving linear systems gave similar results.
Baseline Nearest Neighbor Classi er: Another simple classi er is a
K-nearest neighbor classi er with a Euclidean distance measure between input
images. This classi er has the advantage that no training time, and no brain
on the part of the designer, are required. However, the memory requirement
and recognition time are large: the complete 60,000 twenty by twenty pixel
training images (about 24 Megabytes at one byte per pixel, or 12 megabytes at 4
bits/pixel) must be available at run time. Much more compact representations
could be devised with modest increase in recognition time and error rate. As in
the previous case, deslanted 20x20 images were used. The test error for k = 3 is
2.4%. Naturally, a realistic Euclidean distance nearest-neighbor system would
operate on feature vectors rather than directly on the pixels, but since all of
the other systems presented in this paper operate directly on the pixels, this
result is useful for a baseline comparison.
Pairwise Linear Classi er: A simple improvement of the basic linear
classi er was tested (Guyon et al. 89). The idea is to train each unit of a
single-layer network to classify one class from one other class. In our case this
layer comprises 45 units labelled 0/1, 0/2,...0/9, 1/2....8/9. Unit i=j is trained
to produce +1 on patterns of class i, -1 on patterns of class j , and is not trained
on other patterns. The nal score for class i is the sum of the outputs all the
units labelled i=x minus the sum of the output of all the units labelled y=i, for
all x and y. Error rate on the test set was 7.6%, only slightly better than a
linear classi er.
Principal Component Analysis and Polynomial Classi er: Follow-
ing (Schurmann 78), a preprocessing stage was constructed which computes
the projection of the input pattern on the 40 principal components of the set of
training vectors. To compute the principal components, the mean of each input
component was rst computed and subtracted from the training vectors. The
covariance matrix of the resulting vectors was then computed, and diagonalized
using Singular Value Decomposition. The 40-dimensional feature vector was
used as the input of a second degree polynomial classi er. This classi er can be
seen as a linear classi er with 821 inputs, preceded by a module that computes
all products of pairs of input variables. Error on the test set was 3.3%.
Radial Basis Function Network: Following (Lee 91), an RBF network
was constructed. The rst layer was composed of 1,000 Gaussian RBF units
with 400 inputs (20x20), the second layer was a simple 1000-10 linear classi-
er. The RBF units were divided into 10 groups of 100. Each group of units
was trained on all the training examples of one of the 10 classes using the
adaptive K-means algorithm. The second layer weights were computed using
a regularized pseudo-inverse method. Error rate on the test set was 3.6%
Large Fully Connected Multi-Layer Neural Network: Another clas-
si er that we tested was a fully connected multi-layer neural network with two
layers of weights (one hidden layer). The network trained with various numbers
of hidden units. Deslanted 20x20 images were used as input. The best result
was 1.6% on the test set, obtained with a 400-300-10 network (approximately
123,300 weights). It remains somewhat of a mystery that networks with such
a large number of free parameters manage to achieve reasonably low testing
errors. We conjecture that the dynamics of gradient descent learning in multi-
INPUT feature maps feature maps feature maps feature maps OUTPUT
28x28 4@24x24 4@12x12 12@8x8 12@4x4 10@1x1

Su Co Su Co
Co bs nv bs
nv am nv
am ol ol
ol pl ut pl ut
ut in io in io
io g n g n
n

Figure 1: Architecture of LeNet 1. Each plane represents a feaure map, i.e. a

set of units whose weights are constrained to be identical. Input images are
sized to t in a 16 x 16 pixel eld, but enough blank pixels are added around
the border of this eld to avoid edge e ects in the convolution calculations.

layer nets has a \self-regularization" e ect. Because the origin of weight space
is a saddle point that is attractive in almost every direction, the weights in-
variably shrink during the rst few epochs (recent theoretical analysis seem to
con rm this (Sara Solla, personal communication)). Small weights cause the
sigmoids to operate in the quasi-linear region, making the network essentially
equivalent to a low-capacity, single-layer network. As the learning proceeds,
the weights grow, which progressively increases the e ective capacity of the
network. A better theoretical understanding of these phenomena, and more
empirical evidence, are de nitely needed.
LeNet 1: To solve the dilemma between small networks that cannot learn
the training set, and large networks that seem overparameterized, one can de-
sign specialized network architectures that are speci cally designed to recognize
two-dimensional shapes such as digits, while eliminating irrelevant distortions
and variability. These considerations lead us to the idea of convolutional net-
work (LeCun et al. 90). In a convolutional net, each unit takes its input from
a local \receptive eld" on the layer below, forcing it to extract a local fea-
ture. Furthermore, units located at di erent places on the image are grouped in
planes, called feature maps, within which units are constrained to share a sin-
gle set of weights. This makes the operation performed by a feature map shift
invariant, and equivalent to a convolution, followed by squashing functions.
This weight-sharing technique greatly reduces the number of free parameters.
A single layer is formed of multiple feature maps, extracting di erent features
types.
Complete networks are formed of multiple convolutional layers, extracting
features of increasing complexity and abstraction. Sensitivity to shifts and dis-
tortions can be reduced by using lower-resolution feature maps in the higher
layers. This is achieved by inserting subsampling layers between the convolu-
tion layers. It is important to stress that all the weights in such a network
are trained by gradient descent. Computing the gradient can be done with a
slightly modi ed version of the classical backpropagation procedure. The train-
ing process causes convolutional networks to automatically synthesize their own
features. One of our rst convolutional network architecture, LeNet 1, shown
in Figure 3, was trained on the database. Because of LeNet 1's small input
eld, the images were down-sampled to 16x16 pixels and centered in the 28x28
input layer. Although about 100,000 multiply/add steps are required to eval-
uate LeNet 1, its convolutional nature keeps the number of free parameters
to only about 3000. The LeNet 1 architecture was developed using our own
version of the USPS database and its size was tuned to match the available
data. LeNet 1 achieved 1.7% test error.
LeNet 4: Experiments with LeNet 1 made it clear that a larger convolu-
tional network was needed to make optimal use of the large size of the training
set. LeNet 4 was designed to address this problem. It is an expanded ver-
sion of LeNet 1 that has a 32x32 input layer in which the 20x20 images (not
deslanted) were centered by center of mass. It includes more feature maps and
an additional layer of hidden units that is fully connected to both the last layer
of features maps and to the output units. LeNet 4 contains about 260,000
connections and has about 17,000 free parameters. Test error was 1.1%. In
previous experiments with ZIP code data, replacing the last layer of LeNet with
a more complex classi er improved the error rate. We replaced the last layer
of LeNet4 with a Euclidean Nearest Neighbor classi er, and with the \local
learning" method of Bottou and Vapnik, in which a local linear classi er is
retrained each time a new test pattern is shown. Neither of those improve the
raw error rate, although they did improve the rejection.
LeNet 5: LeNet 5, has an architecture similar to LeNet 4, but has more
feature maps, a larger fully-connected layer, and it uses a distributed repre-
sentation to encode the categories at the output layer, rather than the more
traditional \1 of N" code. LeNet 5 has a total of about 340,000 connections,
and 60,000 free parameters, most of them in the last two layers. Again the non-
deslanted 20x20 images centered by center of mass were used, but the training
procedure included a module that distorts the input images during training us-
ing small randomly picked ane transformations (shift, scaling, rotation, and
skewing). It achieved 0.9% error.
Boosted LeNet 4: Following theoretical work by R. Schapire, Drucker et
al. (Drucker et al 93) developed the \boosting" method for combining multiple
classi ers. Three LeNet 4 are combined: the rst one is trained the usual way.
the second one is trained on patterns that are ltered by the rst net so that
the second machine sees a mix of patterns, 50% of which the rst net got right,
and 50% of which it got wrong. Finally, the third net is trained on new patterns
on which the rst and the second nets disagree. During testing, the outputs of
the three nets are simply added. Because the error rate of LeNet 4 is very low,
it was necessary to arti cially increase the number of training samples with
random distortions (like with LeNet 5) in order to get enough samples to train
the second and third nets. The test error rate was 0.7%, the best of any of our
classi ers. At rst glance, boosting appears to be three times more expensive
as a single net. In fact, when the rst net produces a high con dence answer,
the other nets are not called. The cost is about 1.75 times that of a single net.
Tangent Distance Classi er (TDC): The Tangent Distance classi er
(TDC) is a nearest-neighbor method where the distance function is made in-
sensitive to small distortions and translations of the input image (Simard et al.
93). If we consider an image as a point in a high dimensional pixel space (where
the dimensionality equals the number of pixels), then an evolving distortion of
a character traces out a curve in pixel space. Taken together, all these distor-
tions de ne a low-dimensional manifold in pixel space. For small distortions,
in the vicinity of the original image, this manifold can be approximated by a
plane, known as the tangent plane. An excellent measure of "closeness" for
character images is the distance between their tangent planes, where the set of
distortions used to generate the planes includes translations, scaling, skewing,
400−10 −−−− 8.4 −−−−>
pairwise −−−− 7.6 −−−−>
PCA+quadratic −−−− 3.3 −−−−>
1000 RBF −−−− 3.6 −−−−>
400−300−10 1.6
LeNet 1 1.7
LeNet 4 1.1
LeNet 4 / Local 1.1
LeNet 4 / K−NN 1.1
LeNet 5 0.9
Boosted LeNet 4 0.7
K−NN Euclidean 2.4
Tangent Distance 1.1
Soft Margin 1.1

0 0.5 1 1.5 2 2.5 3

Figure 2: error rate on the test set (%). The uncertainty in the quoted error
rates is about 0.1%.

squeezing, rotation, and line thickness variations. A test error rate of 1.1%
was achieved using 16x16 pixel images. Pre ltering techniques using simple
Euclidean distance at multiple resolutions allowed to reduce the number of
necessary Tangent Distance calculations. The gure for storage requirement
assumes that the patterns are represented at multiple resolutions at one byte
per pixel.
Optimal Margin Classi er (OMC): Polynomial classi ers are well-
studied methods for generating complex decision surfaces. Unfortunately, they
are impractical for high-dimensional problems, because the number of prod-
uct terms is prohibitive. A particularly interesting subset of decision surfaces
is the ones that correspond to hyperplanes that are at a maximum distance
from the convex hulls of the two classes in the high-dimensional space of the
product terms. Boser, Guyon, and Vapnik (Boser et al. 92) realized that any
polynomial of degree k in this \maximum margin" set can be computed by rst
computing the dot product of the input image with a subset of the training
samples (called the \support vectors"), elevating the result to the k-th power,
and linearly combining the numbers thereby obtained. Finding the support
vectors and the coecients amounts to solving a high-dimensional quadratic
minimization problem with linear inequality constraints. Using a version of the
procedure, known as Soft Margin Classi er (Cortes & Vapnik 95) that is well
suited for noisy problems, with a 4-th degree decision surface, a test error of
1.1% was reached. The number of support vectors obtained was around 25,000.

4 Discussion
A summary of the performance of our classi ers is shown in Figures 2 to 5.
Figure 2 shows the raw error rate of the classi ers on the 10,000 example test
set. Boosted LeNet 4 is clearly the best, achieving a score of 0.7%, closely
followed by LeNet 5 at 0.9%. This can be compared to our estimate of human
performance, 0.2%.
Figure 3 shows the number of patterns in the test set that must be rejected
to attain a 0.5% error. In many applications, rejection performance is more
signi cant than raw error rate. Again, Boosted LeNet 4 has the best score. The
enhanced versions LeNet 4 did better than the original LeNet 4, even though
400−300−10 3.2
LeNet 1 3.7
LeNet 4 1.8
LeNet 4 / Local 1.4
LeNet 4 / K−NN 1.6
Boosted LeNet 4 0.5
K−NN Euclidean 8.1
Tangent Distance 1.9
Soft Margin 1.8

0 1 2 3 4 5 6 7 8 9

Figure 3: Percent of test patterns rejected to achieve 0.5% error on the remain-
ing test examples for some of the systems.
400−10 0.5
pairwise 2
PCA+quadratic 2.5
1000 RBF 60
400−300−10 10
LeNet 1 15
LeNet 4 30
LeNet 4 / Local 2000
LeNet 4 / K−NN 1000
LeNet 5 40
Boosted LeNet 4 50
K−NN Euclidean 1000
Tangent Distance 2000
Soft Margin 2000

0 500 1000 1500 2000

Figure 4: Time required on a Sparc 10 for recognition of a single character

starting with a size-normalized image (in milliseconds).

the raw accuracies were identical.

Figure 4 shows the time required on a Sparc 10 for each method to rec-
ognize a test pattern, starting with a size-normalized image. Expectedly,
memory-based method are much slower than neural networks. Single-board
hardware designed with LeNet in mind performs recognition at 1000 charac-
ters/sec (Sackinger & Graf 94). Cost-e ective hardware implementations of
memory-based techniques are more elusive, due to their enormous memory
requirements.
Training time was also measured. K-nearest neighbors and TDC have es-
sentially zero training time. While the single-layer net, the pairwise net, and
PCA+quadratic net could be trained in less than an hour, the multilayer net
training times were expectedly much longer: 3 days for LeNet 1, 7 days for
the fully connected net, 2 weeks for LeNet 4 and 5, and about a month for
boosted LeNet 4. Training the Soft Margin classi er took about 10 days. How-
ever, while the training time is marginally relevant to the designer, it is totally
irrelevant to the customer.
Figure 5 shows the memory requirements of our various classi ers. Figures
are based on 4 bit per pixel representations of the prototypes for K-Nearest
Neighbors, 1 byte per pixel for Soft Margin, and Tangent Distance. They should
be taken as upper bounds, as clever compression of the data and/or elimination
of redundant training examples can reduce the memory requirements of some
400−10 0.016
pairwise 0.072
PCA+quadratic 0.1
1000 RBF 0.44
400−300−10 0.49
LeNet 1 0.012
LeNet 4 0.068
LeNet 4 / Local −−− 12 MBytes −−−>
LeNet 4 / K−NN −−− 12 MBytes −−−>
LeNet 5 0.24
Boosted LeNet 4 0.21
K−NN Euclidean −−− 12 MBytes −−−>
Tangent Distance −−− 25 MBytes −−−>
Soft Margin −−− 11 MBytes −−−>

0 0.1 0.2 0.3 0.4 0.5 0.6

Figure 5: Memory requirements for classi cation of test patterns (in MBytes).
Numbers are based on 4 bit/pixel for K-NN, 1 byte per pixel for Soft Margin,
and Tangent Distance, 4 byte per pixel for the rest.

of the methods. Memory requirements for the neural networks assume 4 bytes
per weight (and 4 bytes per prototype component for the LeNet 4 / memory-
based hybrids), but experiments show that one-byte weights can be used with
no signi cant change in error rate. Of the high-accuracy classi ers, LeNet 4
requires the least memory.

5 Conclusions
This paper is a snapshot of ongoing work. Although we expect continued
changes in all aspects of recognition technology, there are some conclusions
that are likely to remain valid for some time.
Performance depends on many factors including high accuracy, low run
time, and low memory requirements. As computer technology improves, larger-
capacity recognizers become feasible. Larger recognizers in turn require larger
training sets. LeNet 1 was appropriate to the available technology ve years
ago, just as LeNet 5 is appropriate now. Five years ago a recognizer as complex
as LeNet 5 would have required several months' training, and more data than
was available, and was therefore not even considered.
For quite a long time, LeNet 1 was considered the state of the art. The
local learning classi er, the optimal margin classi er, and the tangent distance
classi er were developed to improve upon LeNet 1 { and they succeeded at
that. However, they in turn motivated a search for improved neural network
architectures. This search was guided in part by estimates of the capacity of
various learning machines, derived from measurements of the training and test
error as a function of the number of training examples. We discovered that
more capacity was needed. Through a series of experiments in architecture,
combined with an analysis of the characteristics of recognition errors, LeNet 4
and LeNet 5 were crafted.
We nd that boosting gives a substantial improvement in accuracy, with a
relatively modest penalty in memory and computing expense. Also, distortion
models can be used to increase the e ective size of a data set without actually
taking more data.
The optimal margin classi er has excellent accuracy, which is most remark-
able, because unlike the other high performance classi ers, it does not include a
priori knowledge about the problem. In fact, this classi er would do just as well
if the image pixels were permuted with a xed mapping. It is still much slower
and memory hungry than the convolutional nets. However, improvements are
expected as the technique is relatively new.
Convolutional networks are particularly well suited for recognizing or re-
jecting shapes with widely varying size, position, and orientation, such as the
ones typically produced by heuristic segmenters in real-world string recognition
systems (see article by L. Jackel in these proceedings).
When plenty of data is available, many methods can attain respectable
accuracy. Although the neural-net methods require considerable training time,
trained networks run much faster and require much less space than memory-
based techniques. The neural nets' advantage will become more striking as
training databases continue to increase in size.

References
B. E. Boser, I. Guyon, and V. N. Vapnik, A Training Algorithm for Opti-
mal Margin Classi ers, in Proceedings of the Fifth Annual Workshop on
Computational Learning Theory 5 144-152, Pittsburgh (1992).
L. Bottou and V. Vapnik, Local Learning Algorithms, Neural Computation 4,
888-900 (1992).
C. Cortes and V. Vapnik, The Soft Margin Classi er, Machine Learning, to
appear (1995).
R. O. Duda and P. E. Hart, Pattern Classi cation and Scene Analysis, Chapter
4, John Wiley and Sons (1973).
H. Drucker, R. Schapire, and P. Simard, Boosting Performance in Neural Net-
works, International Journal of Pattern Recognition and Arti cial Intelli-
gence 7 705-720 (1993).
I. Guyon, I. Poujaud, L. Personnaz, G. Dreyfus, J. Denker, and Y. LeCun,
Comparing Di erent Neural Net Architectures for Classifying Handwritten
Digits, in Proc. 1989 IJCNN II 127-132, Washington DC. IEEE, (1989).
Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, R. Hubbard,
and L. D. Jackel, Handwritten digit recognition with a back-propagation
network, in D. Touretzky (ed), Advances in Neural Information Processing
Systems 2, Morgan Kaufman, (1990).
Yuchum Lee, Handwritten Digit Recognition using K-Nearest Neighbor, Radial-
Basis Functions, and Backpropagation Neural Networks, Neural Compu-
tation, 3, 3, (1991).
E. Sackinger and H.-P. Graf, A System for High-Speed Pattern Recognition and
Image Analysis, Proc of the fourth International Conference on Microelec-
tronics for Neural Networks and Fuzzy Systems, IEEE (1994).
J. Schurmann, A Multi=Font Word Recognition System for Postal Address
Reading, IEEE Trans., G27, 3 (1978).
P. Simard, Y. LeCun, and J Denker, Ecient Pattern Recognition Using a
New Transformation Distance, Neural Information Processing Systems 5,
50-58, Morgan Kaufmann (1993).

View publication stats

PP 180828 Shrink Wrapping Manual
100% (2)
PP 180828 Shrink Wrapping Manual
56 pages
Michael P. Ekstrom (Auth.) - Digital Image Processing Techniques (1984, Academic Press) PDF
No ratings yet
Michael P. Ekstrom (Auth.) - Digital Image Processing Techniques (1984, Academic Press) PDF
379 pages
Toefl Itp Reading Comprehension
No ratings yet
Toefl Itp Reading Comprehension
7 pages
Thesis Leadership of Apple
100% (1)
Thesis Leadership of Apple
78 pages
QuantumAlgorithm
No ratings yet
QuantumAlgorithm
13 pages
Liu 2017
No ratings yet
Liu 2017
11 pages
Convolutional Networks For Images, Speech, and Time-Series: November 1997
No ratings yet
Convolutional Networks For Images, Speech, and Time-Series: November 1997
15 pages
A New Method of Feature Fusion and Its A PDF
No ratings yet
A New Method of Feature Fusion and Its A PDF
12 pages
Practical Characteristics of Neural Network and Conventional Pattern Classifiers On Artificial and Speech Problems
No ratings yet
Practical Characteristics of Neural Network and Conventional Pattern Classifiers On Artificial and Speech Problems
10 pages
Quantum Algorithm
No ratings yet
Quantum Algorithm
13 pages
Content-Based Image Retrieval Tutorial
No ratings yet
Content-Based Image Retrieval Tutorial
16 pages
Sun Et Al. - 2005 - A New Method of Feature Fusion and Its Application
No ratings yet
Sun Et Al. - 2005 - A New Method of Feature Fusion and Its Application
12 pages
1 s2.0 S2211381911002232 Main
No ratings yet
1 s2.0 S2211381911002232 Main
10 pages
Ultra Fast Object Counting Based-On Cellular Neural Network 2006
No ratings yet
Ultra Fast Object Counting Based-On Cellular Neural Network 2006
4 pages
134 Faster Cnns With Direct Sparse
No ratings yet
134 Faster Cnns With Direct Sparse
11 pages
Theory 0
No ratings yet
Theory 0
4 pages
Research On Face Recognition Based On CNN
No ratings yet
Research On Face Recognition Based On CNN
6 pages
SkinNet a Deep Learning Framework for Skin Lesion Segmentation
No ratings yet
SkinNet a Deep Learning Framework for Skin Lesion Segmentation
3 pages
1 s2.0 S0263224120302050 Main
No ratings yet
1 s2.0 S0263224120302050 Main
12 pages
10 1 1 45
No ratings yet
10 1 1 45
45 pages
Scaling Monosemanticity - Extracting Interpretable Features From Claude 3 Sonnet
No ratings yet
Scaling Monosemanticity - Extracting Interpretable Features From Claude 3 Sonnet
75 pages
Scaling Monosemanticity - Extracting Interpretable Features From Claude 3 Sonnet
No ratings yet
Scaling Monosemanticity - Extracting Interpretable Features From Claude 3 Sonnet
75 pages
V D N N D: P D A: Isualizing EEP Eural Etwork Ecisions Rediction Ifference Nalysis
No ratings yet
V D N N D: P D A: Isualizing EEP Eural Etwork Ecisions Rediction Ifference Nalysis
12 pages
A Novel Method For Detecting Image Forgery Based On Convolutional Neural Network
No ratings yet
A Novel Method For Detecting Image Forgery Based On Convolutional Neural Network
4 pages
African School X RD Tutorial
No ratings yet
African School X RD Tutorial
48 pages
Finding Community Structure in Very Large Networks
No ratings yet
Finding Community Structure in Very Large Networks
6 pages
Forecasting Assignment2023
No ratings yet
Forecasting Assignment2023
3 pages
Sysid24 0033 MS
No ratings yet
Sysid24 0033 MS
7 pages
Comparing CNN and Imaging Processing Seismic Fault Detection Methods, Qi, Et Al, 2020
No ratings yet
Comparing CNN and Imaging Processing Seismic Fault Detection Methods, Qi, Et Al, 2020
4 pages
Automatic Detection of Welding Defects Using Deep PDF
No ratings yet
Automatic Detection of Welding Defects Using Deep PDF
11 pages
Deep ONet
No ratings yet
Deep ONet
22 pages
AT&T Bell Laboratories, Holmdel, NJ 07733, USA
No ratings yet
AT&T Bell Laboratories, Holmdel, NJ 07733, USA
16 pages
Auto-Encoder Based Dimensionality Reduction
No ratings yet
Auto-Encoder Based Dimensionality Reduction
25 pages
The Art of The Propagator
No ratings yet
The Art of The Propagator
53 pages
Lu DeepONet NMachineIntell21
No ratings yet
Lu DeepONet NMachineIntell21
15 pages
Training Towards Significance With The Decorrelated Event Classifier Transformer Neural Network
No ratings yet
Training Towards Significance With The Decorrelated Event Classifier Transformer Neural Network
18 pages
X-DenseNet Deep Learning For Garbage Classificatio
No ratings yet
X-DenseNet Deep Learning For Garbage Classificatio
7 pages
Character Recognition Using Neural Networks: Rókus Arnold, Póth Miklós
No ratings yet
Character Recognition Using Neural Networks: Rókus Arnold, Póth Miklós
4 pages
Convolutional Neural Networks: Computer Vision
No ratings yet
Convolutional Neural Networks: Computer Vision
14 pages
Fitting Statistical Models With PROC MCMC: Conference Paper
No ratings yet
Fitting Statistical Models With PROC MCMC: Conference Paper
27 pages
Application of A Neural Network For The Prediction of Crystallization Kinetics
No ratings yet
Application of A Neural Network For The Prediction of Crystallization Kinetics
6 pages
Pflib - An Object Oriented Matlab Toolbox For Particle Filtering
No ratings yet
Pflib - An Object Oriented Matlab Toolbox For Particle Filtering
8 pages
DL Concepts 1 Overview
No ratings yet
DL Concepts 1 Overview
80 pages
Maths For ML
No ratings yet
Maths For ML
156 pages
Matematics and Machine Learning
No ratings yet
Matematics and Machine Learning
156 pages
Leeetal Cyphysim Emsoft 2015
No ratings yet
Leeetal Cyphysim Emsoft 2015
10 pages
Multi-Layered Deep Convolutional Neural Network For Object Detection
No ratings yet
Multi-Layered Deep Convolutional Neural Network For Object Detection
6 pages
Pattern Classification of Back-Propagation Algorithm Using Exclusive Connecting Network
No ratings yet
Pattern Classification of Back-Propagation Algorithm Using Exclusive Connecting Network
5 pages
Improved K-Nearest Neighbor Classi"cation: Yingquan Wu, Krassimir Ianakiev, Venu Govindaraju
No ratings yet
Improved K-Nearest Neighbor Classi"cation: Yingquan Wu, Krassimir Ianakiev, Venu Govindaraju
8 pages
paper [41]
No ratings yet
paper [41]
9 pages
lecture19
No ratings yet
lecture19
8 pages
Main
No ratings yet
Main
25 pages
Computational Intelligence Based Machine Fault Diagnosis: D. D. WANG, Debing YANG, Jinwu XU & Ke XU
No ratings yet
Computational Intelligence Based Machine Fault Diagnosis: D. D. WANG, Debing YANG, Jinwu XU & Ke XU
5 pages
Unit 1
No ratings yet
Unit 1
61 pages
M. B. Yobas, J. N. Crook, D P. Ross - Credit Scoring Using Neural and Evolutionary Techniques
No ratings yet
M. B. Yobas, J. N. Crook, D P. Ross - Credit Scoring Using Neural and Evolutionary Techniques
15 pages
Support Vector Networks
No ratings yet
Support Vector Networks
25 pages
Artificial Neural Network Applications in Geotechnical Engineering
No ratings yet
Artificial Neural Network Applications in Geotechnical Engineering
15 pages
DSFSF
No ratings yet
DSFSF
9 pages
Advanced data analysis methods for dark matter research associated with the quark top at the LHC
No ratings yet
Advanced data analysis methods for dark matter research associated with the quark top at the LHC
5 pages
Artificial Intelligence Techniques in Solar Energy Applications
No ratings yet
Artificial Intelligence Techniques in Solar Energy Applications
26 pages
Clauset Et Al - 2004 - Finding Community Structure in Very Large Networks
No ratings yet
Clauset Et Al - 2004 - Finding Community Structure in Very Large Networks
6 pages
1975 Robust Nonlinear Regression Using The Dogleg Algorithm
No ratings yet
1975 Robust Nonlinear Regression Using The Dogleg Algorithm
11 pages
Combining Pattern Classifiers: Methods and Algorithms
From Everand
Combining Pattern Classifiers: Methods and Algorithms
Ludmila I. Kuncheva
No ratings yet
Fundamentals of Security in Operating Systems
No ratings yet
Fundamentals of Security in Operating Systems
4 pages
R 66 Parts Katalog
100% (1)
R 66 Parts Katalog
950 pages
Location Based Services
No ratings yet
Location Based Services
23 pages
Orange and Violet Illustration Class Syllabus Education Presentation
No ratings yet
Orange and Violet Illustration Class Syllabus Education Presentation
5 pages
Lista-Pyramid (27-08-2014)
No ratings yet
Lista-Pyramid (27-08-2014)
19 pages
School Police 1
No ratings yet
School Police 1
5 pages
Practice Questions & Answers: Made With by Sawzeeyy
No ratings yet
Practice Questions & Answers: Made With by Sawzeeyy
141 pages
H48691RU 4 VE6 EC300 BUM en-US
No ratings yet
H48691RU 4 VE6 EC300 BUM en-US
354 pages
Indoor (Online) Gedung Direktorat Lantai 4: Stage Uk. 3m X 8m, T:20cm New Karpet LED Indoor 3m X 8m + Level. 1m
No ratings yet
Indoor (Online) Gedung Direktorat Lantai 4: Stage Uk. 3m X 8m, T:20cm New Karpet LED Indoor 3m X 8m + Level. 1m
3 pages
Wa0000.
No ratings yet
Wa0000.
11 pages
Graphic Designer Neville Brody Facts
No ratings yet
Graphic Designer Neville Brody Facts
3 pages
Computer Science
No ratings yet
Computer Science
23 pages
Terex-CC8800 1 Twin B1 200808
No ratings yet
Terex-CC8800 1 Twin B1 200808
8 pages
2013 Course Structure BTech CSE
No ratings yet
2013 Course Structure BTech CSE
32 pages
Indicator Analysis of Employee Performance Based On The Effect of Digital Leadership, Digital Culture, Organizational Learning, and Innovation
No ratings yet
Indicator Analysis of Employee Performance Based On The Effect of Digital Leadership, Digital Culture, Organizational Learning, and Innovation
14 pages
Short Report Format
No ratings yet
Short Report Format
3 pages
Ibarreta-Presentor No.17
No ratings yet
Ibarreta-Presentor No.17
29 pages
Taipei Trash & Recycling: A Short Guide - COLLECTIVE GREEN
No ratings yet
Taipei Trash & Recycling: A Short Guide - COLLECTIVE GREEN
11 pages
Animation in China History, Aesthetics, Media
No ratings yet
Animation in China History, Aesthetics, Media
273 pages
2223 BLP ws16 Ce01 Guide
No ratings yet
2223 BLP ws16 Ce01 Guide
18 pages
Earnings Per Share (Eps)
No ratings yet
Earnings Per Share (Eps)
2 pages
HardieSmart V2.0 Zerolot Design Guide JAN2017
No ratings yet
HardieSmart V2.0 Zerolot Design Guide JAN2017
24 pages
Kasus MYOB Bookstore
No ratings yet
Kasus MYOB Bookstore
6 pages
Edu 103 - Instructional Technology
No ratings yet
Edu 103 - Instructional Technology
10 pages
Perkinsrestaurant Menu PDF
No ratings yet
Perkinsrestaurant Menu PDF
12 pages
Pressure Transmitter Calibration
No ratings yet
Pressure Transmitter Calibration
6 pages
AJAY Chhattisgarh - Company Final
No ratings yet
AJAY Chhattisgarh - Company Final
333 pages