Class 23
Class 23
1
Figure 1: An illustration of a fully connected neural network. In this representation, the input layer is
passive; each node receives an input (e.g., X1i could represent the i’th sample of a signal) and sends
that as output to each subsequent node. The nodes of the hidden layer and output layer are active (see
inset).
error:
N
1 X (i) ′(i)
2
J(w)
⃗ = y −y (1)
N i=0
2
Figure 2: Sketches of deep neural networks (DNNs). The blue lines indicate inputs, and the orange lines
indicate outputs. The length of the blue and orange lines represents the data dimension. The green
lines indicate intermedia connections. (a) In a fully connected neural network (FCNN), the inputs of
one layer are connected to every unit in the next layer. f stands for a nonlinear activation function. In
(b–f), the details of the layers are omitted and only the shape of the network is represented. Note that
(b) is a vanilla Convolutional Neural Network (CNN) that is cascaded by convolutional layers, pooling
layers, nonlinear layer. (c) is a Convolutional Autoencoder (CAE) that uses convolutional layers in the
encoder and deconvolutional layers in the decoder. each network architecture.
always improve the loss on the training data, an independent set of data is
held out (called validation data) to assess the point at which the model is
overfitting. Figure 6 shows example training curves, showing that there is
overfitting after 3 epochs.
10. Examples of neural networks are numerous. We’ll consider two examples.5
(a) An approach to denoise data (e.g., Zhu et al., 2019), that relies on tradi-
tional DSP (STFT) for pre-processing but extends traditional frequency
filtering.
(b) An approach to discriminate between certain types of seismic events
using the STFT and array processing for pre-processing (e.g., Ronac-
Giannone et al., 2024).
5
When reading these papers, it helps me to review Figure 3 and to think through the input X and the predictions and
true targets, Y’ and Y.
3
Figure 3: An illustration of the training process. The goal is to find the value of the weights that
minimize the loss, which is a measure of the difference between the predictions and true targets.
4
Figure 5: Two strategies commonly used for separating data into testing and training sets.
Figure 6: Training and validation loss from an example of training a neural network model.