1 FFT and Spectrogram: 1.1 Fourier Transform For Finite Duration Signals
1 FFT and Spectrogram: 1.1 Fourier Transform For Finite Duration Signals
1.1
In order to analyze the frequency content of a finite duration discrete time signal x with N samples, we use
the Discrete Fourier Transform (DFT):
x
(k) =
N
1
X
x(n)ei
2k
N n
k = 0, . . . , N 1.
n=0
This can be interpreted as the Fourier Transform of the finite duration signal evaluated at the frequencies
f = k/N . Another interpretation is that the DFT is the Fourier Series of the periodic extension of x but is
missing the 1/N scaling factor. This second interpretation gives rise to the Inverse DFT formula.
In order to go from x
back to x, we use the Inverse DFT:
x(n) =
N 1
2k
1 X
x
(k)ei N n ,
N
n = 0, . . . , N 1
k=0
1
Fx
,
N
= F x,
=
1
1
1
..
.
1
i 2
N
i 4
N
e
4
ei N
..
.
ei2
N 1
N
e
8
ei N
..
.
ei2
2(N 1)
N
...
...
...
..
.
1
e
2(N 1)
ei2 N
..
.
...
ei2
1
i2 NN
(N 1)2
N
We will often want to graphically depict the frequency content found in the DFT x
. Remember that x
is
in general a complex valued vector even if x is real valued, so it is usual to plot both the magnitude |
x(k)|
and phase arg(
x(k)) on separate graphs.
The elements in the x
vector are coefficients for the frequencies f = 0, 1/N, 2/N, . . . , (N 1)/N . However,
notice that the high frequencies are equivalent to low negative frequencies due to aliasing. For example,
f = (N 1)/N is equivalent to f = 1/N . It is common to rearrange the vector x
to represent the
frequency components in the range [1/2, 1/2] rather than [0, 1]. Please read about the Matlab function
fftshift which is used for this purpose.
1.2
To calculate the DFT of a function in Matlab, use the function fft. FFT stands for Fast Fourier Transform,
which is a family of algorithms for computing the DFT. A straight computation of the DFT from the
formulas above would take n2 complex multiplications and n(n 1) complex additions. Algorithms have
been developed (for example, the Cooley-Tukey FFT) which exploit symmetries of the roots of unity to
reduce the number of calculations to O(n log2 n). It is fastest when n is a power of 2, followed by composite
numbers of small primes. Read the Matlab help page for more information.
To take the DFT of a vector x of length n, use the command:
hx=fft(x);
1
1.3
Sampled Signals
Suppose you begin with a continuous time signal x(t) defined over a finite time period [0, T ] and then you
construct a discrete-time signal x[n] by taking uniformly spaced samples of x(t). If N uniformly spaced
samples of x are taken, then the sampling frequency is Fs = N/T (the sampling frequency would technically
be (N 1)/T if the first sample is x(0) and the last sample is x(T )).
We can take the DFT of the discrete-time signal x[n] and use this to represent the frequency content of the
continuous-time signal x(t). This may not be valid, but lets go with it anyway. Youll learn more about this
in the lecture on sampling. If we represent the discrete-time frequency spectrum on the range [1/2, 1/2],
then we translate this to continuous-time frequencies by multiplying by Fs . Thus, given a sampled signal
with sampling frequency Fs , we plot its DFT magnitude and phase versus frequencies in Hz in the range
[Fs /2, Fs /2].
1.4
The Spectrogram
Let x be signal of length N . Consider consecutive segments (or clips) of x of length m where m n and
let X Rm(N m+1) be the matrix with the consecutive segments as consecutive columns. In other words,
[x[0], x[1], . . . , x[m 1]]T is the first column, [x[1], x[2], . . . , x[m]]T is the second column, and so forth. Both
the rows and columns of X are indexed by time. We see that X is a highly redundant representation of x.
whose columns are the DFT of the columns
The spectrogram of x with window size m is the matrix X
of X. So
FX
1
FX
m
are indexed by frequency and the columns are indexed by time. Each location on X
by calling the function imagesc. This first scales the image to the full range of the color map and then
displays it as an image. The x-axis is the time axis and the y-axis is the frequency axis. You can change the
color map with the command colormap. Since the signal is real, the first half and the second half of each
you only need
column are redundant by a symmetry property of the DFT. Therefore, when you display X,
to show the first half of each column. Also, by default imagesc plots from the top left corner, and we will
want to see it upside down, so you use the following commands (assuming HX is your spectrogram):
ax=imagesc(t,f,HX); % t is the time vector for the x-axis
set(ax,YDir,normal); % and f is the frequency vector for the y-axis
is already in x what is its value? We are interested in analyzing
Given that all of the information in X
signals whose frequency content changes in time (e.g. music) and we want to analyze how this occurs. Taking
the DFT of the entire signal will show us the frequency content over the entire time period. We would like to
take the DFT over a short period of time because this will give us a local snapshot in time of the frequency
content of the signal during that short time period. We can do this by first taking the DFT of x(1 : m),
then we can take the DFT of x(2 : m + 1), and so on until we hit the end of the signal. This is exactly the
spectrogram.