0% found this document useful (0 votes)

90 views28 pages

Programming With Python: Contents

Uploaded by

BorderBRE

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

90 views28 pages

Programming With Python: Contents

Uploaded by

BorderBRE

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Programming with Python

Stefan Güttel, guettel.com (http://guettel.com)

Contents:
1. Useful tools for data analysis
2. Case study: How warm was Europe in the past?

Useful tools for data analysis

Jupyter Notebook
Most data analysis needs to be accompanied by some form of reporting, explaining which data has been used, where the
data comes from, which analysis method is being used and why, what are the conclusions of the analysis, and discussion.
Jupyter Notebook is an open-source web application that allows us to create and share documents that contain live code,
equations, visualizations and narrative text. If you have installed the Anaconda distribution on your computer, it should
already be available. We can start Jupyter Notebook from the Anaconda Navigator or the command line.

Jupyter Notebooks contain both Python code and text, which is formatted in Markdown. Here are a few Markdown formats:

Headings with #, ##, ###, ...

Inline code enclosed with backticks like so: `print('hello')` , and code blocks with ```
Emphasis, aka italics, with *asterisks* or _underscores_
Enumerated lists with 1. , 2. , etc., or bullet points with *
Links with [I'm an inline-style link to Google](https://www.google.com)
Images ![alt text](https://url/image.png "Logo Title Text 1")
LaTex formulas with $ and $$

See Markdown Cheatsheet (https://github.com/adam-p/markdown-here/wiki/Markdown-Cheatsheet) for more.

JupterLab
Similar to Jupyter Notebook, but with many additional features focused on interactive, exploratory computing. The
JupyterLab interface consists of a main work area containing tabs of documents and activities, a collapsible left sidebar, and
a menu bar. The left sidebar contains a file browser, the list of running kernels and terminals, the command palette, the
notebook cell tools inspector, and the tabs list. It is also good for viewing large CSV files.

Python Data Analysis Library

The Python Data Analysis Library (https://pandas.pydata.org/) pandas provides high-performance, easy-to-use data
structures and data analysis tools. Most importantly, it implements a fast and efficient DataFrame object for data
manipulation with integrated indexing, and tools for reading and writing data between in-memory data structures and
different formats. There's also a 10 minutes Pandas tutorial (https://pandas.pydata.org/pandas-
docs/stable/user_guide/10min.html).

Here we just show a quick example of pandas, using the data_reader package (which must be installed separately) to read
and plot the daily low and high prices of the Amazon stock.
In [1]: import pandas_datareader.data as web
import matplotlib.pyplot as plt
import pandas as pd

symbol = 'AMZN' # Amazon stock

start = '2019-04-22'
end = '2020-11-29'
df = web.DataReader(name=symbol, data_source='yahoo', start=start, end=end)
df.head()

Out[1]:
High Low Open Close Volume Adj Close

Date

2019-04-22 1888.420044 1845.640015 1855.400024 1887.310059 3373800 1887.310059

2019-04-23 1929.260010 1889.579956 1891.199951 1923.770020 4640400 1923.770020

2019-04-24 1929.689941 1898.160034 1925.000000 1901.750000 3675800 1901.750000

2019-04-25 1922.449951 1900.310059 1917.000000 1902.250000 6099100 1902.250000

2019-04-26 1951.000000 1898.000000 1929.000000 1950.630005 8432600 1950.630005

In [2]: %matplotlib inline

import matplotlib.pylab as pylab
pylab.rcParams['figure.figsize'] = 10, 7.5
ax = df[['Low','High']].plot();
How do I become a data analyst?
The basic mathematical tools every data analyst needs to know are grounded in Linear Algebra, Optimisation, Statistics, and
Probability Theory. Some of the third and fourth year courses that might be helpful are

MATH36001 - Matrix Analysis

MATH36061 - Convex Optimization
MATH38001 - Statistical Inference
MATH38141 - Regression Analysis
MATH38161 - Multivariate Statistics and Machine Learning
MATH38032 - Time Series Analysis
MATH46101 - Numerical Linear Algebra
MATH48091 - Statistical Computing

Apart from being proficient in Python and the pandads package, a data analyst knows about various data analysis and
machine learning techniques such as

statistical hypothesis testing

gradient descent methods
k-nearest neighbor clustering
simple, multiple, and logistic regression
decision trees
neural networks

Many of these techniques are implemented in the Python package scikit-learn (https://scikit-learn.org). Check out the
extensive example collection (https://scikit-learn.org/stable/auto_examples).

How do I become a data scientist?

In addition to knowing data analysis techniques, a data scientist is able to write their own analysis algorithms and judge their
suitability and reliability scientifically. A data scientist not only knows how to use advanced machine learning algorithms, but
understands their internally workings and has read relevant scientific literature before using them.

A good way to get familiar with the fundamental data science toos and algorithms is to code them from scratch, using only
basic Python language. I strongly recommend the book Data Science from Scratch by Joel Grus (O'Reilly 2015).

Indeed, a great deal of work can be done without using any of the above-mentioned libraries. We will best demonstrate this
with a concrete data analysis problem.

Case study: How warm was Europe in the past?

Problems. Let us solve the following related problems:

1. What were the extreme average temperatures in the past 500 years in Europe?
2. How did the temperature change?
3. What did it look like at a certain point in time (a date or some approximant of that)?
Loading the data
Obviously, Python itself does not provide the needed data. This is where searching the internet comes in handy, leading us
to the historical paleoclimatological data (http://www.ncdc.noaa.gov/data-access/paleoclimatology-data/datasets/historical)
of the NCDC (National Climatic Data Center) (http://www.ncdc.noaa.gov/). From their FTP site
(ftp://ftp.ncdc.noaa.gov/pub/data/paleo/historical/) we can download various data.

Some of the data, along with other info and the copyright notice, is available in the file europe-seasonal.txt (10a-
temps/eu-data/europe-seasonal.txt) ( ftp://ftp.ncdc.noaa.gov/pub/data/paleo/historical/europe-
seasonal.txt ). The data itself looks like this:

Year DJF MAM JJA SON Annual

1500 -0.945 7.157 17.483 8.990 8.166
1501 -0.850 7.435 17.401 8.687 8.163
1502 -1.053 6.872 17.906 9.071 8.194
...
2002 0.207 9.214 18.905 9.301 9.508
2003 -1.101 8.521 19.615 9.838 9.374
2004 0.187 8.297 18.325 10.073 9.235

The column "year" holds the year of each row's data "DJF" stands for Winter (December, January, February), "MAM"
stands for Spring (March, April, May), "JJA" stands for Summer (June, July, August), "SON" stands for Autumn
(September, October, November), and "Annual" is the average temperature for the given year.

We can copy just this part into a new file and save it under some name, for example "europe-seasonal.dat" .

Notice that this is not exactly a CSV file like we saw before, as the separators are strings of whitespaces of a varying length.
The columns of this file are defined by their length (the year holds 4 characters and the rest hold 12 characters each).

Luckily, this is not a problem: the split function that we used earlier uses exactly strings of whitespaces of a varying
length as separators if it's not given a different one. So, one line from the above file can be split like this:

year,djf,mam,jja,son,annual = line.strip().split()

The additional strip() call removes leading and trailing whitespaces (each line ends with a new-line characters that we
want to remove). However, don't forget: each of the year , djf , mam , jja , son , and annual is a string now and
needs to be converted to either int or float if we are to use it as such.

A note on organizing our code. Given that we want to write several programs dealing with the same data, creating a
module with some common functionality is a reasonable way to go.
The first function to write would be the one fetching the data from the above file. There are two things to consider here:

1. What to return? We can write it to return either an iterator or a list of all values. Since this data set is not very big, the
two approaches don't differ much. Still, iterator is usually a better option and we'll do that here.
2. How to store each year's (row's) data? Obvious choices are a tuple or a dictionary. The latter is a tad more
descriptive, but a tuple is a bit easier to create, so we'll work with tuples. This is really just the matter of a personal
choice.

So, what we need to do is read the file line by line, split each line, convert the elements to int / float , put them in a tuple
and yield them.

Since we have created our own input file, we can choose the format. It will be as described above, but minus the header
row, since we have no use for it. In other words, our file "europe-seasonal.dat" looks like this:

1500 -0.945 7.157 17.483 8.990 8.166

1501 -0.850 7.435 17.401 8.687 8.163
1502 -1.053 6.872 17.906 9.071 8.194
...
2002 0.207 9.214 18.905 9.301 9.508
2003 -1.101 8.521 19.615 9.838 9.374
2004 0.187 8.297 18.325 10.073 9.235

We can now create a function that will read the data from this file:

In [3]: import os.path

# This is the directory with the data

data_dir = os.path.join("10a-temps", "eu-data")

def seasonal_data(fname = os.path.join(data_dir, "europe-seasonal.dat")):

"""
Read the seasonal data from a fixed-width column file with the columns
`year`,
`djf` (Winter: December, January, February),
`mam` (Spring: March, April, May),
`jja` (Summer: June, July, August),
`son` (Autumn: September, October, November), and
`annual` (Annual average).

The return value is an iterator that returns the tuple

`(year, djf, mam, jja, son, and annual)`
for each row of data.

It is assumed that the file has no header.

If the file does not exist, a `FileNotFoundError` exception is raised.

"""
with open(fname) as f:
for line in f:
fields = line.strip().split()
# Extract year, convert it to `int`, join it back with
# the rest of values converted to `float`, and `yield`
# them all as a tuple
# The `try...except` block ensures that the faulty data
# (for example, a header) is ignored
if len(fields) == 6:
try:
yield tuple([int(fields[0])] + [float(x) for x in fields[1:]])
except ValueError:
continue
Notice the use of os.path.join above. While it will generally work to use "10a-temps/eu-data" instead of
os.path.join("10a-temps", "eu-data") , there are situations when it will fail. A good program should always try to
avoid those. More details can be found in this excellent explanation (http://stackoverflow.com/a/24072843/1667018).

Let us now get some basic info about the temperature in Europe in the past 500 years:

In [4]: # Get data to a list, as to avoid rereading the file several times.
# We can afford this because the file is fairly small.
data = list(seasonal_data())

# Minimums and maximums:

print("The lowest average winter temperature: {:+7.3f}C".format(min(data, key=lambda t:
t[1])[1]))
print(" This happened in the year {}.".format(min(data, key=lambda t: t[1])[0]))
print("The highest average winter temperature: {:+7.3f}C".format(max(data, key=lambda t:
t[1])[1]))
print(" This happened in the year {}.".format(max(data, key=lambda t: t[1])[0]))
print("The lowest average summer temperature: {:+7.3f}C".format(min(data, key=lambda t:
t[3])[3]))
print(" This happened in the year {}.".format(min(data, key=lambda t: t[3])[0]))
print("The highest average summer temperature: {:+7.3f}C".format(max(data, key=lambda t:
t[3])[3]))
print(" This happened in the year {}.".format(max(data, key=lambda t: t[3])[0]))
min_annual = min(data, key=lambda t: t[5])
max_annual = max(data, key=lambda t: t[5])
print("The average anual temperature varied between {:+.3f}C in {} to {:+.3f}C in {}.".fo
rmat(
min_annual[5], min_annual[0],
max_annual[5], max_annual[0]
))

The lowest average winter temperature: -4.152C

This happened in the year 1709.
The highest average winter temperature: +1.734C
This happened in the year 1990.
The lowest average summer temperature: +16.477C
This happened in the year 1902.
The highest average summer temperature: +19.615C
This happened in the year 2003.
The average anual temperature varied between +7.006C in 1875 to +9.664C in 2000.

Plotting the temperature trend

The above analysis gave us a glimpse of what was going on with our temperature in the second half of the past millennium.
However, to truly see what was going on, we should plot this data.

First some setting up for this document:

In [5]: %matplotlib inline

import matplotlib.pylab as pylab
pylab.rcParams['figure.figsize'] = 10, 7.5
Note that %matplotlib inline is NOT a Python construct. It's purpose is to tell IPython Notebook (in which these notes
are written) to include the plot in the document itself.

As for the remaining two lines, they establish the size of the plot and can be used in Python as well. However, there are
usually better ways to do it and this is used merely to set the default values for all the plots produced by the program.

We are now ready to do some basic plotting:

In [6]: import matplotlib.pyplot as plt

years = list()
djfs = list()
mams = list()
jjas = list()
sons = list()
annuals = list()

# There are more Pythonic ways to do this, for example with

# NumPy's transposing or generator expressions, but this is
# more straightforward.
for year,djf,mam,jja,son,annual in seasonal_data():
years.append(year)
djfs.append(djf)
mams.append(mam)
jjas.append(jja)
sons.append(son)
annuals.append(annual)

plt.plot(
years, djfs, "blue",
years, mams, "green",
years, jjas, "red",
years, sons, "orange",
years, annuals, "gray"
)

plt.show()
So, how does this work?

1. import matplotlib.pyplot as plt imports the basic plotting module.

2. The function plot (http://matplotlib.org/api/pyplot_api.html#matplotlib.pyplot.plot) takes each dataset as a pair of two
lists: x-values and y-values. Therefore, we create the lists of all the years and Spring/Summer/Autumn/Winter/annual
temperatures in a very simple for loop.
3. We plot each of the lists of values, using years as the common list of values for the x-axis.
4. Finally, plt.show() tells the system that the plot is ready to be shown.

From the above plot, we can observe several things:

1. Winters are always colder than Springs, which are usually (a bit) colder than Autumns, which are always colder than
Summers. Average, as its name suggests, is in the middle. None of this is really surprising.
2. The average temperature varies more for the Winters than for other seasons.
3. Springs are varying more since around the beginning of the 19th century.

What we cannot see are trends. For example, is the temperature rising?

The above numbers suggest that Europe is warming up, because the maximum temperatures in Winter, Summer, and on
average have all occured in recent years. However, these are just extremes that may or may not correlate with the general
behaviour of the temperature. To observe that, we use smoothing (http://en.wikipedia.org/wiki/Smoothing).

There are many different smoothing algorithms. Here, we shall use the moving average
(http://en.wikipedia.org/wiki/Moving_average) in its most simple form. If the temperature for a certain season in year y is
given by the variable Ly , we create new variables:
y+r
1
′
Ly := ∑ Lk ,
2r + 1
k=y−r

i.e., L′y is the average value of the temperatures from the year y − r up to (and including) the year y + r , where r (the
radius) is some given number. The bigger the r, the smoother the result.

So, how do we smooth a list in Python?

i+r
Let us smooth only one element first, the i -th one. This means computing the sum ∑k=i−r Lk and dividing it by 2r + 1.
This means we need to:

get a part of the list: L[i-r:i+r+1] (the +1 part is here because the right limit is not included as a part of the new
list),
i+r
find its sum: ∑k=i−r Lk = sum(L[i-r:i+r+1]) ,
i+r
divide it with 2*r+1 : 1

2r+1
∑
k=i−r
Lk = sum(L[i-r:i+r+1]) / (2*r+1) .

Repeating the above for all viable indices i can be easily done as a list comprehension:

[ sum(L[i-r:i+r+1])/(2*r+1) for i in range(r, len(L)-r) ]

Finally, since we want to do this for a whole list, it is wise to compute 2*r+1 ahead and just store it in some variable.

In [7]: def smooth(L, r):

"""
Return a new list obtained from a list `L` by smoothing its values
for a radius `r`. The returned list is shorter than `L` by `2*r`
elements because the border values are not smoothed.
"""
tot = 2*r + 1
return [ sum(L[i-r:i+r+1])/tot for i in range(r, len(L)-r) ]
Now, the smoothed versions of our temperature lists are easy to obtain:

In [8]: sdjfs = smooth(djfs, 5)

# Display the first 3 and the last 3 elements of this new list
print("Smoothed Winter temperatures: {}, ..., {}".format(
", ".join("{:.3}".format(x) for x in sdjfs[:3]),
", ".join("{:.3}".format(x) for x in sdjfs[-3:])
))

Smoothed Winter temperatures: -0.917, -1.04, -1.16, ..., 0.338, 0.197, 0.169

Recall that the smoothed arrays are shorter than the original ones. This means that the years list is no longer appropriate
for the x-axis and we need to create a new one, with the first and the last r elements removed:

In [9]: r = 5
sdjfs = smooth(djfs, r)
print("len(smooth_djfs) = ", len(sdjfs))
print("len(years) = ", len(years))
syears = years[r:-r]
print("len(smooth_years) =", len(syears))

len(smooth_djfs) = 495
len(years) = 505
len(smooth_years) = 495

As far as smoothing is concerned, this is it.

However, there are various improvements that can be done to our plot.

First, to make it easier to make some improvements, we take the figure and the subplot reference in two variables:

fig = plt.figure()
ax = plt.subplot(111)

This allows us to do the customisations that are related to them, and not just the plots themselves. For example:

box = ax.get_position()
ax.set_position([box.x0, box.y0, box.width * 0.8, box.height])

is used to reduce the width of the plotting area by 20% (to 0.8 of its original width), leaving some space on the right side for
the legend.

The legend itself is added by the legend function (http://matplotlib.org/users/legend_guide.html):

plt.legend(bbox_to_anchor=(1.03, 1), loc="upper left", borderaxespad=0)

The description of the arguments used can be found in the function's reference
(http://matplotlib.org/api/pyplot_api.html#matplotlib.pyplot.legend).
So, how does the legend get the names of the plots?

This can be done in several different ways, the easiest one being the plot command itself. To do that, we draw all the
plots one by one:

plt.plot(syears, smooth(djfs, r), color="blue", label="Winter")

plt.plot(syears, smooth(mams, r), color="green", label="Spring")
...

The value of the label argument is used as a description of the plot in the legend.

We can also add titles to the plot and to its axes:

plt.title("Smoothed temperatures through the century")

plt.xlabel("Year")
plt.ylabel("Temp (C)")

A grid is also trivial to add:

plt.grid()

Notice how our plot has a big empty space on the right side. This is because the Matplotlib's automation decided that 2100
is a good right limit for the x-axis. However, we might want to use a different value, maybe 2015. We set this by calling the
axis function (http://matplotlib.org/api/pyplot_api.html#matplotlib.pyplot.axis):

plt.axis([1500, 2015, -10, 35])

This sets the x-axis to display the values from 1500 to 2015, and the y-axis to display the values from -10 to 35.

Of course, it would be better to derive these limits from the data. Luckily, we know that all the elements of djf (the Winter
temperatures) are smaller than all the elements of the remaining lists; also, all the ellements of jja (the Summer
temperatures) are bigger than all the elements of the remaining lists. This simplifies finding minimum and maximum, so our
limits can be:

plt.axis( [ years[0], years[-1], min(djfs), max(jjas) ] )

Finally, nothing bad will happen if we go a bit wider with the temperatures, i.e., if instead of the interval [−4.152, 19.615]
we plot [−5, 20]. This can be done by some rounding magic, for example to the next value divisable by 5:

plt.axis( [ years[0], years[-1], 5floor(min(djfs)/5), 5ceil(max(jjas)/5) ] )

This will add only a minor extra empty space to the top and to the bottom of our plot, but nothing big like the year 2100
added to the right. At the same time, our y-axis labels will turn out nicer.

Instead of just showing it on the screen, we can also save the created plot:

plt.savefig("europe-temps-smooth.png", bbox_inches="tight", dpi=200)

The bbox_inches defines the padding around the image, while the dpi argument stands for "Dots Per Inch". The bigger
the value, the bigger the produced image. You can find these and other parameters in the documentation of the savefig
function (http://matplotlib.org/api/pyplot_api.html#matplotlib.pyplot.savefig).

Using what we've seen so far, we can produce the following plot:
In [10]: import matplotlib.pyplot as plt
from math import floor, ceil

r = 17

years = list()
djfs = list()
mams = list()
jjas = list()
sons = list()
annuals = list()

fig = plt.figure()
ax = plt.subplot(111)

# There are more Pythonic ways to do this, for example with

# Remove the first and the last `r` years as they cannot be properly smoothed
syears = years[r:-r]
# Compute the smoothed values
plt.plot(syears, smooth(djfs, r), color="blue", label="Winter")
plt.plot(syears, smooth(mams, r), color="green", label="Spring")
plt.plot(syears, smooth(jjas, r), color="red", label="Summer")
plt.plot(syears, smooth(sons, r), color="orange", label="Autumn")
plt.plot(syears, smooth(annuals, r), color="gray", label="Average")

# Shrink the plot

box = ax.get_position()
ax.set_position([box.x0, box.y0, box.width * 0.8, box.height])

# Add the legend

plt.legend(bbox_to_anchor=(1.03, 1), loc="upper left", borderaxespad=0)

# Define the axes limits

plt.axis([years[0], years[-1], 5*floor(min(djfs)/5), 5*ceil(max(jjas)/5)])

# Title and axes labels

plt.title("Smoothed temperatures through the century")
plt.xlabel("Year")
plt.ylabel("Temp (C)")

# Display grid
plt.grid()

# Save the plot as a PNG image

plt.savefig("europe-temps-smooth.png", bbox_inches="tight", dpi=200)

# Show the plot

plt.show()
Now, this is a much better presentation of the general temperature behaviour in Europe in the past 500 years.

It would be nice to have this plot and some form of the previous one together, overlapping. Or, even better, have a several
smoothed versions (for different values of r ), in a way that the less smoothed ones are less visible, yet still present.

We can do this by plotting as we did above, for several different values of r . The only question is how to achieve "less
visibility" of certain plots.

Those familiar with image processing probably know what an alpha-channel is. It holds an additional pixel information, not
unlike color, that defines transparency of the pixel. The value can be any real number between 0 and 1 , where 0 means
invisible and 1 means completely visible.

We shall define our alpha according to r , with some tweaking to make the final image look better:
In [11]: import matplotlib.pyplot as plt
from math import floor, ceil

# Radii for which to do the smoothing (0 = no smoothing)

rs = [ 0, 1, 3, 11, 17 ]

years = list()
djfs = list()
mams = list()
jjas = list()
sons = list()
annuals = list()

fig = plt.figure()
ax = plt.subplot(111)

# There are more Pythonic ways to do this, for example with

# Get the smoothed amounts (for each point take the average of
# `r` values to the left and to the right
for r in rs:
# Remove the first and the last `r` years as they cannot be properly smoothed
syears = years[r:-r] if r else years
# Compute the smoothed values
alpha = 0.1 + 0.9*(r/rs[-1])**2 if r else 0.1 # 0 = invisible, 1 = fully visible
plt.plot(syears, smooth(djfs, r), color="blue", alpha=alpha, label="Winter")
plt.plot(syears, smooth(mams, r), color="green", alpha=alpha, label="Spring")
plt.plot(syears, smooth(jjas, r), color="red", alpha=alpha, label="Summer")
plt.plot(syears, smooth(sons, r), color="orange", alpha=alpha, label="Autumn")
plt.plot(syears, smooth(annuals, r), color="gray", alpha=alpha, label="Average")

# Shrink the plot

box = ax.get_position()
ax.set_position([box.x0, box.y0, box.width * 0.8, box.height])

# Add the legend

plt.legend(bbox_to_anchor=(1.03, 1), loc="upper left", borderaxespad=0)

# Define the axes limits

plt.axis([years[0], years[-1], 5*floor(min(djfs)/5), 5*ceil(max(jjas)/5)])

# Title and axes labels

plt.title("Smoothed temperatures through the century")
plt.xlabel("Year")
plt.ylabel("Temp (C)")

# Display grid
plt.grid()

# Save the plot as a PNG image

plt.savefig("europe-temps-smooth.png", bbox_inches="tight", dpi=200)

# Show the plot

plt.show()
This looks almost as intended. The only problem is the abundance of the items in the legend, which is quite normal, since
each and every one of our 20 plots (5 of them in 4 different "alpha" versions) has its own label which legend() then
collects and displays.

To avoid this, we can define the label to be None for all but the last r :
In [12]: import matplotlib.pyplot as plt
from math import floor, ceil

# Radii for which to do the smoothing

rs = [ 0, 1, 3, 11, 17 ]

years = list()
djfs = list()
mams = list()
jjas = list()
sons = list()
annuals = list()

fig = plt.figure()
ax = plt.subplot(111)

# There are more Pythonic ways to do this, for example with

# Get the smoothed amounts (for each point take the average of
# `r` values to the left and to the right
for r in rs:
# Remove the first and the last `r` years as they cannot be properly smoothed
syears = years[r:-r] if r else years
# Compute the smoothed values
alpha = 0.1 + 0.9*(r/rs[-1])**2 if r else 0.1 # 0 = invisible, 1 = fully visible
plt.plot(syears, smooth(djfs, r), color="blue", alpha=alpha, label="Winter" if r == r
s[-1] else None)
plt.plot(syears, smooth(mams, r), color="green", alpha=alpha, label="Spring" if r ==
rs[-1] else None)
plt.plot(syears, smooth(jjas, r), color="red", alpha=alpha, label="Summer" if r == rs
[-1] else None)
plt.plot(syears, smooth(sons, r), color="orange", alpha=alpha, label="Autumn" if r ==
rs[-1] else None)
plt.plot(syears, smooth(annuals, r), color="gray", alpha=alpha, label="Average" if r
== rs[-1] else None)

# Shrink the plot

box = ax.get_position()
ax.set_position([box.x0, box.y0, box.width * 0.8, box.height])

# Add the legend

plt.legend(bbox_to_anchor=(1.03, 1), loc="upper left", borderaxespad=0)

# Define the axes limits

plt.axis([years[0], years[-1], 5*floor(min(djfs)/5), 5*ceil(max(jjas)/5)])

# Title and axes labels

plt.title("Smoothed temperatures through the century")
plt.xlabel("Year")
plt.ylabel("Temp (C)")

# Display grid
plt.grid()

# Save the plot as a PNG image

plt.savefig("europe-temps-smooth.png", bbox_inches="tight", dpi=200)
# Show the plot
plt.show()

And here is our (overly large) saved image, loaded by the IPython-specific function Image :
In [13]: from IPython.display import Image
Image("europe-temps-smooth.png")

Out[13]:
Creating a temperature map
What we did above, we did by using just one of the available tables of data (the one from the file "europe-
seasonal.txt" ). Looking at the other files there (and their descriptions), we can extract more data and we can make more
plots reflecting different information.

For this part of the lecture, we focus on the file TT_Europe_1500_2002_New.GDX

( ftp://ftp.ncdc.noaa.gov/pub/data/paleo/historical/europe-seasonal-
files/TT_Europe_1500_2002_New.GDX , 143 MB) from

Despite its weird file extension .GDX , it is (almost) an ordinary CSV file.

The format is described in the file Readme_TT_1500_2002.txt

( ftp://ftp.ncdc.noaa.gov/pub/data/paleo/historical/europe-seasonal-files/Readme_TT_1500_2002.txt ):

YearSeason followed by 9100 Gridpoints

Seasons are given as 13(Winter, DJF), 14(Spring, MAM), 15 (Summer, JJA), 16 (Autumn, SON)

Year1Season1 Gridpoint1 Gridpoint2 Gridpoint3 Gridpoint4 .....Gridpointy

Year1Season2 Gridpoint1 Gridpoint2 Gridpoint3 Gridpoint4 .....Gridpointy
.....
Year1Season4 Gridpoint1 Gridpoint2 Gridpoint3 Gridpoint4 .....Gridpointy
Year2Season1 Gridpoint1 Gridpoint2 Gridpoint3 Gridpoint4 .....Gridpointy
....
.
Year2Season4 Gridpoint1 Gridpoint2 Gridpoint3 Gridpoint4 .....Gridpointy
.
.
.
YearendSeason4 Gridpoint1 Gridpoint2 Gridpoint3 Gridpoint4 .....Gridpointy

Therefore, each row consist of year Season and a long string

with 9100 gridpoints

How can we do that?

First, we observe some basic details that are important for designing our program:

1. The file is quite big (143 MB) and stuffing it all to memory is not a good idea.
However, to draw the map for just one year and season, we only need its corresponding line. So, our program shall
read the local copy of the file (downloaded from the internet) line by line until we reach the data that we need. Then we
shall collect that data and be done with the file.
2. What type of plot should we use?
Unless we already have some idea, it is best to check the Matplotlib gallery (http://matplotlib.org/gallery.html). What
would be a good plot type to use? These
(http://matplotlib.org/examples/images_contours_and_fields/interpolation_methods.html) certainly seem nice:

3. Obviously, Matplotlib has no idea what our coloured smudges represent (Europe), so we need an appropriate image
to combine with the plot.
This is somewhat tricky, as our data represents square parts of the map, so we must use the map that was created by
the appropriate projection (called equirectangular projection (http://en.wikipedia.org/wiki/Equirectangular_projection))
and we need to crop the map so that it fits the data (note: the official description of the data is wrong; the covered area
is between 25W-40E and 35N-70N, not 30N-70N).
This part is beyond the scope of this course. We shall use this image:
We are now ready to begin. Let us first define some useful variables and then grab the data:

In [14]: # Data file name

fname = os.path.join("10a-temps", "eu-data", "TT_Europe_1500_2002_New.GDX")
# Image file name
iname = os.path.join("10a-temps", "images", "europe.png")

# We ommit the input and give year and season directly.

# This can easily be replaced later.
year = 1700
season = 13

# Filtering string, as per the description in the readme file

fltr = "{:04d}{:02d}".format(year, season)

# Grab the data

with open(fname, mode="rt", encoding="utf8") as f:
for line in f:
if line[:6] == fltr:
break
else:
print("No data found for {}.".format(fltr))
exit(1)

data_list = [float(x) for x in line[6:].strip().split()]

Once the first field in the line corresponds to our filter (a "yyyyss" string, where "yyyy" is a four digit year and "ss" is
a two-digit season identifier), we stop reading.

The last thing we do is splitting the rest of the line, which contains only temperatures, that we immediatelly convert to floats.

Now, out data list contains all the temperatures for the given year and season.

Notice how our data is in a list, and our map requires a grid (a table, a matrix,... some rectangular shape).

Instead of carefully creating a list of lists, we can get some help from NumPy, which is -- in essence -- a system for handling
multidimensional arrays. Its basic data structure is an ndarray (which stands for n-dimensional array): it is created by the
array function (http://docs.scipy.org/doc/numpy/reference/generated/numpy.array.html), and it has a neat little function
called reshape (http://docs.scipy.org/doc/numpy/reference/generated/numpy.reshape.html) that does exactly what we
need:
In [15]: import numpy as np

# In the program, we can use the same variable.

# Here, we shall need the original list later on, so we keep it as `data_list`.
data = np.array(data_list).reshape((70, 130))
print("The element with indices (53, 31):", data[53,31])

The element with indices (53, 31): 9.11

Note: NumPy's ndarray allows double indexing, i.e., the element with the indices (53, 31) is referenced as
data[53,31] . If this was an ordinary Pythonic list of lists, we would have to use data[53][31] .

Mimicking the example on the previously linked page

(http://matplotlib.org/examples/images_contours_and_fields/interpolation_methods.html), we create the plot:

In [16]: plt.imshow(data, interpolation="bicubic", cmap='jet');

While somewhat interesting, this is far from what we've seen above. What happened?

The colours are assigned to the values automatically, with the lowest ones being blue, the highest ones being red, and those
in between having other colours.

The readme file says that only the continental temperatures are available. But our data needs to be "matrix-like", so what is
there in the locations describing the sea?

Opening the file reveals the secret: those temperatures are given as -999.99. So, our automatic colouring works fine, but all
the "interesting" temperatures (from approx. -25C to approx. 40C) are squeezed at the top of the scale, thus all getting
coloured red.

In other words, we need to set the proper scale for colouring.

After a bit of Googling, it is easy to find that this is done by the function matplotlib.colors.Normalize
(http://matplotlib.org/api/colors_api.html#matplotlib.colors.Normalize), which takes minimum and maximum values.

These are easy to find while avoiding all the values that are not between −100 and +100:
In [17]: import matplotlib.colors

norm = matplotlib.colors.Normalize(
vmin=min(fld for fld in data_list if fld > -100),
vmax=max(fld for fld in data_list if fld < +100)
)
plt.imshow(data, interpolation="bicubic", norm=norm, cmap='jet');

This is pretty much what we wanted.

To combine it with the above image (the map of Europe), we look at the gallery again and find this
(http://matplotlib.org/examples/pylab_examples/layer_images.html):

Now, we don't want to use a checker's board as the background, but an image, but the principle is the same.

So, how do we load an image (instead of creating the checkers board)?

Back in the gallery, we quickly find the Image demo

(http://matplotlib.org/examples/images_contours_and_fields/image_demo.html) that does exactly that and almost nothing
more. More on dealing with images in Matplolib can be read in Matplotlib's Image tutorial
(http://matplotlib.org/users/image_tutorial.html).

Luckily, both the checkerboard and the image are dealt with using the function imshow
(http://matplotlib.org/api/pyplot_api.html#matplotlib.pyplot.imshow), so merging these examples is easy:
In [18]: import pylab

img = matplotlib.image.imread(iname)
im_europe = plt.imshow(img)
#pylab.hold(True)
im_temps = plt.imshow(data,
interpolation="bicubic",
norm=norm,
alpha=0.43,
extent=(0,img.shape[1],img.shape[0],0),
cmap='jet'
)
plt.show()

We can also add a title and remove the axes labels:

In [19]: import pylab

seasons = { "13": "Winter", "14": "Spring", "15": "Summer", "16": "Autumn" }

fig = plt.figure()
ax = plt.subplot(111)
img = matplotlib.image.imread(iname)
im_europe = plt.imshow(img)
#pylab.hold(True)
im_temps = plt.imshow(data,
interpolation="bicubic",
norm=norm,
alpha=0.43,
extent=(0,img.shape[1],img.shape[0],0),
cmap='jet'
)
plt.title("European temperatures for the {} of {}.".format(seasons[str(season)], year))
ax.set_xticks([])
ax.set_yticks([])
plt.show()

Last, but not least, it would be nice to explain what those colours actually mean. Like the legend in the previous example, a
map like this can use a colorbar, with the shrink argument that makes the colorbar a bit smaller than it would be
otherwise. This is trivial to add:
In [20]: import pylab

seasons = { "13": "Winter", "14": "Spring", "15": "Summer", "16": "Autumn" }

So, with the program's docstring and import statements ommited, here is our program, but this time displaying the
Summer of '69 (https://www.youtube.com/watch?v=eFjjO_lhf9c):
In [21]: # Data file name
fname = os.path.join("10a-temps", "eu-data", "TT_Europe_1500_2002_New.GDX")
# Image file name
iname = os.path.join("10a-temps", "images", "europe.png")
# Seasons and their codes
seasons = { "13": "Winter", "14": "Spring", "15": "Summer", "16": "Autumn" }

# We ommit the input and give year and season directly.

# This can easily be replaced later.
year = 1969
season = 15

# Filtering string, as per the description in the readme file

fltr = "{:04d}{:02d}".format(year, season)

# Grab the data

with open(fname, mode="rt", encoding="utf8") as f:
for line in f:
if line[:6] == fltr:
break
else:
print("No data found for {}.".format(fltr))
exit(1)

# Convert data from strings to float

data = [float(x) for x in line[6:].strip().split()]

# Prepare a plot
fig = plt.figure()
ax = plt.subplot(111)

# Set colour normalization to min and max values

# between -100 and 100 to avoid junk data (for example,
# -999.99 denotes "no data", i.e., the sea).
norm = matplotlib.colors.Normalize(
vmin=min(fld for fld in data if fld > -100),
vmax=max(fld for fld in data if fld < +100)
)

# Reshape the data from a list to a 70x130 matrix

data = np.array(data).reshape((70, 130))

# Get the image

img = matplotlib.image.imread(iname)
im_europe = plt.imshow(img)
#pylab.hold(True)

# Create the semi-transparent temperature plot

# with the same extent (dimensions) as the image
im_temps = plt.imshow(data,
interpolation="bicubic",
norm=norm,
alpha=0.43,
extent=(0,img.shape[1],img.shape[0],0),
cmap='jet'
)

# Add the colorbar

plt.colorbar(shrink=0.85)

# Set the title

plt.title("European temperatures for the {} of {}.".format(seasons[str(season)], year))
ax.set_xticks([])
ax.set_yticks([])

# Display the plot

plt.show()
Conclusion
The module, the programs, and the data files (compressed for easier downloading) from this lecture can be downloaded
here (10a-temps.zip) (22MiB). If reusing and/or redistributing, please keep the readme files and the references to the
original sources of data.

While these are just some examples of what can be done with data in Python, there are specialized modules and packages
for dealing with large data and for doing far more advanced data analysis. To learn more, feel free to check Pandas
(http://pandas.pydata.org/), statistics module (https://docs.python.org/3/library/statistics.html), Statsmodels module
(http://statsmodels.sourceforge.net/), ...

References

Temperatures data source:

Luterbacher, J., et al. 2006: European Seasonal Temperature Reconstructions.
IGBP PAGES/World Data Center for Paleoclimatology
Data Contribution Series # 2006-060.
NOAA/NCDC Paleoclimatology Program, Boulder CO, USA.

Gridded seasonal absolute surface air temperature for Europe 1500-2002:

Luterbacher, J., Dietrich, D., Xoplaki, E., Grosjean, M., and Wanner, H., 2004:
European seasonal and annual temperature variability, trends, and extremes since 1500,
Science 303, 1499-1503 (DOI:10.1126/science.1093877 (http://doi.org/10.1126/science.1093877)).

Xoplaki, E., Luterbacher, J., Paeth, H., Dietrich, D., Steiner N., Grosjean, M., and Wanner, H., 2005:
European spring and autumn temperature variability and change of extremes over the last half millennium,
Geophys. Res. Lett., 32, L15713 (DOI:10.1029/2005GL023424 (http://doi.org/10.1029/2005GL023424)).

Measurement and Evaluation in Human Performance (James R Morrow JR., Dale P. Mood Etc.)
50% (2)
Measurement and Evaluation in Human Performance (James R Morrow JR., Dale P. Mood Etc.)
759 pages
Python For Data Analysis
No ratings yet
Python For Data Analysis
47 pages
Files in MATLAB
No ratings yet
Files in MATLAB
11 pages
Python Cheat Sheet 2.0
100% (1)
Python Cheat Sheet 2.0
10 pages
Data Science - Unit II
100% (2)
Data Science - Unit II
173 pages
Data Analysis
No ratings yet
Data Analysis
20 pages
Pandas: A Foundational Python Library For Data Analysis and Statistics
100% (3)
Pandas: A Foundational Python Library For Data Analysis and Statistics
9 pages
Python For Data Analysis: Dr. Kishore Kunal
100% (1)
Python For Data Analysis: Dr. Kishore Kunal
43 pages
Data Analysis With Python
100% (3)
Data Analysis With Python
49 pages
Q-Step WS 06112019 Data Analysis and Visualisation With Python
No ratings yet
Q-Step WS 06112019 Data Analysis and Visualisation With Python
76 pages
Financial Analytics With Python
100% (1)
Financial Analytics With Python
40 pages
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
100% (4)
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
11 pages
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
100% (3)
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
9 pages
Python Notes by Prof T
No ratings yet
Python Notes by Prof T
10 pages
Python For Data Analysis
No ratings yet
Python For Data Analysis
96 pages
Olympic Data Minor Project 5th Sem
No ratings yet
Olympic Data Minor Project 5th Sem
23 pages
Python Cheat Sheet For Excel Users
No ratings yet
Python Cheat Sheet For Excel Users
5 pages
Fundamentals of Data Science Students
No ratings yet
Fundamentals of Data Science Students
52 pages
Python For Exploratory Data Analysis
No ratings yet
Python For Exploratory Data Analysis
12 pages
Python Cheat Sheet For Excel Users
100% (2)
Python Cheat Sheet For Excel Users
5 pages
MMW MidTerm RevMat
No ratings yet
MMW MidTerm RevMat
8 pages
Data Preprocessing Python Tome I
No ratings yet
Data Preprocessing Python Tome I
10 pages
Transportation Planning-Principles, Practices and Policies: I-J I J I-J I J J
No ratings yet
Transportation Planning-Principles, Practices and Policies: I-J I J I-J I J J
6 pages
Apuntes Azure Data Scientist
No ratings yet
Apuntes Azure Data Scientist
397 pages
Pandas PDF
No ratings yet
Pandas PDF
25 pages
Chapter 4 - Python For Data Analysis
No ratings yet
Chapter 4 - Python For Data Analysis
47 pages
Python For Data Analysis Edgar
No ratings yet
Python For Data Analysis Edgar
49 pages
Lecture Week2
No ratings yet
Lecture Week2
72 pages
Jupyter Notebook
No ratings yet
Jupyter Notebook
71 pages
More On Pandas
No ratings yet
More On Pandas
51 pages
CSE445 NSU Week - 3
No ratings yet
CSE445 NSU Week - 3
48 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
10 pages
Python
No ratings yet
Python
32 pages
Labdev
No ratings yet
Labdev
57 pages
Introduction To MATLAB: Stefan Güttel October 15, 2020
No ratings yet
Introduction To MATLAB: Stefan Güttel October 15, 2020
12 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
2 pages
Measures of Association
No ratings yet
Measures of Association
4 pages
Data Analysis Tools
No ratings yet
Data Analysis Tools
26 pages
Python Libraries 2
No ratings yet
Python Libraries 2
80 pages
01 Introduction To Python
No ratings yet
01 Introduction To Python
36 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
12 pages
Data Structures For Statistical Computing in Python
No ratings yet
Data Structures For Statistical Computing in Python
6 pages
CREATES Research Paper 2008-42: Ole E. Barndorff-Nielsen, Silja Kinnebrock and Neil Shephard
No ratings yet
CREATES Research Paper 2008-42: Ole E. Barndorff-Nielsen, Silja Kinnebrock and Neil Shephard
24 pages
Python For Data Science
No ratings yet
Python For Data Science
45 pages
Year 2 Maths
No ratings yet
Year 2 Maths
11 pages
Errata For Python For Finance
No ratings yet
Errata For Python For Finance
12 pages
Experiment No: 1 Introduction To Data Analytics and Python Fundamentals Page-1/11
No ratings yet
Experiment No: 1 Introduction To Data Analytics and Python Fundamentals Page-1/11
8 pages
DAV EXP 1 t12 31
No ratings yet
DAV EXP 1 t12 31
39 pages
Lab Record Dev
No ratings yet
Lab Record Dev
20 pages
SAMS2007 User Manual
No ratings yet
SAMS2007 User Manual
123 pages
ML File Updated
No ratings yet
ML File Updated
60 pages
Python For Statistics
No ratings yet
Python For Statistics
40 pages
Data Mining in Education Data Classification and Decision Tree Approach 097 Z00080E10038 2
No ratings yet
Data Mining in Education Data Classification and Decision Tree Approach 097 Z00080E10038 2
5 pages
Furman University Statistics Using SPSS
No ratings yet
Furman University Statistics Using SPSS
117 pages
CV - YOLO v1
No ratings yet
CV - YOLO v1
35 pages
M3685 UG Prospectus 2021 - PDF - Update - 100620
No ratings yet
M3685 UG Prospectus 2021 - PDF - Update - 100620
168 pages
KANO BULA Effects of Investment On Financial Performance of Commercial Banks I
No ratings yet
KANO BULA Effects of Investment On Financial Performance of Commercial Banks I
73 pages
AD3301 DEV Lab Manual
No ratings yet
AD3301 DEV Lab Manual
26 pages
3rd Semester DDM AI DAA DEV Print Pages For Spiral Record 25-1-24 - Removed
No ratings yet
3rd Semester DDM AI DAA DEV Print Pages For Spiral Record 25-1-24 - Removed
28 pages
Data Science
No ratings yet
Data Science
42 pages
Social Studies Ryan Abogado - 041140
No ratings yet
Social Studies Ryan Abogado - 041140
133 pages
Usage of NumPy For Numerical Data in Detail
No ratings yet
Usage of NumPy For Numerical Data in Detail
52 pages
Programming With Python Coursework 2020: Supermarket Self-Checkout System
No ratings yet
Programming With Python Coursework 2020: Supermarket Self-Checkout System
14 pages
Importing Data Python Cheat Sheet PDF
No ratings yet
Importing Data Python Cheat Sheet PDF
1 page
L6 and 7-Data Preprocessing-Coding
No ratings yet
L6 and 7-Data Preprocessing-Coding
34 pages
Final Dev Record
No ratings yet
Final Dev Record
49 pages
DAL EXT 1 and 2
No ratings yet
DAL EXT 1 and 2
125 pages
Exam1 Soln PDF
No ratings yet
Exam1 Soln PDF
11 pages
Programmatic Poster
No ratings yet
Programmatic Poster
2 pages
Statistical Design and Analysis of Experiments Part Two
No ratings yet
Statistical Design and Analysis of Experiments Part Two
21 pages
Course - Introduction To Data Science (SD211105)
No ratings yet
Course - Introduction To Data Science (SD211105)
10 pages
FOD Record Sem 1
No ratings yet
FOD Record Sem 1
25 pages
Unit2 PDS
No ratings yet
Unit2 PDS
17 pages
Wa0005.
No ratings yet
Wa0005.
29 pages
Public Business Analytics ML AI
No ratings yet
Public Business Analytics ML AI
5 pages
Python Pandas Tutorial
No ratings yet
Python Pandas Tutorial
45 pages
Pandas DataFrameObject
No ratings yet
Pandas DataFrameObject
4 pages
A Theoretical and Empirical Investigation of Job Satisfaction and Intended Turnover in The Large Cpa Firm
No ratings yet
A Theoretical and Empirical Investigation of Job Satisfaction and Intended Turnover in The Large Cpa Firm
16 pages
BJMC 14 Block 02
No ratings yet
BJMC 14 Block 02
44 pages
P1
No ratings yet
P1
12 pages
Pierian Data - Python For Finance & Algorithmic Trading Course Notes
No ratings yet
Pierian Data - Python For Finance & Algorithmic Trading Course Notes
11 pages
A New Liu-Type Estimator For The Inverse Gaussian Regression Model
No ratings yet
A New Liu-Type Estimator For The Inverse Gaussian Regression Model
21 pages
Kemoreseptor Udang B1A020001-Alika Fauziah R2K1
No ratings yet
Kemoreseptor Udang B1A020001-Alika Fauziah R2K1
9 pages
Lo Et Al., 2020
No ratings yet
Lo Et Al., 2020
8 pages
LAB 11 Refine Factorial Design
No ratings yet
LAB 11 Refine Factorial Design
16 pages
9030-Article Text-54550-1-10-20230930
No ratings yet
9030-Article Text-54550-1-10-20230930
10 pages
Profita
No ratings yet
Profita
7 pages
6.189 - Homework ONLY: Administrivia
No ratings yet
6.189 - Homework ONLY: Administrivia
3 pages
Department of Management School of Management &business Studies Jamia Hamdard
No ratings yet
Department of Management School of Management &business Studies Jamia Hamdard
3 pages
Assignment in Research and Statistics
No ratings yet
Assignment in Research and Statistics
17 pages
6.189 - Notes/Homework: Administrivia
No ratings yet
6.189 - Notes/Homework: Administrivia
14 pages
Everybody Knows Psychology Is Not A Real Science
No ratings yet
Everybody Knows Psychology Is Not A Real Science
16 pages
7280-Article Text-27228-11-10-20221105
No ratings yet
7280-Article Text-27228-11-10-20221105
17 pages
Design of Experiments (DOE) - ASQ
No ratings yet
Design of Experiments (DOE) - ASQ
2 pages
Iso 3207 1975
No ratings yet
Iso 3207 1975
11 pages
Useful Python
From Everand
Useful Python
Stuart Langridge
No ratings yet
Data Science Programming In Python
From Everand
Data Science Programming In Python
Anita Raichand
No ratings yet
Programming Concepts in C++
From Everand
Programming Concepts in C++
Robert Burns
No ratings yet