Haskell Book

Download as pdf or txt
Download as pdf or txt
You are on page 1of 87

Speeding Through Haskell

With Example Code!

Mihai-Radu Popescu

questions@sthaskell.com

To

#haskell,

where all questions are answered in ma jestic stereo.

Contents

I.
1.

Starting Out
Introduction

1
2

1.1. 1.2.

About the Book . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Why Haskell? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.2.1. 1.2.2. 1.2.3. For Programmers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . For Mathematicians For Everybody Else . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

2 2 2 3 3 4 4 4 5
6

1.3.

Before We Start . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.3.1. 1.3.2. 1.3.3. Using GHCi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Interactive vs. Noninteractive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Loading Files . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

2.

Basics: Functions and Lists

2.1.

Getting Started . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.1.1. 2.1.2. 2.1.3. 2.1.4. Simple Arithmetic Boolean Algebra Inx Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

6 6 6 7 8 9 9 10 12 13 14 14 15 16
17

Calling and Making Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

2.2.

Using Lists 2.2.1. 2.2.2. 2.2.3. 2.2.4.

Intro . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Basic List Functions Cycling Lists Basics Ranges . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

2.3.

List Comprehensions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.3.1. 2.3.2. 2.3.3. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Advanced Uses

Practical Applications

3.

Types, Typeclasses, and Polymorphism

3.1.

Understanding Types . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.1.1. 3.1.2. Knowing Types . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Type Declarations Type Variables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

17 17 18 19 19 19 20 21 22 22 23 24 25

3.2.

Polymorphism . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.2.1. 3.2.2. 3.2.3. 3.2.4. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Typeclasses . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Making Polymorphic Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Drawbacks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

3.3.

Case Study: Tuples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.3.1. 3.3.2. 3.3.3. 3.3.4. Lists Recap . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Understanding Tuples Functions on Tuples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

iii

Contents

II.
4.

Getting the Hang of It


Exploring Syntax

27
28

4.1.

Pattern Matching . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.1.1. 4.1.2. 4.1.3. 4.1.4. 4.1.5. Basics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Matching with Cons As patterns . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

28 28 30 31 32 33 34 34 36 39 41
43

Patterns in Comprehensions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Guards . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

4.2.

Other Constructs and Expressions 4.2.1. 4.2.2. 4.2.3. 4.2.4.

Where Bindings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Let Bindings Bonus: Case Expressions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

5.

Recursion

5.1.

Basic Implementation 5.1.1. 5.1.2. 5.1.3.

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

43 43 45 46 47 47 48 49 49 49 50 51
53

Understanding Recursion

Practical Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . More Parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Using Guards . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Multiple Regular Cases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Innite Recursion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Using Natural Numbers [FIXME-move to adv. types] . . . . . . . . . . . . . . . . . . . Application: Quicksort . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

5.2.

Variations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.2.1. 5.2.2. 5.2.3.

5.3.

Further Expansion 5.3.1. 5.3.2. 5.3.3.

6.

Advanced Functions

6.1.

Currying and Partial Application . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.1.1. 6.1.2. 6.1.3. Fundamentals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Problem Z . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . When It's Not . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

53 53 54 55 56 56 57 58 58 60 62 63 65 65 67

6.2.

Higher Order Functions 6.2.1. 6.2.2.

Passing Functions as Parameters

Flipping the Parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

6.3.

More Useful Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.3.1. 6.3.2. 6.3.3. 6.3.4.

map

and

zipWith

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Working with Predicates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Comparison with List Comprehensions . . . . . . . . . . . . . . . . . . . . . . . . . . . Anonymous Functions (Lambdas) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Eating a List . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

6.4.

Folds and Scans . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.4.1. 6.4.2. Introducing Folds Proper

III. Appendices
A. Miscellaneous

68
69

A.1. Functions

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

69 69 70

A.1.1. Fixity

A.1.2. Laziness Explained . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

iv

Contents

A.2. Constants (A.K.A. Variables) A.2.1. Local Variables


B. Types and Typeclasses

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

71 71
74

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

B.1. Typeclasses in Depth . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . B.1.1. B.1.2.

74 74 75 76 78 78 79 80
82

Show and Read Eq, Ord, Enum .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

B.1.3. Numeric Typeclasses . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . B.2. Type Errors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . B.2.1. General Type Errors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . B.2.2. Ambiguous Type Variable Errors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . B.2.3. Making Custom Errors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
C. Modules

C.1. Data.List

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

82

Part I.

Starting Out

1. Introduction
The Haskell community has an acute shortage of buggy underdocumented programs.

(sorear)

1.1. About the Book


Warning! This book is a work in progress. Read at your own risk!
Hello there! This is a book that will show you around the Haskell programming language. If you're not

already familiar (or too familiar) with programming in another language, you might need to put in extra work. Don't be discouraged! While the stu in the beginning may seem extremely boring, mind-blowing things start happening later on. This book has a lot of footnotes. You don't have to read them, but sometimes you might gain some insight by doing so. You can click on them (do it here ) to jump to them faster (readers from the website might want to download the book for this reason). You can click on the table of contents as well. The writing in this book may not be polished yet, and some things may be missing, but take a look  you might just like it!

1.2. Why Haskell?


Every language (human or computer) is unique. But there exists a special breed of languages  those that challenge and shape the way one thinks. Haskell is one of them  lost innovation in a sea of clichs. Unfortunately, the only people apparently interested in Haskell are academics who blindly push the boundaries and gurus who want to learn just one more language. On a more concrete note, if Haskell were to have a list of prerequisites, it would be very unusual indeed  at least two of the following:

Extensive programming experience A background in mathematics An inclination towards the abstract Perseverence Hard work

1.2.1. For Programmers


I never intended to (and still don't quite) take programming seriously. I wanted something quick, fun and challenging to kill some time, clear my thoughts and, above all, stop performing repetitive tasks. My rst language was Python  easy, fun, good with the teachers. After about two weeks, I let it go and tried others: Common Lisp, C, Perl, Java, and nally, I fell in love with Haskell. One might say Haskell is a bit dierent. For example, in Haskell:

If it didn't work, you might want to download the book (google docs link). If it still doesn't work, get Adobe Reader.

1. Introduction

return

doesn't return

Classes aren't really classes Variables are actually constants. The code

might not

execute in the order shown on the screen.

Below are some of my favorite snippets of code, each on a separate line. They're classics, and really show how Haskell stands out. 1 2 3 4 5

fibonacci = 0:1: zipWith (+) fibonacci ( tail fibonacci ) primes = nubBy (\ x y -> ( gcd x y ) > 1) [2..] rationals = fix ((1:) . ( > >= \x -> [x +1 , 1%( x +1) ]) ) :: [ Rational ] powerset = filterM ( const [ True , False ]) histogram = map ( head &&& length ) . group . sort

1.2.2. For Mathematicians


Every time someone writes

i = i + 1,

a mathematician dies . The fact is that many mathematicians have

cringed at the sight of a computer screen with some random code. They are used to writing stu like: Let a function f : Z Z, f (x) = 2y + 3, where y = |x 4|. If we consider set A = {5, 3, . . . , 11}, we shall map function f over A, naming the result set B . We shall also 2 consider set C = f (x) |x A, x < 10 . One does not simply code such a thing in C or Python  at least not without mutilating maths. However, in Haskell, the result is pleasing to the eye and easy to understand, too (everything following the comment). 1 2 3 4 5 6 7



is a

f :: Integer -> Integer f x = 2* y + 3 where y = abs (x -4) a = [ -5 , -3..11] -- we ' ll see later why a , b , and c are lowercase b = map f a c = [( f x) ^2 | x <- a , x < 10] -- this really works !
The mathematical applications of Haskell are endless. It's even possible to dene and work with monoids [XREF]!

1.2.3. For Everybody Else


Intelligent and/or hardworking people will enjoy the challenge provided by Haskell. At the end of the journey, the traveller will look at the world with new eyes, satised that he is now better equipped to understand the Universe. This is all because Haskell is riddled with complex, counterintuitive or simply mind-boggling elements. Let's take a look at something interesting. 1 2 3

compare 2 3 -- works compare (2 3) -- doesn ' t work ( compare 2) 3 -- works !!


This paradox (let's call it

Problem Z

even though it's actually a feature), and more, will be presented and

explained throughout the book.

Not really, but hey.

1. Introduction

1.3. Before We Start


This book requires a Haskell interpreter. For most people, the best option is The Haskell Platform, although alternatives like hugs exist. The Haskell Platform uses GHCi as the interpreter (and also has a compiler, GHC), which is what we will use in our examples.

1.3.1. Using GHCi


On Windows, GHCi can be opened using the Start Menu. On Linux, Mac and other UNIX-like systems, ghci can be started using the shell. Below is a typical GHCi session on Linux. We type some expressions, load a le, add a module, and nally change the prompt to something shorter. We added some blank lines to make the output more readable, but in real life the following is a single block of text. There's no need to understand it for now  the example is just to give a rough idea of the GHCi experience. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

ee@bt :~ $ ghci GHCi , version 7.4.1: http :// www . haskell . org / ghc / Loading package base ... linking ... done . Prelude > 2 + 3 5 Prelude > max 10 2 10

:? for help

Prelude > :l test . hs -- loading a file [1 of 1] Compiling Main ( test . hs , interpreted ) Ok , modules loaded : Main . * Main > import Control . Monad -- importing a module * Main Control . Monad > : set prompt " ghci > " ghci > :q -- you can also exit with Ctrl -D Leaving GHCi . ee@bt :~ $

1.3.2. Interactive vs. Noninteractive


GHCi is very narrowly scoped. It's more of a debugger: you can't just copy-paste source les into it, like in Python  there are key dierences between interactive code and code loaded from a le. For example, compare the following (from now on we will use

:set prompt "ghci> "):


1 2

ghci>

to indicate an interactive prompt  it's set using

a = 5 b = a + 1 ghci > let a = 5 ghci > let b = a + 1


We will later (in [XREF]) understand why these dierences occur. example is working

1 2

inside

For now, remember that the second

a Haskell program (GHCi is, after all, written in Haskell).

1. Introduction

1.3.3. Loading Files


Many examples will use functions written in a separate le, which is then loaded into GHCi. Let's go ahead and open up vim (or any other text editor) and write some declarations to get the hang of it. 1 2 3 4

-- File : basic . hs a = 2 b = 3 c = a + b
Now let's load this into GHCi and see if it works (the le needs to be in the directory where GHCi was started, or it won't work ).

1 2 3 4 5 6 7 8 9 10 11

ghci > :l basic . hs -- this is how we load [1 of 1] Compiling Main ( basic .hs , interpreted ) Ok , modules loaded : Main . ghci > a + 1 3 ghci > c - b == a True ghci > :r -- this reloads the file if we change it [1 of 1] Compiling Main ( basic .hs , interpreted ) Ok , modules loaded : Main . ghci >
Again, there is no need to dissect the above pieces of code  what's important is knowing how to load a le (:l

file.hs)

and reload it (:r).

Actually, if the full path is given it will work just ne, but it's cumbersome.

2. Basics: Functions and Lists


I kinda expect functions to return something sensible, but I guess I'm spoiled by exposure to functional programming.

(kzm)

2.1. Getting Started


2.1.1. Simple Arithmetic
It is very easy to use GHCi as a calculator. It supports all the basic operations and some extra functions (min, 1 2 3 4 5 6 7 8

abs, exp

etc.). As an added bonus, Haskell supports arbitrarily large integers.

ghci > 4 + 5*6 34 ghci > exp 2 7.38905609893065 ghci > 10 - 4 - ( max 5 6) 0 ghci > 10^60 1000000000000000000000000000000000000000000000000000000000000
There still are some problems, especially with the

operator.

1 2 3 4 5

ghci > -3 -3 ghci > -3 + 4 1 ghci > min -3 4 -- this gives a very long error message .
GHCi treats

min -3 4

as

min - (3 4),

and therefore thinks we want to subtract

3 4

from

min.

This may

look strange, even downright stupid, but GHCi has a very good reason: arguments is essential in Haskell. We have no choice but to oblige  a solution is to wrap 1 2

being able to call functions as

-3

in parentheses.

ghci > min ( -3) 4 -3

2.1.2. Boolean Algebra


In Haskell, working with booleans or testing for equality is as straightforward as can be expected. 1 2 3 4

ghci > False || False -- right associative False ghci > True || False && False -- && has a higher precedence True

2. Basics: Functions and Lists

5 6 7 8 9 10 11 12

ghci > False ghci > True ghci > False ghci > True

not True not False || not True 5 == 6 -- equality is not associative 5 /= 7 -- programmers beware , it ' s not !=

A combination of right associativity and something called stops at the rst

True

laziness

(we'll get back to it later) means that stops at the rst

statement found (from the left). Likewise,

&&

False.

||

Another interesting fact is that

||

and

&&

are not built into the language, they're functions like all others.

2.1.3. Calling and Making Functions


Functions are called with space between the parameters. Some functions accept only one parameter, some more . We have already seen some functions, so here are some more examples, and then we'll move on. 1 2 3 4 5 6 7

ghci > 4 ghci > 'b ' ghci > 'X ' ghci >

succ 3 -- needs to have a logical successor succ 'a ' pred 'Y ' -- same here pred " Hello " -- error

There is an important distinction to be made regarding function calls. Parentheses around the arguments only set precedence, not separate the function from the arguments. It's essential not to get fooled, especially in the next example. 1 2 3 4 5 6

ghci > foo ( bar 10) -- in C this would be foo ( bar (10) ) ghci > ( foo bar ) 10 ghci > foo bar 10 -- this is equivalent to the above ghci > foo bar ( baz 10) 8 -- in C: foo ( bar , baz (10) , 8)
Also, function application has the highest precedence, so if you write (for more details see A.1.1). We're slightly familiar with dening functions, too (the 1.2.2 example). Let's play a little more with them. Obviously, we can refer to other functions in a denition. Another thing to note is that functions can't begin with uppercase letters.

foo 10 + 8,

it means

(foo 10) + 8

1 2 3 4 5 6

-- File : functions . hs triple x = 3* x strangeAddition x y = x + triple y squareTwo x y = (x + y) ^2 c = 4 -- this one takes zero parameters

Technically all functions accept only one parameter, but it's not healthy to think like this, at least for now  remember

Problem Z

(introduced in 1.2.3)?

2. Basics: Functions and Lists

Before we start... calling around, let's talk a little about the last line. This is a very interesting case indeed 

is what we would call in other languages a variable. It's declared the same as a function, but it takes

zero parameters so it's a constant

2 (Haskell gives an error if you do

c = 4

then

c = 5

in the same le). This,

Unlike most languages, in Haskell a zero-parameter function and a constant are really the same. strangely enough, has something to do with 1 2 3 4 5 6 7 8 9 10 11 12 13

Problem Z

 we'll understand what that means soon enough.

ghci > :l functions . hs [1 of 1] Compiling Main Ok , modules loaded : Main . ghci > triple 2 6 ghci > strangeAddition 10 20 70 ghci > squareTwo 5 6 121 ghci > triple c 12 ghci > strangeAddition ( triple 2) c 18

( functions .hs , interpreted )

Before we continue, let's look a bit at Haskell's if-else. The rst thing we notice is that the no variables to change, so a function that doesn't return anything wouldn't work . Does  f sense? Let's add something to

else

part is make

mandatory. Why? Every function has to return something. Why? Haskell is more like maths  there are

(x) =

functions.hs

(the quote is a valid character in function names) and see what

happens. Indentation is essential in Haskell because that's how the interpreter identies blocks of code. 1 2 3 4

-- File : functions . hs ( CONTINUED ) strangeAddition ' x y = if x > y then x + triple y else y + triple x ghci > :r -- we won 't be showing load / reload from now on [1 of 1] Compiling Main ( functions .hs , interpreted ) Ok , modules loaded : Main . ghci > strangeAddition 5 3 14 ghci > strangeAddition 3 5 18 ghci > strangeAddition ' 5 3 14 ghci > strangeAddition ' 3 5 14

1 2 3 4 5 6 7 8 9 10 11

2.1.4. Inx Functions


Until now we've called functions by putting them before the arguments, like above. But if we surround

functions with backquotes, we can make them inx (put them between the parameters), much like

or

*.

Warning! Backquotes work only with two-parameter functions.


2 3

Mathematicians will understand this right away. There is also a technical reason, explained in detail in [XREF]

2. Basics: Functions and Lists

1 2 3 4 5

ghci > 3 ` squareTwo ` 4 49 ghci > 10 ` strangeAddition ` 20 70 ghci > 2 ` triple ` -- error ( and looks stupid , too )
Backquotes are usually adopted to make functions more readable, but they can also be used to create chains. Watch out for associativity (default left) and precedence (default highest)  built-in functions don't use the defaults (see A.1.1).

1 2 3 4 5 6

ghci > 2 ` squareTwo ` 3 ` squareTwo ` 4 ` squareTwo ` 5 715716 ghci > ((2 ` squareTwo ` 3) ` squareTwo ` 4) ` squareTwo ` 5 715716 ghci > 2 ` squareTwo ` (3 ` squareTwo ` (4 ` squareTwo ` 5) ) 49815364
If a function name contains only symbols (like

++, ^,

or

-.-),

it's automatically inx. We can still call inx

functions before the arguments, by putting them in parentheses. This really helps with 1 2 3 4 5 6

Problem Z.

ghci > (+) 2 3 5 ghci > (*) 4 5 20 ghci > (/) 10 4 2.5

2.2. Using Lists


2.2.1. Intro
Lists are to Haskell like... well, there's really no comparison. They are the most used data structure. They:


1 2 3 4 5 6

Are homogenous  mixing, for example, numbers with characters gives an error. Have variable length . Can be innitely long . Are singly linked  lists can only be traversed from left to right .

We'll dene some lists in a le so we can explore functions that operate on them.

-- File : lists . hs numbers = [1 , 3, 7, 5, 6 , 6, 8, 10] languages = [" lisp " , " haskell " , "c " , " perl " , " ruby " , " python " ] hello = " Hello , World !" -- same as [ 'H ' , 'e ', 'l ' , 'l ', ... and so on ] listOfLists = [[1 , 5 , 7, 9] , [2 , 4, 6] , [1]] emptyList = []
For starters,

is equivalent to

++ concatenates two lists. a ++ (b ++ c)7 .


laziness.

It's one of the most basic operators. It's associative, so

(a ++ b) ++ c

4 5 6 7

Well, technically speaking they can't change (nothing can), but for all intents and purposes they are variable in length. This is because of Functions in Haskell (like those from 2.1.2) are made to use only as much information as is necessary, and not more. If we combine with

&&

an innite number of

Falses,

do we really need to get past the rst one?

This means that accessing the last element requires going through the whole list  watch out! Without this basic property, lists would be stupid.

2. Basics: Functions and Lists

1 2 3 4

ghci > [1 , 2, 3] ++ [5 , 4] [1 ,2 ,3 ,5 ,4] ghci > " Haskell " ++ " " ++ " is " ++ " " ++ " fun " " Haskell is fun "
The simplest list operator is

[1, 2, 3]
1 2 3 4 5 6 7 8

9 is just syntactic sugar for

 it adds an element to the front of a list .

It's so basic, in fact, that

1:2:3:[].

In 4.1.3 and [XREF] we'll cover the many uses of

:,

but

for now we'll stick to basics.

ghci > 5 : [4 , 6 , 8] [5 ,4 ,6 ,8] ghci > 5 : 4 : 6 : 8 : [] [5 ,4 ,6 ,8] ghci > 'f ' : " iretruck " " firetruck " ghci > [3 , 4] : [[5 , 6, 7] , [8 , 9]] [[3 ,4] ,[5 ,6 ,7] ,[8 ,9]]
The following throw errors because we're not using however.

correctly.

There are numerous ways to x them,

1 2 3

ghci > [1] : [2 , 3] -- use 1 : [2 , 3] or [1] ++ [2 , 3] instead . ghci > 1 : 2 : 3 -- use 1 : 2 : [3] or 1 : 2 : 3 : [] ghci > [10 , 9, 2] : 4 -- use [10 , 9 , 2] ++ [4]

2.2.2. Basic List Functions


Getting information from lists is done using the following built-in functions (we usually call our lists

xs10 ):

head tail last init !! n

 rst element  all but the rst  last element  all but the last  the n

th element (numbering starts at 0)

take n drop n length null

 rst n elements  all but the rst n elements  self-explanatory

 check if the list is empty. How

not

to do it:

 list == []  bad  length list == 0  worse  unsafeCoerce list :: Bool  worst

8 9 10

is called a list constructor (or cons for short). It's the operator that links the elements of a list (we'll see how this happens

a bit later, in [XREF]) The same thing, but prettier. As in the plural form of

 exes. Along the same lines:

ys, zs, as, bs, cs

etc.

10

2. Basics: Functions and Lists

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19

ghci > let xs = [1 , 2, 3, 4 , 5, 6] ghci > head xs 1 ghci > tail xs [2 ,3 ,4 ,5 ,6] ghci > last xs 6 ghci > init xs [1 ,2 ,3 ,4 ,5] ghci > xs !! 4 5 ghci > take 2 xs [1 ,2] ghci > drop 2 xs [3 ,4 ,5 ,6] ghci > length xs 6 ghci > null xs False
One thing worth pointing out is that, due to the nature of lists in Haskell, accessing the last element of a list is considerably slower than accessing the rst one. This is because, internally, accessing an element requires going through

11 the ones before it. [FIXME-elaborate with examples]

Warning! Giving out-of-bounds values to


1 2 3 4 5 6

head, tail, init, last,

and

!!

throws an exception.

ghci > head [] *** Exception : Prelude . head : empty list ghci > l !! 100 *** Exception : Prelude .(!!) : index too large ghci > l !! ( -2) *** Exception : Prelude .(!!) : negative index
Some more useful functions:

maximum minimum sum product elem notElem


1 2 3 4 5 6

 the maximum of a list  the minimum

12

 the sum of a list of numbers  likewise, the product

 checks if an element is a member of a list  the opposite of

13 (usually called inx because it's more readable)

elem

(also called inx).

ghci > ghci > 10 ghci > 2 ghci >


11 12 13

let xs = [8 , 5, 3, 4 , 10 , 2] maximum xs minimum xs sum xs

This is not entirely accurate, but it will do for now. To calculate the maximum, the elements need to have some sort of logical order. A list of numbers or a list of characters are ne, but a list of functions is not. Needs to be able to equate elements. This may seem pretty standard, but not all stu can equal other stu (we'll discuss this in-depth in [XREF]).

11

2. Basics: Functions and Lists

7 8 9 10 11 12 13 14 15

32 ghci > 9600 ghci > True ghci > False ghci > True

product xs 5 `elem ` xs 22 ` elem ` xs 22 ` notElem ` xs

A special case, 1 2 3 4 5 6

concat,

operates on lists of lists: it attens them. It only removes one layer, though.

ghci > concat [[2 ,3] ,[4 ,5]] [2 ,3 ,4 ,5] ghci > concat [[5]] [5] ghci > concat [[[5]]] [[5]]
There are some functions that operate on lists of

Bools: True, False


otherwise.

and or
1 2 3 4 5 6 7 8

 returns

True

if all the elements are

True

if at least one is

True, False

otherwise.

ghci > False ghci > True ghci > True ghci > False

and [ True , True , False ] and [ True , True , True ] or [ True , False , False ] or [ False , False , False ]

And neither last nor least (see C.1 for more), reversing long lists. 1 2

reverse reverses a list.

It's not very ecient, though, so avoid

ghci > reverse [1 , 2, 3, 4 , 5] [5 ,4 ,3 ,2 ,1]

2.2.3. Ranges
Many times we need to construct lists according to certain rules. ranges. Let's see some examples and then discuss them. 1 2 3 4 5 6 7 8 9 10 Probably the simplest way is by using

ghci > [1 , 2 .. 20] [1 ,2 ,3 ,4 ,5 ,6 ,7 ,8 ,9 ,10 ,11 ,12 ,13 ,14 ,15 ,16 ,17 ,18 ,19 ,20] ghci > [1 .. 20] [1 ,2 ,3 ,4 ,5 ,6 ,7 ,8 ,9 ,10 ,11 ,12 ,13 ,14 ,15 ,16 ,17 ,18 ,19 ,20] ghci > [1 , 3 .. 15] [1 ,3 ,5 ,7 ,9 ,11 ,13 ,15] ghci > [1 , 7 .. 30] [1 ,7 ,13 ,19 ,25] ghci > [3 , 2 .. -10] [3 ,2 ,1 ,0 , -1 , -2 , -3 , -4 , -5 , -6 , -7 , -8 , -9 , -10]

12

2. Basics: Functions and Lists

The following will 1 2

not

work.

ghci > [1 , 2, 4 , 8 .. 128] -- nope ghci > [1 .. 39 , 40] -- not this , either
It's pretty obvious: these ranges generate sequences where the dierence between consecutive terms is constant (arithmetic progressions). They always go like this:

[first element, next element .. last element]. [a .. n]


is shorthand for

If we need to generate consecutive things, writing the whole list by hand.

[a, a+1 .. n]

which is shorter than

Furthermore, only arithmetic progressions are possible using ranges. including negative or noninteger 1 2

14 ones.

You can, however, specify any step,

ghci > [1 , 2.1 .. 5] [1.0 ,2.1 ,3.2 ,4.300000000000001 ,5.400000000000001]


Warning! Using nonintegers in ranges yields undesireable results due to rounding errors.
Interestingly, if the upper bound is omitted, ranges generate innite lists, as exemplied below this, press

15 . If you do

Ctrl-C

to stop it.

1 2

ghci > [1..] [1 , 2, 3 , 4 , 5, 22 , 23 , 24 , 40 , 41 , 42 , 58 , 59 , 60 , 76 , 77 , 78 , 94 , 95 , 96 ,


How is this useful? with

6, 7 , 8 , 9, 25 , 26 , 27 , 43 , 44 , 45 , 61 , 62 , 63 , 79 , 80 , 81 , 97 , 98 , 99 ,

10 , 11 , 12 , 13 , 14 , 28 , 29 , 30 , 31 , 32 , 46 , 47 , 48 , 49 , 50 , 64 , 65 , 66 , 67 , 68 , 82 , 83 , 84 , 85 , 86 , 100 , 101 , 102 , 103 ,

15 , 16 , 17 , 18 , 19 , 33 , 34 , 35 , 36 , 37 , 51 , 52 , 53 , 54 , 55 , 69 , 70 , 71 , 72 , 73 , 87 , 88 , 89 , 90 , 91 , 104 , ^ CInterrupted .

20 , 38 , 56 , 74 , 92 ,

21 , 39 , 57 , 75 , 93 ,

Well, let's remember that Haskell is

lazy,

so unless we want something unwise, like

printing all the elements of an innite list (see above) we should be in the clear. We are already familiar

take,

so let's use it in conjunction with ranges.

1 2 3 4 5 6

ghci > take 20 [1..] [1 ,2 ,3 ,4 ,5 ,6 ,7 ,8 ,9 ,10 ,11 ,12 ,13 ,14 ,15 ,16 ,17 ,18 ,19 ,20] ghci > take 5 [13 , 26 ..] [13 ,26 ,39 ,52 ,65] ghci > take 11 [1 , -2 ..] [1 , -2 , -5 , -8 , -11 , -14 , -17 , -20 , -23 , -26 , -29]
We immediately notice that the computations have ended, so clearly Haskell didn't evaluate the entire innite list. In fact, when we learn more about functions, we'll see exactly how laziness works Also, take note: ranges aren't limited to numbers.

16 .

2.2.4. Cycling Lists


What if we want a number repeated over and over? We can do the advantage of being functions, which will help with

[1, 1 .. ], and that's perfectly okay.

There

are three functions we have omitted from 2.2.2, and they will make it more readable. Additionally, they have

Problem Z. Here they are:

14 15 16

With decimals. Disclaimer: we won't actually print innitely many numbers. It's not unlike if-else in other languages  if the statement is true, the

else

branch won't evaluate and viceversa.

13

2. Basics: Functions and Lists

repeat repeats an element into an innite list.


though.

We'll probably want to

take a nite number of elements,

cycle

repeats an entire list. Again, we'll want to

take

elements.

replicate
1 2 3 4 5 6

repeats an element a specied number of times.

ghci > take 10 ( repeat 5) [5 ,5 ,5 ,5 ,5 ,5 ,5 ,5 ,5 ,5] ghci > take 10 ( cycle [5 , 4]) [5 ,4 ,5 ,4 ,5 ,4 ,5 ,4 ,5 ,4] ghci > replicate 10 4 [4 ,4 ,4 ,4 ,4 ,4 ,4 ,4 ,4 ,4]
Warning! Do not confuse

repeat

and

cycle

 they do very dierent things.

1 2 3 4

ghci > take 10 ( repeat [5 , 4]) [[5 ,4] ,[5 ,4] ,[5 ,4] ,[5 ,4] ,[5 ,4] ,[5 ,4] ,[5 ,4] ,[5 ,4] ,[5 ,4] ,[5 ,4]] ghci > take 10 ( cycle [5 , 4]) [5 ,4 ,5 ,4 ,5 ,4 ,5 ,4 ,5 ,4]

2.3. List Comprehensions


2.3.1. Basics
We've seen how to declare, manipulate and, to an extent, generate lists. We will now learn one of the most powerful tools in all of Haskell, list comprehensions. Let's start with basic examples and move on from there. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18

ghci > [ x | x <- [1..20] ] [1 ,2 ,3 ,4 ,5 ,6 ,7 ,8 ,9 ,10 ,11 ,12 ,13 ,14 ,15 ,16 ,17 ,18 ,19 ,20] ghci > [ x | x <- [1..20] , even x ] [2 ,4 ,6 ,8 ,10 ,12 ,14 ,16 ,18 ,20] ghci > [ x | x <- [1..20] , x > 6 ] [7 ,8 ,9 ,10 ,11 ,12 ,13 ,14 ,15 ,16 ,17 ,18 ,19 ,20] ghci > [ x | x <- [1..20] , even x , x > 6 ] [8 ,10 ,12 ,14 ,16 ,18 ,20] ghci > [ x | x <- [1..20] , even x , x > 6, odd x ] [] ghci > [ a ++ b | a <- [" Haskell " , "C "] , b <- [ " syntax " , " types " ] ] [" Haskell syntax " ," Haskell types " ,"C syntax " ," C types "] ghci > [ x + 3 | x <- [1 , 6 .. 30] ] [4 ,9 ,14 ,19 ,24 ,29] ghci > [ x + 3 | x <- [1 , 6 .. 30] , even x ] [9 ,19 ,29] ghci > [ a ++ " is fun !" | a <- [" Haskell " , " Perl " , "C " , " Lisp "] ] [" Haskell is fun !" ," Perl is fun !" ,"C is fun !" ," Lisp is fun !"]
Anyone who's seen and understood mathematical set comprehensions can just skim the rest of the section. 2.3.2 is worth reading carefully, though. List comprehensions have two components (let's take

[ 2*x | x <- [1, 3, 4], odd x ] 2*x)

as an example):

The left hand-side contains the expression to be evaluated (in our case, The right hand-side has:

14

2. Basics: Functions and Lists

 

A base list from which

is extracted:

x <- [1, 3, 4] odd x

A list of predicates (lters) that must be satised (in this case, we have only one):

In order to understand better, let's manually calculate the above comprehension, step by step. 1. Find the base list:

[1, 3, 4]. x. odd x. x: 2*x


then add it to the

2. Take the rst element from the base list and call it

3. Check the truth value of the predicates (in this case, only one): 4. If

all

the predicates are satised, evaluate the left hand-side expression for

result list. 5. Do the above steps for all elements in the base list. Voil: the result is

[2, 6].

It's important to note that internally, Haskell does things a little dierently.

However, the result is the same so it shouldn't bother us.

2.3.2. Advanced Uses


We can also combine two, three or more base lists, more predicates etc. The order of the base lists determines the order of the result list, as we can see from the rst example. The predicates are calculated left-to-right so it's recommended that more powerful lters be put rst. 1 2 3 4 5 6 7 8

ghci > [ 10* a + b | a <- [1..3] , b <- [1..3] ] [11 ,12 ,13 ,21 ,22 ,23 ,31 ,32 ,33] ghci > [ x * y | x <- [2 , 4, 6] , y <- [10 , 100 , 1000] ] [20 ,200 ,2000 ,40 ,400 ,4000 ,60 ,600 ,6000] ghci > [ x * y | x <- [1..4] , y <- [1..3] , even (x + y ) ] [1 ,3 ,4 ,3 ,9 ,8] ghci > [ x + y | x <- [3..6] , y <- [2 , 4 , 8] , x <= y ] [7 ,11 ,8 ,12 ,13 ,14]
Because a list comprehension is an expression, we can put it in the left hand-side of another one  comprehensions inside comprehensions.

1 2 3

ghci > let xss = [[1 , 2 , 3 , 4, 5] , [4 , 5, 6, 7] , [7 , 8 , 9, 10]] ghci > [ [ x | x <- xs , x >= 5 ] | xs <- xss ] [[5] ,[5 ,6 ,7] ,[7 ,8 ,9 ,10]]
Moreover, instead of specifying an upper bound in a base list, we can

take

a number of results afterwards.

1 2

ghci > take 5 [ a | a <- [1..] , b <- [1.. a ], c <- [1.. b], a ^2 == b ^2 + c ^2 ] [5 ,10 ,13 ,15 ,17]
There are a few catches, however, some very serious.

1 2 3 4

ghci > take 20 [ x | x <- [1..] , x < 10 ] [1 ,2 ,3 ,4 ,5 ,6 ,7 ,8 ,9^ CInterrupted -- this would never finish ghci > take 5 [ x | x <- [1..] , x < 10 ] [1 ,2 ,3 ,4 ,5] -- this works fine because Haskell is lazy
Warning! Make sure Haskell can nd at least as many items as you

take. x = 2
in the

Some problems are harder to spot without running the code. For instance, Haskell never tries following example, because it has plenty of

ys

to choose from.

15

2. Basics: Functions and Lists

1 2

ghci > take 20 [ x * y | x <- [1..] , y <- [1..] ] [1 ,2 ,3 ,4 ,5 ,6 ,7 ,8 ,9 ,10 ,11 ,12 ,13 ,14 ,15 ,16 ,17 ,18 ,19 ,20]
To repeat, Haskell tries all the values from the

last

base list before continuing, so avoid having more than

one unbounded base list, because it will either not give us what we want (see above) or run indenitely (see below). Actually, there is a mountain of theory on this issue, such as this paper (advanced content). 1 2 3 4

17

ghci > take 10 [ x * y | [1^ CInterrupted . -- bad ghci > take 10 [ x * y | [1 ,2 ,4 ,3 ,6 ,9 ,4 ,8 ,12 ,16]

x <- [1..] , y <- [1..] , y <= x ] idea , runs indefinitely x <- [1..] , y <- [1.. x ] ] -- do this instead

Mastering all the subtleties of list comprehensions takes a lot of time and experience, so let's move on. We'll learn as we go.

2.3.3. Practical Applications


On the up side, list comprehensions have many practical uses. The classical example is determining the

length of a list. We'll need to apply our knowledge of list functions here, namely 1 2

sum.

ghci > sum [ 1 | x <- [3 .. 20] ] 18


It works, but we're not really using

anywhere, so it's a waste of a perfectly good variable name.

The

solution is to write an underscore whenever a variable name is not needed. 1 2

ghci > sum [ 1 | _ <- [3 .. 20] ] 18


If we want to use them repeatedly, we can declare functions with list comprehensions. Some examples:

1 2 3 4 5

-- File : comprefunctions . hs length ' xs = sum [ 1 | _ <- xs ] vowels string = [ c | c <- string , c `elem ` " aeiou " ] removeVowels string = [ c | c <- string , c ` notElem ` " aeiou " ] allSums xs ys = [ x + y | x <- xs , y <- ys ] ghci > length ' [2 , 4 .. 10] 5 ghci > length ' [] 0 ghci > vowels " hello world " " eoo " ghci > removeVowels " hello world " " hll wrld " ghci > allSums [1 , 2, 3] [4 , 5] [5 ,6 ,6 ,7 ,7 ,8]
Functions and lists have a lot of power. We'll be using them extensively throughout this book (and even outside it) so it's better to take our time and make sure we understand as much as we can at this point. Things are only going to get harder as we advance.

1 2 3 4 5 6 7 8 9 10

17

This is called a diverging computation.

16

3. Types, Typeclasses, and Polymorphism


I should actually think before coding, but the type system is so good :)

(Cale)

3.1. Understanding Types


3.1.1. Knowing Types
In most of the programming world, every variable has a type: an integer, a character, a boolean etc. But more often than not, they're there for cosmetic purposes  most compilers will happily add a number to a character. That doesn't make much sense, does it?

Fortunately, Haskell has a strong type system. That means that however similar their internal representations are, the compiler won't allow us to perform illogical calculations on them, such as multiplying an integer with a boolean. This may seem restrictive (and it sometimes is), but it helps avoid certain types of errors (type errors). Moreover, Haskell features static typing, which means all types are known at compile-time so if the program has a type error, it won't even compile. As an added bonus, Haskell has type inference, so we don't need to manually specify the type of everything we use. Basically, the compiler can gure out on its own that In GHCi, we can use 1 2 3 4 5 6 7 8 9 10 11

is a number or

"hello"

is a string .

:t

to determine the type of an expression (:: means has the type of  ).

ghci > :t 'a ' 'x ' :: Char ghci > :t " abcd " -- same as ['a ', 'b ' ,'c ','d '] " xxx " :: [ Char ] ghci > :t 'a ': 'b ': 'c ': 'd ' :[] -- same as " abcd " 'a ': 'b ': 'c ': 'd ' :[] :: [ Char ] ghci > :t False False :: Bool ghci > :t " hello " == " world " -- returns False " hello " == " world " :: Bool
We know that

[]

denotes a list, so it's easy to conclude that

[Char]

means a list of characters. The others

are self-explanatory. This is just a very short example  we'll be seeing more in the future. We also immediately notice that all types begin with a capital letter. This is the reason why variable and function names are lowercase . Below is a recap of the most widely used types in Haskell. We'll be running into these all the time.

1 2 3 4

One might argue that

'z'

is

'a' + 25,

but Haskell won't let you do that.

Imagine working on a long, dicult physics problem asking for some velocity  but after hours of calculations, the result is in kilograms. That can't be good. It can even deduce more complex types just as easily. The capitalization technique used for functions in Haskell is informally named

camelCase.

17

3. Types, Typeclasses, and Polymorphism

Int

is a bounded integer. On 32-bit systems it's between

231

and

231 1.

Integer Float Bool Char


1 2 3 4 5 6 7 8 9

is an arbitrarily large integer. It's slightly less ecient than

Int.
can be faster than

is a single-precision oating point. is a double-precision oating point. Due to optimizations,

Double

Double

Float.

is a boolean. It can be either

True

or

False. 1

and

won't work.

represents (by default) a Unicode character.

If we try to mix wrong types, Haskell throws a type error. It usually looks like this:

ghci > 3 + 'a ' < interactive >:1:1: No instance for ( Num Char ) arising from the literal `3' Possible fix : add an instance declaration for ( Num Char ) In the first argument of `(+) ', namely `3' In the expression : 3 + 'a ' In an equation for `it ': it = 3 + 'a '
Basically GHCi tells us that it doesn't know how to add

'a' to 3, because 'a' is not a number.

An extremely

detailed dissection of type errors in GHCi is presented in B.2.1.

3.1.2. Type Declarations


In Haskell, functions have types too. We mentioned that Haskell can infer the type of an expression on its own. However, it's possible to manually declare the type of a function. This helps us to:

Clarify our thoughts Make code more readable Avoid mistakes

The type declarations make functions much more expressive. Although Haskell could have inferred by itself what the types of the functions are (like in the 2.1.3 and 2.3.3 examples), we chose to give explicit type declarations to illustrate the method. In type declarations the parameters (and the return type) are separated by

5 them there are .


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

->,

regardless of how many of

-- File : functions2 . hs triple :: Int -> Int triple x = 3 * x strangeAddition :: Int -> Int -> Int strangeAddition x y = x + triple y squareTwo :: Double -> Double -> Double squareTwo x y = (x + y) ^2 vowels :: [ Char ] -> [ Char ] vowels word = [ c | c <- word , c `elem ` " aeiou " ] sumLists :: [ Int ] -> [ Int ] -> [ Int ] sumLists xs ys = [ x + y | x <- xs , y <- ys ]
5

Problem Z

is at work here. We'll see why it's not something like

Int, Int -> Int.

18

3. Types, Typeclasses, and Polymorphism

Warning! The parameters and the return type are not dierentiated  all are separated by

->.

In fact, type declarations give us so much information, that we can even deduce what a function does simply from its type declaration. Let's take

f :: [Char] -> Int as our example.

This function takes a list of characters (a string) and returns

an integer. We can reasonably infer that the function takes the string and performs some sort of counting (such as nding out the total length or counting all the spaces) or other calculation (such as a hash function). Indeed,

of the letters

f is dened like so: f xs = [ 1 | x <- xs, x `elem` "abc" ]. The function counts all occurences a, b, and c in a given string, so our assessment was spot-on.

Because of this tremendous advantage, we'll be giving type declarations to (almost) every function we write from now on. Oh, and just so we don't forget. If we have two functions with the same type declarations, we don't need to repeat ourselves  we separate the function names with commas in their type declaration. 1 2 3 4 5

-- File : functions2 . hs ( CONTINUED ) sum1 , sum2 :: Int -> Int -> Int -> Int sum1 x y z = x + y + z sum2 x y z = x + y - z

3.2. Polymorphism
3.2.1. Type Variables
Until now, we've dened functions of type

Int -> Int or [Char] -> Int. But what about functions like head? If we give head a type declaration of [Int] -> Int, for example, it will work only with integers. But head works with basically every type of element. So what is head's type? ghci > :t head head :: [a ] -> a

1 2

In the above snippet of code,

is what we call a

type variable.

It's some sort of generic type. Because

doesn't require specic behavior out of its parameters (unlike that can be equated), we can use

a6

==,

head

for instance, which requires parameters

to make an extremely general function. Basically

it accepts a list of any type and returns an element of This is called

the same

[a] -> a tells us that

type.

polymorphism :

whenever we use a type variable, we indicate that the function does not expect

a specic behavior, so it basically works as-is for a variety of inputs.

3.2.2. Typeclasses
We've seen some of the most specic type signatures (like most general (for example, For this, we need typeclasses. Typeclasses group types with a common behavior. Each internal denition of a typeclass contains a

Int -> Int or Char -> Int -> Bool) and the [a] -> a, [a] -> [a] -> [a]), but what if we require something in between?

collection of functions that must work for all members of that typeclass. It's pretty simple really. Typeclasses are presented in depth in B.1 (strongly recommended reading). explain how they but In the following we'll try to

Integral

interact.

For this, we'll consider

Num

and

Integral. Num
a, b, c
etc.

contains all types of numbers,

only integers.

It doesn't need to have only one letter, but for conciseness, we'll use

19

3. Types, Typeclasses, and Polymorphism

In addition, for something to be an

7 is some sort of a subclass of

Num:

Integral, it must also be a Num.

We can logically conclude that

Integral

it is more specic. The more specic a typeclass, the more operations are

possible within it. For example, things like

div

(integer division;

Num supplies, among others, +, -, * and abs. Integral oers, / in other languages) and mod (modulo; % in other languages).
8 of numbers. But as soon as we perform an

in addition,

If we just write

20

or

30,

they're any type

function on them, they (and the result of the operation) can no longer be We'll get round to 1 2 3 4 5 6 7 8

=>

Integral specic Floats or Rationals or whatever.

in a few moments.

ghci > :t 20 20 :: Num a => a ghci > :t 30 30 :: Num a => a ghci > :t 20 `div ` 5 20 `div ` 5 :: Integral a => a ghci > :t 20 `mod ` 30 20 `mod ` 30 :: Integral a => a
This is the gist of typeclasses and polymorphism: they group common behavior so we can make very general functions. If we make a

sort

function, we can be certain that it won't only work with lists of numbers, but

also with strings or anything else that can be ordered. At this point, it's a good idea to go through the typeclasses described in B.1. They're very useful.

3.2.3. Making Polymorphic Functions


Now let's see how we actually use typeclasses: in type declarations, mostly. Here are a few examples: 1 2 3 4 5 6 7 8

ghci > :t (+) (+) :: Num a = > a -> a -> a ghci > :t (^) (^) :: ( Num a , Integral b) => a -> b -> a ghci > :t pi pi :: Floating a = > a ghci > :t show show :: Show a => a -> String
It seems polymorphic functions really do use the of the function is right after the

=>

a lot.

Basically, everything before the

constraint. In the rst example, it tells the compiler (and us) that

=>.

9 is a member of Num . The actual type

=>

is a class

When we read such a denition, we usually do it (somewhat) from right to left. We shall use

(^) :: (Num a, Integral b) => a -> b -> a

as an example.

(^) ::

is the name of the function. In this case it's surrounded by parentheses because it consists only of

symbols. means has type of   now we jump to the bit after the

=>.

a -> b -> a

returns a parameter of the rst type (a).

means the function takes a parameter of a type (a), a parameter of another type (b) and

7 8 9

Calling it a subclass is not technically correct, but it

is

intuitively true.

We've avoided using kind to the point of repeating ourselves. This is not due to lack of vocabulary: in Haskell, something dierent. Kinds are explained in [XREF] (advanced topic).

kind

means

We can also have multiple class constraints by surrounding them in parentheses and separating them with commas, like in

(^).

20

3. Types, Typeclasses, and Polymorphism

(Num a, Integral b)
10 integer .

is the last thing we read  it tells us that

is any type of number but

is an

Now we'll apply our newly-gained knowledge to make our functions more general. We'll recycle examples from 2.1.3, 2.3.3, and 3.1.2. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19

-- File : polyfunctions . hs triple :: Num a = > a -> a triple x = 3* x strangeAddition :: Num a => a -> a -> a strangeAddition x y = x + triple y c :: Num a => a c = 4 length ' :: Num a => [ b] -> a length ' xs = sum [1 | _ <- xs ] vowels :: [ Char ] -> [ Char ] vowels word = [ c | c <- word , c `elem ` " aeiou " ] sumLists :: Num a => [ a] -> [ a] -> [a ] sumLists xs ys = [ x + y | x <- xs , y <- ys ]
A great thing about Haskell is that if our type denitions are wrong (i.e., they are incompatible with the function itself ), an error is thrown. Apart from the obvious advantage, this means we can cheat and let Haskell infer the type for us, then copy-paste it in our le.

1 2 3

ghci > let spaces xs = sum [ 1 | x <- xs , x == ' ' ] ghci > :t spaces spaces :: Num a = > [ Char ] -> a -- File : polyfunctions . hs ( CONTINUED ) spaces :: Num a = > [ Char ] -> a spaces xs = sum [ 1 | x <- xs , x == ' ' ]

1 2 3 4

3.2.4. Drawbacks
We've seen how we can make our programs more readable and reliable by adding type denitions. The good news is that we can't accidentally add centimeters and inches. The bad news is that we can't add an integer and a oating point.

What

Of course we can do stu like 1 2 3 4 5 6

4 + 5.1,

but that's dierent. Let's see.

ghci > 4 + 5.1 9.1 ghci > (4 :: Int ) + (5.1 :: Float ) < interactive >:1:15: Couldn 't match expected type `Int ' with actual type ` Float '
10
It can be any one of the 7 types of integer Haskell has.

21

3. Types, Typeclasses, and Polymorphism

7 8 9

In the second argument of `(+) ', namely `(5.1 :: Float ) ' In the expression : (4 :: Int ) + (5.1 :: Float ) In an equation for `it ': it = (4 :: Int ) + (5.1 :: Float )
It seems that it all blows up if we force the types. The above error tells us, quite clearly, that it expected to be an

Int

rather than a

Float.

Haskell can't add two dierent types

11 . The keen reader will remember

5.1

that we previously mentioned polymorphic constants. We can easily check if this is the case here. 1 2 3 4 5 6

ghci > :t 4 4 :: Num a => a ghci > :t 5.1 5.1 :: Fractional a => a ghci > :t (4 + 5.1) (4 + 5.1) :: Fractional a => a
Aha! So (Float, in

4 can take any number type (Int, Complex, Rational, Float, Double etc.), but 5.1 is a fractional Double etc.). Naturally, adding them means that 4 can have only the types 5.1 can have, so anything Fractional12 .

Right now, things may seem confusing (and rightfully so). The most important thing to remember here is to make type declarations as general as possible, but not more general. In bullet points:

Specic declarations limit a function to a certain type or typeclass:

triple :: Int -> Int.

13 General declarations make a function versatile :

triple :: Num a => a -> a. triple :: a -> a.


14 .

Too general declarations are incorrect and throw errors:

If we're not sure of a type, we should leave it blank. The compiler always infers types better than the user

Some food for thought: what happens if a typeclass has the same name as a type? So, for example, we have

sillyFunction :: Derp a => a -> Derp.


the same name.

How do we distinguish between the rst

Derp

and the second

one? Well, they're logically dierent: one is a type, the other a typeclass. It doesn't matter if both have Does anyone ever confuse Jack the actor with Jack the movie character terms, we say that they have dierent the namespace.

kinds

15 ? In technical

(we'll talk more about them in [XREF]). The compiler won't

ever confuse them, and as it happens, it's a pretty frequently used technique: we don't want to... pollute

3.3. Case Study: Tuples


3.3.1. Lists Recap
We mentioned lists are homogenous and have variable length (2.2.1). Before continuing, let's explore this from a new perspective: types. 1 2 3 4 5

ghci > :t [1 , [1 , 2, 3] :: ghci > :t [1 , [1 , 2, 3 , 4] ghci > :t (:)


11 12 13 14 15

2 , 3] Num t = > [ t] 2 , 3, 4] :: Num t = > [t ]


Num a => a -> a -> a. (4 + 5.1) :: (Num a, Fractional a) => a,

The addition operator (+) is of the type Actually, it should look like the same thing.

but because

Fractional

is included in

Num,

it's

Sometimes we want to avoid that. For example, maybe we want a function that can only triple integers so we don't accidentally rounding errors. Unless, of course, it's released software  type denitions are half the documentation. Or for physicists,

the length with

the acceleration.

22

3. Types, Typeclasses, and Polymorphism

6 7 8

(:) :: a -> [a ] -> [a ] ghci > :t (++) (++) :: [a ] -> [a ] -> [a]
Even if we don't know anything about lists, from the above piece of code we can draw two very important conclusions:

No matter how long a list is, its type is the same. This makes them essentially variable in length  we have do-it-all functions that can lengthen (:, regardless of length.

++

etc.) or shorten (take,

drop

etc.) any list,

Both

and

++

take identical types as parameters, so there's no way we can get away with adding a

dierent type of element to a list. This translates into our current knowledge of lists: variable length and homogeneity. It reinforces the idea that we can learn a great deal simply by analyzing types.

3.3.2. Understanding Tuples


Let's say we heard of a new Haskell feature: we can put stu in parentheses and surround them by commas  these structures are called tuples

16 . Unfortunately all the documentation is lost (yeah, right). It may not

seem like a lot, but we can extract a wealth of information from the little we know. First, let's see if we got the syntax right and try various things to see if they work. 1 2 3 4 5 6 7 8 9 10 11 12 13 14

ghci > (4 , 5, 6) (4 ,5 ,6) ghci > (10 , 2, 3, 3) (10 ,2 ,3 ,3) ghci > (85 , " Hello ") (85 , " Hello ") ghci > ( 'a ', " Haskell " , 15 , " never " , " easy ") ( 'a ' ," Haskell " ,15 , " never " ," easy ") ghci > () () ghci > ( 'a ') 'a ' ghci > (20) 20
Let's draw some partial conclusions about tuples:

They can be any size. They are

not

necessarily homogenous.

There is such a thing as an empty tuple:

().
17 .

Single-element tuples are the same as the elements themselves

Let's see what types they are. 1 2 3 4 5

ghci > :t (4 , 5 , 6) (4 , 5, 6) :: ( Num t1 , Num t2 , Num t) = > (t , t1 , t2 ) ghci > :t (10 , 2, 3 , 3) (10 , 2 , 3, 3) :: ( Num t1 , Num t3 , Num t2 , Num t) => (t , t1 , t2 , t3 ) ghci > :t (85 , " Hello ")
16 17
For the record, that's not a new feature. That's pretty obvious  all we did is surround them with parentheses.

23

3. Types, Typeclasses, and Polymorphism

6 7 8 9 10 11 12 13 14 15

(85 , " Hello ") :: Num t => (t , [ Char ]) ghci > :t ( 'a ' , " Haskell " , 15 , " never " , " easy ") ( 'a ' , " Haskell " , 15 , " never " , " easy " ) :: Num t => ( Char , [ Char ] , t , [ Char ] , [ Char ]) ghci > :t () () :: () ghci > :t ( 'a ') ( 'a ') :: Char ghci > :t (20) (20) :: Num a => a
So the type of the tuple contains the types of all the elements inside it. This means:

Tuples have an essentially xed length An empty tuple is its own type:

18 .

()

is of type

().

We've also inadvertently learned that type denitions can be split across multiple lines (as long as the next lines are indented slightly to the right).

3.3.3. Functions on Tuples


We now make a horrible typo: 1 2 3 4 5 6 7 8

ghci > ( ,) < interactive >:1:1: No instance for ( Show ( a0 -> b0 -> ( a0 , b0 ) )) arising from a use of ` print ' Possible fix : add an instance declaration for ( Show ( a0 -> b0 -> ( a0 , b0 ) )) In a stmt of an interactive GHCi command : print it
The error says: the type of

(,), which is a0 -> b0 -> (a0, b0) (a function19 ) is not a member of the Show
safe to say that it creates a tuple from its two parameters

typeclass (which is no surprise seeing we can't print functions). So what does we have 1 2 3 4 5 6

(,) do? It's (), (,) etc.

20 . By the same logic

ghci > ( ,) 5 6 (5 ,6) ghci > ( ,) 123 " abc " (123 , " abc ") ghci > ( , ,) 'a ' 16 " ddx " ( 'a ' ,16 , " ddx ")

Problem Z.
snd,

It's more readable to just do it normally, like

(5, 6).

Like all prex functions,

(,)

comes in handy for

Another thought experiment  let's imagine that somebody told us about two useful functions: but they didn't mention what they do. As always, we want to check their types rst.

fst

and

18 19 20

We can write functions to add an element to a tuple of a specic size (and type) but never universal ones that work on all of them. One that takes two types and returns a tuple which contains those types. 2-tuples (those made using

(,))

are usually called pairs (or sometimes doubles), 3-tuples are triple(t)s etc.

24

3. Types, Typeclasses, and Polymorphism

1 2 3 4

ghci > :t fst fst :: (a , b) -> a ghci > :t snd snd :: (a , b) -> b
Now it's clear.

fst

must take the rst element of a pair, and

snd,

the second.

1 2 3 4 5

ghci > fst (5 , "a" ) 5 ghci > snd (5 , "a" ) "a " ghci > fst (1 , 2, 3) -- whoops , error
Warning!

fst

and

snd

only work on pairs. There are no built-in functions for triples or larger.

3.3.4. Applications
Tuples are especially useful in conjunction with functions or list comprehensions, namely when we want to return multiple things. We now go back to some of the 2.3.2 examples, and try to improve them. 1 2 3 4 5 6

ghci > [ (a , b) | a <- [1..3] , b <- [1..3] ] [(1 ,1) ,(1 ,2) ,(1 ,3) ,(2 ,1) ,(2 ,2) ,(2 ,3) ,(3 ,1) ,(3 ,2) ,(3 ,3) ] ghci > [ (x , y , x + y) | x <- [1..4] , y <- [1..3] , even (x + y ) ] [(1 ,1 ,2) ,(1 ,3 ,4) ,(2 ,2 ,4) ,(3 ,1 ,4) ,(3 ,3 ,6) ,(4 ,2 ,6) ] ghci > take 5 [ (a , b , c) | a <- [1..] , b <- [1.. a ], c <- [1.. b], a ^2 == b ^2 + c ^2 ] [(5 ,4 ,3) ,(10 ,8 ,6) ,(13 ,12 ,5) ,(15 ,12 ,9) ,(17 ,15 ,8) ]
So far, so good. Tuples seem to be okay for trivial uses, but where they really work wonders is in larger, more complex programs. A classic example is splitting a list in order to work on both parts simultaneously. We'll look deeper into this in [XREF] and [XREF].

1 2 3 4 5

ghci > let splitHead xs = ( head xs , tail xs ) ghci > splitHead [1 , 5, 3 , 2, 6] (1 ,[5 ,3 ,2 ,6]) ghci > splitHead [] (*** Exception : Prelude . head : empty list
Of course, we can't perform called

splitAt

splitHead on an empty list,

because it has no head. A better, built-in function

solves our problems gracefully.

1 2

ghci > :t splitAt splitAt :: Int -> [a ] -> ([ a], [a ])


It seems that that: 1. It will split the list at any point, and 2. It won't give us unexpected errors for out-of-bounds values.

splitAt also takes an Int apart from the list, and returns a pair of lists so it's logical to think

1 2 3 4

ghci > splitAt 5 [1..10] ([1 ,2 ,3 ,4 ,5] ,[6 ,7 ,8 ,9 ,10]) ghci > splitAt 1 [2 , 3 , 5, 8] ([2] ,[3 ,5 ,8])

25

3. Types, Typeclasses, and Polymorphism

5 6 7 8 9 10 11 12

ghci > splitAt 0 [2 , 3 , 5, 8] ([] ,[2 ,3 ,5 ,8]) ghci > splitAt ( -1) [2 , 3 , 5, 8] ([] ,[2 ,3 ,5 ,8]) ghci > splitAt 5 [1 , 2] ([1 ,2] ,[]) ghci > splitAt 1 [] ([] ,[])
That's it for now! We'll return to types later on, but our next big step is mastering functions with advanced syntax and everything.

26

Part II.

Getting the Hang of It

27

4. Exploring Syntax
Uninformed people believe that syntax is the hardest part of learning a language.

(kmc)

4.1. Pattern Matching


4.1.1. Basics
We've seen the if-else in action (2.1.3). A serious downside is that it uses so much space. What if we want to create a mini-dictionary? 1 2 3 4 5 6 7 8 9 10

-- File : useless - dict . hs engGer :: [ Char ] engGer word = if else if else if else if else if else if else " I -> [ Char ] word == " one " word == " two " word == " three " word == " four " word == " five " word == " six " don ' t know what then then then then then then " ++ " eins " " zwei " " drei " " vier " " fnf " " sechs " word ++ " means ."

That works perfectly, apart from the fact that it looks awful and contains lots of superuous information, such as the rst

if

or the second

if

or the third

if...

Fortunately, we can do this instead: 1 2 3 4 5 6 7 8 9 10

-- File : patterns . hs engGer engGer engGer engGer engGer engGer engGer engGer :: [ Char ] " one " = " two " = " three " = " four " = " five " = " six " = word = -> [ Char ] " eins " " zwei " " drei " " vier " " fnf " " sechs " " I don 't know what " ++ word ++ " means ."

A few things to note:


1 2

It looks much better . We don't need to align the

=s

but it increases readability.

We have one function body for each use case.

Bear with us  the rst examples are really boring. ...but it's still inecient to write a dictionary like that.

28

4. Exploring Syntax

In the second example we have used something called pattern matching. Essentially, Haskell looks at each of the patterns (from top to bottom) , and if one works, it will evaluate the corresponding function body. It's pretty simple if we think about it. To clarify, the syntax looks like: 1 2 3 4 5 6

-- Syntax : pattern matching function pattern1 = result1 function pattern2 = result2 function pattern3 = result3 function pattern4 = result4 ...
If we're not careful, our pattern matching can fail. This happens mostly when we don't cover all our angles  we forget to consider a case.

1 2 3 4 5 6

-- File : patterns - wrong . hs intToString intToString intToString intToString :: Int -> [ Char ] 1 = " one " 2 = " two " 3 = " three " 1, 2,
and

This example is boring, but it illustrates the issue quite well. It's obvious that all cases except a corresponding pattern to match the input.

are missing, but in real life things may not be so straightforward. GHCi throws an error when it can't nd

These errors are particularly dangerous because the compiler can't nd them right away: it has to be given an incorrect input, and by that time it might be too late. We

can

use

:set -fwarn-incomplete-patterns and

GHCi will warn us on non-exhaustive patterns, but this isn't 100% guaranteed  better to check personally. 1 2 3 4

ghci > intToString 3 " three " ghci > intToString 20 *** Exception : dontbother . hs :(4 ,1) -(6 ,23) : Non - exhaustive patterns in function Main . intToString
Warning! Make sure all possible cases are covered in pattern-matching.
The obvious solution is to introduce some sort of catch-all pattern.

1 2 3 4 5 6 7

-- File : patterns - wrong . hs ( FIXED ) intToString intToString intToString intToString intToString :: Int -> [ Char ] 1 = " one " 2 = " two " 3 = " three " n = " I don ' t know about " ++ show n

In this case, everything is well. The program won't crash when we give an unexpected input, but it won't do anything useful either. As we progress, we'll learn how to deal with increasingly complex scenarios. 1 2

ghci > intToString 20 "I don 't know about 20 "


For the avid reader, B.2.3 shows a basic method of customizing error messages  useful when we don't really want to x them.

If we move

engGer word = ...

at the top, it will always say

I don't know ...,

because

word

ts anything (it's just a

variable name), and is checked rst.

29

4. Exploring Syntax

4.1.2. Applications
We don't actually want to use pattern matching just as a gloried if-else. Where it really shines is in matching

patterns, not boring numbers (although it can certainly do that as well).


Earlier (3.3.3), we wanted to do

fst

on a triple. We can't do that, but at this point we know very well that

we can make our own function. Let's do it. 1 2 3

-- File : patterns2 . hs fst3 :: (a , b , c) -> a fst3 (x , _ , _) = x ghci > fst3 (" Mike " , " Adams " , 23) " Mike "
Now that we know it works, it's a breeze to implement the whole lot.

1 2

1 2 3 4 5 6

-- File : patterns2 . hs ( CONTINUED ) snd3 :: (a , b , c) -> b snd3 (_ , y , _) = y trd3 :: (a , b , c) -> c trd3 (_ , _ , z) = z


Let's say we're

mathematicians

with Haskell knowledge. We have a simple task ahead of us: multiplying

two 2D vectors. What does that mean? Basically we are given two pairs like: 1 2 3

4 multiplication is (a c, b d). Easy as pie . Before learning pattern matching, we might have done something

(a, b)

and

(c, d)

 the result of the

-- File : vectors . hs mulVct :: Num a = > (a , a) -> (a , a) -> (a , a ) mulVct a b = ( fst a * fst b , snd a * snd b )
It works perfectly well (we can try it), but it's not quite what we wanted. Let's arm ourselves with patterns and try again.

1 2 3

-- File : vectors . hs ( FIXED ) mulVct :: Num a = > (a , a) -> (a , a) -> (a , a ) mulVct (a , b ) (c , d) = (a * c , b * d)


The end result is equivalent in both cases. The obvious dierence is in readability. Even though the computer doesn't care, our human readers will be thankful of our design choices.

1 2 3 4

ghci > mulVct (1 ,2) (3 ,4) (3 ,8) ghci > mulVct (0 ,1) (5 ,10) (0 ,10)
A word of warning: there. Because we

Num a => (a, a) -> (a, a) -> (a, a) is not the most general type denition out only multiply a with c and b with d, a and c can have dierent types from b and d.

However, in this case it doesn't make much sense  vectors should be homogenous. So, even though the compiler doesn't care, we do. So here we go:

Warning! Use the most general type denition

that actually makes sense.

Another thing: Even though, at rst, they might seem like a good idea, lists aren't suitable as vectors because they have variable length.

Or at least we hope so.

30

4. Exploring Syntax

4.1.3. Matching with Cons


It is time to discover the full power of the cons operator (:). We've seen how

1:2:3:[]

and

1:[2, 3].

[1, 2, 3]

is the same as

All of them are patterns that can be matched.

1 2 3 4 5 6 7 8 9 10 11 12

-- File : cons - patterns . hs match1 :: ( Num a ) => [a] -> String match1 [x , y , z] = " List of 3 numbers with sum " ++ show (x + y + z) match1 _ = " Nope ." match2 :: ( Num a ) => [a] -> String match2 (x :y: z :[]) = " List of 3 numbers with sum " ++ show ( x + y + z ) match2 _ = " Nope ." match3 :: ( Num a ) => [a] -> String match3 (x :[y , z ]) = " List of 3 numbers with sum " ++ show (x + y + z) match3 _ = " Nope ."
We will say this only once: patterns made of multiple bits must be surrounded by parentheses. necessary, while

([x, y])

(x:y:[])

is

is not.

All three functions above do the exact same thing. Although this may be interesting, in our case, their main disadvantage is that they match only lists of length 3. It's not particularly useful, but what it illustrates is the equivalence of certain notations. Before continuing, we must note that pattern matching trying it with 1 2 3 4

++

cannot be done with arbitrary functions.

For example,

gives a parse error.

-- File : cons - patterns - wrong . hs match4 :: ( Num a ) => [a] -> String match4 ([ x ,y ] ++ [z ]) = " List of 3 numbers with sum " ++ show ( x + y + z ) match4 _ = " Nope ."
Although it certainly looks logical to us, the compiler doesn't think the same.

1 2 3 4 5

ghci > :l cons - patterns - wrong . hs [1 of 1] Compiling Main

( cons - patterns - wrong .hs , interpreted )

cons - patterns - wrong . hs :3:9: Parse error in pattern : [x , y ] ++ [z ] Failed , modules loaded : none .
The reason it works with

and not with

++

is that

creates (

cons tructs)

the list from elements, while

++

is just a function that happens to operate on lists. We've seen how to create pattens that exactly match the input (engGer

we can use variables (intToString

5 want to be able to match lists of arbitrary length .

n).

We know that we can combine the two (snd3

one).

We've also learned that

(_, y, _)).

Now we

We can't bind all of the elements, individually, to variables because we don't know how many of them there are. What we can do is, say, name the rst element of the list, say, 1 2 3

and the rest of the elements

xs.

-- File : cons - patterns . hs ( CONTINUED ) describe :: ( Show a) => [ a] -> String describe ( x: xs ) = "A list with the first element " ++ show x ++ " and " ++ show ( length xs ) ++ " other elements ."
5

After all, if we can't do that, lists are basically useless.

31

4. Exploring Syntax

This works because something like pattern 1 2 3 4 5 6

x:xs

is

and

xs

is

[1, 2, 3, 4, 5] [2, 3, 4, 5].

is exactly the same as

1:[2, 3, 4, 5]

so it ts the

ghci > describe [1..5] "A list with the first element 1 and 4 other elements ." ghci > describe " hello , world " "A list with the first element 'h ' and 11 other elements ." ghci > describe [] *** Exception : cons - patterns . hs :3:1 -113: Non - exhaustive patterns in function describe
What seems to be the problem? If we look closely, element of

[],

so

[] doesn't actually t the pattern x:xs.

There is no rst

can't be matched to it. Thus the whole pattern fails (half wrong is all wrong). We can

solve this right away. 1 2 3 4

-- File : cons - patterns . hs ( CONTINUED ) ( FIXED ) describe :: ( Show a) => [ a] -> String describe [] = " An empty list ." describe ( x: xs ) = "A list with the first element " ++ show x ++ " and " ++ show ( length xs ) ++ " other elements ." ghci > describe [] " An empty list ."
Incidentally, the

1 2

head

function in

Prelude

is dened similarly. We can make our own!

1 2 3 4

-- File : ourhead . hs head ' :: [ a] -> a head ' (x:_ ) = x head ' [] = undefined
This

undefined

is exactly what it says on the tin: the

other words, it's 1 2 3 4

undefined.

head'

of an empty list doesn't make sense, or, in

ghci > head ' [4 , 4] 4 ghci > head ' [] *** Exception : Prelude . undefined
Just a quick reminder: if we want to have custom error messages, we can take a look at B.2.3.

error,

explained in

4.1.4. As patterns


Observe a simple function. Its disadvantage is that we write

x:xs

twice. The interpreter essentially splits

the string into a head and a tail and then puts it back together again. It's inecient. 1 2 3 4

-- File : as - patterns . hs f :: String -> String -- String is the same as [ Char ] f " " = " This is an empty string . " f ( x: xs ) = " The string " ++ x: xs ++ " has the first character " ++ [x]
Notice the dierence (below) when using as patterns  by writing we can reference the whole pattern by using the name

all,

all@(x:xs)

instead of simply

without having to write

x:xs

(x:xs)

again. This saves

us from unnecessary keystrokes and the interpreter from unnecessary operations.

32

4. Exploring Syntax

1 2 3 4

-- File : as - patterns . hs ( FIXED ) f :: String -> String -- String is the same as [ Char ] f " " = " This is an empty string . " f all@ (x: xs ) = " The string " ++ all ++ " has the first character " ++ [x]
Another example:

1 2 3 4

-- File : as - patterns2 . hs split3 :: [a ] -> (a , a , [ a ]) split3 (x :y: ys ) = (x , y , x: y: ys ) split3 _ = undefined


Last chance to learn examples.

error

(B.2.3)  we won't be using

undefined

any longer, except in quick and dirty

1 2 3 4

-- File : as - patterns2 . hs ( FIXED ) split3 :: [a ] -> (a , a , [ a ]) split3 list@ (x: y: ys ) = (x , y , list ) split3 _ = error " split3 : list too short "
As

6 we've stated above, writing stu like

so we won't have to repeat ourselves. In this case, We just say

list.

name@horriblyLongPattern will bind the entire pattern to name, list@(x:y:ys) spares us the need to write x:y:ys again.

4.1.5. Patterns in Comprehensions


Oh, just so we don't forget: we can use pattern matching in list comprehensions, too. 1 2 3 4 5 6 7

ghci > let stuff = [(4 , 5) , (8 , 3) , (2 , 2) , (6 , 1) , (3 , 2) ] ghci > [ a * b | (a , b) <- stuff ] [20 ,24 ,4 ,6 ,6] ghci > [ a + b | (a , b) <- stuff , even a , odd b ] [9 ,11 ,7] ghci > [ [a , b] | (a , b ) <- stuff ] [[4 ,5] ,[8 ,3] ,[2 ,2] ,[6 ,1] ,[3 ,2]]
This time, if a pattern fails, it will just move on to the next element.

1 2 3 4 5

ghci > let newstuff = [[4 ,5 ,6] , [7 ,8] , [9 ,10 ,11]] ghci > [ a + b*c | [a ,b ,c] <- newstuff ] [34 ,119] ghci > [ 2* a | [ a] <- newstuff ] []
If a pattern's

type

fails, however, the result is not as pretty.

1 2 3 4 5 6 7 8

ghci > [ x + y | (x , y) <- [(1 , 1, 1) , (2 , 2 , 2) ] ] < interactive >:1:11: Couldn 't match expected type `(t0 , with actual type `(t3 , In the pattern : (x , y ) In a stmt of a list comprehension : In the expression : [ x + y | (x , y )
6
Haha.

t1 , t2 ) ' t4 ) ' (x , y) <- [(1 , 1 , 1) , (2 , 2 , 2) ] <- [(1 , 1, 1) , (2 , 2, 2) ]]

33

4. Exploring Syntax

Warning! While failing patterns can be excused, using the wrong type

always

results in an error.

4.2. Other Constructs and Expressions


4.2.1. Guards
We were very vehement about the fact that pattern matching is 1 2 3 4 5 6 7 8

not

a gloried if-else. The following is:

-- File : guards . hs numberSize :: ( Ord a , Fractional a) => a -> String numberSize x | x < 0.1 = " Small " | x < 1 = " Small - ish " | x < 10 = " Okay " | x < 100 = " Large " | otherwise = " Huge ! "
In the above example, we tried to estimate the size of a given number using adjectives like

Huge!.
like.

Small-ish

This is not terribly mature, but shows how these things (which, by the way, are called

guards )

and look

Guards are basically a replacement of if-else trees. They are separated by separate lines for readability. the result (Okay).

|7 and usually neatly aligned on They consist of a boolean expression (such as x < 10), followed by =, and then
The rst boolean to be

Just like patterns, guards are checked from top to bottom. evaluated (and Haskell as writing

True,

won't

continue with the other patterns). The nal guard,

True has otherwise8 , is

its result the same

but it looks more similar to written English, so it's preferred.

After this huge block of text, we should refresh our eyes by looking at some code. We've implemented our own versions of 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19

max, min, abs9 ,

and

compare

in a variety of styles.

-- File : guards . hs ( CONTINUED ) max2 :: Ord a => a -> a -> a max2 x y | x <= y = y | otherwise = x min2 :: Ord a => a -> a -> a min2 x y | x <= y = x | otherwise = y abs2 :: ( Num a , Ord a ) => a -> a abs2 x | x < 0 = -x | otherwise = x abs2 ' :: ( Num a , Ord a ) = > a -> a abs2 ' x | x < 0 = -x abs2 ' x = x compare2 :: Ord a => a -> a -> Ordering x ` compare2 ` y | x == y = EQ
7 8 9
These things are called pipes. We've seen them in list comprehensions but here they do entirely dierent things. It's not mandatory but highly recommended. If Haskell reaches the end of the guards without meeting an checks the next pattern (as in pattern matching). If no corresponding patterns are found, an error is thrown. A little more restrictive than the ocial implementation (requires

otherwise,

it

Ord).

34

4. Exploring Syntax

20 21 22 23 24 25 26

| x <= y = LT | otherwise = GT compare2 ' :: Ord a = > a -> a -> Ordering compare2 ' x y | x == y = EQ | x <= y = LT | otherwise = GT
All of the above are valid, but some are more readable than others. From top to bottom: 1. 2. 3.

max2 min2 abs2


OK.

has a pretty standard style  we've seen this one above, and it's very readable. is at the other end of the spectrum: putting guards in a single line is not a good idea. puts the guards immediately to the right of the function and starts them on the same line. Also

4.

abs2'

uses a combination of guards and pattern matching.

It does the same thing as

abs',

but

uses a totally dierent layout. Not usually recommended, but in some cases it looks better than the alternatives. 5.

compare2 is like abs2.


readability.

What's dierent is that it's declared inx (surrounded by backquotes) to increase

6.

compare2':

this is very bad. It works just ne, but it looks horrendous. We also notice that the guards

must be indented at least one character

10 (for the record, the recommended amount is four).

At the end of the day, it's not a big deal which style we choose possible, but not if it means sacricing readability.

11 . It's important to be as consistent as

Let's try some more examples with guards. Say we want to make a drink calculator. It shows us how sober somebody is, given the blood alcohol concentration 1 2 3 4 5 6 7

12 .

-- File : drink - calc . hs drink :: ( Ord a , Fractional a ) = > a -> String drink bac -- Blood Alcohol Concentration | bac < 0.03 = " You ' re as sober as can be expected ." | bac < 0.08 = " You can drive , but it 's a bad idea ." | bac < 0.10 = " Your reasoning is out the window . " | otherwise = " Stop drinking . "
This is kinda lengthy, and not very useful, but we'll perfect it as we move along. For now, let's give it a try.

1 2 3 4 5 6 7

ghci > drink 0.07 " You can drive , but it 's a bad idea . " ghci > drink (4/30) " Stop drinking ." ghci > import Data . Ratio -- let 's try rationals , too ghci > drink (1 % 5) " Stop drinking ."
One does not simply know the blood alcohol concentration  it needs to be calculated. Fortunately, there is a simple formula, where

is the number of drinks.

13

10 11 12 13

If the

Except the everything-on-a-single-line method (min2) and the one randomly indented (compare2')  we run from them like the plague. We've found this information on the internet, so it's not the most precise calculator out there.

starts at the very beginning of the line, Haskell treats it as a new function denition.

Warning! Excessive alcohol consumption can be hazardous to your health. Driving vehicles or operating heavy machinery should not be done under the inuence of this dangerous chemical. Drink responsibly. Drive safely. This message brought to you by Haskellers Anonymous.

35

4. Exploring Syntax

c=
In Haskell speak, this is

0.025 N 0.035 N

if you're male if you're female Apart from doing what

bac = n * if sex == "male" then 0.025 else 0.035.

we want it to do, this is yet another reminder that we can jam the if-else anywhere. It's better than saying

bac = if sex == "male" then n*0.025 else n*0.035 because we're not repeating

ourselves, not to mention that it's clearer. With our current knowledge of Haskell, there are two ways of doing it, neither particularly good. 1 2 3 4 5 6 7

-- File : drink - calc . hs drink :: ( Fractional a , Ord a ) = > String -> drink sex n -- Blood Alcohol Concentration | ( n * if sex == " male " then 0.025 else sober as can be expected . " | ( n * if sex == " male " then 0.025 else but it ' s a bad idea . " | ( n * if sex == " male " then 0.025 else is out the window ." | otherwise = " Stop drinking . "
If we try it out, it works:

a -> String 0.035) < 0.03 = " You ' re as 0.035) < 0.08 = " You can drive , 0.035) < 0.10 = " Your reasoning

1 2 3 4 5 6 7 8

ghci > drink " male " 4 " Stop drinking ." ghci > drink " female " 2 " You can drive , but it 's a bad idea . " ghci > drink " male " 1 " You ' re as sober as can be expected . " ghci > drink " female " 8 " Stop drinking ."
The code is, however, yucky (and that's putting it mildly). The other solution is to use another function to calculate the

bac.

1 2 3 4 5 6 7 8 9 10

-- File : drink - calc . hs ( FIXED ) bac :: ( Fractional a , Ord a ) => String -> a -> a bac sex n = n * if sex == " male " then 0.025 else 0.035 drink drink | | | | :: ( Fractional a , Ord a ) = > String -> a -> String sex n -- Blood Alcohol Concentration bac sex n < 0.03 = " You ' re as sober as can be expected . " bac sex n < 0.08 = " You can drive , but it ' s a bad idea . " bac sex n < 0.10 = " Your reasoning is out the window ." otherwise = " Stop drinking . "
We're still repeating ourselves and we've just

It still works and it's a tad shorter, but that's about it. we can do.

introduced a function that we're not going to use anywhere else. With what we know so far, there's nothing

4.2.2. Where Bindings


This is where an example.

where

bindings come into play. We're not going to improve

bac

right away  let's start with

36

4. Exploring Syntax

1 2 3 4 5

-- File : gpa . hs gpa :: [ Int ] -> Int -> Int gpa grades final = func grades + final where func :: [ Int ] -> Int func xs = sum xs `div ` length xs
It is time to take a moment and contemplate this function. Okay, moment's over. So what do we have here? Why, a GPA calculator, of course. This one seems to do something with the grades then add it to the nal. If we only read the rst line, we don't know what does. Neither does the compiler. The

func

where

keyword introduces a section that contains denitions. In our case,

func

is dened just like we

learned. It's easy to see what it does. The type denition tells us that it takes a list of integers and returns only one, and the body indicates it averages those numbers other grades. Pretty simple. Another thing: inside

14 . So

gpa

adds the nal to the average of the

15 omitted ), multiple function bodies, pattern matching etc. It's just like our typical function (or name)
denition. We can even put a

where

sections we can have the usual gimmicks: type declarations (which are usually

where

inside a

where!

In fact, pattern matching inside where sections is so useful and important, it's worth giving a specic example. 1 2 3 4

-- File : stutter . hs stutter :: String -> String stutter word = [ w] ++ " -" ++ [w ] ++ " -" ++ word where ( w:_ ) = word
It's

[w],

not

things like 1 2

w because ++ takes strings, not characters. The keen reader would notice where w = head word. No matter how we write it, we should be consistent

that we can also do with our choices.

ghci > stutter " hello " "h -h - hello "


These are the basics of is the initial code:

where

bindings. Now it's time to improve our calculator (in three easy steps). This

1 2 3 4 5 6 7 8 9

bac :: ( Fractional a , Ord a ) => String -> a -> a bac sex n = n * if sex == " male " then 0.025 else 0.035 drink drink | | | | :: ( Fractional a , Ord a ) = > String -> a -> String sex n bac sex n < 0.03 = " You ' re as sober as can be expected . " bac sex n < 0.08 = " You can drive , but it ' s a bad idea . " bac sex n < 0.10 = " Your reasoning is out the window ." otherwise = " Stop drinking . "

Problems:

We're repeating ourselves. We have a function that we use nowhere else. The code is slightly confusing.

The obvious thing to do is put

bac

in a

where

section (not to worry, the

where

is visible to all the guards).

14 15

We should have called it Because functions inside

average or avg or something instead of func. where sections are usually short and simple. If

one becomes too long, consider writing it separately.

37

4. Exploring Syntax

1 2 3 4 5 6 7 8

drink :: ( Fractional a , Ord a ) = > String -> a -> String drink sex n | bac sex n < 0.03 = " You ' re as sober as can be expected . " | bac sex n < 0.08 = " You can drive , but it ' s a bad idea . " | bac sex n < 0.10 = " Your reasoning is out the window ." | otherwise = " Stop drinking . " where bac :: ( Fractional a , Ord a ) = > String -> a -> a bac sex n = n * if sex == " male " then 0.025 else 0.035
Problems:

We're repeating ourselves. We have a function that we use nowhere else. The code is slightly confusing.

Now we get rid of

redundant (drink already has the parameters 1 2 3 4 5 6 7

bac's

type declaration  the function is simple enough. We also notice that

sex

and

n,

which can be used in the

where

sex n

is

section).

drink :: ( Fractional a , Ord a ) = > String -> a -> String drink sex n | bac < 0.03 = " You ' re as sober as can be expected ." | bac < 0.08 = " You can drive , but it 's a bad idea ." | bac < 0.10 = " Your reasoning is out the window . " | otherwise = " Stop drinking . " where bac = n * if sex == " male " then 0.025 else 0.035
Problems:

We're repeating ourselves. We have a function that we use nowhere else. The code is slightly confusing.

Finally, let's make the function easier to understand and modify by giving names to has a 1 2 3 4 5 6 7 8 9 10

0.03, 0.08

and

0.10.

This way we can be sure we understand what they mean and also easily modify them (for instance, France

0.05

limit for driving).

drink :: ( Fractional a , Ord a ) = > String -> a -> String drink sex n | bac < soberLimit = " You ' re as sober as can be expected . " | bac < drivingLimit = " You can drive , but it ' s a bad idea ." | bac < thinkingLimit = " Your reasoning is out the window . " | otherwise = " Stop drinking . " where bac = n * if sex == " male " then 0.025 else 0.035 soberLimit = 0.03 drivingLimit = 0.08 thinkingLimit = 0.10
Problems:

We're repeating ourselves. We have a function that we use nowhere else. The code is slightly confusing.

Now we're ready to move on. Oh, and one more thing. We must align things neatly following the part, or the code might not compile or function correctly.

where

38

4. Exploring Syntax

Warning! In

where

sections, not aligning the code can yield undesirable results.

However, placing the 1 2 3 4 5

where

on a separate line is allowed, like in the following example:

-- File : cone . hs coneVolume :: Floating a => a -> a -> a coneVolume r h = baseArea * h / 3 where baseArea = pi * r ^2

4.2.3. Let Bindings


We'll recycle the above example for our purposes. 1 2 3 4 5

-- File : cone - let . hs coneVolume :: Floating a => a -> a -> a coneVolume r h = let baseArea = pi * r ^2 in baseArea * h / 3
It seems pretty intuitive. reversed  One might say let bindings are let <bindings> in <expression>, as opposed just like to

where bindings, only with the order <expression> where <bindings>. There's

much more to them, though. A mountain of examples follows (and not many words). For a start, 1 2 3 4 5 6 7 8

let

is not unlike the

if

statement; we can jam it pretty much everywhere  interactive...

ghci > let a = 3 in 2 * a 6 ghci > 4 + 5 * ( let x = 5 in 2 * x ) 54 ghci > 2 + 3 * ( let e = 2.718281828 in e * (e + 1) ) 32.32201377330506 ghci > " hello " ++ ( let w = " world " in w ++ w ++ w) " hello world world world "
... and loaded from a le (just like

where

bindings,

let

bindings must be properly aligned).

1 2 3 4 5 6

-- File : cone - area . hs coneArea :: Floating a = > a -> a -> a coneArea r h = let baseArea = pi * r ^2 sideArea = let l = sqrt (r ^2 + h ^2) in pi * r * l in baseArea + sideArea
We can perform many neat tricks using

let,

such as:

1 2 3 4

Binding several variables

inline 16

using semicolons.

ghci > let x = 4; y = 5; z = 6 in (x + y) * z 54 ghci > " Hello " ++ ( let x = " world "; y = " wide " in y ++ x) ++ " !" " Hello wide world ! "

16
Using pattern matching

A fancy way of saying in (the middle of ) a single line.

39

4. Exploring Syntax

1 2 3 4 5 6

ghci > let (x , y) = (3 , 2) in y * x 6 ghci > let x:y :_ = " asdf " in y :x :[] " sa " ghci > 4 + ( let a:b :c: _ = [5 ,10..] in c - b + a) 14

Putting them inside list comprehensions

1 2 3 4

ghci > [ x | x <- [1..10] , let a = 8*x , a < 50] [1 ,2 ,3 ,4 ,5 ,6] ghci > [ x: xs | x <- [ 'a '.. 'c '] , let xs = " ghj "] [" aghj " ," bghj " ," cghj " ]

Nesting them.

1 2 3 4

ghci > let x = 4 in let y = 5 in x + y 9 ghci > let a = 'h ' in let as = " ello " in a : as " hello "
When dening several variables with

let,

we can use one in the denition of another.

1 2 3 4

ghci > let x = 4; y = 2* x in x + y 12 ghci > let x = 5; y = 3 + x; z = x * y in x + y - z -27


We can also do it in any order.

1 2 3 4

ghci > let y = 2* x; x = 4 in x + y 12 ghci > let y = 3 + x ; z = x * y; x = 5 in x + y - z -27


It won't work, however, in separate

lets

or if we try to use a variable prior to its let binding.

1 2 3 4 5 6

ghci > [ x | x <- [1..10] , y < 2 , let y = x - 5] < interactive >:1:21: Not in scope : `y ' ghci > let y = 2 * x in ( let x = 4 in y + x) < interactive >:2:13: Not in scope : `x '
Additionally, things:

let bindings are very local; they are only visible where we dene them  we talk more about local

let

bindings are

not

visible across guards. All these drawbacks are the result of a very simple

things in A.2.1. For instance: 1 2 3 4 5 6

Prelude > let a = 3 in 2 * a 6 Prelude > a < interactive >:2:1: Not in scope : `a ' ghci > ( let b = 5 in 4 * b) + b

40

4. Exploring Syntax

7 8 9 10 11

< interactive >:3:24: Not in scope : `b ' ghci > [ x | x <- [1..10] , let c = 2*x , c < 5] ++ [ c] < interactive >:4:45: Not in scope : `c '
There is only one exception to this rule: we can omit the

in

part when dening things interactively; this

way, the names will be visible during the entire interactive session (but not the next). 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

ghci > let a = 5; b = 6 ghci > " hello world " " hello world " ghci > a + b 11 ghci > :q Leaving GHCi . ee@bt :~ $ ghci GHCi , version 7.4.1: http :// www . haskell . org / ghc / Loading package base ... linking ... done . Prelude > a + b < interactive >:2:1: Not in scope : `a ' < interactive >:2:5: Not in scope : `b '
It's time for a little discussion and recap. The

:? for help

let <bindings> in <expression> let bindings are so local,

syntax allows

pressions. That's the most important dierence Interestingly,

let to be put anywhere, between let and where.

especially inside larger ex-

that it somehow limits their usefulness

17 . Coincidentally, this is also

one of their great advantages. The above reasons, and more, bring us to our nal point:

where

is better with guards;

let,

let

and

where

are not always interchangeable 

inside larger expressions.

4.2.4. Bonus: Case Expressions


Just like

[1, 2, 3]

is syntactic sugar for

1:2:3:[],

pattern matching (in function denitions) is just syn-

tactic sugar for case expressions. 1 2 3 4

-- File : case - expr . hs tail ' :: [ a] -> [a ] tail ' [] = error " tail ': empty list " tail ' (_: xs ) = xs
We've just implemented our version of it looks with case expressions.

tail

using pattern matching (in function denitions). Let's see how

1 2 3 4

-- File : case - expr . hs ( FIXED ) tail ' :: [ a] -> [a ] tail ' all = case all of [] -> error " tail ': empty list " (_ : xs ) -> xs
17

The biggest problem is that they won't work with guards the way we want them to.

41

4. Exploring Syntax

The syntax for case expressions is pretty much self-explanatory. A longer example, just to consolidate our knowledge: 1 2 3 4 5

-- File : case - expr2 . hs f :: Int -> String f n = case n of 1 -> " one " 2 -> " two " _ -> " many "
Of course, those can be any patterns, not just numbers. If it's not 100% clear yet, this is the syntax:

1 2 3 4 5 6

-- Syntax : case expressions ( in function definitions ) function argument = case argument of pattern1 -> result1 pattern2 -> result2 pattern3 -> result3 pattern4 -> result4 ...
We've been very careful to mention in function denitions repeatedly. That's because, technically, case

expressions make use of pattern matching, so it's not really fair to compare the two. Their main advantage is that case expressions work anywhere, just like

let

bindings. Basically, they enable

pattern matching anywhere we desire. We can put them in the middle of an expression, for example. 1 2 3 4 5

-- File : case - expr3 . hs f :: ( Show a) => [ a] -> f [] = " This list is f [ x] = " This list is f ( x:_ ) = " This list is

String empty . Sorry . " a singleton , with the element : " ++ show x longer . Its head is : " ++ show x

1 2 3 4 5

-- File : case - expr3 . hs ( FIXED ) f :: ( Show a) => [ a] -> String f xs = " This list is " ++ case xs of [] [x ]

-> " empty . Sorry ." -> "a singleton , with the element : " ++ show x (x :_) -> " longer . Its head is : " ++ show x let
bindings:

18 than the alternatives. Syntactic sugar in general oers a clearer they are ever-so-slightly less readable
exposition at the expense of power. In fact, after this chapter on syntax, we've seen many alternative ways of solving a given problem. Which one to use is left at the reader's discretion.

The reason we don't use case expressions all the time is much like the reason we don't abuse

18

Some people may disagree.

42

5. Recursion
Primitive recursion is the goto of functional programming.

(anonymous)

5.1. Basic Implementation


5.1.1. Understanding Recursion
Recursion is perhaps one of the most powerful tools in all of Haskell . According to Wikipedia, recursion is the process of repeating items in a self-similar way. In programming, recursion is a method of dening functions in which the function is applied within its own denition. Simply put, a recursive function is a function that calls itself. To understand the principle, this chapter concerns itself only with explicit (also called primitive) recursion  the easiest and most basic form of recursion. Later (in [XREF]) we will see many cool functions that perform recursion for us. The simplest example is the factorial . We can write denition we're looking for. This is: 1 2 3 4

factorial n = product [1..n],

but that's not the

-- File : factorial . hs factorial :: Integral a => a -> a factorial 0 = 1 factorial n = n * factorial ( n - 1) ghci > factorial 3 6 ghci > factorial 5 120
It works, 1. 2. 3. 4. 5. 6. 7. 8. 9.

1 2 3 4

but why ?

Let's see what GHCi does if we try to call is is is is is is is is is

factorial 4.

factorial 4 factorial 3 factorial 2 factorial 1 factorial 0 factorial 4 factorial 4 factorial 4 factorial 4

4 * factorial 3. 3 * factorial 2, 2 * factorial 1, 1 * factorial 0, 1,


so so so so is

factorial 4 factorial 4 factorial 4

is is is

4 * (3 * factorial 2). 4 * (3 * (2 * factorial 1)). 4 * (3 * (2 * (1 * factorial 0))).

factorial 4

4 * (3 * (2 * (1 * 1))).

4 * (3 * (2 * 1)). 4 * (3 * 2). 4 * 6. 24.


1

1 2

Author's note: it took all my willpower not to start with a recursion joke . The factorial of a (non-negative) integer

is the product of integers from 1 to

n.

The factorial of 0 is, by convention, 1.

43

5. Recursion

10. Done! At this point, it's useful to make our line-by-line analysis. Here's the function again, without that pesky rst line comment: 1 2 3

factorial :: Integral a => a -> a factorial 0 = 1 factorial n = n * factorial ( n - 1)


1. The type denition is important; the factorial doesn't make sense over non-integers . doesn't work on negative numbers either (which we'll discuss in 5.3.1). 2. Without this line, the function would never nish.

Actually, it

factorial 1

would be

1 * (0 * (-1 * (-2 ....

This is called the base case or edge condition. We'll discuss it in a moment. 3. This one puts an operation on hold (namely multiplication), then brings the evaluation closer to the base case. Eventually it will reach it, the pending operations will be performed, and the computation will end, as seen in the elaboration above. Sounds complicated? Because it is. The above operations aren't meant to be our concern. The compiler can do them without our help. We should understand recursion intuitively, and to do that, we must think simpler. Here's a little something to break the wall of text, and then we'll move on. 1 2 3 4 5

________ _____ ___ __ \ _____ ___________ _____________________ (_) ______ _______ __ /_/ /_ _ \_ ___ /_ / / / __ ___ / __ ___ / __ / _ __ \ __ __ \ _ _ , _ / / __ // / __ / /_/ / _ / _ ( __ ) _ / / /_/ /_ / / / /_ / |_ | \ ___ / \ ___ / \__ ,_/ /_ / / ____ / / _/ \ ____ / /_/ /_/
The bottom line is, a recursive function has two main elements: 1. The base case  the simplest one, where we already know the answer. The base case is where the calculation ends. Some examples: a) The factorial of 0 is 1. We know this because it's convention. Can it get any simpler? Not really. b) The length of an empty list is 0. We know that because it's obvious. c) The maximum of a single number is that number. 2. All other cases  here we must bring evaluation closer to the base case. must get closer. Some examples: a) The factorial of We must simplify. Why?

Because the base case is the only way our calculation can nish. We must reach it. To reach it, we

is

times the factorial of

n 1; n 1

is closer to 0, so we're on the right track. if we repeat this

b) The length of a list is one plus

the length of the list without the rst element ;

enough times, we'll reach the empty list, as planned. c) The maximum of a list is the rst element or whichever is larger. In all three situations, the regular cases bring us closer to the edge condition (base case), thus guaranteeing that the computer will, in fact, nish calculating and provide a result.

the maximum of the list without the rst element,

Actually it does  it's called the Gamma function.

44

5. Recursion

5.1.2. Practical Examples


It is time to put the above into code. The factorial has already been done. Let's try the length 1 2 3 4

4 one.

-- File : length . hs length ' :: [ a] -> Int length ' [] = 0 -- what are we supposed to do now ?
Obviously, the list with the rst element is one longer then the list without it. We should somehow write this down, but to do it, we must separate the list into its rst element and the rest. Do we know something that does that? Yes, it's the

x:xs

pattern. We've already covered some of its uses, but here is a quick refresher:

1 2 3 1 2

-- File : xxs . hs super :: String -> String super ( x: xs ) = " First letter : " ++ [x] ++ "; the rest : " ++ xs ghci > super " Greetings ! " " First letter : G; the rest : reetings !"
Now we can state the obvious, clearly and concisely.

length ' (x: xs ) = 1 + length ' xs


And that's it! If we put it inside our original code, it works like a charm.

1 2 3 4 1 2 3 4

-- File : length . hs ( FIXED ) length ' :: Num a => [b] -> a length ' [] = 0 length ' (x: xs ) = 1 + length ' xs ghci > length ' [1 ,2 ,3 ,4] 4 ghci > length ' " haskell " 7
We might even notice that we're not using

(from the

x:xs),

so we can write

length' (_:xs).

To determine the maximum of a list, we have to, once again, separate the list into a head and a tail. This time we get to see the completed code directly. 1 2 3 4 5

-- File : maximum ' maximum ' maximum ' maximum '

maximum . hs :: Ord a => [a ] -> a [] = error " maximum ' of empty list " [x ] = x (x : xs ) = max x ( maximum ' xs ) []

In a dramatic twist of events, this function has if ) we supply

two

edge conditions. The rst will be reached if (and only

 the maximum of an empty list doesn't make sense. The other one is the normal base

case we all know and love  the maximum of a single element is itself. The third pattern compares the head with the maximum of the tail to determine which one is bigger. Notice how 1 2

max

operates on 2 elements while

maximum'

works on an entire list.

ghci > maximum ' [1 ,3 ,4 ,2 ,5 ,2] 5


4
We're using

length'

because

length

already exists, and we must have a dierent name.

45

5. Recursion

5.1.3. More Parameters


A recursive function can take any number of parameters. Knowing that, we'll try to implement our own

replicate. replicate
1 2 3 4 5 6

repeats an element a specied number of times (so 2 parameters).

ghci > :t replicate replicate :: Int -> a -> [ a] ghci > replicate 5 2 [2 ,2 ,2 ,2 ,2] ghci > replicate 6 'a ' " aaaaaa "
It's easier if we try to implement it for a certain element, say

5 0 times. We'll call the function screamer .

'A'.

Our edge condition is trying to repeat it

1 2 3 4 5

-- File : screamer . hs -- replicate when applied to the letter 'A ' screamer :: Int -> String screamer 0 = [] -- it ' s the same as "" screamer n = 'A ' : screamer (n -1)
Obviously,

replicate

works with any element  if we pass it as an extra parameter it should work.

1 2 3 4 1 2 3 4

-- File : replicate . hs replicate ' :: Int -> a -> [a ] replicate ' 0 _ = [] replicate ' n x = x : replicate ' (n -1) x ghci > replicate ' 3 'b ' " bbb " ghci > replicate ' 2 " Hi " [" Hi " ," Hi "]
This time, one of the parameters (namely, the second one) always remained unchanged. But it is not always so. We can manipulate several parameters when writing a recursive function. This very dumb implementation of

compare,

which only works on positive integers, is a... good(ish) example.

1 2 3 4 5 6

-- File : dumb - compare . hs cmp :: Integer -> Integer -> Ordering cmp 0 0 = EQ cmp 0 _ = LT cmp _ 0 = GT cmp x y = cmp (x -1) (y -1)
This example also illustrates a good rule of thumb : the number of base cases is usually equal to the number of possible outcomes. In this case, it's three:

EQ, LT

and

GT. compare?

Anyway, the principle of this function is very simple. It decrements The other is larger. Is there an even more inecient version of

both

parameters, until one reaches zero. I have no idea.

take
1 2 3 4

takes taking elements from a list to a whole new level. Example, then code.

ghci > take 3 [1 , 2 , 3, 4] [1 ,2 ,3] ghci > take 5 [1 , 2 , 3, 4] [1 ,2 ,3 ,4]


5 6

replicateA

might sound tempting, but it's already taken (see [XREF]).

Not to be followed blindly.

46

5. Recursion

1 2 3 4

-- File : take . hs take ' 0 _ = [] take ' _ [] = [] take ' n (x : xs ) = x : take ' (n -1) xs
Notice how the two outcomes become base cases. We either

take 0 elements from a list, or try to take elements from an empty list.

In both cases, the result is

[].

The general case is very simple, too. Taking

taking the rst element, then Next up,

n-1

n elements from a list is basically

elements from the rest of the list.

zip.

This function takes two lists and combines them together into a list of pairs. It stops when

one of the lists is empty, so

zip "abc" [1, 2]

is

[('a',1),('b',2)].

The two edge conditions correspond to empty lists (the rst and the second, respectively). The general case separates both lists in a head and a tail. 1 2 3 4 5

-- File : zip . hs zip ' :: [a] -> [b] -> [(a , b)] zip ' [] _ = [] -- First list empty zip ' _ [] = [] -- Second list empty zip ' (x : xs ) (y : ys ) = (x , y) : zip ' xs ys

5.2. Variations
5.2.1. Using Guards
If we're not careful, we might as well end up with a function that runs indenitely, or worse . This usually happens if the edge condition is poorly written, or if the general case does not lead to the edge condition. Half the functions we've written so far have some sort of problem. That's not very encouraging. Our version of

replicate (also, screamer) weirds out when we give it a negative number of repetitions.

The

predened function works ne. 1 2 3 4

ghci > replicate ' ( -2) 5 [5 ,5 ,5 ,5 ,5 ,5 ,5 ,5 ,5 ,5 ,5 ,5 ,5 ,5 ,5 ,5 ,5 ,5 ,5 ,5 ,5 ,5 ,5 ,5 ,^ CInterrupted . ghci > replicate ( -2) 5 []
Warning! Make sure your function behaves correctly even on unexpected input.
The problem? Our edge condition should also check for negative numbers. The easy way to do it is to use a guard.

1 2 3 4

-- File : replicate . hs ( FIXED ) replicate ' :: Int -> a -> [a ] replicate ' n _ | n <= 0 = [] replicate ' n x = x : replicate ' (n -1) x
This is one of the few acceptable uses of inline guards. Notice the absence of an catches everything). In this instance, we can also use

otherwise

clause. This is

because if evaluation reaches the end of the guards, it will fall down to the next pattern (which, in our case,

otherwise

and a single function body.

What's worse than an innitely running program? A wrong result.

47

5. Recursion

1 2 3 4 5

-- File : replicate2 . hs replicate ' :: Int -> a -> [a ] replicate ' n x | n <= 0 = [] | otherwise = x : replicate ' (n -1) x
Sometimes the function does something unimaginable. Our stupid The relevant parts, then illustration:

cmp is at-out wrong on negative numbers.

1 2 3 4

cmp cmp cmp cmp

0 0 _ x

0 _ 0 y

= = = =

EQ LT GT cmp (x -1) (y -1)

1 2 3 4 5 6

ghci > cmp 2 3 LT ghci > cmp ( -2) 3 GT ghci > cmp ( -2) ( -3) ^ CInterrupted .
Why does this happen? The program assumes that the rst number to reach 0 is smaller. But if we decrease an already negative number, it will never become 0. So the other one will be 0 rst, and will be declared the smallest. If both are negative, then the function will continue to run, and run, and run (until we run out of memory) . Here is the corrected function:

1 2 3 4 5 6

-- File : dumb - compare . hs ( FIXED ) cmp :: Ord a = > a -> a -> Ordering cmp x y | x == y = EQ | x <= y = LT | otherwise = GT
The dumb implementation is doomed. There is no way we can get something usable out of it, so we should just trash it.

5.2.2. Multiple Regular Cases


Some recursive functions have dierent behavior for dierent types of input, say, even and odd numbers. This means that we have several separate cases. This can be easily achieved by using pattern matching or guards. The classic example is the Collatz sequence. Take a positive integer.

If it's even, divide it by two. If it's odd, multiply it by three and add one.

It is thought (but not proven) that after a nite number of steps, all numbers will eventually reach 1. By virtue of this fact, we know our edge condition. The two regular cases are for even and odd, respectively.

Experienced programmers out there:

Integer

is unbounded, so it will never wrap around.

48

5. Recursion

1 2 3 4 5 6

-- File : collatz . hs collatz :: Integral a = > a -> [ a] collatz 1 = [1] collatz n | even n = n : collatz (n `div ` 2) | otherwise = n : collatz (3* n + 1)
This function is especially dangerous because we don't actually know if it will nish. Still, let's take it for a spin.

1 2 3 4

ghci > collatz 5 [5 ,16 ,8 ,4 ,2 ,1] ghci > collatz 20 [20 ,10 ,5 ,16 ,8 ,4 ,2 ,1]
Of course, we can simply check the lengths. Some inputs are especially pesky .

1 2 3 4

ghci > length ( collatz 27) 112 ghci > length ( collatz 6171) 262

5.2.3. Innite Recursion


It's easier than it looks. Haskell already supports innite lists, so it should be a breeze to write versions of the following two functions:

repeat cycle
1 2 3 4 5 6

repeats an element an innite number of times

repeats an entire list

The easy way to do it is to simply omit the edge condition, like this:

-- File : inf - recursion . hs repeat ' :: a -> [ a] repeat ' x = x : repeat ' x cycle ' :: [a] -> [ a] cycle ' xs = xs ++ cycle ' xs
Without a base case, the function is all but guaranteed to run indenitely. That is, unless we number of elements (because of laziness).

take

a nite

1 2 3 4

ghci > take 5 ( repeat ' 0) [0 ,0 ,0 ,0 ,0] ghci > take 10 ( cycle ' [1 , 2, 3]) [1 ,2 ,3 ,1 ,2 ,3 ,1 ,2 ,3 ,1]

5.3. Further Expansion


5.3.1. Using Natural Numbers [FIXME-move to adv. types]
Every time we used some sort of counter which we decreased until it reached zero, we used some sort of integer. Recall the factorial function:

The Online Encyclopedia of Integer Sequences has collected a list specially for the purpose: A006877.

49

5. Recursion

1 2 3

factorial :: Integral a => a -> a factorial 0 = 1 factorial n = n * factorial ( n - 1)


But, as we mentioned, the factorial doesn't make much sense over negative numbers. In 5.2.1 we even pointed out that such functions might even run indenitely on negatives. In that spirit, the solution is:

1 2 3 4

factorial factorial factorial factorial

:: Integral a => a -> a n | n < 0 = error " factorial over negative numbers " 0 = 1 n = n * factorial ( n - 1)

That's more of a workaround rather than a x, however. Someone casually looking at the type denition might imagine that the function works over all integers. This is obviously not the case. The right way to do it is to use the appropriate type for the function; something like natural numbers would be welcome. libraries. [FIXME] 1 2 3

Nat

representing

This is a hypothetical example; no such type exists in the standard

factorial :: Nat -> Nat factorial 0 = 1 factorial n = n * factorial ( n - 1)

5.3.2. Application: Quicksort


We have tried to postpone this moment as long as possible. It's time for the most overused piece of Haskell code in history:

quicksort.
10 the list with

What it does: it sorts a list (duh). How it does it: a sorted list is

  

the elements less than or equal to the head, the head of the list, followed by the elements greater than the head,

sorted, followed by

sorted.
quicksort
twice in its denition (once for the smaller

What's interesting for us is that we must call elements and once for the larger ones)

So, without further ado: 1 2 3 4 5 6

-- File : quicksort . hs quicksort :: Ord a => [a] -> [ a] quicksort [] = [] quicksort (x: xs ) = lesserSorted ++ [x] ++ greaterSorted where lesserSorted = quicksort [ y | y <- xs , y <= x ] greaterSorted = quicksort [ y | y <- xs , y > x ] ghci > quicksort [4 ,1 ,5 ,3 ,8 ,7] [1 ,3 ,4 ,5 ,7 ,8] ghci > quicksort " the five boxing wizards jump quickly " " abcdeefghiiiijklmnopqrstuuvwxyz "
10
We can't say it does this, then it does that, because it defeats the purpose of functional programming, which emphasizes how things are dened, rather then how they are done.

1 2 3 4

50

5. Recursion

This implementation of quicksort is surprisingly easy to understand. The function will take the head of the list,

and then put it between

[1,3]

and

[5,8,7]

(after they've been sorted).

Such an algorithm is called divide and conquer because it literally The pieces are then put back together in the correct order.

11 breaks the input into two easier-to-

manage halves, each of them broken down even more, until we reach empty lists, which are already sorted.

Unfortunately, if we perform the detailed breakdown on this function, we clearly see that the algorithm performs many useless operations (concatenating all those empty lists), so it might not be terribly ecient. [FIXME-double check] 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

-- Evaluation steps quicksort [4 ,1 ,5 ,3 ,8 ,7] = quicksort [1 ,3] ++ [4] ++ quicksort [5 ,8 ,7] quicksort [1 ,3] = quicksort [] ++ [1] ++ quicksort [3] quicksort [] = [] quicksort [3] = quicksort [] ++ [3] ++ quicksort [] quicksort [] = [] quicksort [] = [] quicksort [5 ,8 ,7] = quicksort [] ++ [5] ++ quicksort [8 ,7] quicksort [] = [] quicksort [8 ,7] = quicksort [7] ++ [8] ++ quicksort [] quicksort [7] = quicksort [] ++ [7] ++ quicksort [] quicksort [] = [] quicksort [] = [] quicksort [] = [] [] ++ [1] ++ [] ++ [3] ++ [] ++ [4] ++ [] ++ [5] ++ [] ++ [7] ++ [] ++ [8] ++ [] [1 ,3 ,4 ,5 ,7 ,8]
Indeed, running more

now on, we'll just import

Data.List

quicksort on [100000,99999..1] takes quite some time and maxes out the memory. From Data.List, which conveniently contains an ecient sorting function, sort12 . For
goodies, see C.1.

1 2 3

ghci > import Data . List ghci > sort [3 ,5 ,8 ,2 ,1] [1 ,2 ,3 ,5 ,8]

5.3.3. Discussion
All of the functions that we have implemented in this chapter have some common ground. For instance:

Separating a list into a head and a tail until we reach

[].

Having some number and then decreasing it until it becomes 0. Breaking down a list into several smaller parts.

By far the most widely used data structure in this chapter was the list. Somehow lists lend themselves to being recursed upon simply because of the convenient

x:xs pattern which, on one hand, extracts an element

which can be used and, on the other hand, leaves the rest of the list available for further operations. One of the main development directions in Haskell is abstraction. Sadly, in this book, this path has been so far left unexplored (because we were busy understanding syntax). this is an implementation of Specically, the primitive (explicit) recursion we have performed so far in this chapter allows us to consider only particular cases. For instance,

sum:

11 12

Figuratively. Based on mergesort.

51

5. Recursion

1 2

sum [] = 0 sum ( x: xs ) = x + sum xs


And this is an implementation of

product:

1 2

product [] = 1 product (x: xs ) = x * product xs


The function

and

operates on booleans, and tells us if all of them are

True.

Here it is:

1 2

and [] = True and ( x: xs ) = x && and xs


Likewise, the function

or:

1 2

or [] = False or (x: xs ) = x || or xs

to take advantage of it.

A pattern emerges. All these concrete examples have the same basic structure,

but we do not yet know how

There must be a function that covers all these use cases. There is.

We've barely scratched the surface.

52

6. Advanced Functions
I've come to see the power of Haskell at last. You have to treat functions like crap.

(nikki93)

6.1. Currying and Partial Application


6.1.1. Fundamentals
Every function in Haskell takes exactly one parameter. Multiple-parameter functions exist because of what is ocially called 1 2 3

currying

 it's very clever. Let's refer to our rst

Problem Z

example (way back, in 1.2.3).

compare 2 3 -- works compare (2 3) -- doesn ' t work ( compare 2) 3 -- works !!


We've learned why the rst one works and the second doesn't: spaces are used for function application and parentheses for grouping, not the other way around. To see why the third one works, we must understand what 2 and returns

a function

compare 2 3

that takes a parameter and compares 2 with it. Read that again. type, it's

and it nally returns If we take a look at

LT.

That function

does. It rst takes the parameter is then applied to 3

compare's

compare :: Ord a => a -> a -> Ordering.

Up until now, we've

said that it takes two parameters.

a -> a -> Ordering is the same as a -> (a -> Ordering). So the function, in fact, takes only one parameter (an a) and returns an a -> Ordering, which is a function (that takes an a and returns an Ordering).
But now we realize that Let's discuss a clearer example. 1 2 3 4

-- File : currying . hs addFour :: Int -> Int -> Int -> Int -> Int -- we can also write Int -> ( Int -> ( Int -> ( Int -> Int ))) addFour x y z t = x + y + z + t
Now if we add parameters one at a time:

1 2 3 4 5 6 7 8 9 10

ghci > :t addFour addFour :: Int -> Int -> Int -> Int -> Int ghci > :t addFour 1 addFour 1 :: Int -> Int -> Int -> Int ghci > :t addFour 1 2 addFour 1 2 :: Int -> Int -> Int ghci > :t addFour 1 2 3 addFour 1 2 3 :: Int -> Int ghci > :t addFour 1 2 3 4 addFour 1 2 3 4 :: Int

53

6. Advanced Functions

Every time we add another parameter, the type gets eaten up from the left. That is because if we call

application.
f a1

a function with too few parameters, we'll get a function that takes the rest of them. This is called In other words, if

takes

parameters : ..., ...,

a1, a2, a3,

...,

an,

partial

then:

takes

n1

parameters:

a2, a3, a4, a3, a4,

an an -> in type declarations.


If we clearly distinguished

f a1 a2
etc.

takes

n2

parameters:

This is also the chief reason why everything is separated by wouldn't be able to do other neat things, like name them. 1 2 3 4 5 6 7

the parameters from the return type, we couldn't have parially applied functions and thus, indirectly, we

ghci > ghci > LT ghci > GT ghci > EQ

let compare2With = compare 2 compare2With 5 compare2With 1 compare2With 2 compare2With? compare2With x = compare 2 x.

Do we know some other way of dening

Of course,

We've

done things this way many times before. I know we're repeating ourselves, but let's see them again. 1 2

compare2With x = compare 2 x -- the way we ' ve done things compare2With = compare 2 -- equivalent to the above
Notice how

x was present on the right x

side on both hand-sides of the rst equation. Therefore,

(it can safely be removed). Watch out, though, because in something like (x on the left), can't be eliminated without changing the meaning.

x is superuous compare2With x = compare x 2

Warning! Partial application only occurs from left to right (beginning with the rst parameter).
So there you have it. Currying is often confused with partial application, but they are really quite dierent:

Currying is what makes a function take only one parameter and return a function that takes another parameter and so on. We'll discuss it a little later, in [XREF]. Partial application is the act of supplying a function with too few arguments.

Currying and partial application are two of the most important concepts in all of Haskell, so it's a good idea to be familiar with them.

6.1.2. Problem Z
We've put all the cool things that happen because of currying and partial application under the umbrella term

Problem Z. Now it's time to revisit them.

In 2.1.3 we said that a constant really is a zero-parameter function. It makes sense if we think about it  there are no parameters for us to change so the result will always be the same. Do we know what else takes zero parameters? A fully-applied function. Take 1 2 3 4 5 6

compare 2 3

for instance.

ghci > :t compare 2 3 compare 2 3 :: Ordering ghci > :t LT LT :: Ordering ghci > LT == compare 2 3 True
1
We're going to say that a function takes

parameters for simplicity, even though we know what's actually going on.

54

6. Advanced Functions

Moving on, when we discussed inx functions (in 2.1.4) we illustrated how inx functions can be called prex. 1 2 3 4

ghci > 2 + 3 5 ghci > (+) 2 3 5


This enables us to partially apply them .

1 2

ghci > :t (+) 2 (+) 2 :: Num a => a -> a


However, there is a simpler, more intuitive way, by using

sections.

Simply put, we omit one of the sides:

1 2 3 4

ghci > :t (2/) (2/) :: Fractional a => a -> a ghci > :t (/2) (/2) :: Fractional a => a -> a
We still have to put them in parentheses because otherwise the compiler will treat them as incomplete expressions. Sections have another advantage. Notice the dierence between the following two:

1 2 3 4

ghci > (2/) 3 0.6666666666666666 ghci > (/2) 3 1.5


In the second example, we've partially applied the

second

parameter. Neat, huh?

Speaking of sections, we might be tempted to do something like 1 2 3

(3,) 2,

but the compiler will scream at us.

ghci > (3 ,) 2 < interactive >:1:1: Illegal tuple section : use - XTupleSections
What GHCi means by this is that it recognizes what we're trying to do, but won't allow it. It also mentions that if we open GHCi with the option

-XTupleSections,

it will work just ne.

1 2 3 4 5 6

ee@bt :~ $ ghci - XTupleSections GHCi , version 7.4.1: http :// www . haskell . org / ghc / Loading package base ... linking ... done . Prelude > : set prompt " ghci > " ghci > (3 ,) 2 (3 ,2)
But why bother when we can just use

:? for help

(,)

instead?

1 2 3 4

ghci > :t ( ,) 3 ( ,) 3 :: Num a = > b -> (a , b ) ghci > ( ,) 3 2 (3 ,2)

6.1.3. When It's Not


[FIXME-need to have it in appendices and xref to it, possibly earlier]

This is not the main advantage, however. Details in [XREF].

55

6. Advanced Functions

6.2. Higher Order Functions


6.2.1. Passing Functions as Parameters
One very nice thing about functions, and one of the coolest and most powerful things in all of Haskell, is that functions can take functions as parameters. The simplest example (we've intentionally given the following a name that's not revealing) is this: 1 2

f2 :: ( a -> a) -> a -> a f2 f x = f ( f x)


What's with the parentheses in the type declaration? need them because the They indicate that the whole

(a -> a)

thing is a

single parameter: a function that takes something of a type and returns something of the same type. We

3 separate, single parameters .

->

is right-associative  otherwise it would treat the rst

and the second

as

On to the body of the function:

f x
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22

part), then apply

f2

takes a function,

f,

and a value,

again to the result. Essentially,

f2

x.

What it does is apply

to

(the

applies a function twice. In Mathematics class

we would have written something like

f 2 (x) = f (f (x)).

Notice the similarity.

ghci > succ 3 -- successor 4 ghci > succ 4 5 ghci > succ ( succ 3) 5 ghci > f2 succ 3 5 ghci > f2 pred 3 -- predecessor 1 ghci > f2 sqrt 16 2.0 ghci > f2 tail " abcd " " cd " ghci > f2 head " abcd " -- whoops , we need the function to return the same type < interactive >:22:4: Couldn 't match type `Char ' with `[ Char ] ' Expected type : [ Char ] -> [ Char ] Actual type : [ Char ] -> Char In the first argument of `f2 ', namely `head ' In the expression : f2 head " abcd "
We now understand better how f2 works and we know why (a -> a). If our function takes an Int and returns a Bool, resulting Bool  it's the wrong type. While we can call the function we pass has to have the type there's no way we can call it again on the

head

twice on something like

[[2,3][4,5]]

(it returns

2),

using

f2

will give an error.

Moreover, there's no easy way to modify it so it can work. We'll discuss this in [XREF], as well as provide an adequate solution. Very few functions take a single parameter and return something of the same type. We can, however, partially apply functions to the point of accepting only one parameter, and then pass them to useful partial application becomes in this case.

f2.

It's obvious how

We know that functions really only take a single parameter at a time. But it would save us some time and eort to think of them as taking several parameters.

56

6. Advanced Functions

1 2 3 4 5 6 7 8 9 10 11 12

ghci > 13 ghci > 100 ghci > 81 ghci > " aab " ghci > " baa " ghci > " aab "

f2 (+ 2) 9 f2 (* 5) 4 f2 (^2) 3 f2 ( "a" ++) "b" f2 (++ " a") "b" f2 ( 'a ' :) " b"

So let's recap what's going on here, because it's important. 1 2

f2

looks like this:

f2 :: ( a -> a) -> a -> a f2 f x = f ( f x)


Basically it applies the function

(of type

a -> a)

to

(a value of type

a)

twice. We can create a function

to apply it three times, or even four: 1 2 3 4 5

f3 :: ( a -> a) -> a -> a f3 f x = f ( f (f x)) f4 :: ( a -> a) -> a -> a f4 f x = f ( f (f (f x )))


The type remains the same because we still have only two parameters: the function and the value to apply it to.

6.2.2. Flipping the Parameters


Sometimes we want to call a function with the parameters in another order. For instance, maybe we want to call our drink calculator (4.2.2, reproduced here for our convenience) in the order 1 2 3 4 5 6 7

n sex.

drink :: ( Fractional a , Ord a ) = > String -> a -> String drink sex n | bac < 0.03 = " You ' re as sober as can be expected ." | bac < 0.08 = " You can drive , but it 's a bad idea ." | bac < 0.10 = " Your reasoning is out the window . " | otherwise = " Stop drinking . " where bac = n * if sex == " male " then 0.025 else 0.035
We can dene an additional function like below, but since we're talking about higher-order functions, there is another way.

1 2

flipDrink :: ( Fractional a , Ord a) => a -> String -> String flipDrink n sex = drink sex n
In this case, we shall use

flip. flip is a nice built-in function that reverses the parameters of a two-parameter
4

function. We can dene our own version of it :

Quick reminder: despite what the syntax highlighter may imply [FIXME-I'm working on it, but it seems to be a particularly thorny problem], the quote doesn't do anything. It's just another character in the function name so that with the predened

flip.

flip' won't overlap

57

6. Advanced Functions

1 2

flip ' :: ( a -> b -> c) -> b -> a -> c flip ' f y x = f x y


The reasoning is pretty intuitive but can still be confusing: we want to feed the parameters in reverse order, but the function will only accept them in the right one. So we give the parameters in the wrong order (what we want) and

flip'

will call them in the right order (what the compiler wants), just like

flipDrink

above.

Some examples: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17

ghci > drink " female " 2 " You can drive , but it 's a bad idea . " ghci > flip ' drink 2 " female " " You can drive , but it 's a bad idea . " ghci > ( -) 3 2 1 ghci > flip ' ( -) 2 3 1 ghci > (++) " hello " " world " " helloworld " ghci > flip ' (++) " hello " " world " " worldhello " ghci > zip [1 ,2 ,3] [4 ,5 ,6] [(1 ,4) ,(2 ,5) ,(3 ,6) ] ghci > flip ' zip [1 ,2 ,3] [4 ,5 ,6] [(4 ,1) ,(5 ,2) ,(6 ,3) ] ghci > flip ' ( zip [1 ,2 ,3] [4 ,5 ,6]) -- nope , error
What happens if we partially apply parameters reversed.

flip'?

If we only give it the function, we get that function with its

1 2 3 4 5 6 7

ghci > :t zip zip :: [a ] -> [b] -> [(a , b) ] ghci > :t flip ' zip flip ' zip :: [ b] -> [a ] -> [(a , b )] ghci > let oddDivision = flip (/) ghci > 2 ` oddDivision ` 3 1.5
If we give it parameter:

a function

and a parameter, we've essentially partially applied

that function

on its second

1 2 3 4 5 6

ghci > ghci > ghci > GT ghci > LT

let compare2With = compare 2 let compareWith2 = flip compare 2 compareWith2 3 compare2With 3

6.3. More Useful Functions


6.3.1.

map

and

zipWith
map,
which we'll call

Another cool (and useful) thing we can do is apply a function to every element in a list using before, we can have our own

map'.

map.

Like

58

6. Advanced Functions

1 2 3

map ' :: (a -> b ) -> [a] -> [b] map ' _ [] = [] map ' f ( x: xs ) = f x : map ' f xs
This is the rst time we use higher order functions and recursion simultaneously. First, as always, the type declaration:

map'

takes a function (that takes something of type

list of somethings of type

5 and returns a list of somethings of type b.

and returns something of type

b)

and a

Recall how we learned them during the recursion chapter. function (any function, thus the

The second line is the base case: mapping a

_)

over the empty list is the empty list.

The third line: mapping a function list with the rst element

f x

over a list with the rst element

and the rest of the elements

and the rest of the elements obtained by mapping

over

xs.

xs

is a

In other words,

we apply the function element by element, starting with the rst one. Example: 1 2

ghci > map ' succ [6 ,9 ,3] [7 ,10 ,4]


1. 2. 3. 4.

map' succ [6,9,3] map' succ [9,3] map' succ [3] map' succ []
is is is

is

succ 6 : map' succ [9,3],


which is is

which is

7 : map' succ [9,3]

succ 9 : map' succ [3],


so

which is

10 : map' succ [3]


which is

succ 3 : map' succ [], map' succ [6,9,3]

4 : map' succ [] [7,10,4]

[],

7 : 10 : 4 : [],

We're gonna assume that we've gained a sucient understanding of recursion such that elaborations like the one above aren't necessary from now on. [FIXME] NOTE: if I haven't explained things well enough and by this point you do not where you got lost so I know where to improve. I'd really appreciate it. Thanks! Some more examples with 1 2 3 4 5 6 7 8 9 10 11 12 13 14

fully

understand

recursion, especially with higher-order functions, shoot me an e-mail at questions@sthaskell.com telling me

map',

also highlighting some more partial application uses.

ghci > map ' pred [6 ,9 ,3] [5 ,8 ,2] ghci > map ' sqrt [4 ,9 ,16] [2.0 ,3.0 ,4.0] ghci > map ' (+2) [10 ,20 ,30 ,40] [12 ,22 ,32 ,42] ghci > map ' (==5) [2 ,5 ,3 ,5] [ False , True , False , True ] ghci > map ' (4/) [4 ,2 ,1 ,0.5] [1.0 ,2.0 ,4.0 ,8.0] ghci > map ' (++ " aa ") [" bb " , " cc "] [" bbaa " ," ccaa " ] ghci > map ' ( 'x ':) [" b" , "a" , "r "] [" xb " ," xa " ," xr "]
Another function,

zipWith,

is just like

map,

but it operates on

two

lists and takes a two-parameter function.

Our own version might look something like this: 1 2 3 4

zipWith ' zipWith ' zipWith ' zipWith '


5

:: (a -> _ [] _ = _ _ [] = f ( x: xs )
map'

b -> c ) -> [a] -> [b] -> [ c] [] [] ( y: ys ) = f x y : zipWith ' f xs ys


takes a function (that takes an

Shorter explanation:

and returns a

b)

and a list of

as

and returns a list of

bs.

59

6. Advanced Functions

Again, notice how extremely similar to Examples: 1 2 3 4 5 6 7 8 9 10 11 12

map

it is.

So,

zipWith

applies a two-parameter function to the

elements of two lists, returning a third list with the results.

It nishes when one of the lists is empty.

ghci > zipWith ' (+) [2 ,3 ,4] [5 ,6 ,7] [7 ,9 ,11] ghci > zipWith ' (++) [" hello " ," bye " ] [" world " ," everyone "] [" hello world " ," bye everyone "] ghci > zipWith ' (*) [1..6] [2 ,2..] [2 ,4 ,6 ,8 ,10 ,12] ghci > zipWith ' compare [5 ,6 ,7] [3 ,10 ,7] [GT ,LT , EQ ] ghci > zipWith ' (&&) [ True , True ] [ True , False ] [ True , False ] ghci > zipWith ' (++) [" aa " , " bb "] [ " xx " , " yy " ] [" aaxx " ," bbyy " ]
Now we see another useful application of

flip6 .

Not necessarily the following example, but the fact that we

can pass a function with its parameters in another order. 1 2 3 4

ghci > zipWith ' ( flip (++) ) [" aa " , " bb "] [ " xx " , " yy " ] [" xxaa " ," yybb " ] ghci > flip ( zipWith ' (++) ) [" aa " , " bb "] [ " xx " , " yy " ] [" xxaa " ," yybb " ]
It's interesting how both methods work. The rst one passes a function with its parameters reversed. The second ips the lists around. The end result is the same, but we usually use the rst one as it's more readable. Remember the

zip function back in 5.1.3?

It turns out it's a specic case of

zipWith, namely zipWith (,)7 .

1 2 3 4

ghci > zip [1 ,2 ,3] " abc " [(1 , 'a ') ,(2 , 'b ') ,(3 , 'c ') ] ghci > zipWith ( ,) [1 ,2 ,3] " abc " [(1 , 'a ') ,(2 , 'b ') ,(3 , 'c ') ]
Additionally, we can continue with the lists. There actually is such a function,

map and zipWith idea zipWith3. It looks like

and provide something that works on three this:

1 2 3 4 5

zipWith3 zipWith3 zipWith3 zipWith3 zipWith3

:: ( a -> _ [] _ _ _ _ [] _ _ _ _ [] f (x : xs )

b -> c -> d) -> [a] -> [ b] -> [c ] -> [d ] = [] = [] = [] (y : ys ) (z : zs ) = f x y z : zipWith3 f xs ys zs

It's fairly easy to create such functions for 4, 5 or even more lists, but extremely dicult to make one to work for an arbitrary number of them. We'll look into this much later on, in [XREF].

6.3.2. Working with Predicates


A predicate is a function that takes a single parameter and returns a boolean (it essentially tells us if something is true). For instance,

null, (>3), even, (==2), or, elem 'a',

and

isInfinite

are all predicates

(notice how some of them are partially applied functions). They can be used as such, like below, or can be passed to a higher-order function.

6 7

No pun intended. We've met

(,)

in 3.3.3, when discussing tuples.

60

6. Advanced Functions

1 2 3 4 5 6 7 8 9 10 11 12 13 14

ghci > False ghci > True ghci > False ghci > True ghci > True ghci > False ghci > True

null [2 ,3] ( >3) 6 even 5 (==2) 2 or [ True , True , False ] ( elem 'a ') " hello world " isInfinite (1/0)

Using them as parameters for other functions can be extremely useful, but rst we need to know a couple of functions that accept predicates.

filter

is one of them  it takes a predicate and a list and returns a list

containing only the elements that satisfy the predicate. 1 2 3 4

filter :: (a -> Bool ) -> [a] -> [a] filter _ [] = [] filter p (x: xs ) = if p x then x : filter p xs else filter p xs
We immediately notice the predicate: it's the rst parameter, of type list, element by element, keeping those that satisfy the predicate then include else exclude).

p8

a -> Bool.

The function traverses the

and excluding those that don't (if

p x

1 2 3 4 5 6 7 8 9 10

ghci > filter [4 ,2 ,6 ,8 ,2] ghci > filter [4 ,5] ghci > filter [4 ,6 ,7] ghci > filter [" abstract " ] ghci > filter [[] ,[]]

even [5 ,4 ,2 ,1 ,3 ,6 ,8 ,2] ( >3) [4 ,3 ,2 ,1 ,5 ,0] (/= 5) [4 ,5 ,6 ,7] ( elem 'a ') [" hello " , " abstract " , " gemini " ] null [[5 ,6] ,[7] ,[] ,[8 ,9] ,[]]

Even better, we can incorporate 1 2 3 4 5

filter

into bigger functions that do useful things  like

quicksort.

quicksort quicksort quicksort where

:: Ord a => [a] -> [ a] [] = [] (x: xs ) = lesserSorted ++ [x] ++ greaterSorted lesserSorted = quicksort ( filter ( <= x) xs ) greaterSorted = quicksort ( filter (> x) xs )

We've recycled the example from 5.3.2, but instead of using list comprehensions, we used lters. In fact, more of the stu we've discussed so far (like about this in 6.3.3. Before we discuss applications, let's look at two functions which are very similar to

map)

have a list comprehension equivalent.

We'll talk more

dropWhile.
8
While we call our functions

filter: takeWhile

and

f, g

and so on, we usually name predicates

and

q.

61

6. Advanced Functions

takeWhile takes a predicate and a list. Like filter, it takes elements which satisfy Unlike filter, it stops entirely when it encounters an element that doesn't satisfy. dropWhile
1 2 3 4 5 6 7 8 9 10 11 12 is similar to

the predicate.

takeWhile

 but it returns the rest of the list, starting with the rst element

that doesn't satisfy.

ghci > filter ( >3) [4 ,6 ,2 ,1 ,8 ,7] [4 ,6 ,8 ,7] ghci > takeWhile ( >3) [4 ,6 ,2 ,1 ,8 ,7] [4 ,6] ghci > dropWhile ( >3) [4 ,6 ,2 ,1 ,8 ,7] [2 ,1 ,8 ,7] ghci > filter (/= ' ') " hello dear world " " hellodearworld " ghci > takeWhile (/= ' ') " hello dear world " " hello " ghci > dropWhile (/= ' ') " hello dear world " " dear world "
We'll let the source code speak for itself (this time we're using guards instead of explicit if..else, and we're showing another indentation style ):

1 2 3 4 5 6 7 8 9 10 11

takeWhile takeWhile _ [] takeWhile p (x : xs ) | p x | otherwise dropWhile dropWhile _ [] dropWhile p xs@ (x :xs ') | p x | otherwise
Recall the as patterns (4.1.4):

:: (a -> Bool ) -> [a] -> [a] = [] = = x : takeWhile p xs []

:: (a -> Bool ) -> [a] -> [a] = [] = = dropWhile p xs ' xs


allows us to reference

name

is

xs

and

pattern

is

name@pattern (x:xs').

pattern

by using

name;

in our case,

6.3.3. Comparison with List Comprehensions


Some higher-order functions that operate on lists, namely comprehensions. We can even dene them this way: 1 2

map

and

filter,

are equivalent to using list

map f xs = [f x | x <- xs ] filter p xs = [x | x <- xs , p x]


Should we use list comprehensions or higher-order functions? Usually we use the former when we have multiple operations to perform and the latter otherwise. For instance, can be expressed by nesting maps and lters, like extremely unreadable. Conversely,

map (+2) xs *.

is

[ 2*x | x <- xs, even x, x >= 2 ] map (2*) (filter even (filter (>=2) xs)), but is much more concise than [ x + 2 | x <- xs ].
is creating a list of functions  by passing a two- (or

One extremely cool thing that can be done with more-) parameter function, such as

map

This means that the resulting list will contain partially applied functions: elements from it and fully apply them: [FIXME-elaborate on this]

(5*), (4*)

etc. We can extract

These examples are identical to those in the ocial source code. It's not a coincidence; that's where I took them from.

62

6. Advanced Functions

1 2 3 4 5

ghci > let functions = map (*) [5 ,4 ,3 ,2 ,6] ghci > :t functions functions :: [ Integer -> Integer ] ghci > ( head functions ) 8 40
We can totally do it with list comprehensions, as well:

1 2 3

ghci > let functions = [ ( x *) | x <- [5 ,4 ,3 ,2 ,6] ] ghci > ( functions !! 4) 2 12


Psst! The

!! 5

!!

function begins numbering at

will result in an and

index too large

0.

So while

6 is the fth

element, we need to use

!! 4.

Performing

error.

takeWhile

dropWhile

don't have an easy list comprehension equivalent, so we won't talk about them

here. Instead, we'll discuss the dierence between the following:

zipWith (+) [1,2,3] [10,20,30] [ x + y | x <- [1,2,3], y <- [10,20,30] ]


While comprehension matches all possible combinations (1 with 1 2 3 4

zipWith

combines corresponding elements of the list (1 with

10, 1

with

10, 2 with 20 and 3 with 30), the list 20, 1 with 30, 2 with 10 and so on).

ghci > zipWith (+) [1 ,2 ,3] [10 ,20 ,30] [11 ,22 ,33] ghci > [ x + y | x <- [1 ,2 ,3] , y <- [10 ,20 ,30] ] [11 ,21 ,31 ,12 ,22 ,32 ,13 ,23 ,33]
It's a fundamental dierence, but also one easily overlooked.

Warning! Do not confuse

zipWith

with similar list comprehensions.

6.3.4. Anonymous Functions (Lambdas)


We've already encountered some functions which needed to be used only once. Initially we separately dened them. Afterwards, we dened them inside a them than to give a few examples? 1 2 3 4 5 6 7

let or a where.

But what if our function is so trivial, that we'd

rather not name it at all? Introducing anonymous functions, or lambdas for short. What better way to show

-- Syntax : lambdas \x -> x + 2 \ xs -> length xs > 100 \x y z -> x + y + z


Dening anonymous functions is similar to dening regular functions, but instead of the function's name we use

\10 ,

and instead of

we write

->.

Additionally, by using lambdas, we not only specify the function, but we also call it. This is a really nice timesaver, because we usually create anonymous functions to pass them to higher-order functions, where they will be called anyway

11 . Compare the two:

10 11

Because not everyone has

on their keyboards (I do!).

It's a cheesy explanation; we should look at the examples instead.

63

6. Advanced Functions

1 2 3 4 5

ghci > let f x = 2* x + 3 ghci > f 5 13 ghci > (\ x -> 2* x + 3) 5 13


Notice how we put the lambda in parentheses. Without parentheses, lambdas extend all the way to the right. Let's see some lambdas in use. They are, technically speaking, expressions, so we can t them anywhere (where a function is needed):

1 2 3 4 5 6

ghci > map (\ x -> 2* x + 3) [1..5] [5 ,7 ,9 ,11 ,13] ghci > filter (\ x -> x ^2 > 16) [10 ,20 ,5 ,4 ,1 ,6] [10 ,20 ,5 ,6] ghci > zipWith (\ x y -> x + 2* y) [1 ,2 ,3] [4 ,5 ,6] [9 ,12 ,15]
Don't become overzealous with lambdas, though. We might be tempted to use them when it's not necessary:

1 2 3 4

ghci > map (\ x -> x + 2) [1 ,2 ,3] [3 ,4 ,5] ghci > map (\ x -> sqrt x) [4 ,9 ,25] [2.0 ,3.0 ,5.0]
Here, we're better o using the functions directly:

1 2 3 4

ghci > map (+2) [1 ,2 ,3] [3 ,4 ,5] ghci > map sqrt [4 ,9 ,25] [2.0 ,3.0 ,5.0]
One great thing about anonymous functions is that, like regular (named) functions, we can use pattern matching in them. Unlike regular functions, though, we have only one body so we can use only one pattern. If that fails, crash!

1 2 3 4 5 6 7 8

ghci > map (\( x ,y) -> compare x y ) [(3 ,4) , (5 ,6) , (7 ,7) , (9 ,8) ] [LT ,LT ,EQ , GT ] ghci > map (\( x: xs ) -> (x , xs )) [[2 ,3 ,4] , [8 ,10 ,20]] [(2 ,[3 ,4]) ,(8 ,[10 ,20]) ] ghci > map (\( 'a ': xs ) -> xs ) [" animal " , " anonymous " ] [" nimal " ," nonymous "] ghci > map (\(3: xs ) -> xs ) [[4 ,5]] [*** Exception : < interactive >:74:6 -18: Non - exhaustive patterns in lambda
One nal cool thing before we nish with lambdas: because of currying (and the fact that lambdas extend all the way to the right if we don't put them in parentheses), the following two are equivalent:

\x y -> x + y \x -> \y -> x + y


One additional consequence of currying is that we can also dene functions using lambdas, but it's usually not as readable. Notice how the parameters can be moved to the right, after the 1 2

=:

f2 :: ( a -> a) -> a -> a f2 f x = f ( f x)

64

6. Advanced Functions

3 4 5 6 7 8

g2 :: ( a -> a) -> a -> a g2 f = \ x -> f ( f x) h2 :: ( a -> a) -> a -> a h2 = \f x -> f ( f x)


We won't focus as much on anonymous functions here because we'll use them extensively in the chapters that follow.

6.4. Folds and Scans


6.4.1. Eating a List
Remember the discussion in 5.3.3 about common patterns in recursion? Here are the examples again: 1 2 3 4 5 6 7 8 9 10 11

sum [] = 0 sum ( x: xs ) = x + sum xs product [] = 1 product (x: xs ) = x * product xs and [] = True and ( x: xs ) = x && and xs or [] = False or (x: xs ) = x || or xs
The common pattern is:

1 2

ourFunction [] = startingValue ourFunction (x: xs ) = x ` someFunction ` ourFunction xs


Or, if we don't call it inx

12 :

1 2

ourFunction [] = startingValue ourFunction (x: xs ) = someFunction x ( ourFunction xs )


Because we now know higher-order functions, let's make one that covers all these cases. This is going to be dicult, but we'll manage. What would be its parameters? 1.

someFunction, which takes two parameters and returns a third. *, && etc. startingValue, xs,
which can be

This could be any of the following:

+,

2. 3.

0, 1, True

or any other value

the list on which to perform the operations.

Let's call our function

eat,

because it kinda looks like we're eating the list. Okey dokey, here we go. The

type denition. It's got to be something like

eat :: (a -> a -> a) -> a -> [a] -> a, but maybe making
for conciseness. The edge condition is easy  eating the

everything the same type is too specic. Let's skip this one and let GHCi infer the type when we're nished. Let's call

someFunction f

and

startingValue x0

empty list gives us the starting value.

12

Refresher: A prex function comes before its parameters: are equivalent. Notice the backquotes.

f x y.

An inx function is between them:

x `f` y.

The notations

65

6. Advanced Functions

eat f x0 [] = x0
We know that we have to split the list into a head and a tail.

1 2

eat f x0 [] = x0 eat f x0 ( x: xs ) =
Looking back at

ourFunction,

we know we need to apply

to

and something else. Let's keep the prex

(not inx) format we used above. 1 2

eat f x0 [] = x0 eat f x0 ( x: xs ) = f x
Now, what is call

eat

with

f's second parameter? The whole thing should be recursive, xs. We shouldn't forget the parentheses to group eat xs.

so the logical choice would be to

1 2

eat f x0 [] = x0 eat f x0 ( x: xs ) = f x ( eat xs )


Wait! We forgot the other parameters. There:

1 2

eat f x0 [] = x0 eat f x0 ( x: xs ) = f x ( eat f x0 xs )


Let's put it in a le (say,

eat.hs)

and load it.

1 2 3

ghci > :l eat . hs [1 of 1] Compiling Main Ok , modules loaded : Main .

( eat .hs , interpreted )

It compiled! This usually means we did it right. Let's check its type. 1 2

ghci > :t eat eat :: (t -> t1 -> t1 ) -> t1 -> [ t] -> t1


It seems that our

eat :: (a -> a -> a) -> a -> [a] -> a guess was indeed too specic. But it was close! Let's write the type declaration (using a and b instead of the ugly t and t1), align things a little and marvel
at our handiwork:

1 2 3 4

-- File : eat . hs eat :: (a -> b -> b) -> b -> [ a] -> b eat f x0 [] = x0 eat f x0 ( x: xs ) = f x ( eat f x0 xs )
Now let's go ahead and try to dene

0.
1

sum

in terms of

eat.

Our function is addition,

+.

The starting value is

Let's go!

sum ' xs = eat (+) 0 xs


The

xs

is redundant

13 , so we can remove it.

sum ' = eat (+) 0


We can try it out (the denition of properly.

sum'

has to be included in

eat.hs

for it to load) and see if it works

13

It's because of the fundamentals of currying and partial application (6.1.1).

66

6. Advanced Functions

1 2 3 4

ghci > sum ' [1 ,2 ,3 ,4 ,5] 15 ghci > sum ' [] 0


Excellent! We can go ahead and dene the other functions in here just as easily:

1 2 3

product ' = eat (*) 1 and ' = eat (&&) True or ' = eat (||) False
Let's test them as well.

1 2 3 4 5 6 7 8

ghci > 120 ghci > 0 ghci > False ghci > True
This

product ' [4 ,5 ,6] product ' [0] and ' [ True , True , False ] or ' [ False , True , False ]

eat

function is really useful. How come it's not predened? Let's do a Hoogle search for its type.

Whoops! It found something: 1 2 3

foldr.

It looks like we've just reinvented the wheel.

ghci > let sum ' = foldr (+) 0 ghci > sum ' [1 ,2 ,3 ,4 ,5] 15
This is actually a big problem when writing Haskell programs. Because most functions are so abstract,

there's almost always one that covers our particular need  in this case, Our time was not lost, however, as we now know how

foldr. eat,
was not so inspired.

foldr

works under the hood and we've got a pretty

good idea on how to design and test new functions. Too bad our function name,

6.4.2. Introducing Folds Proper


Now that we've played around with

to understand how to make the most of it. worry, we rarely use them all. [FIXME]

eat, which turned out to be our own implementation of foldr, it's time We should mention that there are six dierent folds: foldr, foldr', foldr1, foldl, foldl', and foldl1. That's a lot of functions that do the same thing14 ! Don't

14

You may be wondering why we need six dierent functions for this. I'll be happy to point out in Lisp.

eq, eql, equal, equalp,

and

67

Part III.

Appendices

68

A. Miscellaneous
A.1. Functions
A.1.1. Fixity
The following table in Prelude. Precedence 9 8 7 6 5 4 3 2 1 0 Left-associative Non-associative Right-associative

1 shows the precedence and xity (left-, non-, and right- associativity) of the operators

!! *, /, `div`, `mod`, `rem`, `quot` +, ==, /=, <, <=, >, >=, `elem`, `notElem` >>, >>=

. ^, ^^, **

:, ++ && || $, $!, `seq`

Below are some examples of precedence and xity declarations (if an operator denition lacks a xity declaration it is assumed to be 1 2 3 4 5 6 7 8 9

infixl 9).

-- File : fixity . hs x ++++ y = x + y + x *y infixl 3 ++++ -- left - associative x -.- y = x ^3 + y ^3 infixr 5 -.- -- right - associative func a b = a + b + b infix 2 `func ` -- non - associative
In many cases the correct xity declaration carries a great deal of importance  let's take declared above) as an example.

-.-

(the one

1 2 3 4

ghci > (3 -.- 4) -. - 5 753696 ghci > 3 -. - (4 -.- 5) 6751296


So, even though

-.- is right-associative in Haskell-speak, (a -.- b) -.- c is not the same as a -.- (b -.- c).
1
Taken from the Haskell 98 Report

it is non-associative in the mathematical sense:

69

A. Miscellaneous

A.1.2. Laziness Explained


Haskell has a very strange property when compared to your usual programming languages: This means that the compiler or interpreter will evaluate an expression only when it's needed. expressions. Let's take the simplest function imaginable and move on from there: dened like so: 1 2 3 it's lazy. It's very

tricky on many levels, mainly because laziness introduces important dierences between supercially similar

&&.

As a reminder,

&&

is

(&&) :: Bool -> Bool -> Bool True && x = x False && _ = False
Note how but

lazy

&&

won't even evaluate its second argument if the rst is

in the second. In other words,

&&

False: &&

is

strict in the rst argument,

must always evaluate the rst argument, but not necessarily the

second one. We have a better perspective when we look at expressions in light of

2 (with instructions on how to evaluate them) . Let's take the following piece of code as an example:
1 2 3 4 5

thunks.

Thunks are unevaluated values

-- File : thunks . hs a (b , c) 1: d
Line-by-line: 1. Haskell matches

= ( length " hello " , [1 , 2, 3 , 4]) = a = c

care what it is. It doesn't actually evaluate it, so 2.

(length "hello", [1, 2, 3, 4]) to a. Because we do nothing to a, Haskell doesn't a is just a thunk.
In order to make sure the match succeeds and to assign the

a c,

will need to be matched to a pair.

necessary variables, 3.

is evaluated to something like

(thunk, thunk). b

and

become thunks.

previously a thunk, is now evaluated to make sure it conforms to

1:d. c

now becomes

1:thunk.

Let's take another example: Let's try to fully evaluate 1 2 3 4 5 6 7 8 9

("hi", [4,

5])3 . The steps are as follows:

-- Evaluation steps thunk -- unevaluated ( thunk , thunk ) ( 'h ': thunk , thunk ) ( 'h ': 'i ': thunk , thunk ) ( 'h ': 'i ':[] , thunk ) ( 'h ': 'i ':[] , 4: thunk ) ( 'h ': 'i ':[] , 4:5: thunk ) ( 'h ': 'i ' :[] , 4:5:[])

normal form.
undefined.
error. 1 2

Partially evaluated values are in something called

weak head normal form.

Fully evaluated things are in

We don't always know which functions are strict and which are lazy, but we can check by calling them with If

undefined

is not evaluated (i.e. remains a thunk), nothing happens. If it is, it throws an

ghci > False && undefined False


2 3
In fact, if Haskell weren't lazy, there would be no such thing as a thunk: all expressions and values would always be fully evaluated. For example, by printing it  printing forces evaluation.

70

A. Miscellaneous

3 4

ghci > undefined && False *** Exception : Prelude . undefined


This is our conrmation of the above:

&&

is indeed lazy in the second argument, but strict in the rst.

There are some catches however. We might expect by anything is 1 2

04 .

So we can put

undefined,

0 * x

to not need to evaluate

x.

After all,

multiplied

can't we?

ghci > 0 * undefined *** Exception : Prelude . undefined


Surprise surprise! It seems that multiplication is strict in both parameters, even when supplied

0.

What isn't surprising is that laziness is a touchy subject  the best way to learn it is through experience.

A.2. Constants (A.K.A. Variables)


A.2.1. Local Variables
Let's look at the following example: 1 2 3 4 5

f x y | g x y < 5 = " Less than 5" | g x y == 5 = " Equal to 5" | otherwise = " Greater than 5 " where g x y = 2* x + 3* y
The names (variables) 1. 2. 3. 4. 5.

and

appear 5 times each. Let's count them: function denition

f x y:

the parameters in

f's

g x y < 5:

the parameters used in a call to the same the parameters in

g. =)

g x y == 5: 2*x + 3*y:

g x y = ...:

g's

function denition (before the

the parameters in

g's

body (after the

=)

While we've used the same names in all follows:

instances, they are logically dierent. We can separate them as

Pertaining to Pertaining to

f: g:

1, 2, and 3 4 and 5

The names pertaining to

are logically dierent than those pertaining to

 they seem to have the

same name (to us) but internally they are dierent.

In other languages, those pertaining to

would be

called local variables because they are logically dierent and the dierence occurs only in a limited area:

g x y = 2*x + 3*y.
We can reect the dierence in meaning ourselves, by renaming them: 1 2 3 4 5

f x y | g x y < 5 = " Less than 5" | g x y == 5 = " Equal to 5" | otherwise = " Greater than 5 " where g a b = 2* a + 3* b
4
Not actually true:

is undened, and

0 (1)

is negative zero.

71

A. Miscellaneous

There are more examples of local variables, even in the interactive prompt: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

ghci > let ghci > let ghci > f x (2 ,3) ghci > f 4 (4 ,5) ghci > let (4 ,5) ghci > x 2 ghci > y 3 ghci > let ghci > f x (100 ,200)

x = 2; y = 3 f x y = (x ,y ) y 5 x = 4; y = 5 in f x y

x = 100; y = 200 y

What's going on here? As we know, in Haskell, no variables can change. 1. 2. 3. 4.

let x = 2; y = 3 f x y f 4 5 x = 2
calls calls and

denes the names

and

to be

and

3. (,)). 3.
are logically dierent than those from

let f x y = (x,y) f

denes a two-parameter function that pairs the parameters (essentially,

with the parameters

and

y,

which are

and

f with 4 and 5. Because the x and y from (x,y) y = 3, the function behaves as expected.

5.

let x = 4; y = 5 in f x y temporarily binds 4 and 5 to x and y respectively, then calls the function
with those values.

6. 7. 8.

and

have not changed outside the previous expression  they are still binds the new values of

and

3. y

let x = 100; y = 200 f x y

100

and

200

to the names

and

proves that the new values remain.

What happens is that when we call

let

in 2 and 7, we don't permanently change their values  if we exit

GHCi and enter it again, the dened values are gone. What the

let

in 2 and 7 does is temporarily bind the values (2 and

and then

100

and

200)

to

and

until the end of the interactive session. The

let

in 5 temporarily binds

and

to

and

until

f x y

is evaluated, after which it reverts to the

previous values. What's going on here may be confusing, but hopefully it is somewhat intuitive. The point is that we're not talking of the same that: 1 2 3 4 5 6 7 8 9 10 11

and

with dierent values, we're talking about dierent

xs

and

ys.

We can illustrate

ghci > ghci > ghci > (2 ,3) ghci > (4 ,5) ghci > (4 ,5) ghci > 2 ghci >

let x1 = 2; y1 = 3 let f x2 y2 = (x2 , y2 ) f x1 y1 f 4 5 let x3 = 4; y3 = 5 in f x3 y3 x1 y1

72

A. Miscellaneous

12 13 14 15

3 ghci > let x4 = 100; y4 = 200 ghci > f x4 y4 (100 ,200)


We're going to get better at understanding the when and how of local variables as our experience increases.

73

B. Types and Typeclasses


B.1. Typeclasses in Depth
Typeclasses are the bread and butter of Haskell . Some of the most common (and useful) typeclasses, roughly presented from general to specic, are:

B.1.1.

Show

and

Read
Show and Read are handled Show and it does the rest.
by the computer

These two typeclasses are, for the most part, invisible to the user. Although almost every type out there belongs to both of them, hey, that type is part of

2  we only need to tell the compiler

Show

contains all types which can be converted to strings. almost all types (Int,

 Includes:

[Bool], [[Char]] -> Int


etc.)

etc.)

 Does not include:  Prerequisites:


* show
1 2 3 4 5 6 7 8 9 10

functions (Int

none

 Built-in functions:
converts a value to a string

ghci > show 5 "5 " ghci > show 203 " 203 " ghci > show False " False " ghci > show [1 , 2, 5] " [1 ,2 ,5] " ghci > show [" hi " , " hello " , " blah " ] -- result looks funky " [\" hi \" ,\" hello \" ,\" blah \"] "
Read
is the converse of

Show. shown.

 Includes:

almost everything that can also be functions

 Does not include:  Prerequisites:


* read

none

 Built-in functions:
converts a string to a specic value .

1 2 3

Author's note: in retrospect, I don't know what I meant by saying this. They can also, however, be manually specied, but that's rare. The computer does a really good job. The type has to be specied, either by performing an operation and letting Haskell infer, or by explicitly declaring it. Otherwise, an ambiguous type variable error is thrown (details in B.2.2).

74

B. Types and Typeclasses

1 2 3 4 5 6 7

ghci > False ghci > True ghci > 156 ghci >

read " True " && False read " True " :: Bool read " 67 " + 89 read " 67 " -- ambiguous type variable error

B.1.2.

Eq, Ord, Enum

Many useful functions require membership in at least one of these typeclasses. After all, there is no function that can order unsortable items, and you can't list that which cannot be enumerated.

Eq

contains all types that can be equated.

 Includes:

almost all types functions

 Does not include:  Prerequisites:


* == * /=
1 2 3 4 5

none

 Built-in functions:
tests for equality tests for inequality

ghci > 5 == 6 False ghci > " hello " == " hello " True ghci > (+) == (*) -- type error
Ord
contains types which have a logical ordering. almost all types functions

 Includes:

 Does not include:  Prerequisites: Eq  Built-in functions:


* > * <
and and

>= <=
returns an ordering

* compare * max
1 2 3 4 5 6 7 8 9 and

min

ghci > False ghci > True ghci > False ghci > 10 ghci >

4 > 5 " abcd " >= " abcc " True < False max 10 3 compare 4 5

75

B. Types and Typeclasses

10 11 12 13 14

LT ghci > compare 4 4 EQ ghci > compare 5 4 GT


Enum
contains types which can be enumerated. almost all types functions, strings

 Includes:

 Does not include:  Prerequisites: Ord  Built-in functions:


* succ * pred *
1 2 3 4 5 6 7

returns the logical successor returns the logical predecessor

Other functions synonymous to using ranges

ghci > 7 ghci > 'z ' ghci > '{ ' ghci >

succ 6 succ 'y ' succ 'z ' succ " abcde " -- type error

B.1.3. Numeric Typeclasses


All numbers have a common set of operations. They can, for example, be added or subtracted, even multiplied. There are grouped in many dierent classes, however, because some of them lack specic behavior. For instance, complex numbers

4 cannot be ordered5 .

Num

is the most general numeric typeclass.

 Includes: Int, Integer, Rational, Float, Double etc.  Does not include:  Built-in functions:
* +, -, * abs
and non-numbers

 Prerequisites: Eq, Show


*

* negate * signum
1 2 3

returns the opposite of a number

returns the absolute value is the sign function

ghci > 5 + 4 * 3 - 2 15 ghci > negate 10


4 5 6
The issue is multifaceted: complex numbers have the type example, Returns

Fractional a => a).

RealFloat a => Complex a

as opposed to other numbers (for

The previous footnote was about complex numbers, not their ordering. Just a clarication.

on a positive number,

on zero, and

-1

on a negative number.

76

B. Types and Typeclasses

4 5 6 7 8 9 10

-10 ghci > abs ( -5) 5 ghci > signum 23 1 ghci > signum ( -23) -1
Integral
is the typeclass of integers.

 Includes: Int, Integer and other size integers (Int8, Int16, Int32 etc.)  Does not include:  Built-in functions:
* quot, * div, * rem, * mod,
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 the quontient in division integer division the remainder modulo function anything else

 Prerequisites: Num, Ord, Enum

ghci > 5 ghci > 5 ghci > 2 ghci > 2 ghci > -5 ghci > -6 ghci > 2 ghci > -1

17 ` quot ` 3 17 `div ` 3 17 `rem ` 3 17 `mod ` 3 17 ` quot ` ( -3) 17 `div ` ( -3) 17 `rem ` ( -3) 17 `mod ` ( -3)

Warning! Do not confuse

quot

with

div

and

rem

with

mod

 they behave dierently on negatives.

Fractional

contains fractions, both common ( ,

1 4

2 3 ) and decimal (2.5,

8.53)

 Includes: Rational, Float, Double etc.  Does not include:  Prerequisites: Num  Built-in functions:
* /,
the division function the inverse of a number ( , where integers, non-numbers

* recip,
7

1 x

is the number)

recip 0

gives

Infinity.

However,

Infinity

is not a number

per se, it's just a way to display .


recip 0
or

in our calculations (which, by the way, is an extremely bad idea), we must use

1/0

If we really want to use

or whatever.

77

B. Types and Typeclasses

1 2 3 4 5 6 7 8 9

ghci > 5.2 / 3.2 1.625 ghci > recip 0.25 4.0 ghci > 1 / 0.25 4.0 ghci > recip 0 Infinity ghci > 1 / Infinity -- doesn ' t work
Floating
contains decimal numbers

 Includes: Float, Double etc.  Does not include:  Built-in functions:


* pi,
a function of zero parameters (a constant) common fractions, integers, non-numbers

 Prerequisites: Fractional
8

* exp, sqrt, log * logBase, * **,


which takes two parameters

the fractional power function and friends (sinh,

* sin, cos, tan


1 2 3 4 5 6 7 8 9 10 11 12 13 14

acos, asinh

etc.)

ghci > pi :: Float 3.1415927 ghci > pi :: Double 3.141592653589793 ghci > log 10 2.302585092994046 ghci > 5 ** 2.3 40.51641491731905 ghci > sin ( pi / 3) 0.8660254037844386 ghci > cos ( pi / 3) 0.5000000000000001 ghci > logBase 10 1000 2.9999999999999996
Warning! Watch out for rounding errors  they're a pain in the brain.

B.2. Type Errors


B.2.1. General Type Errors
We'll analyze the following type error in detail, line by line. Intimate knowledge of the structure of type errors should help us x them much faster.

It's a very interesting case  because functions can be polymorphic and constants are (zero-parameter) functions, constants can also be polymorphic.

78

B. Types and Typeclasses

1 2 3 4 5 6 7 8 9

ghci > 1 * False < interactive >:1:1: No instance for ( Num Bool ) arising from the literal `1' Possible fix : add an instance declaration for ( Num Bool ) In the first argument of `(*) ', namely `1' In the expression : 1 * False In an equation for `it ': it = 1 * False
The analysis, as promised: 1. 2. 3. 4. 5. GHCi

ghci> 1 * False

is the (incorrect) expression we ran.

is a blank line. It doesn't really do anything.

<interactive>:1:1:

is the location in the program that gives the error ([line]:[character]). means that

No instance for (Num Bool)

False,

which is a

Bool,

can't be a number (Num).

Bools
6.

arising from the literal `1' tells us that it is through our use of 1, which is a number, inferred that False must also be a number so it can multiply them. But False is a Bool, and
aren't numbers.

Contradiction.

Possible fix: add an instance declaration for (Num Bool) suggests that it is possible to Bools can be numbers. For example, if we tell GHCi that False is the 9 same as 0 and True is really 1, then the expression would compile . Adding instance declarations is
x the error by dening how explained in [XREF].

7.

argument of 8. 9.

In the first argument of `(*)', namely `1' *. In the expression: 1 * False

gives specic context for the error:

the rst

gives more general context. of the error. In

GHCi,

In an equation for `it': it = 1 * False gives the most general context it is an internal variable that stores the result of the previous computation.

Basically all type errors in GHCi follow the above format

10 . It's important to understand them as they're

the fastest way of identifying the problem, especially in very complex cases.

B.2.2. Ambiguous Type Variable Errors


Sometimes Haskell cannot successfully infer the types of the expressions involved. presented with the following: 1 2 3 4 5 6 7 8 In that case, we are

ghci > read "5 " < interactive >:1:1: Ambiguous type variable `a0 ' in the constraint : ( Read a0 ) arising from a use of `read ' Probable fix : add a type signature that fixes these type variable ( s) In the expression : read "5" In an equation for `it ': it = read " 5"
We shall, yet again, dissect the error. The line-by-line analysis shows that: 1.

ghci> read 5

is our ambiguous expression.

9 10

In this case it's not recommended, seeing how multiplying a Other interpreters may display dierently.

Bool

and a number doesn't make much sense.

79

B. Types and Typeclasses

2. 3.

is an empty line. GHCi has the tendency to put that before long errors.

<interactive>:1:1:

is the position of the ambiguous statement,

[line]:[character].

Here, it's at

the very beginning of our interactive statement. 4.

Ambiguous type variable `a0' in the constraint:


because it has multiple solutions.

tells us that GHCi cannot infer the type

5.

(Read a0) arising from a use of `read'


example,

indicates that the typeclass

Read

contains multi-

ple types. What it doesn't say, but we know, is that Haskell must know the specic type to

can be

read

read.

For

as:

a) A character ('5') b) A number (5) c) A string (5) d) Many, many others 6.

Probable fix: add a type signature that fixes these type variable(s) recommends xing the error by adding an expicit type signature would be the desireable action.

11 . GHCi implies (probable) that in most cases this

7. 8.

In the expression: read 5

is the context of the ambiguity. With all this info, it's hard

In an equation for `it': it = read 5 gives even more context.


not to identify and x the problem immediately!

B.2.3. Making Custom Errors


A more expressive way of correcting a program without actually suppressing the error is to write our own error message. for convenience. 1 2 3 4 5 6 We might want this if it's the user's fault for incorrect input, and we want to halt the program, as well as help the user in xing the input. We will use the 4.1.1 base example, reproduced below

-- File : patterns - wrong . hs intToString intToString intToString intToString


The

:: Int -> [ Char ] 1 = " one " 2 = " two " 3 = " three "

error

function takes a string and throws an error with that message.

1 2 3 4 5 6 7

-- File : patterns - wrong . hs ( FIXED ) intToString intToString intToString intToString intToString :: Int -> [ Char ] 1 = " one " 2 = " two " 3 = " three " _ = error " intToString : Number too large "

Notice that the error handler doesn't know the name of the function beforehand, so we might want to include it in the error message, like above. 1 2

ghci > intToString 20 *** Exception : intToString : Number too large


11
Such as

:: Int

or

:: Char.

80

B. Types and Typeclasses

Because 1 2 3 4 5 6 7

error

is an ordinary function, we can also do some magic to make it more expressive.

-- File : patterns - wrong . hs ( FIXED ) intToString intToString intToString intToString intToString :: Int -> [ Char ] 1 = " one " 2 = " two " 3 = " three " n = error (" intToString : Number " ++ show n ++ " too large " )

1 2

ghci > intToString 20 *** Exception : intToString : Number 20 too large


Cusomizing error messages is not mandatory, but it's a very good idea, especially in long and complicated programs. Of course, the real solution is never to crash expressively, but to actually aid the user without blowing the program to smithereens: and [XREF].

graceful failure.

We learn such methods late in the book, in [XREF]

81

C. Modules
C.1. Data.List
The

Data.List

module is the one-stop shop for all our list goodies. It supports many functions, detailed

below. The trickier ones have example code. [FIXME]

82

You might also like