Errata - Mathematical Introduction To Data Science
Errata - Mathematical Introduction To Data Science
(x(i) (j)
1 − minj=1,...,n x1 )(b − a)
n Page 41, Line 5: x̃(i) = a+ (j) (j)
, . . .
maxj=1,...,n x1 − minj=1,...,n x1
x(i) − x(·)
n Page 41, Line 7: x̃(i) = 1
σ1
1
,...
(1) (4)
n Page 42, Line 14: ρ(x , x ) = 3.681
n Page 46, Line 1: We discuss some of these methods in Exercise 3.10.
n Page 45, Line 28: We thus see that texts no. 1 and text no. 2 are significantly more
cosine similar than text no. 1 and text no. 3 or text no. 2 and text no. 3.
n Page 48, Line 26: The cosine distance, on the other hand, may appear here more natural,
as the scalar product increases if the frequency of the fixed word increases in the second
text.
n Page 52, Line 11: For finite subsets A, B ⊆ X we define . . .
n Page 53, Line 20:
1: function Linkage-based Clustering (X, ρ, D, δ)
2: k ← #D
3: for i ← 1 to k do
4: Ci ← {xi }
5: while mini6=j ρ(Ci , Cj ) 6 δ and k > 2 do
6: m←0
7: (i∗ , j ∗ ) ← argmini6=j ρ(Ci , Cj )
8: for ` ← 1 to k − 1 do
9: if ` = min(i∗ , j ∗ ) then
10: C` ← Ci∗ ∪ Cj ∗
11: if ` = max(i∗ , j ∗ ) then
12: m←1
13: C` ← C`+m
14: else
15: C` ← C`+m
16: k ← k−1
17: return C1 , . . . , Ck
n Page 54, Line −6: K : Ck → R
n Page 56, Line −16: The following pseudocode approximates a minimizer of the k-means
cost function.
n Page 56, Pseudocode:
1: function k-means (D, k, X, ρ)
2: µ1 , . . . , µk ← pairwise different points from X
3: for i ← 1 to k do
4: Ci ← {x ∈ D | i ∈ argminj=1,...,k ρ(x, µj )}
5: U ← True
6: while U = True do
7: U ← False
8: for i ← 1 to k do
9: µi ← µ(Ci )
10: for i ← 1 to k do
11: Ci0 ← {x ∈ D | i ∈ argminj=1,...,k ρ(x, µj )}
12: if Ci0 6= Ci
13: Ci ← Ci0
14: U ←True
15: return C1 , . . . , Ck
In the lines 4 and 9 of the pseudocode we pick as a single i in the case that the armin is
not unique.
n Page 57, Line 7:
ρ(x, µ)2 ,
P P
µ(A) ∈ argmin respectively µ(A) ∈ argmin ρ(x, µ).
µ∈A x∈A µ∈X x∈A
n Page 57, Line −7: For j > 1 denote by (C1(j) , . . . , Ck(j) ) that clustering which the algorithm
produces in the j-th round. For j > 2 we have
k
ρ(x, µi )2
P P
K(C1(j) , . . . , Ck(j) ) = min
µ1 ,...,µk ∈X i=1 (j)
x∈C i
hx, M xi
maxn min 6 λk .
U ⊆R x∈U
x6=0
hx, xi
dim U =n−k+1
{i,j}∈E
λ2 (L) = min n .
x6=0
x2i di
X
hDx,1i=0
i=1
n Page 73, Line 9: . . . and our goal in the following will be to show λ2 > α2 /2, . . .
n Page 73, Line 12 (Equation (5.2)):
n
P
· · · and hDx, 1i = di xi = 0
i=1
Z
=: min N , . . . .
0.13 90.02
−→
→
−→
−
Darlene −
→
→ 0.68 90.11 h ih i
12.4 0.56 0.09 0.56 0.09 0.59
Ǎ = 0.15 0.59
9.5 90.12 0.69 90.12 0.69 0.02
0.41 90.07
Elena −→
0.07 0.73
Fatima −→
Gladys −→
0.55 90.09