Tutorial Note 9 (Week 11) Phylogenetic Tree
Tutorial Note 9 (Week 11) Phylogenetic Tree
Phylogenetic Tree
g:G
1 1
e:G f:G
1 1 1 1
Node Likelihood
g:G
a 0.97 (no mutation, a:G == e:G)
1 1
b 0.97 (no mutation, b:G == e:G)
c 0.01 (mutation, c:T != f:G) e:G f:G
d 0.97 (no mutation, d:G == f:G)
1 1 1 1
e (0.97 a:G)(0.97 b:G)(0.97 e:G == g:G) = 0.912673
f (0.97 d:G)(0.01 c:T)(0.97 f:G == g:G) = 0.009409 a:G b:G c:T d:G
g (0.25 g:G)(0.912673 e:G)(0.009409 f:G)
= 0.00214683506
A Particular
Assume the probability of having A, C, G and T are equal (0.25) for the sequence g. Case Probability
1 mutation 0.01
1 no change 0.97
Mutations of G include GA, GC, GT
Thus, 3*P(Mutation) + P(No Change) = 1
e:? f:?
1 1 2 2
2 mutations (0.1)(0.1)
2 no changes (0.7)(0.7)
Likelihood = (0.25)(0.0868)(0.0371) +(0.25)(0.0868)(0.0371)
+(0.25)(0.2596)(0.0717) +(0.25)(0.0868)(0.0717) = 0.0078
A B C D
A 0 11 4 11
B 11 0 13 4
C 4 13 0 13
D 11 4 13 0
A B C D 2 2 2 2 2 2
A C B D A C B D
4 4
Newick:
((A:1,C:3):4,(B:2,D:2):4); 1
A 2 2
3 B D
C
CSCI3220 Algorithms for Bioinformatics Tutorial Notes 12
UPGMA – Ultrametric distances
1. d(x, y) 0 1. d(x, y) + d(y, z) d(x, z)
2. d(x, y) = 0 if x = y 2. d(x, y) max{d(x, z), d(y,
3. d(x, y) = d(y, x) z)}
All leaf nodes have equal distance from root(1) Satisfy all
additive
Branch lengths represents sequences distance (2) Satisfy 1-4 only
Branch lengths represents cluster distance only (3) Satisfy 1-3 only non-
additive
d A B C d A B C d A B C
A 0 10 20 A 0 13 19 A 0 4 13
B 10 0 20 B 13 0 22 B 4 0 19
C 20 20 0 C 19 22 0 C 13 19 0
Q i , j r 2 d Ci , C j u C i u C j u C x d (C , C
y
x y)
B 14 0 31 19 64 B -112 -94
D C C 37 31 0 42 110 C -94
D 7 19 42 0 68 D
7 58 68
1 B d AD B C u Q AD B C C AC AD 1
A 2 2 4 2
AD AD 0 13 36 49 AD -80 -80 7 68 58
C D C AD 6
2 2 4 2
6 B 13 0 31 44 B -80
D 13 49 44
C C AD C ABD 9
C 36 31 0 67 C 2 2 3 2
13 44 49
C B C ABD 4
2 2 3 2
A 1 4 B d ABD C A 1
9 ABD 4 B
AD ABD 0 27 AD 9 ABD
6 C 27 0 6 27
D C D C
C 37 31 0 42 110 C -94
D 7 19 42 0 68 D
C C
A 0 4 5 11 16 36 A 0 4 5 8
B 4 0 7 13 18 42 B 4 0 7 10
C 5 7 0 8 13 33 C 5 7 0 5
D 11 13 8 0 11 43 DE 8 10 5 0
E 16 18 13 11 0 58
d C A , C DE
d (C A , C D ) d (C A , C E ) d (C D , C E )
A E 2
8 11 16 11
2
B
DE D
6 8
3
B 4 0 7 10 21 B -24 -19
C 5 7 0 5 17 C -25
DE 8 10 5 0 18 DE
A
E
1 8
3 AB DE D
B 3 6
C 4 0 5 9 C -16 DE 4 0
DE 7 5 0 12 DE
A A
1 E 1 E
8 8
3 3 3
AB 3 B
AB
AB 4
B AB DE D
6 C DE D
C
3 3
1 1
C C