Sequence Alignment (Chapter 6) : The Biological Problem
Sequence Alignment (Chapter 6) : The Biological Problem
• Close evolutionary
relationship => expect a
high number of homologs
ctgactgtttgtggttc
l What about sequences that differ in length?
gA
Gene A is copied
gA gA’ within organism A
gB gC
gB gC
Organism B Organism C
gA
Gene A is copied
gA gA’ within organism A
gB gC
gB gC
Organism B Organism C
S(WHAT/WH-Y) = 1 + 1 – –µ
WHAT - W H A T
|| -
WH-Y W X
H X X
Y X
WHAT - W H A T
|| - 0
WH-Y W 1
H 2 2-
Global alignment Y 2- -µ
score S3,4 = 2- -µ
3 a3 -3
Human bone
morphogenic protein
receptor type II
precursor (left) has a
300 aa region that
resembles 291 aa
region in TGF-
receptor (right).
The shared function
here is protein kinase.
B
Regions of
similarity
0 - 0 0 0 0 0
1 a1 0
b1 b2 b3 2 a2 0
- - a1
3 a3 0
G 0 0 1 1 0 1
T 0 1 0 0 2 0
A 0 0 0 0 0 0
= maxi,j Mi,j
T 0 1 0 0 1 0
• Best local alignment can be
found by backtracking from C 0 0 0 0 0 0
the highest value in M
G 0 0 1 1 0 1
T 0 1 0 0 2 0
- G G C T C A A T C A
0 - 0 0 0 0 0 0 0 0 0 0 0
1 A 0
2 C 0
3 C 0
4 T 0
5 A 0
6 A 0
7 G 0
8 G 0