0% found this document useful (0 votes)

3 views

lecture1-2

Uploaded by

eshas283

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

lecture1-2

Uploaded by

eshas283

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 44

Part 2: Sequence search and

comparison
Some books

! D.Gusfield, Algorithms on Strings, Trees and Sequences:

Computer Science and Computational Biology, Cambridge
University Press, 1997

! E.Ohlebusch, Bioinformatics algorithms, 2013,

www.oldenbusch-verlag.de
! V.Makinen et al, Genome-scale algorithm design,
Cambridge University Press, 2015
Sequence alignment
Sequence comparison

! Sequence comparison: most ubiquitous task in

bioinformatics
─ genome analysis: gene prediction, phylogeny
reconstruction, repeats, …
─ RNA analysis

─ protein analysis

! Main assumptions:
─ similar sequences correspond to similar biological
functions
─ similar sequences witness phylogenetic proximity

─ similar sequences fold to similar structures

Example: insulin

elephant

hamster

elephant

whale

elephant

alligator
Another example

Image from: http://www.ncbi.nlm.nih.gov/

Sequence alignment

! Given two sequences RDISLVKNAGI and RNILVSDAKNVGI

! 3 types of columns corresponding to 3 elementary evolutionary

events
! matches
! substitution (mismatch)
! insertion, deletion (indel)
! Assign a score (positive or negative) to each event. Alignment
score = sum of scores over all columns. Optimal alignment = one
that maximizes the score
Sequence alignment: scoring

! scoring function:

Score=19 Score=-11

Score=25
Sequence alignment: scoring

! BLOSUM62 matrix for protein sequences

LCS: Longest Common Subsequence

! consider score match:1, indel: 0, mismatch: -1

-AGGCTCACCTGACT-CCAGGC-CGA--TGCC---
|| ||||| ||| | || ||| ||||
TAG-CTCAC--GAC-GC--GG-TCGATTTGCCGAC
LCS: Longest Common Subsequence

! consider score match:1, indel: 0, mismatch: -1

-AGGCTCACCTGACT-CCAGGC-CGA--TGCC---
|| ||||| ||| | || ||| ||||
TAG-CTCAC--GAC-GC--GG-TCGATTTGCCGAC

! optimal alignment ~ longest common subsequence (LCS)

! LCS(AGCGA,CAGATAGAG)=4
! Score(S,T)=LCS(S,T)
! d(S,T)=|S|+|T|-2·LCS(S,T)
minimal number of indels required to transform S into T
Levenshtein distance

! consider score match:0, indel: -1, mismatch: -1

-AGGCTCACCTGACTCCAGGCCGA--TGCC---
|| ||||| ||| | || ||| ||||
TAG-CTCAC--GACGC--GGTCGATTTGCCGAC

! optimal alignment ~ Levenshtein (edit) distance

! minimal number of indels and substitutions required to
transform S into T
! edit(S,T) = -Score(S,T)
! edit(ACAGT,CCGA)=3 ACAGT
| |
CC GA
Bioinformatics: "CIGAR strings"

part of SAM format

RefPos: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19!
Reference: C C A T A C T G A A C T G A C T A A C!
Read : A C T A G A A T G G C T!

POS: 5!
CIGAR: 3M1I3M1D5M!
Computing Score(S,T)

! assume -d is indel penalty, s(x,y) score of aligning x and y

(match or mismatch), S[0..n-1] and T[0..m-1] are input
strings
! Idea: compute Score[i,j]: optimal score between S[0..i-1]
and T[0..j-1]

Score[i-1,j-1] + s(S[i-1],T[j-1])
Score[i,j] = max Score[i-1,j] - d
Score[i,j-1] - d
! initialization: Score[0,0]=0, Score[0,j]=-jd, Score[i,0]=-id
! resulting score: Score[n,m]
! Dynamic Programming!
Example
s(x,x)=2, s(x,y)=-1 for x≠y, d=-2
Example
s(x,x)=2, s(x,y)=-1 for x≠y, d=-2
A C G G C T A T
0 1 2 3 4 5 6 7 8
0
A 1
C 2
T 3
G 4
T 5
A
6
T
7
Example
s(x,x)=2, s(x,y)=-1 for x≠y, d=-2
A C G G C T A T
0 1 2 3 4 5 6 7 8
0 0 -2 -4 -6 -8 -10 -12 -14 -16
A 1 -2
C 2 -4
T 3 -6
G 4 -8
T 5 -10
A
6 -12
T
7 -14
Example
s(x,x)=2, s(x,y)=-1 for x≠y, d=-2
A C G G C T A T
0 1 2 3 4 5 6 7 8
0 0 -2 -4 -6 -8 -10 -12 -14 -16
A 1 -2 2 0 -2 -4 -6 -8 -10 -12
C 2 -4
T 3 -6
G 4 -8
T 5 -10
A
6 -12
T
7 -14
Example
s(x,x)=2, s(x,y)=-1 for x≠y, d=-2
A C G G C T A T
0 1 2 3 4 5 6 7 8
0 0 -2 -4 -6 -8 -10 -12 -14 -16
A 1 -2 2 0 -2 -4 -6 -8 -10 -12
C 2 -4 0 4 2 0 -2 -4 -6 -8
T 3 -6
G 4 -8
T 5 -10
A
6 -12
T
7 -14
Example
s(x,x)=2, s(x,y)=-1 for x≠y, d=-2
A C G G C T A T
0 1 2 3 4 5 6 7 8
0 0 -2 -4 -6 -8 -10 -12 -14 -16
A 1 -2 2 0 -2 -4 -6 -8 -10 -12
C 2 -4 0 4 2 0 -2 -4 -6 -8
T 3 -6 -2 2 3 1 -1 0 -2 -4
G 4 -8 -4 0 4 5 3 1 -1 -3
T 5 -10 -6 -2 2 3 4 5 3 1
A
6 -12 -8 -4 0 1 2 3 7 5
T
7 -14 -10 -6 -2 -1 0 4 5 9
Example
s(x,x)=2, s(x,y)=-1 for x≠y, d=-2
A C G G C T A T
0 1 2 3 4 5 6 7 8
0 0 -2 -4 -6 -8 -10 -12 -14 -16
A 1 -2 2 0 -2 -4 -6 -8 -10 -12
C 2 -4 0 4 2 0 -2 -4 -6 -8
T 3 -6 -2 2 3 1 -1 0 -2 -4
G 4 -8 -4 0 4 5 3 1 -1 -3
T 5 -10 -6 -2 2 3 4 5 3 1
A
6 -12 -8 -4 0 1 2 3 7 5
T
7 -14 -10 -6 -2 -1 0 4 5 9 Score(S,T)
How to recover the alignment?
s(x,x)=2, s(x,y)=-1 for x≠y, d=-2
A C G G C T A T
0 1 2 3 4 5 6 7 8
0 0 -2 -4 -6 -8 -10 -12 -14 -16
A 1 -2 2 0 -2 -4 -6 -8 -10 -12
C 2 -4 0 4 2 0 -2 -4 -6 -8
T 3 -6 -2 2 3 1 -1 0 -2 -4
G 4 -8 -4 0 4 5 3 1 -1 -3
T 5 -10 -6 -2 2 3 4 5 3 1
A
6 -12 -8 -4 0 1 2 3 7 5
T
7 -14 -10 -6 -2 -1 0 4 5 9 Score(S,T)
How to recover the alignment?
s(x,x)=2, s(x,y)=-1 for x≠y, d=-2 ACGGCTAT
ACTG TAT
A C G G C T A T
0 1 2 3 4 5 6 7 8
0 0 -2 -4 -6 -8 -10 -12 -14 -16
A 1 -2 2 0 -2 -4 -6 -8 -10 -12
C 2 -4 0 4 2 0 -2 -4 -6 -8
T 3 -6 -2 2 3 1 -1 0 -2 -4
G 4 -8 -4 0 4 5 3 1 -1 -3
T 5 -10 -6 -2 2 3 4 5 3 1
A
6 -12 -8 -4 0 1 2 3 7 5
T
7 -14 -10 -6 -2 -1 0 4 5 9 Score(S,T)
Alignment: graph formulation

A C G G C T A T

A
C
T
G
T
A
T

2 -1 -2
match mismatch indels
Alignment: graph formulation

A C G G C T A T

A
C max-
T cost
path
G cost=9

T
A
T

2 -1 -2
match mismatch indels
Exercise

! Give all optimal alignments between

ACCGTTG and CGAATGAA if the match
score is 2, the mismatch penalty is -1 and the
gap penalty (indel score) is -2
Comments

! algorithm known as Needleman-Wunsch algorithm (1970)

! note that optimal alignment is generally not unique
! the problem considered is called global alignment
! both time and space complexity is O(n2)
! space complexity is O(n) if only the optimal score has to be
computed (e.g. line-by-line, keep two lines at a time)
! time can be reduced to O(n2/log2n) (assuming RAM model)
[Masek, Paterson 80] using “four-russians
technique” (another solution in [Crochemore, Landau, Ziv-
Ukelson 03])
! proved to be unlikely solvable in time O(n2-ε) [Abboud,
Williams, Weimann 14] (by reduction from 3SUM to some
versions of alignment problem)
Exercises (1)

! End-space free alignment of S and T

─ compute the best alignment of S and T such that
spaces at string borders contribute 0

T suffix-prefix overlap

S
Exercises (2)

! Approximate occurrences of P in T
─ compute all alignments such that Score(S,T[i..j])>δ

T
Exercises (2)

! Approximate occurrences of P in T
─ compute all alignments such that Score(S,T[i..j])>δ

! Particular cases
─ edit distance (<k): O(kn) [Landau&Vishkin 85,
Galil&Park 89, …]
─ Hamming distance: O(n·log(m)) [Fischer&Paterson
73], O(nk) [Galil&Giancarlo 86], O(n√k·log(k))
[Amir&Lewenstein&Porat 04], …
Computing alignment in linear space

! Hirschberg (1975) proposed a nice trick in order to

compute the optimal alignment in linear space (at the
price of doubling the time)

! Key observation:
n/2
T

S
k*
Computing alignment in linear space

! Hirschberg (1975) proposed a nice trick in order to

compute the optimal alignment in linear space (at the
price of doubling the time)

! Key observation:
n/2
T
Score(n/2, k) ScoreR(n/2, m-k)
S
k*

k*= argmaxk (Score(n/2, k)+ScoreR(n/2, m-k))

Computing alignment in linear space

! Hirschberg (1975) proposed a nice trick in order to

compute the optimal alignment in linear space (at the
price of doubling the time)

! Key observation:
n/2
T
Score(n/2, k) ScoreR(n/2, m-k)
S
compute Score(n/2,k) for all k k*
compute ScoreR(n/2,m-k) for all k

k*= argmaxk (Score[n/2, k]+ScoreR[n/2, m-k])

n
m k*

compute
k*=argmaxk(Score[n/2,k]+ScoreR[n/2,m-k])
m k*

n
m k*

n
Resulting complexity

! if the Score computation on a p×q matrix takes time

c·pq, then computing the first “cut” takes 2·c·(n/
2)·m=c·nm
! the first halving results in time c·(n/2)·k*+c·(n/2)·(m-k*)=
1/2·c·nm
! all recursive calls take time
c·nm+1/2·c·nm+1/4·c·nm+…≤ 2c·nm
Local alignment

! Biologists are mostly interested in local alignments that

may ignore arbitrary prefixes and suffixes of input
sequences

S
Local alignment

! Biologists are mostly interested in local alignments that

may ignore arbitrary prefixes and suffixes of input
sequences

! Problem: Compute all significant local alignments, i.e.

all alignments of score above a threshold
Smith-Waterman algorithm (1981)

! Assume matches are scored positively and mismatches/indels

are scored negatively
! Score[i,j]: maximal score over all substrings of S that end at
position i and all substrings of T that end at position j
! initialization: Score[0,j]=Score[i,0]=0

0
Score[i-1,j-1] + s(S[i],T[j])
Score[i,j] = max
Score[i-1,j] - d
Score[i,j-1] - d
Smith-Waterman: example

EAWACQGKL vs ERDAWCQPGKWY!
s(x,x)=1, s(x,y)=-3 for x≠y, d=-1

resulting local alignment:

Comments

! Score matrix is important

! The average value of score matrix should be negative
! There exists a statistical model (Karlin&Altschul 90)
that allows to relate the score of a local alignment and
the probability for this alignment to appear in random
sequences (p-value)
More complex gap penalty systems

! Affine gap penalty: h+q·i

h: gap opening penalty
q: gap extension penalty
O(mn) algorithm [Gotoh 82]
! Convex gap penalty
O(mn·log n)
! Arbitrary gap penalty
O(mn2+nm2)

National Vision and Strategic Framework For Midwifery - 2022
No ratings yet
National Vision and Strategic Framework For Midwifery - 2022
18 pages
PCB Lect02 Pairwise Allign
No ratings yet
PCB Lect02 Pairwise Allign
51 pages
Lab5 Ch2 Sequence Similarity PDF
No ratings yet
Lab5 Ch2 Sequence Similarity PDF
95 pages
Sequence Alignment: Lecture 2, Thursday April 3, 2003
No ratings yet
Sequence Alignment: Lecture 2, Thursday April 3, 2003
39 pages
Pairwise Alignment 2017
No ratings yet
Pairwise Alignment 2017
49 pages
Dynamic Programming
No ratings yet
Dynamic Programming
28 pages
Introduction Dynamic Programming
No ratings yet
Introduction Dynamic Programming
52 pages
Lecture 5 Introduction Dynamic Programming
No ratings yet
Lecture 5 Introduction Dynamic Programming
52 pages
Needleman Wunsch PDF
No ratings yet
Needleman Wunsch PDF
3 pages
Pattern Matching Techniques and Their Applications To Computational Molecular Biology - A Review
No ratings yet
Pattern Matching Techniques and Their Applications To Computational Molecular Biology - A Review
8 pages
04 Dynamic Programming 2 Editdistance
No ratings yet
04 Dynamic Programming 2 Editdistance
99 pages
Lecture5 Newest
No ratings yet
Lecture5 Newest
124 pages
Sequence Comparison and Alignment: Bioinformatics #4 IPB University
No ratings yet
Sequence Comparison and Alignment: Bioinformatics #4 IPB University
37 pages
Notes On Dynamic-Programming Sequence Alignment
No ratings yet
Notes On Dynamic-Programming Sequence Alignment
8 pages
Week 4
No ratings yet
Week 4
38 pages
Sequence Comparison: Motivation: Finding Similarity Between Sequences Is Important For Many Biological Questions
No ratings yet
Sequence Comparison: Motivation: Finding Similarity Between Sequences Is Important For Many Biological Questions
47 pages
COB Sequencealignment
No ratings yet
COB Sequencealignment
49 pages
Running BLAST Through Perl
No ratings yet
Running BLAST Through Perl
35 pages
Needlemanwunsch 130216130832 Phpapp01
No ratings yet
Needlemanwunsch 130216130832 Phpapp01
39 pages
Unit Ii
No ratings yet
Unit Ii
14 pages
Smithwaterman 130216133804 Phpapp02
No ratings yet
Smithwaterman 130216133804 Phpapp02
15 pages
06DynamicProgrammingII 2x2
No ratings yet
06DynamicProgrammingII 2x2
17 pages
Bioinformatics 1: Lecture 3: - Pairwise Alignment - Substitution - Dynamic Programming Algorithm
No ratings yet
Bioinformatics 1: Lecture 3: - Pairwise Alignment - Substitution - Dynamic Programming Algorithm
32 pages
Lecture 9 and 10 Pair wise global Alignment.
No ratings yet
Lecture 9 and 10 Pair wise global Alignment.
27 pages
05 Dynamic Programming i i
No ratings yet
05 Dynamic Programming i i
64 pages
Sequence Comparison Part 3
No ratings yet
Sequence Comparison Part 3
22 pages
Tabby
No ratings yet
Tabby
11 pages
Three Steps in Dynamic Programming
No ratings yet
Three Steps in Dynamic Programming
7 pages
Sequence Analysis - Pairwise Alignment
No ratings yet
Sequence Analysis - Pairwise Alignment
26 pages
Alignment Methods: Introduction To Global and Local Sequence Alignment Methods
No ratings yet
Alignment Methods: Introduction To Global and Local Sequence Alignment Methods
57 pages
Sequence Alignment Presentation
No ratings yet
Sequence Alignment Presentation
27 pages
Frid Seminar
No ratings yet
Frid Seminar
30 pages
Lecture 4
No ratings yet
Lecture 4
57 pages
Lecture 5: Multiple Sequence Alignment: Introduction To Computational Biology
No ratings yet
Lecture 5: Multiple Sequence Alignment: Introduction To Computational Biology
34 pages
Lecture 5
No ratings yet
Lecture 5
42 pages
Global Alignment: Ben Langmead
No ratings yet
Global Alignment: Ben Langmead
15 pages
DNA Alignment
No ratings yet
DNA Alignment
76 pages
Bio Medical Tics - Sequence Analysis - Alignment - 2011
No ratings yet
Bio Medical Tics - Sequence Analysis - Alignment - 2011
96 pages
cng465 hw1
No ratings yet
cng465 hw1
2 pages
Labwork8 Biomedical Informatics University of Ljubljana, Faculty of Electrical Engineering
No ratings yet
Labwork8 Biomedical Informatics University of Ljubljana, Faculty of Electrical Engineering
7 pages
Lecture-7-Dynamic Programming Global-Sequence Alignment
No ratings yet
Lecture-7-Dynamic Programming Global-Sequence Alignment
31 pages
Zhang 2000
No ratings yet
Zhang 2000
12 pages
Global Alignment
100% (1)
Global Alignment
40 pages
BCB BCB/GDCB/STAT/COM S 568 Spring 2010 Homework 1 January 19, 2010 Due One Week Later. Answers To Selected Problems Will Be Posted
No ratings yet
BCB BCB/GDCB/STAT/COM S 568 Spring 2010 Homework 1 January 19, 2010 Due One Week Later. Answers To Selected Problems Will Be Posted
1 page
HW1 2014
No ratings yet
HW1 2014
2 pages
q1 Answer
No ratings yet
q1 Answer
2 pages
What Is Dynamic Programming?
No ratings yet
What Is Dynamic Programming?
7 pages
Sequence Comparison: Local Alignment
No ratings yet
Sequence Comparison: Local Alignment
21 pages
Sequence Alignment: "Continuing.." (5th Week)
No ratings yet
Sequence Alignment: "Continuing.." (5th Week)
61 pages
Module Comparing and Visualizing Multiple Biological Sequences
No ratings yet
Module Comparing and Visualizing Multiple Biological Sequences
34 pages
Sequence Alignment Methods and Algorithms
No ratings yet
Sequence Alignment Methods and Algorithms
37 pages
Sequence Alignment Methods and Algorithms
75% (4)
Sequence Alignment Methods and Algorithms
37 pages
Sequence Comparison
No ratings yet
Sequence Comparison
39 pages
lecture2_sequence_alignment
No ratings yet
lecture2_sequence_alignment
26 pages
Multiple Alignment PDF
No ratings yet
Multiple Alignment PDF
45 pages
Sequence Alignment: Lecture 2, Thursday April 3, 2003
No ratings yet
Sequence Alignment: Lecture 2, Thursday April 3, 2003
38 pages
Definition of Minimum Edit Distance
No ratings yet
Definition of Minimum Edit Distance
49 pages
Bioinformatics Prof. M. Michael Gromiha Department of Biotechnology Indian Institute of Technology, Madras Lecture - 7b Sequence Alignment II
No ratings yet
Bioinformatics Prof. M. Michael Gromiha Department of Biotechnology Indian Institute of Technology, Madras Lecture - 7b Sequence Alignment II
26 pages
Bioinfo Generic Skill
No ratings yet
Bioinfo Generic Skill
10 pages
Analytic Geometry: Graphic Solutions Using Matlab Language
From Everand
Analytic Geometry: Graphic Solutions Using Matlab Language
Ing. Mario Castillo
No ratings yet
Computer Solved: Nonlinear Differential Equations
From Everand
Computer Solved: Nonlinear Differential Equations
Joe J. Ettl
No ratings yet
Career in Biomedical Engineering
No ratings yet
Career in Biomedical Engineering
2 pages
Gynae
No ratings yet
Gynae
34 pages
Complicações Observadas em Cães e Gatos Com Doenças Neurológicas
No ratings yet
Complicações Observadas em Cães e Gatos Com Doenças Neurológicas
13 pages
PHS 426 PDF
No ratings yet
PHS 426 PDF
219 pages
Teori Pendekatan Di Layanan Kesehatan Primer: Biopsikososiokultural
No ratings yet
Teori Pendekatan Di Layanan Kesehatan Primer: Biopsikososiokultural
23 pages
Sistemico Endo
No ratings yet
Sistemico Endo
20 pages
Community Health Nursing Webinar Guidelines: Webinar Rubrics To Follow
No ratings yet
Community Health Nursing Webinar Guidelines: Webinar Rubrics To Follow
2 pages
Genital Tract Injuries
No ratings yet
Genital Tract Injuries
74 pages
Medalion DOORTUA+2021 Bhs+Ingg
No ratings yet
Medalion DOORTUA+2021 Bhs+Ingg
5 pages
Cannabinoids Explained
No ratings yet
Cannabinoids Explained
9 pages
Objective
No ratings yet
Objective
2 pages
ALS Protocol
No ratings yet
ALS Protocol
134 pages
Common Errors in Nursing Documentation
No ratings yet
Common Errors in Nursing Documentation
116 pages
Hed Mod 1 - Jose
No ratings yet
Hed Mod 1 - Jose
3 pages
Retinoblastoma Case Report Latest With Reference
No ratings yet
Retinoblastoma Case Report Latest With Reference
29 pages
Diabetes Case Study
No ratings yet
Diabetes Case Study
6 pages
FNCP - Final
No ratings yet
FNCP - Final
6 pages
CGHS2
No ratings yet
CGHS2
7 pages
Updated Revised-Medical Handwashing Checklist
No ratings yet
Updated Revised-Medical Handwashing Checklist
8 pages
Regional Training Course On Beach Water Pools Water Quality Guidelines Monitoring and Surveillance
No ratings yet
Regional Training Course On Beach Water Pools Water Quality Guidelines Monitoring and Surveillance
20 pages
Ethico-Moral-and-Legal-Foundation-of-Client-Education-DB
No ratings yet
Ethico-Moral-and-Legal-Foundation-of-Client-Education-DB
13 pages
10 Health Benefits of Laughing
No ratings yet
10 Health Benefits of Laughing
3 pages
An Overview of Guidelines and Regulations For Probiotics: Virender K. Batish
No ratings yet
An Overview of Guidelines and Regulations For Probiotics: Virender K. Batish
45 pages
Implementing The Nursing Care Plan
No ratings yet
Implementing The Nursing Care Plan
33 pages
WHO Releases New International Classification of Diseases (ICD 11)
No ratings yet
WHO Releases New International Classification of Diseases (ICD 11)
5 pages
Guideline Pancreatitis
No ratings yet
Guideline Pancreatitis
39 pages
Department of Education: Republic of The Philippines
No ratings yet
Department of Education: Republic of The Philippines
3 pages
Core Document: Philosophy and Model of Midwifery Care
No ratings yet
Core Document: Philosophy and Model of Midwifery Care
3 pages
Unconsciousness
No ratings yet
Unconsciousness
16 pages

lecture1-2

Uploaded by

lecture1-2

Uploaded by

Part 2: Sequence search and

! D.Gusfield, Algorithms on Strings, Trees and Sequences:

! E.Ohlebusch, Bioinformatics algorithms, 2013,

! Sequence comparison: most ubiquitous task in

─ similar sequences fold to similar structures

Image from: http://www.ncbi.nlm.nih.gov/

! Given two sequences RDISLVKNAGI and RNILVSDAKNVGI

! 3 types of columns corresponding to 3 elementary evolutionary

! BLOSUM62 matrix for protein sequences

! consider score match:1, indel: 0, mismatch: -1

! consider score match:1, indel: 0, mismatch: -1

! optimal alignment ~ longest common subsequence (LCS)

! consider score match:0, indel: -1, mismatch: -1

! optimal alignment ~ Levenshtein (edit) distance

part of SAM format

! assume -d is indel penalty, s(x,y) score of aligning x and y

! Give all optimal alignments between

! algorithm known as Needleman-Wunsch algorithm (1970)

! End-space free alignment of S and T

! Hirschberg (1975) proposed a nice trick in order to

! Hirschberg (1975) proposed a nice trick in order to

k*= argmaxk (Score(n/2, k)+ScoreR(n/2, m-k))

! Hirschberg (1975) proposed a nice trick in order to

k*= argmaxk (Score[n/2, k]+ScoreR[n/2, m-k])

! if the Score computation on a p×q matrix takes time

! Biologists are mostly interested in local alignments that

! Biologists are mostly interested in local alignments that

! Problem: Compute all significant local alignments, i.e.

! Assume matches are scored positively and mismatches/indels

resulting local alignment:

! Score matrix is important

! Affine gap penalty: h+q·i

You might also like