0% found this document useful (0 votes)

22 views13 pages

Bioinformatics Lesson 05

This document discusses multiple sequence alignment (MSA). MSA can reveal subtle similarities between sequences that pairwise alignment cannot detect. Global MSA approaches use dynamic programming to find the optimal alignment that maximizes a score function, but this becomes computationally expensive with many sequences. Progressive and iterative methods are alternatives. MSA is useful for studying correspondence between related genes, predicting protein structure, creating profiles for protein families, and phylogenetic analysis.

Uploaded by

mahedi hasan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views13 pages

Bioinformatics Lesson 05

Uploaded by

mahedi hasan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Introduction to Bioinformatics

Lecture 7
Why Multiple Sequence Alignment?

• Up until now we have only

tried to align two sequences.
• What about more than two?
And what for?
• A faint similarity between two
sequences becomes significant
if present in many VTISCTGSSSNIG
• Multiple alignments can V T LT C T G S S S N I G
reveal subtle similarities that V T LS C S S S G F I F S
pairwise alignments do not V T LT C T V S G T S F D
reveal VTITCVVSDVSHE
V T LV C L I S D F Y P G
V T LV C L I S D F Y P G
V T LV C L VS D Y F P E
Multiple Sequence Alignment (msa)
VTISCTGSSSNIGAGNHVKWYQQLPG
VTISCTGTSSNIGSITVNWYQQLPG
LRLSCSSSGFIFSSYAMYWVRQAPG
LSLTCTVSGTSFDDYYSTWVRQPPG
PEVTCVVVDVSHEDPQVKFNWYVDG
ATLVCLISDFYPGAVTVAWKADS
ATLVCLISDFYPGAVTVAWKADS
AALGCLVKDYFPEPVTVSWNSG-
VSLTCLVKGFYPSDIAVEWESNG-

• Goal: Bring the greatest number of similar

characters into the same column of the alignment
• Similar to alignment of two sequences.
Multiple Sequence Alignment: Motivation
• Correspondence. Find out which parts “do the same thing”
– Similar genes are conserved across widely divergent species,
often performing similar functions
• Structure prediction
– Use knowledge of structure of one or more members of a
protein MSA to predict structure of other members
– Structure is more conserved than sequence
• Create “profiles” for protein families
– Allow us to search for other members of the family
• Genome assembly: Automated reconstruction of “contig”
maps of genomic fragments such as ESTs
• msa is the starting point for phylogenetic analysis
• msa often allows to detect weakly conserved regions which
pairwise alignment can’t
Multiple Sequence Alignment: Approaches
• Optimal Global Alignments -
– Generalization of Dynamic programming
– Find alignment that maximizes a score function
– Computationally expensive: Time grows as
product of sequence lengths
• Global Progressive Alignments - Match closely-
related sequences first using a guide tree
• Global Iterative Alignments - Multiple re-building
attempts to find best alignment
• Local alignments
– Profile analysis,
– Block analysis
– Patterns searching and/or Statistical methods
Global msa: Challenges
• Computationally Expensive
– If msa includes matches, mismatches and gaps and also
accounts the degree of variation then global msa can be
applied to only a few sequences
• Difficult to score
– Multiple comparison necessary in each column of the msa for
a cumulative score
– Placement of gaps and scoring of substitution is more difficult
• Difficulty increases with diversity
– Relatively easy for a set of closely related sequences
– Identifying the correct ancestry relationships for a set of
distantly related sequences is more challenging
– Even difficult if some members are more alike compared
to others
Global msa: Dynamic Programming
• The two-sequence alignment algorithm (Needleman-
Wunsch) can be generalized to any number of
sequences.
• E.g., for three sequences X, Y, W
define C[i,j,k] = score of optimum
alignment
among
X[1..i], Y[1..j], W[1..k]
• As for two sequences, divide possible alignments
into different classes, depending on how they end.
– Devise recurrence relations for C[i,j,k]
– C[i,j,k] is the maximum out of all possibilities
msa for 3 sequences: alignment can end in 7 ways

X1 . . . Xi
Xi-1 Xi Yj
Y1 . . . Yj Wk
W1 . . . Yj-1 Wk -
Yj
Wk-1
Wk
Xi
-
Xi Wk
- Xi
- - Yj
Yj
- - -
- Wk
Aligning Three Sequences
V
V

W
W

2-D edit graph

• Same strategy as
aligning two sequences
• Use a 3-D “Manhattan
X
Cube”, with each axis
representing a sequence 3-D edit graph
to align
Dynamic programming for 3 sequences

Each alignment is a path through the

dynamic programming matrix

S
A
V S N —S
A —S N A —
———A
N
S
S
Start V S N S
2-D cell versus 2-D Alignment Cell
C(i-1,j-1,k-1) C(i-1,j,k- C (i-1,j-1) C (i-1,j)
1)
C (i-1,j,k)
C(i-1,j-1,k)

C (i,j-1)

In 2-D, 3 edges
in each unit
C(i,j-1,k-1)
square
C(i,j,k-1)

In 3-D, 7 edges
in each unit cube
C(i,j-1,k) C(i,j,k)

Enumerate all possibilities and choose the best one

Multiple Alignment: Dynamic Programming

si-1,j-1,k-1 + (vi, wj, uk) cube diagonal:

no in/dels
si-1,j-1,k +  (vi, wj, _ )
• si,j,k = max
si-1,j,k-1 +  (vi , _, uk) face diagonal:
si,j-1,k-1 +  (_, wj, uk) one in/del
+  (vi, _ , _)
si-1,j,k
+  (_, wj, _) edge diagonal:
two in/dels
si,j-1,k +  (_, _, uk)
si,j,k-1
• (x, y, z) is an entry in the 3-D scoring
matrix
• Reading Materials
– Chapter 5: Bioinformatics Sequence and Genome
analysis – David W. Mount
• 2nd Edition: Page 170~194
• 1st Edition: Page 140~165
– Cédric Notredame, Desmond G. Higgins and Jaap Heringa “T-
coffee: a novel method for fast and accurate multiple
sequence alignment”, Journal of Molecular Biology, Volume
302, Issue 1, 8 September 2000, Pages 205-217

– Christopher Lee, Catherine Grasso and Mark F. Sharlow,

“Multiple sequence alignment using partial order graphs”
Bioinformatics Vol. 18 no. 3 2002, Pages 452-464

– Cédric Notredame and Desmond G. Higgins “SAGA: sequence

alignment by genetic algorithm”, Nucleic Acids Res. 1996 Apr
15;24(8):1515-24.

Multiple Sequence Alignment
No ratings yet
Multiple Sequence Alignment
19 pages
Second - Done - w15 - 16 - A - Multiple Sequence Alignment
No ratings yet
Second - Done - w15 - 16 - A - Multiple Sequence Alignment
36 pages
Bioinformatic Material
No ratings yet
Bioinformatic Material
26 pages
Multiple Alignment
No ratings yet
Multiple Alignment
28 pages
Unit 3 Sequence Alignment and Phylogenetic Tree
No ratings yet
Unit 3 Sequence Alignment and Phylogenetic Tree
70 pages
Computational Biology (3) Alignment Algorithms: by Dr. Safynaz Abdel-Fattah Computer Science Department
No ratings yet
Computational Biology (3) Alignment Algorithms: by Dr. Safynaz Abdel-Fattah Computer Science Department
107 pages
L8 Msa
No ratings yet
L8 Msa
52 pages
Lecture 4
No ratings yet
Lecture 4
21 pages
Msa Notes
No ratings yet
Msa Notes
10 pages
Multiple Sequence Alignment: Some Slides From Cuong Dang and Others
No ratings yet
Multiple Sequence Alignment: Some Slides From Cuong Dang and Others
27 pages
Multiple Sequence Alignment
No ratings yet
Multiple Sequence Alignment
89 pages
Multiple Sequence Alignment
No ratings yet
Multiple Sequence Alignment
18 pages
Multiple Sequence Alignment Part 1
No ratings yet
Multiple Sequence Alignment Part 1
64 pages
5 Sequence Alignment
No ratings yet
5 Sequence Alignment
21 pages
Lec4 - Multiple Sequence Alignment
No ratings yet
Lec4 - Multiple Sequence Alignment
22 pages
Align 2
No ratings yet
Align 2
29 pages
Introduction-To-Computational Biology
No ratings yet
Introduction-To-Computational Biology
61 pages
Lec7 - Multiple Sequence Alignment
No ratings yet
Lec7 - Multiple Sequence Alignment
22 pages
Module 3 CSE3069 (Bioinformatics)
No ratings yet
Module 3 CSE3069 (Bioinformatics)
57 pages
BLAST (Basic Local Alignment Search Tool)
100% (1)
BLAST (Basic Local Alignment Search Tool)
23 pages
Sequence Alignment
No ratings yet
Sequence Alignment
24 pages
Chapter 6 Multiple Sequence Alignment 2022 Bioinformatics For Everyone
No ratings yet
Chapter 6 Multiple Sequence Alignment 2022 Bioinformatics For Everyone
7 pages
Multiple Sequence Alignment (MSA)
No ratings yet
Multiple Sequence Alignment (MSA)
78 pages
04-Alinemiento Múltiple de Secuencias
No ratings yet
04-Alinemiento Múltiple de Secuencias
14 pages
Lecture 3
No ratings yet
Lecture 3
39 pages
W03 Pairwise
No ratings yet
W03 Pairwise
55 pages
Sequence Alignment
No ratings yet
Sequence Alignment
25 pages
MultipleSequenceAlignment 2021 PDF
No ratings yet
MultipleSequenceAlignment 2021 PDF
5 pages
Importance and Significance of Sequence Alignment - pptx12
No ratings yet
Importance and Significance of Sequence Alignment - pptx12
15 pages
Lecture 6
No ratings yet
Lecture 6
31 pages
L3.4 Alignment
No ratings yet
L3.4 Alignment
90 pages
Sequence Allignment
No ratings yet
Sequence Allignment
5 pages
Sequence Alignment Methods
No ratings yet
Sequence Alignment Methods
32 pages
Sequence Alignment
No ratings yet
Sequence Alignment
9 pages
Note 7 - Group 7 Scribbing
No ratings yet
Note 7 - Group 7 Scribbing
7 pages
Chap 03 BioInfo
No ratings yet
Chap 03 BioInfo
15 pages
Notes Bioinformatics
No ratings yet
Notes Bioinformatics
14 pages
Sequence Alignment Presentation
No ratings yet
Sequence Alignment Presentation
27 pages
Alignment Methods
No ratings yet
Alignment Methods
33 pages
Multiple Alignment PDF
No ratings yet
Multiple Alignment PDF
45 pages
Unit 3 Bioinformatics
No ratings yet
Unit 3 Bioinformatics
11 pages
Lecture 5: Multiple Sequence Alignment: Introduction To Computational Biology
No ratings yet
Lecture 5: Multiple Sequence Alignment: Introduction To Computational Biology
34 pages
Dr. Zoya Khalid Zoya - Khalid@nu - Edu.pk
No ratings yet
Dr. Zoya Khalid Zoya - Khalid@nu - Edu.pk
51 pages
Multiple Sequence Alignment 3
No ratings yet
Multiple Sequence Alignment 3
22 pages
Sequence Analysis - Pairwise Alignment
No ratings yet
Sequence Analysis - Pairwise Alignment
26 pages
Bio Medical Tics - Sequence Analysis - Alignment - 2011
No ratings yet
Bio Medical Tics - Sequence Analysis - Alignment - 2011
96 pages
Msa
No ratings yet
Msa
28 pages
Sequence Alignment Methods and Algorithms
75% (4)
Sequence Alignment Methods and Algorithms
37 pages
Sequence Alignment Methods and Algorithms
No ratings yet
Sequence Alignment Methods and Algorithms
37 pages
Multiple Sequence Alignment
No ratings yet
Multiple Sequence Alignment
7 pages
Sequence Analysis in Bioinformatics
No ratings yet
Sequence Analysis in Bioinformatics
18 pages
Sequence Alignments: Felix Sappelt Irina Wagner
100% (1)
Sequence Alignments: Felix Sappelt Irina Wagner
34 pages
Multiple Seq Alignment
No ratings yet
Multiple Seq Alignment
36 pages
Bioinformatics: Sequence Alignment Methods
No ratings yet
Bioinformatics: Sequence Alignment Methods
32 pages
Multiple Sequence Alignment Black and White
No ratings yet
Multiple Sequence Alignment Black and White
2 pages
Data Mining-Mining Sequence Patterns in Biological Data
No ratings yet
Data Mining-Mining Sequence Patterns in Biological Data
6 pages
Analysis of Incidence Rates - 1st Edition Full Version Download
100% (10)
Analysis of Incidence Rates - 1st Edition Full Version Download
15 pages
Forecasting
No ratings yet
Forecasting
30 pages
Regression Models As A Tool in Medical Research - 1st Edition No-Wait Download
100% (19)
Regression Models As A Tool in Medical Research - 1st Edition No-Wait Download
15 pages
(Ebook) Econometrics by Example by Damodar Gujarati ISBN 9781137375018, 1137375019 Download
No ratings yet
(Ebook) Econometrics by Example by Damodar Gujarati ISBN 9781137375018, 1137375019 Download
43 pages
CahanBaiNg-Factor-Based Imputation of Missing Values and Covariances in Panel Data of Large Dimensions
No ratings yet
CahanBaiNg-Factor-Based Imputation of Missing Values and Covariances in Panel Data of Large Dimensions
34 pages
MBS 7e PPT 15
No ratings yet
MBS 7e PPT 15
51 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
7 pages
Levine Bsfc7ge Ch12 1
No ratings yet
Levine Bsfc7ge Ch12 1
93 pages
Econometrics by Abdul Waheed
No ratings yet
Econometrics by Abdul Waheed
21 pages
Hasil Uji Daya Beda Aitem Dan Reliabilitas SPSS Nasywa
No ratings yet
Hasil Uji Daya Beda Aitem Dan Reliabilitas SPSS Nasywa
3 pages
Managerial Economic
No ratings yet
Managerial Economic
9 pages
MPhil Econometrics Question Final Exam 2022
No ratings yet
MPhil Econometrics Question Final Exam 2022
2 pages
Ecta - Higher Order Properties of GMM and Generalized - 2004
No ratings yet
Ecta - Higher Order Properties of GMM and Generalized - 2004
37 pages
Limited Dependent Variable Models Example
No ratings yet
Limited Dependent Variable Models Example
5 pages
Linear Regression
No ratings yet
Linear Regression
4 pages
09.the Gauss-Markov Theorem and BLUE OLS Coefficient Estimates
No ratings yet
09.the Gauss-Markov Theorem and BLUE OLS Coefficient Estimates
10 pages
Probability and Statistics ch7
No ratings yet
Probability and Statistics ch7
19 pages
Machine Learning Model Evaluation
No ratings yet
Machine Learning Model Evaluation
11 pages
Forecasting - Solutions
No ratings yet
Forecasting - Solutions
46 pages
Presentación Modelo 3
No ratings yet
Presentación Modelo 3
8 pages
Polygenic Scoring Accuracy Varies Across The Genetic Ancestry Continuum
No ratings yet
Polygenic Scoring Accuracy Varies Across The Genetic Ancestry Continuum
25 pages
Population Math Practice
No ratings yet
Population Math Practice
2 pages
Population Evolution, Genetic Drift, Hardy-Weinberg Webquest
No ratings yet
Population Evolution, Genetic Drift, Hardy-Weinberg Webquest
8 pages
ANNEXURE (M&A Report)
No ratings yet
ANNEXURE (M&A Report)
15 pages
DH 301 - Basic Epidemiology - Department Academic Mentorship Program
No ratings yet
DH 301 - Basic Epidemiology - Department Academic Mentorship Program
5 pages
Act 1 PIRC
No ratings yet
Act 1 PIRC
2 pages
SMS 3355 Design Analysis of Sample Surveys
No ratings yet
SMS 3355 Design Analysis of Sample Surveys
3 pages
Midterm Exam 1 - Specimen Paper - v3
No ratings yet
Midterm Exam 1 - Specimen Paper - v3
4 pages
Pr. 12 Regression
No ratings yet
Pr. 12 Regression
4 pages
EOS Van - Der - Waals - Equation Nov 2 2011
No ratings yet
EOS Van - Der - Waals - Equation Nov 2 2011
7 pages
Even Distribution and Spherical Ball-Packing
From Everand
Even Distribution and Spherical Ball-Packing
Ying-chien Chang
No ratings yet
Useful Formulae: Mathematical & Physical
From Everand
Useful Formulae: Mathematical & Physical
Matthew Watkins
No ratings yet
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
Mathematical Functions
From Everand
Mathematical Functions
Oliver Linton
No ratings yet

Bioinformatics Lesson 05

Uploaded by

Bioinformatics Lesson 05

Uploaded by

Introduction to Bioinformatics

• Up until now we have only

• Goal: Bring the greatest number of similar

2-D edit graph

Each alignment is a path through the

Enumerate all possibilities and choose the best one

si-1,j-1,k-1 + (vi, wj, uk) cube diagonal:

– Christopher Lee, Catherine Grasso and Mark F. Sharlow,

– Cédric Notredame and Desmond G. Higgins “SAGA: sequence

You might also like