0% found this document useful (0 votes)

12 views28 pages

Bioinformatics Module 2 Notes

Uploaded by

crizjames1096

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views28 pages

Bioinformatics Module 2 Notes

Uploaded by

crizjames1096

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

5/19/2024

• The PAM matrices for amino acids, along with the single letter
abbreviations used for genetically encoded amino acids, were
developed by Margaret Dayhoff.

PAM
🞂 A percent(or point) accepted mutation — also
known as a PAM — is the replacement of a
single amino acid in the primary structure of
a protein with another single amino acid, which is
accepted by the processes of natural selection.
🞂 These mutations were identified by comparing
highly similar sequences with at least 85% identity

🞂 A PAM matrix is a matrix where each column and

row represents one of the twenty standard amino
acids.

1
5/19/2024

• PAM also defines a time unit, where 1 PAM is the time in which 1/100
amino acids are expected to undergo a mutation.
• The PAM1 probability matrix shows the probability of the amino acid
at row i being replaced by the amino acid at column j.

2
5/19/2024

• PAM250 probability matrix, describing the replacement probabilities

given 250 PAM units of time

PAM
🞂 Each entry indicates the likelihood of the amino acid
of that row being replaced with the amino acid of
that column through a series of one or more point
accepted mutations during a specified evolutionary
interval, rather than these two amino acids being
aligned due to chance.
🞂 Different PAM matrices correspond to different
lengths of time in the evolution of the protein
sequence.

3
5/19/2024

PAM Matrices
🞂 PAM matrices are amino acid substitution matrices that
encode the expected evolutionary change at the amino
acid level.
🞂 Each PAM matrix is designed to compare two sequences
which are a specific number of PAM units apart.
🞂 One PAM unit is defined as 1% of the amino acids positions
that have been changed.
🞂 Two sequences S1 and S2 are at evolutionary distance of 1
PAM unit ,if S1 has converted to S2 with an average of one
amino acid substitution per 100 amino acids.
🞂 250 PAM = 250 mutations per 100 amino acids, so 2.5
accepted mutations per amino acid

PAM Matrices
🞂 When used for protein comparison, the mutation
probability (odds) matrix is normalized and the
logarithm is taken. (this lets us add the scores along
a protein instead of multiplying the probabilities).
The resulting matrix is the "log-odds" matrix, known
as the PAM matrix.

4
5/19/2024

PAM Series
🞂 There is a whole series of matrices: PAM10 ……..
PAM250
🞂 These matrices are extrapolated from PAM1 matrix
(by matrix multiplication)
🞂 The PAM120 score matrix is designed to compare
between sequences that are 120 PAM units apart:
The score it gives a pair of sequences is the (log of
the) probabilities of such sequences evolving during
120 PAM units of evolution.

PAM Series
🞂 For any specific pair (Ai, Aj) of amino acids the (i,j)
entry in the PAM n matrix reflects the frequency at
which Ai is expected to replace with Aj in two
sequences that are n PAM units diverged. These
frequencies should be estimated by gathering
statistics on replaced amino acids.

5
5/19/2024

PAM 100

Creation of a PAM matrix

1. Construct a multiple sequence alignment.
2. From the alignment, construct a phylogenetic tree.
3. For each amino acid type, the frequency with
which it is substituted by every other amino acid is
calculated.(Fij)
4. Compute the amino acid mutability, mi of each
amino acid.

6
5/19/2024

Problems with PAM

🞂 Not all position are same
🞂 Evolutionary rates vary greatly with in a sequence.
🞂 Environment changes over evolutionary time
🞂 Difficulty of determining ancestral relationships
among sequences.

BLOSUM
🞂 Block Substitution Matrix.
🞂 BLOSUM matrices were first by Steven Henikoff and
Jorja Henikoff
🞂 Only blocks of amino acid sequences with small
change between them are considered. These blocks
are called conserved blocks.
🞂 Local alignment

7
5/19/2024

BLOSUM
🞂 The Blocks database contains multiply aligned
ungapped segments corresponding to the most
highly conserved regions of proteins (local alignment
versus global alignment).
🞂 Blocks contains sequences at all different
evolutionary distances.

BLOSUM
🞂 In each alignment the sequences similar at some threshold
value of percent identity were clustered into groups and
averaged.
🞂 Different BLOSUM matrices differ in the % sequence identity
used in clustering.
🞂 Therefore, BLOSUM62 means that the sequences used to
create this matrix have approximately 62% identity.
🞂 BLOSUM matrices are derived from blocks whose alignment
corresponds to the BLOSUM-matrix number.
🞂 BLOSUM62 represents closer sequences than BLOSUM45.

8
5/19/2024

BLOSUM

Construction of BLOSUM
Step 0: Eliminating the sequences that are more than r%
identical

9
5/19/2024

Construction of BLOSUM

10
5/19/2024

Construction of BLOSUM
🞂 Step 3: Count the observed frequency of Amino acid
pair.
◦ ABobs =8/60
🞂 Step 4: Count the expected frequency of amino acid
pairs.
◦ ABexp =(14/24 X 4/24) X 2
= 112/576
● 2 -> Since ancestral states are not known , we will consider both
substitutions AB and BA as equiprobable.
🞂 Step 5: Calculate the log odd ratio.
◦ 2log2AB = 2log2(O/E) = 2log2((8/60)/(112/576)
= - 1.09

Construction of BLOSUM
Pair Observed(O) Expected (E) 2log2(O/E)
AA 26/60 196/576 .70
AB 8/60 112/576 -1.09
AC 10/60 168/576 -1.61
BB 3/60 16/576 1.70
BC 6/60 48/576 0.53
CC 7/60 36/576 1.80

11
5/19/2024

BLOSUM Matrices
🞂 No extrapolations are made in going to higher
evolutionary distances.
🞂 High number - closely related sequences
🞂 Low number - distant sequences.
🞂 BLOSUM62 is the most popular: best for general
alignment.

PAM VS BLOSUM
PAM BLOSUM

PAM matrices are used to score alignments BLOSUM matrices are used to score alignments
between closely related protein sequences. between evolutionarily divergent protein
sequences.
Based on global alignments Based on local alignments

Alignments have high similarity than Alignments have low similarity than PAM
BLOSUM alignments alignments

Mutations in global alignments are very Based on highly conserved stretches of

significant. alignments

Higher numbers in the PAM matrix naming Higher numbers in the BLOSUM matrix
denotes greater evolutionary distance naming denotes higher sequence similarity
and smaller evolutionary distance

useful at short evolutionary distances (PAM10 - At long evolutionary distances, for example
PAM120). PAM250 or 20% identity, BLOSUM matrices are
more effective
Example: PAM 250 is used for more distant Example: BLOSUM 80 is used for closely
sequences than PAM 120 related sequences than BLOSUM 62

Lecture-I. Introduction - Livestock Breeding System
No ratings yet
Lecture-I. Introduction - Livestock Breeding System
13 pages
2-Substitution Matrices and Python - 2017
No ratings yet
2-Substitution Matrices and Python - 2017
65 pages
Pam
No ratings yet
Pam
4 pages
PAM and BLOSUM
No ratings yet
PAM and BLOSUM
21 pages
15-Unnamed-08-08-2024
No ratings yet
15-Unnamed-08-08-2024
12 pages
Amino Acid Substitution Matrices: Evolutionary Model
No ratings yet
Amino Acid Substitution Matrices: Evolutionary Model
20 pages
Bioinformatics in PAM AND BLOSUM
100% (15)
Bioinformatics in PAM AND BLOSUM
17 pages
Substitution Matrix
No ratings yet
Substitution Matrix
10 pages
PAM Abd BLOSUM
No ratings yet
PAM Abd BLOSUM
3 pages
Pam Blosum
100% (1)
Pam Blosum
71 pages
12-BLOSSUM
No ratings yet
12-BLOSSUM
10 pages
PAM and BLOSUM Matrices
No ratings yet
PAM and BLOSUM Matrices
3 pages
PB Bioinfo L4 2023
No ratings yet
PB Bioinfo L4 2023
29 pages
Aminoacid+Alignment Including PAM & BLOSUM
0% (1)
Aminoacid+Alignment Including PAM & BLOSUM
38 pages
Comparison of The PAM and BLOSUM Amino Acid Substitution Matrices
No ratings yet
Comparison of The PAM and BLOSUM Amino Acid Substitution Matrices
4 pages
BLOSUM
No ratings yet
BLOSUM
3 pages
Lecture 7- Score Matrix
No ratings yet
Lecture 7- Score Matrix
12 pages
04 CAP5510 Fall21
No ratings yet
04 CAP5510 Fall21
37 pages
Protein Alignment Scoring - PAM and BLOSUM
No ratings yet
Protein Alignment Scoring - PAM and BLOSUM
11 pages
16-Unnamed-08-08-2024
No ratings yet
16-Unnamed-08-08-2024
13 pages
Pam Master
No ratings yet
Pam Master
4 pages
UNIT III
No ratings yet
UNIT III
14 pages
Mount - 2008 - Using BLOSUM in Sequence Alignments
No ratings yet
Mount - 2008 - Using BLOSUM in Sequence Alignments
5 pages
Mount - 2008 - Using PAM Matrices in Sequence Alignments
No ratings yet
Mount - 2008 - Using PAM Matrices in Sequence Alignments
9 pages
Introduction To Bioinformatics: Sequence Alignment
No ratings yet
Introduction To Bioinformatics: Sequence Alignment
29 pages
Using Scoring Matrices
No ratings yet
Using Scoring Matrices
3 pages
Alignment of Sequences
No ratings yet
Alignment of Sequences
33 pages
_second_done_w14a_substitution patterns
No ratings yet
_second_done_w14a_substitution patterns
36 pages
Blosum-2014
No ratings yet
Blosum-2014
3 pages
Bern Slides
No ratings yet
Bern Slides
7 pages
1 Pearson
No ratings yet
1 Pearson
9 pages
Amino Acid Substitution Scores: 1 2 N 1 2 N N I 1 I I
No ratings yet
Amino Acid Substitution Scores: 1 2 N 1 2 N N I 1 I I
3 pages
Lecture 9 Scoring Matrices
No ratings yet
Lecture 9 Scoring Matrices
20 pages
BIOINFORMATICS
No ratings yet
BIOINFORMATICS
21 pages
Scoring Matrices and The Statistical Significance of Molecular Sequence Features
No ratings yet
Scoring Matrices and The Statistical Significance of Molecular Sequence Features
2 pages
Module III
No ratings yet
Module III
55 pages
Frid Seminar
No ratings yet
Frid Seminar
30 pages
BLAST Lecture Notes
No ratings yet
BLAST Lecture Notes
16 pages
Sequence Alignment: Scoring Matrices
No ratings yet
Sequence Alignment: Scoring Matrices
30 pages
bioinformatics 2 abd 3
No ratings yet
bioinformatics 2 abd 3
2 pages
Unit2 2
No ratings yet
Unit2 2
30 pages
Lecture 8 Dayhoff Algorithm
No ratings yet
Lecture 8 Dayhoff Algorithm
23 pages
SECT 5 SL L1-Rev
No ratings yet
SECT 5 SL L1-Rev
30 pages
Dr. Zoya Khalid Zoya - Khalid@nu - Edu.pk
No ratings yet
Dr. Zoya Khalid Zoya - Khalid@nu - Edu.pk
51 pages
BLOSUM Matrices
No ratings yet
BLOSUM Matrices
18 pages
Lecture 3 and 4 LSM2241
No ratings yet
Lecture 3 and 4 LSM2241
6 pages
14-PAM
No ratings yet
14-PAM
9 pages
Bioinformatics I
No ratings yet
Bioinformatics I
39 pages
14 Handbook of Plant Biotechnology
No ratings yet
14 Handbook of Plant Biotechnology
1 page
msa_MTech
No ratings yet
msa_MTech
17 pages
Scoring Matrices 06
No ratings yet
Scoring Matrices 06
25 pages
Unit Ii
No ratings yet
Unit Ii
14 pages
Protein Interactions Predicted by A Combination of NMR and Analysis of Protein Sequence Covariances
No ratings yet
Protein Interactions Predicted by A Combination of NMR and Analysis of Protein Sequence Covariances
43 pages
Sequence Alignment and Searching
No ratings yet
Sequence Alignment and Searching
37 pages
BLOSUM Matrices
No ratings yet
BLOSUM Matrices
18 pages
Scoring of Alignments: Einführung in Die Bioinformatik
No ratings yet
Scoring of Alignments: Einführung in Die Bioinformatik
19 pages
LO5 Pairwise Sequence Alignment
No ratings yet
LO5 Pairwise Sequence Alignment
11 pages
Protein Sequence Alignment Lecture Notes
No ratings yet
Protein Sequence Alignment Lecture Notes
2 pages
Full PDF
No ratings yet
Full PDF
5 pages
Classical Approach to Constrained and Unconstrained Molecular Dynamics
From Everand
Classical Approach to Constrained and Unconstrained Molecular Dynamics
Ajith Gunaratne
No ratings yet
Investigation of the Usefulness of the PowerWorld Simulator Program: Developed by "Glover, Overbye & Sarma" in the Solution of Power System Problems
From Everand
Investigation of the Usefulness of the PowerWorld Simulator Program: Developed by "Glover, Overbye & Sarma" in the Solution of Power System Problems
Dr. Hidaia Mahmood Alassouli
No ratings yet
3 Large Stock Manual
0% (1)
3 Large Stock Manual
64 pages
PNS-BAFS-371-2023-Organic-Swine-Code-of-Practice-COP
No ratings yet
PNS-BAFS-371-2023-Organic-Swine-Code-of-Practice-COP
33 pages
Identification of the four species of human malaria parasites by nested PCR that targets variant sequences in the small subunit rRNA gene primers 3
No ratings yet
Identification of the four species of human malaria parasites by nested PCR that targets variant sequences in the small subunit rRNA gene primers 3
5 pages
I. Intro To Animal Science
100% (1)
I. Intro To Animal Science
14 pages
Captive Bred Lion Policy
No ratings yet
Captive Bred Lion Policy
19 pages
D N Singh
No ratings yet
D N Singh
11 pages
journal2022
No ratings yet
journal2022
12 pages
Objective 5.01 Livestock Products and By-Products and Economic Impact and Importance - Trends in Animal Agriculture
No ratings yet
Objective 5.01 Livestock Products and By-Products and Economic Impact and Importance - Trends in Animal Agriculture
35 pages
Stagno 2021 Social Dimensions of Commons
No ratings yet
Stagno 2021 Social Dimensions of Commons
35 pages
Evaluation Report On Western Ghats Development Programme in Tamil Nadu (A Joint Study)
No ratings yet
Evaluation Report On Western Ghats Development Programme in Tamil Nadu (A Joint Study)
5 pages
Barbering NC II CG
No ratings yet
Barbering NC II CG
26 pages
Pricelist Genomik Promega 2025_Indolab Utama
No ratings yet
Pricelist Genomik Promega 2025_Indolab Utama
2 pages
Domestic Duck Production
100% (1)
Domestic Duck Production
251 pages
Thesis Seminar On Informtaion Need Assessment
No ratings yet
Thesis Seminar On Informtaion Need Assessment
85 pages
Biology Study Material
No ratings yet
Biology Study Material
74 pages
Project Sem3
No ratings yet
Project Sem3
39 pages
Sr Inter (P1&P2) & LTC Final Phase Revision NEET Grand Test-06 (16!04!2025)_Key & Solutions
No ratings yet
Sr Inter (P1&P2) & LTC Final Phase Revision NEET Grand Test-06 (16!04!2025)_Key & Solutions
11 pages
Bioinfi U3 Part -1
No ratings yet
Bioinfi U3 Part -1
4 pages
Value Chain Analysis For Australian Grass - Fed Beef Production Executive Summary
No ratings yet
Value Chain Analysis For Australian Grass - Fed Beef Production Executive Summary
29 pages
Transcriptome Analysis of Chickpea During Heat Stress Unveils the 2023 Crop
No ratings yet
Transcriptome Analysis of Chickpea During Heat Stress Unveils the 2023 Crop
13 pages
Agriculture Syllabus
No ratings yet
Agriculture Syllabus
18 pages
Ruminants FAO
No ratings yet
Ruminants FAO
61 pages
Gam 223843
No ratings yet
Gam 223843
111 pages
Bacterial Genetics.pptx
No ratings yet
Bacterial Genetics.pptx
11 pages
RFS PRE-TEST
No ratings yet
RFS PRE-TEST
3 pages
221230-Final Exam-Printed-Example Answers
No ratings yet
221230-Final Exam-Printed-Example Answers
10 pages
a comprehensive human gene expression profile database with knockdown of transcription factors
No ratings yet
a comprehensive human gene expression profile database with knockdown of transcription factors
8 pages
PNS: Code of Good Animal Husbandry Practice For Sheep
No ratings yet
PNS: Code of Good Animal Husbandry Practice For Sheep
35 pages
Livestock Systems and Forage Resources of Small Ruminant Farms in Some Selected Districts in Sierra Leone
No ratings yet
Livestock Systems and Forage Resources of Small Ruminant Farms in Some Selected Districts in Sierra Leone
8 pages

Bioinformatics Module 2 Notes

Uploaded by

Bioinformatics Module 2 Notes

Uploaded by

5/19/2024

🞂 A PAM matrix is a matrix where each column and

• PAM250 probability matrix, describing the replacement probabilities

Creation of a PAM matrix

Problems with PAM

Mutations in global alignments are very Based on highly conserved stretches of

You might also like