PSI Blast and Position Specific Scoring Matrix

PSI-BLAST uses position-specific scoring matrices (PSSMs) to iteratively search sequence databases and build multiple sequence alignments of protein families. A PSSM captures the frequency of amino acids occurring in each position of an alignment. PSI-BLAST starts with a BLAST search to generate an initial PSSM, then runs additional searches with the PSSM to recruit more distant homologs into the alignment. Through multiple iterations, PSI-BLAST can identify more members of a protein family than a single BLAST search by leveraging information about conserved residues in each position.

Uploaded by

filymascolo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views3 pages

PSI Blast and Position Specific Scoring Matrix

Uploaded by

filymascolo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

PSI-BLAST AND POSITION-SPECIFIC SCORING MATRIX

A multiple alignment is good in order to create groups. Ofc if they’re related. Ex the whole family
of globins: they bind eme group and oxygen. They share something between alpha-beta subunits.
We can see some positions preserved, just looking at it, I can easily spot it. Or I can easily spot an
idrophobic aa in a positions and so on. But what can’t be spotted by AI? Some rare aa may be an
important signal, there are a lot of methods that allow us to transform this multiple alignment to
discuss the whole family of the protein. I can remove unnecessary information, ex some info may
be mascherate by sort of noise.
Ways I can extract these features: already studied PSI-BLAST: they use PSSM which are
position specific scoring matrices. Is a pssm-version of blast. Blast uses predefined matrices as
blosum62, while psi-blast uses these dynamic matrices. PSSM: scoring matrix. It doesn’t give a
score on an aa rispetto a quante volte è stato sostituito, ma in base alla posizione in cui si trova. I
must know what protein i’m considering cuz pos.3 of a globin is different from another protein. I
build a matrix where I put how much that aa is represented in that position. It’s not like a score but a
FREQUENCY. It can occur in a position may be represented an A 34 times and a L 36 times which
is similar, while in another position an F would be represented 78 times which is significant. I
calculate the times of presence of that aa in that position OUT the number of alignment.
As we can see, C is stronger than A (positions 2 and 1) cuz 100/100.

A weight matrix is not a pssm, for example if we compare the test seq with consensus we would not
say that is part of that group, but if we compare it with the matrix instead we would say it’s part of
the family. We can put a frequency (a percentage) on a test seq to compare ow many times the aa is
in the family in that position. Log odds score: Si=log10 qi,a/pi  means ratio of observed and
expected. The bigger is Si, the more frequent that aa is present. But, if we compare a pam or a
blosum, they’re not so different cuz are also ratio of frequecy obs/exp, but the difference is the
shape of matrix (?).
Now, if we have a pssm, we can substitute it at a blosum, but with some
differences. Scores won’t be the same, 1 is aa vs aa, and the othr is aa vs
position.
Pssm is a matrix where on Y there are 20 aa, and on X the number of the
position. I compare a query aa, how much scores in position n.
PSI-BLAST calculates a PSSM by starting from result of normal blast. I give
a standard matrix, generic one. So by using blast algorithm, it calculates a
PSSM in a database of sequences, doing iterations. The last step so produces
the final PSSM, that is used for PSI-BLAST that runs the new matrix, a pssm,
on the same database of start. I don’t do it many an many times, not cuz I onlu
time expesive, but useless cuz it’s the same result of the same database. Psi blast is not good to run
for the whole database, much larger is much not precise is. A good use is only in families. The
concept is trying to converge a sequences that is not part of the starting database. Everytime I run
psi, the pssm changes. At each cycle, the n of
alignments in family augments. We start from a Seq. query
query and we want to catch all the family from a
database. Blast only finds few of members, but
psi is more sensible cuz by doing various cycles,
finds a larger family or quite-the whole family.
Maybe 7-8 iterations are enough. Don’t need to
repeat 20-30 times, cuz the results would
converge, and we’ll waste time.

PSCAN
Going forward, we want strategy to ragroup alignments. We want to put proteins in some relations.
For proteins often this means alignment. We want to find similarity and patterns in protein, a
specific aa followed by another, a gropu of 3 aa… PRINTS is a database containing motif and small
patterns, so we can check if someone is represented in proteins.

Advance Blast Rani Anak Mat 212111
No ratings yet
Advance Blast Rani Anak Mat 212111
3 pages
Multiple Sequence Alignment MSA
No ratings yet
Multiple Sequence Alignment MSA
8 pages
Delta Blast PDF
No ratings yet
Delta Blast PDF
14 pages
Sequence DB Search
No ratings yet
Sequence DB Search
38 pages
Lab Report 05
No ratings yet
Lab Report 05
20 pages
PSSM
No ratings yet
PSSM
17 pages
PSI-BLAST Tutorial - Comparative Genomics-For Term Paper
No ratings yet
PSI-BLAST Tutorial - Comparative Genomics-For Term Paper
9 pages
Bioinformatics 1 p3
No ratings yet
Bioinformatics 1 p3
17 pages
Blast
No ratings yet
Blast
28 pages
Reliability 7
No ratings yet
Reliability 7
10 pages
An Introduction To Patterns, Profiles, Hmms and Psi-Blast
No ratings yet
An Introduction To Patterns, Profiles, Hmms and Psi-Blast
92 pages
Pattern Recognition 1
No ratings yet
Pattern Recognition 1
5 pages
Lecture 4: Blast: Ly Le, PHD
No ratings yet
Lecture 4: Blast: Ly Le, PHD
60 pages
Methods For Applying Multiple Sequence Alignment
No ratings yet
Methods For Applying Multiple Sequence Alignment
17 pages
Blast
100% (1)
Blast
21 pages
Splicing Explanation
No ratings yet
Splicing Explanation
20 pages
Bioinformatics Is The Inter-Disciplinary Branch of Biology Which Merges Computer Science, Mathematics and Engineering To Study The Biological Data
No ratings yet
Bioinformatics Is The Inter-Disciplinary Branch of Biology Which Merges Computer Science, Mathematics and Engineering To Study The Biological Data
26 pages
BLAST - A Heuristic Algorithm
No ratings yet
BLAST - A Heuristic Algorithm
18 pages
IBB - MB.501 Database Search and Sequence Alignment
No ratings yet
IBB - MB.501 Database Search and Sequence Alignment
51 pages
CL662 Homework 3: Roll Number: 150020027 Name: Prathamesh Kulkarni
No ratings yet
CL662 Homework 3: Roll Number: 150020027 Name: Prathamesh Kulkarni
21 pages
BLAST
No ratings yet
BLAST
30 pages
Azencott BioML
No ratings yet
Azencott BioML
87 pages
Database Similarity Searching
No ratings yet
Database Similarity Searching
4 pages
Basic Bioinformatics
No ratings yet
Basic Bioinformatics
40 pages
PSSM (Handout)
No ratings yet
PSSM (Handout)
10 pages
BLAST Background
100% (1)
BLAST Background
27 pages
Bioinformatics: Blast and Sequence Analysis
No ratings yet
Bioinformatics: Blast and Sequence Analysis
45 pages
Lecture2022 - 3 /!
No ratings yet
Lecture2022 - 3 /!
60 pages
Bioinformatics Session8
No ratings yet
Bioinformatics Session8
33 pages
Bioinformatics Lab 2 (Evelyn)
No ratings yet
Bioinformatics Lab 2 (Evelyn)
9 pages
PSIPRED
No ratings yet
PSIPRED
8 pages
Bioinformatics Lab 2
No ratings yet
Bioinformatics Lab 2
9 pages
hGPR55vsSEAURCHINncbi Blast - Cgi
No ratings yet
hGPR55vsSEAURCHINncbi Blast - Cgi
40 pages
W 35432
No ratings yet
W 35432
10 pages
F 56665
No ratings yet
F 56665
3 pages
Week5 Profiles HMM
No ratings yet
Week5 Profiles HMM
20 pages
Lecture 4
No ratings yet
Lecture 4
106 pages
Variants of Blast: By-Darshana D Ghadi Roll No. - 03
No ratings yet
Variants of Blast: By-Darshana D Ghadi Roll No. - 03
17 pages
Sven Bergmann Part1
No ratings yet
Sven Bergmann Part1
11 pages
Basic Local Alignment
No ratings yet
Basic Local Alignment
36 pages
Introduction To Different Resources of Bioinformatics and Application PDF
No ratings yet
Introduction To Different Resources of Bioinformatics and Application PDF
55 pages
BLAST
100% (1)
BLAST
4 pages
شباتر اله مجمعه
No ratings yet
شباتر اله مجمعه
126 pages
Blast ND Fasta
No ratings yet
Blast ND Fasta
28 pages
Blast & Fasta
No ratings yet
Blast & Fasta
47 pages
Blast 170122070200
No ratings yet
Blast 170122070200
22 pages
Sequence Similarity Searching: Basic Local Alignment Search Tool
No ratings yet
Sequence Similarity Searching: Basic Local Alignment Search Tool
47 pages
Blast
No ratings yet
Blast
6 pages
Zon Tov 2017
No ratings yet
Zon Tov 2017
14 pages
Sequence Alignments: Felix Sappelt Irina Wagner
100% (1)
Sequence Alignments: Felix Sappelt Irina Wagner
34 pages
Blast (Basic Local Alignment Search Tool)
No ratings yet
Blast (Basic Local Alignment Search Tool)
28 pages
Unit Iv - Blast
No ratings yet
Unit Iv - Blast
21 pages
5 Database Similarity Search BLAST
No ratings yet
5 Database Similarity Search BLAST
47 pages
Algorithm Design and Scoring Matrices PDF
No ratings yet
Algorithm Design and Scoring Matrices PDF
31 pages
Dr. Zoya Khalid Zoya - Khalid@nu - Edu.pk
No ratings yet
Dr. Zoya Khalid Zoya - Khalid@nu - Edu.pk
51 pages
Pascal Triangle Analogues Introduction
From Everand
Pascal Triangle Analogues Introduction
Tomislav Tomšić
No ratings yet
Shaping The Brain
From Everand
Shaping The Brain
Nicholas Thomas
No ratings yet
Set-Theoretic Paradoxes and their Resolution in Z-F
From Everand
Set-Theoretic Paradoxes and their Resolution in Z-F
Samuel Horelick
4.5/5 (2)
Life Within a Simulation and Beyond: Evolve Within The Simulation
From Everand
Life Within a Simulation and Beyond: Evolve Within The Simulation
Oktay Akgul
No ratings yet
Data Structures II Essentials
From Everand
Data Structures II Essentials
Dennis C. Smolarski
No ratings yet
Worksheet 8.1 - BiotechnologyandGMO 1
0% (1)
Worksheet 8.1 - BiotechnologyandGMO 1
5 pages
Module 5
No ratings yet
Module 5
23 pages
BTT302 - Ktu Qbank
No ratings yet
BTT302 - Ktu Qbank
6 pages
Solutions1 2
No ratings yet
Solutions1 2
3 pages
Pairwise Sequence Alignment
No ratings yet
Pairwise Sequence Alignment
12 pages
Maptek BlastLogic Measure Audit Improve Benefitstudy
No ratings yet
Maptek BlastLogic Measure Audit Improve Benefitstudy
1 page
Primer Design: Design of Oligonucleotide PCR Primers and Hybridization Probes
No ratings yet
Primer Design: Design of Oligonucleotide PCR Primers and Hybridization Probes
22 pages
Post - Lab For Genes & Consequences
No ratings yet
Post - Lab For Genes & Consequences
2 pages
Bacterial Protein Secretion Systems - Methods and Protocols (PDFDrive)
100% (4)
Bacterial Protein Secretion Systems - Methods and Protocols (PDFDrive)
521 pages
DeepMicrobes Taxonomic Classification For Metagenomics Using Deep Learning
No ratings yet
DeepMicrobes Taxonomic Classification For Metagenomics Using Deep Learning
13 pages
Tedersoo SM
No ratings yet
Tedersoo SM
40 pages
BSC (H) Biotech III Yr - 24-25 Syllabus
No ratings yet
BSC (H) Biotech III Yr - 24-25 Syllabus
50 pages
MUSCLE: Multiple Sequence Alignment With High Accuracy and High Throughput
No ratings yet
MUSCLE: Multiple Sequence Alignment With High Accuracy and High Throughput
6 pages
02 Blades (Instruction Manual) - v2
No ratings yet
02 Blades (Instruction Manual) - v2
54 pages
Ch21. Genomes and Their Evolution - Campbell Biology 12th
No ratings yet
Ch21. Genomes and Their Evolution - Campbell Biology 12th
25 pages
BLAST - Practic Information
No ratings yet
BLAST - Practic Information
2 pages
Phylogenetic Tree and Heat Resistance of
No ratings yet
Phylogenetic Tree and Heat Resistance of
12 pages
Jurnal Class 1 Bioinfo
No ratings yet
Jurnal Class 1 Bioinfo
10 pages
Semwork 1
No ratings yet
Semwork 1
19 pages
Patenting in Biotechnology
No ratings yet
Patenting in Biotechnology
5 pages
Us20120251502a1 PDF
No ratings yet
Us20120251502a1 PDF
117 pages
B.Sc. (H) Botany 6th Semester 2024
No ratings yet
B.Sc. (H) Botany 6th Semester 2024
8 pages
Bioinformatics Assingment - B8.Docx Alex Presly-37
No ratings yet
Bioinformatics Assingment - B8.Docx Alex Presly-37
10 pages
Database Dalam Bioinformatika
No ratings yet
Database Dalam Bioinformatika
34 pages
Phi Blast
No ratings yet
Phi Blast
10 pages
Assign 4 - GR5 - S22324
No ratings yet
Assign 4 - GR5 - S22324
9 pages
BLAST: An Introductory Tool For Students To Bioinformatics Applications
No ratings yet
BLAST: An Introductory Tool For Students To Bioinformatics Applications
11 pages
Use of Whole Genome Shotgun Metagenomics A Practical Guide For The Microbiome-Minded Physician Scientist
No ratings yet
Use of Whole Genome Shotgun Metagenomics A Practical Guide For The Microbiome-Minded Physician Scientist
9 pages
Book Chapter
No ratings yet
Book Chapter
17 pages

PSI Blast and Position Specific Scoring Matrix

Uploaded by

PSI Blast and Position Specific Scoring Matrix

Uploaded by

PSI-BLAST AND POSITION-SPECIFIC SCORING MATRIX

You might also like