Lecture 7
Lecture 7
STRUCTURES
Comparing Protein Structures: Why?
1
Chain/Domain Library
Goals:
Predict structure from sequence
Predict function based on sequence
Predict function based on structure
2
Fig. 1. Examples of structural alignments obtained with MAMMOTH
(B) Structural alignment of 1pgb with 5tss_A. The score in this case is 6.29.
3
4
• Recognizing Structural Similarity
5
»Example: Superposition to minimize RMSD
• 1. Define measure of similarityRMSD = {Σ|x-xj|2)/N}1/2
• 2. Determine correspondence between residues of each protein
(e.g. by sequence alignment, or a guess)
• 3. Align centers of mass
• 4. Use matrix methods to solve for the rotation that gives minimal
RMSD (variety of methods available)
• 5. Evaluate the resulting number
• 6. Refine the alignment
• 7. iterate
Distance Matrix
-identify contact patterns of groups that are close together-compare
these for different structures-fast, insensitive to insertions-example:
Distance ALIgnment Tool (DALI)
6
Structural Classification of Proteins
Classification of structures
SCOP: http://scop.mrc-lmb.cam.ac.uk/scop/
(domains, good annotation)
CATH: http://www.biochem.ucl.ac.uk/bsm/cath/
CE: http://cl.sdsc.edu/ce.html
FSSP: http://www2.ebi.ac.uk/dali/fssp/
(chains, updated weekly)
HOMSTRAD: http://www-cryst.bioc.cam.ac.uk/~homstrad/
HSSP: http://swift.embl-heidelberg.de/hssp/
7
SCOP Hierarchy of Structures
947 superfamily
1557 family
12794 protein
8
Statistics from July 2005
Æ 945 FOLDS
Æ 1539 SUPERFAMILIES
Æ 2845 FAMILIES
Æ 70859 DOMAINS
9
10
11
12
13
14
15
16
Classification of Protein Structure: SCOP
http://scop.mrc-lmb.cam.ac.uk/scop/
http://scop.berkeley.edu/
17
Classification of Protein Structure: SCOP
SCOP is organized into 4 hierarchical layers:
(1) Classes:
18
Classification of Protein Structure: SCOP
http://www.biochem.ucl.ac.uk/bsm/cath/
19
Classification of Protein Structure: CATH
Mixed Alpha
Alpha Beta
Beta
C
Tim Barrel
Other Barrel
20
Classification of Protein Structure: CATH
21
22
The DALI Domain Dictionary
http://www.ebi.ac.uk/dali/domain/
23
Summary
• Classification is an important part of biology; protein structures are not
exempt
• While all structural biologists agree that proteins are usually a collection of
domains, there is no consensus on how to delineate the domains
24