SCOP and CATH Database

SCOP and CATH are secondary protein structure databases that provide hierarchical classifications of protein domains derived from protein structures in the PDB. SCOP classifies domains based on structural similarities and evolutionary relationships, while CATH classifies based on class, architecture, topology, and homology. Both databases aim to determine evolutionary relationships between proteins to study protein structure and function.

Uploaded by

Aishwarya Dharan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (5 votes)

5K views22 pages

SCOP and CATH Database

Uploaded by

Aishwarya Dharan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

SCOP AND CATH DATABASE

By Aishwarya Dharan
MSc. Life Science(Bioinformatics)
19mslsbf02
SCOP
Structural Classification Of Proteins

CATH
Class Architecture Topology
Homologous
SCOP AND CATH

• Secondary databases to study protein structure.

• Secondary databases contain information derived from primary databases.
Secondary databases store information such as conserved sequences, active
site residues, and signature sequences. Protein Databank data is stored in
secondary databases.
• The Structural Classification of Proteins (SCOP) database is free and a publicly available database, which
manually classifies protein structural domains based on similarities of their structures and amino acid sequences.
(http://scop2.mrc-lmb.cam.ac.uk/)
• The SCOP protein classification is essentially a manual process using visual inspection and comparison of
structures, some automation is used for the most routine tasks such as clustering protein chains on the basis of
sequence similarity.
• SCOP was created in 1994 in the Centre for Protein Engineering and the Laboratory of Molecular Biology. It was
maintained by Alexey G. Murzin and his colleagues in the Centre for Protein Engineering at Cambridge
University until its closure in 2010 and subsequently at the Laboratory of Molecular Biology in Cambridge,
England.
• The main motivation for this classification is to determine the evolutionary relationship between proteins.
• SCOP has been discontinued due to accelerating pace of protein structure publications, the limited automation of
classification could not keep up, leading to a non-comprehensive dataset. The last official version of SCOP is
1.75. SCOP1.75 is also known as SCOP2.
• SCOP2 offers two different ways for accessing data: SCOP2-browser, and SCOP2-graph.
SCOP2-browser allows navigation in a traditional way by browsing pages displaying the node
information. SCOP2-graph is a graph-based web tool for display and navigation.

• Since SCOP and SCOP2 are not up-to-date with the latest version of the PDB, an extended
version of SCOP, SCOPe, was recently established by the Chandonia group.

• Structural Classification of Proteins extended (SCOPe) database was released in 2012 with
far greater automation of the same hierarchical system and is full backwards compatible with
SCOP. In 2014, manual curation was reintroduced into SCOPe to maintain accurate structure
assignment. SCOPe 2.05 has classified 71,000 of the 110,000 total PDB entries. SCOPe also
corrects some errors in SCOP.
CLASSIFICATION OF SCOP
SCOP is organized into 4 hierarchical layers:
1. Class—It is the general structural architecture of the protein domains. Proteins are usually (but
not always) separated into domains, and most of these domains are classified into one of the first
five classes:
a) all-α：those whose structure is essentially formed by α-helices
b) all-β：those whose structure is essentially formed by β -sheets
c) α/β ： those with α-helices and β-strands
d) α+β：mainly antiparallel beta sheets (segregated alpha
and beta regions)
e) multi-domain：those with domains of different fold and
for which no homologues are known at present.
2. Fold—It represents similar arrangement of regular secondary
structures but without evidence of evolutionary relatedness.
• Includes different shapes of domains within a class, e.g.,
2 helices; antiparallel hairpin, left-handed twist, etc.
Source: https://www.ebi.ac.uk/training/online/sites/ebi.ac.uk.training.online/files/resize/user/511/documents/slide1_5-457x343.jpg
3. Superfamily—The domains in a fold are grouped into superfamilies, which have at least a distant
common ancestor.
4. Family—Domains belonging to the same family:
• share some sequence similarity.
• evolutionarily related.
• pairwise residue identities between them are 30% and greater.
5. Protein domain: The domains in families are grouped into protein domains, which are essentially the same protein.
6. Species: The domains in "protein domains" are grouped according to species.

Source: https://image1.slideserve.com/1868737/structural-classification-of-proteins-n.jpg
• The CATH Protein Structure Classification database (http://www.cathdb.info/ ) is a free,
publicly available online resource that provides hierarchical domain classification of protein
structures in the Protein Data Bank. Protein structures are classified using a combination of
automatic structural alignment program (SSAP) as well as manual comparison. Although the
protocol used is mostly automatic, manual inspection is used to check assignments at some
critical stages, such as the detection of very distantly related homologues and analogues and
the assignment of novel architectures.
• It was created in the mid-1990s by Professor Christine Orengo and colleagues including Janet
Thornton and David Jones, and continues to be developed by the Orengo group at University
College London.
• Experimentally-determined protein three-dimensional structures are obtained from the PDB
and split into their consecutive polypeptide chains, where applicable.
CLASSIFICATION OF CATH
• The four main levels of the CATH hierarchy are as follows:
1. Class: the overall secondary-structure content of the domain. e.g., all α, all β, α/β, α+β, α&β, etc.
2. Architecture: Structures are classified according to their overall shape as determined by the
orientations of the secondary structures in 3D space but ignores the connectivity between them.
3. Topology: consists of structures with the same number, arrangement and connectivity of secondary
structure based on structural superposition.
4. Homologous superfamily: Functional and structural similarities are determined by sequence
comparison and then by structure comparison using SSAP. Two structures are in the same homologous
superfamily if any of the following hold:
• Sequence identity > 35%
• SSAP score > 80 and sequence identity > 20%
• SSAP score > 80 and 60% of larger structure is equivalent to the smaller structure; the domains have
related functions
• To illustrate the types of domains that one can observe at the architecture level,
let us look at some of the mixed alpha–beta class. These are the following 10
entries at the architecture level:

ALPHA-BETA (αβ) 2-Layer Sandwich (αβ1)

3-Layer(aba) Sandwich (αβ2)
Alpha-Beta Barrel (αβ3)
Alpha-Beta Complex (αβ4)
Roll (αβ5)
MAINLY ALPHA (α) Orthogonal Bundle (α1)
Up-down Bundle (α2)
MAINLY BETA (β) Beta Barrel (β1)
Roll (β2)
Sandwich (β3)
Illustration of 10 different CATH architectures (A) 2-layer sandwich (αβ1). (B) 3-layer(αβα)
sandwich (αβ2). (C) alpha-beta barrel (αβ3). (D) alpha-beta complex (αβ4). (E) roll (αβ5). (F)
orthogonal bundle (α1). (G) up-down bundle (α2). (H) beta barrel (β1). (I) roll (β2). (J) sandwich
(β3)

Source:
https://www.researchgate.net/profile/Senthilnathan_Rajendran2/publication/325182475/figure/fig2/AS:628569227153409@1526873990276/Illustration
-of-10-different-CATH-architectures-subfolds-in-our-data-set-A-2-Layer.png
Source: https://images.slideplayer.com/25/7639286/slides/slide_3.jpg
APPLICATIONS OF SCOP
1) To study viral fold specificity - SCOP classification was used by Cheng and Brooks
to study fold diversity in viral capsid proteins. Cheng and Brooks concluded that
viral capsids evolved under distinct evolutionary constraints from non‐capsid
proteins, and may provide valuable templates for protein engineering.
2) Study evolution of oligomer geometries. In a study of evolution of different
oligomeric states by Perica, Chothia, and Teichmann, structures were collected from
10 SCOP families that have “at least one dimer and one homologous tetramer or
hexamer with the same dimeric binding mode.” The study detected locations of
mutations that were correlated with different oligomerization states and found that
“such indirect, or allosteric mutations affecting intersubunit geometry via indirect
mechanisms are as important as interface sequence changes for evolution of
oligomeric states.”
APPLICATIONS OF CATH
• There was one study in which the authors used information that was available only in CATH
and not in SCOP. In the study by Bukhari and Caetano‐Anollés, phylogenetic data were used
to study the emergence of different CATH domain architectures.
• The focus of the study was on the CATH architecture level, which does not have an analogous
level in SCOP.
• The study found ancient architectures such as the CATH 3‐layer (αβα) sandwich (3.40) or
the orthogonal bundle (1.10) are involved in basic cellular functions, but more recently evolved
architectures such as prism, propeller, 2‐solenoid, super‐roll, clam, trefoil, and box are not
widely distributed.
• That study also benchmarked the phylogenetic analysis of CATH domains compared with
SCOP domains, measuring the distribution of CATH architectures, topologies, and homologies,
and SCOP folds, superfamilies, and families in Bacteria, Eukarya, and Archaea
superkingdoms.
COMPARATIVE DISCREPANCIES
BETWEEN SCOP & CATH
CATH assigns more domains than SCOP, due to the
fact that CATH defines domains purely structurally,
whereas SCOP takes into account whether or not a
domain is observed as recurring in another
superfamily, or observed as a separate single-
domain fold.
(a) Structure of papain (1ppo), a cysteine proteinase from papaya, with catalytic histidine, asparagine
and cystine shown as ball-and-stick residues. SCOP classifies the structure as one domain (SCOP code:
4.3.1), leaving the catalytic cysteine, histidine, and asparagine together to form the active site, whereas
CATH splits the structure into two, as shown by blue (CATH code: 1.10.190.10) and yellow (3.10.160.10)
colouring, rendering each domain effectively functionless. After this study by Haldane & Jones, Papain is
now treated as a single domain in CATH.
Source: Fig. 4, https://doi.org/10.1016/S0969-2126(99)80177-4
COMPARATIVE DISCREPANCIES BETWEEN SCOP &
CATH

Examples of class assignment disagreements between CATH and SCOP. (a) SCOP ignores the
small helical elements in the haemagglutinin structure and classifies the domain as mainly β,
whereas CATH takes the helices into account and considers the structure αβ. (b) In case of
lysozyme superfamily (e.g. 1lys), CATH disregards the presence of small β strands and
considers the protein mainly α, whereas SCOP takes into account the functional and
evolutionary importance of these strands, and calls the lysozymes α/β.
Source: https://ars.els-cdn.com/content/image/1-s2.0-S0969212699801774-
gr6_lrg.jpg
DIFFERENCE BETWEEN SCOP & CATH

Fold (F)

Source: https://www.researchgate.net/profile/Syed_Abbas_Bukhari/publication/235993836/figure/fig1/AS:299889826254851@1448510714283/Hierarchy-of-
the-CATH-structural-classification-system-compared-to-corresponding-SCOP.png
DIFFERENCE BETWEEN SCOP & CATH

• In CATH, there is only one class to represent mixed

alpha-beta.
• In SCOP there are two:
• α/β: beta structure is largely parallel, made of β α β
motifs
• α + β : alpha and beta structure segregated to different
parts of structure
CONCLUSION

SCOP is a valuable resource for detailed

evolutionary information, and CATH is a valuable
source of geometric information.
REFERENCES
• Hadley C., Jones D. T. (1999). A systematic comparison of protein structure
classifications: SCOP, CATH and FSSP. Structure. 7:1099–1112.
https://doi.org/10.1016/S0969-2126(99)80177-4
• Burkowski F. J. (2008). Structural Bioinformatics: An algorithmic approach.
Florida, FL: CRC Press, Taylor & Francis Group.
• Murzin A. G., Brenner S. E., Hubbard T., Chothia C. (1995). SCOP: a
structural classification of proteins database for the investigation of
sequences and structures. J. Mol. Biol. 247, 536-540. [PDF]
THANK YOU

Bill Nye Simple Machines Video Worksheet
No ratings yet
Bill Nye Simple Machines Video Worksheet
1 page
Satyanarayan - Biotechnology
No ratings yet
Satyanarayan - Biotechnology
880 pages
University of Engineering & Technology, Lahore: Unofficial Transcript
No ratings yet
University of Engineering & Technology, Lahore: Unofficial Transcript
2 pages
Shuttle Vectors and Expression Vectors
100% (2)
Shuttle Vectors and Expression Vectors
2 pages
Blotting Techniques
96% (46)
Blotting Techniques
36 pages
Types of Fermenter
100% (3)
Types of Fermenter
24 pages
Isolation, Preservation and Improvement of Industrially Important Microorganisms
100% (1)
Isolation, Preservation and Improvement of Industrially Important Microorganisms
41 pages
Tetrad Analysis - Sample Problems: TRP + Produces The Following Tetrads. Determine The Genetic Map
100% (2)
Tetrad Analysis - Sample Problems: TRP + Produces The Following Tetrads. Determine The Genetic Map
9 pages
The Chronological Development of The Fermentation Industry
80% (10)
The Chronological Development of The Fermentation Industry
23 pages
Molecular Biology and Genetics Book 3 PDF
75% (4)
Molecular Biology and Genetics Book 3 PDF
68 pages
Carrier Recovery Using A Second Order Costas Loop
No ratings yet
Carrier Recovery Using A Second Order Costas Loop
25 pages
PAM Blosum: Assignment 1 Bioinformatics (DSE 1)
100% (3)
PAM Blosum: Assignment 1 Bioinformatics (DSE 1)
9 pages
Blast (Basic Local Alignment Search Tool)
No ratings yet
Blast (Basic Local Alignment Search Tool)
28 pages
Bootstrapping PRESENTATION BY GROUP 4
100% (4)
Bootstrapping PRESENTATION BY GROUP 4
31 pages
Multiple Sequence Alignment 3
No ratings yet
Multiple Sequence Alignment 3
22 pages
Biotechnology by U Satyanarayan Z Lib or
100% (2)
Biotechnology by U Satyanarayan Z Lib or
880 pages
Linker, Adaptor, Homopolymer Tailing
56% (9)
Linker, Adaptor, Homopolymer Tailing
15 pages
MBOE-201 07. Fermentation Economics PDF
100% (1)
MBOE-201 07. Fermentation Economics PDF
26 pages
Bioinformatics in PAM AND BLOSUM
100% (15)
Bioinformatics in PAM AND BLOSUM
17 pages
Recovery and Purification of Intracellular and Extra Cellular Products
100% (1)
Recovery and Purification of Intracellular and Extra Cellular Products
25 pages
L3.2 Immobilized Enzyme Kinetics
100% (2)
L3.2 Immobilized Enzyme Kinetics
98 pages
Selection of Recombinant Clones
100% (2)
Selection of Recombinant Clones
2 pages
Gene Prediction
25% (4)
Gene Prediction
36 pages
Bioinformatics. CH 3 Databases (Summarized Notes)
50% (2)
Bioinformatics. CH 3 Databases (Summarized Notes)
5 pages
Abzymes
100% (1)
Abzymes
17 pages
Split Genes
No ratings yet
Split Genes
56 pages
Industrial Production of Protease
100% (1)
Industrial Production of Protease
56 pages
Multi Enzyme Complex: Sudhanshu Shekhar M.Tech (Biotech) III Sem A7110709009
84% (25)
Multi Enzyme Complex: Sudhanshu Shekhar M.Tech (Biotech) III Sem A7110709009
20 pages
Application of Spectrophotometer
No ratings yet
Application of Spectrophotometer
16 pages
Sequence File Formats
No ratings yet
Sequence File Formats
22 pages
Strain Improvement
No ratings yet
Strain Improvement
15 pages
Genetic Mapping and Interference and Coincidence
100% (1)
Genetic Mapping and Interference and Coincidence
17 pages
Analysis of Film and Pore Diffusion Effects On Kinetics of Immobilized Enzyme Reactions
100% (1)
Analysis of Film and Pore Diffusion Effects On Kinetics of Immobilized Enzyme Reactions
7 pages
Genei Teaching Kit Manuals
100% (4)
Genei Teaching Kit Manuals
352 pages
Fermentation Technology-1
75% (4)
Fermentation Technology-1
42 pages
Unit 1 Molecules Their Interaction Relevant To Biology CSIR UGC NET Life Sciences
100% (3)
Unit 1 Molecules Their Interaction Relevant To Biology CSIR UGC NET Life Sciences
5 pages
Screening of Microorganisms: Primary and Secondary Techniques - Industrial Biotechnology
No ratings yet
Screening of Microorganisms: Primary and Secondary Techniques - Industrial Biotechnology
10 pages
Animal Cell Culture PRINT
67% (3)
Animal Cell Culture PRINT
22 pages
Bioinformatics Biological Database
No ratings yet
Bioinformatics Biological Database
31 pages
DNA Packaging
100% (2)
DNA Packaging
25 pages
Gel Electrophoresis
No ratings yet
Gel Electrophoresis
4 pages
Cloning Vector FINAL
100% (1)
Cloning Vector FINAL
20 pages
Cell Synchronization
100% (4)
Cell Synchronization
21 pages
Surface Fermentation
50% (2)
Surface Fermentation
15 pages
Restriction Digestion Teaching Kit
0% (1)
Restriction Digestion Teaching Kit
4 pages
Scope of Immunology
100% (6)
Scope of Immunology
6 pages
Complexity of EUKARYOTic Genome
No ratings yet
Complexity of EUKARYOTic Genome
27 pages
Microbial Fermentation and Production of Small and Macro Molecules
60% (5)
Microbial Fermentation and Production of Small and Macro Molecules
5 pages
Lab Manual For Down Stream Process Lab
No ratings yet
Lab Manual For Down Stream Process Lab
31 pages
Lecture 1B - Benzer's Work and Complementation Test
100% (2)
Lecture 1B - Benzer's Work and Complementation Test
5 pages
MCQ Bio
No ratings yet
MCQ Bio
6 pages
Computer Applications in Fermentation
No ratings yet
Computer Applications in Fermentation
29 pages
Ramachandran Plot
100% (5)
Ramachandran Plot
18 pages
Microscopy-Principles and Types
0% (1)
Microscopy-Principles and Types
82 pages
Cell Culture Based Vaccine
No ratings yet
Cell Culture Based Vaccine
11 pages
Allosteric Enzyme
100% (1)
Allosteric Enzyme
22 pages
Filter Feeding in Mpolychatete, Molluscs and Echinodermata
100% (1)
Filter Feeding in Mpolychatete, Molluscs and Echinodermata
14 pages
5 Mitochondrial DNA and Chloroplast DNA
No ratings yet
5 Mitochondrial DNA and Chloroplast DNA
16 pages
Sequence Retrieval System
No ratings yet
Sequence Retrieval System
2 pages
12 Biotechnology - Applications PPT STUDENTS
100% (1)
12 Biotechnology - Applications PPT STUDENTS
47 pages
Indicators of Food Microbial Quality and Safety
100% (2)
Indicators of Food Microbial Quality and Safety
12 pages
13-SCOP - Structural Classification of Proteins-06-09-2024
No ratings yet
13-SCOP - Structural Classification of Proteins-06-09-2024
21 pages
Protein Structure Classification/domain Prediction: SCOP and CATH (Bioinformatics) .
100% (4)
Protein Structure Classification/domain Prediction: SCOP and CATH (Bioinformatics) .
23 pages
Processing of Polymers
100% (1)
Processing of Polymers
36 pages
24 Aspirin
No ratings yet
24 Aspirin
4 pages
Tds Bopa 15 STD
No ratings yet
Tds Bopa 15 STD
1 page
White Noise &amp Properties
No ratings yet
White Noise &amp Properties
26 pages
NABL Scope 2018 PDF
No ratings yet
NABL Scope 2018 PDF
27 pages
A Level Mathematics Practice Paper Q - Pure Mathematics
No ratings yet
A Level Mathematics Practice Paper Q - Pure Mathematics
5 pages
Thermo Lab Report
No ratings yet
Thermo Lab Report
8 pages
Power System State Estimation
100% (1)
Power System State Estimation
13 pages
Test No.26
No ratings yet
Test No.26
2 pages
Math 9
No ratings yet
Math 9
3 pages
Physics Manual XI
No ratings yet
Physics Manual XI
23 pages
Numerical Evaluation of Damage Distribution Over A Slat Track Using Flight Test Data
No ratings yet
Numerical Evaluation of Damage Distribution Over A Slat Track Using Flight Test Data
9 pages
Optimization of Sacrificial Anodes For One Offshore Jacket: February 2016
100% (1)
Optimization of Sacrificial Anodes For One Offshore Jacket: February 2016
7 pages
Module 1 - Engagement Questions - Istec Academy
No ratings yet
Module 1 - Engagement Questions - Istec Academy
14 pages
Metals From Ores. An Introduction To Ext
No ratings yet
Metals From Ores. An Introduction To Ext
17 pages
Chem Notes PDF
No ratings yet
Chem Notes PDF
8 pages
Aim of Project
No ratings yet
Aim of Project
15 pages
HGH-15-CA-1R140-Z0-C 140179 Drawing
No ratings yet
HGH-15-CA-1R140-Z0-C 140179 Drawing
1 page
DP1 Phy P2 SL MS
No ratings yet
DP1 Phy P2 SL MS
9 pages
Chapter 4 Shell and Tube Heat Exchangers
No ratings yet
Chapter 4 Shell and Tube Heat Exchangers
45 pages
AA HL-Sequences-Exp-log WS MS
No ratings yet
AA HL-Sequences-Exp-log WS MS
46 pages
Algebra 1 Practice Problems
No ratings yet
Algebra 1 Practice Problems
5 pages
Problem Set 1 Properties of Material
No ratings yet
Problem Set 1 Properties of Material
9 pages
Title Defense Windstrip
No ratings yet
Title Defense Windstrip
5 pages
CalTrans Trenching Shoring Manual
No ratings yet
CalTrans Trenching Shoring Manual
409 pages
Glosario de Términos de Fundición + Electrolisis PDF
No ratings yet
Glosario de Términos de Fundición + Electrolisis PDF
13 pages

SCOP and CATH Database

Uploaded by

SCOP and CATH Database

Uploaded by

SCOP AND CATH DATABASE

• Secondary databases to study protein structure.

ALPHA-BETA (αβ) 2-Layer Sandwich (αβ1)

• In CATH, there is only one class to represent mixed

SCOP is a valuable resource for detailed

You might also like