0% found this document useful (0 votes)

11 views46 pages

Protein 3D Structure Database

The document provides an overview of protein 3D structures, including primary, secondary, tertiary, and quaternary classifications. It discusses methods for determining 3D structures, such as X-ray crystallography and NMR, and introduces protein structural databases like PDB, SCOP, and CATH. Additionally, it outlines the hierarchical classification of protein domains and the significance of structural and evolutionary relationships in protein classification.

Uploaded by

ssetbtalumnifeedback2024

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views46 pages

Protein 3D Structure Database

Uploaded by

ssetbtalumnifeedback2024

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 46

Protein 3D structure database

PDB, CATH, SCOP

Three-dimensional (3D) structures
• The three-dimensional (3D) structure is also
called the tertiary structure.
• If a protein molecule consists of more than
one polypeptide, it also has the quaternary
structure, which specifies the relative
positions among the polypeptides (subunits)
in a protein.
Protein Structures
Primary Secondary Tertiary Quaternary

Amino acid Alpha helices & Arrangement Packing of several

sequence. Beta sheets, of secondary polypeptide chains.
Loops. elements in
3D space.
Given an amino acid sequence, we are interested in its
secondary structures, and how they are arranged in higher
structures.
How is a 3D structure determined ?

1. Experimental methods (Best approach):

• X-rays crystallography - stable fold, good quality crystals.
• NMR - stable fold, not suitable for large molecule.

2. In-silico methods (partial solutions -

based on similarity):
• Sequence or profile alignment - uses similar sequences,
limited use of 3D information.
• Threading - needs 3D structure
• Ab-initio structure prediction - not always successful.
Protein Structural Databases
• PDB-Protein Data Bank

• SCOP

• CATH
PDB
• The Protein Data Bank is a repository for 3-D
structural data of proteins and nucleic acids.
• These data, typically obtained by X-ray
crystallography or NMR spectroscopy and submitted
by biologists and biochemists from around the world.
• The PDB was established in 1971 at Brookhaven
National Laboratory and originally contained just 7
protein structures
• In 1998, the Research Collaboratory for Structural
Bioinformatics (RCSB) became responsible for the
management of the PDB.
PDB Statistics: 142000 Biological Macromolecular Structures
Protein Structure in PDB
• Text files
• Each entry is specified by a unique 4-letter
code (PDB code): say 1HUY for a variant of
GFP; 1BGK for a 37-residue toxin protein
isolated from sea anemone
• 1HUY and 1BGK
– Header information
– Atomic coordinates in Å
Header Details

• Identifies the molecule, modifications, date of

release

• Host organism, keywords, method of study

• Authors, reference, resolution for X-ray structure

– Smaller the number, better the structure.

• Sequence, reference
The Atomic Coordinates
• XYZ Coordinates for each atom (starting with ATOM, only heavy atom
for X-ray structure) from the first residue to the last

• XYZ coordinates for any ligands (starting with HETATM) complexed to

the bio-macromolecule

• O atoms of water molecules (starting with HETATM, normally at the last

part of the xyz coordinate section)

• Usually, for X-ray structure, resolution is not high enough to locate H

atoms: hence only heavy atoms are shown in the PDB file.

• For NMR structure, all atoms (including hydrogen atoms) are specified
in the PDB file.
X-ray structure 1HUY
NMR structure 1BGK
2. Free Software for Protein Structure
Visualization

• RASMOL: available for all platforms

http://www.openrasmol.org
• Swiss PDB Viewer: from Swiss-Prot
http://www.expasy.ch/spdbv/
• Chemscape Chime Plug-in: for PC and Mac
http://www.mdl.com/downloads/downloadable/index.jsp
• YASARA: http://www.yasara.org/
• MOLMOL: MOLecule analysis and MOLecule display
http://129.132.45.141/wuthrich/software/molmol/index.html
Ribbon representation by RasMol

1HUY An Improved Yellow

Variant Of Green
Fluorescent Protein

From Tsien’s group

J.Biol.Chem. 276 29188
(2001)
Ribbon representation by YASARA
Ribbon representation by YASARA
Ribbon representation by MOLMOL
An ensemble of 15 structures (NMR, toxin Bgk);
Proton atoms also included

15 backbone structures of the

sea anemone toxin Bgk
15 all-atom structures of the
sea anemone toxin Bgk

Line representation
Ribbon representation
Space-filling representation
3. Hierarchical classification of protein
domains: SCOP & CATH

• SCOP:Structural Classification of Proteins

University of Cambridge, UK
http://scop.mrc-lmb.cam.ac.uk/scop/

• CATH: Class—Architecture—Topology
--Homologous Superfamily
Sequence family
University College London, UK
http://www.biochem.ucl.ac.uk/bsm/cath/
Basis for protein classification
Proteins adopt a limited number of topologies
More than 50,000 sequences fold into ~1000 unique folds.

Homologous sequences have similar structures

Usually, when sequence identity>30%, proteins adopt the
same fold. Even in the absence of sequence homology, some
folds are preferred by vastly different sequences.

The “active site” is highly conserved

A subset of functionally critical residues are found to be
conserved even the folds are varied.
How many unique folds do organisms
use to express functions?

Sequence space
> 50,000

Conformational
Many sequences to form space
one unique fold
~1,000 ???????
SCOP-Introduction
• SCOP-Structural Classification Of Protein

• URL - http://scop.mrc-lmp.cam.ac.uk/scop/

• Maintained-MRC laboratory of molecular biology and centre for protein

engineering , Cambridge, UK.

• Authors : Alexei.G.Murzin, 1995

• CO-WORKERS- L.Lo conte, B.G.Ailey,S.E.Brenner, T.J.P.Hubbard,

C.Chothia
Features:
- Its purpose is to classify protein 3D structures in a
hierarchical scheme of structural classes.

- All protein structures are classified and it is updated

as new structures, are deposited in the PDB.

-It adds information through analysis and

organization into hierarchical scheme of Folds, Super
Families and Families.
Definition
• Description- the structural and evolutionary
relationships between all proteins whose structure is
known.
• Proteins are classified to reflect both structural and
evolutionary relatedness.
• Hierarchy Level
• SCOP has been constructed using a combination of
manual inspection and automated methods
• The four classification levels are:

• Class - A very broad description of the

structural content of the protein

• Fold - Indicative of a broad structural

similarity but with no evidence of a
homologous relationship
• Super family - Sufficient structural similarity
to infer a divergent evolutionary
relationship but no
detectable sequence similarity
• Family - Significant sequence similarity which
can be detected either directly or through a
transitive search.
SCOP2
• SCOP2 is a successor to the Structural Classification of
Proteins (SCOP) database.
• Similarly to SCOP, the main focus of SCOP2 is to organize
structurally characterized proteins according to their
structural and evolutionary relationships.
• The relationships in SCOP2 fall into four major categories:
– Protein types,
– Evolutionary events,
– Structural classes and
– Protein relationships. The first two categories do not have counterparts in
SCOP.
`
CATH
• CATH- Class, Architecture,Topology and
Homologous

• URL- www.cathdb.info

• Maintained by PDB

• 1997 by Christine Orengo, Janet Thornton and their

colleagues , University college of London.
Features:
• The CATH database ( Class, Architecture, Topology,
Homologous super family) is a hierarchical classification
of protein domain structures, which clusters proteins at
four major structural levels.
• The aim of the databases is similar to that of SCOP but the
scheme is different , the philosophy and practical details of
producing the classification are also different.
• Four main levels
Class C-level
Architecture, A-level
Topology (Fold family), T-level
Homologous Superfamily, H-level
Class
• Class is determined according to the
secondary structure composition and packing
within the structure.
• Three major classes :
mainly-alpha,
mainly-beta and
alpha-beta (α/β,α+β)
Architecture, A-level
• This describes the overall shape of the domain
structure as determined by the orientations
of the secondary structures
• but ignores the connectivity between the
secondary structures
• e.g. barrel or 3-layer sandwich
Topology (Fold family), T-level
• Structures are grouped into fold groups at this level
depending on both the overall shape and
connectivity of the secondary structures.
• This is done using the structure comparison
algorithm SSAP (sequential structure alignment
program) and CATHEDRAL (a fast and effective
algorithm to predict folds and domain boundaries
from multidomain protein structures).
• Equivalent to a fold in SCOP
Homologous Superfamily, H-level
• This level groups together protein domains
which are thought to share a common
ancestor and can therefore be described as
homologous

• Similarities are identified either by high

sequence identity or structure comparison
using SSAP.

K Vijaya Ramesh
100% (3)
K Vijaya Ramesh
406 pages
Biology Project On Dna Fingerprinting
75% (73)
Biology Project On Dna Fingerprinting
21 pages
SCOP and CATH Database
100% (5)
SCOP and CATH Database
22 pages
Scop Database
No ratings yet
Scop Database
29 pages
Pearson IIT Foundation Series - Biology Class 10 7th Edition Trishna Knowledge Systems Instant Download
100% (1)
Pearson IIT Foundation Series - Biology Class 10 7th Edition Trishna Knowledge Systems Instant Download
51 pages
Protein Structure Classification/domain Prediction: SCOP and CATH (Bioinformatics) .
100% (4)
Protein Structure Classification/domain Prediction: SCOP and CATH (Bioinformatics) .
23 pages
Protein Database Overview
No ratings yet
Protein Database Overview
13 pages
05 Structural Databases
No ratings yet
05 Structural Databases
23 pages
Basic Concepts and Laws: Biology Pointers
100% (1)
Basic Concepts and Laws: Biology Pointers
22 pages
Fold Lib
100% (1)
Fold Lib
24 pages
Structural Classification of Proteins Database
100% (1)
Structural Classification of Proteins Database
8 pages
CSEC BIOLOGY - Ecological Studies
88% (8)
CSEC BIOLOGY - Ecological Studies
36 pages
Veterinary Medicine: 1 History
No ratings yet
Veterinary Medicine: 1 History
8 pages
Protein Structure
No ratings yet
Protein Structure
52 pages
Protein Folds and Structure
No ratings yet
Protein Folds and Structure
19 pages
CSD
No ratings yet
CSD
14 pages
Protein Structure: Daisuke Kihara
No ratings yet
Protein Structure: Daisuke Kihara
19 pages
Scop & Cath: Dr. M.I. Hassan
No ratings yet
Scop & Cath: Dr. M.I. Hassan
50 pages
Protein 3d
No ratings yet
Protein 3d
86 pages
PDBefold Tutorial
No ratings yet
PDBefold Tutorial
14 pages
SCOP Database 2020 1603872986557
No ratings yet
SCOP Database 2020 1603872986557
7 pages
Proteins DR Wurie
No ratings yet
Proteins DR Wurie
70 pages
Bioinformatics Unit I
No ratings yet
Bioinformatics Unit I
6 pages
Lecture 5' - Introduction To Protein Struct II Spr08
No ratings yet
Lecture 5' - Introduction To Protein Struct II Spr08
29 pages
Pi Is 0969212699801774
No ratings yet
Pi Is 0969212699801774
14 pages
Anwesha Mazumder
No ratings yet
Anwesha Mazumder
12 pages
Protein Structural Motifs: Doug Brutlag Professor Emeritus Biochemistry & Medicine (By Courtesy)
No ratings yet
Protein Structural Motifs: Doug Brutlag Professor Emeritus Biochemistry & Medicine (By Courtesy)
100 pages
Article
No ratings yet
Article
11 pages
Proclust:: Improved Clustering of Protein Sequences With An Extended Graph-Based Approach
No ratings yet
Proclust:: Improved Clustering of Protein Sequences With An Extended Graph-Based Approach
58 pages
Introduction To Structural Databases
No ratings yet
Introduction To Structural Databases
10 pages
Proteins 76 418 2009
No ratings yet
Proteins 76 418 2009
21 pages
Bioinformatic Databases 2
No ratings yet
Bioinformatic Databases 2
28 pages
Sanchez CurrOpinStructBiol 1997
No ratings yet
Sanchez CurrOpinStructBiol 1997
9 pages
Lecture 7
No ratings yet
Lecture 7
24 pages
Protein Sequence
No ratings yet
Protein Sequence
36 pages
The Role of Protein Structure in Genomics: Minireview
No ratings yet
The Role of Protein Structure in Genomics: Minireview
5 pages
Lecture3-Structural Bioinformatics-Secondary Resources
No ratings yet
Lecture3-Structural Bioinformatics-Secondary Resources
26 pages
Lecture4-Protein Data Analysis
No ratings yet
Lecture4-Protein Data Analysis
26 pages
Bioinformatics TM6
No ratings yet
Bioinformatics TM6
30 pages
FALLSEM2024-25 BBIT418L TH VL2024250104339 2024-09-11 Reference-Material-I
No ratings yet
FALLSEM2024-25 BBIT418L TH VL2024250104339 2024-09-11 Reference-Material-I
34 pages
Overview of Protein Structure
No ratings yet
Overview of Protein Structure
3 pages
FALLSEM2024-25 BBIT418L TH VL2024250104339 2024-09-12 Reference-Material-I
No ratings yet
FALLSEM2024-25 BBIT418L TH VL2024250104339 2024-09-12 Reference-Material-I
20 pages
Hope - 3 Grade 12: Energy System
No ratings yet
Hope - 3 Grade 12: Energy System
9 pages
Fat Noews
No ratings yet
Fat Noews
20 pages
Bioinfo - S1 2021 - L9 - Protein Structure - 1 Slide
No ratings yet
Bioinfo - S1 2021 - L9 - Protein Structure - 1 Slide
87 pages
Iii. Protein Classification Scop
No ratings yet
Iii. Protein Classification Scop
11 pages
Protein Structure: Predictive Methods and Experimental Methodologies
No ratings yet
Protein Structure: Predictive Methods and Experimental Methodologies
33 pages
CATH, Bilogical Data Bases, Bioinformatics Data Base
No ratings yet
CATH, Bilogical Data Bases, Bioinformatics Data Base
3 pages
Structural Bioinformatics
No ratings yet
Structural Bioinformatics
75 pages
13-SCOP - Structural Classification of Proteins-06-09-2024
No ratings yet
13-SCOP - Structural Classification of Proteins-06-09-2024
21 pages
Structural Bioinformatics
No ratings yet
Structural Bioinformatics
4 pages
Protein Databases
No ratings yet
Protein Databases
23 pages
Scop 2008
No ratings yet
Scop 2008
7 pages
Classification Database
No ratings yet
Classification Database
5 pages
Main
No ratings yet
Main
15 pages
CS273 - Protein Structure Prediction
No ratings yet
CS273 - Protein Structure Prediction
39 pages
Lecture 12 (Structural Bioinformatics)
No ratings yet
Lecture 12 (Structural Bioinformatics)
30 pages
Bioinformatics
No ratings yet
Bioinformatics
10 pages
Structural Bioinformatics
No ratings yet
Structural Bioinformatics
37 pages
Fold Recognition (Threading) : Lecture-02
No ratings yet
Fold Recognition (Threading) : Lecture-02
5 pages
Xenobiotic Metabolism PDF
No ratings yet
Xenobiotic Metabolism PDF
50 pages
Protein Family
No ratings yet
Protein Family
5 pages
Test Series For Neet-2020
No ratings yet
Test Series For Neet-2020
10 pages
Protein Structure Similarity: Mlesnick@stanford - Edu
No ratings yet
Protein Structure Similarity: Mlesnick@stanford - Edu
8 pages
Cath Database
No ratings yet
Cath Database
16 pages
Template Recognition and Initial Alignment
No ratings yet
Template Recognition and Initial Alignment
12 pages
Careers in Biotechnology Booklet
No ratings yet
Careers in Biotechnology Booklet
20 pages
Structural Databases
No ratings yet
Structural Databases
5 pages
Leukemia Panel Sample Report
No ratings yet
Leukemia Panel Sample Report
2 pages
TR G3C4
No ratings yet
TR G3C4
9 pages
Plant Animal Reproduction
No ratings yet
Plant Animal Reproduction
7 pages
Grade 12 Bio Unit 3 Short Notes Oda Sbs
No ratings yet
Grade 12 Bio Unit 3 Short Notes Oda Sbs
127 pages
ECOSYSTEM
No ratings yet
ECOSYSTEM
43 pages
2016 Impact Factor (JCR)
No ratings yet
2016 Impact Factor (JCR)
210 pages
Notes Plant Function and Structure
No ratings yet
Notes Plant Function and Structure
5 pages
Slo Review
No ratings yet
Slo Review
5 pages
Pengembangan BOD Sensor
No ratings yet
Pengembangan BOD Sensor
8 pages
Ecosystems Review Activities - Answers
No ratings yet
Ecosystems Review Activities - Answers
4 pages
Biokimia Hormon 2018
No ratings yet
Biokimia Hormon 2018
35 pages
Testbank & Ebook Brock Biology of Microorganisms 16th Edition Madigan Solution Manual Instant
No ratings yet
Testbank & Ebook Brock Biology of Microorganisms 16th Edition Madigan Solution Manual Instant
18 pages
Sex Linked Traits HW.
No ratings yet
Sex Linked Traits HW.
2 pages
Skibidi Re by Mememandir
No ratings yet
Skibidi Re by Mememandir
12 pages
S Line Solution
No ratings yet
S Line Solution
28 pages
Nutrition (HOTS Question)
No ratings yet
Nutrition (HOTS Question)
4 pages
Alphonce K.N Muscle Tissue MCQs
No ratings yet
Alphonce K.N Muscle Tissue MCQs
3 pages
Death of Dolly Marks Cloning Milestone: News Focus
No ratings yet
Death of Dolly Marks Cloning Milestone: News Focus
2 pages
M.1 OVERVIEW OF INFLAMMATION Overview of Inflammatory Response and Immunologic Functions
No ratings yet
M.1 OVERVIEW OF INFLAMMATION Overview of Inflammatory Response and Immunologic Functions
2 pages
Gene Technology
No ratings yet
Gene Technology
1 page
Utilizing Web-Based Search Engines for Analyzing Biological Macromolecules
From Everand
Utilizing Web-Based Search Engines for Analyzing Biological Macromolecules
Natalie Roberts
No ratings yet
Success Topical Guidebook For GCE O Level Biology 1 5158
From Everand
Success Topical Guidebook For GCE O Level Biology 1 5158
Esther Chen
No ratings yet

Protein 3D Structure Database

Uploaded by

Protein 3D Structure Database

Uploaded by

Protein 3D structure database

PDB, CATH, SCOP

Amino acid Alpha helices & Arrangement Packing of several

1. Experimental methods (Best approach):

2. In-silico methods (partial solutions -

• Identifies the molecule, modifications, date of

• Host organism, keywords, method of study

• Authors, reference, resolution for X-ray structure

• XYZ coordinates for any ligands (starting with HETATM) complexed to

• O atoms of water molecules (starting with HETATM, normally at the last

• Usually, for X-ray structure, resolution is not high enough to locate H

• RASMOL: available for all platforms

1HUY An Improved Yellow

From Tsien’s group

15 backbone structures of the

• SCOP:Structural Classification of Proteins

Homologous sequences have similar structures

The “active site” is highly conserved

• Maintained-MRC laboratory of molecular biology and centre for protein

engineering , Cambridge, UK.

• Authors : Alexei.G.Murzin, 1995

• CO-WORKERS- L.Lo conte, B.G.Ailey,S.E.Brenner, T.J.P.Hubbard,

- All protein structures are classified and it is updated

-It adds information through analysis and

• Class - A very broad description of the

• Fold - Indicative of a broad structural

• 1997 by Christine Orengo, Janet Thornton and their

• Similarities are identified either by high

You might also like