0% found this document useful (0 votes)

31 views23 pages

Protein Databases

Uploaded by

Awais Faizy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views23 pages

Protein Databases

Uploaded by

Awais Faizy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 23

Protein databases

• Protein databases are a type of

biological database that are collections of
information about proteins.
• The information contained in protein databases
includes the amino acid sequence, the domain
structure, the biological function of the protein, its
three-dimensional structure, and its interactions
with other proteins.
• Several protein databases are publicly available.
Based on the type of information stored, protein
databases can be classified into several categories.
• Some of the most common categories of protein
databases are as follows
Protein Sequence Databases

• The protein sequence database contains amino acid

sequences of proteins and related information. The
amino acid sequence of a protein is important
because it determines the protein’s three-
dimensional structure and function, as well as its
identity.
• Some of the most popular protein sequence
databases are:
PIR

• PIR (Protein Information Resource) is a popular

protein sequence database that provides
information on functionally annotated protein
sequences.
• PIR maintains three databases, the Protein
Sequence Database (PSD), the Non-redundant
Reference (NREF) sequence database, and the
integrated Protein Classification (iProClass)
database, which contains annotated protein
sequences, classification information, and protein
family, function, and structure information.
SWISS-PROT

• SWISS-PROT is a protein sequence database that

provides high levels of annotations, including
information on the protein’s function, domain
structure, post-translational modifications, and
variants.
• Swiss-Prot is jointly managed by the SIB (Swiss
Institute of Bioinformatics) and the EBI (European
Bioinformatics Institute).
PDB

• PDB (Protein Data Bank) is a worldwide repository

of 3D structure data on large molecules such as
proteins, nucleic acids, and other biological
macromolecules.
• It stores three-dimensional structural models of
macromolecules obtained through three frequently
used experimental methods: X-ray crystallography,
nuclear magnetic resonance spectroscopy (NMR),
and electron microscopy (3DEM).
SCOP

• SCOP (Structural Classification of Proteins) is a

protein structure database that organizes proteins
based on their secondary structure properties.
• SCOP categorizes proteins into different levels
based on their evolutionary relationships and
structural similarities.
• Proteins with high sequence identity or similar
structure and function are grouped into families,
and families with similar structures but low
sequence identity are placed into superfamilies.
CATH

• CATH is a database that categorizes protein

domains into hierarchical levels based on their
folding patterns.
• Protein domains are classified into the CATH
hierarchy, which consists of four levels of increasing
specificity: Class, Architecture, Topology, and
Homologous Superfamily. Domains that have
similar folding patterns are grouped together at
higher levels of the hierarchy.
Protein-Protein Interaction Databases

• Protein-protein interaction databases are

collections of information on the interactions
between proteins.
• These databases provide valuable information on
the relationships between different proteins and
their functions in biological systems.
BIND

• BIND (Biomolecular Interaction Network Database)

is a database that stores detailed descriptions of
interactions, molecular complexes, and pathways
between various biomolecules, including proteins,
nucleic acids, and small molecules.
• The database is designed to be used for data
mining and can be used to study networks of
interactions and map pathways across different
species. The database can also provide information
for kinetic simulations.
DIP

• DIP (Database of Interacting Proteins) is a database

that contains protein-protein interaction
information that has been compiled through both
manual curations and computational methods.
• It is useful for understanding protein functions, and
their relationships with other proteins. It can also
be used to study the properties of networks of
interacting proteins, evaluate predictions of
protein-protein interactions, and explore the
evolution of these interactions.
MINT

• MINT (Molecular Interaction) is a database that

stores information on functional interactions
between biological molecules such as proteins,
RNA, and DNA.
• It also stores information on enzymatic
modifications of partner molecules.
• The database primarily focuses on experimentally
verified protein-protein interactions and considers
both direct and indirect relationships.
Protein Pattern and Profile Databases

• Protein pattern and profile databases contain

information on motifs found in sequences.
Sequence motifs correspond to structural or
functional features in proteins.
• So, the use of protein sequence patterns or profiles
is a valuable tool in determining the function of
proteins.
InterPro

• InterPro is a database that contains information on

protein families, domains, and functional sites.
• It was created by combining several major protein
signature databases, including PROSITE, Pfam,
PRINTS, ProDom, and SMART into a single
comprehensive resource.
PROSITE

• PROSITE is a collection of signatures that identify

patterns or profiles in proteins, which can provide
information on their biological functions.
• The signatures in the database are linked to
annotation documents that provide information on
the protein family or domain detected, including its
name, function, 3D structure, and references
Metabolic Pathway Databases

• Metabolic pathway databases contain information

about enzymes, biochemical reactions, and
metabolic pathways.
ENZYME

• ENZYME is a database that stores information on

enzyme nomenclature.
• It is used as the nomenclature source for enzyme
names and reactions by most metabolic databases
as well as by other biomolecular databases.
KEGG

• KEGG (Kyoto Encyclopedia of Genes and Genomes)

is a comprehensive database that maps out
molecular and cellular pathways involving
interactions between genes and molecules.
• It is composed of pathway maps, molecule tables,
gene tables, and genome maps, and is used to build
functional maps of metabolic and regulatory
pathways.
Applications of protein
databases
• Protein databases have numerous applications.
Some of the applications are:
• Protein databases can be used in sequence analysis
to identify homologous sequences and predict
protein functions based on sequence similarity.
• Protein databases can also be used for predicting
protein structure by comparing the amino acid
sequence of a protein with known structures in the
database.
• Protein databases also include tools to study
protein-protein interactions.
• Protein pattern and profile databases can be used
for protein family identification by identifying
conserved motifs.
• Protein databases such as metabolic pathway
databases can be used in drug discovery and
disease research by studying the metabolic
pathways involved in diseases.
Secondary databases

• By contrast, secondary databases comprise data

derived from the results of analysing primary data.
• They are often referred to as curated databases but
this is a bit of a misnomer because primary
databases are also curated to ensure that the data
in them is consistent and accurate.
• Secondary databases often draw upon information
from numerous sources,
• including other databases (primary and secondary),
controlled vocabularies and the scientific literature.
• They are highly curated, often using a complex
combination of computational algorithms and
manual analysis and interpretation to derive new
knowledge from the public record of science.

Biological - Databases Class Work 60
No ratings yet
Biological - Databases Class Work 60
60 pages
Databases Class Work
No ratings yet
Databases Class Work
48 pages
Book Amit Kumar, G. K. Goswami, Edwin Huffine Handbook of DNA Forensic Applications and Interpretation
No ratings yet
Book Amit Kumar, G. K. Goswami, Edwin Huffine Handbook of DNA Forensic Applications and Interpretation
190 pages
Note 2
No ratings yet
Note 2
54 pages
BCH 505 Bioinformatics 3 (2 2) Databases
No ratings yet
BCH 505 Bioinformatics 3 (2 2) Databases
17 pages
Introduction To Bioinformatics (Databases)
No ratings yet
Introduction To Bioinformatics (Databases)
28 pages
Biological Databases
No ratings yet
Biological Databases
41 pages
Bioinformatics. CH 3 Databases (Summarized Notes)
50% (2)
Bioinformatics. CH 3 Databases (Summarized Notes)
5 pages
Unit 2
No ratings yet
Unit 2
36 pages
Bioinformatics (Final)
No ratings yet
Bioinformatics (Final)
41 pages
Molecular Evolution and Phylogenetics - Nei and Kumar
100% (3)
Molecular Evolution and Phylogenetics - Nei and Kumar
350 pages
Introduction To Databases
No ratings yet
Introduction To Databases
21 pages
Database 2
No ratings yet
Database 2
15 pages
Protein Database Overview
No ratings yet
Protein Database Overview
13 pages
Multi-Omics Playbook Dec 2024
No ratings yet
Multi-Omics Playbook Dec 2024
96 pages
Bioinformatics Biological Database
No ratings yet
Bioinformatics Biological Database
31 pages
Data Base in Bioinformatics
No ratings yet
Data Base in Bioinformatics
30 pages
Kebt 110
No ratings yet
Kebt 110
14 pages
Biological Database ODL
No ratings yet
Biological Database ODL
21 pages
Zoya Bioinformatics Assignment
No ratings yet
Zoya Bioinformatics Assignment
36 pages
Biological Data Bases
No ratings yet
Biological Data Bases
36 pages
Lec2 Databases
No ratings yet
Lec2 Databases
135 pages
BIOINFORMATICS - eNOTES
No ratings yet
BIOINFORMATICS - eNOTES
23 pages
Biologicaldatabase 190402034501
No ratings yet
Biologicaldatabase 190402034501
26 pages
(Methods in Molecular Biology, 2231) Kazutaka Katoh - Multiple Sequence Alignment - Methods and Protocols-Humana (2020)
No ratings yet
(Methods in Molecular Biology, 2231) Kazutaka Katoh - Multiple Sequence Alignment - Methods and Protocols-Humana (2020)
322 pages
Computational Biology
No ratings yet
Computational Biology
19 pages
Ajol File Journals - 314 - Articles - 242956 - Submission - Proof - 242956 3745 584187 1 10 20230306
No ratings yet
Ajol File Journals - 314 - Articles - 242956 - Submission - Proof - 242956 3745 584187 1 10 20230306
17 pages
CCIR Academy - Cambridge Future Scholar (Summer 2024)
No ratings yet
CCIR Academy - Cambridge Future Scholar (Summer 2024)
260 pages
Bioinformatics PPT Section B Data Storage and Retrival Group 3
No ratings yet
Bioinformatics PPT Section B Data Storage and Retrival Group 3
36 pages
Biological Databases PDF
No ratings yet
Biological Databases PDF
13 pages
Bioinformatics Databases
No ratings yet
Bioinformatics Databases
7 pages
Peace BMCB Seminar
No ratings yet
Peace BMCB Seminar
13 pages
Module 2 Biodata
No ratings yet
Module 2 Biodata
36 pages
BIOINFORMATICS
No ratings yet
BIOINFORMATICS
22 pages
Database
No ratings yet
Database
16 pages
Bioinformatics Lecture Notes Database
No ratings yet
Bioinformatics Lecture Notes Database
28 pages
Contrast Data Mining - Concepts, Algorithms, and Applications (Dong & Bailey 2012-09-07)
No ratings yet
Contrast Data Mining - Concepts, Algorithms, and Applications (Dong & Bailey 2012-09-07)
428 pages
Bioinformatics
No ratings yet
Bioinformatics
47 pages
Bioinformatics (STH Sir)
No ratings yet
Bioinformatics (STH Sir)
13 pages
Biological Databases BDB
No ratings yet
Biological Databases BDB
5 pages
Bioinformatic Databases 2
No ratings yet
Bioinformatic Databases 2
28 pages
Supercomputing & Computational Biology: Presented by
100% (2)
Supercomputing & Computational Biology: Presented by
26 pages
Unit I
No ratings yet
Unit I
28 pages
Dissertation Pcs Sujets
100% (2)
Dissertation Pcs Sujets
4 pages
Lecture Topic: Protein Databases: Topics Covered
No ratings yet
Lecture Topic: Protein Databases: Topics Covered
67 pages
Bioinformatics: Merging Biology and Technology
From Everand
Bioinformatics: Merging Biology and Technology
Mani Devar
No ratings yet
161 Vansh Sharma
No ratings yet
161 Vansh Sharma
4 pages
Protein Family
No ratings yet
Protein Family
5 pages
Naas Score 2014
No ratings yet
Naas Score 2014
40 pages
Bioinformatics Overview
100% (1)
Bioinformatics Overview
18 pages
Databases - Final
No ratings yet
Databases - Final
50 pages
DATAbases 1 KD
No ratings yet
DATAbases 1 KD
5 pages
Bio in For Ma Tics
No ratings yet
Bio in For Ma Tics
52 pages
Presentation 11
No ratings yet
Presentation 11
20 pages
CH12
No ratings yet
CH12
8 pages
Protein Databases
No ratings yet
Protein Databases
12 pages
Bioinformatics & Gene Banks
No ratings yet
Bioinformatics & Gene Banks
2 pages
Protein Database
No ratings yet
Protein Database
3 pages
Lab Report 05
No ratings yet
Lab Report 05
20 pages
Abasyn University Peshawar: Name: Ihsan Ullah Depart: BS Medical Lab Technology
No ratings yet
Abasyn University Peshawar: Name: Ihsan Ullah Depart: BS Medical Lab Technology
8 pages
Bioinformatics Lab 2
No ratings yet
Bioinformatics Lab 2
9 pages
Presented By: Anand Sagar Tiwari M.Pharm (Second Semester)
No ratings yet
Presented By: Anand Sagar Tiwari M.Pharm (Second Semester)
56 pages
Rese Rach
No ratings yet
Rese Rach
37 pages
Biological Data and Database Biological Data
No ratings yet
Biological Data and Database Biological Data
10 pages
Bioinformatics Day2
No ratings yet
Bioinformatics Day2
3 pages
Metagenomics An Overview and Its Application in Nematology
No ratings yet
Metagenomics An Overview and Its Application in Nematology
6 pages
Basics of Bioinformatics in Biological Research
No ratings yet
Basics of Bioinformatics in Biological Research
5 pages
Toxicology
No ratings yet
Toxicology
78 pages
Biological Databases
No ratings yet
Biological Databases
13 pages
Introduction To NCBI Resources
No ratings yet
Introduction To NCBI Resources
39 pages
Quality Control (QC)
No ratings yet
Quality Control (QC)
25 pages
2014 AGTA Conference Handbook
No ratings yet
2014 AGTA Conference Handbook
139 pages
10 1 1 98 698-cnk3
No ratings yet
10 1 1 98 698-cnk3
190 pages
Bioinformatics and Omics Topic: Database and Biological Database With Examples Assignment-3
No ratings yet
Bioinformatics and Omics Topic: Database and Biological Database With Examples Assignment-3
5 pages
Generating Structural Data Analysis
No ratings yet
Generating Structural Data Analysis
8 pages
Isolater 1.0.1
No ratings yet
Isolater 1.0.1
28 pages
Bioinformatics Note
No ratings yet
Bioinformatics Note
7 pages
Lab Manual: Birla Institute of Technology and Science, Pilani Pilani Campus
No ratings yet
Lab Manual: Birla Institute of Technology and Science, Pilani Pilani Campus
62 pages
List of Online Bioinformatics Tools and Software - Final
No ratings yet
List of Online Bioinformatics Tools and Software - Final
23 pages
Botany: U.G.C. Choice Based Credit System (CBCS) Annual Pattern UG Courses Model Curriculum
No ratings yet
Botany: U.G.C. Choice Based Credit System (CBCS) Annual Pattern UG Courses Model Curriculum
37 pages
Experimental and Bioinformatic Approaches For Interrogating Protein-Protein Interactions To Determine Protein Function
No ratings yet
Experimental and Bioinformatic Approaches For Interrogating Protein-Protein Interactions To Determine Protein Function
18 pages
Tics - A Brief Introduction
No ratings yet
Tics - A Brief Introduction
4 pages
NC Panpop Merge SV 刘建全
No ratings yet
NC Panpop Merge SV 刘建全
9 pages
Biological Databases
No ratings yet
Biological Databases
12 pages
1 s2.0 S0010482523009630 Main
No ratings yet
1 s2.0 S0010482523009630 Main
10 pages
rapid-pcr-barcoding-RPB 9059 v1 Revl 14aug2019-Minion
No ratings yet
rapid-pcr-barcoding-RPB 9059 v1 Revl 14aug2019-Minion
20 pages
Nikhil Pandey
No ratings yet
Nikhil Pandey
16 pages
Top 200 High Impact Journal - 20.02.09 - Natural Sciences
No ratings yet
Top 200 High Impact Journal - 20.02.09 - Natural Sciences
9 pages
IBDP Biology HL FE2016 Kognity
No ratings yet
IBDP Biology HL FE2016 Kognity
1 page
C Accts Easyweb Chettinad Chettinadadmin Homework 050722 12BGENMSG050722025610 PM
No ratings yet
C Accts Easyweb Chettinad Chettinadadmin Homework 050722 12BGENMSG050722025610 PM
5 pages
Understanding Biochemistry Bio Molecule
No ratings yet
Understanding Biochemistry Bio Molecule
7 pages
Morphology 2
No ratings yet
Morphology 2
1 page
The Nature of Systems Biology: Frank J. Bruggeman and Hans V. Westerhoff
No ratings yet
The Nature of Systems Biology: Frank J. Bruggeman and Hans V. Westerhoff
6 pages

Protein Databases

Uploaded by

Protein Databases

Uploaded by

Protein databases

• Protein databases are a type of

• The protein sequence database contains amino acid

• PIR (Protein Information Resource) is a popular

• SWISS-PROT is a protein sequence database that

• PDB (Protein Data Bank) is a worldwide repository

• SCOP (Structural Classification of Proteins) is a

• CATH is a database that categorizes protein

• Protein-protein interaction databases are

• BIND (Biomolecular Interaction Network Database)

• DIP (Database of Interacting Proteins) is a database

• MINT (Molecular Interaction) is a database that

• Protein pattern and profile databases contain

• InterPro is a database that contains information on

• PROSITE is a collection of signatures that identify

• Metabolic pathway databases contain information

• ENZYME is a database that stores information on

• KEGG (Kyoto Encyclopedia of Genes and Genomes)

• By contrast, secondary databases comprise data

You might also like