0% found this document useful (0 votes)

93 views5 pages

Bioinformatics and Omics Topic: Database and Biological Database With Examples Assignment-3

This document discusses biological databases, including their purpose and types. Biological databases store biological data like DNA sequences, protein structures, etc. in an organized way. There are three main types of biological databases: primary databases which archive experimental data with minimal annotation; secondary databases which apply computational analysis to primary data to derive more knowledge; and composite databases which merge and filter data from primary databases to make searches more efficient. Examples of prominent biological databases discussed include GenBank, SWISS-PROT, Pfam, and BLOCKS.

Uploaded by

vidushi srivastava

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views5 pages

Bioinformatics and Omics Topic: Database and Biological Database With Examples Assignment-3

Uploaded by

vidushi srivastava

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Bioinformatics and Omics

Topic: Database and Biological database with examples

Assignment-3

Submitted to: Submitted by:

Dr. Shazia Haider Diksha gupta(20915008)
Msc micro 3rd sem
Database and Biological database with examples

Assignment-3

A database is a collection of inter-related data which helps in efficient retrieval, insertion and deletion lf
data from database and organizes data in the form of tables, schemas, reports etc. Databases are
effectively electronic filing cabinets, a convenient and efficient method of storing vast amount of
information.

It is a computerized archive used to store and organize data in such a way that information can be
retrieved easily via a variety of search criteria. These are composed of computer hardware and software
for data management. The chief objective of the development of a database is to organize data in a set of
structured records to enable easy retrieval of information. Each record, also called an entry,
should contain a number of fields that hold the actual data items, for example, fields for names,
phone numbers, addresses, dates. To retrieve a particular record from the database, a user can
specify a particular piece of information, called value, to be found in a particular field and expect
the computer to retrieve the whole data record. This process is called making a query.

Although data retrieval is the main purpose of all databases, biological databases often have a
higher level of requirement, known as knowledge discovery, which refers to the identification of
connections between pieces of information that were not known when the information was first
entered. For example, databases containing raw sequence information can perform extra
computational tasks to identify sequence homology or conserved motifs. These features facilitate
the discovery of new biological insights from raw data.

Software which is used to manage database is called Database Management System (DBMS).
They are sophisticated computer software programs for organizing, searching, and accessing
data.

Types

There are many different databases types, depending both on nature of the information being
stored (eg. Sequences or structure, 2D gel or 3D structure images) and on the manner of data
storage
 Flat-files database
 Relational database
 Object-oriented database

Biological database

These are the databases consisting of biological data like protein sequencing, molecular
structure, DNA sequences, etc. in an organized form.

They are free to use and contain a huge collection of a variety of biological data.These are
libraries of biological sciences, collected from scientific experiments, published literature, high-
throughput experiment technology, and computational analysis. They contain information from
research areas including genomics, proteomics, metabolomics, microarray gene expression,
and phylogenetic.

Information contained in biological databases includes gene function, structure, localization

(both cellular and chromosomal), clinical effects of mutations as well as similarities of biological
sequences and structures. Current biological databases use all three types of database structures:
flat files, relational, and object oriented.

Biological database can be divided into 3 categories:

 Primary Databases
o Most of the data in the databases are contributed directly by authors with a
minimal level of annotation.
o It can also be called an archival database since it archives the
experimental results submitted by the scientists.
o The primary database is populated with experimentally derived data like

genome sequence, macromolecular structure, etc. The data entered here

remains uncurated (no modifications are performed over the data)

There are three major public sequence databases that store raw nucleic acid sequence data
produced and submitted by researchers worldwide: GenBank, the European Molecular Biology
Laboratory (EMBL) database and the DNA Data Bank of Japan (DDBJ). They together
constitute the International Nucleotide Sequence Database Collaboration for the three-
dimensional structures of biological macromolecules, there is only one centralized database, the
PDB. This database archives atomic coordinates of macromolecules (both proteins and nucleic
acids) determined by x-ray crystallography and NMR.

GenBank is the most complete collection of annotated nucleic acid sequence data for almost
every organism. The content includes genomic DNA, mRNA, cDNA, ESTs, high throughput raw
sequence data, and sequence polymorphisms. There is also a GenPept database for protein
sequences, the majority of which are conceptual translations from DNA sequences, although a
small number of the amino acid sequences are derived using peptide sequencing techniques.

There are two ways to search for sequences in GenBank. One is using text-based keywords
similar to a PubMed search. The other is using molecular sequences to search by sequence
similarity using BLAST.

 Secondary Databases
o To turn the raw sequence information into more sophisticated biological
knowledge, much post processing of the sequence information is needed. This
begs the need for secondary databases, which contain computationally processed
sequence information derived from the primary databases.
o Computational algorithms are applied to the primary database and meaningful and
informative data is stored inside the secondary database.
o A secondary database is better and contains more valuable knowledge compared
to the primary database.

A prominent example of secondary databases is SWISS-PROT, which provides detailed

sequence annotation that includes structure, function, and protein family assignment. The
sequence data are mainly derived from TrEMBL, a database of translated nucleic acid sequences
stored in the EMBL database. PIR is also an example of this databases.

The Pfam and Blocks databases contain aligned protein sequence information as well as derived
motifs and patterns, which can be used for classification of protein families and inference of
protein functions. The DALI database is vital for protein structure classification and threading
analysis to identify distant evolutionary relationships among proteins.

In Blocks database, the motifs (here called Blocks) are created automatically by highlighting and
detecting the most conserved regions of each family of proteins. This databases are fully
automated. Keyword and sequence searching are the two important features of this type of
database. Blocks are ungapped Multiple Sequence Alignment representing conserved protein
regions.

 Composite Databases

o The data entered in these types of databases are first compared and then filtered
based on desired criteria.

o The initial data are taken from the primary database, and then they are merged
together based on certain conditions.

o It helps in searching sequences rapidly. Composite Databases contain non-

redundant data.

o They render sequence searching more efficient, because they obviate the need to
interrogate multiple resources.

o Because they are often curated by experts in the field, they may have unique
organizations and additional annotations associated with the sequences.

OWL, NRDB and SWISS-PROT+ TrEMBL are the examples of these databases.

NRDB (Non-Redundant Database) is built locally at the NCBI. The database is a composite of
GenPept, PDB sequences, SWISS-PROT, SPupdate, PIR, and GenPeptupdate. The database is
thus comprehensive and contains up-to-date information. It is non-redundant, but non- identical,
i.e., only identical sequence copies are removed from the resource. But the contents of NRDB
are both error-prone and, in spite of its name, redundant. NRDB is the default database of the
NCBI BLAST service.

80 Mock Questions - Aws Certified Data Engineer Associate
100% (1)
80 Mock Questions - Aws Certified Data Engineer Associate
33 pages
Biological Databases ODL
No ratings yet
Biological Databases ODL
31 pages
HA250 EN Col18
100% (1)
HA250 EN Col18
117 pages
BCH 505 Bioinformatics 3 (2 2) Databases
No ratings yet
BCH 505 Bioinformatics 3 (2 2) Databases
17 pages
Bioinformatics (Final)
No ratings yet
Bioinformatics (Final)
41 pages
Biological Databases
No ratings yet
Biological Databases
3 pages
Bioinformatics PPT Section B Data Storage and Retrival Group 3
No ratings yet
Bioinformatics PPT Section B Data Storage and Retrival Group 3
36 pages
Unit II Bioinformatics
No ratings yet
Unit II Bioinformatics
25 pages
Introduction To Bioinformatics (Databases)
No ratings yet
Introduction To Bioinformatics (Databases)
28 pages
Dbms Gate Notes
No ratings yet
Dbms Gate Notes
574 pages
M Lec 01 & 02 Biological Database
No ratings yet
M Lec 01 & 02 Biological Database
50 pages
BIOINFORMATICS - eNOTES
No ratings yet
BIOINFORMATICS - eNOTES
23 pages
Databases Class Work
No ratings yet
Databases Class Work
48 pages
Biological Databases Lec 2,3
No ratings yet
Biological Databases Lec 2,3
49 pages
Bioinformatics Biological Database
No ratings yet
Bioinformatics Biological Database
31 pages
Bioinformatics. CH 3 Databases (Summarized Notes)
50% (2)
Bioinformatics. CH 3 Databases (Summarized Notes)
5 pages
Biological Databases: DR Z Chikwambi Biotechnology
No ratings yet
Biological Databases: DR Z Chikwambi Biotechnology
47 pages
Database
No ratings yet
Database
40 pages
Lecture 4 Biological Databases
No ratings yet
Lecture 4 Biological Databases
29 pages
Biological - Databases Class Work 60
No ratings yet
Biological - Databases Class Work 60
60 pages
Biological Databases PDF
No ratings yet
Biological Databases PDF
13 pages
Lec2 Databases
No ratings yet
Lec2 Databases
135 pages
02-A-Introduction To Biological Databases
No ratings yet
02-A-Introduction To Biological Databases
52 pages
Biol BDs Singapore
No ratings yet
Biol BDs Singapore
24 pages
Biological Databases
No ratings yet
Biological Databases
41 pages
Unit 2
No ratings yet
Unit 2
36 pages
NoSQL MongoDB HBase Cassandra
100% (1)
NoSQL MongoDB HBase Cassandra
142 pages
Internship Report
No ratings yet
Internship Report
26 pages
Capture D'écran . 2023-03-14 À 00.15.22
No ratings yet
Capture D'écran . 2023-03-14 À 00.15.22
54 pages
CMSC 838T - Lecture 9: Bioinformatics Databases
No ratings yet
CMSC 838T - Lecture 9: Bioinformatics Databases
65 pages
Introduction To Databases
No ratings yet
Introduction To Databases
7 pages
FALLSEM2019-20 BIT2001 ETH VL2019201000690 Reference Material I 11-Jul-2019 Unit I New
No ratings yet
FALLSEM2019-20 BIT2001 ETH VL2019201000690 Reference Material I 11-Jul-2019 Unit I New
48 pages
Bioinformatics
No ratings yet
Bioinformatics
47 pages
Bioinformatics Lecture Notes Database
No ratings yet
Bioinformatics Lecture Notes Database
28 pages
Day 1
No ratings yet
Day 1
38 pages
Biological Database ODL
No ratings yet
Biological Database ODL
21 pages
Sec1 Introduction To Bioinformatics
No ratings yet
Sec1 Introduction To Bioinformatics
20 pages
Introduction To Databases
No ratings yet
Introduction To Databases
21 pages
Biological Databases: - Bio-Informatics
No ratings yet
Biological Databases: - Bio-Informatics
16 pages
Database
No ratings yet
Database
16 pages
WINSEM2021-22 BIY1012 ETH VL2021220501045 Reference Material I 11-01-2022 Ntroduction To Databases
No ratings yet
WINSEM2021-22 BIY1012 ETH VL2021220501045 Reference Material I 11-01-2022 Ntroduction To Databases
42 pages
Bio in For Ma Tics
No ratings yet
Bio in For Ma Tics
52 pages
Biological Databases
No ratings yet
Biological Databases
17 pages
Ajol File Journals - 314 - Articles - 242956 - Submission - Proof - 242956 3745 584187 1 10 20230306
No ratings yet
Ajol File Journals - 314 - Articles - 242956 - Submission - Proof - 242956 3745 584187 1 10 20230306
17 pages
Unit I
No ratings yet
Unit I
28 pages
Unit Ii
No ratings yet
Unit Ii
23 pages
Database 2
No ratings yet
Database 2
15 pages
المحاضرة 2
No ratings yet
المحاضرة 2
16 pages
Peace BMCB Seminar
No ratings yet
Peace BMCB Seminar
13 pages
Databases - Final
No ratings yet
Databases - Final
50 pages
Presentation 11
No ratings yet
Presentation 11
20 pages
Tea Plantation: Submitted To Dr. Smriti Gaur
No ratings yet
Tea Plantation: Submitted To Dr. Smriti Gaur
19 pages
Lecture 5 - DataBase
No ratings yet
Lecture 5 - DataBase
18 pages
PDF Fee Receipt
No ratings yet
PDF Fee Receipt
1 page
Bioinformatics Database Resources: Icxa Khandelwal Pavan Kumar Agrawal Rahul Shrivastava
No ratings yet
Bioinformatics Database Resources: Icxa Khandelwal Pavan Kumar Agrawal Rahul Shrivastava
46 pages
2024.HF BioInformatics Lec3p
No ratings yet
2024.HF BioInformatics Lec3p
11 pages
Rese Rach
No ratings yet
Rese Rach
37 pages
Biological Databases BDB
No ratings yet
Biological Databases BDB
5 pages
CH12
No ratings yet
CH12
8 pages
161 Vansh Sharma
No ratings yet
161 Vansh Sharma
4 pages
Biological Data and Database Biological Data
No ratings yet
Biological Data and Database Biological Data
10 pages
Basics of Bioinformatics in Biological Research
No ratings yet
Basics of Bioinformatics in Biological Research
5 pages
Biological Database
No ratings yet
Biological Database
3 pages
Bioinfo U2 KD 2
No ratings yet
Bioinfo U2 KD 2
3 pages
Biological Databases
No ratings yet
Biological Databases
13 pages
Bioinformatics Day2
No ratings yet
Bioinformatics Day2
3 pages
FAQ in IDMS
No ratings yet
FAQ in IDMS
5 pages
Guide To Threat Modeling
No ratings yet
Guide To Threat Modeling
30 pages
SWRL Tutorial 01
100% (4)
SWRL Tutorial 01
80 pages
Topic: - Vermicomposting
No ratings yet
Topic: - Vermicomposting
9 pages
Event Management System Web-Based Application Management System
No ratings yet
Event Management System Web-Based Application Management System
83 pages
Symantec Reporter 10.5.x Administrator's Guide: Revision - Wednesday, March 11, 2020
No ratings yet
Symantec Reporter 10.5.x Administrator's Guide: Revision - Wednesday, March 11, 2020
132 pages
NaveenKumarArjunan LogisticsStint Report PDF
No ratings yet
NaveenKumarArjunan LogisticsStint Report PDF
17 pages
Semantic Tags
No ratings yet
Semantic Tags
6 pages
Informatica Training - Presentation Transcript
No ratings yet
Informatica Training - Presentation Transcript
10 pages
Lesson 1 Overview of IT Audit
No ratings yet
Lesson 1 Overview of IT Audit
42 pages
DataMasking Using DataStage
No ratings yet
DataMasking Using DataStage
60 pages
Reponse Test PHP
No ratings yet
Reponse Test PHP
19 pages
Du Migration Certificate Form
No ratings yet
Du Migration Certificate Form
3 pages
Law Enforcement Analytic Standards
100% (1)
Law Enforcement Analytic Standards
46 pages
International GCSE: Computer Science
No ratings yet
International GCSE: Computer Science
8 pages
Summation Load File Guide
No ratings yet
Summation Load File Guide
87 pages
How I Cracked The AWS Solution Architect Cloud Quest. - DEV Community
No ratings yet
How I Cracked The AWS Solution Architect Cloud Quest. - DEV Community
8 pages
Class X Pre Board-1
No ratings yet
Class X Pre Board-1
6 pages
Biosensor Technology As Diagnostic Tool For Detection of Pathogen
No ratings yet
Biosensor Technology As Diagnostic Tool For Detection of Pathogen
9 pages
Dr. H.S. Gaur University Sagar, M.P. B.C.A. 1 Year (I Semester) Paper Code: Bca101 Computer Fundamentals
No ratings yet
Dr. H.S. Gaur University Sagar, M.P. B.C.A. 1 Year (I Semester) Paper Code: Bca101 Computer Fundamentals
22 pages
CF Flasharray Architect Professional Exam Guide 5
No ratings yet
CF Flasharray Architect Professional Exam Guide 5
18 pages
Survey Report: Department of Humanities and Social Sciences
No ratings yet
Survey Report: Department of Humanities and Social Sciences
7 pages
Lab-1 File Record Piyush Kumar
No ratings yet
Lab-1 File Record Piyush Kumar
48 pages
Beverages: Serving Time - 25 Minutes
No ratings yet
Beverages: Serving Time - 25 Minutes
8 pages
M.SC (Microbiology) 2 Year Program 1 Sem Microbiology Lab - I (19M25Bt111) Vikas Sharma 20915002
No ratings yet
M.SC (Microbiology) 2 Year Program 1 Sem Microbiology Lab - I (19M25Bt111) Vikas Sharma 20915002
7 pages
Blast Command Line Applications User Manual: Last Updated: June 28, 2021
No ratings yet
Blast Command Line Applications User Manual: Last Updated: June 28, 2021
101 pages
Review Article: Importance of IL-10 Modulation by Probiotic Microorganisms in Gastrointestinal Inflammatory Diseases
No ratings yet
Review Article: Importance of IL-10 Modulation by Probiotic Microorganisms in Gastrointestinal Inflammatory Diseases
11 pages
JavaTextbook Chapter 21 JDBC-2020
No ratings yet
JavaTextbook Chapter 21 JDBC-2020
29 pages
Manual Topocad 14 ENU
No ratings yet
Manual Topocad 14 ENU
445 pages
CURD
No ratings yet
CURD
26 pages
Guidelines - Orientation - Procedure 2021 Odd
No ratings yet
Guidelines - Orientation - Procedure 2021 Odd
2 pages
Admission Notification For Ph.D. Program - 2021-22
No ratings yet
Admission Notification For Ph.D. Program - 2021-22
18 pages
DB Lec Week-1 Introduction
No ratings yet
DB Lec Week-1 Introduction
14 pages
Screenshot 20210508 155202 Com - Notebloc.app
No ratings yet
Screenshot 20210508 155202 Com - Notebloc.app
1 page
Earth Day Eco-Warrior (Open Positions - 2500) - Internship - Certificate
No ratings yet
Earth Day Eco-Warrior (Open Positions - 2500) - Internship - Certificate
1 page
Developing An Ontology For Cyber Security Knowledge Graphs
No ratings yet
Developing An Ontology For Cyber Security Knowledge Graphs
4 pages
20180308012211are Farms Becomin Digital Firms
No ratings yet
20180308012211are Farms Becomin Digital Firms
4 pages
Enzymes
No ratings yet
Enzymes
2 pages
VermiComposting
No ratings yet
VermiComposting
6 pages
Student Choicereport
No ratings yet
Student Choicereport
1 page
Information Sheet:: Eucalyptus Trees As Honey Bee Forage
No ratings yet
Information Sheet:: Eucalyptus Trees As Honey Bee Forage
1 page
Introduction to Bioinformatics Using Action Labs
From Everand
Introduction to Bioinformatics Using Action Labs
Jean-Louis Lassez
5/5 (1)

Bioinformatics and Omics Topic: Database and Biological Database With Examples Assignment-3

Uploaded by

Bioinformatics and Omics Topic: Database and Biological Database With Examples Assignment-3

Uploaded by

Bioinformatics and Omics

Topic: Database and Biological database with examples

Submitted to: Submitted by:

Information contained in biological databases includes gene function, structure, localization

Biological database can be divided into 3 categories:

genome sequence, macromolecular structure, etc. The data entered here

A prominent example of secondary databases is SWISS-PROT, which provides detailed

o It helps in searching sequences rapidly. Composite Databases contain non-

You might also like