SQH7001 Bioinformatics Task - Velda Rifka Almira
SQH7001 Bioinformatics Task - Velda Rifka Almira
SQH7001 Bioinformatics Task - Velda Rifka Almira
Student ID : 22098713
Course ID : SQH7001
Course name : Research Methodology in Environmental Management Technology
Assignment on Bioinformatics
1. Using the primary databases from NCBI website, search for mRNA and protein sequence of
insulin. The table below contains the results of required information list:
Type of
Insulin mRNA Sequence Insulin Protein Sequence
Data
Name of NCBI Nucleotide Database NCBI Protein Database
Database & (https://www.ncbi.nlm.nih.gov/nuccore/?ter (https://www.ncbi.nlm.nih.gov/protein/?ter
URL m=mRNA+of+insulin ) m=insulin+protein)
54607 72218
Number of
Search Hits
1. MN57671.1 1. AAA40590.1
2. NM_001204686.1 2. NP_001191615.1
3. M61153.1 3. KAB1251309.1
4. U03610.1 4. NP_001035835.1
5. JF909299.1 5. NP_571131.1
Top Five
Search
Result
Accession
Number
Type of
Insulin mRNA Sequence Insulin Protein Sequence
Data
DDBJ (374) PDB (2925)
EMBL (170) RefSeq (32346)
GenBank (15737) UniProtKB / Swiss-Prot (3109)
INSDC [GenBank] (16281) DDBJ (551)
RefSeq (38322) EMBL (1149)
TPA (4) GenBank (31926)
PIR (53)
Source
Databases
6499 5484
Number of
Sequence
Search Hits
Specifically
for Humans
384 603
Number of
Search Hits
for
Sequences
Released
from year
2016 to 2017
for Humans
37
Number of Sequence
that may be related to
disease(s)
Type of Data Hemoglobin Protein Sequence
Number of hb protein
sequence for Danio
Rerio that are
computationally and
manually reviewed
Answers :
1. a. Nucleotide is the monomer of nucleic acids, and nucleic acid is the polymer of nucleotides.
Nucleotide is composed of phosphate group and a nitrogenous base, which are attached to
pentose sugar. As for nucleic acid is composed of a chain of nucleotides, which are linked by
phosphodiester bonds.
They also have different function, nucleotides are polymerized to form DNA or RNA, they
serve as an energy source and signal transducer. While nucleic acids are also involved in
gene expression, as the storage of genetic information. Below are the examples of
nucleotides and nucleic acids:
• Nucleotides → ATP, ADP, CMP, dGTP, ddATP
• Nucleic Acids → DNA, RNA
b. Nucleic acid is a complex organic molecule that made up of nucleotides (pentose sugars,
nitrogenous bases, and phosplinked in a long chain. As for amino acid is a simple organic
molecule, which contains both carboxyl and amino groups. Nucleic acid is a polymer and the
monomer of nucleic acid are nucleotides, while the amino acid is a monomer and the polymer
of amino acids is a protein.
Both of them also serve different roles. Nucleic acids store genetic information of the cell and
are involved in the synthesis of functional proteins. On the other hand, Amino acids are used
in the translation of mRNA as building blocks of proteins.
As for the differences between the primary and secondary biological databases, mainly as
follows:
Aspect of Differences Primary Databases Secondary Databases
When utilizing the sequences, we need to adhere to the terms and conditions set by the
database, and follow the ethical guidelines.
1. Obtain the PDB structure file from the PDB with the PDB ID of 1A6M. By following the
instructions, below are the answers to the questions:
a. The required information for the given structure obtained from PDB: