0% found this document useful (0 votes)

104 views29 pages

Lab Work

This document contains a summary of 5 practical experiments conducted by a student named Zainab Sohail in their 5th semester of studying Bioinformatics. The experiments include: 1) Retrieving gene sequences from databases; 2) Performing multiple sequence alignments; 3) Conducting phylogenetic analysis; 4) Retrieving protein sequences; 5) Predicting protein secondary and tertiary structures. For each experiment, the document provides details on the objectives, procedures, and results.

Uploaded by

Aleena Khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

104 views29 pages

Lab Work

Uploaded by

Aleena Khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 29

Name:

Zainab Sohail
Arid No #:
16-ARID-2582
Semester:
5th
Subject:
Bioinformatics

DEPARTMENT OF BIOCHEMISTRY
INDEX
S.NO Experiment Date Signature
1. Databases or 6-12-2018
Software
2. Retrieval of gene 13-12-2018
sequence
3. Multiple Sequence 20-12-2018
Alignment
4. Phylogenetic 27-12-2018
Analysis

5. Retrieval of 3-1-2018
protein sequence
6. Secondary 10-1-2018
Structure
Prediction
7. Tertiary Structure 17-1-2018
Prediction
8. Structure 17-1-2018
Visualization
PRACTICAL# 1:
DATABASES OR SOFTWARES
Databases:
A database is an organized collection of data, generally stored
and accessed electronically from a computer system. Where databases
are more complex, they are often developed using formal design and
modelling techniques.

Biological databases:
A biological database is a large, organized body of persistent
data, usually associated with computerized software designed to update,
query, and retrieve components of the data stored within the system.
A simple database might be a single file containing many
records, each of which includes the same set of information.

Popular databases:
A few popular databases are GenBank from NCBI (National
Centre for Biotechnology Information), Swissport from the Swiss Institute
of Bioinformatics and PIR from the Protein Information Resource.
GenBank:
GenBank (Genetic Sequence Databank) is one of the fastest
growing repositories of known genetic sequences.
EMBL:
The EMBL Nucleotide Sequence Database is a comprehensive
database of DNA and RNA sequences collected from the scientific
literature and patent applications and directly submitted from
researchers and sequencing groups.
SwissPort:
This is a protein sequence database that provides a high level of
integration with other databases and has a very low level of redundancy
(means less identical sequences are present in the database).
PRACTICAL#2:
RETRIEVAL OF GENE SEQUENCE
Procedure:
In order to retrieve a gene sequence, follow the steps given below
1. Go to NCBI database search.

2. Select “National Centre for Biotechnology Information”, this screen

will appear
3. In all Databases, enter “Gene”, search your gene name also for
example cytochrome b.

4. Your device will display all the records for this gene. Select the one
whose gene sequence you want to retrieve e.g. I have selected
CYBA. (The gene record will contain gene locus graphical
representation, gene sequence, transcript, product and related
literature information).
5. To see the gene sequence information, click on FASTA.
6. By clicking FASTA, the gene sequence will appear. By copying this
sequence, the sequence will be retrieved and can be used for further
processing.

PRACTICAL#3:
MULTIPLE SEQUENCE ALIGNMENT
Definition:
A multiple sequence alignment (MSA) is a sequence
alignment of three or more biological sequences, generally protein,
DNA, or RNA. In many cases, the input set of query sequences are
assumed to have an evolutionary relationship by which they share a
linkage and are descended from a common ancestor. From the resulting
MSA, sequence homology can be inferred, and phylogenetic analysis
can be conducted to assess the sequences' shared evolutionary origins.
Visual depictions of the alignment illustrate mutation events
such as point mutations (single amino acid or nucleotide changes) that
appear as differing characters in a single alignment column, and
insertion or deletion mutations (indels or gaps) that appear as hyphens
in one or more of the sequences in the alignment.
Multiple sequence alignment is often used to assess sequence
conservation of protein domains, tertiary and secondary structures, and
even individual amino acids or nucleotides.

Explanation:
Multiple sequence alignment also refers to the process of
aligning such a sequence set. Because three or more sequences of
biologically relevant length can be difficult and are almost always time-
consuming to align by hand, computational algorithms are used to
produce and analyse the alignments.
MSAs require more sophisticated methodologies than pairwise
alignment because they are more computationally complex. Most
multiple sequence alignment programs use heuristic methods rather
than global optimization because identifying the optimal alignment
between more than a few sequences of moderate length is prohibitively
computationally expensive.

Multiple Sequence Alignment Tools:

Some of the multiple sequence alignment tools are:

1. Kalign:
Very fast MSA tool that concentrates on local regions.
Suitable for large alignments.

2. T-Coffee:
Consistency-based MSA tool that attempts to mitigate the
pitfalls of progressive alignment methods. Suitable for small alignments.

3. WebPRANK:
The EBI has a new phylogeny-aware multiple sequence
alignment program which makes use of evolutionary information to help
place insertions and deletions.

4. Clustal Omega
New MSA tool that uses seeded guide trees and HMM profile-
profile techniques to generate alignments. Suitable for medium-large
alignments.

Procedure:
1. Search Kalign. This is the tool of multiple sequence alignment.

2. Select “ Kalign < multiple sequence alignment < EMBL-EBI”. This

screen will appear.

3. Now, select Nucleic Acid in place of protein in step 1 in above figure.

4. Now retrieve a sequence of gene from NCBI (as discussed in
previous practical).i.e. I have retrieved a sequence of cytochrome b.
Also select reference sequence line.
5. Now paste this sequence on kalign page.

6. Now select one more sequence of gene from NCBI and paste in the
same block from the next line where the first sequence pasted. (we
are selecting two and more than two sequences because this is a
multiple sequence alignment.)
7. After pasting your sequences, click “submit”.

8. Your result will appear on the screen.

PRACTICAL#4:
PHYLOGENETIC ANALYSIS
Phylogenetics:
Phylogenetics is the study of the evolutionary history and
relationships among individuals or groups of organisms. These
relationships are discovered through phylogenetic inference methods
that evaluate observed heritable traits, such as DNA sequences or
morphology under a model of evolution of these traits. The result of
these analyses is a phylogeny--a diagrammatic hypothesis about the
history of the evolutionary relationships of a group of organisms.
The tips of a phylogenetic tree can be living organisms or fossils, and
represent the "end", or the present, in an evolutionary lineage.
Phylogenetic analyses have become central to understanding
biodiversity, evolution, ecology, and genomes.

Phylogenetic Analysis:
Phylogenetic methods can be used for many purposes, including
analysis of morphological and several kinds of molecular data. These
can be used for
 Comparisons of more than two sequences
 Analysis of gene families, including functional predictions
 Estimation of evolutionary relationships among organisms

Steps for analysis:

1. Choosing the sequence type
2. Alignment of sequence data
3. Search for the best tree
4. Evaluation of tree reproducibility

Phylogenetic Methods:
Phylogenetic methods can be divided into three general categories
1. Parsimony
2. Minimum Distance
3. likelihood

1. Parsimony:
 Finds the optimum tree by minimizing the number of evolutionary
changes
 No assumption on the evolutionary pattern
 May oversimplify evolution
 May produce several equally good trees

2. Minimum distance:
 Pairwise distances can be aggregated into a phylogenetic tree
 Search for the tree that minimizes discrepancies among pairwise
distances
 May or may not use an explicit model of sequence evolution
 How the distances are calculated and how the tree is found can be
mixed and matched
 To know what method is being used, you have to know both how the
distance matrix was constructed, and how the tree was determined

3. Likelihood:
 A model of sequence evolution can be used to relate the data to a
hypothesis (typically a tree topology).
 Maximum likelihood
 Search for the tree that maximizes the likelihood function
 The idea is to find the tree that is most likely given the data and the
model.

Properties of analytical methods

1. Consistency
A method is consistent if it is more likely to find the correct
answer with more data.
2. Power
A method is powerful if it can find the correct answer with
very few data.
3. Accuracy
A method is accurate if in multiple trials it produces answers
that follow a normal distribution centered on the correct answer.
4. Precision
A method is precise if in multiple trials it finds answers that
are very close to each other.

Procedure:

1. In the results of kalign, at the top you will see the option of
phylogenetic tree. Select it.

2. Your result will appear on the screen.

PRACTICAL#5:
RETRIEVAL OF PROTEIN SEQUENCE
Procedure:
In order to retrieve a protein sequence, follow the steps given below
1. Go to NCBI database search.

2. Select “National Centre for Biotechnology Information”, this screen

will appear.

3. In all Databases, enter “Protein”, search your protein name also for
example haemoglobin homo sapiens.
4. Your device will display all the records for this protein. Select the one
whose protein sequence you want to retrieve e.g. I have selected
beta-globin.

5. To find protein sequence, click FASTA.

6. By clicking FASTA, the gene sequence will appear. By copying this
sequence, the sequence will be retrieved and can be used for further
processing.

PRACTICAL#6:
SECONDARY STRUCTURE PREDICTION
Introduction:
Secondary structure prediction is a set of techniques in
bioinformatics that aim to predict the secondary structures of proteins
and nucleic acid sequences based only on knowledge of their primary
structure. For proteins, this means predicting the formation of protein
structures such as alpha helices and beta strands, while for nucleic
acids it means predicting the formation of nucleic acid structures like
helixes and stem-loop structures through base pairing and base stacking
interactions.

Procedure:
In order to predict secondary structure of a protein, follow the
following steps:
1. Go to ScanProsite search.

2. Select ScanProsite, the following screen will appear.

3. In step 1, Retrieve a protein sequence from NCBI and paste it in a
box shown in figure above.

4. In step 2, select option according to your demand and start the scan.
5. The result will appear on the screen.

PRACTICAL#7:
TERTIARY STRUCTURE PREDICTION
Introduction:
Protein tertiary structure refers to the 3-dimentional form of the
protein, presented as a polypeptide chain backbone with one or more
protein secondary structures, the protein domains.
Determining the tertiary structure of a protein can be achieved
by x-ray crystallography, nuclear magnetic resonance, and dual
polarization interferometry.
Alternatively, protein tertiary structure can be predicted using
specific algorithm and software tools based on amino acid sequence.

Procedure:
1. Go to Swiss Model search.

2. Enter start modelling, the following screen will appear.

3. Retrieve a protein sequence from NCBI and paste it in a box shown in
figure above. Select “Build Model”.
4. The Model results are given below.

PRACTICAL#8:
STRUCTURE VISUALIZATION TOOLS

Visualization:
Visualization tools allow us to
• see 3D structure data.
• communicate features about 3-D structures to colleagues.
• illustrate biological processes (catalytic/binding).
• educate laypersons about structural biology.

Tools:
1. RasMol
2. JMol
3. CHIME
4. PyMol
5. Swiss 3D viewer
1. RasMol:
This tool was developed by Roger Sayle. It is an
Open source, binaries available. RasMol is widely used,
simple to use (menus) for simple operations. The Complex
operations require command-line interface.
2. JMol:
Jmol can connect to certain databases in order to directly
retrieve structures. This applies to the Jmol application, to the JSmol
HTML5 object and to the Jmol signed Java applet. (The unsigned applet
is not allowed connection to external servers and so does not support
this method.
3. CHIME:
CHIME stands for “Chemical mIME”. It is a free molecular
viewer web browser plugin based on Rasmol and was developed by
MDL Information Systems.
4. PyMOL:
It is the set of structure tools built on top of Python and
supports all Standard Features. PyMOL is extensible, scriptable native
ray tracer. It is a freely available tool.
5. Swiss PDB Viewer:
This tool is for superimposition to compare proteins and
their components such as active/binding sites. Some of the functions of
this tool are measure angles, distances between atoms, Manual or
automated (Swiss-Model) homology modelling including loop modelling,
Threading (Fold recognition), Mutations and Energy minimization,
Electron density map reading and model building (crystallography data)
and Interface to POV-Ray rendering software.

Types of Vectors
100% (1)
Types of Vectors
28 pages
(Methods in Molecular Biology 1525) Jonathan M. Keith (Eds.) - Bioinformatics - Volume I - Data, Sequence Analysis, and Evolution-Humana Press (2017)
100% (3)
(Methods in Molecular Biology 1525) Jonathan M. Keith (Eds.) - Bioinformatics - Volume I - Data, Sequence Analysis, and Evolution-Humana Press (2017)
489 pages
MCQ A
No ratings yet
MCQ A
11,493 pages
Phylogenetic Tree
No ratings yet
Phylogenetic Tree
25 pages
(Methods in Molecular Biology, 2231) Kazutaka Katoh - Multiple Sequence Alignment - Methods and Protocols-Humana (2020)
No ratings yet
(Methods in Molecular Biology, 2231) Kazutaka Katoh - Multiple Sequence Alignment - Methods and Protocols-Humana (2020)
322 pages
BIOINFORMATICS
No ratings yet
BIOINFORMATICS
85 pages
Blast
100% (1)
Blast
21 pages
Bioinformatics Database and Applications
100% (3)
Bioinformatics Database and Applications
82 pages
Swami
No ratings yet
Swami
12 pages
Databases Bioinformatics
No ratings yet
Databases Bioinformatics
42 pages
A2 Gene Technology 1 N
No ratings yet
A2 Gene Technology 1 N
6 pages
GTGF GGCF
No ratings yet
GTGF GGCF
19 pages
Phylogenetic Trees
No ratings yet
Phylogenetic Trees
11 pages
PAM Blosum: Assignment 1 Bioinformatics (DSE 1)
100% (3)
PAM Blosum: Assignment 1 Bioinformatics (DSE 1)
9 pages
Exercise 7 Bioinformatics
No ratings yet
Exercise 7 Bioinformatics
8 pages
Unit IV
No ratings yet
Unit IV
11 pages
Multiple Sequence Alignment
No ratings yet
Multiple Sequence Alignment
19 pages
Edutxt 20
No ratings yet
Edutxt 20
12 pages
Bioinformatics:: Guide To Bio-Computing and The Internet
No ratings yet
Bioinformatics:: Guide To Bio-Computing and The Internet
34 pages
Phylogenetic Analysis
100% (1)
Phylogenetic Analysis
27 pages
IBB - MB.501 Mol. Phylogeny
No ratings yet
IBB - MB.501 Mol. Phylogeny
81 pages
Exercises For Phylogeny: Exercise 1. Parsimony and Rooted Versus Unrooted Trees
No ratings yet
Exercises For Phylogeny: Exercise 1. Parsimony and Rooted Versus Unrooted Trees
11 pages
Latthika
No ratings yet
Latthika
21 pages
6.2 MEGA Workshop
No ratings yet
6.2 MEGA Workshop
3 pages
Basics of Bioinformatics
100% (7)
Basics of Bioinformatics
99 pages
Bioinfo Course Notes M1 2020 DR Mbulli
No ratings yet
Bioinfo Course Notes M1 2020 DR Mbulli
56 pages
Tentative Course List (Jan - April 2024)
No ratings yet
Tentative Course List (Jan - April 2024)
124 pages
Springer Nature Journals List 2017: Complete Catalogue Including Open Access Journals
No ratings yet
Springer Nature Journals List 2017: Complete Catalogue Including Open Access Journals
224 pages
ESP102 كتاب الهندسة - 220831 - 093711
No ratings yet
ESP102 كتاب الهندسة - 220831 - 093711
99 pages
Disclaimer
No ratings yet
Disclaimer
36 pages
Multiple Sequence Alignment
No ratings yet
Multiple Sequence Alignment
17 pages
Msa MTech
No ratings yet
Msa MTech
17 pages
BioinfoMethods-I Lab03 r2025
No ratings yet
BioinfoMethods-I Lab03 r2025
14 pages
Biotechnology
No ratings yet
Biotechnology
29 pages
Genome Editing Training Brichure - ICAR IIAB
No ratings yet
Genome Editing Training Brichure - ICAR IIAB
3 pages
Multiple Sequence Alignment Part 1
No ratings yet
Multiple Sequence Alignment Part 1
64 pages
Metagenomics
100% (1)
Metagenomics
19 pages
Biological Databases
No ratings yet
Biological Databases
15 pages
Unit 6 - Bioinformatics
No ratings yet
Unit 6 - Bioinformatics
41 pages
Phylogenetic Tree
No ratings yet
Phylogenetic Tree
12 pages
Phylogeny Notes
No ratings yet
Phylogeny Notes
14 pages
Second Semester Examinations Question Paper - Computational Genomics
No ratings yet
Second Semester Examinations Question Paper - Computational Genomics
6 pages
Module - 4 - Reference Course Content
No ratings yet
Module - 4 - Reference Course Content
25 pages
Advanced Science - 2020 - Manghwar - CRISPR Cas Systems in Genome Editing Methodologies and Tools For sgRNA Design
No ratings yet
Advanced Science - 2020 - Manghwar - CRISPR Cas Systems in Genome Editing Methodologies and Tools For sgRNA Design
16 pages
Vimal Roll No 2211022 ANALYSIS TOOL. PHYLIPpptx
No ratings yet
Vimal Roll No 2211022 ANALYSIS TOOL. PHYLIPpptx
27 pages
Engineering Disciplines: Dr. Frank B. Flanders and Katherine Hudson
No ratings yet
Engineering Disciplines: Dr. Frank B. Flanders and Katherine Hudson
32 pages
Analysis of Protein Sequence Alignment and Phylogenetic Tree Construction
No ratings yet
Analysis of Protein Sequence Alignment and Phylogenetic Tree Construction
9 pages
Activity 9
No ratings yet
Activity 9
11 pages
Multiple Sequence Alignment
No ratings yet
Multiple Sequence Alignment
18 pages
SunulanDersListesi 20241026 085040
No ratings yet
SunulanDersListesi 20241026 085040
18 pages
Bioinformatics Model Exam
No ratings yet
Bioinformatics Model Exam
18 pages
Balamurugan
No ratings yet
Balamurugan
17 pages
Module - 6 - Reference Course Content
No ratings yet
Module - 6 - Reference Course Content
16 pages
Phylogeny
No ratings yet
Phylogeny
21 pages
Experiment 9 Bioinformatics Tools For Cell and Molecular Biology
No ratings yet
Experiment 9 Bioinformatics Tools For Cell and Molecular Biology
11 pages
Bioinformatics Lab Notebook: Comsats University, Islamabad
No ratings yet
Bioinformatics Lab Notebook: Comsats University, Islamabad
27 pages
Multiple Sequence Alignment and Phylogenetic Analysis
No ratings yet
Multiple Sequence Alignment and Phylogenetic Analysis
17 pages
Lecture 3.3 (Multiple Sequence Alignment and Phylogeny) (SEAS)
No ratings yet
Lecture 3.3 (Multiple Sequence Alignment and Phylogeny) (SEAS)
19 pages
10 1109@gucon 2018 8675069
No ratings yet
10 1109@gucon 2018 8675069
4 pages
00 Endterm Activity Building A Phylogenetic Tree
No ratings yet
00 Endterm Activity Building A Phylogenetic Tree
6 pages
Phylogenomics An Introduction Full Download
No ratings yet
Phylogenomics An Introduction Full Download
16 pages
Evolution of Myxozoan Mitochondrial Genomes: Insights From Myxobolids
No ratings yet
Evolution of Myxozoan Mitochondrial Genomes: Insights From Myxobolids
15 pages
04-Alinemiento Múltiple de Secuencias
No ratings yet
04-Alinemiento Múltiple de Secuencias
14 pages
Lab 4: Phylogenetics: Bioinformatic Methods I Lab 4
No ratings yet
Lab 4: Phylogenetics: Bioinformatic Methods I Lab 4
20 pages
9.4 Genetic Engineering
No ratings yet
9.4 Genetic Engineering
11 pages
Sequence Alignments: Felix Sappelt Irina Wagner
100% (1)
Sequence Alignments: Felix Sappelt Irina Wagner
34 pages
Lecture 5
No ratings yet
Lecture 5
8 pages
IGCSE Biology B19 Upgraded TopicGuide
No ratings yet
IGCSE Biology B19 Upgraded TopicGuide
3 pages
BIO 401 (Phylogenetics and Sequence Alignments)
No ratings yet
BIO 401 (Phylogenetics and Sequence Alignments)
3 pages
Multiple Sequence Alignment For Construction of Phylogenetic Tree
No ratings yet
Multiple Sequence Alignment For Construction of Phylogenetic Tree
5 pages
Martinez - Phylogenetics Assignment-1
No ratings yet
Martinez - Phylogenetics Assignment-1
3 pages
75tentative Course List July Dec 2024
No ratings yet
75tentative Course List July Dec 2024
5 pages
Bioinformatics Practical Part Iii
No ratings yet
Bioinformatics Practical Part Iii
4 pages
Exploring Database and Analyzing Protein Sequence
No ratings yet
Exploring Database and Analyzing Protein Sequence
70 pages
ModelQuestions MID Spring2024
No ratings yet
ModelQuestions MID Spring2024
5 pages
Bioinformatics-And-Phylogeny
No ratings yet
Bioinformatics-And-Phylogeny
14 pages
Multiple Sequence Alignment Tools: Tutorials and Comparative Analysis
No ratings yet
Multiple Sequence Alignment Tools: Tutorials and Comparative Analysis
19 pages
Bio Info Practicles
No ratings yet
Bio Info Practicles
12 pages
Sisteamtika Filogenetik Melly
No ratings yet
Sisteamtika Filogenetik Melly
11 pages
15activation of B Lymphocytes-II
No ratings yet
15activation of B Lymphocytes-II
12 pages
14activation of B Lymphocytes-I
No ratings yet
14activation of B Lymphocytes-I
11 pages
Bio Tools Booklet
No ratings yet
Bio Tools Booklet
5 pages
Mini Review 2
No ratings yet
Mini Review 2
10 pages
Spyder
No ratings yet
Spyder
1 page
Ec 94
No ratings yet
Ec 94
2 pages
BT403 QP
No ratings yet
BT403 QP
2 pages
Lecture 3 and 4 LSM2241
No ratings yet
Lecture 3 and 4 LSM2241
6 pages
Selected Topics Are Sleep Disorders and Alzheimer's Disease & Korsakoff's Syndrome. Analysis of Sleep Disorders
No ratings yet
Selected Topics Are Sleep Disorders and Alzheimer's Disease & Korsakoff's Syndrome. Analysis of Sleep Disorders
1 page
Glofish Pets
No ratings yet
Glofish Pets
2 pages
Bioinformatics Unveiled
From Everand
Bioinformatics Unveiled
Joan Melody
No ratings yet
Bioinformatics: Merging Biology and Technology
From Everand
Bioinformatics: Merging Biology and Technology
Mani Devar
No ratings yet
Introduction to Bioinformatics Using Action Labs
From Everand
Introduction to Bioinformatics Using Action Labs
Jean-Louis Lassez
5/5 (1)
Neuroevolution: Fundamentals and Applications for Surpassing Human Intelligence with Neuroevolution
From Everand
Neuroevolution: Fundamentals and Applications for Surpassing Human Intelligence with Neuroevolution
Fouad Sabry
No ratings yet

Lab Work

Uploaded by

Lab Work

Uploaded by

Name:

2. Select “National Centre for Biotechnology Information”, this screen

Multiple Sequence Alignment Tools:

2. Select “ Kalign < multiple sequence alignment < EMBL-EBI”. This

3. Now, select Nucleic Acid in place of protein in step 1 in above figure.

8. Your result will appear on the screen.

Steps for analysis:

Properties of analytical methods

2. Your result will appear on the screen.

2. Select “National Centre for Biotechnology Information”, this screen

5. To find protein sequence, click FASTA.

2. Select ScanProsite, the following screen will appear.

2. Enter start modelling, the following screen will appear.

You might also like