Bioinformatics
Bioinformatics
Data
Database
OMIM database
Online Mendelian Inheritance in Man
http://www.ncbi.nlm.nih.gov/Omim
PubMed
1.It enables user to do keyword searches, provides links to a
selection of full articles, and has text mining capabilities, e.g.
provides links to related articles, and GenBank entries,
among others.
2.It contains entries for more than 30 million abstracts of
scientific publications.
ENTREZ (NCBI, USA) Used to access literature (abstracts), sequence and structure db
DNAPLOT (EBI, UK) Sequence alignment tool
LOCUS LINK (NCBI, Assessing information on homologous genes
USA)
LIGAND (GenomNet, A chemical db, allows search for a combination of enzymes and links
Japan) to all publically accessible db.
BRITE (GenomNet, Biomolecular relations information transmission and expression db;
Japan) links to all publically accessible db.
TAXONOMY BROWSER Taxonomic classification of various species as well as genetic
(NCBI, USA) information
STRUCTURE It support Molecular Modelling Database (MMDB) and software
tools forKsusturmucYatduarv,eDaepnaratmyl enstsi of Biochemistry
BLAST
(Basic Local Alignment Search Tool)
for Homology Analyses
• BLASTn
– Nucleotide query vs nucleotide database
• BLASTp
– protein query vs protein database
• BLASTx
– automatic 6-frame translation of nucleotide query vs protein database
– If you have a DNA sequence and you want to now what protein (if any) it
encodes, you can perform BLASTx search.
• tBLASTn
– protein query vs automatic 6-frame translation of nucleotide database
– You can use this program to ask whether a DNA or ESTs db contains a
nucleotide sequence encoding a protein that matches your protein of
interest.
• tBLASTx
– automatic 6-frame translation of nucleotide query vs automatic 6-frame
translation of nucleotKiduseumdYaadtaav,bDaepsaertment of Biochemistry
BLAST
(Basic Local Alignment Search Tool)
for Homology Analyses
Program Input
Database
1
BLASTn DNA DNA
1
BLASTp protein protein
6
DNA
6
tBLASTn
BLASTx protein protein
36
tBLASTx DNA DNA
Kusum Yadav, Department of Biochemistry
SEQUENCE ALIGNMENT
What is Sequence Alignment ?
A sequence alignment is a way of arranging the sequences of DNA
or protein to identify regions of similarity that may be a
consequence of functional, structural, or evolutionary
relationships between the sequences.
Definitions
Similarity
T h e extent to which nucleotide or protein s e que nce s are
r e l a t e d . It i s b a s e d u p o n i d e n t i t y p l u s c o n s e r v a t i o n .
Identity
T h e extent to which t w o s e q u e n c e s are invariant.
Conservation
C h a n g e s at a specific position of a n a m i n o acid or (less
commonly, D N A ) s e q u e n c e that preserve the physico-
chemical properties of the original residue.
• Pairwise alignment
• Multiple Alignment
51 :LFLQDNIVAEFSVDETGQMSATAKGRVR.LLNNWD..VCADMVGTFTDTE
| | | | :: | .| . || |: || |. 97 RBP
45 ISLLDAQSAPLRV.YVEELKPTPEGDLEILLQKWENGECAQKKIIAEKTK
93 lactoglobulin
98 DPAKFKMKYWGVASFLQKGNDDHWIVDTDYDTYAV...........QYSC
94 IPAVFKIDALNENKVL........VLDTDYKKYLLFCMENSAEPEQSLAC
136 RBP 135 lactoglobulin
|| ||. | :.|||| | . .|
137 RLLNLDGTCADSYSFVFSRDPNGLPPEAQKIVRQRQ.EELCLARQYRLIV 185 RBP
. | | | : || . | || |
136 QCLVRTPEVDDEALEKFDKALKALPMHIRLSFNPTQLEEQCHI....... 178 lactoglobulin
1 MKWVWALLLLAAWAAAERDCRVSSFRVKENFDKARFSGTWYAMAKKDPEG 50 RBP
. ||| | . |. . . | : .||||.:| :
1 ...MKCLLLALALTCGAQALIVT..QTMKGLDIQKVAGTWYSLAMAASD. 44 lactoglobulin
51 LFLQDNIVAEFSVDETGQMSATAKGRVR.LLNNWD..VCADMVGTFTDTE 97 RBP
: | | | | :: | .| . || |: || |.
45 ISLLDAQSAPLRV.YVEELKPTPEGDLEILLQKWENGECAQKKIIAEKTK
93 lactoglobulin
98 DPAKFKMKYWGVASFLQKGNDDHWIVDTDYDTYAV...........QYSC
136 RBP
|| ||. | :.|||| | . .|
RQRQ.EELCLA
94 IPAVFKIDALNENKVL........VLDTDYKKYLLFCMENSAEPEQSLAC
135.lactoglobulin
| | | : || . | || | (bar)
136 QCLVRTPEVDDEALEKFDKALKALPMHIRLSF NPTQLEEQCHI ....... 178 lactoglobulin
1 MKWVWALLLLAAWAAAERDCRVSSFRVKENFDKARFSGTWYAMAKKDPEG 50 RBP
. ||| | . |. . . | : .||||.:| :
1 ...MKCLLLALALTCGAQALIVT..QTMKGLDIQKVAGTWYSLAMAASD. 44 lactoglobulin
51 LFLQDNIVAEFSVDETGQMSATAKGRVR.LLNNWD..VCADMVGTFTDTE 97 RBP
: | | | | :: | .| . || |: || |.
45 ISLLDAQSAPLRV.YVEELKPTPEGDLEILLQKWENGECAQKKIIAEKTK
93 lactoglobulin
98 DPAKFKMKYWGVASFLQKGNDDHWIVDTDYDTYAV...........QYSC
136 RBP
|| ||. | :.|||| | . .|
Very
DSYSFVFSRDPNGLP PEAQKIVRQRQ.EELC LARQYRLIV 185 RBP
94 IPAVFKIDALNENKVL........VLDTDYKKYLLFCMENSAEPEQSLAC
137 RLLNLDGTCA Somewhat
136 QCLVRTPEVD
|
similar
135.lactoglobulin
| | : |
| .
similar
HI....... 178 lactoglobulin
(one dot)
DEALEKFDKALKALP | || | (two dots)
MHIRLSFNPTQLEEQC
51 :LFLQDNIVAEFSVDETGQMSATAKGRVR.LLNNWD..VCADMVGTFTDTE
| | | | :: | .| . || |: || |. 97 RBP
45 ISLLDAQSAPLRV.YVEELKPTPEGDLEILLQKWENGECAQKKIIAEKTK
93 lactoglobulin
98 DPAKFKMKYWGVASFLQKGNDDHWIVDTDYDTYAV...........QYSC
94 IPAVFKIDALNENKVL........VLDTDYKKYLLFCMENSAEPEQSLAC
136 RBP 135 lactoglobulin
|| ||. | :.|||| | . .|
137 RLLNLDGTCADSYSFVFSRDPNGLPPEAQKIVRQRQ.EELCLARQYRLIV 185 RBP
. | | | : || . | || |
136 QCLVRTPEVDDEALEKFDKALKALPMHIRLSFNPTQLEEQCHI....... 178 lactoglobulin
Internal Termina
gap
Kusum Yadav, Department of Biochemistry
l gap
Kusum Yadav, Department of Biochemistry
Sequence Analyses for relatedness
• Homologs: similar sequences in different organisms derived
from a common ancestor sequence.
• Orthologs : homologous sequences in different related species
that arose from a common ancestral gene during speciation.
Orthologs are presumed to have similar biological function.
e.g. Human and rats myoglobins both transport oxygen in
muscle
• Paralogs: homologous genes within the same organism
e.g. human α and β globins are paralogs. Paralogs are the
result of gene duplication events
• Xenologs: similar sequences that have arisen out of horizontal
transfer events (symbiosis, viruses, etc)
Kusum Yadav, Department of Biochemistry
Multiple sequence Alignment
• Partial or complete alignment of three or
more related proteins/ nucleotide sequences
• Conserved domain analysis
• Primer Designing
• CLUSTALW
• T-Coffee
• MUSCLE
• KALIGN
• CLC & GCG WorkBench
MEGA
PHYLIP
PAUP
Treeview
ODEN
PHYLOWIN
TREECON
DENDRON