Lecture 6- Sequence Analysis
Lecture 6- Sequence Analysis
Introduction to
Bioinformatics
Lecture 6 – Sequence Analysis
Dr. Athira B
Asst. Professor, CSE
IIIT Kottayam
Suppose you have given a set of new DNA
sequences and ask to identify the
Functional/Structural/Biological features?
How you can do this analysis?
One solution is compare with already existing
known sequences- how they are similar?
How to do this similarity checking?
Sequence Analysis
Process of subjecting a DNA, RNA or peptide sequence to any of a
wide range of analytical methods to understand its features, function,
structure, or evolution.
Objectives:
To find similarity, often to infer if they are related (homologous)
To identify intrinsic features of the sequence such as active sites,
post translational modification sites, gene-structures, reading
frames, distributions of introns and exons and regulatory elements
To identify sequence differences and variations such as point
mutations and single nucleotide polymorphism (SNP) in order to
get the genetic marker.
Revealing the evolution and genetic diversity of sequences and
organisms
Identification of molecular structure from sequence alone
Methods
Needleman-Wunsch
Pairwise global alignment only.
Smith-Waterman
Pairwise, local (or global) alignment.
BLAST
Pairwise heuristic local alignment
The Needleman-Wunsch algorithm
Gap penalty = -5
Step 1
Step2
Final Table
Exercise