Lecture 6 - Sequence Analysis

Uploaded by

aletimanaswini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views28 pages

Lecture 6 - Sequence Analysis

Uploaded by

aletimanaswini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 28

ISC 211

Introduction to
Bioinformatics
Lecture 6 – Sequence Analysis
Dr. Athira B
Asst. Professor, CSE
IIIT Kottayam
 Suppose you have given a set of new DNA
sequences and ask to identify the
Functional/Structural/Biological features?
 How you can do this analysis?
 One solution is compare with already existing
known sequences- how they are similar?
 How to do this similarity checking?
Sequence Analysis
 Process of subjecting a DNA, RNA or peptide sequence to any of a
wide range of analytical methods to understand its features, function,
structure, or evolution.
 Objectives:
 To find similarity, often to infer if they are related (homologous)
 To identify intrinsic features of the sequence such as active sites,
post translational modification sites, gene-structures, reading
frames, distributions of introns and exons and regulatory elements
 To identify sequence differences and variations such as point
mutations and single nucleotide polymorphism (SNP) in order to
get the genetic marker.
 Revealing the evolution and genetic diversity of sequences and
organisms
 Identification of molecular structure from sequence alone
Methods

 Sequence Alignment - Pairwise and Multiple

sequence
 Comparison against large databases
Sequence Alignment

 Procedure of comparing two or more sequences by

searching for a series of individual characters or
character patterns
 Identify same characters in the same row
 Alignment can be local/global
Sequence Alignment
 Biological Problem
 Sequence alignment is a way of arranging protein (or DNA)
sequences to identify regions of similarity that may be a
consequence of evolutionary relationships between the sequences.
 Genome sequencing allows comparison of organisms at DNA and
protein levels
 Comparisons can be used to
 Find evolutionary relationships between organisms
 Identify functionally conserved sequences
 Identify corresponding genes in human and model organisms:
develop models for human diseases
Sequence Homology

 Homology: genes that derive from a common ancestor-gene are

called homologs
 Orthologous genes are homologous genes in different organisms
 Paralogous genes are homologous genes in one organism that
derive from gene duplication
 Gene duplication: one gene is duplicated in multiple copies that
therefore free to evolve and assume new functions
Sequence similarity

 Intuitively, similarity of two sequences refers to the

degree of match between corresponding positions in
sequence
 Sequence similarity is not sequence homology
 Homology is more difficult to detect over greater
evolutionary distances
Causes of Gene (dis) similarity

 Mutation: a nucleotide at a certain location is replaced by

another
nucleotide ATA → AGA
 Insertion: at a certain location one new nucleotide is inserted
in
between two existing nucleotides (e.g.: AA → AGA)
 Deletion: at a certain location one existing nucleotide is
deleted (e.g.: ACTG → AC-G)
 Indel: an insertion or a deletion
Sequence Alignment

 Find the similarity between two (or more) DNA-sequences by

finding
a good alignment between them
 Alignment specifies which positions in two sequences match
Sequence Alignment
 Sequence alignment is an arrangement of two or more
sequences,
highlighting their similarity.
 The sequences are padded with gaps (dashes) so that
wherever
possible, columns contain identical characters from the
sequences
involved
Sequence Alignment

 Pairwise Sequence Alignment: methods are concerned with

finding
the best-matching piece-wise local or global alignments of protein
(amino acid) or DNA (nucleic acid) sequences.
 Global Alignment: an alignment in which all the characters in
both
sequences participate in the alignment.
 Local Alignment: a matching two sequence from regions which
have
more similar with each other
Algorithms

 Needleman-Wunsch
Pairwise global alignment only.
 Smith-Waterman
Pairwise, local (or global) alignment.
 BLAST
Pairwise heuristic local alignment
The Needleman-Wunsch algorithm

 The Needleman-Wunsch algorithm (1970, J Mol Biol. 48(3):443-

53)
performs a global alignment on two sequences (s and t) and is
applied to align protein or nucleotide sequences.
 The Needleman-Wunsch algorithm is an example of dynamic
programming, and is guaranteed to find the alignment with the
maximum score.
 Eg: sequences
where s(xi , yj ) is the substitution cost and d is the gap penalty
Dynamic Programming-steps

1. Initialization of the score matrix

2. Calculation of scores and filling the traceback matrix
3. Deducing the alignment from the traceback matrix
Let’s work on this simple example

 Input: AAG (sequence #1) , AGC (sequence #2)

 Gap penalty = -5
 Step 1
 Step2

 Final Table
Exercise

 Seq 1: GAATTC Seq 2 : GATAC

 Match = 2 , mismatch = -1, gap = -2
Smith-Waterman local (or global) alignment.
Example 1 :
Seq 1: GAATTC Seq 2 : GATAC
Match = 2 , mismatch = -1, gap = -2
Example 2

 Seq 1: GAATTCAT Seq 2 : CCTCATG

 Starting score: 0, match = 2, mismatch = -1, gap = -2

Morse Fall Risk Assessment Tool
100% (7)
Morse Fall Risk Assessment Tool
2 pages
PM PFC Matrix
100% (1)
PM PFC Matrix
4 pages
Curiculum Vitae
No ratings yet
Curiculum Vitae
17 pages
Critical Care Nursing Assignment
50% (2)
Critical Care Nursing Assignment
13 pages
The Estrogen Question
No ratings yet
The Estrogen Question
8 pages
Sequence Alignment
No ratings yet
Sequence Alignment
25 pages
5 Sequence Alignment
No ratings yet
5 Sequence Alignment
21 pages
Unit 3 Sequence Alignment and Phylogenetic Tree
No ratings yet
Unit 3 Sequence Alignment and Phylogenetic Tree
70 pages
Sequence Analysis - Alignment
No ratings yet
Sequence Analysis - Alignment
57 pages
Introduction-To-Computational Biology
No ratings yet
Introduction-To-Computational Biology
61 pages
Chap 03 BioInfo
No ratings yet
Chap 03 BioInfo
15 pages
Unit 2.1
No ratings yet
Unit 2.1
77 pages
Local and Global Sequence Alignment 12 by DR Sheikh Arslan Sehgal
No ratings yet
Local and Global Sequence Alignment 12 by DR Sheikh Arslan Sehgal
59 pages
Computational Biology (3) Alignment Algorithms: by Dr. Safynaz Abdel-Fattah Computer Science Department
No ratings yet
Computational Biology (3) Alignment Algorithms: by Dr. Safynaz Abdel-Fattah Computer Science Department
107 pages
Sequence Alignment
No ratings yet
Sequence Alignment
36 pages
Need & Emergence of The Field: Speaker Shashi Shekhar Head of Computational Section Biowits Life Sciences
No ratings yet
Need & Emergence of The Field: Speaker Shashi Shekhar Head of Computational Section Biowits Life Sciences
59 pages
Module II
No ratings yet
Module II
51 pages
Sequence Alignment Methods
No ratings yet
Sequence Alignment Methods
32 pages
Msa MTech
No ratings yet
Msa MTech
17 pages
Sequence Alignment: Sequence Alignment Is The Most Important Task in Bioinformatics!
No ratings yet
Sequence Alignment: Sequence Alignment Is The Most Important Task in Bioinformatics!
13 pages
Unit3 Final
No ratings yet
Unit3 Final
114 pages
Sequence Analysis in Bioinformatics
No ratings yet
Sequence Analysis in Bioinformatics
18 pages
Sequence Alignment Presentation
No ratings yet
Sequence Alignment Presentation
27 pages
Module 3 CSE3069 (Bioinformatics)
No ratings yet
Module 3 CSE3069 (Bioinformatics)
57 pages
Chapter 2 Bioinformatics
No ratings yet
Chapter 2 Bioinformatics
9 pages
Lecture 4
No ratings yet
Lecture 4
22 pages
Sequence Alignment
No ratings yet
Sequence Alignment
27 pages
Bioinfo Notes 2
No ratings yet
Bioinfo Notes 2
9 pages
Alignment Methods
No ratings yet
Alignment Methods
33 pages
Disclaimer
No ratings yet
Disclaimer
22 pages
Genomic Sequence Alignment
No ratings yet
Genomic Sequence Alignment
25 pages
Sequence Alignment
No ratings yet
Sequence Alignment
18 pages
Unit - Ii Sequence Analysis: Pair-Wise Sequence Comparison
No ratings yet
Unit - Ii Sequence Analysis: Pair-Wise Sequence Comparison
17 pages
Dynamic Programming Methods in Pairwise Alignment
No ratings yet
Dynamic Programming Methods in Pairwise Alignment
41 pages
Sequence Alignment
No ratings yet
Sequence Alignment
9 pages
Sequence Alignment
No ratings yet
Sequence Alignment
22 pages
Sequence Alignment
No ratings yet
Sequence Alignment
24 pages
W03 Pairwise
No ratings yet
W03 Pairwise
55 pages
Importance and Significance of Sequence Alignment - pptx12
No ratings yet
Importance and Significance of Sequence Alignment - pptx12
15 pages
Bioinformatics Pairwise Alignment
No ratings yet
Bioinformatics Pairwise Alignment
128 pages
Sequence Alignment Methods Final
No ratings yet
Sequence Alignment Methods Final
69 pages
Blast 2 Sequences, A New Tool For Comparing Protein and Nucleotide Sequences
No ratings yet
Blast 2 Sequences, A New Tool For Comparing Protein and Nucleotide Sequences
17 pages
Introduction To Bioinformatics Presentation
No ratings yet
Introduction To Bioinformatics Presentation
13 pages
Sequence Alignments and Its Types, Applications
No ratings yet
Sequence Alignments and Its Types, Applications
27 pages
Sequence Alignment Methods and Algorithms
No ratings yet
Sequence Alignment Methods and Algorithms
37 pages
Sequence Alignment Methods and Algorithms
75% (4)
Sequence Alignment Methods and Algorithms
37 pages
BLAST and Sequence Alignment
No ratings yet
BLAST and Sequence Alignment
36 pages
BT302 L3 Psa
No ratings yet
BT302 L3 Psa
47 pages
Data Mining-Mining Sequence Patterns in Biological Data
No ratings yet
Data Mining-Mining Sequence Patterns in Biological Data
6 pages
Sequences Alignments (Similarity & Homology)
No ratings yet
Sequences Alignments (Similarity & Homology)
32 pages
Sequence Alignment Write
No ratings yet
Sequence Alignment Write
17 pages
B.I Sec 4.
No ratings yet
B.I Sec 4.
18 pages
Genomics and Similarity Search
No ratings yet
Genomics and Similarity Search
43 pages
Bioinformatics Alignment
No ratings yet
Bioinformatics Alignment
128 pages
Bio Medical Tics - Sequence Analysis - Alignment - 2011
No ratings yet
Bio Medical Tics - Sequence Analysis - Alignment - 2011
96 pages
Bioinformatics_2
No ratings yet
Bioinformatics_2
26 pages
Sequence Alingment
No ratings yet
Sequence Alingment
10 pages
Bioinformatics Seminar3rdOct18
No ratings yet
Bioinformatics Seminar3rdOct18
25 pages
Sequence Analysis - Pairwise Alignment
No ratings yet
Sequence Analysis - Pairwise Alignment
26 pages
Pairwise Alignment Prelab PDF
No ratings yet
Pairwise Alignment Prelab PDF
87 pages
03 - Sequence Alignment
No ratings yet
03 - Sequence Alignment
4 pages
CE6068 Lecture 5
No ratings yet
CE6068 Lecture 5
83 pages
Sequence Alignment: "Continuing.." (5th Week)
No ratings yet
Sequence Alignment: "Continuing.." (5th Week)
61 pages
Neuroevolution: Fundamentals and Applications for Surpassing Human Intelligence with Neuroevolution
From Everand
Neuroevolution: Fundamentals and Applications for Surpassing Human Intelligence with Neuroevolution
Fouad Sabry
No ratings yet
Reordering Life: Knowledge and Control in the Genomics Revolution
From Everand
Reordering Life: Knowledge and Control in the Genomics Revolution
Stephen Hilgartner
No ratings yet
Antenatal Assessment and Care
No ratings yet
Antenatal Assessment and Care
12 pages
Dr. Diana Hylton, MD: Neurology - Female - Age 69
No ratings yet
Dr. Diana Hylton, MD: Neurology - Female - Age 69
13 pages
Principles of Endodontic Surgery: Chapter Outline
No ratings yet
Principles of Endodontic Surgery: Chapter Outline
42 pages
Christine Mikstas (RDN) - Health Benefits of Coffee and Tea
No ratings yet
Christine Mikstas (RDN) - Health Benefits of Coffee and Tea
11 pages
Atrial Septal Defect - 7 Year Old
No ratings yet
Atrial Septal Defect - 7 Year Old
1 page
Antenatal Principles of Antenatal Care 2017
100% (3)
Antenatal Principles of Antenatal Care 2017
60 pages
Ej 0536854
No ratings yet
Ej 0536854
1 page
Temporomandibular Joint Comorbidities
No ratings yet
Temporomandibular Joint Comorbidities
14 pages
NCP 2 - Pedia
No ratings yet
NCP 2 - Pedia
7 pages
Pediatric Asthma
No ratings yet
Pediatric Asthma
8 pages
CABALLERO - BIO 4 - Qtr3 - Quiz3
No ratings yet
CABALLERO - BIO 4 - Qtr3 - Quiz3
2 pages
Nursing Diagnosis2
No ratings yet
Nursing Diagnosis2
14 pages
Qi Presentation
No ratings yet
Qi Presentation
10 pages
Pankaj Shah, Professor, Department of Community Medicine
No ratings yet
Pankaj Shah, Professor, Department of Community Medicine
1 page
Oxford English For Careers - Medicine1 .Unit 2
No ratings yet
Oxford English For Careers - Medicine1 .Unit 2
5 pages
James J. Jasper, D.D.S.: Education
No ratings yet
James J. Jasper, D.D.S.: Education
2 pages
Cpdprogram Pharmacy 82318
No ratings yet
Cpdprogram Pharmacy 82318
81 pages
Rational Use of Antibiotics
No ratings yet
Rational Use of Antibiotics
27 pages
2 Placenta - Libre Pathology
No ratings yet
2 Placenta - Libre Pathology
26 pages
Continuous Improvement Project
No ratings yet
Continuous Improvement Project
15 pages
Orthodontic Consent Form 21
No ratings yet
Orthodontic Consent Form 21
2 pages
Vydehi Hospitals Contacts
100% (1)
Vydehi Hospitals Contacts
5 pages
Denominators For Intensive Care Unit (ICU) /other Locations (Not NICU or SCA)
No ratings yet
Denominators For Intensive Care Unit (ICU) /other Locations (Not NICU or SCA)
1 page
APACT 2021 Metadata
No ratings yet
APACT 2021 Metadata
126 pages
Gastroenteritis
No ratings yet
Gastroenteritis
3 pages