Lecture-7 Gene Duplication and Read Mapping
Lecture-7 Gene Duplication and Read Mapping
and Read
Mapping
Week 7
1. Mutation
2. Gene Duplication
3. Read Mapping
- Keyword Tree
- Suffix Tree
- Suffix Array
- Burrows Wheeler Transform
1. DNA Mutation
What and how mutation occurs, common forms
Mutation
DNA Mutation refers to sudden, ATCCGA
random changes in DNA sequences ATGCCGA
which leads to different phenotypic
expressions.
Insertion
Common Mutation
Types
Substitution Duplication
AATTCGCA AATCGCA
AATGCGCA Inversion AATCATCGCA
AATCGCA
AACGGCA Insertion
Deletion
AGCATCG AATCGCA
AATTCGCA
ACTATCG AATTCGCA
AATCGCA
2. Gene Duplication
Duplication of Genes, Homolog, Ortholog, Paralogs
Gene
Duplication
Gene duplication (or chromosomal
duplication or gene amplification) is
a major mechanism through which
new genetic material is generated
during molecular evolution. It can be
defined as any duplication of a
region of DNA that contains a gene.
Homolog, Ortholog, Paralog and
Speciation
• Homolog - A gene related to a
second gene by descent from
a common ancestral DNA
sequence
▹ Add $ as ending
notation – abaaba$
▹ By Shifting each
alphabet to the right
once, generate all the
rotations
▹ Lexicographically Sort
all the rotations
▹ Given Sequence –
abaaba
▹ Add $ as ending
notation – abaaba$
▹ Lexicographically sorted
all rotations will
generate BWT Matrix
which will be denoted as
BWM (T)
LF (Last to First)
Mapping
▹ Generate Burrows
Wheeler Matrix for a
given sequence
▹ Assign numbers to
distinguish same
characters
4. If MATCH, then
- Find b1 in First Column
- Print row number
- Terminate
5. If No MATCH, then
- Find the row with that element in the
Genome Indexing (Burrows Wheeler Transform)
Dinosaurs
And they both descended from Reptiles