Sequence Alignment
Sequence Alignment
Nilakshi Samaranayake
Terminology
Homolog
– A gene related to a second gene by descent
Paralog
– Paralogs are genes related by duplication within a
genome
Ortholog
– Orthologs are genes in different species that evolved
from a common ancestral gene by speciation
Global alignment
– create an end-to-end alignment of the
sequences to be aligned.
– alignment is carried out from beginning till end
of the sequence to find out the best possible
alignment
Alignment methods
Local alignment
– find one, or more, alignments describing the most
similar region(s) within the sequences to be
aligned
– alignment tends to stop at the ends of regions of
identity or strong similarity
Alignment methods
Dynamic programming
– Needleman-Wunsch algorithm
– Smith-Waterman algorithm
Eg: EMBOSS
Alternative methods (eg: probabilistic
methods)
– EMBL FASTA
(http://fasta.bioch.virginia.edu/fasta_www2/fasta_l
ist2.shtml)
– NCBI BLAST
Pairwise sequence alignment
https://www.ncbi.nlm.nih.gov/search/