Module 3 Session.2 Practical Assignment-Lucy Nakabazzi
Module 3 Session.2 Practical Assignment-Lucy Nakabazzi
Practical Assignment
Module name: Sequence alignment theory and applications
Session name: Pairwise Sequence Alignment
Trainer: Jonathan Kayondo, Sonal Henson
Participant: <Lucy Nakabazzi>
Date: <29th August, 2022>
This exercise demonstrates the usefulness of the local alignment in bioinformatics, and also
what can go wrong if you are not careful in choosing your parameters. We also look at the
dynamic programming applications in the Needleman-Wunsch algorithm.
Uniprot - www.uniprot.org
EMBOSS Water – www.ebi.ac.uk/Tools/psa/emboss_water/
EMBOSS Needle - www.ebi.ac.uk/Tools/psa/emboss_needle/
TeachEnG - http://teacheng.illinois.edu/
Please note
Hand-in information If you are formally enrolled in the IBT course, please upload
your completed assignment to the Vula ‘Assignments’ tab. Take note of the final
hand-in date for each assignment, which will be indicated on Vula.
Task 1: instructions
The local alignment finds regions of local similarity. It is implemented trough Smith–
Waterman (but other also) algorithm. It has many purposes, and just some of them are
finding motifs, finding one sequence within another, search sequence databases, etc.
Introduction to Bioinformatics online course: IBT
11. Now use the Needleman-Wunsch algorithm for global pairwise alignment to align
the two sequences from steps 1 and 2. You can use the EBI tool Needle for that
(http://www.ebi.ac.uk/Tools/psa/emboss_needle/)
12. Follow steps 4-6 to submit the alignment.
13. How does your alignment compare with that from step 6?
14. Repeat steps 8 and 9, this time on the Needle program.
15. How does the alignment compare to that from step 12 (using default parameters)?
16. This exercise demonstrates the usefulness of the local alignment in bioinformatics,
and also what can go wrong if you are not careful in choosing your parameters.
13.
Length: 3757
Identity: 517/3757 (13.8%)
Similarity: 628/3757 (16.7%)
Gaps: 2940/3757 (78.3%)
Score: 2587.5
Positions of alignment: 2
14.Length: 3776
Identity: 675/3776 (17.9%)
Similarity: 760/3776 (20.1%)
Gaps: 2978/3776 (78.9%)
Score: 3337.0
15.
The % identity, % similarity, gaps, and the extent (positions) of alignment of the two
sequences are quite similar and relatable, although the overall alignment score is not the
same.
16.
>
Task 2: instructions
>