BLAST and Multiple Sequence Alignment of NCBI 1
BLAST and Multiple Sequence Alignment of NCBI 1
>Unknown sequence 1
AAATGAGTTAATAGAATCTTTACAAATAAGAATATACACTTCTGCTTAGGATGATA
ATTGGAGGCAAGTG
AATCCTGAGCGTGATTTGATAATGACCTAATAATGATGGGTTTTATTTCCAGACTTC
ACTTCTAATGGTG
ATTATGGGAGAACTGGAGCCTTCAGAGGGTAAAATTAAGCACAGTGGAAGAATTT
CATTCTGTTCTCAGT
TTTCCTGGATTATGCCTGGCACCATTAAAGAAAATATCATCTTTGGTGTTTCCTATG
ATGAATATAGATA
CAGAAGCGTCATCAAAGCATGCCAACTAGAAGAGGTAAGAAACTATGTGAAAACT
TTTTGATTATGCATA
TGAACCCTTCACACTACCCAAATTATATATTTGGCTCCATATTCAATCGGTTAGTCT
ACATATATTTATG
TTTCCTCTATGGGTAAGCTACTGTGAATGGATCAATTAATAAAACACATGACCTAT
GCTTTAAGAAGCTT GCAAACACATGAA
>Unknown sequence 2
AAATGAGTTAATAGAATCTTTACAAATAAGAATATACACTTCTGCTTAGGATGATA
ATTGGAGGCAAGTG
AATCCTGAGCGTGATTTGATAATGACCTAATAATGATGGGTTTTATTTCCAGACTTC
ACTTCTAATGGTG
ATTATGGGAGAACTGGAGCCTTCAGAGGGTAAAATTAAGCACAGTGGAAGAATTT
CATTCTGTTCTCAGT
TTTCCTGGATTATGCCTGGCACCATTAAAGAAAATATCATTGGTGTTTCCTATGATG
AATATAGATACAG
AAGCGTCATCAAAGCATGCCAACTAGAAGAGGTAAGAAACTATGTGAAAACTTTTT
GATTATGCATATGA
ACCCTTCACACTACCCAAATTATATATTTGGCTCCATATTCAATCGGTTAGTCTACA
TATATTTATGTTT
CCTCTATGGGTAAGCTACTGTGAATGGATCAATTAATAAAACACATGACCTATGCT
TTAAGAAGCTTGCA AACACATGAA
Part 1: Questions
1. In the Descriptions section, look at the top result, which should be the result with the highest
score. Write down information about the best match:
Sequence 1
Sequence 2
2. Now scroll down to the Alignments heading. Look at the top result, which should be the same
one. Look at the alignment between your query and the reference. Do you see any mismatches?
- None
- Aside from there being no difference between the query and the reference, the given
sequence is a good match because the genes being found in the query are similar to the
subject.
4. What is this gene? Google the name of the gene and write down something significant you
learned about it.
- The CFTR gene provides instructions for making a protein called the CF transmembrane
conductance regulator (CFTR). In other words, it encodes a protein in cell membranes in
epithelial tissues which affects multiple organ systems in the human body that help
produce mucus, sweat, saliva, tears and digestive enzymes. Therefore, the CFTR gene
plays a vital role in the human body as it controls and helps to maintain the balance of salt
and water on many surfaces in the body, such as the surface of the lung.
Part 2: Investigating sets of sequences
Each of the following sets of sequences were obtained from a sequencing experiment.
For each experiment (Set 1, Set 2 and Set 3), answer these questions:
1. What do these sequences have in common?
2. What is your best guess about the original purpose of this experiment?
Set 1
>Sequence1a GTAATGTACATAACATTAATGTAATAAAGA
>Sequence1bATCACGAGCTTAATTACCATGCCGCGTGAAACCAGCA
ACC
>Sequence1c ATGGACTAATGGCTAATCAGCCCATGCTCACACATA
Set 2
>Sequence2a
TTTGGTTGTTCGACGACGGATGCAGAGCTCAGGGAAGTGGGGACGTGTTTTGGCT
ATCCT
>Sequence2b
GCGATGCATCAGGATGCATCCTCTGATCTTAGGGTGGTACGAGAAAAATTGAAG
AATGTA
>Sequence2c
GCGGTTCCACAAGACCCTGAGGCGCCTGGTGCCTGACTCGGACGTCCGGTTCCTCCT
CTC
2. What is your best guess about the original purpose of this experiment?
- I guess the purpose of this experiment is to identify the importance of the
role of genes in plant growth.
Set 3
>Sequence3a
TAACCTACGGGTGGCCGCAGTGGGGAATATTGCACAATGGACACAAGTCTGATGCA
GCGACGC CG
CGTGGGGGATGAAGGCTTTCGGGTTGTAAACTCCTTTCAGTACAGAAGAAGCATTT
TTGTGAC GG
TATGTGCAGAAGAAGCGCCGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGG
GCGCGA GCG
TTGTCCGGAATTATTGGGCGTAAAGAGCTCGTAGGCGGTTTGTTGCGCCTGCTGTG
>Sequence3b
TGTCCTACGGGGGGCTGCAGTGAGGAATATTGGTCAATGGGCGAGAGCCTGAACC
AGCCAAG TCG
CGTGAAGGATGACTGTCTTATGGATTGTAAACTTCTTTTATACGGGAATAACAAGA
GTCACGT GT
GGCTCCCTGCATGTACCGTATGAATAAGCATCGGCTAACTCCGTGCCAGCAGCCGC
GGTAATA CG
GAGGATGCGAGCGTTATCCGGATTTATTGGGTTTAAAGGGTGCGTAGGCGGC
>Sequence3c
GGCCTACGGGGGGCTGCAGTGGGTACGGGCAGACTAGAGTGTGGTAGGGGTAATTG
GAATTC CTG
GTGTAGCGGTGGAATGCGCAGATATCAGGAGGAACACCGATGGCGAAGGCAGGTT
ACTGGGC CAT
TACTGACGCTGAGGAGCGAAAGCGTGGGTAGCGAACAGGATTAGATACCCTAGTA
GTCT
2. What is your best guess about the original purpose of this experiment?
- The purpose of this experiment is to determine the gene coding that causes
infections to organisms and also to identify if the given sequence strain are
best match to clone.