Bioinformatics Lab # 6
Bioinformatics Lab # 6
1. Given a protein sequence, find out which protein it is and from which organism it
belongs using BLAST Program.
MERGVRRGAALVAAWRSLWERGGLALFRPQCRTGCGACRVQGTRPFSLSAAASAVLG
LGSWGGDSGKQKLTLQDVAELIRKKECRRVVVMAGAGISTPSGIPDFRSPGSGLYSNLE
QYNIPYPEAIFELAYFFINPKPFFTLAKELYPGNYRPNYAHYFLRLLHDKGLLLRLYTQNI
DGLERVAGIPPDRLVEAHGTFATATCTVCRRKFPGEDFRGDVMADKVPHCRVCTGIVK
PDIVFFGEELPQRFFLHMTDFPMADLLFVIGTSLEVEPFASLAGAVRNSVPRVLINR
DLVGPFAWQQRYNDIAQLGDVVTGVEKMVELLDWNEEMQTLIQKEKEKLDAKDK
From the given details, find
Name the protein of that sequence.
The E-value (Expect Value) is a statistical measure used in sequence alignment (such as
BLAST) to indicate the number of matches (or alignments) you might expect to see by chance
when searching a database.
A low E-value (close to 0) indicates a highly significant match, meaning the alignment
is unlikely to have occurred by chance.
A high E-value suggests the match could be due to random chance and is less significant.
In summary, the E-value helps assess the quality and reliability of the alignment result:
E-value ≈ 0: Strong, significant match.
E-value > 0: Weaker, less significant match.
The lower the E-value, the more reliable the match between the sequences.
2. Given a nucleotide sequence, find the details regarding the sequence like which
organism, name, accession id etc. using BLAST
Nucleotide Sequence
AGTGCCGCGCGTCGAGCGGAGCAGAGGAGGCGAGGGCGGAGGGCCAGAGAGGCAGTTGGAAGATGGCGGAC
GAGGTGGCGCTCGCCCTTCAGGCCGCCGGCTCCCCTTCCGCGGCGGCCGCCATGGAGGCCGCGTCGCAGCCGGC
GGACGAGCCGCTCCGCAAGAGGCCCCGCCGAGACGGGCCTGGCCTCGGGCGCAGCCCGGGCGAGCCGAGCGCA
GCAGTGGCGCCGGCGGCCGCGGGGTGTGAGGCGGCGAGCGCCGCGGCCCCGGCGGCGCTGTGGCGGGAGGCG
GCAGGGGCGGCGGCGAGCGCGGAGCGGGAGGCCCCGGCGACGGCCGTGGCCGGGGACGGAGACAATGGGTCC
GGCCTGCGGCGGGAGCCGAGGGCGGCTGACGACTTCGACGACGACGAGGGCGAGGAGGAGGACGAGGCGGCG
GCGGCAGCGGCGGCGGCAGCGATCGGCTACCGAGACAACCTCCTGTTGACCGATGGACTCCTCACTAATGGCTTT
CATTCCTGTGAAAGTGATGACGATGACAGAACGTCACACGCCAGCTCTAGTGACTGGACTCCGCGGCCGCGGATA
GGTCCATATACTTTTGTTCAGCAACATCTCATGATTGGCACCGATCCTCGAACAATTCTTAAAGATTTATTACCAGA
AACAATTCCTCCACCTGAGCTGGATGATATGACGCTGTGGCAGATTGTTATTAATATCCTTTCAGAACCACCAAAGC
GGAAAAAAAGAAAAGATATCAATACAATTGAAGATGCTGTGAAGTTACTGCAGGAGTGTAAAAAGATAATAGTTC
TGACTGGAGCTGGGGTTTCTGTCTCCTGTGGGATTCCTGACTTCAGATCAAGAGACGGTATCTATGCTCGCCTTGC
GGTGGACTTCCCAGACCTCCCAGACCCTCAAGCCATGTTTGATATTGAGTATTTTAGAAAAGACCCAAGACCATTCT
TCAAGTTTGCAAAGGAAATATATCCCGGACAGTTCCAGCCGTCTCTGTGTCACAAATTCATAGCTTTGTCAGATAA
GGAAGGAAAACTACTTCGAAATTATACTCAAAATATAGATACCTTGGAGCAGGTTGCAGGAATCCAAAGGATCCT
TCAGTGTCATGGTTCCTTTGCAACAGCATCTTGCCTGATTTGTAAATACAAAGTTGATTGTGAAGCTGTTCGTGGAG
ACATTTTTAATCAGGTAGTTCCTCGGTGCCCTAGGTGCCCAGCTGATGAGCCACTTGCCATCATGAAGCCAGAGAT
TGTCTTCTTTGGTGAAAACTTACCAGAACAGTTTCATAGAGCCATGAAGTATGACAAAGATGAAGTTGACCTCCTC
ATTGTTATTGGATCTTCTCTGAAAGTGAGACCAGTAGCACTAATTCCAAGTTCTATACCCCATGAAGTGCCTCAAAT
ATTAATAAATAGGGAACCTTTGCCTCATCTACATTTTGATGTAGAGCTCCTTGGAGACTGCGATGTTATAATTAATG
AGTTGTGTCATAGGCTAGGTGGTGAATATGCCAAACTTTGTTGTAACCCTGTAAAGCTTTCAGAAATTACTGAAAA
ACCTCCACGCCCACAAAAGGAATTGGTTCATTTATCAGAGTTGCCACCAACACCTCTTCATATTTCGGAAGACTCAA
GTTCACCTGAAAGAACTGTACCACAAGACTCTTCTGTGATTGCTACACTTGTAGACCAAGCAACAAACAACAATGT
TAATGATTTAGAAGTATCTGAATCAAGTTGTGTGGAAGAAAAACCACAAGAAGTACAGACTAGTAGGAATGTTGA
GAACATTAATGTGGAAAATCCAGATTTTAAGGCTGTTGGTTCCAGTACTGCAGACAAAAATGAAAGAACTTCAGTT
GCAGAAACAGTGAGAAAATGCTGGCCTAATAGACTTGCAAAGGAGCAGATTAGTAAGCGGCTTGAGGGTAATCA
ATACCTGTTTGTACCACCAAATCGTTACATATTCCACGGTGCTGAGGTATACTCAGACTCTGAAGATGACGTCTTGT
CCTCTAGTTCCTGTGGCAGTAACAGTGACAGTGGCACATGCCAGAGTCCAAGTTTAGAAGAACCCTTGGAAGATG
AAAGTGAAATTGAAGAATTCTACAATGGCTTGGAAGATGATACGGAGAGGCCCGAATGTGCTGGAGGATCTGGA
TTTGGAGCTGATGGAGGGGATCAAGAGGTTGTTAATGAAGCTATAGCTACAAGACAGGAATTGACAGATGTAAA
CTATCCATCAGACAAATCATAACACTATTGAAGCTGTCCGGATTCAGGAATTGCTCCACCAGCATTGGGAACTTTA
GCATGTCAAAAAATGAATGTTTACTTGTGAACTTGAACAAGGAAATCTGAAAGATGTATTATTTATAGACTGGAAA
ATAGATTGTCTTCTTGGATAATTTCTAAAGTTCCATCATTTCTGTTTGTACTTGTACATTCAACACTGTTGGTTGACTT
CATCTTCCTTTCAAGGTTCATTTGTATGATACATTCGTATGTATGTATAATTTTGTTTTTTGCCTAATGAGTTTCAACC
TTTTAAAGTTTTCAAAAGCCATTGGAATGTTAATGTAAAGGGAACAGCTTATCTAGACCAAAGAATGGTATTTCAC
ACTTTTTTGTTTGTAACATTGAATAGTTTAAAGCCCTCAATTTCTGTTCTGCTGAACTTTTATTTTTAGGACAGTTAAC
TTTTTAAACACTGGCATTTTCCAAAACTTGTGGCAGCTAACTTTTTAAAATCACAGATGACTTGTAATGTGAGGAGT
CAGCACCGTGTCTGGAGCACTCAAAACTTGGTGCTCAGTGTGTGAAGCGTACTTACTGCATCGTTTTTGTACTTGCT
GCAGACGTGGTAATGTCCAAACAGGCCCCTGAGACTAATCTGATAAATGATTTGGAAATGTGTTTCAGTTGTTCTA
GAAACAATAGTGCCTGTCTATATAGGTCCCCTTAGTTTGAATATTTGCCATTGTTTAATTAAATACCTATCACTGTGG
TAGAGCCTGCATAGATCTTCACCACAAATACTGCCAAGATGTGAATATGCAAAGCCTTTCTGAATCTAATAATGGT
ACTTCTACTGGGGAGAGTGTAATATTTTGGACTGCTGTTTTTCCATTAATGAGGAAAGCAATAGGCCTCTTAATTAA
AGTCCCAAAGTCATAAGATAAATTGTAGCTCAACCAGAAAGTACACTGTTGCCTGTTGAGGATTTGGTGTAATGTA
TCCCAAGGTGTTAGCCTTGTATTATGGAGATGAATACAGATCCAATAGTCAAATGAAACTAGTTCTTAGTTATTTAA
AAGCTTAGCTTGCCTTAAAACTAGGGATCAATTTTCTCAACTGCAGAAACTTTTAGCCTTTCAAACAGTTCACACCT
CAGAAAGTCAGTATTTATTTTACAGACTTCTTTGGAACATTGCCCCCAAATTTAAATATTCATGTGGGTTTAGTATTT
ATTACAAAAAAATGATTTGAAATATAGCTGTTCTTTATGCATAAAATACCCAGTTAGGACCATTACTGCCAGAGGA
GAAAAGTATTAAGTAGCTCATTTCCCTACCTAAAAGATAACTGAATTTATTTGGCTACACTAAAGAATGCAGTATAT
TTAGTTTTCCATTTGCATGATGTGTTTGTGCTATAGACAATATTTTAAATTGAAAAATTTGTTTTAAATTATTTTTACA
GTGAAGACTGTTTTCAGCTCTTTTTATATTGTACATAGACTTTTATGTAATCTGGCATATGTTTTGTAGACCGTTTAA
TGACTGGATTATCTTCCTCCAACTTTTGAAATACAAAAACAGTGTTTTATACTTGTATCTTGTTTTAAAGTCTTATATT
AAAATTGTCATTTGACTTTTTTCCCGTTAAAAAAAAAAAAAAA
http://vlab.amrita.edu/?sub=3&brch=274&sim=1434&cnt=5