0% found this document useful (0 votes)

64 views9 pages

University of Mauritius

This document summarizes an assignment for the course AGRI 2081Y - Computational Biology offered at the University of Mauritius, Faculty of Agriculture. The assignment was completed by Marie Natacha Meunier with student ID 1712892 and submitted to the lecturer Dr Shakuntala Baichoo on 25th May 2020. The assignment contains code snippets and answers to computational biology questions involving string manipulation of DNA, RNA and protein sequences.

Uploaded by

grace meunier

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views9 pages

University of Mauritius

Uploaded by

grace meunier

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

UNIVERSITY OF MAURITIUS

FACULTY OF AGRICULTURE
BSc (Hons) Biotechnology

AGRI 2081Y (3) - COMPUTATIONAL BIOLOGY

Name of Student: Marie Natacha Meunier

Student I.D: 1712892

Date: 25th May 2020

Lecturer Name: Dr Shakuntala Baichoo

chain_a = """SSSVPSQKTYQGSYGFRLGFLHSGTAKSVTCTYSPALNKM
FCQLAKTCPVQLWVDSTPPPGTRVRAMAIYKQSQHMTEVV
RRCPHHERCSDSDGLAPPQHLIRVEGNLRVEYLDDRNTFR
HSVVVPYEPPEVGSDCTTIHYNYMCNSSCMGGMNRRPILT
IITLEDSSGNLLGRNSFEVRVCACPGRDRRTEEENLRKKG
EPHHELPPGSTKRALPNNT"""

#Question 1 a

num_lines = chain_a.count ("\n")

print (num_lines)

#Question 1 b
length sequence = len (chain_a) - chain_a.count ("\n")
print (length sequence: ", length)

#Question 1 c
new_chain = chain_a.replace("\n", "")
print("New Chain:",new_chain)

#Question 1 d

count = 0
result=0
for i in chain_a:
if i == 'C':
count = count + 1
print ("Number of C:",count)

#Question 1 e
if "NLRVEYLDDRN" in chain_a:
print("yes found");

pos= chain_a.find("NLRVEYLDDRN")
print("Starting position :",pos);
Question 2

dna_seq = """GGGCTTGTGGCGCGAGCTTCTGAAACTAGGCGGCAGAGGCGGAGCCGCT
GTGGCACTGCTGCGCCTCTGCTGCGCCTCGGGTGTCTTTT
GCGGCGGTGGGTCGCCGCCGGGAGAAGCGTGAGGGGACAG
ATTTGTGACCGGCGCGGTTTTTGTCAGCTTACTCCGGCCA AAAAAGAACTGCACCTCTGGAGCGG""

#Question 2 a

# Count the number of C’s in DNA sequence

no_c = dna_seq.count ("C")

# Count the number of G’s in DNA sequence

no_g = dna_seq.count ("G")

#determine the length of the DNA sequence

dna_length = len(dna_seq)

#compute the GC content

gc_cont = (no_g + no_c)

#Question 2 b

rna_seq = dna_seq.replace("T","U")
#Question 2 c

intron = dna_seq[50:156]
exon1 = dna_seq[0:50]
exon2 = dna_seq[156:]
spliced = exon1+exon2

Question 3
#Question 3 a

clusters = """\
>Cluster 0
0 >YLR106C at 100.00%
>Cluster 50
0 >YPL082C at 100.00%
>Cluster 54
0 >YHL009W-A at 90.80%
1 >YHL009W-B at 100.00%
2 >YJL113W at 98.77%
3 >YJL114W at 97.35%
>Cluster 52
0 >YBR208C at 100.00%
"""

#Question a
result = re.findall(r">Cluster?([ \d.]+)", clusters, re.IGNORECASE |
re.MULTILINE)
#print("ID :",str(result))

#Question b
r = clusters.replace('>Cluster', 'Test')
#print("New :",r)
result = re.findall(r"> ?([A-Za-z0-9-]+)", r, re.IGNORECASE |
re.MULTILINE)
#print("sd :",str(result))

per=re.findall(r"> ?([A-Za-z0-9-]+)", r, re.IGNORECASE | re.MULTILINE)

+ re.findall(r"at ?([\d.]+)", clusters, re.IGNORECASE | re.MULTILINE)
#print("sd :",str(per))

lines = r.split('\n')
#print(lines)
for line in lines:
print(re.findall(r"> ?([A-Za-z0-9-]+)", line, re.IGNORECASE |
re.MULTILINE) + re.findall(r"at ?([\d.]+)", line, re.IGNORECASE |
re.MULTILINE))
#Question 4

("A", "T"): 10.0 / 5.0,

("A", "C"): 10.0 / 7.0,
("A", "G"): 10.0 / 6.0,
("T", "C"): 5.0 / 7.0,
("T", "G"): 5.0 / 6.0,
("C", "G"): 7.0 / 6.0 .
#Question 4 a

#There is no difference between the len(ratios), len(ratios.keys()),

len(ratios.values()) and len(ratios.items()) since all the commands
measure the key values
print len(ratios.keys())
print len(ratios.values())
print len(ratios.items())

#Question 4 b

ratio= ("A", "T"): 10.0 / 5.0, ("C", "G"): 7.0 / 6.0 .

If ("A", "T") in ratios:

print ("yes 'A, T' is found in ratios")
or:
print ("No 'T, A' is not found in ratios")

If ("C", "G") in ratios:

print ("yes 'C, G' is found in ratios")
or:
print ("No 'C, G' is not found in ratios")
#Question 4 c

contains_2 = 2 in ratios.values()
print contains_2

contains_3 = 3 in ratios.values()
print contains_3

#Question 4 d

2 in ("A", "T"):
print (("A", "T"), 2) in ratios.items()

1000 in ("C", "G"):

print (("C", "G"), 1000) in ratios.items()

#Question 4 e

keys = [key_value[0]
for key_value in ratios.items()]
values = [key_value[-1]
for key_value in ratios.items()]
#Question 5

#translate the list:

list = ["A", "T", "T", "A", "G", "T", "C"]

translation=

String="ade tym tym ade gua tym cyt"

str = " ade tym tym ade gua tym cyt "

s = ['A, T, T, A, G, T, C ', 'for', ' ade, tym, tym, ade, gua, tym, cyt ']

print(listToString(s))
#Question 6

A python program to read the file data.fasta

text=""">2HMI:A|PDBID|CHAIN|SEQUENCE

PISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKI

>2HMI:B|PDBID|CHAIN|SEQUENCE

PISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKI

>2HMI:C|PDBID|CHAIN|SEQUENCE

DIQMTQTTSSLSASLGDRVTISCSASQDISSYLNWYQQKPEGTVKLLIYY

>2HMI:D|PDBID|CHAIN|SEQUENCE

QITLKESGPGIVQPSQPFRLTCTFSGFSLSTSGIGVTWIRQPSGKGLEWL

>2HMI:E|PDBID|CHAIN|SEQUENCE

ATGGCGCCCGAACAGGGAC

>2HMI:F|PDBID|CHAIN|SEQUENCE

GTCCCTGTTCGGGCGCCA"""

fastaFile = open('fasta_file.txt')

Nintendo: HR Strategies
No ratings yet
Nintendo: HR Strategies
9 pages
Biostatistics and Research Methodology
From Everand
Biostatistics and Research Methodology
Dr. G. Nageswara Rao
5/5 (5)
Shogun Method Derek Rake
13% (8)
Shogun Method Derek Rake
33 pages
Fundamentals of Artificial Intelligence - Lab 1: Expert Fundamentals of Artificial Intelligence - Lab 1: Expert Systems Systems
No ratings yet
Fundamentals of Artificial Intelligence - Lab 1: Expert Fundamentals of Artificial Intelligence - Lab 1: Expert Systems Systems
8 pages
Function Solutions
No ratings yet
Function Solutions
10 pages
p3 Python Project
No ratings yet
p3 Python Project
4 pages
BINP16 Programming Exam 2016-10-25 Solutions
No ratings yet
BINP16 Programming Exam 2016-10-25 Solutions
5 pages
IDC306 Assignment 5 MS21009
No ratings yet
IDC306 Assignment 5 MS21009
4 pages
solutionsExerciseMaster11 23
No ratings yet
solutionsExerciseMaster11 23
13 pages
BT3040 - BIOINFORMATICS - Assignment 4: Question 1
No ratings yet
BT3040 - BIOINFORMATICS - Assignment 4: Question 1
9 pages
Lösungen Zu Den Exercises AI Python
No ratings yet
Lösungen Zu Den Exercises AI Python
26 pages
2nd Year
No ratings yet
2nd Year
83 pages
Untitled Document
No ratings yet
Untitled Document
15 pages
p2 Python Project
No ratings yet
p2 Python Project
3 pages
Py 1679789071
No ratings yet
Py 1679789071
2 pages
AI - Programs KP Print
No ratings yet
AI - Programs KP Print
14 pages
ENEL2CM Assignment 2 (2025)
No ratings yet
ENEL2CM Assignment 2 (2025)
15 pages
15CSL76 Students
No ratings yet
15CSL76 Students
18 pages
Aiml Sample Programs
No ratings yet
Aiml Sample Programs
20 pages
J.K. Institute of Applied Physics and Technology: Natural Language Processing Assignment
No ratings yet
J.K. Institute of Applied Physics and Technology: Natural Language Processing Assignment
22 pages
Group17 2
No ratings yet
Group17 2
9 pages
AIML Manual V1-6-83
No ratings yet
AIML Manual V1-6-83
78 pages
PY Exam
No ratings yet
PY Exam
11 pages
Ai SRK
No ratings yet
Ai SRK
19 pages
Python Solutions
No ratings yet
Python Solutions
4 pages
Dy Ai Rec
No ratings yet
Dy Ai Rec
24 pages
Artificial Intelligence Lab File
No ratings yet
Artificial Intelligence Lab File
10 pages
DWM Final Exps
No ratings yet
DWM Final Exps
14 pages
Python
No ratings yet
Python
9 pages
Ex 06 LL
No ratings yet
Ex 06 LL
9 pages
CS3491-AIML Lab Manual
No ratings yet
CS3491-AIML Lab Manual
20 pages
Exam Sample Questions
No ratings yet
Exam Sample Questions
6 pages
31-40 Prolog Program
No ratings yet
31-40 Prolog Program
23 pages
PRGM Aiml
No ratings yet
PRGM Aiml
27 pages
Homework 4
No ratings yet
Homework 4
7 pages
Answers Etc Sip Class 12
No ratings yet
Answers Etc Sip Class 12
9 pages
AIML Lab Manual
No ratings yet
AIML Lab Manual
39 pages
Bioinf575 hw07 Dmeghana
No ratings yet
Bioinf575 hw07 Dmeghana
34 pages
2024 Final Exams Rev Worksheet 1
No ratings yet
2024 Final Exams Rev Worksheet 1
9 pages
Aiml Lab
No ratings yet
Aiml Lab
10 pages
Comp Sci Prac
No ratings yet
Comp Sci Prac
8 pages
0 Aimlfinal
No ratings yet
0 Aimlfinal
24 pages
CSE160 Final 23wi Key
No ratings yet
CSE160 Final 23wi Key
10 pages
Aiml Programs
No ratings yet
Aiml Programs
24 pages
Algo
No ratings yet
Algo
10 pages
Aiml Lab Manual
No ratings yet
Aiml Lab Manual
44 pages
Arduino DLL
No ratings yet
Arduino DLL
13 pages
Shashidhar-18csl76 Final
No ratings yet
Shashidhar-18csl76 Final
19 pages
AI & ML Lab Manual
No ratings yet
AI & ML Lab Manual
25 pages
Aiml Lab Manual New Ucev
No ratings yet
Aiml Lab Manual New Ucev
37 pages
Lab Manual: Spring 2021
No ratings yet
Lab Manual: Spring 2021
33 pages
AI&ML Lab Manual
No ratings yet
AI&ML Lab Manual
50 pages
Aiml Lab Programs PDF
No ratings yet
Aiml Lab Programs PDF
25 pages
Machine Learning Through Python Lab Mannual
No ratings yet
Machine Learning Through Python Lab Mannual
33 pages
Prac1 23bme053
No ratings yet
Prac1 23bme053
32 pages
Aiml Lab Manual 2023
No ratings yet
Aiml Lab Manual 2023
17 pages
Ai Myh
No ratings yet
Ai Myh
8 pages
Quizlet Py
No ratings yet
Quizlet Py
13 pages
Project
No ratings yet
Project
29 pages
CODE
No ratings yet
CODE
16 pages
Calculus and Statistics
From Everand
Calculus and Statistics
Michael C. Gemignani
4/5 (1)
University of Mauritius
No ratings yet
University of Mauritius
4 pages
Radial Immuno
No ratings yet
Radial Immuno
8 pages
Revision questions-AGRI 2042
No ratings yet
Revision questions-AGRI 2042
1 page
Intracellular Signal Transduction
No ratings yet
Intracellular Signal Transduction
11 pages
SBSC Agribiotechnology Year Iii: Gmo Test
No ratings yet
SBSC Agribiotechnology Year Iii: Gmo Test
1 page
KD N Covid To Upload
No ratings yet
KD N Covid To Upload
12 pages
BT Cotton
No ratings yet
BT Cotton
3 pages
Agarose Gel Electrophoresis
No ratings yet
Agarose Gel Electrophoresis
1 page
Saliva Report
No ratings yet
Saliva Report
11 pages
Nusae - Company Profile - 2019 PDF
No ratings yet
Nusae - Company Profile - 2019 PDF
122 pages
Vet Strategy and Action Plan - en PDF
No ratings yet
Vet Strategy and Action Plan - en PDF
93 pages
American Dream Essay
No ratings yet
American Dream Essay
4 pages
Regular Expression Question Solution
100% (2)
Regular Expression Question Solution
68 pages
Youngs Modulus by Cantilever Method
No ratings yet
Youngs Modulus by Cantilever Method
3 pages
Xperiment O: A: To Compute Fourier Transform of A Continuous Time Signal
No ratings yet
Xperiment O: A: To Compute Fourier Transform of A Continuous Time Signal
6 pages
Vision Technique
No ratings yet
Vision Technique
14 pages
La Soledad, M.Denevi
No ratings yet
La Soledad, M.Denevi
4 pages
Aug 5-9
No ratings yet
Aug 5-9
6 pages
(LEARNING TASKS 6) Proper Etiquette and Safety in The Use of Facilities and Equipment
No ratings yet
(LEARNING TASKS 6) Proper Etiquette and Safety in The Use of Facilities and Equipment
3 pages
Argument Essay Rubric
No ratings yet
Argument Essay Rubric
3 pages
Report Rubrics
No ratings yet
Report Rubrics
2 pages
Ch.2 CRD
No ratings yet
Ch.2 CRD
10 pages
Torsion of Multi-Cell Cross-Section - Hw7 - B
No ratings yet
Torsion of Multi-Cell Cross-Section - Hw7 - B
12 pages
Enhancing and Scalability in Big Data and Cloud Computing: Future Opportunities and Security
No ratings yet
Enhancing and Scalability in Big Data and Cloud Computing: Future Opportunities and Security
7 pages
Natural and Artificial Tracers in Ground Water
100% (1)
Natural and Artificial Tracers in Ground Water
23 pages
Joseph Henry
No ratings yet
Joseph Henry
11 pages
Procom - Interview Tips and Common Questions
No ratings yet
Procom - Interview Tips and Common Questions
4 pages
OMEGA AIR - Process and Sterile Filtration - English
No ratings yet
OMEGA AIR - Process and Sterile Filtration - English
12 pages
Bassey Curriculum Vitae
No ratings yet
Bassey Curriculum Vitae
3 pages
The Master of Animals in Old World Iconography
No ratings yet
The Master of Animals in Old World Iconography
21 pages
International MKT Case Study 2 IKEA
No ratings yet
International MKT Case Study 2 IKEA
3 pages
How To Optimize Human Biology: Where Genome Editing and Artificial Intelligence Collide
No ratings yet
How To Optimize Human Biology: Where Genome Editing and Artificial Intelligence Collide
27 pages
MSDS - Sulphur 90%: Section 1. Product Information
No ratings yet
MSDS - Sulphur 90%: Section 1. Product Information
3 pages
190-ECDIS JRC JAN-7201-9201 Instruct Manual Function 1-4-2019
100% (7)
190-ECDIS JRC JAN-7201-9201 Instruct Manual Function 1-4-2019
558 pages
Venn Diagram
No ratings yet
Venn Diagram
2 pages
Induction by Contradiction PDF
No ratings yet
Induction by Contradiction PDF
2 pages
Building Lighting Automation Through The Integration of DALI With Wireless Sensor Networks
No ratings yet
Building Lighting Automation Through The Integration of DALI With Wireless Sensor Networks
6 pages
Project in Math24-A1
No ratings yet
Project in Math24-A1
10 pages

University of Mauritius

Uploaded by

University of Mauritius

Uploaded by

UNIVERSITY OF MAURITIUS

AGRI 2081Y (3) - COMPUTATIONAL BIOLOGY

Name of Student: Marie Natacha Meunier

Student I.D: 1712892

Date: 25th May 2020

Lecturer Name: Dr Shakuntala Baichoo

num_lines = chain_a.count ("\n")

# Count the number of C’s in DNA sequence

# Count the number of G’s in DNA sequence

#determine the length of the DNA sequence

#compute the GC content

gc_cont = (no_g + no_c)

per=re.findall(r"> ?([A-Za-z0-9-]+)", r, re.IGNORECASE | re.MULTILINE)

("A", "T"): 10.0 / 5.0,

#There is no difference between the len(ratios), len(ratios.keys()),

ratio= ("A", "T"): 10.0 / 5.0, ("C", "G"): 7.0 / 6.0 .

If ("A", "T") in ratios:

If ("C", "G") in ratios:

1000 in ("C", "G"):

#translate the list:

list = ["A", "T", "T", "A", "G", "T", "C"]

String="ade tym tym ade gua tym cyt"

A python program to read the file data.fasta

You might also like