0% found this document useful (0 votes)

18 views10 pages

Function Solutions

This document contains solutions to exercises involving Python functions. It includes 21 exercises on topics like defining functions, passing arguments, returning values, iterating over dictionaries, parsing FASTA files, and working with genomic data. Functions are defined to perform tasks like calculating GC content, extracting sequences from genomes, and writing FASTA files. The exercises demonstrate common bioinformatics analysis patterns using Python functions.

Uploaded by

Huy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views10 pages

Function Solutions

Uploaded by

Huy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

function_solutions

October 18, 2019

1 Table of Contents
1 Functions
1.1 Exercise 1
1.2 Exercise 2
1.3 Exercise 3
1.4 Exercise 4
1.5 Exercise 5
1.6 Exercise 6
1.7 Exercise 7
1.8 Exercise 8
1.9 Exercise 9
1.10 Exercise 10
1.11 Exercise 11
1.12 Exercise 12
1.13 Exercise 13
1.14 Exercise 14
2 Case study
2.1 Exercise 15
2.2 Exercise 16
2.3 Exercise 17
2.4 Exercise 18
2.5 Exercise 19
2.6 Exercise 20
2.7 Exercise 21
2.8 A less readable way to write it

2 Functions
2.1 Exercise 1
In [3]: def print_line():
print("This is a function!")

print_line()
This is a function!

1
2.2 Exercise 2
In [4]: def print_line():
return "This is a function!"

result = print_line()
print(result)
print(result[0])
This is a function!
T

2.3 Exercise 3
In [5]: def greet(name1, name2):
return "Hello {} and {}!".format(name1, name2)

print(greet("Björn", "Dag"))
Hello Björn and Dag!

2.4 Exercise 4
In [6]: def greet(name1="?", name2="?"):
return "Hello {} and {}!".format(name1, name2)

print(greet())
print(greet("Petr"))
print(greet("Björn", "Dag"))
Hello ? and ?!
Hello Petr and ?!
Hello Björn and Dag!

2.5 Exercise 5
In [7]: def multiply(nbr1, nbr2):
return nbr1 * nbr2

def add(nbr1, nbr2):

return nbr1 + nbr2

print(multiply(2,3))
print(add(2,3))
6
5

2
2.6 Exercise 6
In [8]: def multiply(nbr1, nbr2):
return nbr1 * nbr2

def add(nbr1, nbr2):

return nbr1 + nbr2

def calculate(nbr1, nbr2, operation):

if operation == "add":
return add(nbr1, nbr2)
elif operation == "multiply":
return multiply(nbr1, nbr2)
else:
raise Exception("Unknown operation option: {}".format(operation))

print(calculate(2, 3, operation="add"))
print(calculate(2, 3, operation="multiply"))

5
6

2.7 Exercise 7
In [9]: def get_gc(seq_raw):
seq = seq_raw.upper() # Handle both lower- and upper- case
gc_count = seq.count("C") + seq.count("G")
at_count = seq.count("A") + seq.count("T")
return gc_count / (at_count + gc_count)

print(get_gc("CACAGGTT"))
print(get_gc("CAG"))

0.5
0.6666666666666666

2.8 Exercise 8
In [10]: def many_hi(number):
for _ in range(number):
print("Hi!")

many_hi(4)

Hi!
Hi!
Hi!

3
Hi!

2.9 Exercise 9
In [11]: def many_hi(number):
return ["Hi!"] * number

hi_list = many_hi(6)
print(hi_list)

['Hi!', 'Hi!', 'Hi!', 'Hi!', 'Hi!', 'Hi!']

2.10 Exercise 10
In [12]: def many_hi(number, word="Hi"):
return ["{}!".format(word)] * number

hi_list = many_hi(3)
print(hi_list)
word_list = many_hi(4, word="Halloa")
print(word_list)

['Hi!', 'Hi!', 'Hi!']

['Halloa!', 'Halloa!', 'Halloa!', 'Halloa!']

2.11 Exercise 11
In [13]: seq_dict = {
"header1.1": "ATGCTAGCTAGCTAGCTACG",
"header1.2": "ACGTAGCTAGCTAGCAC",
"header2.1": "AGCTAGCTAGCTATTATCTACT"
}
seq_dict

Out[13]: {'header1.1': 'ATGCTAGCTAGCTAGCTACG',

'header1.2': 'ACGTAGCTAGCTAGCAC',
'header2.1': 'AGCTAGCTAGCTATTATCTACT'}

2.12 Exercise 12
In [14]: n = 1
for key in seq_dict.keys():
value = seq_dict[key]
print("Entry {} has header: {} and sequence: {}".format(n, key, value))
n += 1

4
Entry 1 has header: header1.1 and sequence: ATGCTAGCTAGCTAGCTACG
Entry 2 has header: header1.2 and sequence: ACGTAGCTAGCTAGCAC
Entry 3 has header: header2.1 and sequence: AGCTAGCTAGCTATTATCTACT

2.13 Exercise 13
In [15]: def load_fasta(filepath):

seqs = dict()
with open(filepath, 'r') as in_fh:
key = None
for line in in_fh:
line = line.rstrip()
if line.startswith('>'):
key = line[1:]
else:
seq = line
seqs[key] = seq
return seqs

fasta_dict = load_fasta("test.fa")
print(fasta_dict)

{'header1.1': 'ATGCTAGCTAGCTAGCTACG', 'header1.2': 'ACGTAGCTAGCTAGCAC', 'header2.1': 'AGCTAGCTA

2.14 Exercise 14
In [16]: def calcGC(seq_raw):
seq = seq_raw.upper() # Handle both lower- and upper- case
gc_count = seq.count("C") + seq.count("G")
at_count = seq.count("A") + seq.count("T")
return gc_count / (at_count + gc_count)

sequenceList = list(fasta_dict.values())

for sequence in sequenceList:

gc = calcGC(sequence)
print('{}: {}'.format(sequence, gc))

ATGCTAGCTAGCTAGCTACG: 0.5
ACGTAGCTAGCTAGCAC: 0.5294117647058824
AGCTAGCTAGCTATTATCTACT: 0.36363636363636365

5
3 Case study
3.1 Exercise 15
In [ ]:

3.2 Exercise 16
In [18]: in_fp = 'data/fungus.gff'
out_fp = 'data/fungus_renamed.gff'

with open(in_fp, 'r') as in_fh, open(out_fp, 'w') as out_fh:

lines = 0
for line in in_fh:
line = line.rstrip()
lines += 1
if line.startswith('#'):
print(line, file=out_fh)
else:
fields = line.split('\t')
fields[0] = 'scaffold_{}'.format(fields[0][3:])
merged_line = '\t'.join(fields)
print(merged_line, file=out_fh)

print('{} lines written to file {}'.format(lines, out_fp))

208425 lines written to file data/fungus_renamed.gff

3.3 Exercise 17
In [19]: def parse_fasta_to_dict(filepath):

seqs = dict()
with open(filepath, 'r') as in_fh:
key = None
for line in in_fh:
line = line.rstrip()
if line.startswith('>'):
key = line[1:]
else:
seq = line
seqs[key] = seq
return seqs

fasta_dict = parse_fasta_to_dict("data/fungusAssembly.single.fna")
print(len(fasta_dict.keys()))
print(len(fasta_dict['scaffold_10']))

6
2681
1262682

3.4 Exercise 18
In [20]: def get_feature_coordinates(annot_gff, target_pattern):

feature_coords_list = list()
with open(annot_gff, 'r') as in_fh:
for line in in_fh:
if not line.startswith('##'):
line = line.rstrip()
fields = line.split('\t')
chrom_nbr = fields[0]
id_type = fields[2]
start_pos = int(fields[3])
end_pos = int(fields[4])

if id_type == target_pattern:
feature_coords_list.append((chrom_nbr, start_pos, end_pos))
return feature_coords_list

feature_coords = get_feature_coordinates('data/fungus_renamed.gff', 'CDS')

print(feature_coords[0])
print(feature_coords[1])

('scaffold_1', 1335, 1582)

('scaffold_1', 1640, 1947)

3.5 Exercise 19
In [21]: def extract_features(genome_dict, feature_coords):

feature_dict = dict()
for coord_tuple in feature_coords:
scaffold = coord_tuple[0]
start_pos = coord_tuple[1]
end_pos = coord_tuple[2]

seq = genome_dict[scaffold][start_pos-1:end_pos]
feature_dict['>{} {}-{}'.format(scaffold, start_pos, end_pos)] = seq

return feature_dict

genome_path = 'data/fungusAssembly.single.fna'
annot_path = 'data/fungus_renamed.gff'

7
genome_dict = parse_fasta_to_dict(genome_path)
feature_coords = get_feature_coordinates(annot_path, 'CDS')
feature_fasta_dict = extract_features(genome_dict, feature_coords)

print(feature_fasta_dict[">scaffold_1 1335-1582"])
print(len(feature_fasta_dict[">scaffold_1 1335-1582"]))

ATGTGTAccccatcaatatcctcgcaaaatattcccagcgatgacaagagccgttcgagttcgcagggctccgaacgatgtggagagacctcgct
248

3.6 Exercise 20
In [22]: def write_fasta_dict(fasta_dict, out_fp):
with open(out_fp, 'w') as out_fh:
for header in sorted(fasta_dict.keys()):
seq = fasta_dict[header]
print('{}\n{}'.format(header, seq), file=out_fh)
print('Written {} entries to {}'.format(len(fasta_dict), out_fp))

write_fasta_dict(feature_fasta_dict, 'data/out_dict.fa')

Written 88016 entries to data/out_dict.fa

3.7 Exercise 21
In [23]: fasta_dict = dict()
with open('data/fungusAssembly.single.fna', 'r') as in_fh:
key = None
for line in in_fh:
line = line.rstrip()
if line.startswith('>'):
key = line[1:]
else:
seq = line
fasta_dict[key] = seq

feature_coords = list()
with open('data/fungus_renamed.gff', 'r') as in_fh:
for line in in_fh:
if not line.startswith('##'):
line = line.rstrip()
fields = line.split('\t')
chrom_nbr = fields[0]
id_type = fields[2]
start_pos = int(fields[3])

8
end_pos = int(fields[4])

if id_type == 'CDS':
feature_coords.append((chrom_nbr, start_pos, end_pos))

feature_dict = dict()
for coord_tuple in feature_coords:
scaffold = coord_tuple[0]
start_pos = coord_tuple[1]
end_pos = coord_tuple[2]

seq = genome_dict[scaffold][start_pos-1:end_pos]
feature_dict['>{} {}-{}'.format(scaffold, start_pos, end_pos)] = seq

with open('out_dict.fa', 'w') as out_fh:

for header in sorted(fasta_dict.keys()):
seq = fasta_dict[header]
print('{}\n{}'.format(header, seq), file=out_fh)

print(feature_fasta_dict[">scaffold_1 1335-1582"])
print(len(feature_fasta_dict[">scaffold_1 1335-1582"]))

ATGTGTAccccatcaatatcctcgcaaaatattcccagcgatgacaagagccgttcgagttcgcagggctccgaacgatgtggagagacctcgct
248

3.8 A less readable way to write it

Everyone to their own, but for me this is not a recommended way, as it becomes very difficult to
follow the logic for other people, or yourself two weeks later, or even yourself while writing it (I
struggled while writing this!)

In [24]: d = dict()
with open('data/fungusAssembly.single.fna', 'r') as file:
key = None
for x in file:
x = x.rstrip()
if x.startswith('>'):
key = x[1:]
else:
x1 = x
d[key] = x1

mylist = list()
with open('data/fungus_renamed.gff', 'r') as file:
for l in file:
if not l.startswith('##'):
l = l.rstrip()

9
f = l.split('\t')
c = f[0]
t = f[2]
p1 = int(f[3])
p2 = int(f[4])

if t == 'CDS':
mylist.append((c, p1, p2))

fd = dict()
for ct in mylist:
s = ct[0]
p1 = ct[1]
p2 = ct[2]

s2 = d[s][p1-1:p2]
fd['>{} {}-{}'.format(s, p1, p2)] = s2

with open('out_dict.fa', 'w') as file2:

for h in sorted(fd.keys()):
s = fd[h]
print('{}\n{}'.format(h, s), file=file2)

print(fd[">scaffold_1 1335-1582"])
print(len(fd[">scaffold_1 1335-1582"]))

ATGTGTAccccatcaatatcctcgcaaaatattcccagcgatgacaagagccgttcgagttcgcagggctccgaacgatgtggagagacctcgct
248

Lesson Plan COT 1 MIL
100% (2)
Lesson Plan COT 1 MIL
10 pages
The Holy Quran Russian
No ratings yet
The Holy Quran Russian
1,090 pages
Reflection Rubric
No ratings yet
Reflection Rubric
1 page
solutionsExerciseMaster11 23
No ratings yet
solutionsExerciseMaster11 23
13 pages
University of Mauritius
No ratings yet
University of Mauritius
9 pages
Lab 2
No ratings yet
Lab 2
7 pages
AI Practical MCA
No ratings yet
AI Practical MCA
21 pages
C:/Users/Rafe/Appdata/Local/Programs/Python/Python35-32/Scripts Object and Data Structures Basics
No ratings yet
C:/Users/Rafe/Appdata/Local/Programs/Python/Python35-32/Scripts Object and Data Structures Basics
16 pages
Group17 2
No ratings yet
Group17 2
9 pages
Artificial Intelligence Lab File
No ratings yet
Artificial Intelligence Lab File
10 pages
CSE160 Final 23wi Key
No ratings yet
CSE160 Final 23wi Key
10 pages
Collections
No ratings yet
Collections
7 pages
Lösungen Zu Den Exercises AI Python
No ratings yet
Lösungen Zu Den Exercises AI Python
26 pages
15CSL76 Students
No ratings yet
15CSL76 Students
18 pages
3final ML Lab Manual
No ratings yet
3final ML Lab Manual
17 pages
Aiml Sample Programs
No ratings yet
Aiml Sample Programs
20 pages
Aiml Programs
No ratings yet
Aiml Programs
24 pages
2nd Year
No ratings yet
2nd Year
83 pages
IDC306 Assignment 5 MS21009
No ratings yet
IDC306 Assignment 5 MS21009
4 pages
Computational and Systems Biology Assignment Help
100% (1)
Computational and Systems Biology Assignment Help
15 pages
AIML Manual - Merged
No ratings yet
AIML Manual - Merged
41 pages
p3 Python Project
No ratings yet
p3 Python Project
4 pages
Python
No ratings yet
Python
17 pages
AIML Lab Manual
No ratings yet
AIML Lab Manual
39 pages
ENEL2CM Assignment 2 (2025)
No ratings yet
ENEL2CM Assignment 2 (2025)
15 pages
DWM EXP 1 To 14 C - Merged - Compressed
No ratings yet
DWM EXP 1 To 14 C - Merged - Compressed
104 pages
Lab Manual: Spring 2021
No ratings yet
Lab Manual: Spring 2021
33 pages
Programs
No ratings yet
Programs
8 pages
Ada File
No ratings yet
Ada File
26 pages
Exam Programming Exercises
No ratings yet
Exam Programming Exercises
7 pages
BFS
No ratings yet
BFS
16 pages
CS Practical File
No ratings yet
CS Practical File
28 pages
Artificial Intelligence Lab Manual
No ratings yet
Artificial Intelligence Lab Manual
35 pages
Aimlf Lab Manual
No ratings yet
Aimlf Lab Manual
50 pages
13 Object Oriented Programming - Python Solutions 1.5 Documentation
No ratings yet
13 Object Oriented Programming - Python Solutions 1.5 Documentation
5 pages
AI - Programs KP Print
No ratings yet
AI - Programs KP Print
14 pages
AI and ML Lab Programs To Print
No ratings yet
AI and ML Lab Programs To Print
22 pages
AIML LAB - Merged
No ratings yet
AIML LAB - Merged
52 pages
AIML Lab - To Print
No ratings yet
AIML Lab - To Print
45 pages
Ref 1
No ratings yet
Ref 1
4 pages
Aiml Lab Exps
No ratings yet
Aiml Lab Exps
16 pages
01 07 FrequentWordsWithMismatchesSolution
No ratings yet
01 07 FrequentWordsWithMismatchesSolution
2 pages
CS3491-AIML Lab Manual
No ratings yet
CS3491-AIML Lab Manual
20 pages
Aiml Exp 2 Adarsh Pandey
No ratings yet
Aiml Exp 2 Adarsh Pandey
3 pages
Python Answers
No ratings yet
Python Answers
6 pages
2023 Data Analysis and Visualization Using Python
100% (2)
2023 Data Analysis and Visualization Using Python
9 pages
AI Print
No ratings yet
AI Print
14 pages
AIML Manual V1-6-83
No ratings yet
AIML Manual V1-6-83
78 pages
Iksha ' Nusandhan: Admission Batch: 2019
No ratings yet
Iksha ' Nusandhan: Admission Batch: 2019
17 pages
MATH Lab Final Code
No ratings yet
MATH Lab Final Code
10 pages
Agri Lab Manual Merged
No ratings yet
Agri Lab Manual Merged
25 pages
Python Cheatsheet 2
No ratings yet
Python Cheatsheet 2
4 pages
Q1: Conference Reviewing (20 PTS, 5 Pts Each) : M M M (I) (J) I J J I M (I) (J) - 1
No ratings yet
Q1: Conference Reviewing (20 PTS, 5 Pts Each) : M M M (I) (J) I J J I M (I) (J) - 1
9 pages
MIT6 006F11 ps4
No ratings yet
MIT6 006F11 ps4
5 pages
Python Code Examples
100% (1)
Python Code Examples
30 pages
CS Practical File
No ratings yet
CS Practical File
21 pages
Hints and Answers
No ratings yet
Hints and Answers
13 pages
Agri Lab Manual
No ratings yet
Agri Lab Manual
22 pages
Dy Ai Rec
No ratings yet
Dy Ai Rec
24 pages
IDP Lab Report (Saswat Mohanty - 1941012407 - CSE-D)
No ratings yet
IDP Lab Report (Saswat Mohanty - 1941012407 - CSE-D)
47 pages
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
No ratings yet
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
16 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
150+ C Pattern Programs
From Everand
150+ C Pattern Programs
Hernando Abella
No ratings yet
Ca3 Es-Cs-201 Cse 2nd Semester
No ratings yet
Ca3 Es-Cs-201 Cse 2nd Semester
1 page
OpenEdge 12 Product Availability Guide
No ratings yet
OpenEdge 12 Product Availability Guide
20 pages
Discussions About Vandanam and Vanakkam
100% (1)
Discussions About Vandanam and Vanakkam
7 pages
Sample Paper
No ratings yet
Sample Paper
6 pages
FLED Manuscript 1
No ratings yet
FLED Manuscript 1
13 pages
Circular e Resource2024
No ratings yet
Circular e Resource2024
2 pages
Williams PDF
No ratings yet
Williams PDF
14 pages
Five Design Principles
No ratings yet
Five Design Principles
6 pages
A Linux Command Line Primer
No ratings yet
A Linux Command Line Primer
20 pages
B, Inggris Ukk.
No ratings yet
B, Inggris Ukk.
8 pages
Grade 2 Cause Effect B
No ratings yet
Grade 2 Cause Effect B
3 pages
Keller Protocol
No ratings yet
Keller Protocol
37 pages
p7 English Paper
No ratings yet
p7 English Paper
16 pages
Step by Step Guide LyncDebugTools - Snooper 2013
No ratings yet
Step by Step Guide LyncDebugTools - Snooper 2013
13 pages
Graduate Admissions Essays
67% (3)
Graduate Admissions Essays
6 pages
Thierens - Astrology in Mesopotamian Culture
100% (4)
Thierens - Astrology in Mesopotamian Culture
78 pages
Quanta G31a Dag31amb6d0 Y61x-6l Rev 1a
No ratings yet
Quanta G31a Dag31amb6d0 Y61x-6l Rev 1a
49 pages
Oracle DBA Syllabus
No ratings yet
Oracle DBA Syllabus
7 pages
גדל אשר באך - 701-800
No ratings yet
גדל אשר באך - 701-800
100 pages
IB - INS3283. Digital Marketing.2024
No ratings yet
IB - INS3283. Digital Marketing.2024
7 pages
Class 11 Winter Holidays Homework 202425
No ratings yet
Class 11 Winter Holidays Homework 202425
3 pages
Gbio 55 Lec Lesson 4
No ratings yet
Gbio 55 Lec Lesson 4
8 pages
0547 - s03 - RP - 3 SPEAKING2
No ratings yet
0547 - s03 - RP - 3 SPEAKING2
18 pages
Techniques in Selecting & Organizing Information
100% (2)
Techniques in Selecting & Organizing Information
18 pages
AxiDraw Guide v512
No ratings yet
AxiDraw Guide v512
86 pages
S.P.I.T.Polytechnic, Kurund: Classtest-1
No ratings yet
S.P.I.T.Polytechnic, Kurund: Classtest-1
2 pages
Eaton Guidespec Busway Low Voltage 26 25 00
No ratings yet
Eaton Guidespec Busway Low Voltage 26 25 00
7 pages

Function Solutions

Uploaded by

Function Solutions

Uploaded by

function_solutions

October 18, 2019

def add(nbr1, nbr2):

def add(nbr1, nbr2):

def calculate(nbr1, nbr2, operation):

['Hi!', 'Hi!', 'Hi!', 'Hi!', 'Hi!', 'Hi!']

['Hi!', 'Hi!', 'Hi!']

Out[13]: {'header1.1': 'ATGCTAGCTAGCTAGCTACG',

{'header1.1': 'ATGCTAGCTAGCTAGCTACG', 'header1.2': 'ACGTAGCTAGCTAGCAC', 'header2.1': 'AGCTAGCTA

for sequence in sequenceList:

with open(in_fp, 'r') as in_fh, open(out_fp, 'w') as out_fh:

print('{} lines written to file {}'.format(lines, out_fp))

208425 lines written to file data/fungus_renamed.gff

feature_coords = get_feature_coordinates('data/fungus_renamed.gff', 'CDS')

('scaffold_1', 1335, 1582)

Written 88016 entries to data/out_dict.fa

with open('out_dict.fa', 'w') as out_fh:

3.8 A less readable way to write it

with open('out_dict.fa', 'w') as file2:

You might also like