0% found this document useful (0 votes)

9 views7 pages

Homework 4

Uploaded by

plane

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views7 pages

Homework 4

Uploaded by

plane

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Koç University

COMP 125: Programming with Python

Homework #4
Deadline: December 27, 2020 at 23:59
Submission through: Blackboard
Make sure you read and understand every part of this document

This homework assignment contains 3 programming questions. Each question may contain
multiple parts.

Download Hw4.zip from Blackboard and unzip the contents to a convenient location on your
computer. The Python files (.py extension) contain starter codes for the programming
questions. Remaining files are sample text/data files for the File I/O questions.

Solve each question in its own Python file. Do not change the names of the files. Do not
change the headers of the given functions (function names, function parameters).
When you are finished, see the end of this document for submission instructions.
● Your submitted code should run as is. It should not yield syntax errors.
● We will not edit or comment/uncomment parts of your code in order to fix syntax
errors.

Q1: Sparse Matrix Operations - 30 pts

A sparse matrix is a matrix where most of the elements are zero (i.e. it has very few
non-zero elements). They occur frequently in scientific computation and engineering
applications. A 5x5 example is given below:

These matrices can get quite large. To save on space, only their non-zero elements are
stored. There are multiple ways of storing these matrices. In this homework, you are going to
use dictionaries to store these matrices. Let ANxM be a sparse matrix with N r ows and M
columns. Let Aij be the element of A in the ith
row and the jth
column (i=0...N-1, j=0...M-1). then
this sparse matrix should be represented as follows:
● Let sp_A be a python dictionary that will store A
● The keys are tuples of the row (i) and column (j) indices of non-zero elements.
(sp_A[(i,j)]=Aij, A
ij ≠ 0)
● A special key (-1) stores the matrix dimensions (sp_A[-1]=[N,M])
The starter code for this question is given to you in sparse_matrices.py. This question has
three parts:

Part 1: You will be given the matrices in a list of lists format. You need to convert them to the
aforementioned sparse representation. Fill in the dense_to_sparse function to complete
this part.

Part 2: You are going to implement the matrix transpose operation (switching the row and
column indices of non-zero elements, Aij←Aji) for the dictionary based sparse matrix
representation described in this question. Fill in the sparse_transpose function to
complete this part.

Part 3: The code for matrix multiplication for list of lists implementation is given to you. You
are going to implement matrix multiplication for the dictionary based sparse matrix
representation described in this question. Fill in the sparse_mat_mult function to
complete this part.

Let , be two matrices and their multiplication. The elements of

are calculated as:

For all of the parts, follow the comments. Do not iterate over the entire rows and columns
for any of the parts! This will be inefficient and you will lose points. There is an example
function, sparse_mat_add, as an example for you.

There are also other functions that may help you debug or get inspirations from. Make sure
to go over the sparse_matrices.py file, both comments and code, for your own benefit.
Q2: Protein Center of Mass Calculation - 35 pts
The Protein Data Bank archive serves as a single repository of information about the 3D
structures of proteins, nucleic acids and complex assemblies. Each PDB file contains
various kinds of information. The type of information is indicated in the first six characters of
each line, such as HEADER, SOURCE, COMPND, AUTHOR, REMARKS, etc. For this
homework, you just need to concentrate on lines beginning with the word “ATOM”. An
example is given below, where the columns represent the atom record, atom number, atom
identifier, amino acid type, chain identifier, residue sequence number, x-coordinate (in Å),
y-coordinate (in Å), z coordinate (in Å), occupancy, β-factor and element symbol
respectively. The symbol Å denotes Angstrom (1Å= 10-10 m).

X Y Z ELEMENT
Implement the following functions:
A. pdb_parser(pdb_filename):
Receives one argument, pdb_filename. Opens the PDB file, reads all lines starting
with ATOM and stores the X, Y, Z coordinates and the ELEMENT type of the atom in
a list, atoms.
e.g.
atoms= [[27.340, 24.430, 2.614,’N’],[26.266, 25.413, 2.842,’C’],...]

atoms list is of length N, number of atoms that make up the protein in the PDB file.
The function should return atoms.

B. center_of_mass(atoms):
This function calculates the center of mass of the protein, as follows.

rcm is coordinates of the center of mass of the protein, ri is a list containing the [x, y, z]
coordinates of the ith atom), m
i is the mass of the ith atom and N is the total number of
atoms. mi should be obtained from the dictionary mass={'C':12.01, 'O':16.00,
'H':1.008, ...} by using the element type of the ith
atom as key. mass dictionary is
already provided in the pdb.py template.

C. shift(atoms, vec):
This function translates the protein by vec a nd returns the updated coordinates of the
protein, atomsnew. v ec is a list of size 3.
E.g. vec=[a, b, c], atoms=[[x, y, z, ‘C’]] -> atomsnew=[[x+a, y+b, z+c, ‘C’]]

After implementing these functions, demonstrate them as follows:

i) Read the 1ubq.pdb file and parse the information.
ii) Calculate the center of mass.
iii) Shift the coordinates of the protein such that the molecule’s center of mass is at
the origin.

The sample output from the program should be:

Q3: Course Scheduling - 35 pts
In this question, you will practice File I/O with multiple input files. In particular, you are given
three files related to the scheduling of classes at Koç University in Spring 2020:
● instructors.txt: contains the names, IDs, and instructors of courses
● locations.txt: contains the physical location (room) of each course
● times.txt: contains the timeslot (day, start time, end time) of each course

The starter code for this question is given in CourseScheduling.py. Implement the empty
functions in the starter code to solve the following parts.

PART A: Write a function called read_schedules() that reads course schedules from the 3
given files and stores them in a dictionary of dictionaries:
data = { “COMP110”: { “Name”: “Introduction to Programming with Matlab”,
“Instructor”: “Emre Kutukoglu”,
“Location”: “SCI103”,
“Days”: “TuTh”,
“Start Time”: “8:30”,
“End Time”: “9:45”},
“COMP125”: { “Name”: “Programming with Python”,
“Instructor”: “Ayca Tuzmen”,
“Location”: “ENGZ50”,
“Days”: “TuTh”,
“Start Time”: “13:00”,
“End Time”: “14:15”},
….
}

For the outer dictionary data, the key should be the course ID (e.g.: “COMP110”,
“COMP125”, “ELEC204”) and the value should be a dictionary. For the inner dictionary, the
keys should be “Name”, “Instructor”, “Location”, “Days”, “Start Time”, “End Time”; and the
values should be the corresponding information for that course as shown above.

The return value of read_schedules() should be the main dictionary data.

Hint: You should first read instructors.txt when creating the outer dictionary data, and later
populate the location and time details using locations.txt and times.txt.

PART B: Some courses included in instructors.txt are missing location assignments or time
assignments, i.e., they are included in instructors.txt but not in locations.txt or times.txt. Write
a function find_unscheduled(data) that:
● Takes as input the main dictionary data constructed above
● Has two return values:
○ First return value is the list of courses that have no location
■ Example: [“COMP306”, “INDR460”]
○ Second return value is the list of courses that have no timeslot
■ Example: [“ELEC422”, “MECH435”, …]
PART C: Write a function clean_schedule(data, courses_to_remove) such that:
● data is the main dictionary constructed in Part A
● courses_to_remove is the list of courses that should be removed from data
○ Example: [“ELEC422”, “MECH435”, …]
● Your function should return the resulting dictionary after the courses are removed
from it

Caution: This function must return the resulting dictionary, do not just modify the dictionary
in-place.

PART D: Write a function find_instructor(data, courseID) such that:

● data is the main dictionary constructed in Part A
● courseID is a string containing the ID of one course
○ Example: courseID = “COMP125” or courseID = “MECH433”
● Your function should return the instructor who is teaching that course
○ If the given courseID does not exist in data, your function should return the
following string: “NA”

PART E: Write a function find_subj_courses(data, subject) such that:

● data is the main dictionary constructed in Part A
● subject is the 4-letter subject area
○ Example: subject = “COMP” or subject = “ELEC” or subject = “MECH”
● Your function should return the list of all courses from that subject area
○ Example: [“COMP110”, “COMP125”, “COMP131”, …, “COMP437”]

PART F: Write a function build_schedule(data, courses) such that:

● data is the main dictionary constructed in Part A
● courses is a list containing course IDs that a student is interested in taking
○ Example: courses = [“COMP125”, “MECH301”, “COMP305”, “ELEC301”]
● File output: Your function should create and write to a file called “student.txt” the
schedule of this student. For the above example, contents of student.txt will be:
COMP125,ENGZ50,TuTh,13:00,14:15
MECH301,SOSZ27,MoWe,10:00,11:15
COMP305,SOSB07,MoWe,8:30,9:45
ELEC301,SNA159,MoWe,13:00,14:15
● If the given courses have a time conflict, i.e., they overlap, then the student cannot
take the given list of courses simultaneously. In this case, your program should write
only the following to student.txt (course IDs, locations, times should not be written):
** Time conflict **
Submission and Grading
Solve each question in its own file. Do not change the names of the files. Do not change
the headers of the given functions (function names, function parameters).

When you are finished, compress your Hw4 folder containing all of your answers. The result
should be a SINGLE compressed file (file extension: .zip, .rar, .tar, .tar.gz, or .7z). Upload
this compressed file to Blackboard.

Follow instructions, input-output formats, return values closely. Your code may be graded by
an autograder, which means any inconsistency will be automatically penalized.

You may receive 0 if one or more of the following is true:

● You do not follow the submission instructions, i.e., you submit something other than a
single compressed file.
● Your compressed file does not contain your Python files or the file names are wrong.
● Your compressed file is corrupted.
○ After you submit, you should download your submission from
Blackboard to make sure it is not corrupted and it has the latest version
of your code.
● Your code contains syntax errors.
● Your code does not run without errors the way that it is submitted.
○ We should not have to open your file and comment/uncomment parts of it in
order to run your code.
● Your code does not terminate, e.g.: it contains infinite loops.

You are only going to be graded based on your Blackboard submission. We will not accept
homework via e-mail or other means.

Best of luck and happy coding!

Python Exam Paper Solved1
No ratings yet
Python Exam Paper Solved1
6 pages
DSL 1-3
No ratings yet
DSL 1-3
7 pages
Ds Practical ..
No ratings yet
Ds Practical ..
55 pages
Fds Merged
No ratings yet
Fds Merged
102 pages
Updated Lab Manual
No ratings yet
Updated Lab Manual
66 pages
OSDBMS
No ratings yet
OSDBMS
59 pages
FDS Manual 22-23
No ratings yet
FDS Manual 22-23
56 pages
My Practical File
100% (1)
My Practical File
40 pages
Lab Manual
No ratings yet
Lab Manual
19 pages
FDS Codes
No ratings yet
FDS Codes
53 pages
Cycle 1 Programs
No ratings yet
Cycle 1 Programs
20 pages
Dfs Manual
No ratings yet
Dfs Manual
43 pages
Indira National School: Academic
No ratings yet
Indira National School: Academic
55 pages
Develop Programs To Understand Concept of Class and Object in Python
No ratings yet
Develop Programs To Understand Concept of Class and Object in Python
49 pages
AI Lab
No ratings yet
AI Lab
48 pages
22-ML Lab Expt 1
No ratings yet
22-ML Lab Expt 1
29 pages
AI Final PDF
No ratings yet
AI Final PDF
38 pages
Wa0011.
No ratings yet
Wa0011.
36 pages
Lab Manual DSL
No ratings yet
Lab Manual DSL
26 pages
Final FDS Code
No ratings yet
Final FDS Code
28 pages
Adobe Scan Sep 03 2024
No ratings yet
Adobe Scan Sep 03 2024
10 pages
Machine
No ratings yet
Machine
33 pages
DSL All Practical Codes - by HK - Official
No ratings yet
DSL All Practical Codes - by HK - Official
46 pages
Pythonrv
No ratings yet
Pythonrv
27 pages
DS Lab Programs
No ratings yet
DS Lab Programs
47 pages
11 Revision
No ratings yet
11 Revision
17 pages
Wa0007
No ratings yet
Wa0007
15 pages
IP Practical Record 2022-23
No ratings yet
IP Practical Record 2022-23
43 pages
Unit 3
No ratings yet
Unit 3
11 pages
22mbada303 Module 4
No ratings yet
22mbada303 Module 4
32 pages
PMI - Modules and Data Structures
No ratings yet
PMI - Modules and Data Structures
23 pages
IDS Syllabus
No ratings yet
IDS Syllabus
5 pages
Sns Lab 4
100% (1)
Sns Lab 4
14 pages
Discrete Mathematics Practicals
No ratings yet
Discrete Mathematics Practicals
3 pages
11th PGM
No ratings yet
11th PGM
9 pages
Experiment 3
No ratings yet
Experiment 3
8 pages
URA302 - Python-Programming - URA302 - (Lab - Assignment - 2) .Ipynb at Main Sparsh0106 - URA302 - Python-Programming
No ratings yet
URA302 - Python-Programming - URA302 - (Lab - Assignment - 2) .Ipynb at Main Sparsh0106 - URA302 - Python-Programming
6 pages
FDS Rec 9-12
No ratings yet
FDS Rec 9-12
6 pages
PDA Lab Prog (Short)
No ratings yet
PDA Lab Prog (Short)
11 pages
Ii - CS3352 - Int Iv - QB
No ratings yet
Ii - CS3352 - Int Iv - QB
3 pages
BScMathSc ProblemSolvingUsingComputers So 241028 164636
No ratings yet
BScMathSc ProblemSolvingUsingComputers So 241028 164636
7 pages
MIT6 189IAP11 hw3
No ratings yet
MIT6 189IAP11 hw3
6 pages
Q1: Conference Reviewing (20 PTS, 5 Pts Each) : M M M (I) (J) I J J I M (I) (J) - 1
No ratings yet
Q1: Conference Reviewing (20 PTS, 5 Pts Each) : M M M (I) (J) I J J I M (I) (J) - 1
9 pages
Data Science Assignment 1 Answers
No ratings yet
Data Science Assignment 1 Answers
3 pages
Dec2017 - Python
No ratings yet
Dec2017 - Python
6 pages
3rd EXPERIMENT
No ratings yet
3rd EXPERIMENT
13 pages
PDA - Assignment Questions
No ratings yet
PDA - Assignment Questions
4 pages
Numpy: Exact Filenames Not Allowed
No ratings yet
Numpy: Exact Filenames Not Allowed
6 pages
23cde312 MQP2
No ratings yet
23cde312 MQP2
2 pages
12 CS Ernakulam-Sample Question Papers-22-23-2-Ans
No ratings yet
12 CS Ernakulam-Sample Question Papers-22-23-2-Ans
3 pages
Intro To Programming With Python - Assignment 6
No ratings yet
Intro To Programming With Python - Assignment 6
3 pages
I037 - Manas Patel Experiment08
No ratings yet
I037 - Manas Patel Experiment08
9 pages
12 CS Ernakulam-Sample Question Papers-22-23-3-Ans
No ratings yet
12 CS Ernakulam-Sample Question Papers-22-23-3-Ans
4 pages
Questionnaire
No ratings yet
Questionnaire
3 pages
1.python Assignment: July 8, 2021
No ratings yet
1.python Assignment: July 8, 2021
11 pages
Final Openenrollement S22 230
No ratings yet
Final Openenrollement S22 230
4 pages
CS3361 Set1
No ratings yet
CS3361 Set1
5 pages
Quiz 2 Fall 2021
No ratings yet
Quiz 2 Fall 2021
5 pages
ML Assignment Last
No ratings yet
ML Assignment Last
4 pages