0% found this document useful (0 votes)

18 views4 pages

p3 Python Project

This program allows a user to input DNA sequences from a file and either find the consensus sequence or transcribe the sequences to RNA. It contains functions to load the sequences, count nucleotide frequencies, find the consensus, convert DNA to RNA, and output the results to a new file. The main function handles user input to select the option and calls the appropriate functions to analyze the sequences and write the output.

Uploaded by

Daniella Vargas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views4 pages

p3 Python Project

Uploaded by

Daniella Vargas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 4

"""This program ask the user to enter a number of DNA sequences and finds the

consensus sequence. The ouput is the consensus.

Add the corresponding code to accomplish the requested tasks
"""

##### ADD YOUR NAME, Student ID, and Section number #######
# NAME: DANIELLA VARGAS FIGUEROA
# STUDENT ID:802228453
# SECTION:096
###########################################################

# The function load_data, it take as an argument, it input the DNA sequences, save
in the list and return the list
# a: is a number of sequences to be input

#Auxiliar functions

def valid_seq(seq):
isvalid = False
for s in list(seq):
if (s == 'A') or (s == 'C') or (s == 'T') or (s == 'G'):
isvalid = True
else:
isvalid = False
break
return isvalid

#the max_nuc() takes four inputs: the nucleotide frequencey in a colum, and returns
a list of two elements containing the nucleotide
#and its frequency in a column
def max_nuc(freq_a, freq_g, freq_c, freq_t):
if freq_a > freq_g and freq_a > freq_c and freq_a > freq_t:
return ["A", freq_a]
elif freq_g > freq_a and freq_g > freq_c and freq_g > freq_t:
return ["G", freq_g]
elif freq_c > freq_a and freq_c > freq_g and freq_c > freq_t:
return ["C", freq_c]
elif freq_t > freq_a and freq_t > freq_c and freq_t > freq_g:
return ["T", freq_t]

#########################
#the load_data() takes two inputs: the file name and returns one tuple (firts one
list of elements, and option (consesus or transcription)
def load_data(filename, option):
#assign variable and open file
lst = []
infile = open(filename, "r")
#read file
valid_length = None
for line in infile:
seq = line.rstrip("\n")
#Check if the sequence is valid and is the same length as the first one to
continue with program.
if valid_seq(seq) == True and (valid_length == len(seq)
or valid_length == None):
lst.append(seq)
if len(lst) == 1:
valid_length = len(lst[0])
result = (lst, option)
#Return result.
return result

# The function count_nucl_freq, it take arguments the load_data, contains the

frecuencies of the nucleotides for each column
# a: is a list of DNA sequences
def count_nucl_freq(a):
#create an empty list to store each letter's frequency
frequencies = []
#Use for loops to look for the frequency of each letter in each column.
for i in range(0, len(a[0])):
columnfrec = [0, 0, 0, 0]
for j in range(0, len(a)):
let = a[j][i]
if let == "A":
columnfrec[0] = columnfrec[0] + 1
elif let == "G":
columnfrec[1] = columnfrec[1] + 1
elif let == "C":
columnfrec[2] = columnfrec[2] + 1
else:
columnfrec[3] = columnfrec[3] + 1
#Append each Maximum frequency by column to the list frequencies.
frequencies.append(
max_nuc(columnfrec[0], columnfrec[1], columnfrec[2], columnfrec[3]))
#return list
return frequencies
# analyze the list by columns
# find nucleotide frecuencies
# you will decide what data type, from the ones already explained, works best for
your implementation
# return frecuencies

# The function find_consensus, it take arguments the count_nucl_freq and return a

consensus sequence
# a: is a you return in count_nucl_freq
def find_consensus(a):
#Open a new file to store the consesus string.
f = open("answer.txt", "w")
# Create an empty string to store the consensus.
consensusString = ""
#For loop to access each element in index 0 in the frequency list done before and
add it to the consensous string.
for element in a:
#print(element)
x = element[0]

consensusString = consensusString + x
#Write the Consensus inside the file.
f.write(consensusString)

# function convert_seqn it take one argument the dna sequences

def convert_seq(a):
#Create empty string to store converted DNA to RNA results
result = ""
#Iterate throught each DNA sequences and convert each letter.
for let in a:
if let == "A":
result += "U"
elif let == "T":
result += "A"
elif let == "C":
result += "G"
elif let == "G":
result += "C"
#Return string with converted RNA sequences.
return result

# convert dna to rna sequences

# return rna sequences

#function transcript_seq, it take one argument the list of sequences

def transcript_seq(a):
#Create an empty list to store converted RNA sequences.
rnaseq = []
file = open("answer.txt", "w")
#Iterate through DNA sequences and convert each sequence to RNA.
for seq in a:
rna = convert_seq(seq)
file.write(rna + "\n")
#Append converted RNA sequences to empty list.
rnaseq.append(rna)
#Return RNA sequences list.
return rnaseq

# Read list DNA sequences

# return list RNA Sequences

# The function main, your program to start and function calls and write new file
with consensus or transcription
def main():
filename = input("Write the name of the file: ")
print('Select option:')
print('1. Consensus Sequences')
print('2. Transcriptions Sequences')
option = int(input(""))
#Create while loop to only accept option one or two.
while option != 1 and option != 2:
print("Incorrect input. Only enter 1 or 2.")
option = int(input(""))
data = load_data(filename, option)
#Create the function calls according to the option the user inputs.
if data[1] == 1:
freq = count_nucl_freq(data[0])
cons = find_consensus(freq)
elif data[1] == 2:
# conv=convert_seq(data[0])
transcript = transcript_seq(data[0])

#ask the number DNA sequence

# contains the functions call
# function doesn't return anyting

if __name__ == "__main__":
main()

Bio Python 202111
No ratings yet
Bio Python 202111
63 pages
Techcorp iam platform implementation plan
No ratings yet
Techcorp iam platform implementation plan
14 pages
DWM EXP 1 to 14 C_merged_compressed
No ratings yet
DWM EXP 1 to 14 C_merged_compressed
104 pages
vertopal.com_bioinf575_hw07_dmeghana (1)
No ratings yet
vertopal.com_bioinf575_hw07_dmeghana (1)
34 pages
Computational and Systems Biology Assignment Help
100% (1)
Computational and Systems Biology Assignment Help
15 pages
MOOC Project Work - Sequence Analysis - Data Analysis With Python 2021
No ratings yet
MOOC Project Work - Sequence Analysis - Data Analysis With Python 2021
29 pages
AI and ML Lab Program
No ratings yet
AI and ML Lab Program
24 pages
CS Practical File
No ratings yet
CS Practical File
28 pages
SRS For Student Attendance System
100% (8)
SRS For Student Attendance System
15 pages
Lösungen Zu Den Exercises AI Python
No ratings yet
Lösungen Zu Den Exercises AI Python
26 pages
BECOB236 Code
No ratings yet
BECOB236 Code
10 pages
BT3040 - BIOINFORMATICS - Assignment 4: Question 1
No ratings yet
BT3040 - BIOINFORMATICS - Assignment 4: Question 1
9 pages
BIO Code Report
No ratings yet
BIO Code Report
6 pages
RIP-Tutorials-bioinformatics
No ratings yet
RIP-Tutorials-bioinformatics
19 pages
solutionsExerciseMaster11 23
No ratings yet
solutionsExerciseMaster11 23
13 pages
Function Solutions
No ratings yet
Function Solutions
10 pages
Assignment 1
No ratings yet
Assignment 1
5 pages
BINP16 Programming Exam 2016-10-25 Solutions
No ratings yet
BINP16 Programming Exam 2016-10-25 Solutions
5 pages
3. Sequence Comparison1
No ratings yet
3. Sequence Comparison1
25 pages
Python
No ratings yet
Python
9 pages
INFO390C DNDS Pset05
No ratings yet
INFO390C DNDS Pset05
9 pages
Group17 2
No ratings yet
Group17 2
9 pages
DSA Assignment 1-5
No ratings yet
DSA Assignment 1-5
20 pages
CSE 5370: Bioinformatics Homework 2: Due Thursday, February 24th, 2022 at 4:59PM CST
No ratings yet
CSE 5370: Bioinformatics Homework 2: Due Thursday, February 24th, 2022 at 4:59PM CST
3 pages
In-Linear-Time: Check This Web Site
No ratings yet
In-Linear-Time: Check This Web Site
4 pages
programs (1)
No ratings yet
programs (1)
8 pages
Code2pdf 6564f797c624e
No ratings yet
Code2pdf 6564f797c624e
2 pages
exam_programming_exercises
No ratings yet
exam_programming_exercises
7 pages
Shogun Method Derek Rake
13% (8)
Shogun Method Derek Rake
33 pages
University of Mauritius
No ratings yet
University of Mauritius
9 pages
CSE160-Final-23wi-key
No ratings yet
CSE160-Final-23wi-key
10 pages
Python_Basics_Exercises
No ratings yet
Python_Basics_Exercises
4 pages
with open
No ratings yet
with open
6 pages
EX-9 EXCEPTION
No ratings yet
EX-9 EXCEPTION
3 pages
p2 Python Project
No ratings yet
p2 Python Project
3 pages
IDC306_Assignment_5_MS21009
No ratings yet
IDC306_Assignment_5_MS21009
4 pages
Lab2
No ratings yet
Lab2
7 pages
02-11-22-Lab-5-MS21212.ipynb - Colaboratory
No ratings yet
02-11-22-Lab-5-MS21212.ipynb - Colaboratory
8 pages
HW 13
No ratings yet
HW 13
6 pages
Faculty of Engineering Ain Shams University Name: Ahmed Nashaat Hassanen Department: CESS Bioinformatics ID: 14P6016 Ass1
No ratings yet
Faculty of Engineering Ain Shams University Name: Ahmed Nashaat Hassanen Department: CESS Bioinformatics ID: 14P6016 Ass1
3 pages
Exam Sample Questions (1)
No ratings yet
Exam Sample Questions (1)
6 pages
PS1
No ratings yet
PS1
2 pages
Ass 2 Bioinformatics
No ratings yet
Ass 2 Bioinformatics
8 pages
Bio Lab 1 Set A
No ratings yet
Bio Lab 1 Set A
2 pages
solutionsExerciseMaster1 10
No ratings yet
solutionsExerciseMaster1 10
9 pages
python assignment
No ratings yet
python assignment
8 pages
Manual de Ejercicios de Python
No ratings yet
Manual de Ejercicios de Python
1 page
OBE Action Plan
No ratings yet
OBE Action Plan
2 pages
BioInfo2 Assignment - Python
No ratings yet
BioInfo2 Assignment - Python
11 pages
01 07 FrequentWordsWithMismatchesSolution
No ratings yet
01 07 FrequentWordsWithMismatchesSolution
2 pages
(Kay A. Robbins, Steve Robbins) UNIX Systems Progr Pratica
0% (1)
(Kay A. Robbins, Steve Robbins) UNIX Systems Progr Pratica
1,008 pages
Multi Head and Multi Tape Turing Machines-21-03-2024
No ratings yet
Multi Head and Multi Tape Turing Machines-21-03-2024
67 pages
AD Rooms and Challenges
No ratings yet
AD Rooms and Challenges
3 pages
Cucumber MCQ-3
No ratings yet
Cucumber MCQ-3
5 pages
Thesis Theme Wordpress Examples
100% (2)
Thesis Theme Wordpress Examples
8 pages
SOS Inventory User Guide PDF
No ratings yet
SOS Inventory User Guide PDF
190 pages
Vijaya Vittala Institute of Technology-1
No ratings yet
Vijaya Vittala Institute of Technology-1
4 pages
Rotary Ac Buyers Guide.2023.07
No ratings yet
Rotary Ac Buyers Guide.2023.07
17 pages
AWS Azure Google Cloud 22052023 053615pm
No ratings yet
AWS Azure Google Cloud 22052023 053615pm
26 pages
Specific Issues in Science, Technology, and Society: The Information Age
100% (1)
Specific Issues in Science, Technology, and Society: The Information Age
26 pages
MIDAS Heritage 2012 Version 1.1
No ratings yet
MIDAS Heritage 2012 Version 1.1
122 pages
RCE10!10!31 Extension Module
No ratings yet
RCE10!10!31 Extension Module
2 pages
Core Banking Solution
No ratings yet
Core Banking Solution
30 pages
Telenor B2B Generic Portfolio
No ratings yet
Telenor B2B Generic Portfolio
42 pages
DCW
No ratings yet
DCW
2 pages
Mathematics Written in Sand - : The hp-15C, Intel 8087, Etc
No ratings yet
Mathematics Written in Sand - : The hp-15C, Intel 8087, Etc
49 pages
Android Platform
No ratings yet
Android Platform
44 pages
15-441 Computer Networking: Lecture 5 - Ethernet
No ratings yet
15-441 Computer Networking: Lecture 5 - Ethernet
41 pages
Implementation Repeaters: Hardware of An Echo-Canceller For On-Channel
No ratings yet
Implementation Repeaters: Hardware of An Echo-Canceller For On-Channel
3 pages
Bulk Encryption On GPUs - AMD
No ratings yet
Bulk Encryption On GPUs - AMD
25 pages
Hostel Administration in Common Portal Abstract
No ratings yet
Hostel Administration in Common Portal Abstract
6 pages
Deploy Bots To Generate Leads and Sales
No ratings yet
Deploy Bots To Generate Leads and Sales
19 pages
POI Basic Networking
No ratings yet
POI Basic Networking
3 pages
MURATA Design Solutions
No ratings yet
MURATA Design Solutions
20 pages
Bitcoin Design
No ratings yet
Bitcoin Design
2 pages
Assignment 2 - OO Implementation
No ratings yet
Assignment 2 - OO Implementation
5 pages
Erp
No ratings yet
Erp
16 pages
Perl One-Liners: 130 Programs That Get Things Done
From Everand
Perl One-Liners: 130 Programs That Get Things Done
Peteris Krumins
4/5 (3)
Simplifying Data Science With Python
From Everand
Simplifying Data Science With Python
Billy David millican
No ratings yet
C Programming
From Everand
C Programming
Netra
No ratings yet
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
PHP programming
From Everand
PHP programming
Nino Paiotta
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
From Everand
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
Charlie Masterson
No ratings yet
Python: Advanced Guide to Programming Code with Python
From Everand
Python: Advanced Guide to Programming Code with Python
Charlie Masterson
No ratings yet
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
From Everand
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

p3 Python Project

Uploaded by

p3 Python Project

Uploaded by

"""This program ask the user to enter a number of DNA sequences and finds the

consensus sequence. The ouput is the consensus.

# The function count_nucl_freq, it take arguments the load_data, contains the

# The function find_consensus, it take arguments the count_nucl_freq and return a

# function convert_seqn it take one argument the dna sequences

# convert dna to rna sequences

#function transcript_seq, it take one argument the list of sequences

# Read list DNA sequences

#ask the number DNA sequence

You might also like