0% found this document useful (0 votes)

16 views9 pages

Bio 9

Uploaded by

Canada ??

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views9 pages

Bio 9

Uploaded by

Canada ??

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Bioinformatics (week 1)

what is programming?
a way to instruct the computer to perform certain tasks. The instructions are what we define as
programs. In summary, we want the computer to run specific tasks and we need to learn how to
generate such instructions.
Why do we want the computer to run the tasks?
Because computers are:
1. fast,
2. cheaper than the time it will take a human to perform certain tasks,
3. can work 24 hours.
Writing a paragraph of instructions, it was renamed coding. We generate a to-do list for the
computer.

Why R?
-R was assigned originally by Ross Ihaka and Robert Gentleman.
-R was assigned originally for a statistical analysis.
-R was assigned as an interpret language.
That means we can run the code without a compiler. Interpret means we can run the code, our
orders line by line. Compile means we need to write the entire program and run the entire
program.

In R, what we will need is a common line interpreter. We write the code, we write the order in a
line, someone has to interpret and transfer the orders to the computer. Few last things. R is a very
popular sourceful environment. It's open source, it's under a general public license, and it's
written primarily in C and Fortran.

What is RStudio
Rstudio is an Integrated Development Environment, IDE, for R. An IDE is a software
application that provides comprehensive facilities for software development to compute the
parameters. Provide an environment
Data type:
1-Vector: concatenating values of the same type.
Syntax:
Vec1 <- c(1,2,3)

2-List: If we want to have different types, then we will be using what we call lists.
Syntax:
L1 <- list(‘a’,’b’)

3-matrix, what do we need to provide? (Same type)

We need to provide: the name of the matrix.
the number of rows.
the number of columns and what are the values
Syntax:
M1 <- matrix(fill, nrows, ncolums)
M1 <- matrix(0, 2, 3)

4- DataFrames.
You can think of DataFrames as a collection of columns, where every column can be a vector of
different types.
df1 <- data.frame(col1 = v1, col2 = v2, col3 = v3, name = v4)

5-Factors are list of predefined set of values

# Initial data vector
data.vec <- c("small", "small", "medium", "large", "medium")
class(data.vec)

# Convert data to factor, all levels are considered equal, i.e. no order
data.factor <- factor(data.vec)

another function:
class ()
head()
dim()
nrow()
ncol()
length()
control flow:
# Basic syntax

if (x1 < 10) {print("A")}

# Basic syntax: else

if (x1 < 10) {print("A")}

else {print("B")}

## Switch-statements

```{r}

colorMapper <- function(x) {

switch(x,
red = "#FF0000",
green = "#00FF00",
blue = "#0000FF",
stop("Invalid color name")
)
}

colorMapper('red')
colorMapper('tree')

## For-loops
# Create dummy matrix
mat <- matrix(
data = rnorm(20),
nrow = 5,
ncol = 4,
dimnames = list(NULL, c('col1', 'col2', 'col3', 'col4')))

# Initialize result vector. We know how large the result is.

means <- vector("list", ncol(mat))

# Iterate over matrix columns and populate result

for (i in 1:ncol(mat)) {
means[[i]] <- mean(mat[,i])
}

```

## While-loops

```{r}

# Initialize a vector of 0
items <- vector('numeric', length = 3)

# Add a vector of random numbers to the initial vector, until the total sum is
larger 10
# the total number of iterations is not known beforehand

iter <- 0
while(sum(items) < 10) {
iter <- iter + 1
items <- items + rnorm(length(items))
}
iter
items

Week 3
Python
Lists:
- Lists are ordered collections of elements in Python.
- Elements in a list can be of different types, such as numbers, strings, Boolean values, or even
other lists.
- Lists can be modified by adding, removing, or changing elements.
- List elements can be accessed using indexing, starting from 0.
- Negative indexing can be used to access elements from the end of the list.
- Slicing can be used to access a subset of elements in a list.
- Functions like `len()` and methods like `reverse()` and `sort()` can be used to manipulate lists.

Tuples:
- Tuples are similar to lists, but they are immutable, meaning they cannot be modified once
created.
- Tuples are created using round brackets instead of square brackets.
- Tuples can contain elements of different types, similar to lists.
- Tuple elements can be accessed using indexing, starting from 0.

Dictionaries:
- Dictionaries are unordered collections of key-value pairs.
- Each element in a dictionary consists of a key and its corresponding value, separated by a colon
(:).
- Dictionaries are defined using curly braces ({}) and can be empty or contain elements.
- Keys in a dictionary must be unique, but values can be duplicated.
- Dictionary elements are not accessed by position, but rather by their keys.
- The `keys()` method can be used to retrieve all the keys in a dictionary.
- Dictionaries can be modified by adding, updating, or deleting key-value pairs.
- Adding a new element involves assigning a value to a new key.
- Updating an element involves assigning a new value to an existing key.
- Deleting an element can be done using the `del` keyword followed by the key.

Sets:
- Sets are unordered collections of unique elements.
- Sets are defined using curly braces ({}) or the `set()` function.
- Sets automatically remove duplicate values, so each element appears only once.
- Sets are useful for operations such as intersection, union, and difference.
- The `union()` method or the `|` operator can be used to find the union of two sets.
- The `intersection()` method or the `&` operator can be used to find the intersection of two sets.
- The `difference()` method or the `-` operator can be used to find the difference between two
sets.
- The `symmetric_difference()` method or the `^` operator can be used to find elements that are
in either set, but not in both.

Week 4

Representing scRNA-seq experiments in Python

- Single-cell RNA-seq data in Python is typically stored within an annotated data object.
- The main component of the data object is the X matrix, which contains the gene expression
values. Cells are represented as rows, and genes as columns. Multiple layers are possible within
X, such as raw counts and unnormalized data.
- The obs data frame contains information specific to each cell, such as donor name,
experimental group, and other cell-related metadata.
- The var data frame contains information specific to each gene, such as the chromosome where
the gene is located or other gene-related metadata.
- The obsm data frame typically contains additional encodings for the gene expression values,
such as the new coordinates of each cell in a principal component analysis (PCA) space.
- The obsp data frame contains information about pairwise similarities between cells, indicating
how similar each cell is to all the others.
- The varp data frame is similar to obsp but contains information about pairwise similarities
between genes.

What is programming?
a) A way to instruct the computer to perform certain tasks.
b) A method of organizing data.
c) A process of analyzing biological sequences.
d) A technique for visualizing complex data.

Why do we want the computer to run tasks?

a) Computers are fast.
b) Computers are cheaper than human labor.
c) Computers can work 24 hours.
d) All of the above.

Why was R originally assigned as an interpreted language?

a) It allows running the code without a compiler.
b) It has better performance compared to compiled languages.
c) It requires less memory to execute programs.
d) It supports parallel processing.

What is RStudio?
a) An integrated development environment (IDE) for R.
b) A programming language used for statistical analysis.
c) A database management system.
d) A web-based server for running R scripts.

Which data type in R is used for concatenating values of the same type?
a) Vector.
b) List.
c) Matrix.
d) DataFrame.

What is a list in Python?

a) An ordered collection of elements.
b) An immutable data structure.
c) A key-value pair.
d) A set of unique elements.

How are elements accessed in a list?

a) Using indexing, starting from 1.
b) Using indexing, starting from 0.
c) Using negative indexing.
d) Using slicing.

What is a dictionary in Python?

a) An ordered collection of elements.
b) A mutable data structure.
c) A key-value pair.
d) A set of unique elements.

Which set operation can be used to find the union of two sets in Python?
a) |
b) &
c) -
d) ==

How are single-cell RNA-seq experiments represented in Python?

a) Using a list of lists.
b) Using a dictionary of lists.
c) Using an annotated data object.
d) Using a matrix of values.

What is the purpose of the X matrix in single-cell RNA-seq data representation?

a) It contains gene expression values.
b) It stores cell-related metadata.
c) It represents pairwise similarities between cells.
d) It encodes gene expression values in a PCA space.

What information is typically stored in the obs and var data frames in single-cell RNA-seq data
representation?
a) Cell-related metadata and gene-related metadata, respectively.
b) Gene-related metadata and cell-related metadata, respectively.
c) Pairwise similarities between cells and genes, respectively.
d) Gene expression values and cell expression values, respectively.

What is the role of the obsm data frame in single-cell RNA-seq data representation?
a) Storing pairwise similarities between cells.
b) Storing pairwise similarities between genes.
c) Storing additional encodings for gene expression values.
d) Storing additional encodings for cell expression values.
Which R function is used to determine the class or data type of an object?
a) class()
b) type()
c) typeof()
d) dtype ()

In R, which function is used to display the first few rows of a data frame?
a) head()
b) tail()
c) first()
d) top()

What is the purpose of an if-else statement in R?

a) To perform a specific action based on a condition.
b) To iterate over a sequence of values.
c) To define a custom function.
d) To handle exceptions and errors.

How are elements added to a list in Python?

a) Using the add() function.
b) Using the insert() function.
c) Using the append() function.
d) Using the extend() function.

What is the key difference between a list and a tuple in Python?

a) Lists are mutable, while tuples are immutable.
b) Lists can store elements of different data types, while tuples cannot.
c) Lists have a fixed length, while tuples can have a variable length.
d) Lists are ordered, while tuples are unordered.

Which method is used to remove an element from a dictionary in Python?

a) remove()
b) pop()
c) delete()
d) discard()

What is the purpose of the intersection operation on sets in Python?

a) To find the common elements between two sets.
b) To combine the elements of two sets.
c) To find the unique elements in two sets.
d) To remove the common elements between two sets.
How is gene expression data typically represented in a single-cell RNA-seq experiment?
a) As a matrix with cells as rows and genes as columns.
b) As a matrix with genes as rows and cells as columns.
c) As a list of gene expression values.
d) As a dictionary with cells as keys and gene expression values as values.

What does the var data frame represent in single-cell RNA-seq data representation?
a) Cell-related metadata.
b) Gene-related metadata.
c) Pairwise similarities between cells.
d) Pairwise similarities between genes.

What is the significance of the obsp data frame in single-cell RNA-seq data representation?
a) It represents gene expression values in a PCA space.
b) It stores cell-related metadata.
c) It stores pairwise similarities between cells.
d) It stores pairwise similarities between genes.

BS en 50171-2001
100% (5)
BS en 50171-2001
24 pages
ISACA - CISA.v2022-03-15.q144: Show Answer
No ratings yet
ISACA - CISA.v2022-03-15.q144: Show Answer
35 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
W2 Advanced Data Structures, IO & Control
No ratings yet
W2 Advanced Data Structures, IO & Control
44 pages
DataScience Unit 2
No ratings yet
DataScience Unit 2
45 pages
R Programming Lab Manual
0% (1)
R Programming Lab Manual
16 pages
Cheat Sheet
No ratings yet
Cheat Sheet
2 pages
CS605 Da
No ratings yet
CS605 Da
21 pages
MDA File
No ratings yet
MDA File
37 pages
R Programming Lab
No ratings yet
R Programming Lab
33 pages
Practical Programs
No ratings yet
Practical Programs
29 pages
Chapter 2 Introduction To R and Python
No ratings yet
Chapter 2 Introduction To R and Python
35 pages
302 SM and Da (Unit 3 4 5)
No ratings yet
302 SM and Da (Unit 3 4 5)
47 pages
Unit 2 Notes - Data Analysis Using R
No ratings yet
Unit 2 Notes - Data Analysis Using R
19 pages
R Programming-Chapiter 4
No ratings yet
R Programming-Chapiter 4
16 pages
Sds 02
No ratings yet
Sds 02
39 pages
MA214 Lab 1 Part 1
No ratings yet
MA214 Lab 1 Part 1
44 pages
R
No ratings yet
R
13 pages
Cheat Sheet
No ratings yet
Cheat Sheet
2 pages
Data Mining Lab 2
No ratings yet
Data Mining Lab 2
15 pages
STATS LAB Basics of R PDF
No ratings yet
STATS LAB Basics of R PDF
77 pages
R Basics PDF
No ratings yet
R Basics PDF
10 pages
R Reference Card
No ratings yet
R Reference Card
6 pages
R Reference Card
No ratings yet
R Reference Card
6 pages
Introduction To R
No ratings yet
Introduction To R
23 pages
Basics of R
No ratings yet
Basics of R
12 pages
Rprograms CSE
No ratings yet
Rprograms CSE
26 pages
2 Undefined
No ratings yet
2 Undefined
86 pages
R Project
0% (1)
R Project
25 pages
Python
No ratings yet
Python
20 pages
QB Samplealllllll Hemu
No ratings yet
QB Samplealllllll Hemu
19 pages
Bda. Unit. 5
No ratings yet
Bda. Unit. 5
27 pages
An Introduction To R: Biostatistics 615/815
No ratings yet
An Introduction To R: Biostatistics 615/815
59 pages
Introduction To Analytics and R File
No ratings yet
Introduction To Analytics and R File
29 pages
Summer 2024 Examination Model Answer Only For The Use of RAC Assessors Subject Name: Programming With Python Subject Code
No ratings yet
Summer 2024 Examination Model Answer Only For The Use of RAC Assessors Subject Name: Programming With Python Subject Code
130 pages
Summer 2024 Examination Model Answer Only For The Use of RAC Assessors Subject Name: Programming With Python Subject Code
No ratings yet
Summer 2024 Examination Model Answer Only For The Use of RAC Assessors Subject Name: Programming With Python Subject Code
19 pages
R Programming
No ratings yet
R Programming
22 pages
Module 4 - Writing Functions in Python
No ratings yet
Module 4 - Writing Functions in Python
20 pages
Untitled
No ratings yet
Untitled
59 pages
Part I: Introductory Materials: Introduction To R
No ratings yet
Part I: Introductory Materials: Introduction To R
25 pages
R Programming Notes
No ratings yet
R Programming Notes
23 pages
R Programming
No ratings yet
R Programming
60 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
40 pages
Unit 2
No ratings yet
Unit 2
47 pages
Me-I 2022 - ML Lab
No ratings yet
Me-I 2022 - ML Lab
28 pages
Introduction To R - Part1
No ratings yet
Introduction To R - Part1
34 pages
Wa0003.
No ratings yet
Wa0003.
9 pages
Basic R Programming
No ratings yet
Basic R Programming
16 pages
R-1ST Internal-Lab Notes
No ratings yet
R-1ST Internal-Lab Notes
14 pages
R Programming PDF
No ratings yet
R Programming PDF
128 pages
R Programming PDF
No ratings yet
R Programming PDF
128 pages
Matlab Tutorial
No ratings yet
Matlab Tutorial
25 pages
R Programming Language: History
No ratings yet
R Programming Language: History
20 pages
18 3 24 Upto Week 6 A B Latest 1
No ratings yet
18 3 24 Upto Week 6 A B Latest 1
25 pages
Functions Vs Scripts and Datasets
No ratings yet
Functions Vs Scripts and Datasets
25 pages
20222MBA0121 - Isha Sahu
No ratings yet
20222MBA0121 - Isha Sahu
2 pages
Basic R Tutorial
No ratings yet
Basic R Tutorial
56 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Mastering Data Structures and Algorithms in C and C++
From Everand
Mastering Data Structures and Algorithms in C and C++
Sachin Naha
No ratings yet
Algorithms and Data Structures: An Easy Guide to Programming Skills
From Everand
Algorithms and Data Structures: An Easy Guide to Programming Skills
Rigdon Jonathan
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Coding Interview Questions and Answers
From Everand
Coding Interview Questions and Answers
Chinmoy Mukherjee
No ratings yet
DS243 Full Mark
No ratings yet
DS243 Full Mark
4 pages
Summary Sample
No ratings yet
Summary Sample
1 page
Step 2 Document Outlining (3 Points)
No ratings yet
Step 2 Document Outlining (3 Points)
1 page
ENG 103 Assignment 2.
No ratings yet
ENG 103 Assignment 2.
3 pages
Reflection
No ratings yet
Reflection
1 page
ENG 103 Assignment 2 Summary
No ratings yet
ENG 103 Assignment 2 Summary
5 pages
Transmission Diagnostics 6T30-40
No ratings yet
Transmission Diagnostics 6T30-40
276 pages
Dranetz HDPQ Plus Brochure US v10
No ratings yet
Dranetz HDPQ Plus Brochure US v10
6 pages
RS232
No ratings yet
RS232
4 pages
HSV5 TB
No ratings yet
HSV5 TB
15 pages
27.2.16 Lab - Investigating An Attack On A Windows Host
No ratings yet
27.2.16 Lab - Investigating An Attack On A Windows Host
8 pages
FOX 615 Teleprotection
No ratings yet
FOX 615 Teleprotection
8 pages
La Gard Combogard Pro 39e Electronic Lock Software Installation Instructions 730 018 Rev D Web PDF
No ratings yet
La Gard Combogard Pro 39e Electronic Lock Software Installation Instructions 730 018 Rev D Web PDF
12 pages
IC Electronic English Catalogue 2010
No ratings yet
IC Electronic English Catalogue 2010
48 pages
JBL Live 650btnc Manual
No ratings yet
JBL Live 650btnc Manual
11 pages
PAT DS150H 2 (Consolas Service Manual)
No ratings yet
PAT DS150H 2 (Consolas Service Manual)
13 pages
Soumen Dikpati C.V
No ratings yet
Soumen Dikpati C.V
2 pages
Manual de Usuario Volkswagen Sharan (1996) (513 Páginas)
No ratings yet
Manual de Usuario Volkswagen Sharan (1996) (513 Páginas)
3 pages
Parkinson Disease Detection
No ratings yet
Parkinson Disease Detection
5 pages
Influence of New Media in Achieving Communication Efficiency Mass Media in Nigeria
No ratings yet
Influence of New Media in Achieving Communication Efficiency Mass Media in Nigeria
60 pages
Brand Fashion Project 2
No ratings yet
Brand Fashion Project 2
16 pages
FINAS
No ratings yet
FINAS
24 pages
Valtek Beta Positioners: For Control Valves
No ratings yet
Valtek Beta Positioners: For Control Valves
8 pages
Angular 8 Tutorial & Crash Course
No ratings yet
Angular 8 Tutorial & Crash Course
29 pages
CPU Scheduling
No ratings yet
CPU Scheduling
63 pages
GS-26 English
No ratings yet
GS-26 English
20 pages
Manual RCM 12
No ratings yet
Manual RCM 12
24 pages
Esp8266 Commands
No ratings yet
Esp8266 Commands
12 pages
VLT5000 5000flux 6000 8000 Profibus DP V1 MG90G102
No ratings yet
VLT5000 5000flux 6000 8000 Profibus DP V1 MG90G102
63 pages
C-Language Syllabus
No ratings yet
C-Language Syllabus
3 pages
CH 32 Security in The Internet IPSec SSLTLS PGP VPN and Firewalls Multiple Choice Questions and Answers PDF
No ratings yet
CH 32 Security in The Internet IPSec SSLTLS PGP VPN and Firewalls Multiple Choice Questions and Answers PDF
9 pages
Operations and Service Manual 69NT40-561-300 To 399: Container Refrigeration
100% (1)
Operations and Service Manual 69NT40-561-300 To 399: Container Refrigeration
154 pages
Microsoft Azure Ai Fundamentals Certification Companion Guide To Prepare For The Ai900 Exam 1st Edition Krunal S Trivedi Download
No ratings yet
Microsoft Azure Ai Fundamentals Certification Companion Guide To Prepare For The Ai900 Exam 1st Edition Krunal S Trivedi Download
82 pages
CV Abdul Saboor Khan
No ratings yet
CV Abdul Saboor Khan
2 pages