Tba L09
Tba L09
Application
(CSE-896)
Spring 2025
Lecture 09
Middle Integration:
Most commonly
used
1. Data heterogeneity
2
Major Challenges in Integration:
• Some of the major challenges that must be tackled for reliable
integration of omics datasets include:
1. Data heterogeneity
3
Major Challenges in Integration:
• Some of the major challenges that must be tackled for reliable
integration of omics datasets include:
4
Major Challenges in Integration:
• Some of the major challenges that must be tackled for reliable
integration of omics datasets include:
3. High dimensionality
5
Major Challenges in Integration:
• Some of the major challenges that must be tackled for reliable
integration of omics datasets include:
4. Computational Performance
6
ML Methods for Multiomic Integration:
• mixOmics:
mixOmics is an R package for exploring and integrating omics data, including transcriptomics,
proteomics, lipidomics, microbiome, metagenomics and beyond. The mixOmics package includes tools
for data integration, biomarker discovery, and data visualization.
7
ML Methods for Multiomic Integration:
8
ML Methods for Multiomic Integration:
9
Setting up R/RStudio:
10
ML Methods for Multiomic Integration:
• Reproducibility means …
• If two groups were analyzing the same data, they would reach the
same conclusions?
11
ML Methods for Multiomic Integration:
12
Setting up R/RStudio:
13
Setting up R/RStudio:
14
ML Methods for Multiomic Integration:
• Ensuring reproducibility … Data Organization
Adapted from Goldman, June 2020 and Goldman and Obrycki, December 2020 15
Setting up R/RStudio:
https://phdcomics.com/comics.php?f=1689 16
ML Methods for Multiomic Integration:
• Ensuring reproducibility … How to?
• Document everything
• What were the exact commands that you ran throughout the analysis?
• Containerization … Docker
18
Basic R and RStudio:
19
Basic R and RStudio:
20
Basic R and RStudio:
21
Basic R and RStudio:
• Let’s create our first project directory using RStudio.
• Open RStudio
• In the New Project window, choose New Directory. Then, choose New
Project. Name your new directory Intro-to-R and then “Create the
project as subdirectory of:” the Desktop (or location of your choice).
22
Basic R and RStudio:
• Let’s create our first project directory using RStudio.
23
Basic R and RStudio:
• Go to the File menu and select New File, and select R Script.
• Go to the File menu and select Save As..., type Intro-to-R.R and
select Save
24
Basic R and RStudio:
3. History/
1. Code Editor Environment
4. Plots/ Help/
Packages
2. Console
25
Basic R and RStudio:
26
Basic R and RStudio:
• A directory that contains everything related to your analyses for
a specific project.
27
Basic R and RStudio:
• When a project is (re) opened within RStudio the following actions are
taken:
• A new R session (process) is started
• The .RData file in the project’s main directory is loaded, populating the environment with
any objects that were present when the project was closed
• The .Rhistory file in the project’s main directory is loaded into the RStudio History pane
(and used for Console Up/Down arrow command history).
• Other RStudio settings (e.g. active tabs, splitter positions, etc.) are restored to where they
were the last time the project was closed.
28
Basic R and RStudio:
• Setting Up Working Directory:
29
Basic R and RStudio:
• Setting Up Working Directory:
30
Basic R and RStudio:
• Structuring your working directory:
For instance, separate directories
for raw & processed data
31