0% found this document useful (0 votes)

44 views8 pages

Tutorial RNA-Seq Analysis Part 1

This document is the first part of a tutorial on RNA-Seq analysis. It introduces RNA-Seq and guides the user through importing sample data, running an initial analysis to map reads to genes and examine expression values, and interpreting the results by looking at coverage across exons and evidence for different isoforms. The tutorial data is from mouse tissue samples and focuses on illustrating how parameter choices affect the analysis and interpretation.

Uploaded by

Lee LI PIN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views8 pages

Tutorial RNA-Seq Analysis Part 1

Uploaded by

Lee LI PIN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Tutorial

Tutorial: RNA-Seq analysis part I: Getting started

June 21, 2011

CLC bio
Finlandsgade 10-12 8200 Aarhus N Denmark
Telephone: +45 70 22 55 09 Fax: +45 70 22 55 19
www.clcbio.com [email protected]
Tutorial: RNA-Seq analysis part I: Getting started

Tutorial: RNA-Seq analysis part I: Getting started

This tutorial is the first part of a series of tutorials about RNA-Seq. The aim of the tutorials is
to take you from start to end of an RNA-Seq analysis including mapping of reads, interpreting
results, checking quality and finally doing statistical analysis. Along the way, we will focus on
illustrating the effect of the parameters and choices made during the analysis.
The data used is from a study reported in [Mortazavi et al., 2008]. The data set consists of
RNA-Seq data from three types of Mouse tissue: Brain, Liver and Skeletal muscle. Each of the
tissues has been sampled twice, so there are 6 samples all in all.
Tutorial

Downloading and importing the data

At http://www.clcbio.com/ngsexampledata you find the following data:

Subset of the full data set This file can be imported using the standard import and includes a
subset of the full data set including a region of chromosome 16 for use as a reference.
When running the full data set, we extracted all the reads that matched the genes of this
part of chromosome 16. Download and import this data set (using the normal import) for
use in these tutorials.

Experiments with the full data set Later on, we will work on experiments generated from the full
data set. Download and import this data set (using the normal import) for use in these
tutorials.

Once downloaded and imported, you should have the following folders and data in the Navigation
Area (see figure 1).

Figure 1: The subset of the full data set has been imported together with the experiments generated
from the full data set.

Running the RNA-Seq analysis

Now, you can start the actual analysis. The first step is to transform the list of reads into what
we call an RNA-Seq sample. This is basically a list of genes with expression values. To do this,
go to:

P. 2
Tutorial: RNA-Seq analysis part I: Getting started

Toolbox | High-throughput Sequencing ( ) | RNA-Seq Analysis ( )

This opens a dialog where you select the sequencing reads from the Brain spike sample, as
shown in figure 2.
Tutorial

Figure 2: Selecting the Brain spikes sample for RNA-Seq analysis.

Click Next when the data is listed in the right-hand side of the dialog.
You are now presented with the dialog shown in figure 3.

Figure 3: Choosing the annotated reference sequence.

Since we are using (part of) the ref-seq annotated mouse genome, choose Use reference with
annotations. Click ( ) to select the reference sequence NC_000082 subset.
Click Next where you can set parameters for the mapping. Leave these settings at their default
- we will focus on these later on. (You can set the parameters to default by clicking the button
( ) at the bottom of the dialog, but then you will have to define the reference sequence again).

P. 3
Tutorial: RNA-Seq analysis part I: Getting started

Clicking Next will show the dialog in figure 4.

Tutorial

Figure 4: Exon discovery.

The choice between Prokaryote and Eukaryote is basically a matter of telling the Workbench
whether you have introns in your reference. In order to select Eukaryote, you need to have
reference sequences with annotations of the type mRNA (this is the way the Workbench expects
exons to be defined). The reference sequence provided with this tutorial includes mRNA
annotations (they are the green annotations), so you select Eukaryote in this wizard.
Below you can specify settings for discovering novel exons. We will investigate this in detail later
on.
Clicking Next will allow you to specify the output options as shown in figure 5.

Figure 5: Selecting the output of the RNA-Seq analysis.

P. 4
Tutorial: RNA-Seq analysis part I: Getting started

Uncheck the Create list of un-mapped reads, Create report and Make log and click Finish.
The standard output is a table showing mapping statistics on each gene.

Interpreting the brain spikes analysis result

The result of the RNA-Seq analysis is shown in figure 6.
Tutorial

Figure 6: A table with expression values for all genes.

The Expression values column is per default based on the RPKM. Change the measure to use
Total exon reads instead by clicking at the bottom of the view (we will go into more details with
expression measures in part II). Now sort the table on the new expression value by clicking the
column header twice. Find the Ahsg gene (4th from the top of the list) and double-click.
When the result is open, you need to do a few customizations to make the view better suited
for interpretation. In the Side Panel, under Text format, set the font size to small or tiny. To
save these customizations so that they take effect next time you open a mapping, click the
Save/Restore Settings button ( ) at the top of the Side Panel and click Save Settings. Give
your settings a name and make sure the check box to Always apply these settings is checked.
Double-click the tab of the view (or press Ctrl + M) to Maximize the view and click Fit Width ( )
in the tool bar to zoom out to see the full gene. You should now have a view similar to figure 7.

You can now see distinct peaks of coverage below the exons which are marked in green. Scroll
slowly down on the scroll bar at the right hand side of the view. You will begin to see reads that
have been mapped across exon-exon boundaries.
Click Zoom in ( ) and click-and-drag a rectangle around one of the exons. In this way you can
zoom in to see more details of a particular exon. If you zoom all the way in, you will be able to
see the nucleotide level and the alignment of the reads.

P. 5
Tutorial: RNA-Seq analysis part I: Getting started
Tutorial

Figure 7: The reads mapped to the Ahsg gene.

Close the view and go back to the RNA-seq sample. In the 'Transcripts' column you can see that
the Ahsg gene only has one transcript annotated. Use the Advanced filter ( ) at the upper
right hand part of the RNA-seq sample table view) to identify genes with more than one transcript
annotated (set the filter to Transcripts > 1 and press Apply as shown in figure 8).

Figure 8: Using the advanced filter to only show genes with more than one annotated transcript.

The Fetub gene has three transcripts annotated. Open the mapping for this gene and press Fit
width ( ) to zoom out completely and get an overview of the mapping to this gene.
One of the three transcripts annotated for Fetub uses a different first exon from the other two
transcripts. There is no coverage in this exon at all, and thus no evidence for expression of the
alternative first exon isoform. The other two transcripts have the same first exon but one skips

P. 6
Tutorial: RNA-Seq analysis part I: Getting started

the second exon of the other. You can see both reads that span from exon 2 to exon 3 and reads
that span from exon 2 to exon 4. Thus, there is evidence for both of these splice variants (see
figure 9).
Tutorial

Figure 9: Reads showing evidence for expression of two isoforms.

Close the view and you are ready for part II: Non-specific matches and expression values.

P. 7
Bibliography
Tutorial

[Mortazavi et al., 2008] Mortazavi, A., Williams, B. A., McCue, K., Schaeffer, L., and Wold,
B. (2008). Mapping and quantifying mammalian transcriptomes by rna-seq. Nat Methods,
5(7):621--628.

12th STD Bio-Zoology EM Book Back One Marks
50% (2)
12th STD Bio-Zoology EM Book Back One Marks
17 pages
Rnaseq by Example
No ratings yet
Rnaseq by Example
163 pages
Jupeb 2019 Biology Syllabus
No ratings yet
Jupeb 2019 Biology Syllabus
20 pages
Genetic Engineering
No ratings yet
Genetic Engineering
17 pages
Combined
No ratings yet
Combined
417 pages
RNA Seq Data Analysis
No ratings yet
RNA Seq Data Analysis
90 pages
RNA-Seq Workflow: Gene-Level Exploratory Analysis and Differential Expression
No ratings yet
RNA-Seq Workflow: Gene-Level Exploratory Analysis and Differential Expression
42 pages
Intro To RNA-seq Concepts
No ratings yet
Intro To RNA-seq Concepts
85 pages
RNA Sequencing (RNA-seq) - Comprehensive Notes
No ratings yet
RNA Sequencing (RNA-seq) - Comprehensive Notes
5 pages
The RNA World 11th Lect High-Throughput Methods GH AY16 2017
No ratings yet
The RNA World 11th Lect High-Throughput Methods GH AY16 2017
59 pages
3 RNAseq Background
No ratings yet
3 RNAseq Background
42 pages
RNA Seq R - Final Decode
No ratings yet
RNA Seq R - Final Decode
76 pages
2023-GenomicaFuncional y Biocomputacion-Day1
No ratings yet
2023-GenomicaFuncional y Biocomputacion-Day1
92 pages
A Guide To Basic RNA Sequencing Data
No ratings yet
A Guide To Basic RNA Sequencing Data
30 pages
Analysis of RNA-Seq Data
No ratings yet
Analysis of RNA-Seq Data
71 pages
Intro 2 RNAseq
No ratings yet
Intro 2 RNAseq
98 pages
Biopython Org DIST Docs Tutorial Tutorial HTML
No ratings yet
Biopython Org DIST Docs Tutorial Tutorial HTML
267 pages
Gene Expression RNA Sequence
No ratings yet
Gene Expression RNA Sequence
120 pages
Nazarov QC-Statistics
No ratings yet
Nazarov QC-Statistics
50 pages
IBB - MB.501 RNA-seq + Introduction To Galaxy
No ratings yet
IBB - MB.501 RNA-seq + Introduction To Galaxy
34 pages
Lecture4 Expression - Analysis 2019
No ratings yet
Lecture4 Expression - Analysis 2019
79 pages
Factors Affecting Growth and Development
100% (1)
Factors Affecting Growth and Development
9 pages
Module 7 8 Lecture Slides
No ratings yet
Module 7 8 Lecture Slides
59 pages
Mendel Gregor
No ratings yet
Mendel Gregor
5 pages
Statquest Gentle Introduction To Rna Seq
100% (1)
Statquest Gentle Introduction To Rna Seq
188 pages
The Basics of Biology
No ratings yet
The Basics of Biology
127 pages
RNASeq Command Line 25march2021 0
No ratings yet
RNASeq Command Line 25march2021 0
33 pages
Nihms 977214
No ratings yet
Nihms 977214
21 pages
Survey RNA-Seq Data Analysis (2016)
No ratings yet
Survey RNA-Seq Data Analysis (2016)
19 pages
Rnaseq Workshop Slides
No ratings yet
Rnaseq Workshop Slides
110 pages
RNA Seq - Applications and Best Practices
No ratings yet
RNA Seq - Applications and Best Practices
34 pages
Module8 RNASeq Pathogen Practical Manual
No ratings yet
Module8 RNASeq Pathogen Practical Manual
23 pages
Impact of Gene Annotation On RNA-seq Data Analysis Shanrong Zhao and Baohong Zhang
No ratings yet
Impact of Gene Annotation On RNA-seq Data Analysis Shanrong Zhao and Baohong Zhang
23 pages
CLC Genomics Workbench User Manual Subset
No ratings yet
CLC Genomics Workbench User Manual Subset
222 pages
Introduction To Differential Gene Expression Analysis Using RNA-seq
No ratings yet
Introduction To Differential Gene Expression Analysis Using RNA-seq
97 pages
RNA Seq Tutorial
0% (1)
RNA Seq Tutorial
139 pages
BGi RNA-Seq Analysis
No ratings yet
BGi RNA-Seq Analysis
19 pages
Advanced RNASeq With Upload To IPA
No ratings yet
Advanced RNASeq With Upload To IPA
18 pages
RNA-Seq and Transcriptome Analysis: Jessica Holmes
No ratings yet
RNA-Seq and Transcriptome Analysis: Jessica Holmes
98 pages
Biology Project - DNA Finger Printing
No ratings yet
Biology Project - DNA Finger Printing
12 pages
RNA-seq With NOISeq R-Bioc Package
No ratings yet
RNA-seq With NOISeq R-Bioc Package
15 pages
Kurukshetra University Kurukshetra: Galaxy Global Group of Institutions, Ambala
No ratings yet
Kurukshetra University Kurukshetra: Galaxy Global Group of Institutions, Ambala
54 pages
From RNA-seq Reads To Gene Expression
No ratings yet
From RNA-seq Reads To Gene Expression
27 pages
Tutorial: Expression Analysis Using RNA-Seq
No ratings yet
Tutorial: Expression Analysis Using RNA-Seq
19 pages
Genetics Activities 1 240520 175744
No ratings yet
Genetics Activities 1 240520 175744
78 pages
Beginner's Guide To Using The DESeq2 Package
No ratings yet
Beginner's Guide To Using The DESeq2 Package
32 pages
Unit 2 BI
No ratings yet
Unit 2 BI
10 pages
7th Grade Science Curriculum Map Final Draft
No ratings yet
7th Grade Science Curriculum Map Final Draft
5 pages
Analysis of SARS-CoV-2
No ratings yet
Analysis of SARS-CoV-2
11 pages
NPG Nature Vol 405 Issue 6790 Jun
No ratings yet
NPG Nature Vol 405 Issue 6790 Jun
95 pages
Curso Rnaseq Saebb Utfpr
No ratings yet
Curso Rnaseq Saebb Utfpr
18 pages
RNA-Seq Module 1
No ratings yet
RNA-Seq Module 1
54 pages
A Tutorial: Genome - Based RNA - Seq Analysis Using The TUXEDO Package (Updated: 2014 - 10 - 21)
No ratings yet
A Tutorial: Genome - Based RNA - Seq Analysis Using The TUXEDO Package (Updated: 2014 - 10 - 21)
17 pages
BN335 L6 Transcriptomics JH
No ratings yet
BN335 L6 Transcriptomics JH
9 pages
Day1 Laros RNASeq Galaxy 2012
No ratings yet
Day1 Laros RNASeq Galaxy 2012
40 pages
Tutorial: Molecular Biology Basics
No ratings yet
Tutorial: Molecular Biology Basics
21 pages
Measuring Transcriptomes With RNA-Seq
No ratings yet
Measuring Transcriptomes With RNA-Seq
48 pages
Tutorial RNA-Seq Analysis Part 2
No ratings yet
Tutorial RNA-Seq Analysis Part 2
9 pages
Yogesh Pradhan Final Synopsis
No ratings yet
Yogesh Pradhan Final Synopsis
18 pages
Mathano Dukhavo
No ratings yet
Mathano Dukhavo
105 pages
Blank en Berg Pittsburgh 2011 Ngs
No ratings yet
Blank en Berg Pittsburgh 2011 Ngs
59 pages
Intro To Using Galaxy - For Bioinformatics: Carrie Ganote
No ratings yet
Intro To Using Galaxy - For Bioinformatics: Carrie Ganote
26 pages
Tutorial RNA-Seq Analysis Part 3
No ratings yet
Tutorial RNA-Seq Analysis Part 3
4 pages
Alignment
No ratings yet
Alignment
3 pages
RNA-Seq Analysis Course
No ratings yet
RNA-Seq Analysis Course
40 pages
A
No ratings yet
A
11 pages
Transcriptome Software Paper
No ratings yet
Transcriptome Software Paper
7 pages
The Bench Scientist's Guide To Statistical Analysis of RNA-Seq Data
No ratings yet
The Bench Scientist's Guide To Statistical Analysis of RNA-Seq Data
10 pages
Biol 321
100% (1)
Biol 321
7 pages
Problems of Identity and Individuality
No ratings yet
Problems of Identity and Individuality
1 page
Lab 8 Homepage
No ratings yet
Lab 8 Homepage
4 pages
Telomeres, Lifestyle, Cancer, and Aging: Masood A. Shammas
No ratings yet
Telomeres, Lifestyle, Cancer, and Aging: Masood A. Shammas
7 pages
Hidden Markov Model (HMM) Architecture
No ratings yet
Hidden Markov Model (HMM) Architecture
15 pages
Ec 94
No ratings yet
Ec 94
2 pages
08-Genes in Populations
No ratings yet
08-Genes in Populations
5 pages
Bio Psych Chapter 2 - Evolution
No ratings yet
Bio Psych Chapter 2 - Evolution
6 pages
Inspire Grade6 U2
No ratings yet
Inspire Grade6 U2
13 pages
Interpreting DNA SequenceREV
No ratings yet
Interpreting DNA SequenceREV
12 pages
s11250 023 03688 Z - Laurence
No ratings yet
s11250 023 03688 Z - Laurence
13 pages
Journal of Virology-2018-Zou-e01881-17.full
No ratings yet
Journal of Virology-2018-Zou-e01881-17.full
16 pages
Lesson Notes: Biology 4G Chapter 5: Cell Division
No ratings yet
Lesson Notes: Biology 4G Chapter 5: Cell Division
26 pages
M SC Zoology III Sem Paper II DR S K Thakur
No ratings yet
M SC Zoology III Sem Paper II DR S K Thakur
6 pages
13 Test Bank - Wheatley Biology Chapter 13 Test
No ratings yet
13 Test Bank - Wheatley Biology Chapter 13 Test
15 pages
04 Application of Genomic Tools - One Technology Takes It All
No ratings yet
04 Application of Genomic Tools - One Technology Takes It All
14 pages
Molecular Biology: The Central Dogma: Patricia J Pukkila
No ratings yet
Molecular Biology: The Central Dogma: Patricia J Pukkila
5 pages
Gant, Isaac - Copy of Hayflick Limit Individual Student
No ratings yet
Gant, Isaac - Copy of Hayflick Limit Individual Student
4 pages
Mendelian Inheritance
No ratings yet
Mendelian Inheritance
3 pages
DNA Replication and Repair
No ratings yet
DNA Replication and Repair
4 pages
Lessons in Bioinformatics - Dot Plots: Lessons in Bioinformatics, #1
From Everand
Lessons in Bioinformatics - Dot Plots: Lessons in Bioinformatics, #1
Björn Olsson
No ratings yet
Introduction to Bioinformatics Using Action Labs
From Everand
Introduction to Bioinformatics Using Action Labs
Jean-Louis Lassez
5/5 (1)