0% found this document useful (0 votes)

9 views42 pages

3 RNAseq Background

The document provides an overview of RNA sequencing (RNA-seq), detailing its methodology, including RNA isolation, cDNA conversion, sequencing, and downstream analysis. It discusses the significance of RNA-seq in functional studies, challenges faced during the process, and different mapping strategies for read alignment. Additionally, it introduces key metrics like RPKM and TPM for quantifying gene expression levels.

Uploaded by

johngeralt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views42 pages

3 RNAseq Background

Uploaded by

johngeralt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 42

Computational Biology Associate professor: Tingwen Chen ( 陳亭妏 )

Lab Office: Room 420 bioICT building

Email: [email protected]
What is RNASeq
General analysis flowchart
RPKM, TPM
DEGs
Functional analysis
scRNA Seq
Demo data we will use
Several slides are adapted from 2013 Canadian bioinformatics workshops
4
Gene
expression

5
RNA sequencing
Isolate RNAs Generate cDNA, fragment,
Samples of interest size select, add linkers

Condition 1 Condition 2
(normal colon) (colon tumor) Sequence ends

Map to genome,
transcriptome, and
predicted exon
junctions

100s of millions of paired reads

10s of billions bases of sequence
Downstream analysis

6
Why sequence RNA (versus DNA)?
• Functional studies
• Genome may be constant but an experimental condition has a
pronounced effect on gene expression
• e.g. Drug treated vs. untreated cell line
• e.g. Wild type versus knock out mice

• Some molecular features can only be observed at the

RNA level
• Alternative isoforms, fusion transcripts, RNA editing
• Predicting transcript sequence from genome
sequence is difficult
• Alternative splicing, RNA editing, etc.

7
Why sequence RNA (versus DNA)?
• Interpreting mutations that do not have an obvious effect on protein
sequence
• ‘Regulatory’ mutations that affect what mRNA isoform is expressed and how much
• e.g. splice sites, promoters, exonic/intronic splicing motifs, etc.

• Prioritizing protein coding somatic mutations (often heterozygous)

• If the gene is not expressed, a mutation in that gene would be less interesting
• If the gene is expressed but only from the wild type allele, this might suggest loss-of-function
(haploinsufficiency)
• If the mutant allele itself is expressed, this might suggest a candidate drug target

8
Introduction to RNA-seq
https://www.youtube.com/watch?v=tlf6wYJrwKY&t=414s
main steps in RNA-seq

1. RNA is isolated from a sample,

2. RNA is converted to cDNA fragments via reverse-transcription and
fragmentation,
3. a high-throughput sequencer is used to generate millions of reads
from the cDNA fragments,
4. …

10
Challenges
• Sample
• Purity?, quantity?, quality?
• RNAs consist of small exons that may be separated by large introns
• Mapping reads to genome is challenging
• The relative abundance of RNAs vary wildly
• 105 – 107 orders of magnitude
• Since RNA sequencing works by random sampling, a small fraction of highly expressed genes may
consume the majority of reads
• Ribosomal and mitochondrial genes
• RNAs come in a wide range of sizes
• Small RNAs must be captured separately
• PolyA selection of large RNAs may result in 3’ end bias
• RNA is fragile compared to DNA (easily degraded)
11
Replicates
• Technical Replicate
• Multiple instances of
sequence generation
• Flow Cells, Lanes, Indexes
• Biological Replicate
• Multiple isolations of cells
showing the same
phenotype, stage or other
experimental condition
• Some example
concerns/challenges:
• Environmental Factors,
Growth Conditions, Time
• Correlation Coefficient 0.92-
0.98

12
main steps in RNA-seq

1. RNA is isolated from a sample,

14
15
Which read aligner should I use?
https://www.ebi.ac.uk/~nf/hts_mappers/

16
Features comparison of aligners
https://www.ebi.ac.uk/~nf/hts_mappers/
18
19
https://cole-trapnell-lab.github.io/team/cole-trapnell/
Spliced mappers
• Exon-first • Seed-and-extend
• Exon-first methods map reads first to the • Seed-and-extend methods generally start by mapping part of
genome using an unspliced approach to the reads as kmers or substrings; candidate matches are then
find read-clusters; unmapped reads are extended using different algorithms and potential splice-sites
then used to find connections between are located.
these read-clusters.
• Include:
• Include: • MapNext (Bao et al. 2009),
• TopHat (Trapnell et al. 2009), • PALMapper (Jean et al. 2010),
• MapSplice (Wang et al. 2010a), • SplitSeek (Ameur et al. 2010),
• SpliceMap (Au et al. 2010), • GSNAP (Wu et al. 2010),
• HMMsplicer (Dimon et al. 2010), • Supersplat (Bryant et al. 2010),
• SOAPsplice (Huang et al. 2011), • SeqSaw (Wang et al. 2011),
• PASSion (Zhang et al. 2012), • and STAR (Dobin et al. 2012).
• TrueSight (Li et al. 2012b),
• and GEM (Marco-Sola et al. 2012).

21
22
main steps in RNA-seq

1. RNA is isolated from a sample,

24
25
Expectation–maximization algorithm
https://en.wikipedia.org/wiki/Expectation%E2%80%93maximization_algorithm
Expectation-maximization
https://www.youtube.com/watch?v=REypj2sy_5U
Expectation-maximization
Aim: Gaussian means and variances
(prior: uniform)

https://www.youtube.com/watch?v=iQoXFmbXRJA
EM algorithm
30
main steps in RNA-seq

1. RNA is isolated from a sample,

2. RNA is converted to cDNA fragments via reverse-transcription and
fragmentation,
3. a high-throughput sequencer is used to generate millions of reads
from the cDNA fragments,
4. reads are mapped to a reference genome or transcript set with an
alignment tool
5. transcriptome reconstruction
6. and counts of reads mapped to each gene are used to estimate
expression levels.
31
http://yourgene.pixnet.net/blog/post/99023045-%E8%BD%89%E9%8C%84%E9%AB%94%E9%87%8D%E5%BB%BA%E8%88%87%E5%9F%BA%E5%9B%A0%E9%AB%94%E5%BA%8F%E5%88%97%E5%B7%B2%E7
%9F%A5%E7%89%A9%E7%A8%AE%E7%9A%84rna%E5%AE%9A%E5%BA%8F
33
main steps in RNA-seq

1. RNA is isolated from a sample,

Mortazavi, A. et.al. (2008). Mapping and quantifying mammalian transcriptomes by rna-seq. Nat Methods,
5(7):621--628.

35
RPKM example
million million

Total exon reads=18 Total exon reads=2

Mapped reads=18+2=20 million Mapped reads=18+2=20 million
Exon length=9 KB Exon length=1 KB
RPKM=18/(20*9)=0.1 RPKM=2/(20*1)=0.1

http://yourgene.pixnet.net/blog/post/69572975-rpkm-%E7%B0%A1%E4%BB%8B
36
TPM
https://www.youtube.com/watch?v=TTUrtCY2k-w&t=3s
Expression profile
PCA
https://www.youtube.com/watch?v=HMOI_lkzW08

https://www.youtube.com/watch?v=FgakZw6K1QQ
MDS & PCoA

https://www.youtube.com/watch?v=GEn-_dAyYME
Homework
• What’s TPM?
• What’s FPKM?
• What’s the difference between TPM and FPKM?
Any questions?

Rnaseq by Example
No ratings yet
Rnaseq by Example
163 pages
Rna Seq Dissertation
100% (1)
Rna Seq Dissertation
6 pages
Rna Sequencing Methods Review Web
No ratings yet
Rna Sequencing Methods Review Web
122 pages
Lecture4 Expression - Analysis 2019
No ratings yet
Lecture4 Expression - Analysis 2019
79 pages
3 Rna-Seq
No ratings yet
3 Rna-Seq
59 pages
Combined
No ratings yet
Combined
417 pages
Measuring Transcriptomes With RNA-Seq
No ratings yet
Measuring Transcriptomes With RNA-Seq
48 pages
2023-GenomicaFuncional y Biocomputacion-Day1
No ratings yet
2023-GenomicaFuncional y Biocomputacion-Day1
92 pages
RNA Seq - Applications and Best Practices
No ratings yet
RNA Seq - Applications and Best Practices
34 pages
Curso Rnaseq Saebb Utfpr
No ratings yet
Curso Rnaseq Saebb Utfpr
18 pages
A Tutorial: Genome - Based RNA - Seq Analysis Using The TUXEDO Package (Updated: 2014 - 10 - 21)
No ratings yet
A Tutorial: Genome - Based RNA - Seq Analysis Using The TUXEDO Package (Updated: 2014 - 10 - 21)
17 pages
The RNA World 11th Lect High-Throughput Methods GH AY16 2017
No ratings yet
The RNA World 11th Lect High-Throughput Methods GH AY16 2017
59 pages
RNA Seq Data Analysis
No ratings yet
RNA Seq Data Analysis
90 pages
Artigo Bioinformática
No ratings yet
Artigo Bioinformática
19 pages
BGi RNA-Seq Analysis
No ratings yet
BGi RNA-Seq Analysis
19 pages
Large-Scale Analysis of Gene Expression
No ratings yet
Large-Scale Analysis of Gene Expression
27 pages
RNA Seq R - Final Decode
No ratings yet
RNA Seq R - Final Decode
76 pages
Count-Based Differential Expression Analysis of RNA Sequencing Data Using R and Bioconductor
No ratings yet
Count-Based Differential Expression Analysis of RNA Sequencing Data Using R and Bioconductor
22 pages
Perspectives: Rna-Seq: A Revolutionary Tool For Transcriptomics
No ratings yet
Perspectives: Rna-Seq: A Revolutionary Tool For Transcriptomics
7 pages
Analysis of RNA-Seq Data
No ratings yet
Analysis of RNA-Seq Data
71 pages
Tutorial RNA-Seq Analysis Part 1
No ratings yet
Tutorial RNA-Seq Analysis Part 1
8 pages
HISAT, StringTie and Ballgown
No ratings yet
HISAT, StringTie and Ballgown
18 pages
Statquest Gentle Introduction To Rna Seq
100% (1)
Statquest Gentle Introduction To Rna Seq
188 pages
ExSeq Presentation With Background
No ratings yet
ExSeq Presentation With Background
40 pages
BN335 L6 Transcriptomics JH
No ratings yet
BN335 L6 Transcriptomics JH
9 pages
RNA Sequencing: An Introduction To Efficient Planning and Execution of RNA Sequencing (RNA-Seq) Experiments
No ratings yet
RNA Sequencing: An Introduction To Efficient Planning and Execution of RNA Sequencing (RNA-Seq) Experiments
6 pages
RNA-seq With NOISeq R-Bioc Package
No ratings yet
RNA-seq With NOISeq R-Bioc Package
15 pages
Nihms 977214
No ratings yet
Nihms 977214
21 pages
Trapnell 2009
No ratings yet
Trapnell 2009
7 pages
Module8 RNASeq Pathogen Practical Manual
No ratings yet
Module8 RNASeq Pathogen Practical Manual
23 pages
Systematic Comparison and Assessment of RNA Seq Procedures For Gene Expression Quantitative Analysis
No ratings yet
Systematic Comparison and Assessment of RNA Seq Procedures For Gene Expression Quantitative Analysis
15 pages
RNA-Seq Analysis Course
No ratings yet
RNA-Seq Analysis Course
40 pages
Intro To RNA-seq Concepts
No ratings yet
Intro To RNA-seq Concepts
85 pages
Assays For Mutation Rate
No ratings yet
Assays For Mutation Rate
8 pages
Survey RNA-Seq Data Analysis (2016)
No ratings yet
Survey RNA-Seq Data Analysis (2016)
19 pages
RNA Sequencing Process and Applications-F19960606001
No ratings yet
RNA Sequencing Process and Applications-F19960606001
7 pages
Concepts of Transcriptomics - 20-8-2024
No ratings yet
Concepts of Transcriptomics - 20-8-2024
6 pages
RNA-Seq and Transcriptome Analysis: Jessica Holmes
No ratings yet
RNA-Seq and Transcriptome Analysis: Jessica Holmes
98 pages
Complete Bulk RNA Sequencing Presentation
No ratings yet
Complete Bulk RNA Sequencing Presentation
10 pages
Highly Parallel Direct RNA Sequencing On An Array of Nanopores
No ratings yet
Highly Parallel Direct RNA Sequencing On An Array of Nanopores
21 pages
Brown Goecks 2015 Sample NextGenDNASequencingInformatics2ed
No ratings yet
Brown Goecks 2015 Sample NextGenDNASequencingInformatics2ed
8 pages
Gene Expression RNA Sequence
No ratings yet
Gene Expression RNA Sequence
120 pages
Day1 Laros RNASeq Galaxy 2012
No ratings yet
Day1 Laros RNASeq Galaxy 2012
40 pages
The Bench Scientist's Guide To Statistical Analysis of RNA-Seq Data
No ratings yet
The Bench Scientist's Guide To Statistical Analysis of RNA-Seq Data
10 pages
Transcriptome Software Paper
No ratings yet
Transcriptome Software Paper
7 pages
Trapnell 2024 TopHat Discovering Splice Junction Wiht RNaSeq
No ratings yet
Trapnell 2024 TopHat Discovering Splice Junction Wiht RNaSeq
7 pages
RNA Sequnecing and Analysis - 2015 Nihms768779
No ratings yet
RNA Sequnecing and Analysis - 2015 Nihms768779
29 pages
A Guide To Basic RNA Sequencing Data
No ratings yet
A Guide To Basic RNA Sequencing Data
30 pages
RNA Seq Tutorial
0% (1)
RNA Seq Tutorial
139 pages
RNA Seq
No ratings yet
RNA Seq
3 pages
Alignment
No ratings yet
Alignment
3 pages
RNA-Seq Module 1
No ratings yet
RNA-Seq Module 1
54 pages
8987 - Gordon Smyth v2
No ratings yet
8987 - Gordon Smyth v2
51 pages
Chapter On Transcriptomics
No ratings yet
Chapter On Transcriptomics
13 pages
RNA-Seq Workflow: Gene-Level Exploratory Analysis and Differential Expression
No ratings yet
RNA-Seq Workflow: Gene-Level Exploratory Analysis and Differential Expression
42 pages
Rna Seq Workflows Guide M GL 00034
No ratings yet
Rna Seq Workflows Guide M GL 00034
24 pages
RNA Sequencing (RNA-seq) - Comprehensive Notes
No ratings yet
RNA Sequencing (RNA-seq) - Comprehensive Notes
5 pages
Blank en Berg Pittsburgh 2011 Ngs
No ratings yet
Blank en Berg Pittsburgh 2011 Ngs
59 pages
Module 3 5mark.
No ratings yet
Module 3 5mark.
23 pages
Ribozyme Technology
100% (1)
Ribozyme Technology
16 pages
Checkpoint Questions
No ratings yet
Checkpoint Questions
13 pages
Genome Organization in Prokaryote
No ratings yet
Genome Organization in Prokaryote
21 pages
Central Dogma of Molecular Biology
No ratings yet
Central Dogma of Molecular Biology
7 pages
1nu23 - Nucleic Acids Lab Manual - Group 2
No ratings yet
1nu23 - Nucleic Acids Lab Manual - Group 2
7 pages
Extra DPP-03 - Molecular Basis of Inheritance
No ratings yet
Extra DPP-03 - Molecular Basis of Inheritance
6 pages
tRNA Structure
No ratings yet
tRNA Structure
5 pages
M
No ratings yet
M
9 pages
Genetic Code
No ratings yet
Genetic Code
10 pages
12 Biology Notes Ch06 Molecular Basis of Inheritance
No ratings yet
12 Biology Notes Ch06 Molecular Basis of Inheritance
6 pages
Chapter 5 Biochemistry and Clinical Pathology Complete Notes by Noteskarts Acc To ER20
No ratings yet
Chapter 5 Biochemistry and Clinical Pathology Complete Notes by Noteskarts Acc To ER20
5 pages
Tarifa Neb 2011-1
No ratings yet
Tarifa Neb 2011-1
19 pages
Optimized Protocol Human Whole Exome Sequencing App Note
No ratings yet
Optimized Protocol Human Whole Exome Sequencing App Note
5 pages
1 introToR 2
No ratings yet
1 introToR 2
32 pages
5 RNAseq DEGs
No ratings yet
5 RNAseq DEGs
32 pages
Illustrated DIY CRISPR - Freeze Dried DH5a Liquid Plasmids
No ratings yet
Illustrated DIY CRISPR - Freeze Dried DH5a Liquid Plasmids
19 pages
Final Project
No ratings yet
Final Project
10 pages
2 DNA Replication IP
No ratings yet
2 DNA Replication IP
3 pages
Chapter-11 Nucleic Acid
No ratings yet
Chapter-11 Nucleic Acid
31 pages
分生實驗四
No ratings yet
分生實驗四
2 pages
Packaging of DNA Into Chromosome
No ratings yet
Packaging of DNA Into Chromosome
13 pages
Primer3 Output (Primer3 - Resghults - Cgi Release 4.1.0)
No ratings yet
Primer3 Output (Primer3 - Resghults - Cgi Release 4.1.0)
2 pages
BioChem Map
No ratings yet
BioChem Map
3 pages
Genetic Engineering and Recombinant DNA Technology
No ratings yet
Genetic Engineering and Recombinant DNA Technology
38 pages
KOD Xtreme™ Hot Start DNA Polymerase
No ratings yet
KOD Xtreme™ Hot Start DNA Polymerase
8 pages
The Case of The Druid Dracula - PCR Lab
No ratings yet
The Case of The Druid Dracula - PCR Lab
12 pages
Mcqs Nucleic Acids
No ratings yet
Mcqs Nucleic Acids
27 pages
Analysis of Gene Expression
No ratings yet
Analysis of Gene Expression
28 pages
Regulation of Translation in Developmental Process
No ratings yet
Regulation of Translation in Developmental Process
5 pages
Ibi Viral-Nucleic-Acid-Extraction Kit Protocol Web
No ratings yet
Ibi Viral-Nucleic-Acid-Extraction Kit Protocol Web
5 pages
Lab Manual 3
No ratings yet
Lab Manual 3
7 pages
Sat Ii Biology E/M DR Haitham Abdallah 0100 36 777 19 DNA
No ratings yet
Sat Ii Biology E/M DR Haitham Abdallah 0100 36 777 19 DNA
11 pages
CM NV Diagnosis
No ratings yet
CM NV Diagnosis
13 pages
DNA Timeline
No ratings yet
DNA Timeline
2 pages
RNA Regulation
From Everand
RNA Regulation
Robert A. Meyers
No ratings yet

3 RNAseq Background

Uploaded by

3 RNAseq Background

Uploaded by

Computational Biology Associate professor: Tingwen Chen ( 陳亭妏 )

Lab Office: Room 420 bioICT building

100s of millions of paired reads

• Some molecular features can only be observed at the

• Prioritizing protein coding somatic mutations (often heterozygous)

1. RNA is isolated from a sample,

1. RNA is isolated from a sample,

1. RNA is isolated from a sample,

1. RNA is isolated from a sample,

1. RNA is isolated from a sample,

Total exon reads=18 Total exon reads=2

You might also like