0% found this document useful (0 votes)

41 views27 pages

From RNA-seq Reads To Gene Expression

The document summarizes the process of analyzing RNA-seq data from gene expression to identify differentially expressed genes between normal and mutated cell samples. Key steps include: 1) Mapping RNA sequencing reads to a reference genome; 2) Counting reads mapped to each gene; 3) Normalizing read counts to account for differences in sequencing depth; 4) Using statistical tools like edgeR or DESeq2 to identify genes that are differentially expressed between normal and mutated samples based on normalized read counts. The output is a list of genes identified as differentially expressed which can then be further analyzed to validate hypotheses or identify enriched biological pathways.

Uploaded by

HoangHai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views27 pages

From RNA-seq Reads To Gene Expression

Uploaded by

HoangHai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 27

From RNA-seq reads to Gene

Expression
Hoang Thanh Hai
Introduction
Bunch Bunch
of of
normal mutated
cells cells

The mutated cells behave differently than the normal cells

What genetic mechanism causing the difference…

Answer: looking at differences in gene expression

Bunch Bunch
of of
normal mutated
cells cells

Bunch of chromosomes Bunch of genes

Bunch
of
cells

Active genes

mRNA level

Non-Active gene
Bunch Bunch
of of
normal mutated
cells cells

RNA seq measure gene >< RNA seq measure gene

expression in normal cells Compare expression in mutated cells

High throughput sequencing tells us which genes are active,

and how much they are transcribed.
Bunch Bunch
of of
normal mutated
cells cells

1. No differences in gene 1 between normal and mutated cells

2. gene 2 is up-regulated in mutated cells
3. Gene 3 is suppressed in mutated cells
Three mains steps for RNA-Seq
Transcriptome profiling using NGS
Step 2: Sequencing

Step 1: Library preparation

Step 3: Data analysis
Step 1: Preparing an RNA-seq library
Step 3: Data analysis
Step 3: Data analysis
From reads to differential expression
1.Raw-data Quality Raw Sequence Data QC by
FASTQ Files FastQC/R

2.Reads Mapping

Unspliced Mapping Spliced mapping

BWA, Bowtie TopHat, MapSplice

Mapped Reads
3.Expression Quantification SAM/BAM Files

Summarize read counts FPKM/RPKM

Cufflinks QC by
RNA-SeQC
4.DE testing

DEseq, edgeR, etc Cuffdiff

List of DE
5.Functional Interpretation
Function Integrate with
Infer networks
enrichment other data

Biological Insights & hypothesis

Raw data for a sample: FASTQ files
Line1: Sequence identifier
Line2: Raw sequence
Line3: meaningless
Line4: quality values for the sequence
Step 1 :low quality reads check

Tools:
• FastQC: checking Information
– total reads, sequence length
– Per base sequence quality
– Overrepresented sequences
– GC content
– Duplication level

• MultiQC: Summary FastQC results

Per base sequence quality

Command: fastqc -o FastQC_Report *fastq.gz

Step 2: Mapping RNA-seq Reads to
genome
Using STAR to align
Step 3: Count the number of reads

Visualize mapping results by Artemis

Step 3: Count the number of reads
• Count the reads per gene -> matrix of number

First column -> genes names Remaining columns -> number of counts
for each sample
Could we state which genes are up-regulated or down-regulated based
or the direct number of counts -> No
Normalize data before comparision

Sample #1 has 635 reads assigned to it Sample #2 has 1270 reads assigned to it.
twice as many reads as Sample #1
However, the read counts make it look like the genes in
Sample #2 were transcribed twice as much as in Sample #1
Normalize data before comparison

Adjust the read counts per gene to reflect differences in how many reads
were assigned to each sample

There are many sophisticated ways -> The simplest method is to just divide the
read counts per gene by the total mapped to each sample (cpm)
Step 4: Differential expression testing
First thing in any DE testing is always the same:
Plot the data

The data is a huge matrix…

Plot samples data
PCA plot

But we have thousands of genes…

So we would need a graph with thousands/2 axes to plot the raw data…

PCA reduces the number of axes you need to display the important
aspects of the data.
Exclude

The wild-type samples make a nice The mutated samples make a nice
cluster in the left side cluster in the right side

• When PC1 are the most important differences -> this mean
biggest differences are between the WT and the MT samples.

An Example
In summary, plotting the data…
• Tells us if we can expect to find interesting
differences.
• Tells us if we should exclude some samples
from any down stream analysis.
Identify differentially expressed genes between
the “normal” and “mutant” samples.
• This is typically done using R with either edgeR or DESeq2,
and the results are generally displayed using this sort of graph
A Red dot is a gene that is different between “normal” and “mutant” samples
Black dots are genes that are the same.
The X-axis tells you how much each gene is transcribed.
The Y-axis tells you how big the relative difference is
between “normal” and “mutant”.
We’ve identified interesting genes, now what?

• If you know what you’re looking for, you can see if the experiment
validated your hypothesis.
• If you don’t know what you’re looking for, you can see if certain
pathways are enriched in either the normal or mutant gene sets.

Rnaseq by Example
No ratings yet
Rnaseq by Example
163 pages
Slides Nov2019 Day4
No ratings yet
Slides Nov2019 Day4
28 pages
RNA-Seq Analysis Course
No ratings yet
RNA-Seq Analysis Course
40 pages
Beginner's Guide To Using The DESeq2 Package
No ratings yet
Beginner's Guide To Using The DESeq2 Package
32 pages
1.RNA Seq Part1 WorkingToTheGoal
No ratings yet
1.RNA Seq Part1 WorkingToTheGoal
75 pages
4 RNAseq-Quantification LO
No ratings yet
4 RNAseq-Quantification LO
30 pages
M.SC Transcriptome Analysis 2025
No ratings yet
M.SC Transcriptome Analysis 2025
21 pages
Cm2 Debily m1 Funcgenprecmed 2024 25
No ratings yet
Cm2 Debily m1 Funcgenprecmed 2024 25
41 pages
Module8 RNASeq Pathogen Practical Manual
No ratings yet
Module8 RNASeq Pathogen Practical Manual
23 pages
Nazarov QC-Statistics
No ratings yet
Nazarov QC-Statistics
50 pages
Introduction To Differential Gene Expression Analysis Using RNA-seq
No ratings yet
Introduction To Differential Gene Expression Analysis Using RNA-seq
97 pages
Measuring Transcriptomes With RNA-Seq
No ratings yet
Measuring Transcriptomes With RNA-Seq
48 pages
Intro 2 RNAseq
No ratings yet
Intro 2 RNAseq
98 pages
Lab 2
No ratings yet
Lab 2
7 pages
Rcourse Partviz
No ratings yet
Rcourse Partviz
9 pages
GenViz Module4 Lecture
No ratings yet
GenViz Module4 Lecture
14 pages
RNA Seq R - Final Decode
No ratings yet
RNA Seq R - Final Decode
76 pages
441 2653 2 PB
No ratings yet
441 2653 2 PB
1 page
Differential Expression Analysis With Deseq2: Dr. Kathi Zarnack
No ratings yet
Differential Expression Analysis With Deseq2: Dr. Kathi Zarnack
8 pages
Diff Expr Ngs
No ratings yet
Diff Expr Ngs
29 pages
Statquest Gentle Introduction To Rna Seq
100% (1)
Statquest Gentle Introduction To Rna Seq
188 pages
RNA-Seq Module 1
No ratings yet
RNA-Seq Module 1
54 pages
From Microarray To RNA-Seq: A Review of Transcriptome Analysis With Next-Generation Sequencing Data
No ratings yet
From Microarray To RNA-Seq: A Review of Transcriptome Analysis With Next-Generation Sequencing Data
27 pages
Differential Expression of Rna-Seq Data at The Gene Level - The Deseq Package
No ratings yet
Differential Expression of Rna-Seq Data at The Gene Level - The Deseq Package
24 pages
Tutorial RNA-Seq Analysis Part 1
No ratings yet
Tutorial RNA-Seq Analysis Part 1
8 pages
Quality Control & Normalization of RNA SEQ Data: Shivangi Agarwal, PHD
No ratings yet
Quality Control & Normalization of RNA SEQ Data: Shivangi Agarwal, PHD
35 pages
Nihms 977214
No ratings yet
Nihms 977214
21 pages
Edge RUsers Guide
No ratings yet
Edge RUsers Guide
138 pages
Edger: Differential Analysis of Sequence Read Count Data User'S Guide
No ratings yet
Edger: Differential Analysis of Sequence Read Count Data User'S Guide
119 pages
Differential Analysis of Count Data - The Deseq2 Package: Michael Love, Simon Anders, Wolfgang Huber
No ratings yet
Differential Analysis of Count Data - The Deseq2 Package: Michael Love, Simon Anders, Wolfgang Huber
33 pages
Edger: Differential Analysis of Sequence Read Count Data User'S Guide
No ratings yet
Edger: Differential Analysis of Sequence Read Count Data User'S Guide
122 pages
Transcriptome Software Paper
No ratings yet
Transcriptome Software Paper
7 pages
edgeRUsersGuide PDF
No ratings yet
edgeRUsersGuide PDF
110 pages
Assignment CB 1
No ratings yet
Assignment CB 1
69 pages
Edger: Differential Expression Analysis of Digital Gene Expression Data
No ratings yet
Edger: Differential Expression Analysis of Digital Gene Expression Data
69 pages
Tutorial RNA-Seq Analysis Part 2
No ratings yet
Tutorial RNA-Seq Analysis Part 2
9 pages
3 RNAseq-Mapping LO
No ratings yet
3 RNAseq-Mapping LO
98 pages
TMM - A Scaling Normalization Method For Differential Expression Analysis of RNA-seq data-Robinson-GenomeBiology-2010
No ratings yet
TMM - A Scaling Normalization Method For Differential Expression Analysis of RNA-seq data-Robinson-GenomeBiology-2010
9 pages
Edger Users Guide
No ratings yet
Edger Users Guide
139 pages
Distribution
No ratings yet
Distribution
7 pages
RNA Seq Data Analysis
No ratings yet
RNA Seq Data Analysis
90 pages
Lecture4 Expression - Analysis 2019
No ratings yet
Lecture4 Expression - Analysis 2019
79 pages
Curso Rnaseq Saebb Utfpr
No ratings yet
Curso Rnaseq Saebb Utfpr
18 pages
Lab03 - Lab Manual
No ratings yet
Lab03 - Lab Manual
16 pages
Count-Based Differential Expression Analysis of RNA Sequencing Data Using R and Bioconductor
No ratings yet
Count-Based Differential Expression Analysis of RNA Sequencing Data Using R and Bioconductor
22 pages
L05 Deseq2 Anders
No ratings yet
L05 Deseq2 Anders
46 pages
Gene Expression Analysis Method Integration and Co Expression Module Detection Applied To Rare Glucide Metabolism Disorders Using Exphuntersuite
No ratings yet
Gene Expression Analysis Method Integration and Co Expression Module Detection Applied To Rare Glucide Metabolism Disorders Using Exphuntersuite
12 pages
ScRNA Seq Course
100% (1)
ScRNA Seq Course
337 pages
PDxNucleus Brochure
No ratings yet
PDxNucleus Brochure
17 pages
Project O: Breast Cancer Gene Analysis Using R: Sheena Scroggins, Susan Mcgowan, John Caras
No ratings yet
Project O: Breast Cancer Gene Analysis Using R: Sheena Scroggins, Susan Mcgowan, John Caras
25 pages
Assays For Mutation Rate
No ratings yet
Assays For Mutation Rate
8 pages
High Throughput Sequencing
No ratings yet
High Throughput Sequencing
5 pages
Interpreting DNA SequenceREV
No ratings yet
Interpreting DNA SequenceREV
12 pages
Analyzing Your QRT For Relative 2 CT
No ratings yet
Analyzing Your QRT For Relative 2 CT
5 pages
3 RNAseq Background
No ratings yet
3 RNAseq Background
42 pages
CLC Genomics Workbench User Manual Subset
No ratings yet
CLC Genomics Workbench User Manual Subset
222 pages
RNA-seq With NOISeq R-Bioc Package
No ratings yet
RNA-seq With NOISeq R-Bioc Package
15 pages
Functional Genomics
No ratings yet
Functional Genomics
11 pages
Xenopus Development
From Everand
Xenopus Development
Malgorzata Kloc
No ratings yet
Introduction to Bioinformatics Using Action Labs
From Everand
Introduction to Bioinformatics Using Action Labs
Jean-Louis Lassez
5/5 (1)
Cal Bio Chem Human IgG Booklet CB0051
No ratings yet
Cal Bio Chem Human IgG Booklet CB0051
66 pages
Ref Paper For PHD
No ratings yet
Ref Paper For PHD
17 pages
Canva Example
No ratings yet
Canva Example
1 page
Clover A1c
No ratings yet
Clover A1c
39 pages
Chapter 4 Enzymes and Vitamins
No ratings yet
Chapter 4 Enzymes and Vitamins
9 pages
Ijms 25 06097
No ratings yet
Ijms 25 06097
15 pages
Gastric Secretion
No ratings yet
Gastric Secretion
50 pages
Beet Root
67% (3)
Beet Root
18 pages
Cell Mediated Immunity Lect
No ratings yet
Cell Mediated Immunity Lect
22 pages
Post-Translational Modification - Wikipedia
No ratings yet
Post-Translational Modification - Wikipedia
62 pages
AP Q Chapter 3
No ratings yet
AP Q Chapter 3
68 pages
6ed (Chpt-17)
No ratings yet
6ed (Chpt-17)
35 pages
Module 1 - Foundations of Biochemistry
No ratings yet
Module 1 - Foundations of Biochemistry
11 pages
Cell Analogy II District
No ratings yet
Cell Analogy II District
2 pages
1.biokimia Kanker
No ratings yet
1.biokimia Kanker
94 pages
Cytosolic Lipolysis and Lipophagy
No ratings yet
Cytosolic Lipolysis and Lipophagy
14 pages
LFSC Assignment Grade 11 (28 May 2025) - 045500
No ratings yet
LFSC Assignment Grade 11 (28 May 2025) - 045500
11 pages
Local Media3371875718567533646
No ratings yet
Local Media3371875718567533646
2 pages
Sequencing Technologies - The Next Generation: Michael L. Metzker
No ratings yet
Sequencing Technologies - The Next Generation: Michael L. Metzker
16 pages
Transport Mechanism
100% (1)
Transport Mechanism
105 pages
مؤمل جميل 2
No ratings yet
مؤمل جميل 2
6 pages
Michaelis-Menten Kinetics: Chemistry 24b 14&15 Spring Quarter 200 4 Date: May 3&5 Instructor: Richard Roberts
No ratings yet
Michaelis-Menten Kinetics: Chemistry 24b 14&15 Spring Quarter 200 4 Date: May 3&5 Instructor: Richard Roberts
15 pages
Genetic Engineering
No ratings yet
Genetic Engineering
16 pages
BSC
No ratings yet
BSC
22 pages
Lesson Plan Biotech PH
No ratings yet
Lesson Plan Biotech PH
4 pages
DM
No ratings yet
DM
86 pages
Qualitative Analysis of Proteins and Amino Acids
No ratings yet
Qualitative Analysis of Proteins and Amino Acids
33 pages
2-Protein Synthesis Tumm Tart
No ratings yet
2-Protein Synthesis Tumm Tart
20 pages
Journal of Bacteriology 2013 Hemarajata 5567.full
No ratings yet
Journal of Bacteriology 2013 Hemarajata 5567.full
10 pages
DP1 BioHL - Topic 7 - Revision Worksheet
No ratings yet
DP1 BioHL - Topic 7 - Revision Worksheet
29 pages

From RNA-seq Reads To Gene Expression

Uploaded by

From RNA-seq Reads To Gene Expression

Uploaded by

From RNA-seq reads to Gene

The mutated cells behave differently than the normal cells

What genetic mechanism causing the difference…

Answer: looking at differences in gene expression

Bunch of chromosomes Bunch of genes

RNA seq measure gene >< RNA seq measure gene

High throughput sequencing tells us which genes are active,

1. No differences in gene 1 between normal and mutated cells

Step 1: Library preparation

Unspliced Mapping Spliced mapping

Summarize read counts FPKM/RPKM

DEseq, edgeR, etc Cuffdiff

Biological Insights & hypothesis

• MultiQC: Summary FastQC results

Command: fastqc -o FastQC_Report *fastq.gz

Visualize mapping results by Artemis

The data is a huge matrix…

But we have thousands of genes…

You might also like