Salmon

Uploaded by

carucast

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views3 pages

Salmon

Uploaded by

carucast

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

RNA-SEQ QUANTIFICATION

RSEM and Salmon

1. Expression index

• RPKM (reads per kilobase per million):

§ (Total reads/ 1M) / gene length in KB.
§ Corrects for coverage, gene length.
§ Methods that used RPKM TopHat and Cuﬄinks.
• TPM (transcripts per million):
§ (Read count/gene length)/scaling factor (summatory of RPK across all genes/1M).
§ ProporGon of reads mapped to a gene in each sample à is comparable.
§ It is used by RSEM algorithm.
• CPM (count per million): it`s used for diﬀerenGal expression assays.

2. RSEM for quan6ﬁca6on

• Input:
§ FASTQ (is going to be slower due to the reads must be mapped) or BAM files.
§ Reference transcript annotaGon files: gene readout, or transcript readout, or the coding genes, or even
non-coding regions).
• Output: transcript-level gene expression (read count, TPM, FPKM) calculated on effecGve transcript length.
• EffecGve transcript length:
§ This is not the full gene length. It`s the coding part of them, the cDNA or exons.
§ Due to degradaGon of the ends, the reads have good coverage in the middle but worse in the ends these
is why you have to apply this correcGon.
§ Given the sequence composiGon of these transcripts, you would expect a priori to sample more reads
from them.
3. Isoform inference

• Depending on the transcript reference ﬁle that you give to RSEM, if you only give exons without considering
the isoforms of a gene, it can give you a general level expression esGmate.
• Given known set of isoforms:
§ EsGmate x (abundance of the isoform) by observing the n (number of reads exon) and knowing the
length of the exons and the exons that are part of the isoforms you can get the relaGve abundance of the
isoforms on a sample.
4. Pseudoalignment

• RSEM is considered the best quanGficaGon approach, but it could be a liYle bit slow.
• Pseudoalignment algorithms such as Kallisto and Salmon are faster because instead of doing a full alignment of
the reads across the whole genome they use the coding transfer (2% of the genome).
• They need reference transcript annotaGon files.
• Find all the transcripts and posiGons that a read is compaGble with (not useful to detect novel transcripts or gene
fusions).
• Salmon also corrects for sequence-specific GC biases.
• Can run either FASTQ files or BAM files.
• Can map 10 million reads in a few minutes, sacrificing accuracy for speed.

5. Output

• Kallisto (abundance.tsv ﬁle):

§ Taget ID: coding region of the genome where reads where mapped.
§ Length: length of the coding region.
§ Eff_length: length correcGon due to possible end degradaGon of the reads.
§ TPM (transcripts per million): proporGon of reads mapped to a gene in each sample.
§ Est_counts: number of reads per million mapped in this coding region of the genome.
• Salmon (quant.sf file):
§ Name: coding region of the genome where reads where mapped.
§ Length: length of the coding region.
§ EffecGve length: length correcGon due to possible end degradaGon of the reads.
§ TPM (transcripts per million): proporGon of reads mapped to a gene in each sample.
§ NumReads: number of reads per million mapped in this coding region of the genome.

Genomics
No ratings yet
Genomics
43 pages
Anatomy and Pathophysiology of Anemia
88% (8)
Anatomy and Pathophysiology of Anemia
9 pages
Rnaseq by Example
No ratings yet
Rnaseq by Example
163 pages
Purchase Order: Po No. Dated
No ratings yet
Purchase Order: Po No. Dated
3 pages
441 2653 2 PB
No ratings yet
441 2653 2 PB
1 page
Lecture4 Expression - Analysis 2019
No ratings yet
Lecture4 Expression - Analysis 2019
79 pages
HW5e Int Tests Guide
50% (2)
HW5e Int Tests Guide
1 page
2023-GenomicaFuncional y Biocomputacion-Day1
No ratings yet
2023-GenomicaFuncional y Biocomputacion-Day1
92 pages
Beginner's Guide To Using The DESeq2 Package
No ratings yet
Beginner's Guide To Using The DESeq2 Package
32 pages
Grammar of The Yucatecan Language
75% (4)
Grammar of The Yucatecan Language
412 pages
Nazarov QC-Statistics
No ratings yet
Nazarov QC-Statistics
50 pages
Impact of Gene Annotation On RNA-seq Data Analysis Shanrong Zhao and Baohong Zhang
No ratings yet
Impact of Gene Annotation On RNA-seq Data Analysis Shanrong Zhao and Baohong Zhang
23 pages
Measuring Transcriptomes With RNA-Seq
No ratings yet
Measuring Transcriptomes With RNA-Seq
48 pages
4 RNAseq-Quantification LO
No ratings yet
4 RNAseq-Quantification LO
30 pages
4 RNAseq Datapreprocessing
No ratings yet
4 RNAseq Datapreprocessing
43 pages
Week13
No ratings yet
Week13
43 pages
Intro To Pneumatics Modified
No ratings yet
Intro To Pneumatics Modified
35 pages
WES Shivangi
No ratings yet
WES Shivangi
43 pages
Trinity
No ratings yet
Trinity
25 pages
Rsamtools Overview
No ratings yet
Rsamtools Overview
13 pages
Artigo Transcriptoma
No ratings yet
Artigo Transcriptoma
11 pages
Analysis of RNA-Seq Data
No ratings yet
Analysis of RNA-Seq Data
71 pages
RNA Seq R - Final Decode
No ratings yet
RNA Seq R - Final Decode
76 pages
Epp
100% (1)
Epp
2 pages
NOISeq
No ratings yet
NOISeq
26 pages
Classes of Molecular Markers
No ratings yet
Classes of Molecular Markers
39 pages
GKV 1157
No ratings yet
GKV 1157
7 pages
Anotacion de Genomas
No ratings yet
Anotacion de Genomas
84 pages
Introduction To Quantitative Real-Time PCR
No ratings yet
Introduction To Quantitative Real-Time PCR
41 pages
Module8 RNASeq Pathogen Practical Manual
No ratings yet
Module8 RNASeq Pathogen Practical Manual
23 pages
Gmapr: Use The Gmap Suite of Tools in R: Michael Lawrence, Cory Barr October 26, 2021
No ratings yet
Gmapr: Use The Gmap Suite of Tools in R: Michael Lawrence, Cory Barr October 26, 2021
8 pages
3 RNAseq-Mapping LO
No ratings yet
3 RNAseq-Mapping LO
98 pages
Article3 Voom LIMMA
No ratings yet
Article3 Voom LIMMA
17 pages
Trinity Workshop Activities
No ratings yet
Trinity Workshop Activities
11 pages
Bray, 2016
No ratings yet
Bray, 2016
5 pages
Whole Exome Seq Data Analysis 1742774815
No ratings yet
Whole Exome Seq Data Analysis 1742774815
58 pages
M.SC Transcriptome Analysis 2025
No ratings yet
M.SC Transcriptome Analysis 2025
21 pages
HHS Public Access: Ballgown Bridges The Gap Between Transcriptome Assembly and Expression Analysis
No ratings yet
HHS Public Access: Ballgown Bridges The Gap Between Transcriptome Assembly and Expression Analysis
9 pages
Lab03 - Lab Manual
No ratings yet
Lab03 - Lab Manual
16 pages
Assignment CB 1
No ratings yet
Assignment CB 1
69 pages
CLC Genomics Workbench User Manual Subset
No ratings yet
CLC Genomics Workbench User Manual Subset
222 pages
FreeBayes Variant Calling Workflow For DNA-Seq - Bioinformatics Workbook
No ratings yet
FreeBayes Variant Calling Workflow For DNA-Seq - Bioinformatics Workbook
9 pages
Alignment
No ratings yet
Alignment
3 pages
J. H. Wells, L. R. Williams Auth. Embeddings and Extensions in Analysis PDF
100% (1)
J. H. Wells, L. R. Williams Auth. Embeddings and Extensions in Analysis PDF
116 pages
List of Online Bioinformatics Tools and Software - Final
No ratings yet
List of Online Bioinformatics Tools and Software - Final
23 pages
12 Blossum
No ratings yet
12 Blossum
10 pages
Edger: Differential Expression Analysis of Digital Gene Expression Data
No ratings yet
Edger: Differential Expression Analysis of Digital Gene Expression Data
69 pages
RNA-Seq Analysis Course
No ratings yet
RNA-Seq Analysis Course
40 pages
Understanding QPCR Results
No ratings yet
Understanding QPCR Results
3 pages
Chua Yuen Chong, Gerrard - BIO61604 - Pract 3 and 4
No ratings yet
Chua Yuen Chong, Gerrard - BIO61604 - Pract 3 and 4
20 pages
Quality Control & Normalization of RNA SEQ Data: Shivangi Agarwal, PHD
No ratings yet
Quality Control & Normalization of RNA SEQ Data: Shivangi Agarwal, PHD
35 pages
Mileidy W. Gonzalez and William R. Pearson
No ratings yet
Mileidy W. Gonzalez and William R. Pearson
23 pages
Module 3 5mark.
No ratings yet
Module 3 5mark.
23 pages
R8 Waray BoSY CRLA 11.24.2021 v4
No ratings yet
R8 Waray BoSY CRLA 11.24.2021 v4
10 pages
Freedman 2024
No ratings yet
Freedman 2024
9 pages
RNA-Seq Module 1
No ratings yet
RNA-Seq Module 1
54 pages
Transcriptome Software Paper
No ratings yet
Transcriptome Software Paper
7 pages
Riborex
No ratings yet
Riborex
9 pages
The Bench Scientist's Guide To Statistical Analysis of RNA-Seq Data
No ratings yet
The Bench Scientist's Guide To Statistical Analysis of RNA-Seq Data
10 pages
Poster PPT Portrait
No ratings yet
Poster PPT Portrait
1 page
Sequencing Genomes
No ratings yet
Sequencing Genomes
7 pages
Genomics For Beginner
No ratings yet
Genomics For Beginner
9 pages
Bio Tools Booklet
No ratings yet
Bio Tools Booklet
5 pages
Microreads ALLPATHS: de Novo Assembly of Whole-Genome Shotgun
No ratings yet
Microreads ALLPATHS: de Novo Assembly of Whole-Genome Shotgun
12 pages
Blank en Berg Pittsburgh 2011 Ngs
No ratings yet
Blank en Berg Pittsburgh 2011 Ngs
59 pages
MCA 301 Data Mining Notes
No ratings yet
MCA 301 Data Mining Notes
6 pages
ENM Installation Guide
No ratings yet
ENM Installation Guide
19 pages
Nike - Final Report
No ratings yet
Nike - Final Report
13 pages
Medical Technology Laws and Bioethics: Allyson F. Higoy, RMT
No ratings yet
Medical Technology Laws and Bioethics: Allyson F. Higoy, RMT
62 pages
Qawaid Fiqhiyyah
No ratings yet
Qawaid Fiqhiyyah
411 pages
Test Accessories Main Catalog: Test & Measureline - Test & Measurement
No ratings yet
Test Accessories Main Catalog: Test & Measureline - Test & Measurement
188 pages
External Environment
No ratings yet
External Environment
54 pages
Contraception Today A Pocketbook For General Practitioners and Practice Nurses 7th Edition John Guillebaud
No ratings yet
Contraception Today A Pocketbook For General Practitioners and Practice Nurses 7th Edition John Guillebaud
55 pages
Tube Stube Settlers
No ratings yet
Tube Stube Settlers
9 pages
Early Sequence Aligment
No ratings yet
Early Sequence Aligment
14 pages
Tema 10 Leukocyte Migration
No ratings yet
Tema 10 Leukocyte Migration
36 pages
Resume Real Estate Finance
No ratings yet
Resume Real Estate Finance
31 pages
The Philosophy of Fear and Freedom
No ratings yet
The Philosophy of Fear and Freedom
2 pages
w9 - L2 - Review For Lecture Midterm 2
No ratings yet
w9 - L2 - Review For Lecture Midterm 2
14 pages
DKA NICE Guidelines
No ratings yet
DKA NICE Guidelines
6 pages
Lab Report Quantitative Determination of Protease Activity
No ratings yet
Lab Report Quantitative Determination of Protease Activity
6 pages
Distribution
No ratings yet
Distribution
7 pages
GSEA
No ratings yet
GSEA
7 pages
Examen Innovation I
No ratings yet
Examen Innovation I
6 pages
Stridhana A Critical Approach Research M
No ratings yet
Stridhana A Critical Approach Research M
13 pages
Motion To Disqualify Allen Baddour
No ratings yet
Motion To Disqualify Allen Baddour
12 pages
Ajp12. Minu
No ratings yet
Ajp12. Minu
9 pages
Brochure Antech Type C
No ratings yet
Brochure Antech Type C
2 pages
IPE 4715 Material Handling and Maintenance
No ratings yet
IPE 4715 Material Handling and Maintenance
2 pages
Eaton Sure Lites Sel25 50 60 Spec PDF
No ratings yet
Eaton Sure Lites Sel25 50 60 Spec PDF
7 pages
Reading Passage 1
No ratings yet
Reading Passage 1
13 pages
Winklers Disease
No ratings yet
Winklers Disease
2 pages
24/07/08 TP-Link W8920G 108M ADSL and ADSL2+ Set Up Guide
No ratings yet
24/07/08 TP-Link W8920G 108M ADSL and ADSL2+ Set Up Guide
7 pages
Basic Information About C language PDF
From Everand
Basic Information About C language PDF
Suraj Das
No ratings yet
Gene Expression Programming: Fundamentals and Applications
From Everand
Gene Expression Programming: Fundamentals and Applications
Fouad Sabry
No ratings yet

Salmon

Uploaded by

Salmon

Uploaded by

RNA-SEQ QUANTIFICATION

RSEM and Salmon

• RPKM (reads per kilobase per million):

2. RSEM for quan6ﬁca6on

• Kallisto (abundance.tsv ﬁle):

You might also like