rna-seq: from a to t - michigan state universityrna-seq: from a to t . nick beckloff director,...
TRANSCRIPT
![Page 1: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/1.jpg)
RNA-Seq: From A to T
Nick Beckloff Director, Genomics Core
Research Technology Support Facilities
Tracy Teal BEACON, MMG
Michigan State University 7/30/2014
![Page 2: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/2.jpg)
Outline
• RNA-Seq basics • Sample input • Choosing a method • Sequencing • Library Preparation • Validation • RNA-Seq vs Arrays • Future
![Page 3: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/3.jpg)
Upcoming Events
• Wafegen seminar September* • Methylation Boot Camp September* • BiCEP Launch September • iCER 10th Anniversary October
* Tentative due to availability
![Page 4: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/4.jpg)
RNA-Seq Applications
RNA-Seq
Transcriptome Profiling
Biomarkers
sRNA Variants
Isoforms
Novel transcripts
Contents in this presentation may change at any time**
![Page 5: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/5.jpg)
RNA-Seq Basics
RNA Isolation RNA QC RNA-Seq Library prep
Fragmentation cDNA synthesis
Adapter/Barcode Amplification
Analysis Library QC Sequencing
![Page 6: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/6.jpg)
RNA-seq Project Checklist
• Budget • Repetitions
Analysis
• Quality, quantity • Species
Library
• Read Length • Depth
Sequencing
![Page 7: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/7.jpg)
RNA-seq Guidelines
Reps
Length Coverage
Length Differential Expression = 50 vs 100 bp Coverage: Euk = 20-30M reads/sample Prok = 10M reads/sample Repetitions = 3+ Reps > Depth “when moving from 10 – 30 MM reads with 2 or 3 replicates, one will pull in approximately 25% more differentially expressed (DE) genes.” Bioinformatics Volume 30, Issue 3. 301-304.
![Page 8: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/8.jpg)
RNA-seq Guidelines Depth vs Reps: A real world scenario 12 total samples (1 control and 5 conditions in duplicate) 1×50 bp sequncing for differential expression 10 MM reads per samples (12 samples in one lane of ILMN HiSeq) OR 30 MM reads (4 samples in one lane of ILMN HiSeq). What is the best scenario?
Reads from 10 to 30M gives 25% more reads at 1.5x cost
Increase is reps provides 35% more DE genes at 1.6x cost Bioinformatics Volume 30, Issue 3. 301-304.
![Page 9: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/9.jpg)
Project Planning High Quality
Partially Degraded
Degraded
RNA Input is one of the most important drivers in
selecting library prep method
• What Species? • Rna quality? • Quantity?
- 1-200ng, 1-5 ug, etc
• What RNA species? - mRNA, small RNA, etc
Choosing a Method
![Page 10: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/10.jpg)
Library Preps (PolyA selection)
• Up to 1 ug input • RIN >7.0 • Clean output • Only PolyA RNA • Stranded
Features/Limitations
![Page 11: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/11.jpg)
Library Preps (rRNA removal)
• Not 100% efficient • Check species compatibility • 100 ng-5 ug input* • Excludes RNAs < 200bp • Can use with degraded RNA
Features/Limitations Epicentre: Ribo-Zero TM
![Page 12: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/12.jpg)
Library Preps (rRNA removal)
http://www.epibio.com/rnamatchmaker
Check non-model organisms
BLAST rRNA seqs New Ribozero Plant
Leaf/Seed/Root!!
![Page 13: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/13.jpg)
Library Preps (rRNA removal)
• Not 100% efficient • Some junk carry over • Find sweet spot • May need extra reads
Limitations/Tips
Sweet Spot
![Page 14: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/14.jpg)
Library Preps (Quantitative RNA)
• 96 distinct, 8 nt Molecular Index
• Large number of combinations (96x96=9216)
• 96 Barcodes • 10-100 ng input • PE Sequencing
Features/Limitations Bioo Scientific qRNA-Seq kit
![Page 15: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/15.jpg)
Library Preps (Low Input RNA)
Nugen Ovation System V2
• Oligo DT and random priming amplifies both polyA AND non-poly A
• 500 pg input • Stranded • Prokaryotic version • RNA < 200 bp lost
Features/Limitations
![Page 16: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/16.jpg)
Library Preps (Low Input RNA)
SMARTer Universal Low Input
• Recommended for single cell
- Fluidgm C1 • Input 100 pg • Only for polyA samples • Yields best data for high
quality RNA*
*allegedly
Features/Limitations
![Page 17: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/17.jpg)
Library Preps (sRNA)
Netxflex small RNA Kit
• Sequencing of small RNAs and miRNAs
• Gel free adapter depletion • No PAGE gel cuts • Total RNA input >1 ug • Adapters are ligated to
samples
Features/Limitations
![Page 18: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/18.jpg)
Library Preps (Targeted RNA)
SureSelect RNA capture Kit
• Targeted capture of RNA • Total RNA input • KB to 10 MB size • Post-library capture
methods • Nugen uses single
primer method • No cost to design
Features/Limitations
![Page 19: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/19.jpg)
Library Preps (Targeted Depletion)
Nugen InDA-C (Insert Dependent Adapter Cleavage)
• Customized probes for target exclusion
• Eliminates targets by cleaving sequencing adapter
• Post-library selection • Used in Ovation Prokaryotic
system • Stranded
Features/Limitations
Bead purification
InDA-C Fragmentation Enrichment PCR
Strand selection
Adaptor ligation
2nd strand synthesis
1st strand synthesis
![Page 20: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/20.jpg)
Library Preps (Targeted Depletion)
Targeted Depletion of rRNA in (Vitis vinifera, cultivar pinot noir)
Percentage of Mapped Transcripts Cyto rRNA Chloroplast RNA Mito rRNA Informative
Library 1
Library 2
0% 10% 20% 30% 40% 50% 60% 70% 80% 90%
100%
% informative reads increased from 22% to 56% with InDA-C
15.4
28.2 0.
5 55.9
18.9
24.3 0.
6 56.2
![Page 21: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/21.jpg)
Mouse 18S Coverage with and without InDA-C
No InDA-C
With InDA-C
Probe location
Library Preps (Targeted Depletion)
![Page 22: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/22.jpg)
Library Preps (Single Cell RNA)
Ovation Single Cell System
• 1-5 pg input • Converts to cDNA
amplifies library
Features/Limitations
![Page 23: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/23.jpg)
Company Conf
idential
• Bioanalyzer traces of Ovation Single Cell RNA-Seq libraries from CD8+ resting sorted T-cells
• Functional libraries created from a single cell
• Estimated total RNA content per cell is 0.5 pg
% Non- rRNA
Single site
RefSeq Strand
Retention Total
Reads %
Aligned Input % rRNA Good alignment to genome – less wasted reads 1 cell 3,175,287 35 1.3 67 95.2
1 cell 2,928,789 39 1.4 69 87.9 Ribosomal reads less than 5% – more informative sequence
10 cells 3,089,680 54 4.1 69 97.9
Excellent strand retention – improved transcriptional value
10 cells 2,904,477 52 3.6 69 98.1
100 cells 3,687,320 81 2.9 66 98.0
100 cells 3,179,519 83 2.9 65 98.4
Library Preps (Single Cell RNA)
![Page 24: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/24.jpg)
RNA-Seq Validation
![Page 25: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/25.jpg)
RNA-Seq Validation (qPCR)
• qPCR validation of RNA-seq • Wafegen SmartChip System • >5,000 rxns/chip • 100 nl volume • Supports Taqman and Sybr
green • Custom targets for NGS
Features/Limitations
![Page 26: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/26.jpg)
RNA-seq vs Arrays
• RNA-seq vs Microarrays
- Cost is comparable - Microarrays only detect what is spotted - RNA-seq > arrays for isoforms, novel transcripts - Complimentary to one another • Intangible
• Reviewers prefer RNA-seq in grants
Take Home Message
Genomics researchers astonished to learn microarrays still exist!! – The Science Web
![Page 27: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/27.jpg)
Future of RNA-seq
• Transcriptomes of Everything • Lower Inputs • Single cell vs multiple profiles • Longer reads • More novel isoforms • More RNA subspecies
Take Home Message
![Page 28: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/28.jpg)
General considerations for RNA-seq quantification for differential expression
Tracy K. Teal
Assistant Professor Microbiology & Molecular Genetics
July 30, 2014
Adapted from NGS RNASeq slides Author: Ian Dworkin
![Page 29: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/29.jpg)
What are the goals of your research? Why did you generate all of the RNAseq data in the first place?
§ Transcriptome assembly (& SNP discovery) § Transcript discovery (variants for Transcription
start site, alternative splicing, etc..) § Quantification of (alternative transcripts) § Differential expression analysis across
treatments.
RNA-‐seq is generated for a number of reasons
![Page 30: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/30.jpg)
What was once thought to be separate goals are now clearly recognized as
intertwined.
§ Early work for RNA-seq tried to “mirror” the type of gene level analysis used in microarrays.
§ However, RNA-seq has demonstrated how important it is to take into account alternative transcripts, even when attempting to get “gene level” measures.
![Page 31: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/31.jpg)
How do we put together a useful pipeline for RNAseq
What are the steps we need to consider?
![Page 32: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/32.jpg)
How do we put together a useful pipeline for RNAseq?
What are the steps we need to consider? § Quality filtering § Genome/transcriptome assembly. § Mapping reads to genome/transcriptome. § Deal with alternative transcripts (new
transcriptome)? § Remap & count reads. § Differential expression
![Page 33: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/33.jpg)
Quality filtering Your analysis is only as good as your data
§ Quality control and removal of poor-quality reads (FASTQC, RNASeQC, fastx, …)
§ Remove adapters and linkers (FASTQC, Trimmomatic, …)
![Page 34: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/34.jpg)
Mapping reads Ultimately all analyses require read mapping
Image credit: Nir Friedman lab
![Page 35: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/35.jpg)
Challenge: alternative splicing
![Page 36: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/36.jpg)
Overview of RNA-Seq analysis pipeline for detecting differential expression
Oshlack et al., From RNA-‐seq reads to differen3al expression results, Genome Biology 2010.
Quality filter
![Page 37: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/37.jpg)
RNA-‐seq Workflows and Tools. Stephen Turner. Figshare. hJp://dx.doi.org/10.6084/m9.figshare.662782
![Page 38: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/38.jpg)
Pipelines for RNA-seq (geared towards splicing)
Alamancos et al. Methods to Study Splicing from RNA-‐Seq hJp://arxiv.org/abs/1304.5952 Figshare. hJp://dx.doi.org/10.6084/m9.figshare.679993
![Page 39: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/39.jpg)
The “tuxedo” protocol for RNA-seq
Trapnell C et al Differen[al gene and transcript expression analysis of RNA-‐seq experiments with TopHat and Cufflinks Nature Protocols 7, 562–578 (2012)
![Page 40: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/40.jpg)
Nookaew et al 2102 NAR
![Page 41: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/41.jpg)
How should we map reads
§ Do we want to map to a reference genome (with a “splice aware” aligner)?
§ Or do we want to map to a transcriptome directly?
![Page 42: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/42.jpg)
Mapping to the genome
How do we deal with alternative transcripts or paralogs during mapping?
§ "splicing aware" aligners: § Exon First: (Tophat, MapSplice, SpliceMap) Fig1A Garber § Step 1 - map reads to genome § Step 2 -unmapped reads are split, and aligned.
§ Seed & extend (Fig1B Garber) (GSNAP, QPALMA) § kmers from reads are mapped (the seeds), and then
extended
![Page 43: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/43.jpg)
Garber et al. 2011
![Page 44: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/44.jpg)
Mapping to a transcriptome
§ What might be the downside to mapping to the transcriptome? Incomplete transcriptomes can lead to errors in inferred expression levels. Potentially less well annotated. § For this Burrows-Wheeler is faster than seed
based approaches (shrimb & stampy), but the latter may be preferred if mapping to "distant" transcriptomes.
![Page 45: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/45.jpg)
Which to use
§ If a (close to?) perfect match transcriptome assembly is available for mapping. Burrows-wheeler based aligners can be much faster than seed based methods (upto 15x faster)
§ BW based aligners have reduced performance once mismatches are considered. § Exponential decrease in performance with each additional
mismatch (iteratively performs perfect searches). § Seed methods may be more sensitive when mapping to
transcriptomes of distantly related species (or high polymorphism rates).
From Garber et al. 2011
![Page 46: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/46.jpg)
Counting
§ One of the most difficult issues has been how to count reads.
§ What are some of the issues that we need to account for during counting of reads?
![Page 47: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/47.jpg)
Counting
§ We are interested in transcript abundance. § But we need to take into account a number of
things. § How many reads in the sample. § Length of transcripts § GC content and sequencing bias
![Page 48: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/48.jpg)
Counting
§ RPKM (Reads Per Kilobase of transcript per Million mapped reads) – Mortazavi et al 2008
§ FPKM (Fragments Per Kilobase of transcript per Million mapped reads). Avoids double counting in paired-end sequencing.
Normalizing a transcript's read count by both its length and the total number of mapped reads in the sample
![Page 49: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/49.jpg)
Garber et al. Computa[onal methods for transcriptome annota[on and quan[fica[on using RNA-‐seq. Nat Methods, Jun 2011
![Page 50: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/50.jpg)
Accounting for multiple isoforms
§ - Only count reads that map uniquely to an isoform. Can be very problematic, when isoforms do not have unique exons.
§ - so called "isoform-expression" methods (cufflinks, MISO) model the uncertainty parametrically (often using MLE). The model with the best mix of isoforms that models the data (highest joint probability) is the best estimate. How this is handled differs a great deal by the different model.
![Page 51: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/51.jpg)
Garber et al. 2011
![Page 52: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/52.jpg)
Trapnell C et al Differen[al analysis of gene regula[on at transcript resolu[on with RNA-‐seq Nat Biotechnol. 2013 Jan;31(1):46-‐53
![Page 53: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/53.jpg)
Differential expression
§ DEseq (http://www.ncbi.nlm.nih.gov/pubmed/20979621) § EDGE-R § EBseq (RSEM/EBseq) § RSEM (http://deweylab.biostat.wisc.edu/rsem/) § eXpress (http://bio.math.berkeley.edu/eXpress/overview.html) § Beers simulation pipeline(http://www.cbil.upenn.edu/BEERS/) § DEXseq (http://bioconductor.org/packages/release/bioc/html/DEXSeq.html) § Limma (voom) § Htseq (python library) works with DEseq
![Page 54: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/54.jpg)
Nookaew et al 2102 NAR
Differen[ally expressed genes based on sofware for quan[fica[on Differen[ally expressed genes based on sofware for mapping
![Page 55: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/55.jpg)
Problems with cufflink and cuffdiff? Reproducibility… § http://seqanswers.com/forums/showthread.php?t=20702 § http://seqanswers.com/forums/showthread.php?t=17662 § http://seqanswers.com/forums/showthread.php?t=23962 § http://seqanswers.com/forums/showthread.php?t=21020 § http://seqanswers.com/forums/showthread.php?t=21708 § http://www.biostars.org/p/6317/
![Page 56: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/56.jpg)
So, what to do?
![Page 57: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/57.jpg)
Example workflows and tutorials § Ian Dworkin’s NGS course protocols http://ged.msu.edu/angus/tutorials-2013/index.html
§ Bacterial RNA-Seq workflow from Ben Johnson & Rob Abramovitch http://www.abramovitchlab.com/#/rna-seq-computational-methods/ § Canadian Bioinformatics workshops http://bioinformatics.ca/workshops/2013/informatics-rna-sequence-analysis-2013 § Trinity and Tuxedo tutorials http://trinityrnaseq.sourceforge.net/rnaseq_workshop.html § Samtools for variant calling
![Page 58: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/58.jpg)
The “tuxedo” protocol for RNA-seq
Trapnell et al 2012
![Page 59: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/59.jpg)
Overviews of RNA-Seq
§ Graber et al, Computational methods for transcriptome annotation and quantification using RNA-seq, Nat Methods, Jun 2011
§ http://jura.wi.mit.edu/bio/education/hot_topics/RNAseq/RNAseqDE_Dec2011.pdf
![Page 60: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/60.jpg)
![Page 61: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/61.jpg)
Aligning to a transcriptome or a genome
§ Aligning to a genome, you have to account for the different splice variations
§ Aligning to a transcriptome, you have the different isoforms, so the mapping is more straightforward
§ However, you might have to assemble your own transcriptome
![Page 62: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/62.jpg)
How to assemble multiple alternative spliced transcripts?
1 2 3
In the presence of AS, conven[onal assembly may be erroneous, ambiguous, or truncated.
Overlapping
truncated truncated
correct truncated
![Page 63: RNA-Seq: From A to T - Michigan State UniversityRNA-Seq: From A to T . Nick Beckloff Director, Genomics Core Research Technology Support Facilities Tracy Teal ... RNA-seq Project Checklist](https://reader030.vdocuments.us/reader030/viewer/2022041008/5eb31a27bd2b1903ac60c05e/html5/thumbnails/63.jpg)
Need to use splice-aware assemblers
• Cufflinks (most commonly used) • Scripture • Trinity • Trans-‐ABySS • GRIT