ngs: the basics · next generation sequencing massively parallel sequencing ... immobilized pcr on...

29
NGS: the basics

Upload: others

Post on 18-Jun-2020

14 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

NGS: the basics

Page 2: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Human genome sequence

June 26th 2000: official announcement of the completion of the draft of the human genome sequence (truly finished in 2004)

Costs:HGP:

3 billion $15 years

Celera:200 million $

2 years

Craig VenterFrancis Collins

Page 3: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Next-Generation Sequencing (NGS): Slashing costs

Page 4: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Next generation sequencing Massively parallel sequencing

Key: direct sequencing of DNA without the bacterial cloning step:

1. From colonies to poloniesImmobilized PCR on solid support

Flow cell or beads (emPCR)

2. Single molecule sequencingVoir Claude Thermes

Page 5: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Library preparation

LM-PCR to allow single molecule amplification

Page 6: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Clonal amplification of single molecules

Emulsion PCR on beads(454, Ion Torrent)

Page 7: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Ion Torrent: Natural Chemistry

Page 8: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Fast Direct Detection

Nucleotides flow sequentially over Ion semiconductor chipDirect detection of natural DNA extensionA few seconds per incorporation

Sensor Plate

Silicon SubstrateDrain SourceBulk

dNTP

To column receiver

∆ pH

∆ Q

∆ V

Sensing Layer

H+

Rothberg J.M. et al Nature doi:10.1038/nature10242

Page 9: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Scalable Semiconductor Technology

WaferSemiconductor Manufacturing

ChipSemiconductor Packaging

Chip Cross Section

Semiconductor Design

Page 10: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Illumina amplification step on a flow cell

Page 11: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Sequencing by synthesis

CRT: cyclic reversible termination

Page 12: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Illumina

Direct in situ sequencing of polonies

A C

TG

Page 13: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

MiSeq : 300 nt reads (15x106 per run)

NextSeq : 150 nt reads (400x106 per run)

HiSeq2500/3000/4000 : 100 – 150 nt reads (≈2x109 per run)

Illumina sequencers

Page 14: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

The 2016 winning technologies

IlluminaPoloniesSeveral 100 million readsA few 100 bp longError rate ~0.1%

Oxford nanoporesSingle moleculesA few 10,000 readsSeveral 10,000 bp longError rate ~10%

Page 15: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Impact of costs decreases

Collecte des échantillons et design de l’expérience

SéquençageGestion DonnéesRéduction Données Analyses des données

100%

0%

Pre-NGS (2000) 2010 2020

Plan d’expérienceStratégie de construction des banques

Page 16: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Grands types d’applications

Séquençage de novo de génomes Biologie de l’évolution Ouverture de l’éventail des modèles biologiques Diversité du vivant devient accessible à la biologie

moléculaire Caractérisation de la variabilité dans une

population Caractérisation de la diversité des espèces

dans l’environnement Caractérisation des mécanismes

d’interprétation de l’information génomique

Page 17: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

DNA-seq Libraries

Genomic DNA

Size selection

Sonication

Illumina TruSeq technology

End repair

Phosphorylation

A - overhang

Page 18: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Primer 1: complementary to R

Primer 2: equivalent to R

Ligate Y-adaptors

PCR

Page 19: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

AA

AT

TA

PCRamplification

Double StrandedY-adapter method library

Y adapterligation

3’ endadenylation

endpolishing

endPolishing

P adapterligation

3’ extension and nick repair

Double StrandedBlunt-End method library

Strand denaturationend dephosphorylation

starting DNA fragment

biotinylated single strand adapterligation

primer extension

double strandedadapter ligation

strand separationby denaturation

Single Strandedmethod library

endpolishing

PCRamplification

TA

AT

PCRamplification

Page 20: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Nextera “tagmentation”

Tagmentation

Dual barcode approach

up to 96 indexedsamples

Tagment Enzyme fragments DNA and attaches junction adapters (blueand green) to both ends of the tagmented molecule

rapid ( 2-4 hours) and requires small quantities (50 ng)

Transposomes / Tagment Enzyme

DNA-seq Libraries

Page 21: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

RNA-seq Libraries

Page 22: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Paired end sequencing

1rst read 2d read1rst barcode 2d barcode

Page 23: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

“Classical” Illumina mate pair library

Problems :• low coverage• few fragments, over-amplified

several kilobases

Circularisation

Fragmentation, purification, adaptor ligation

Paired end sequencing

Page 24: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

A new method : Nextera Mate Pair

Tagment Enzyme fragments DNA and attaches a biotinylated junctionadapter (green) to both ends of the tagmented molecule

circularization

Fragmentation enrichment via the biotin tag

adapters ligation at both ends

Page 25: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

More than 50 NGS applications

Page 26: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Caractérisation des mécanismes d’interprétation de l’information génomique

Conformation du chromosome, higher orderchromatin structure

Organisation nucléosomale Méthylation de l’ADN et autres modifications Liaison des facteurs de transcription Réplication de l’ADN Transcription nucléaire, conformation des ARN,

interaction ARN-protéines ARN sous toutes ses formes, petits, grands,

épissage alternatif, sens-antisens, codant-non codant, compartimentation cellulaire, transport, traduction, modification, dégradation

Une multiplicité d’approches pour analyser presque tous les niveaux d’organisation et d’expression du génome

Page 27: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Impact of costs decreases

Collecte des échantillons et design de l’expérience

SéquençageGestion DonnéesRéduction Données Analyses des données

100%

0%

Pre-NGS (2000) 2010 2020

Enjeu majeur

Page 28: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Quelques enjeux de l’analyse des données NGS

Va concerner des pans entiers de la biologie qui en seront transformés

Va concerner un très grand nombre de biologistes: problème de la formation et de l’interdisciplinarité

Il va falloir traiter des volumes de données dont l’expansion actuelle est énorme

Il va falloir intégrer des données hétérogènes

Page 29: NGS: the basics · Next generation sequencing Massively parallel sequencing ... Immobilized PCR on solid support. Flow cell or beads (emPCR) 2. Single molecule sequencing. ... Semiconductor

Quelques considérations clefs

Diversification et complexification des analyses bioinfo accompagnent la diversification des applications du NGS

L’analyse initiale (préliminaire) des données est plus homogène, et est maintenant bien intégrée dans des environnements conviviaux (Prêt à porter)

L’analyse plus poussée des données demandera pendant encore longtemps du « sur mesure ».

Plus vous maitriserez la compréhension des outils d’analyse, plus vous pourrez monter vos plans d’expérience de façon adaptée, et plus vous pourrez interagir de façon productive avec les bioinformaticiens pour avoir un « sur mesure » qui vous sied bien.