improving ngs library prep with an open system liquid …...improving ngs library prep with an open...

Improving NGS Library Prep with an Open

System Liquid Handler

Matthew J. Nesbitt1 and Masoud M. Toloue2 1Aurora Biomed Inc, 1001 E. Pender St., Vancouver, BC V6A1W2 2Bioo Scientific, 3913 Todd Lane Suite 312 Austin, TX 78744 Correspondence should be addressed to M.T. ([email protected]) or M.N. ([email protected])

Growing next generation sequencing (NGS) throughput and new, sophisticated multiplexing

strategies have significantly increased the number of samples users can prepare. The de-

mand for reproducible and reliable data requires consistency between libraries. To achieve

this, the VERSA Mini NGLP, liquid handling workstation has been optimized for NGS library

preparation. Using NEXTflex™ library preparation technology in conjunction with the VERSA

Mini NGLP, high throughput functionality matched with superior enzyme performance out-

paces and improves consistency when compared to manual library preparation.

INTRODUCTION The plummeting cost of sequence acquisition that has

resulted from advances in next generation sequenc-

ing (NGS) technologies has led to its use in new set-

tings such as small research groups, contract research

organizations and the clinic1. The availability of new

benchtop NGS platforms such as the MiSeq from Illu-

mina and the Ion Torrent PGM from Life Technologies

is encouraging NGS adoption by these facilities. The

offer of low per base costs and short run times with

relatively small up-front instrument investments2 aids

the justification these groups use to introduce NGS to

their labs, and drives the eventual adoption of higher

capacity sequencers. Groups that traditionally out-

sourced sequencing to large science facilities have

been empowered, and can bring their projects in-

house. Widespread adoption of NGS is quickly ap-

proaching in spite of an underdeveloped field of liq-

uid handling workstations validated to complete the

associated library preparation protocols. Because of

the high number of sample manipulations needed for

this task, consistently reliable results depend on liq-

uid handling automation3.

VERSA Mini NGLP™

The delicate sample handling offered by the VERSA

Mini NGLP makes this automated liquid handling

workstation ideal for NGS library preparation proto-

cols. A single channel, dual syringe pipette manifold

that is paired with high resolution stepper motors

provides accurate liquid handling at variable speeds

over a large volume range. Reaction setup, incuba-

tions and sample cleanups are executed in a UV lamp,

HEPA filtered hood. The dispense-only ReagentDrop

clusters provide a quick and contamination-free

method of precipitating sample nucleic acids.

Flexible software that provides the user a high degree

of control over the mechanics of liquid handling is

provided. Varying aspiration and dispense speeds can

be set for individual operations to minimize sample

perturbation during magnetic bead-based cleanups.

New protocols can easily be automated, and users

can easily update the number of samples to be proc-

essed. The VERSA Mini NGLP provides an appropriate

automation solution for groups looking to leverage

the advantages of 3rd party reagent kits while intro-

ducing consistency between library preparations.

Protocol

mailto:[email protected]

mailto:[email protected]

Protocol

NEXTflex™ LIBRARY PREP KITS

Designed to prepare single, paired-end and multi-

plexed genomic, ChIP, mRNA, small RNA and reduced

representative bisulfite sequencing libraries, NEXTflex

(Bioo Scientific) improves on current library construc-

tion by using enhanced enzymatic master mixes and

flexible multiplexing options. NEXTflex enhanced

adapter ligation technology is one example. Using

specially designed ligases and an optimized reaction

buffer, the resulting higher number of ligation events

(adapters ligating to sample) with NEXTflex ligation

mix directly correlates with a higher degree of se-

quence diversity, better read numbers, coverage and

assembly.

NEXTflex kits are designed to be compatible with high

-throughput liquid handling automation by using

master mixed enzymes and buffers and gel-free size

selection. With growing sequencing throughput, us-

ers are increasingly able to multiplex or pool several

samples into a single sequencing reaction. To accom-

modate this capacity, NEXTflex DNA Barcodes (Bioo

Scientific), can be used to multiplex or simply tag

samples to monitor cross contamination with up to

96 unique indices. This significantly reduces sequenc-

ing associated costs, allows measurement of base

error rate, cross genomic studies, time course and

drug induced cellular experiments. The NEXTflex DNA

barcodes are available in 96 well plates enabling high

throughput automation.

NEXTflex and VERSA

Low cost automation of NGS library preparation with

the use of optimized library preparation reagents

offers flexibility for groups that leverage multiple NGS

technologies with varying workflows. Laboratories

with multiple NGS technologies can automate differ-

ent library preparation protocols with a single re-

agent provider on the same automated platform.

Currently, most validated automation options for

library preparation are closed systems that do not

accommodate a broad range of reagent kits.

The University of Arizona Genetics Core validated the

use of the VERSA Mini NGLP (Aurora Biomed Inc.)

with NEXTflex (Bioo Scientific) for NGS library prepa-

ration. As an open-platform workstation, the user is

no longer tied down to one library preparation kit.

This can drastically drops the cost per sample. In ad-

dition, the accompanying VERSAware user software

enables control over aspiration/dispensing speeds

and tip positioning that can be tailored to delicately

handle several types of genomic samples.

METHODS

DNA isolated from the Pocket Mouse (Chaetodipus

intermedius) was used as a template for library

preparation. Fragmentation was completed using a

Covaris S series instrument (Woburn, MA, USA) and

split into two equivalent volumes. One was processed

using the standard Illumina manual library prepara-

tion (manual library) used by the University of Ari-

zona Genetics Core. The other was processed in an

automated fashion (automated library) with the

VERSA Mini NGLP Workstation from Aurora Biomed

(Vancouver, BC). Enzymatic modifications of this sam-

ple and necessary purification steps were completed

using the NEXTflex DNA Sequencing Kit (cat# 5140-

01) from Bioo Scientific (Austin, TX).

Reaction setup for end repair, dA tailing, and bar-

coded adapter ligation were handled by the work-

station as were incubations and magnetic bead clean-

ups. Purified, sequenceable libraries (one automated,

one manual) were size selected via agarose gel elec-

trophoresis and excision of a 400 - 500 bp target

range. Amplification was then completed.

Both libraries were assessed to determine quality and

robustness. A Bioanalyzer trace (Agilent Technolo-

gies, Santa Clara, CA) detailed the fragment size dis-

tribution. DNA concentrations were determined with

the use of the PicoGreen reagent (Life Technologies,

Carlsbad, CA). A qPCR quantitation method using a

KAPA Biosystems (Woburn, MA) Library Quant Kit

determined the levels of adapter-bound fragments.

After cluster generation and sequencing of the librar-

ies using one lane on a HiSeq 2000 run (Illumina, San

Diego, CA), the data was characterized.

To ensure the library fragments were appropriately

ligated to Illumina adapter sequences a qPCR library

quantification was completed. The Ct values (Figure

2) and concentration of adapter-bound fragments

(Table 1) for the automated and manual libraries

were similar and of sufficient quality for sequencing.

Both the automated and manually created libraries

amplified equally, passing a common QC metric used

prior to sequencing. To confirm the libraries were a)

representative of the original template and b) of nec-

essary integrity to provide usable raw data, they were

submitted to DNA cluster generation and sequencing on a single lane of a HiSeq 2000 instrument. Sequence Data QC Resulting raw sequence data for 100 bp paired-end

reads were sorted by barcode, converted to FastQ files

and processed with Trimmomatic (Usadel lab, Max

Planck Institute, Potsdam, GER). Adapter sequences

were removed, leading and trailing bases were

scanned, and a sliding window was used to trim reads

at points over which average Q scores dropped below

15.

Figure 1 | Fragment Size Comparison Post Size Selection. Bioanalyzer traces for the automated (a) and manual (b) libraries. A dark electropherogram band in (a) demonstrates the automated process efficiently recovered DNA within the range targeted by the size selection.

Protocol

RESULTS Library QC Bioanalyzer traces for the manual and automated

libraries indicated both had acceptable fragment size

distribution profiles. Both libraries had similar traces;

the manual library had an average fragment of size of

423 bp compared to 417 bp for the automated library

(Figure 1). A PicoGreen experiment indicated total

DNA concentrations of 157.16 nM and 180.66 nM for

the automated and manual libraries respectively

(Table 1).

Table 1 | Determination of Overall dsDNA Concentra-tion (PicoQuant) and Adapter-bound Fragments.

Library PicoQuant (nM)

qPCR (nM)

Automated 157.16 169.2

Manual 180.66 172.5

Differences between the sequence data sets gener-

ated with the two libraries did exist. The automated

set contained approximately 75% more paired-end

reads than the manual set (Table 2). This difference

can be attributed to either 1) highly efficient DNA

cluster generation with the automated library or 2)

enhanced adapter ligation using NEXTflex™ library

preparation technology.

Both libraries appeared to have levels of sequence

duplication that may have been introduced during the

PCR amplification process. FastQC analysis shows that

the automated NEXTflex preparation had a significantly

lower levels of duplication compared to the manual

preparation.

CONCLUSIONS:

In addition to reducing hands on time and the overall

burden of manually preparing hundreds of libraries,

Figure 2 | qPCR Quantification of Automated (a) and Manual (b) Libraries. Log (rRn) vs. cycle count for qPCR analysis of adapter-bound fragments in the automated (a) and manual (b) libraries. A common threshold of 32.7 was used for both, with Ct values of 5.3 and 5.0 in (a) and (b), respectively.

Protocol

An understanding of the sequence characteristics was

necessary to prove the automated method was vi-

able. A FastQC analysis (Andrews lab, Barbraham

Institute, Cambridge, UK) of the R1 read from the

paired-end sequences revealed similarities between

the two libraries for sequence quality and content.

The per base sequence quality and overall read qual-

ity scores (Figure 3) illustrated that both sample

preparations resulted in libraries of high integrity. In

addition, both data sets indicated overall GC content

to be 39% (data not shown). R2 sequence metrics

mirrored those found in R1 (data not shown).

Table 2| Paired-end Reads for Automated and Man-ual Datasets.

Library Reads (M)

Automated 114.1

Manual 65.4

Protocol

automated VERSA liquid handling together with

NEXTflex library preparation has several demonstra-

ble advantages:

High DNA recovery during bead based clean-up

75% greater paired-end reads

Fewer duplicate reads

High read quality

VERSA is a unique liquid handling platform in that it is

an open system, allowing the user to choose their

library preparation technology. With new and im-

proved library technologies, users of closed liquid

handling platforms are at a disadvantage. In addition

to losing flexibility, “closed” system users are de-

pendent on a single manufacturer which can lead to

supply and pricing speed bumps. By using the VERSA

system, users can take advantage of NEXTflex, one of

the most robust DNA library technologies on the mar-

ket. In addition to improved read numbers, coverage

and assembly, NEXTflex libraries have the largest array

of indexed adapter barcodes available.

ACKNOWLEDGEMENTS Sequencing was performed independently by the Uni-

versity of Arizona Genetics Core under the supervision

of Drs. Ryan Sprissler, Matthew Kaplan and Joe Still. Dr.

Michael Nachman provided the samples. Drs. Sikander

Gill and Rajwant Gill provided technical support from

Aurora Biomed’s offices.

REFERENCES 1. Biesecker LG, Mullikin JC, Facio FM, Turner C, Cherukuri PF, Blakesley RW, Bouffard GG, Chines PS, Cruz P, Hansen NF, et al. 2009. The ClinSeq Project: Piloting large-scale genome sequencing for research in genomic medicine. Genome Re- search 19: 1665– 1674. 2. Glenn TC, 2011. Field guide to next generation DNA sequencers. Molecular Ecology Research 11: 759-769. 3. Farais-Hesson E, Erikson J, Atkins A, Shen P, Davis RW, Scharfe C, and Pourmand N, 2010. Semi-automated library preparation for high-throughput DNA sequencing platforms. Journal of Biomedicine and Biotechnology 2010:617469.

-

Figure 3| Sequencing Metrics. Per base and per sequence quality scores for R1 ends of the automated (a, b re-spectively) and manual (c, d respectively) libraries. All median per base scores exceeded Q28, and the average read quality peaked at Q38 for both libraries.

improving ngs library prep with an open system liquid …...improving ngs library prep with an open...

Documents