Robust and High-throughput PCR-Free whole genome sequencing pipeline for Illumina HiSeq X and NovaSeq platforms
Abstract
Large scale whole genome sequencing (WGS) of population based cohorts and for clinical applications is now possible using Illumina’s HiSeq X and NovaSeq systems. Baylor College of Medicine’s Human Genome Sequencing Center... [ view full abstract ]
Large scale whole genome sequencing (WGS) of population based cohorts and for clinical applications is now possible using Illumina’s HiSeq X and NovaSeq systems. Baylor College of Medicine’s Human Genome Sequencing Center (HGSC) is a large-scale sequencing facility which supports NHGRI and NHLBI whole genome sequencing (WGS) projects in the Center for Common Disease Genomics (CCDG) program, Trans-Omics for Precision Medicine (TOPMed) program, Alzheimer’s disease sequencing project (ADSP).
Currently, >2000 PCR-Free libraries/month are constructed for 30X WGS on Illumina’s HiSeq X system and to support the recently installed NovaSeq sequencer. Protocols have been established to generate high quality libraries from a spectrum of DNA quality and quantities. Robust library preparation and quality control (QC) measures have also been implemented to support this pipeline. Libraries prepared using this protocol are sequenced on HiSeq X instruments generating on average about 124GB (unique/lane). The libraries are also currently being tested on S2 flow cell of the recently installed NovaSeq sequencer. PCR-Free library workflow used KAPA Hyper kit with modifications and libraries are prepared in batches of 96 on the Biomek FXP robots in 7 hours. The workflow has generated 26,054 PCR-Free libraries with <1% fail rate since December 2015. Novel QC steps using Fragment Analyzer and qPCR assay accurately quantifies libraries and screens for adapter dimer contamination. Newly synthesized adapter barcode oligos are screened in-house for cross-contamination using amplicons to detect contamination rates below current HPLC screening levels and guarantees that adapter contamination is <0.5% to allow for multiplexing of libraries. A qPCR assay has also been developed to detect and quantify rDNA of >110 bacterial species is used to screen DNA extracted from saliva. Fluidigm SNP Trace panel assay is used to confirm sample identity and VerifyBamID to detect sample contamination. Quality control metrics related to coverage, sample integrity and variant representation are generated at-scale with HgV, the HGSC analysis pipeline. To date 15,382 samples for the TOPMed project were sequenced using these PCR-Free libraries and generated an average sequence coverage of 40X, with 97% of the bases in the genome having 20X or greater coverage. These PCR-free protocols and workflow is planned for implementation in the HGSC clinical laboratory.
Authors
-
Zeineen Momin
(Baylor College of Medicine / Human Genome Sequencing Center)
-
Harsha Doddapaneni
(Baylor College of Medicine / Human Genome Sequencing Center)
-
Balaji Sundararaman
(Baylor College of Medicine / Human Genome Sequencing Center)
-
Robert Charles Glenn
(Baylor College of Medicine / Human Genome Sequencing Center)
-
Qingchang Meng
(Baylor College of Medicine / Human Genome Sequencing Center)
-
Caitlin Nessner
(Baylor College of Medicine / Human Genome Sequencing Center)
-
Sandra Lee
(Baylor College of Medicine / Human Genome Sequencing Center)
-
Kimberly Walker
(Baylor College of Medicine / Human Genome Sequencing Center)
-
Brent Gudenkauf
(Baylor College of Medicine / Human Genome Sequencing Center)
-
Kshitij Rajbhandari
(Baylor College of Medicine / Human Genome Sequencing Center)
-
Ginger Metcalf
(Baylor College of Medicine / Human Genome Sequencing Center)
-
Lora Lewis
(Baylor College of Medicine / Human Genome Sequencing Center)
-
Jianhong Hu
(Baylor College of Medicine / Human Genome Sequencing Center- CLIA Lab)
-
Jireh Santibanez
(Baylor College of Medicine / Human Genome Sequencing Center)
-
Geoffrey Okwuonu
(Baylor College of Medicine / Human Genome Sequencing Center)
-
Shannon Dugan-perez
(Baylor College of Medicine / Human Genome Sequencing Center)
-
Yi Han
(Baylor College of Medicine / Human Genome Sequencing Center)
-
Huyen Dinh
(Baylor College of Medicine / Human Genome Sequencing Center)
-
Eric Boerwinkle
(Baylor College of Medicine/ Medicine- Athero and Lipo)
-
Donna M. Muzny
(Baylor College of Medicine / Human Genome Sequencing Center)
-
Richard A. Gibbs
(Baylor College of Medicine / Human Genome Sequencing Center)
Topic Areas
Sequencing strategies and technology advancements using the various NGS platforms , De novo sequencing, re-sequencing, Human seq., RNA seq., metagenomics, etc. , Human, non-human, and infectious disease applications
Session
PS-1 » Poster Session A (19:00 - Tuesday, 16th May, Mezannine & New Mexico Room)
Presentation Files
The presenter has not uploaded any presentation files.