Profiling Complex Population Genomes with Highly Accurate Single Molecule Reads: Cow Rumen Microbiomes
Abstract
Determining compositions and functional capabilities of complex populations is often challenging, especially for sequencing technologies with short reads that do not uniquely identify organisms or genes. Long-read sequencing... [ view full abstract ]
Determining compositions and functional capabilities of complex populations is often challenging, especially for sequencing technologies with short reads that do not uniquely identify organisms or genes. Long-read sequencing improves the resolution of these mixed communities, but adoption for this application has been limited due to concerns about throughput, cost and accuracy.
The recently introduced PacBio Sequel System generates hundreds of thousands of long and highly accurate single-molecule reads per SMRT Cell.
We investigated how the Sequel System might increase understanding of metagenomic communities. In the past, focus was largely on taxonomic classification with 16S rRNA sequencing. Recent expansion to WGS sequencing enables functional profiling as well, with the ultimate goal of complete genome assemblies.
Here we compare the complex microbiomes in 5 cow rumen samples, for which Illumina WGS sequence data was also available. To maximize the PacBio single-molecule sequence accuracy, libraries of 2 to 3 kb were generated, allowing many polymerase passes per molecule. The resulting reads were filtered at predicted single-molecule accuracy levels up to 99.99%.
Community compositions of the 5 samples were compared with Illumina WGS assemblies from the same set of samples, indicating rare organisms were often missed with Illumina. Assembly from PacBio CCS reads yielded a contig >100 kb in length with 6-fold coverage. Mapping of Illumina reads to the 101 kb contig verified the PacBio assembly and contig sequence.
These results demonstrate ways in which long accurate reads benefit analysis of complex communities.
Authors
-
Nick Sisneros
(Pacific Biosciences)
-
Cheryl Heiner
(Pacific Biosciences)
-
Itai Sharon
(Tei-Hai College, Upper Galilee, and MIGAL Galilee Research Institute)
-
Steven Oh
(Pacific Biosciences)
-
Alvaro Gonzalo Hernandez
(University of Illinois Urbana-Champaign)
-
Itzhak Mizrahi
(Ben-Gurion University of the Negev)
-
Richard Hall
(Pacific Biosciences)
Topic Areas
De novo sequencing, re-sequencing, Human seq., RNA seq., metagenomics, etc. , Analysis for metagenomics, antimicrobial resistance, and forensics , AgriGenomics, livestock genomics, plant genomics
Session
PS-2 » Poster Session B (20:00 - Tuesday, 16th May, Mezannine & New Mexico Room)
Presentation Files
The presenter has not uploaded any presentation files.