Bioinformatics Solutions Towards the Advancement of Pathogen Detection with Metagenomics
Abstract
Next-generation-sequencing (NGS) has great potential for use as an excellent tool for detecting and diagnosing infectious disease. As applied to metagenomics, NGS poses several challenges when geared toward general pathogen... [ view full abstract ]
Next-generation-sequencing (NGS) has great potential for use as an excellent tool for detecting and diagnosing infectious disease. As applied to metagenomics, NGS poses several challenges when geared toward general pathogen detection activities. Some of these challenges include the robust assignment of pathogens, given an incomplete database, short reads, and algorithms that focus only on easy use cases (e.g. pathogens comprise most of the sample). General metagenomics taxonomy classifiers have been employed to help identify organisms within clinical samples. However, before NGS can be used as a routine procedure in a clinical setting, several hurdles must be overcome: (1) an easy-to-use environment that technicians or other non-bioinformatics experts can use, including reports and visualizations that can be interpreted by clinicians in a meaningful fashion; (2) rapid bioinformatics tools which run effectively on commodity hardware; (3) levels of confidence for reported organisms that is not tied solely to abundance; (4) a database of known pathogens that allow meaningful reporting.
Here, we provide some examples of the issues surrounding the use of NGS as a method to robustly identify pathogens in complex samples. We also present a series of efforts designed to: (1) lower the barrier for non-experts to use NGS for routine bioinformatics applications by developing a user-friendly web-based suite of tools; (2) limit the number of organisms mis-identified within samples, thereby improving positive predictive value; (3) provide the ability to fine-tune parameters to better assess what defaults should be used given specific questions that require different cutoffs (e.g. pathogen discovery vs detection of known pathogens). We also present a first attempt at developing confidence scoring algorithms that are not tied to abundance of identified organisms. Funding for this project is provided by the Defense Threat Reduction Agency – Joint Science and Technology Office for Chemical and Biological Defense, contract number HDTRA1-15-C-0013.
Authors
-
Po-E Li
(Los Alamos National Laboratory)
-
Chien-Chi Lo
(Los Alamos National Laboratory)
-
Joseph Russell
(MRIGlobal)
-
Karen Davenport
(Los Alamos National Laboratory)
-
Migun Shakya
(Los Alamos National Laboratory)
-
Pavel Senin
(Los Alamos National Laboratory)
-
Jean Challacombe
(Los Alamos National Laboratory)
-
Yan Xu
(Los Alamos National Laboratory)
-
Shihai Feng
(Los Alamos National Laboratory)
-
Zoe Challacombe
(Los Alamos National Laboratory)
-
David Yarmosh
(MRIGlobal)
-
Tracy Erkkila
(Los Alamos National Laboratory)
-
Jonathan Jacobs
(MRIGlobal)
-
Patrick Chain
(Los Alamos National Laboratory)
Topic Areas
Bringing sequence to the clinic (i.e., diagnostics, cancer, inherited disorders) , Human, non-human, and infectious disease applications
Session
OS-8 » Pathogen Sequencing & Detection (10:30 - Thursday, 18th May, La Fonda Ballroom)
Presentation Files
The presenter has not uploaded any presentation files.