Clinical metagenomic sequencing
Clinical metagenomic next-generation sequencing (mNGS) is the comprehensive analysis of microbial and host genetic material (DNA or RNA) in samples from patients. It allows for identification and genomic characterization of bacteria, fungi, parasites, and viruses without the need for a prior knowledge of a specific pathogen directly from clinical specimens.[1] The capacity to detect all the potential pathogens in a sample makes metagenomic next generation sequencing a potent tool in the diagnosis of infectious disease especially when other more directed assays such as PCR fail.[2] Current limitations include clinical utility, laboratory validity, sense and sensitivity, cost and regulatory considerations.
Laboratory workflow
A typical mNGS workflow consists of the following steps:
- Sample acquisition: the most commonly used samples for metagenomic sequencing are blood, stool, cerebrospinal fluid (CSF), urine, or nasopharyngeal swabs. Among these, blood and CSF are the cleanest, having less background noise, while the others are expected to have a great amount of commensals and/or opportunistic infections and thus have more background noise.[3] Samples should be collected with much caution as surgical specimens could be contaminated during handling of the biopsy; for example, lumbar punctures to obtain CSF specimens may be contaminated during the procedure.[2]
- RNA/DNA extraction: the DNA and the RNA of the sample is extracted by using an extraction kit. If there is a strong previous suspicion of the pathogen genome composition and since the amount of pathogen nucleic acid in more noise samples is overwhelmed by the RNA/DNA of other organisms, selecting an extraction kit of only RNA or DNA would be a more specific and convenient approach. Some commerciable available kits are for example RNeasy PowerSoil Total RNA kit (Qiagen), RNeasy Minikit (Qiagen), MagMAX Viral Isolation kit (ABI), Viral RNA Minikit (Qiagen).[3]
- Optimization strategies for library preparation: because of high levels of background noise in metagenomic sequencing, several target enrichment procedures have been developed that aim to increase the probability of capturing pathogen-derived transcripts and/or genomes. Generally there are two main approaches that can be used to increase the amount of pathogen signal in a sample: negative selection and positive enrichment.[3][1]
- Negative selection (background depletion or subtraction) targets and eliminates the host and microbiome genomic background, while aiming to preserve the nucleic acid derived from the pathogens of interest. Degradation of genomic background can be performed through broad-spectrum digestion with nucleases, such as DNase I for DNA background, or by removing abundant RNA species (rRNA, mtRNA, globin mRNA) using sequence-specific RNA depletion kits. Also CRISPR-Cas9-based approaches can be performed to target and deplete human mitochrondrial RNA for example. Generally, however, subtraction approaches lead to a certain degree of loss of the targeted pathogen genome, as poor recovery may occur during the cleanup.[3]
- Positive enrichment is used to increase pathogen signal rather than reducing background noise. This is commonly done through hybridization-based target capture by probes, which are used to pull out nucleic acid of interest for downstream amplification and sequencing. Panviral probes have been shown to successfully identify diverse types of pathogens in different clinical fluid and respiratory samples, and have been used for sequencing and characterization of novel viruses. However, the probe approach includes extra hybridization and cleanup steps, requiring higher sample input, increasing the risk of losing the target, and increasing the cost and hands-on time.[3]
- High-throughput sequencing: all the nucleic acids fragments of the library are sequenced. The sequencing platform to be used is chosen depending on different factors such as laboratory's research objectives, personal experience and skill levels. So far, the Illumina MiSeq system has proven to be the most commonly used platform for infectious disease research, pathogen surveillance, and pathogen discovery in research and public health. The instrument is compact enough to fit on a laboratory bench, has a fast runtime as compared to other similar platforms, and has a strong user support community. However, with further improvements of this technology and with additional error reduction and software stabilization, the MinION may be an excellent addition to the arsenal of current sequencing technologies for routine surveillance, especially in smaller laboratories with limited resources. For instance, the MinION was successfully used in the ZiBRA project for real-time Zika virus surveillance of mosquitoes and humans in Brazil, and in Guinea to perform real-time surveillance during the ongoing Ebola outbreak.[4][5] In general, for limited resources IlluminaMiSeq, iSeq, Ion Torrent PGM, Oxford Nanopore, MinION are used. While for substantial resources Illumina NextSeq, NovaSeq, PacBio Sequel, Oxford Nanopore and PromethION are preferred. Moreover, for pathogen sequencing the use of controls is of fundamental importance ensuring mNGS assay quality and stability over time; PhiX is used as sequencing control, then the other controls include the positive control, an additional internal control (e.g., spiked DNA or other known pathogen) and a negative control (usually water sample).[3]
- Bioinformatic analysis: Whereas the sequencing itself has been made widely accessible and more user friendly, the data analysis and interpretation that follows still requires specialized bioinformatics expertise and appropriate computational resources. The raw data from a sequencing platform is usually cleaned, trimmed, and filtered to remove low-quality and duplicate reads. Removal of the host genome/transcriptome reads is performed to decrease background noise (e.g., host and environmental reads) and increase the frequency of pathogen reads. This step will also decrease downstream analysis time. Further background noise removal is achieved by mapping of sample reads to the reads from the negative control to ensure elimination of any contaminating reads, such as those associated with the reagents or sampling storage medium. The remaining reads are usually assembled de novo to produce long stretches of sequences called contigs. Taxonomic identification of the resulting contigs is performed by matching them to the genomes and sequences in nucleotide or protein databases; for this, various versions of BLAST are most commonly used. Advanced characterization of bacterial organisms can be also performed, allowing to obtain the necessary depth and breadth of coverage for genetic characterization results. Gene calling can be performed in a variety of ways, including RAST or using NCBI services at the time of full genome submission. Results of multiple annotation tools can be compared for accuracy and completeness and, if necessary, merged using BEACON. For characterization of antibiotic resistance genes, the Resistance Gene Identifier from the Comprehensive Antibiotic Resistance Database (CARD) is commonly used. To characterize virulence factor genes, ShortBRED offers analyses with a customized database from the Virulence Factor Database.[3]
Applications
Metagenomic approaches haven been used to identify infections in ancient remains, discover novel viral pathogens and characterize the human virome in both healthy and diseased states and for forensic applications.[1]
Specifically, applications of clinical metagenomics to date have included infectious disease diagnostics for variety of syndromes and sample types, microbiome analyses in both diseased and healthy states, characterization of the human host response to infection by transcriptomics and the identification of tumour-associated viruses and their genomic integration sittes.[1]
Aside from infectious disease diagnostics, adoption of mNGS (metagenomic Next Generation Sequencing) in clinical laboratories has been slow, and most applications have yet to be incorporated into routine clinical practice. Nonetheless, the breadth and potential clinical utility of these applications are likely to transform the field of diagnostic microbiology in the near future.[1]
Infectious diseases diagnosis
The capacity to detect all potential pathogens (bacteria, viruses, fungi and parasites) in a sample and simultaneously interrogate the host responses has great potential utility in the diagnosis of infectious disease. One way to detect these pathogens is detect part of their genome by metagenomics sequencing (Next Generation Sequencing-mNGS), which can be targeted or untargeted.[1]
Targeted
The targeted analysis is done by enriching individual genes or genomic regions. This approach increases significantly the number of pathogen reads in the sequence data. Because of that, the sensitivity to detect microorganisms that are being targeted usually increases, but this comes with a limitation of the breadth of potential pathogens that can be identified.[1]
Untargeted
The untargeted analysis is a metagenomic "shotgun" approach. The whole DNA and/or RNA is sequenced with this approach using universal primers. The resultant mNGS reads can be assembled into partial or complete genomes. These genome sequences allow to monitor hospital outbreaks to facilitate infection control and public health surveillance. Also, they can be used for subtyping (identificate a specific genetic variant of a microorganism).
Untargeted mNGS is the most promising approach to analyse clinical samples and provide a comprehensive diagnosis of infections. Various groups have validated mNGS in Clinical Laboratory Improvement Amendments (CLIA), such as meningitis or encephalitis, sepsis and pneumonia.[1]
The traditional method consists on formulating a differential diagnosis on the basis of the patient's history, a clinical presentation, imaging findings and laboratory testing. But here it is suggested a different way of diagnosis; metagenomic next-generation sequencing (NGS) is a promising method because a comprehensive spectrum of a potential causes (viral, bacterial; fungus and parasitic) can be identify by a single assay.[1]
Below are some examples of the metagenomic sequencing application in infectious diseases diagnosis.
Diagnosis of meningitis and encephalitis
The traditional method that is used to the diagnosis of infectious diseases has been challenged in some cases: neuroinflamatory diseases, lack of diagnostic tests for rare pathogens and the limited availability and volume of the Central Nervous System (CNS) samples, because of the requirement for invasive procedures. Owing to these problems, some assays suggest a different way of diagnosis, which is the metagenomic next-generation sequencing (NGS); this is a promising method for diagnosis because a comprehensive spectrum of potential causes (bacterial, viral, fungus and parasitic) can be identify by a single study. Summarising, NGS can identify a broad range of pathogens in a single test.[6]
In some articles they evaluate the clinical usefulness of metagenomic NGS for diagnosis neurologic infections, in parallel with conventional microbiologic testing. It has been seen that the highest diagnostic yield resulted from a combination of metagenomics NGS of CSF and conventional testing, including serologic testing and testing of sample types other than CSF.[6]
Moreover, some findings from different studies have shown that neurologic infections remain undiagnosed in a proportion of patients despite conventional testing and they demonstrate the potential usefulness of clinical metagenomic NGS testing in these patients.[6]
The results of metagenomic NGS can also be valuable even when concordant with results of conventional testing, not only providing reassurance that the conventionally obtained diagnosis is correct but also potentially detecting or ruling out coinfections, specially in immunocompromised patients.[6]
Metagenomic is fundamentally a direct-detection method and relies on the presence of nucleic acid form the causative pathogen in the CSF samples.[6]
Study of antimicrobial resistance
Antimicrobial resistance is a health problem needed to resolve.[7]
Nowadays to detect resistances of different microbes is used a technique called Antibiotic Sensitivity (AST), but several studies have discovered that bacterial resistance is in the genoma and it is transferred by horizontal way (HGT), so sequencing methods are being developed to ease the identification and characterization of those genomes and metagenomes. For the moment exist the following methods to detect antimicrobial resistances:[7]
- AST: an advantage about this method is that it gives information for patients treatment. There are also some disadvantages, one of them is that this technique is only useful in cultivable bacteria or that is needed competent personal.[7]
- Sequencing methods: some advantages of those methods over the AST technique are that it is rapid and sensible, it is useful on both bacteria that grow on artificial media and those that do not, and it permits compare studies in several organisms. One type of sequencing method can be used in preference to another depending on the type of the sample, for a genomic sample assembly-based methods is used; for a metagenomic sample it is preferable to use read-based methods.[7]
It is noted that metagenomic sequencing methods have provided better results than genomics ones, due to these present less number of negative falses. Within metagenomics sequencing, functional metagenomic is a powerful approach for characterizing resistomes; a metagenomic library is generated by cloning the total community DNA extracted from a sample into an expression vector, this library is assayed for antimicrobial resistance by plating on selective media that are lethal to the wild-type host. The selected inserts from the surviving recombinant, antimicrobial-resistant host cells are then sequenced, and resulting sequences are subsequently assembled and annotated (PARFuMS).[7]
Functional metagenomics has enabled the discovery of several new antimicrobial resistance mechanisms and their related genes, one such example is the recently discovered tetracycline resistance mechanism by tetracycline destructases.[7]
In conclusion is important to incorporate not only the antimicrobial resistance gene sequence and mechanism but also the genomic context, host bacterial species and geographic location (metagenome).[7]
Clinical microbiome analyses
Recently there has been a shift from the use of targeted sequencing of 16S rRNA gene to the use of mNGS to characterize the microbiome of an individual. This goes hand in hand with the raise of awareness on the important role of the microbiome in different diseases states. Even though, no microbiome-based tests have been approved for the diagnosis or treatment of a disease. This is mainly due to the non complete understanding of the complexity of the microbiome and how it is related to certain diseases.[1]
The use of mNGS to characterize the microbiome has made possible the development of bacterial probiotics to be administrated as pills, for example, as a treatment of Clostridium difficile-associated diseases.[1]
The analysis of bacterial diversity is another application of the microbiome. It can give information on whether a patient's illness is infectious or not. Studies of mNGS show that patients with culture-proven infection (in patients with pneumonia, for example) have less diversity in their respiratory microbiome. This is due to dysbiosis, an abnormal alteration of the microbiome, which can also be related to obesity, diabetes mellitus or inflammatory bowel disease. The identification of dysbiosis using mNGS can be a pathway to treating these pathological conditions by then manipulating the microbiome.[1]
Human host response analyses
Knowing host responses has important prospective utility in the diagnosis of infectious diseases. For this reason, one of the most important possible applications of clinical metagenomics is characterization of the human host response to infection by transcriptomics and the identification of tumor associated viruses and their genomic integration sites.[1]
The studying of genes expression allows us to characterize a lot of infections, for example infections due to Staphylococcus aureus, Lyme disease, candidiasis, tuberculosis and influenza. Also, this approach can be used for cancer classification.
mNGS of RNA libraries used for the detection of pathogens such as RNA viruses in clinical samples incidentally produces host gene expression data for transcriptome (RNAseq) analyses. There are no-RNAseq based assays that are clinically validated to date for use in patients, but RNAseq analysis give better results. The reason is that RNAseq assays can detect active microbial gene expression and might enable the discrimination between infection versus colonization and live versus dead organism.[1]
RNAseq analysis have a lot of other purposes and applications such as to identify novel or under appreciated host–microbial interactions directly from clinical samples, to make indirect diagnosis on the basis of a pathogen specific human host response and to discriminate infectious versus noninfectious causes of acute illness.[1]
Applications in oncology
In oncology, whole-genome or directed NGS approaches to identify mutated genes can be used to simultaneously detect viruses associated with cancer (for example, herpesviruses, papillomaviruses and polyomaviruses) and/or to collect information about the interaction between the virus and its host.[1]
Until now, the US Food and Drug Administration (FDA) has approved the clinical use of two NGS panels testing for actionable genomic aberrations in tumor samples.[1] The addition of specific viral probes allows to detect reads corresponding to both integrated and exogenous viruses. The data gathered about integrated or active viral infections in cancers are important for preventive or therapeutic treatments with target antiviral and chemotherapeutic drugs.[1]
In future, mNGS of cell-free DNA from liquid biopsy sample, such as plasma, might be useful for the identification of early cancer and diagnosis of infection in immunocompromised patients.[1]
Challenges
Despite the potential and recent successes of metagenomics, clinical diagnostic applications have lagged behind research advances owing to a number of factors. A complex interplay of microbial and host factors influences human health, as exemplified by the role of the microbiome in modulating host immune responses, and it is often unclear whether a detected microorganism is contaminant, colonizer or pathogen. Additionally, universal reference standards and proven approaches to demonstrate test validation, reproducibility and quality assurance for clinical metagenomic assays are lacking. Considerations of cost, reimbursement, turnaround time, regulatory considerations and, perhaps most importantly, clinical utility also remain major hurdles for the routine implementation of clinical mNGS in patient care settings.[1]
Clinical utility
Molecular diagnostic assays provide a fairly cost-effective and rapid (about <2hours of turnaround time) means to diagnose the most common infections. However, nearly all conventional microbiological tests in current use detect only one or a limited panel of pathogens at a time or require that a microorganism be successfully cultured from a clinical sample. By contrast, while NGS assays in current use cannot compare with conventional tests with respect to speed (the sequencing run alone on a standard Illumina instrument takes >18hours) mNGS could enable a broad range of pathogens (virus, bacteria, fungi and/or parasites) to be identified from culture or directly from clinical samples on the basis of uniquely identifiable DNA and/or RNA sequences.[1]
To date, several studies have provided a glimpse into the promise of NGS in clinical and public health settings.[1] Even though, most of the metagenomics outcomes data generated consist of case reports which belie the increasing interest on diagnostic metagenomics.[8] For example, NGS was used for the clinical diagnosis of neuroleptospirosis in a 14-year-old critically ill boy with meningoencephalitis; this case was the first to demonstrate the possible utility of metagenomic NGS (mNGS) in providing clinically actionable information, a successful diagnosis prompted appropriate targeted antibiotic treatment and eventual recovery of the patient.[1]
Thus, the argument for the clinical utility of metagenomic is eventually only based in the most difficult- to- diagnose cases or for immunocompromised patients, in whom the spectrum of potential pathogens is greater.[1] Accordingly, there is an overall lack of penetration of this approach into the clinical microbiology laboratory, as making a diagnosis with metagenomics is still basically only useful in the context of case report but not for a true daily diagnostic purpose.[8]
Recent cost-effectiveness modelling of metagenomics in the diagnosis of fever of unknown origin concluded that, even after limiting the cost of diagnostic metagenomics to $100–1000 per test, it would require 2.5-4 times the diagnostic yield of computed tomography of the abdomen/pelvis in order to be cost neutral and cautioned against ‘widespread rush’ to deploy metagenomic testing.[8] Eventually, mNGS may become cost competitive with multiplexed assays or used as an upfront ‘rule out’ assay to exclude infectious aetiologies. But, for the moment, there is a lack of actionability in clinical metagenomics.[1]
Furthermore, in the case of the discovery of potential novel infectious agents, usually only the positives results are published event though the vast majority of sequenced cases are negative, thus resulting in a very biased information. Besides, most of the discovery work based in metagenomic that precede the current diagnostic-based work even mentioned the known agents detected while screening unsolved cases for completely novel causes.[8]
Of course, detection of nucleic acids, either by NGS or multiplexed assays, does not by itself prove that an identified microorganism is the cause of the illness, and findings have to be interpreted in the clinical context. In particular, discovery of an atypical or novel infectious agent in clinical samples should be followed up with confirmatory investigations such as orthogonal testing of tissue biopsy samples and demonstration of seroconversion or via the use of cell culture or animal models, as appropriate, to ascertain its true pathogenic potential.[1]
For all of this, the field suffers from a lack of understanding of true clinical utility.
Laboratory validity
To date, most published testing has been run in an unvalidated, unreportable manner. The ‘standard microbiological testing’ that samples are subjected to prior to metagenomics is variable and has not included reverse transcription-polymerase chain reaction (RT-PCR) testing for common respiratory viruses or, routinely 16S/ITS PCR testing.[8]
Given the relative costs of validating and performing metagenomic versus 16S/ITS PCR testing, the second one is considered an easier and more efficient option. A potential exception to the 16S/ITS testing is blood, given the huge amount of 16S sequence available, making clean cutoffs for diagnostic purposes problematic.[8]
Furthermore, almost all of the organisms detected by metagenomics for which there is an associated treatment and thus would be truly actionable are also detectable by 16S/ITS testing (or 16S/ITS-NGS). This makes questionable the utility of metagenomics in many diagnostic cases.[8]
One of the main points to accomplish laboratory validity is the presence of reference standards and controls when performing mNGS assays. They are needed to ensure the quality and stability of this technique over time.[1]
Most available metagenomic reference materials are dedicated to specific applications (for example, ZymoBIOMICS Microbial Community Standard for microbiome analyses and bacterial and fungal metagenomics) and/or focused on a more limited spectrum of organisms (for example, the National Institute of Standards and Technology (NIST) reference materials for mixed microbial DNA detection, which contain only bacteria) Thus, these materials may not be applicable to untargeted mNGS analyses.
Custom mixtures consisting of a pool of microorganisms (mock microbial communities) or their nucleic acids can be developed as external controls to establish limits of detection for mNGS testing. Internal spike-in control standards are available for other Next Generation Sequencing applications such as transcriptome analysis by RNA-seq.[1]
Nonetheless, the lack of universally accepted reference standards for mNGS makes it difficult to compare assay performances between different laboratories. There is a critical need for standardized reference organisms and genomic materials to facilitate such comparisons and to define optimal analysis methods.[1]
Sense and sensitivity
In clinical microbiology labs, the quantitation of microbial burden is considered a routine function as it is associated with the severity and progression of the disease. To achieve a good quantitation a high sensitivity of the technique is needed.[8]
A key limitation of metagenomic next-generation sequencing (mNGS) is its decreased sensitivity with high background as these can be clinically relevant as the pathogen load in infections can be very low.[1] Whereas interfering substances represent a common problem to clinical chemistry or to PCR diagnostics, the degree of interference from host (for example, in tissue biopsies) or nonpathogen nucleic acids (for example, in stool) in metagenomics is a new twist.[1][8] In addition, due to the relative size of the human genome in comparison with microbial genomes the interference can occur at low levels of contaminating material.[8]
Another challenge for clinical metagenomics in regards to sensitivity is the diagnosis of coinfections where there are present high-titer pathogens that can generate biased results as they may disproportionately soak up reads and make difficult to distinguish the less predominant pathogens.[8]
In addition to issues with interfering substances, specially in the diagnosis area, accurate quantitation and sensitivities are essential as a confusion in the results can affect to a third person, the patient. For these reason, practitioners currently have to be keenly aware of the index-swapping issues associated with Illumina sequencing which can lead to trace incorrectly barcoded samples.[8]
Since metagenomics has typically been used on patients for whom every other test to date has been negative, questions surrounding analytical sensitivity haven been less germane. But, for ruling out infections causes being one of the more important roles for clinical metagenomics it is essential to be capable to perform a deep enough sequencing to achieve adequate sensitivities. One way could be developing novel library preparation techniques.[8]
Cost considerations
Although there have been substantial cost reductions in the generation of sequence data, the overall persample reagent cost for sequencing remains fairly high. In fact the Illumina monopoly on high-quality next-generation sequencing reagents and the need for accurate and deep sequencing in metagenomics mean that the sequencing reagents alone cost more than FDA-approved syndromic testing panels. Also additional direct costs of metagenomics such as extraction, library preparation, and computational analysis have to be considered.[8]
This leads to an overall cost of several hundreds to thousands of dollars per sample analysed, which is higher than that for many other clinical tests.[1]
In general, metagenomic sequencing is most useful and cost efficient for pathogen discovery when at least one of the following criteria are met:
- the identification of the organism is not sufficient (one desires to go beyond discovery to produce data for genomic characterization),
- a coinfection is suspected,
- other simpler assays are ineffective or will take an inordinate amount of time,
- screening of environmental samples for previously undescribed or divergent pathogens.[3]
Regulatory considerations
Every clinical laboratory should be highly regulated and general laboratory and testing requirements applied to all molecular diagnostic assays reported for patient care. An ongoing monitoring is essential especially for mNGS assays in order to verify acceptable performance over time and to investigate atypical findings. Examples of important quality steps are: the initial sample quality checks, the library parameters (concentration and size distribution), the sequence data generation (cluster density and Q- score), the recovery of internal controls and performance of external controls. Validation data coming from the assay development and implementation should be recorded and made available to laboratory inspectors or submitted to regulatory agencies, such as the FDA in the USA or the European Medicines Agency (EMA) in Europe, for approval.
During mNGS assays the monitoring is accomplished using sample internal controls, intrarun control samples, swipe tests for contamination and periodic proficiency testing. Further studies on unexpected or unusual results are always needed and the identification of microorganisms that have not been identified before in the laboratory should be independently confirmed, usually through clinical reference or public health laboratory testing. Furthermore, it is important to determine the clinical significance of these novel or atypical organisms and these findings should be reported and discussed with health care providers, with consideration for their potential pathogenicity and for further testing and treatment options.[1]
Future perspectives
Technological advances in library preparation methods, sequence generation and computational bioinformatics are carrying to quicker and more comprehensive metagenomic analyses at lower cost. While current limitations, like the decrease of the sensitivity for pathogen detection in clinical samples with a high nucleic acid background or with exceedingly low pathogen titres, suggest that mNGS is unlikely to replace conventional diagnostics in the short term, perhaps it could be a complementary or an essential test in certain clinical situations.[1]
Although the use of mNGS for informing clinical care has been demonstrated in multiple small case series, nearly all studies have been retrospective, and clinical utility has yet to be established in a large scale prospective clinical trial. So prospective clinical studies will be critical to understand when to perform mNGS and how the diagnostic yield compares with that of other methods. It could be possible that, over the next 5 years, prospective clinical trial data evaluating the clinical utility and cost effectiveness of mNGS will become available and that overall costs and turnaround time for mNGS will continue to drop.[1]
Furthermore, in a world with constantly emerging pathogens, it's probable that mNGS based testing will have an essential role in monitoring and tracking new disease outbreaks. As surveillance networks and rapid diagnostic platforms such as nanopore sequencing are deployed globally, it will be possible to detect and contain infectious outbreaks at a much earlier stage, saving lives and lowering costs.[1]
References
- Chiu, Charles Y.; Miller, Steven A. (June 2019). "Clinical metagenomics". Nature Reviews Genetics. 20 (6): 341–355. doi:10.1038/s41576-019-0113-7. ISSN 1471-0064. PMC 6858796. PMID 30918369.
- Simner, Patricia J; Miller, Steven; Carroll, Karen C (2017-10-12). "Understanding the Promises and Hurdles of Metagenomic Next-Generation Sequencing as a Diagnostic Tool for Infectious Diseases". Clinical Infectious Diseases. 66 (5): 778–788. doi:10.1093/cid/cix881. ISSN 1058-4838. PMID 29040428.
- Maljkovic Berry, Irina; Melendrez, Melanie C; Bishop-Lilly, Kimberly A; Rutvisuttinunt, Wiriya; Pollett, Simon; Talundzic, Eldin; Morton, Lindsay; Jarman, Richard G (2019-10-14). "Next Generation Sequencing and Bioinformatics Methodologies for Infectious Disease Research and Public Health: Approaches, Applications, and Considerations for Development of Laboratory Capacity". The Journal of Infectious Diseases: jiz286. doi:10.1093/infdis/jiz286. ISSN 0022-1899. PMID 31612214.
- Faria, Nuno Sabino, Ester Nunes, Marcio Alcantara, Luiz Loman, Nicholas Pybus, Oliver (2016-09-29). "Mobile real-time surveillance of Zika virus in Brazil". Genome Medicine. BioMed Central Ltd. 8 (1): 97. doi:10.1186/s13073-016-0356-2. OCLC 963985741. PMC 5041528. PMID 27683027.CS1 maint: multiple names: authors list (link)
- Quick, Joshua; Loman, Nicholas J.; Duraffour, Sophie; Simpson, Jared T.; Severi, Ettore; Cowley, Lauren; Bore, Joseph Akoi; Koundouno, Raymond; Dudas, Gytis; Mikhail, Amy; Ouédraogo, Nobila (February 2016). "Real-time, portable genome sequencing for Ebola surveillance". Nature. 530 (7589): 228–232. Bibcode:2016Natur.530..228Q. doi:10.1038/nature16996. ISSN 0028-0836. PMC 4817224. PMID 26840485.
- Wilson, Michael R.; Sample, Hannah A.; Zorn, Kelsey C.; Arevalo, Shaun; Yu, Guixia; Neuhaus, John; Federman, Scot; Stryke, Doug; Briggs, Benjamin; Langelier, Charles; Berger, Amy (2019-06-13). "Clinical Metagenomic Sequencing for Diagnosis of Meningitis and Encephalitis". New England Journal of Medicine. 380 (24): 2327–2340. doi:10.1056/NEJMoa1803396. ISSN 0028-4793. PMC 6764751. PMID 31189036.
- Boolchandani, Manish; D’Souza, Alaric W.; Dantas, Gautam (June 2019). "Sequencing-based methods and resources to study antimicrobial resistance". Nature Reviews Genetics. 20 (6): 356–370. doi:10.1038/s41576-019-0108-4. ISSN 1471-0064. PMC 6525649. PMID 30886350.
- Greninger, Alexander L. (2018-07-03). "The challenge of diagnostic metagenomics". Expert Review of Molecular Diagnostics. 18 (7): 605–615. doi:10.1080/14737159.2018.1487292. ISSN 1473-7159. PMID 29898605.