Ultra-accurate microbial amplicon sequencing with synthetic long reads

62Citations
Citations of this article
134Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Background: Out of the many pathogenic bacterial species that are known, only a fraction are readily identifiable directly from a complex microbial community using standard next generation DNA sequencing. Long-read sequencing offers the potential to identify a wider range of species and to differentiate between strains within a species, but attaining sufficient accuracy in complex metagenomes remains a challenge. Methods: Here, we describe and analytically validate LoopSeq, a commercially available synthetic long-read (SLR) sequencing technology that generates highly accurate long reads from standard short reads. Results: LoopSeq reads are sufficiently long and accurate to identify microbial genes and species directly from complex samples. LoopSeq perfectly recovered the full diversity of 16S rRNA genes from known strains in a synthetic microbial community. Full-length LoopSeq reads had a per-base error rate of 0.005%, which exceeds the accuracy reported for other long-read sequencing technologies. 18S-ITS and genomic sequencing of fungal and bacterial isolates confirmed that LoopSeq sequencing maintains that accuracy for reads up to 6 kb in length. LoopSeq full-length 16S rRNA reads could accurately classify organisms down to the species level in rinsate from retail meat samples, and could differentiate strains within species identified by the CDC as potential foodborne pathogens. Conclusions: The order-of-magnitude improvement in length and accuracy over standard Illumina amplicon sequencing achieved with LoopSeq enables accurate species-level and strain identification from complex- to low-biomass microbiome samples. The ability to generate accurate and long microbiome sequencing reads using standard short read sequencers will accelerate the building of quality microbial sequence databases and removes a significant hurdle on the path to precision microbial genomics. [MediaObject not available: see fulltext.]

References Powered by Scopus

Trimmomatic: A flexible trimmer for Illumina sequence data

42449Citations
N/AReaders
Get full text

DADA2: High-resolution sample inference from Illumina amplicon data

20287Citations
N/AReaders
Get full text

SPAdes: A new genome assembly algorithm and its applications to single-cell sequencing

18464Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Defining and quantifying the core microbiome: Challenges and prospects

292Citations
N/AReaders
Get full text

Best practices in metabarcoding of fungi: From experimental design to results

150Citations
N/AReaders
Get full text

Emu: species-level microbial community profiling of full-length 16S rRNA Oxford Nanopore sequencing data

111Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Callahan, B. J., Grinevich, D., Thakur, S., Balamotis, M. A., & Yehezkel, T. B. (2021). Ultra-accurate microbial amplicon sequencing with synthetic long reads. Microbiome, 9(1). https://doi.org/10.1186/s40168-021-01072-3

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 39

51%

Researcher 26

34%

Professor / Associate Prof. 12

16%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 26

39%

Biochemistry, Genetics and Molecular Bi... 25

38%

Immunology and Microbiology 8

12%

Medicine and Dentistry 7

11%

Article Metrics

Tooltip
Mentions
News Mentions: 1
Social Media
Shares, Likes & Comments: 56

Save time finding and organizing research with Mendeley

Sign up for free