A Daily-Updated Database and Tools for Comprehensive SARSCoV-2 Mutation-Annotated Trees

53Citations
Citations of this article
37Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

The vast scale of SARS-CoV-2 sequencing data has made it increasingly challenging to comprehensively analyze all available data using existing tools and file formats. To address this, we present a database of SARS-CoV-2 phylogenetic trees inferred with unrestricted public sequences, which we update daily to incorporate new sequences. Our database uses the recently proposed mutation-annotated tree (MAT) format to efficiently encode the tree with branches labeled with parsimony-inferred mutations, as well as Nextstrain clade and Pango lineage labels at clade roots. As of June 9, 2021, our SARS-CoV-2 MAT consists of 834,521 sequences and provides a comprehensive view of the virus’ evolutionary history using public data. We also present matUtils—a command-line utility for rapidly querying, interpreting, and manipulating the MATs. Our daily-updated SARS-CoV-2 MAT database and matUtils software are available at http://hgdownload.soe.ucsc.edu/goldenPath/wuhCor1/UShER_SARS-CoV-2/and https://github.com/yatisht/usher, respectively.

References Powered by Scopus

Ape 5.0: An environment for modern phylogenetics and evolutionary analyses in R

5133Citations
N/AReaders
Get full text

NextStrain: Real-time tracking of pathogen evolution

2038Citations
N/AReaders
Get full text

GISAID: Global initiative on sharing all influenza data – from vision to reality

1874Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Wastewater sequencing reveals early cryptic SARS-CoV-2 variant transmission

244Citations
N/AReaders
Get full text

The UCSC Genome Browser database: 2023 update

243Citations
N/AReaders
Get full text

Shifting mutational constraints in the SARS-CoV-2 receptor-binding domain during viral evolution

129Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

McBroome, J., Thornlow, B., Hinrichs, A. S., Kramer, A., De Maio, N., Goldman, N., … Turakhia, Y. (2021). A Daily-Updated Database and Tools for Comprehensive SARSCoV-2 Mutation-Annotated Trees. Molecular Biology and Evolution, 38(12), 5819–5824. https://doi.org/10.1093/molbev/msab264

Readers' Seniority

Tooltip

Researcher 7

47%

PhD / Post grad / Masters / Doc 4

27%

Professor / Associate Prof. 2

13%

Lecturer / Post doc 2

13%

Readers' Discipline

Tooltip

Biochemistry, Genetics and Molecular Bi... 8

44%

Computer Science 5

28%

Medicine and Dentistry 3

17%

Business, Management and Accounting 2

11%

Save time finding and organizing research with Mendeley

Sign up for free