A divide-and-conquer method for scalable phylogenetic network inference from multilocus data

14Citations
Citations of this article
29Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Motivation: Reticulate evolutionary histories, such as those arising in the presence of hybridization, are best modeled as phylogenetic networks. Recently developed methods allow for statistical inference of phylogenetic networks while also accounting for other processes, such as incomplete lineage sorting. However, these methods can only handle a small number of loci from a handful of genomes. Results: In this article, we introduce a novel two-step method for scalable inference of phylogenetic networks from the sequence alignments of multiple, unlinked loci. The method infers networks on subproblems and then merges them into a network on the full set of taxa. To reduce the number of trinets to infer, we formulate a Hitting Set version of the problem of finding a small number of subsets, and implement a simple heuristic to solve it. We studied their performance, in terms of both running time and accuracy, on simulated as well as on biological datasets. The two-step method accurately infers phylogenetic networks at a scale that is infeasible with existing methods. The results are a significant and promising step towards accurate, large-scale phylogenetic network inference.

References Powered by Scopus

IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies

17338Citations
N/AReaders
Get full text

Evolutionary trees from DNA sequences: A maximum likelihood approach

12283Citations
N/AReaders
Get full text

Generating samples under a Wright-Fisher neutral model of genetic variation

1907Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Disentangling Sources of Gene Tree Discordance in Phylogenomic Data Sets: Testing Ancient Hybridizations in Amaranthaceae s.l

134Citations
N/AReaders
Get full text

The Perfect Storm: Gene Tree Estimation Error, Incomplete Lineage Sorting, and Ancient Gene Flow Explain the Most Recalcitrant Ancient Angiosperm Clade, Malpighiales

77Citations
N/AReaders
Get full text

Classes of explicit phylogenetic networks and their biological and mathematical significance

32Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Zhu, J., Liu, X., Ogilvie, H. A., & Nakhleh, L. K. (2019). A divide-and-conquer method for scalable phylogenetic network inference from multilocus data. In Bioinformatics (Vol. 35, pp. i370–i378). Oxford University Press. https://doi.org/10.1093/bioinformatics/btz359

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 15

88%

Professor / Associate Prof. 1

6%

Researcher 1

6%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 8

40%

Computer Science 5

25%

Biochemistry, Genetics and Molecular Bi... 4

20%

Engineering 3

15%

Save time finding and organizing research with Mendeley

Sign up for free