MUSCLE: A multiple sequence alignment method with reduced time and space complexity

7.1kCitations
Citations of this article
3.7kReaders
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Background: In a previous paper, we introduced MUSCLE, a new program for creating multiple alignments of protein sequences, giving a brief summary of the algorithm and showing MUSCLE to achieve the highest scores reported to date on four alignment accuracy benchmarks. Here we present a more complete discussion of the algorithm, describing several previously unpublished techniques that improve biological accuracy and / or computational complexity. We introduce a new option, MUSCLE-fast, designed for high-throughput applications. We also describe a new protocol for evaluating objective functions that align two profiles. Results: We compare the speed and accuracy of MUSCLE with CLUSTALW, Progressive POA and the MAFFT script FFTNS1, the fastest previously published program known to the author. Accuracy is measured using four benchmarks: BAliBASE, PREFAB, SABmark and SMART. We test three variants that offer highest accuracy (MUSCLE with default settings), highest speed (MUSCLEfast), and a carefully chosen compromise between the two (MUSCLE-prog). We find MUSCLE-fast to be the fastest algorithm on all test sets, achieving average alignment accuracy similar to CLUSTALW in times that are typically two to three orders of magnitude less. MUSCLE-fast is able to align 1,000 sequences of average length 282 in 21 seconds on a current desktop computer. Conclusions: MUSCLE offers a range of options that provide improved speed and / or alignment accuracy compared with currently available programs. MUSCLE is freely available at http:// www.drive5.com/muscle. © 2004 Edgar; licensee BioMed Central Ltd.

References Powered by Scopus

Gapped BLAST and PSI-BLAST: A new generation of protein database search programs

63315Citations
N/AReaders
Get full text

CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice

58546Citations
N/AReaders
Get full text

MUSCLE: Multiple sequence alignment with high accuracy and high throughput

35887Citations
N/AReaders
Get full text

Cited by Powered by Scopus

MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets

37491Citations
N/AReaders
Get full text

MEGA5: Molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods

36481Citations
N/AReaders
Get full text

MAFFT version 5: Improvement in accuracy of multiple sequence alignment

4096Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Edgar, R. C. (2004). MUSCLE: A multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics, 5. https://doi.org/10.1186/1471-2105-5-113

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 1542

62%

Researcher 619

25%

Professor / Associate Prof. 249

10%

Lecturer / Post doc 66

3%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 1496

60%

Biochemistry, Genetics and Molecular Bi... 734

30%

Computer Science 144

6%

Immunology and Microbiology 101

4%

Article Metrics

Tooltip
Mentions
News Mentions: 10
References: 4
Social Media
Shares, Likes & Comments: 6

Save time finding and organizing research with Mendeley

Sign up for free