Improvement in speed and accuracy of multiple sequence alignment program prime

3Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

Multiple sequence alignment (MSA) is a useful tool in bioinformatics. Although many MSA algorithms have been developed, there is still room for improvement in accuracy and speed. We have developed an MSA program PRIME, whose crucial feature is the use of a group-to-group sequence alignment algorithm with a piecewise linear gap cost. We have shown that PRIME is one of the most accurate MSA programs currently available. However, PRIME is slower than other leading MSA programs. To improve computational performance, we newly incorporate anchoring and grouping heuristics into PRIME. An anchoring method is to locate well-conserved regions in a given MSA as anchor points to reduce the region of DP matrix to be examined, while a grouping method detects conserved subfamily alignments specified by phylogenetic tree in a given MSA to reduce the number of iterative refinement steps. The results of BAliBASE 3.0 and PREFAB 4 benchmark tests indicated that these heuristics contributed to reduction in the computational time of PRIME by more than 60% while the average alignment accuracy measures decreased by at most 2%. Additionally, we evaluated the effectiveness of iterative refinement algorithm based on maximal expected accuracy (MEA). Our experiments revealed that when many sequences are aligned, the MEA-based algorithm significantly improves alignment accuracy compared with the standard version of PRIME at the expense of a considerable increase in computation time. © 2008 Information Processing Society of Japan.

References Powered by Scopus

CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice

58468Citations
N/AReaders
Get full text

MUSCLE: Multiple sequence alignment with high accuracy and high throughput

35724Citations
N/AReaders
Get full text

MAFFT: A novel method for rapid multiple sequence alignment based on fast Fourier transform

12121Citations
N/AReaders
Get full text

Cited by Powered by Scopus

A classification of bioinformatics algorithms from the viewpoint of maximizing expected accuracy (MEA)

14Citations
N/AReaders
Get full text

RBT-L: A location based approach for solving the multiple sequence alignment problem

7Citations
N/AReaders
Get full text

RBT-Km: K-means clustering for multiple sequence alignment

1Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Yamada, S., Gotoh, O., & Yamana, H. (2008). Improvement in speed and accuracy of multiple sequence alignment program prime. IPSJ Transactions on Bioinformatics, 1, 2–12. https://doi.org/10.2197/ipsjtbio.1.2

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 3

50%

Professor / Associate Prof. 2

33%

Lecturer / Post doc 1

17%

Readers' Discipline

Tooltip

Computer Science 3

50%

Agricultural and Biological Sciences 2

33%

Mathematics 1

17%

Save time finding and organizing research with Mendeley

Sign up for free