Code developments to improve the efficiency of automated MS/MS spectra interpretation

184Citations
Citations of this article
58Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We report the results of our work to facilitate protein identification using tandem mass spectra and protein sequence databases. We describe a parallel version of SEQUEST (SEQUEST-PVM) that is tolerant toward arithmetic exceptions. The changes we report effectively separate search processes on slave nodes from each other. Therefore, if one of the slave nodes drops out of the cluster due to an error, the rest of the cluster will carry the search process to the end. SEQUEST has been widely used for protein identifications. The modifications made to the code improve its stability and effectiveness in a high-throughput production environment. We evaluate the overhead associated with the parallelization of SEQUEST. A prior version of software to preprocess LC/MS/MS data attempted to differentiate the charge states of ions. Singly charged ions can be accurately identified, but the software was unable to reliably differentiate tandem mass spectra of +2 and +3 charge states. We have designed and implemented a computational approach to narrow charge states of precursor ions from nominal resolution ion-trap tandem mass spectra. The preprocessing code, 2to3, determines the charge state of the precursor ion using its mass-to-charge ratio (m/z) and fragment ions contained in the tandem mass spectrum. For each possible charge state the program calculates the expected fragment ions that account for precursor ion m/z vlues. If any one of the numbers is less than an empirically determined threshold value then the spectrum corresponding to that charge state is removed. If both numbers are higher than the threshold value then +2 and +3 copies of the spectrum are kept. We present the comparison of results from protein identification experiments with and without using 2to3. It is shown that by determining the charge state and eliminating poor quality spectra 2to3 decreases the number of spectral files to be searched without affecting the search results. The decrease reduces computer requirements and researcher efforts for analysis of the results.

References Powered by Scopus

Probability-based protein identification by searching sequence databases using mass spectrometry data

7143Citations
N/AReaders
Get full text

An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database

5777Citations
N/AReaders
Get full text

Interpreting Mass Spectra of Multiply Charged Ions

580Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search

4129Citations
N/AReaders
Get full text

Protein analysis by shotgun/bottom-up proteomics

1160Citations
N/AReaders
Get full text

Histone H3 methylation by Set2 directs deacetylation of coding regions by Rpd3S to suppress spurious intragenic transcription

1054Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Sadygov, R. G., Eng, J., Durr, E., Saraf, A., McDonald, H., MacCoss, M. J., & Yates, J. R. (2002). Code developments to improve the efficiency of automated MS/MS spectra interpretation. Journal of Proteome Research, 1(3), 211–215. https://doi.org/10.1021/pr015514r

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 19

42%

Researcher 16

36%

Professor / Associate Prof. 10

22%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 21

50%

Biochemistry, Genetics and Molecular Bi... 10

24%

Computer Science 8

19%

Chemistry 3

7%

Save time finding and organizing research with Mendeley

Sign up for free