μProteInS- A proteogenomics pipeline for finding novel bacterial microproteins encoded by small ORFs

7Citations
Citations of this article
18Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Genome annotation pipelines traditionally exclude open reading frames (ORFs) shorter than 100 codons to avoid false identifications. However, studies have been showing that these may encode functional microproteins with meaningful biological roles. We developed μProteInS, a proteogenomics pipeline that combines genomics, transcriptomics and proteomics to identify novel microproteins in bacteria. Our pipeline employs a model to filter out low confidence spectra, to avoid the need for manually inspecting Mass Spectrometry data. It also overcomes the shortcomings of traditional approaches that usually exclude overlapping genes, leaderless transcripts and non-conserved sequences, characteristics that are common among small ORFs (smORFs) and hamper their identification.

References Powered by Scopus

StringTie enables improved reconstruction of a transcriptome from RNA-seq reads

8212Citations
N/AReaders
Get full text

Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype

7161Citations
N/AReaders
Get full text

Semi-supervised learning for peptide identification from shotgun proteomics datasets

1702Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Small proteins in bacteria – Big challenges in prediction and identification

9Citations
N/AReaders
Get full text

ProsmORF-pred: a machine learning-based method for the identification of small ORFs in prokaryotic genomes

6Citations
N/AReaders
Get full text

Small proteins in Gram-positive bacteria

1Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

De Souza, E. V., Dalberto, P. F., Machado, V. P., Canedo, A., Saghatelian, A., Machado, P., … Bizarro, C. V. (2022). μProteInS- A proteogenomics pipeline for finding novel bacterial microproteins encoded by small ORFs. Bioinformatics, 38(9), 2612–2614. https://doi.org/10.1093/bioinformatics/btac115

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 5

50%

Researcher 4

40%

Professor / Associate Prof. 1

10%

Readers' Discipline

Tooltip

Biochemistry, Genetics and Molecular Bi... 5

45%

Agricultural and Biological Sciences 3

27%

Immunology and Microbiology 2

18%

Neuroscience 1

9%

Save time finding and organizing research with Mendeley

Sign up for free