Shape analysis of high-throughput transcriptomics experiment data

3Citations
Citations of this article
28Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

The recent growth of high-throughput transcriptome technology has been paralleled by the development of statistical methodologies to analyze the data they produce. Some of these newly developed methods are based on the assumption that the data observed or a transformation of the data are relatively symmetric with light tails, usually summarized by assuming a Gaussian random component. It is indeed very difficult to assess this assumption for small sample sizes. In this article, we utilize L-moments statistics as the basis of exploratory data analysis, the assessment of distributional assumptions, and the hypothesis testing of high-throughput transcriptomic data. In particular, we use L-moments ratios for assessing the shape (skewness and kurtosis) of high-throughput transcriptome data. Based on these statistics, we propose an algorithm for identifying genes with distributions that are markedly different from the majority in the data. In addition, we also illustrate the utility of this framework to characterize the robustness of distributional assumptions. We apply it to RNA-seq data and find that methods based on the simple $t$-test for differential expression analysis using L-moments as weights are robust.

References Powered by Scopus

Differential expression analysis for sequence count data

11970Citations
N/AReaders
Get full text

Mapping and quantifying mammalian transcriptomes by RNA-Seq

10998Citations
N/AReaders
Get full text

Linear models and empirical bayes methods for assessing differential expression in microarray experiments

9662Citations
N/AReaders
Get full text

Cited by Powered by Scopus

BatchQC: Interactive software for evaluating sample and batch effects in genomic data

45Citations
N/AReaders
Get full text

Batch effect correction methods for NASA GeneLab transcriptomic datasets

3Citations
N/AReaders
Get full text

Per-sample standardization and asymmetric winsorization lead to accurate clustering of RNA-seq expression profiles

3Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Okrah, K., & Bravo, H. C. (2015). Shape analysis of high-throughput transcriptomics experiment data. Biostatistics, 16(4), 627–640. https://doi.org/10.1093/biostatistics/kxv018

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 14

70%

Researcher 4

20%

Professor / Associate Prof. 2

10%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 12

57%

Computer Science 4

19%

Biochemistry, Genetics and Molecular Bi... 3

14%

Medicine and Dentistry 2

10%

Save time finding and organizing research with Mendeley

Sign up for free