The classification of HLA supertypes by GRID/CPCA and hierarchical clustering methods.

2Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Biological experiments often produce enormous amount of data, which are usually analyzed by data clustering. Cluster analysis refers to statistical methods that are used to assign data with similar properties into several smaller, more meaningful groups. Two commonly used clustering techniques are introduced in the following section: principal component analysis (PCA) and hierarchical clustering. PCA calculates the variance between variables and groups them into a few uncorrelated groups or principal components (PCs) that are orthogonal to each other. Hierarchical clustering is carried out by separating data into many clusters and merging similar clusters together. Here, we use an example of human leukocyte antigen (HLA) supertype classification to demonstrate the usage of the two methods. Two programs, Generating Optimal Linear Partial Least Square Estimations (GOLPE) and Sybyl, are used for PCA and hierarchical clustering, respectively. However, the reader should bear in mind that the methods have been incorporated into other software as well, such as SIMCA, statistiXL, and R.

Cite

CITATION STYLE

APA

Guan, P., Doytchinova, I. A., & Flower, D. R. (2007). The classification of HLA supertypes by GRID/CPCA and hierarchical clustering methods. Methods in Molecular Biology (Clifton, N.J.), 409, 143–154. https://doi.org/10.1007/978-1-60327-118-9_9

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free