The differential feature detection and the clustering analysis to breast cancers

2Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Breast cancer is one of the most common and fatal cancers in women today. To detect the differential features for correctly diagnosing breast cancer patients, a 2D feature selection algorithm is proposed in this paper. It is named as SDP for short because it adopts the Standard Deviation and Pearson correlation coefficient to define the feature discernibility and independence respectively. The feature importance is defined as the product of its discernibility and independence. A 2D space is constructed using discernibility as x-axis and independence as y-axis. The area of the rectangular enclosed by a feature coordinate lines and axes implies the importance of the feature. Those features in the upper right corner of the 2D space are with much higher importance than the rest ones. They comprise the differential feature subset. The spectral clustering algorithms SC_SD and SC_MD are adopted for clustering analysis, so the two algorithms SDP+SC_SD and SDP+SC_MD were developed for clustering analysis to the breast cancer data WBCD (Wisconsin Breast Cancer Database), WDBC (Wisconsin Diagnostic Breast Cancer), and WPBC (Wisconsin Prognostic Breast Cancer). The experimental results demonstrate that the proposed SDP can detect much better differential features of breast cancers than other compared feature selection algorithms, and the SDP+SC_SD and SDP+SC_MD algorithms outperform the algorithms without feature selection process embedded, such as SC_SD, SC_MD, DPC, SD_DPC, K-means and SD_K-medoids in terms of clustering accuracy, AMI (Adjusted Mutual Information), ARI (Adjusted Rand Index), sensitivity, specificity and precision.

Cite

CITATION STYLE

APA

Xie, J., Wu, Z., Xia, Q., Ding, L., & Fujita, H. (2020). The differential feature detection and the clustering analysis to breast cancers. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12144 LNAI, pp. 457–469). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-55789-8_40

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free