A novel feature selection-based sequential ensemble learning method for class noise detection in high-dimensional data

4Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Most of the irrelevant or noise features in high-dimensional data present significant challenges to high-dimensional mislabeled instances detection methods based on feature selection. Traditional methods often perform the two dependent step: The first step, searching for the relevant subspace, and the second step, using the feature subspace which obtained in the previous step training model. However, Feature subspace that are not related to noise scores and influence detection performance. In this paper, we propose a novel sequential ensemble method SENF that aggregate the above two phases, our method learns the sequential ensembles to obtain refine feature subspace and improve detection accuracy by iterative sparse modeling with noise scores as the regression target attribute. Through extensive experiments on 8 real-world high-dimensional datasets from the UCI machine learning repository [3], we show that SENF performs significantly better or at least similar to the individual baselines as well as the existing state-of-the-art label noise detection method.

Cite

CITATION STYLE

APA

Chen, K., Guan, D., Yuan, W., Li, B., Khattak, A. M., & Alfandi, O. (2018). A novel feature selection-based sequential ensemble learning method for class noise detection in high-dimensional data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11323 LNAI, pp. 55–65). Springer Verlag. https://doi.org/10.1007/978-3-030-05090-0_5

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free