A blocking strategy for ranking features according to probabilistic relevance

Gianluca Bontempi

Conference Proceedings

A blocking strategy for ranking features according to probabilistic relevance

Bontempi G

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 10122 LNCS 59-69

DOI: 10.1007/978-3-319-51469-7_5

0Citations

2Readers

Get full text

Abstract

The paper presents an algorithm to rank features in “small number of samples, large dimensionality” problems according to probabilistic feature relevance, a novel definition of feature relevance. Probabilistic feature relevance, intended as expected weak relevance, is introduced in order to address the problem of estimating conventional feature relevance in data settings where the number of samples is much smaller than the number of features. The resulting ranking algorithm relies on a blocking approach for estimation and consists in creating a large number of identical configurations to measure the conditional information of each feature in a paired manner. Its implementation can be made embarrassingly parallel in the case of very large n. A number of experiments on simulated and real data confirms the interest of the approach.

Cite

CITATION STYLE

APA

Bontempi, G. (2016). A blocking strategy for ranking features according to probabilistic relevance. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10122 LNCS, pp. 59–69). Springer Verlag. https://doi.org/10.1007/978-3-319-51469-7_5

A blocking strategy for ranking features according to probabilistic relevance

Abstract

Cite

Register to see more suggestions