Evaluating Machine Learning Models and Their Diagnostic Value

Gael Varoquaux; Olivier Colliot

Book ChapterOPEN ACCESS

Evaluating Machine Learning Models and Their Diagnostic Value

Humana Press Inc., (2023), 601-630

DOI: 10.1007/978-1-0716-3195-9_20

13Citations

134Readers

Abstract

This chapter describes model validation, a crucial part of machine learning whether it is to select the best model or to assess performance of a given model. We start by detailing the main performance metrics for different tasks (classification, regression), and how they may be interpreted, including in the face of class imbalance, varying prevalence, or asymmetric cost–benefit trade-offs. We then explain how to estimate these metrics in an unbiased manner using training, validation, and test sets. We describe cross-validation procedures—to use a larger part of the data for both training and testing—and the dangers of data leakage—optimism bias due to training data contaminating the test set. Finally, we discuss how to obtain confidence intervals of performance metrics, distinguishing two situations: internal validation or evaluation of learning algorithms and external validation or evaluation of resulting prediction models.

Author supplied keywords

Cite

CITATION STYLE

APA

Varoquaux, G., & Colliot, O. (2023). Evaluating Machine Learning Models and Their Diagnostic Value. In Neuromethods (Vol. 197, pp. 601–630). Humana Press Inc. https://doi.org/10.1007/978-1-0716-3195-9_20

Evaluating Machine Learning Models and Their Diagnostic Value

Abstract

Author supplied keywords

Cite

Register to see more suggestions