Survey on deep learning with class imbalance

2.0kCitations
Citations of this article
2.4kReaders
Mendeley users who have this article in their library.

This article is free to access.

Abstract

The purpose of this study is to examine existing deep learning techniques for addressing class imbalanced data. Effective classification with imbalanced data is an important area of research, as high class imbalance is naturally inherent in many real-world applications, e.g., fraud detection and cancer detection. Moreover, highly imbalanced data poses added difficulty, as most learners will exhibit bias towards the majority class, and in extreme cases, may ignore the minority class altogether. Class imbalance has been studied thoroughly over the last two decades using traditional machine learning models, i.e. non-deep learning. Despite recent advances in deep learning, along with its increasing popularity, very little empirical work in the area of deep learning with class imbalance exists. Having achieved record-breaking performance results in several complex domains, investigating the use of deep neural networks for problems containing high levels of class imbalance is of great interest. Available studies regarding class imbalance and deep learning are surveyed in order to better understand the efficacy of deep learning when applied to class imbalanced data. This survey discusses the implementation details and experimental results for each study, and offers additional insight into their strengths and weaknesses. Several areas of focus include: data complexity, architectures tested, performance interpretation, ease of use, big data application, and generalization to other domains. We have found that research in this area is very limited, that most existing work focuses on computer vision tasks with convolutional neural networks, and that the effects of big data are rarely considered. Several traditional methods for class imbalance, e.g. data sampling and cost-sensitive learning, prove to be applicable in deep learning, while more advanced methods that exploit neural network feature learning abilities show promising results. The survey concludes with a discussion that highlights various gaps in deep learning from class imbalanced data for the purpose of guiding future research.

References Powered by Scopus

Deep residual learning for image recognition

178632Citations
N/AReaders
Get full text

Deep learning

64791Citations
N/AReaders
Get full text

Gradient-based learning applied to document recognition

44907Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

4546Citations
N/AReaders
Get full text

CatBoost for big data: an interdisciplinary review

790Citations
N/AReaders
Get full text

Data imbalance in classification: Experimental evaluation

562Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Johnson, J. M., & Khoshgoftaar, T. M. (2019). Survey on deep learning with class imbalance. Journal of Big Data, 6(1). https://doi.org/10.1186/s40537-019-0192-5

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 731

66%

Researcher 182

17%

Lecturer / Post doc 132

12%

Professor / Associate Prof. 55

5%

Readers' Discipline

Tooltip

Computer Science 607

65%

Engineering 246

26%

Mathematics 44

5%

Medicine and Dentistry 39

4%

Article Metrics

Tooltip
Mentions
News Mentions: 2
Social Media
Shares, Likes & Comments: 11

Save time finding and organizing research with Mendeley

Sign up for free