EfficientDet: Scalable and efficient object detection

5.9kCitations
Citations of this article
4.2kReaders
Mendeley users who have this article in their library.
Get full text

Abstract

Model efficiency has become increasingly important in computer vision. In this paper, we systematically study neural network architecture design choices for object detection and propose several key optimizations to improve efficiency. First, we propose a weighted bi-directional feature pyramid network (BiFPN), which allows easy and fast multi-scale feature fusion; Second, we propose a compound scaling method that uniformly scales the resolution, depth, and width for all backbone, feature network, and box/class prediction networks at the same time. Based on these optimizations and EfficientNet backbones, we have developed a new family of object detectors, called EfficientDet, which consistently achieve much better efficiency than prior art across a wide spectrum of resource constraints. In particular, with single-model and single-scale, our EfficientDetD7 achieves state-of-the-art 52.2 AP on COCO test-dev with 52M parameters and 325B FLOPs1, being 4x - 9x smaller and using 13x - 42x fewer FLOPs than previous detector. Code is available at https://github.com/google/automl/tree/master/efficientdet.

References Powered by Scopus

Deep residual learning for image recognition

177516Citations
N/AReaders
Get full text

Mask R-CNN

20642Citations
N/AReaders
Get full text

Aggregated residual transformations for deep neural networks

7825Citations
N/AReaders
Get full text

Cited by Powered by Scopus

YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors

5149Citations
N/AReaders
Get full text

TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-captured Scenarios

1617Citations
N/AReaders
Get full text

Masked-attention Mask Transformer for Universal Image Segmentation

1479Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Tan, M., Pang, R., & Le, Q. V. (2020). EfficientDet: Scalable and efficient object detection. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 10778–10787). IEEE Computer Society. https://doi.org/10.1109/CVPR42600.2020.01079

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 1317

72%

Researcher 382

21%

Lecturer / Post doc 72

4%

Professor / Associate Prof. 55

3%

Readers' Discipline

Tooltip

Computer Science 1455

71%

Engineering 517

25%

Agricultural and Biological Sciences 43

2%

Mathematics 35

2%

Article Metrics

Tooltip
Mentions
References: 1

Save time finding and organizing research with Mendeley

Sign up for free