Curricular Object Manipulation in LiDAR-based Object Detection

17Citations
Citations of this article
37Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper explores the potential of curriculum learning in LiDAR-based 3D object detection by proposing a curricular object manipulation (COM) framework. The framework embeds the curricular training strategy into both the loss design and the augmentation process. For the loss design, we propose the COMLoss to dynamically predict object-level difficulties and emphasize objects of different difficulties based on training stages. On top of the widely-used augmentation technique called GT-Aug in Li-DAR detection tasks, we propose a novel COMAug strategy which first clusters objects in ground-truth database based on well-designed heuristics. Group-level difficulties rather than individual ones are then predicted and updated during training for stable results. Model performance and generalization capabilities can be improved by sampling and augmenting progressively more difficult objects into the training samples. Extensive experiments and ablation studies reveal the superior and generality of the proposed framework.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhu, Z., Meng, Q., Wang, X., Wang, K., Yan, L., & Yang, J. (2023). Curricular Object Manipulation in LiDAR-based Object Detection. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Vol. 2023-June, pp. 1125–1135). IEEE Computer Society. https://doi.org/10.1109/CVPR52729.2023.00115

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free