Curricular Object Manipulation in LiDAR-based Object Detection

Ziyue Zhu; Qiang Meng; Xiao Wang; Ke Wang; Liujiang Yan; Jian Yang

Conference Proceedings

Curricular Object Manipulation in LiDAR-based Object Detection

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2023) 2023-June 1125-1135

DOI: 10.1109/CVPR52729.2023.00115

17Citations

37Readers

Get full text

Abstract

This paper explores the potential of curriculum learning in LiDAR-based 3D object detection by proposing a curricular object manipulation (COM) framework. The framework embeds the curricular training strategy into both the loss design and the augmentation process. For the loss design, we propose the COMLoss to dynamically predict object-level difficulties and emphasize objects of different difficulties based on training stages. On top of the widely-used augmentation technique called GT-Aug in Li-DAR detection tasks, we propose a novel COMAug strategy which first clusters objects in ground-truth database based on well-designed heuristics. Group-level difficulties rather than individual ones are then predicted and updated during training for stable results. Model performance and generalization capabilities can be improved by sampling and augmenting progressively more difficult objects into the training samples. Extensive experiments and ablation studies reveal the superior and generality of the proposed framework.

Author supplied keywords

3D from single images

Cite

CITATION STYLE

APA

Zhu, Z., Meng, Q., Wang, X., Wang, K., Yan, L., & Yang, J. (2023). Curricular Object Manipulation in LiDAR-based Object Detection. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Vol. 2023-June, pp. 1125–1135). IEEE Computer Society. https://doi.org/10.1109/CVPR52729.2023.00115

Curricular Object Manipulation in LiDAR-based Object Detection

Abstract

Author supplied keywords

Cite

Register to see more suggestions