Point Primitive Transformer for Long-Term 4D Point Cloud Video Understanding

Hao Wen; Yunze Liu; Jingwei Huang; Bo Duan; Li Yi

Conference Proceedings

Point Primitive Transformer for Long-Term 4D Point Cloud Video Understanding

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2022) 13689 LNCS 19-35

DOI: 10.1007/978-3-031-19818-2_2

4Citations

24Readers

Get full text

Abstract

This paper proposes a 4D backbone for long-term point cloud video understanding. A typical way to capture spatial-temporal context is using 4Dconv or transformer without hierarchy. However, those methods are neither effective nor efficient enough due to camera motion, scene changes, sampling patterns, and complexity of 4D data. To address those issues, we leverage the primitive plane as mid-level representation to capture the long-term spatial-temporal context in 4D point cloud videos, and propose a novel hierarchical backbone named Point Primitive Transformer (PPTr), which is mainly composed of intra-primitive point transformers and primitive transformers. Extensive experiments show that PPTr outperforms the previous state of the arts on different tasks.

Author supplied keywords

Cite

CITATION STYLE

APA

Wen, H., Liu, Y., Huang, J., Duan, B., & Yi, L. (2022). Point Primitive Transformer for Long-Term 4D Point Cloud Video Understanding. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13689 LNCS, pp. 19–35). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-19818-2_2

Point Primitive Transformer for Long-Term 4D Point Cloud Video Understanding

Abstract

Author supplied keywords

Cite

Register to see more suggestions