earticle

논문검색

Session Ⅰ: Computer Vision and Image Analysis

Sparse Convolution for 3D Object Detection

초록

영어

3D object detection is widely applied in robotics and autonomous driving; since 3D scenes in autonomous driving are typically outdoor environments, current methods exhibit substantial computational wastage and significant time delays when using convolution directly in the backbone network. This paper proposes a backbone network based on sparse convolutional spatial-semantic fusion modules to solve this problem. High-level semantic features and low-level spatial features extracted through sub-manifold sparse convolution and sparse convolution are fused to enhance feature representation capabilities. Our proposed backbone network achieves excellent performance on the KITTI dataset.

목차

Abstract
I. INTRODUCTION
II. PROBLEM FORMULATION
A. Definition
B. Problem
III. THE PROPOSED MODEL
A. Pillar Encoding module
B. Backbone Part
C. Neck module, detection head and loss function
IV. EXPERIMENTAL RESULT
A. KITTI
B. Experiment details
C. KITTI evaluation results
D. Efficiency analysis
V. CONCLUSION
REFERENCES

저자정보

  • Xingjian Pei Department of Computer Science and Technology Chongqing University of Posts and Telecommunications Chongqing, China
  • Zhangyu Xia Department of Computer Science and Technology Chongqing University of Posts and Telecommunications Chongqing, China

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      0개의 논문이 장바구니에 담겼습니다.