earticle

논문검색

Oral Session B-1: Vision Applications

Contrastive Encoders for 3D Point Cloud Understanding : A Comparative Survey

초록

영어

Three-dimensional (3D) point clouds provide detailed geometric understanding of real-world environments but remain challenging to process due to their sparse and unordered nature. Contrastive learning has emerged as a powerful self-supervised approach for learning representations from unlabeled 3D point cloud data. At the core of these methods lie encoder architectures that project raw points into discriminative latent spaces. This brief survey highlights major encoder families used in 3D contrastive learning and analyzes their design principles, strengths, and limitations. We further discuss how encoder choice influences downstream performance and outline research trends toward efficient, multimodal, and real-time contrastive frameworks.

목차

Abstract
I. INTRODUCTION
II. ENCODER ARCHITECTURES
A. Point-based Encoder
B. Voxel-based Encoders
C. Graph-based Encoders
D. Transformer-based Encoders
E. Multi-modal Encoders
III. CHALLENGES AND FUTURE DIRECTIONS
IV. CONCLUSION
REFERENCES

저자정보

  • Rehenuma Tasnim Rodoshi Department of Computer Science George Mason University Fairfax, VA, USA
  • Rezoan Ahmed Nazib Department of Computer Science George Mason University Fairfax, VA, USA
  • Wooyeol Choi School of Computer Science and Engineering Chung-Ang University Seoul, Republic of Korea

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      0개의 논문이 장바구니에 담겼습니다.