Bichen Wu, Research Scientist at Facebook Reality Labs, presents the “Efficient Deep Learning for 3D Point Cloud Understanding” tutorial at the May 2021 Embedded Vision Summit.
Understanding the 3D environment is a crucial computer vision capability required by a growing set of applications such as autonomous driving, AR/VR and AIoT. 3D visual information, captured by LiDAR and other sensors, is typically represented by a point cloud consisting of thousands of unstructured points.
Developing computer vision solutions to understand 3D point clouds requires addressing several challenges, including how to efficiently represent and process 3D point clouds, how to design efficient on-device neural networks to process 3D point clouds, and how to easily obtain data to train 3D models and improve data efficiency. In this talk, Wu shows how his company addresses these challenges as part of its “SqeezeSeg” research and presents a highly efficient, accurate, and data-efficient solution for on-device 3D point-cloud understanding.
See here for a PDF of the slides.