PCPNet: An Efficient and Semantic-Enhanced Transformer Network for Point Cloud Prediction

April 16, 2023 ยท Entered Twilight ยท ๐Ÿ› IEEE Robotics and Automation Letters

๐Ÿ’ค TWILIGHT: Eternal Rest
Repo abandoned since publication

Repo contents: LICENSE, README.md, config, figs, pcpnet, poetry.lock, pyTorchChamferDistance, pyproject.toml, semantic_net, test.py, train.py, visualize.py

Authors Zhen Luo, Junyi Ma, Zijie Zhou, Guangming Xiong arXiv ID 2304.07773 Category cs.CV: Computer Vision Cross-listed cs.RO Citations 19 Venue IEEE Robotics and Automation Letters Repository https://github.com/Blurryface0814/PCPNet โญ 33 Last Checked 2 months ago
Abstract
The ability to predict future structure features of environments based on past perception information is extremely needed by autonomous vehicles, which helps to make the following decision-making and path planning more reasonable. Recently, point cloud prediction (PCP) is utilized to predict and describe future environmental structures by the point cloud form. In this letter, we propose a novel efficient Transformer-based network to predict the future LiDAR point clouds exploiting the past point cloud sequences. We also design a semantic auxiliary training strategy to make the predicted LiDAR point cloud sequence semantically similar to the ground truth and thus improves the significance of the deployment for more tasks in real-vehicle applications. Our approach is completely self-supervised, which means it does not require any manual labeling and has a solid generalization ability toward different environments. The experimental results show that our method outperforms the state-of-the-art PCP methods on the prediction results and semantic similarity, and has a good real-time performance. Our open-source code and pre-trained models are available at https://github.com/Blurryface0814/PCPNet.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Computer Vision