Extend Your Own Correspondences: Unsupervised Distant Point Cloud Registration by Progressive Distance Extension
Abstract: Registration of point clouds collected from a pair of distant vehicles provides a comprehensive and accurate 3D view of the driving scenario, which is vital for driving safety related applications, yet existing literature suffers from the expensive pose label acquisition and the deficiency to generalize to new data distributions. In this paper, we propose EYOC, an unsupervised distant point cloud registration method that adapts to new point cloud distributions on the fly, requiring no global pose labels. The core idea of EYOC is to train a feature extractor in a progressive fashion, where in each round, the feature extractor, trained with near point cloud pairs, can label slightly farther point cloud pairs, enabling self-supervision on such far point cloud pairs. This process continues until the derived extractor can be used to register distant point clouds. Particularly, to enable high-fidelity correspondence label generation, we devise an effective spatial filtering scheme to select the most representative correspondences to register a point cloud pair, and then utilize the aligned point clouds to discover more correct correspondences. Experiments show that EYOC can achieve comparable performance with state-of-the-art supervised methods at a lower training cost. Moreover, it outwits supervised methods regarding generalization performance on new data distributions.
- RPSRNet: End-to-end trainable rigid point set registration network using Barnes-Hut 2D-Tree representation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 13100–13110, 2021.
- SpinNet: Learning a general surface descriptor for 3D point cloud registration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11753–11762, 2021.
- BUFFER: Balancing accuracy, efficiency, and generalizability in point cloud registration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1255–1264, 2023.
- D3Feat: Joint learning of dense detection and description of 3D local features. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6359–6367, 2020.
- PointDSC: Robust point cloud registration using deep spatial consistency. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 15859–15869, 2021.
- nuScenes: A multimodal dataset for autonomous driving. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.
- SC2-PCR: A second order spatial compatibility for efficient and robust point cloud registration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 13221–13231, 2022.
- 4d spatio-temporal convnets: Minkowski convolutional neural networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3075–3084, 2019a.
- Fully convolutional geometric features. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019b.
- Deep global registration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2514–2523, 2020.
- PPFNet: Global context aware local features for robust 3D point matching. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 195–205, 2018.
- Bootstrap your own correspondences. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 6433–6442, 2021.
- UnsupervisedR&R: Unsupervised point cloud registration via differentiable rendering. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7129–7139, 2021.
- Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM, 24(6):381–395, 1981.
- Robust point cloud registration framework based on deep graph matching. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 8893–8902, 2021.
- Are we ready for autonomous driving? the KITTI vision benchmark suite. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2012.
- SuperPatchMatch: An algorithm for robust correspondences using superpixel patches. IEEE Transactions on Image Processing, 26(8):4068–4078, 2017.
- Learned compact local feature descriptor for tls-based geodetic monitoring of natural outdoor scenes. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 4:113–120, 2018.
- The perfect match: 3D point cloud matching with smoothed densities. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5545–5554, 2019.
- PREDATOR: Registration of 3D point clouds with low overlap. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4267–4276, 2021.
- SemanticKITTI: A dataset for semantic scene understanding of LiDAR sequences. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019.
- Using spin images for efficient object recognition in cluttered 3D scenes. IEEE Transactions on Pattern Analysis and Machine Intelligence, 21(5):433–449, 1999.
- Density adaptive point set registration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3829–3837, 2018.
- DeepPRO: Deep partial point cloud registration of objects. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 5683–5692, 2021a.
- Deep hough voting for robust global registration. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 15994–16003, 2021b.
- Lepard: Learning partial point cloud matching in rigid and deformable scenes. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5554–5564, 2022.
- BEVFormer: Learning bird’s-eye-view representation from multi-camera images via spatiotemporal transformers. In European Conference on Computer Vision, pages 1–18. Springer, 2022.
- APR: Online distant point cloud registration through aggregated point cloud reconstruction. In Proceedings of the International Joint Conference on Artificial Intelligence, pages 1204–1212. International Joint Conferences on Artificial Intelligence Organization, 2023a. Main Track.
- Density-invariant features for distant point cloud registration. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 18215–18225, 2023b.
- H Christopher Longuet-Higgins. A computer algorithm for reconstructing a scene from two projections. Nature, 293(5828):133–135, 1981.
- HRegNet: A hierarchical network for large-scale outdoor LiDAR point cloud registration. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 16014–16023, 2021.
- Unsupervised deep probabilistic approach for partial point cloud registration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 13611–13620, 2023.
- FastSLAM: A factored solution to the simultaneous localization and mapping problem. Association for the Advancement of Artificial Intelligence / Innovative Applications of Artificial Intelligence Conference, 593598, 2002.
- Orb-slam2: An open-source slam system for monocular, stereo, and RGB-D cameras. IEEE Transactions on Robotics, 33(5):1255–1262, 2017.
- Distinctive 3D local deep descriptors. In Proceedings of the International Conference on Pattern Recognition, pages 5720–5727. IEEE, 2021.
- PointNet: Deep learning on point sets for 3D classification and segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 652–660, 2017.
- Geometric transformer for fast and robust point cloud registration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11143–11152, 2022.
- UniFusion: Unified multi-view fusion transformer for spatial-temporal representation in bird’s-eye-view. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 8690–8699, 2023.
- Accelerating 3D deep learning with PyTorch3D. arXiv:2007.08501, 2020.
- Intriguing properties of randomly weighted networks: Generalizing while learning next to nothing. In Proceedings of the Conference on Computer and Robot Vision, pages 9–16. IEEE, 2019.
- Fast point feature histograms (FPFH) for 3D registration. In Proceedings of the IEEE International Conference on Robotics and Automation, pages 3212–3217. IEEE, 2009.
- Complexer-YOLO: Real-time 3D object detection and tracking on semantic point clouds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pages 0–0, 2019.
- Scalability in perception for autonomous driving: Waymo Open Dataset. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
- RGCNN: Regularized graph cnn for point cloud segmentation. In Proceedings of the ACM international conference on Multimedia, pages 746–754, 2018.
- KPConv: Flexible and deformable convolution for point clouds. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 6411–6420, 2019.
- Unique signatures of histograms for local surface description. In Proceedings of the European Conference on Computer Vision, pages 356–369. Springer, 2010.
- Deep image prior. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 9446–9454, 2018.
- PRNet: Self-supervised learning for partial-to-partial registration. Advances in Neural Information Processing Systems, 32, 2019.
- PointConv: Deep convolutional networks on 3D point clouds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9621–9630, 2019.
- SqueezeSegV3: Spatially-adaptive convolution for efficient point-cloud segmentation. In Proceedings of the European Conference on Computer Vision, pages 1–19. Springer, 2020.
- 3DFeat-Net: Weakly supervised local 3D features for point cloud registration. In Proceedings of the European Conference on Computer Vision, pages 607–623, 2018.
- RPM-Net: Robust point matching using learned features. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11824–11833, 2020.
- REGTR: End-to-end point cloud correspondences with transformers. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6677–6686, 2022.
- CoFiNet: Reliable coarse-to-fine correspondences for robust pointcloud registration. Advances in Neural Information Processing Systems, 34:23872–23884, 2021.
- DAIR-V2X: A large-scale dataset for vehicle-infrastructure cooperative 3D object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 21361–21370, 2022.
- PEAL: Prior-embedded explicit attention learning for low-overlap point cloud registration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 17702–17711, 2023.
- 3DMatch: Learning local geometric descriptors from RGB-D reconstructions. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1802–1811, 2017.
- EMP: Edge-assisted multi-vehicle perception. In Proceedings of the 27th Annual International Conference on Mobile Computing and Networking, pages 545–558, 2021.
- 3D registration with maximal cliques. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 17745–17754, 2023.
- Fast global registration. In Proceedings of the European Conference on Computer Vision, pages 766–782. Springer, 2016.
- VPFNet: Improving 3D object detection with virtual point based LiDAR and stereo data fusion. IEEE Transactions on Multimedia, 2022.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.