2000 character limit reached
Learning a Depth Covariance Function
Published 21 Mar 2023 in cs.CV, cs.LG, and cs.RO | (2303.12157v2)
Abstract: We propose learning a depth covariance function with applications to geometric vision tasks. Given RGB images as input, the covariance function can be flexibly used to define priors over depth functions, predictive distributions given observations, and methods for active point selection. We leverage these techniques for a selection of downstream tasks: depth completion, bundle adjustment, and monocular dense visual odometry.
- IronDepth: Iterative refinement of single-view depth using surface normal and its uncertainty. In Proceedings of the British Machine Vision Conference (BMVC), 2022.
- Understanding probabilistic sparse Gaussian process approximations. In Neural Information Processing Systems (NeurIPS), 2016.
- CodeSLAM - learning a compact, optimisable representation for dense visual SLAM. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
- Learning meshes for dense visual SLAM. In Proceedings of the International Conference on Computer Vision (ICCV), 2019.
- AK: Attentive kernel for information gathering. In Proceedings of Robotics: Science and Systems (RSS), 2022.
- Depth estimation via affinity learned with convolutional spatial propagation network. In Proceedings of the European Conference on Computer Vision (ECCV), 2018.
- Inverse depth parametrization for monocular SLAM. IEEE Transactions on Robotics (T-RO), 2008.
- Sparsity agnostic depth completion. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2023.
- DeepFactors: Real-time probabilistic dense monocular SLAM. IEEE Robotics and Automation Letters (RA-L), 2020.
- ScanNet: Richly-annotated 3D reconstructions of indoor scenes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
- Structured uncertainty prediction networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
- Depth map prediction from a single image using a multi-scale deep network. In Neural Information Processing Systems (NeurIPS), 2014.
- LSD-SLAM: Large-scale direct monocular SLAM. In Proceedings of the European Conference on Computer Vision (ECCV), 2014.
- borglab/gtsam, May 2022.
- Digging into self-supervised monocular depth estimation. In Proceedings of the International Conference on Computer Vision (ICCV), 2019.
- RidgeSfM: Structure from motion via robust pairwise matching under depth uncertainty. In Proceedings of the International Conference on 3D Vision (3DV), 2020.
- Near-optimal sensor placements in Gaussian processes. In Proceedings of the International Conference on Machine Learning (ICML), 2005.
- Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
- Probability product kernels. The Journal of Machine Learning Research, 2004.
- What uncertainties do we need in Bayesian deep learning for computer vision? In Neural Information Processing Systems (NeurIPS), 2017.
- Adam: A method for stochastic optimization. In Proceedings of the International Conference on Learning Representations (ICLR), 2015.
- Auto-encoding variational Bayes. In Proceedings of the International Conference on Learning Representations (ICLR), 2014.
- Accurate uncertainties for deep learning using calibrated regression. In Proceedings of the International Conference on Machine Learning (ICML), 2018.
- Adaptive non-stationary kernel regression for terrain modeling. In Proceedings of Robotics: Science and Systems (RSS), 2007.
- Learning steering kernels for guided depth completion. IEEE Transactions on Image Processing, 2021.
- An iterative image registration technique with an application to stereo vision. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), 1981.
- Sparse-to-dense: Depth prediction from sparse depth samples and a single image. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), 2018.
- ORB-SLAM2: An open-source SLAM system for monocular, stereo, and RGB-D cameras. IEEE Transactions on Robotics (T-RO), 2017.
- DTAM: Dense tracking and mapping in real-time. In Proceedings of the International Conference on Computer Vision (ICCV), 2011.
- The promises and pitfalls of deep kernel learning. In Conference on Uncertainty in Artificial Intelligence, 2021.
- Nonstationary covariance functions for Gaussian process regression. In Neural Information Processing Systems (NeurIPS), 2003.
- Non-local spatial propagation network for depth completion. In Proceedings of the European Conference on Computer Vision (ECCV), 2020.
- Bayesian meta-learning for the few-shot setting via deep kernels. In Neural Information Processing Systems (NeurIPS), 2020.
- Towards robust monocular depth estimation: Mixing datasets for zero-shot cross-dataset transfer. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2022.
- Online sparse Gaussian process regression and its applications. IEEE Transactions on Image Processing, 2011.
- Carl Edward Rasmussen and Christopher K. I. Williams. Gaussian Processes for Machine Learning. MIT Press, 2005.
- Jean-Michel M. Rendu. Normal and lognormal estimation. Journal of the International Association for Mathematical Geology, 1979.
- U-Net: Convolutional networks for biomedical image segmentation. In Proceedings of the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2015.
- Structure-from-motion revisited. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
- Good features to track. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 1994.
- Indoor segmentation and support inference from RGBD images. In Proceedings of the European Conference on Computer Vision (ECCV), 2012.
- Learning structured Gaussians to approximate deep ensembles. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
- A benchmark for the evaluation of RGB-D SLAM systems. In Proceedings of the IEEE/RSJ Conference on Intelligent Robots and Systems (IROS), 2012.
- BA-Net: Dense bundle adjustment networks. In Proceedings of the International Conference on Learning Representations (ICLR), 2019.
- DeepV2D: Video to depth with differentiable structure from motion. In Proceedings of the International Conference on Learning Representations (ICLR), 2020.
- DROID-SLAM: Deep visual SLAM for monocular, stereo, and RGB-D cameras. In Neural Information Processing Systems (NeurIPS), 2021.
- Michalis Titsias. Variational learning of inducing variables in sparse Gaussian processes. In International Conference on Artificial Intelligence and Statistics (AISTATS), 2009.
- TartanVO: A generalizable learning-based VO. In Conference on Robot Learning (CoRL), 2021.
- Deep kernel learning. In International Conference on Artificial Intelligence and Statistics (AISTATS), 2016.
- Group normalization. In Proceedings of the European Conference on Computer Vision (ECCV), September 2018.
- RigNet: Repetitive image guided network for depth completion. In Proceedings of the European Conference on Computer Vision (ECCV), 2022.
- Balanced depth completion between dense depth inference and sparse range measurements via KISS-GP. In Proceedings of the IEEE/RSJ Conference on Intelligent Robots and Systems (IROS), 2020.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.