Harnessing Meta-Learning for Improving Full-Frame Video Stabilization
Abstract: Video stabilization is a longstanding computer vision problem, particularly pixel-level synthesis solutions for video stabilization which synthesize full frames add to the complexity of this task. These techniques aim to stabilize videos by synthesizing full frames while enhancing the stability of the considered video. This intensifies the complexity of the task due to the distinct mix of unique motion profiles and visual content present in each video sequence, making robust generalization with fixed parameters difficult. In our study, we introduce a novel approach to enhance the performance of pixel-level synthesis solutions for video stabilization by adapting these models to individual input video sequences. The proposed adaptation exploits low-level visual cues accessible during test-time to improve both the stability and quality of resulting videos. We highlight the efficacy of our methodology of "test-time adaptation" through simple fine-tuning of one of these models, followed by significant stability gain via the integration of meta-learning techniques. Notably, significant improvement is achieved with only a single adaptation step. The versatility of the proposed algorithm is demonstrated by consistently improving the performance of various pixel-level synthesis models for video stabilization in real-world scenarios.
- Deep motion blind video stabilization. arXiv preprint arXiv:2011.09697, 2020.
- Learning task agnostic temporal consistency correction. arXiv preprint arXiv:2206.03753, 2022.
- Meta-learning deep visual words for fast video object segmentation. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2020.
- Non-metric image-based rendering for video stabilization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2001.
- Meta-learning-based incremental few-shot object detection. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2021.
- Camera distortion-aware 3d human pose estimation in video with optimization-based meta-learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
- Deep iterative frame interpolation for full-frame video stabilization. ACM TOG, 2020.
- Deep meta learning for real-time target-aware visual tracking. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2019.
- Self-supervised real-time video stabilization. arXiv preprint arXiv:2111.05980, 2021a.
- Scene-adaptive video frame interpolation via meta-learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
- Test-time adaptation for video frame interpolation via meta-learning. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2021b.
- Imagenet: A large-scale hierarchical image database. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009.
- Minet: Meta-learning instance identifiers for video object detection. IEEE Transactions on Image Processing (TIP), 2021.
- Model-agnostic meta-learning for fast adaptation of deep networks. In International Conference on Machine Learning (ICML), 2017.
- Embodied one-shot video recognition: Learning from actions of a virtual embodied agent. In ACM International Conference on Multimedia (MM), 2019.
- Globalflownet: Video stabilization using deep distilled global motion estimates. In Winter Conference on Applications of Computer Vision (WACV), 2023.
- Video stabilization using epipolar geometry. ACM TOG, 2012.
- Auto-directed video stabilization with robust l1 optimal camera paths. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011.
- Ada-vsr: Adaptive video super-resolution with meta-learning. In ACM International Conference on Multimedia (MM), 2021.
- Few-shot personality-specific image captioning via meta-learning. In Conference on Robots and Vision, 2023.
- Perceptual losses for real-time style transfer and super-resolution. In European conference on computer vision, pages 694–711. Springer, 2016.
- Digital video stabilization and rolling shutter correction using gyroscopes. CSTR, 2011.
- Metapix: Few-shot video retargeting. arXiv preprint arXiv:1910.04742, 2019.
- Video stabilization using robust feature trajectories. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2009.
- Dynavsr: Dynamic adaptive blind video super-resolution. In Winter Conference on Applications of Computer Vision (WACV), 2021.
- Self-supervised video representation learning with meta-contrastive network. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2021.
- Content-preserving warps for 3d video stabilization. ACM Transactions on Graphics (SIGGRAPH), 2009.
- Subspace video stabilization. ACM TOG, 2011.
- Video stabilization with a depth camera. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012.
- Bundled camera paths for video stabilization. ACM TOG, 2013.
- Hybrid neural fusion for full-frame video stabilization. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2021.
- A deep meta-learning neural network for single image rain removal. In International Congress on Image and Signal Processing, BioMedical Engineering and Informatics, 2020.
- Bilevel fast scene adaptation for low-light image enhancement. arXiv preprint arXiv:2306.01343, 2023.
- Full-frame video stabilization with motion inpainting. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2006.
- The contextual loss for image transformation with non-aligned data. In Proceedings of the European Conference on Computer Vision (ECCV), 2018.
- Video deblurring by fitting to test data. arXiv preprint arXiv:2012.05228, 2020.
- Light field video stabilization. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2009.
- Tracking by instance detection: A meta-learning approach. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
- Deep online video stabilization with multi-grid warping transformation learning. IEEE Transactions on Image Processing (TIP), 2018.
- Meta-learning based siamese network with channel-wise self-attention for visual tracking. In International Conference on Image, Video and Signal Processing, 2021.
- Video stabilization: A comprehensive survey. Neurocomputing, 2022.
- Spatially and temporally optimized video stabilization. IEEE transactions on visualization and computer graphics, 2013.
- Out-of-boundary view synthesis towards full-frame video stabilization. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2021.
- Dut: Learning video stabilization by simply watching unstable videos. IEEE Transactions on Image Processing (TIP), 2022.
- Toward human perception-centric video thumbnail generation. In ACM International Conference on Multimedia (MM), 2023.
- Robust video stabilization by optimization in cnn weight space. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
- Learning video stabilization using optical flow. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
- Minimum latency deep online video stabilization. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2023.
- One to many: Adaptive instrument segmentation via meta learning and dynamic online adaptation in robotic surgical video. In IEEE International Conference on Robotics and Automation (ICRA), 2021.
- Plane-based content preserving warps for video stabilization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013.
- L 2 c–learning to learn to compress. In IEEE 22nd International Workshop on Multimedia Signal Processing, 2020.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.