Joint Learning of Blind Super-Resolution and Crack Segmentation for Realistic Degraded Images
Abstract: This paper proposes crack segmentation augmented by super resolution (SR) with deep neural networks. In the proposed method, a SR network is jointly trained with a binary segmentation network in an end-to-end manner. This joint learning allows the SR network to be optimized for improving segmentation results. For realistic scenarios, the SR network is extended from non-blind to blind for processing a low-resolution image degraded by unknown blurs. The joint network is improved by our proposed two extra paths that further encourage the mutual optimization between SR and segmentation. Comparative experiments with State of The Art (SoTA) segmentation methods demonstrate the superiority of our joint learning, and various ablation studies prove the effects of our contributions.
- F. Elghaish, S. Talebi, E. Abdellatef, S. T. Matarneh, M. R. Hosseini, S. Wu, M. Mayouf, A. Hajirasouli et al., “Developing a new deep learning cnn model to detect and classify highway cracks,” Journal of Engineering, Design and Technology, 2021.
- E. Shelhamer, J. Long, and T. Darrell, “Fully convolutional networks for semantic segmentation,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 39, no. 4, pp. 640–651, 2017.
- K. He, G. Gkioxari, P. Dollár, and R. B. Girshick, “Mask R-CNN,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 42, no. 2, pp. 386–397, 2020.
- A. Kirillov, K. He, R. B. Girshick, C. Rother, and P. Dollár, “Panoptic segmentation,” in CVPR, 2019.
- S. Stent, R. Gherardi, B. Stenger, K. Soga, and R. Cipolla, “An image-based system for change detection on tunnel linings,” in MVA, 2013.
- Y. Fei, K. C. P. Wang, A. Zhang, C. Chen, J. Q. Li, Y. Liu, G. Yang, and B. Li, “Pixel-level cracking detection on 3d asphalt pavement images through deep-learning- based cracknet-v,” IEEE Trans. Intell. Transp. Syst., vol. 21, no. 1, pp. 273–284, 2020.
- Y. Kondo and N. Ukita, “Crack segmentation for low-resolution images using joint learning with super-resolution,” in MVA, 2021.
- H. Bae, K. Jang, and Y.-K. An, “Deep super resolution crack network (srcnet) for improving computer vision–based automated crack detectability in in situ bridges,” Structural Health Monitoring, vol. 20, no. 4, pp. 1428–1442, 2021.
- L. Wang, D. Li, Y. Zhu, L. Tian, and Y. Shan, “Dual super-resolution learning for semantic segmentation,” in CVPR, 2020, https://github.com/Dootmaan/DSRL.
- S. Minaee, Y. Boykov, F. Porikli, A. Plaza, N. Kehtarnavaz, and D. Terzopoulos, “Image segmentation using deep learning: A survey,” arXiv, 2020.
- D. Dais, İhsan Engin Bal, E. Smyrou, and V. Sarhosis, “Automatic crack classification and segmentation on masonry surfaces using convolutional neural networks and transfer learning,” Automation in Construction, vol. 125, p. 103606, 2021.
- T. Lin, P. Goyal, R. B. Girshick, K. He, and P. Dollár, “Focal loss for dense object detection,” in ICCV, 2017.
- Z. Li, K. Kamnitsas, and B. Glocker, “Analyzing overfitting under class imbalance in neural networks for image segmentation,” IEEE Trans. Medical Imaging, vol. 40, no. 3, pp. 1065–1077, 2021.
- M. S. Hossain, J. M. Betts, and A. P. Paplinski, “Dual focal loss to address class imbalance in semantic segmentation,” Neurocomputing, vol. 462, pp. 69–87, 2021.
- S. R. Bulò, G. Neuhold, and P. Kontschieder, “Loss max-pooling for semantic image segmentation,” in CVPR, 2017.
- L. Gong, Y. Zhang, Y. Zhang, Y. Yang, and W. Xu, “Erroneous pixel prediction for semantic image segmentation,” Comput. Vis. Media, vol. 8, no. 1, pp. 165–175, 2022.
- F. Milletari, N. Navab, and S. Ahmadi, “V-net: Fully convolutional neural networks for volumetric medical image segmentation,” in 3DV, 2016.
- C. H. Sudre, W. Li, T. Vercauteren, S. Ourselin, and M. J. Cardoso, “Generalised dice overlap as a deep learning loss function for highly unbalanced segmentations,” in MICCAI, 2017.
- S. A. Taghanaki, Y. Zheng, S. K. Zhou, B. Georgescu, P. Sharma, D. Xu, D. Comaniciu, and G. Hamarneh, “Combo loss: Handling input and output imbalance in multi-organ segmentation,” Comput Med Imaging Graph, vol. 75, pp. 24–33, 2019.
- D. Karimi and S. E. Salcudean, “Reducing the hausdorff distance in medical image segmentation with convolutional neural networks,” IEEE Trans. Medical Imaging, vol. 39, no. 2, pp. 499–513, 2020.
- H. Kervadec, J. Bouchtiba, C. Desrosiers, E. Granger, J. Dolz, and I. B. Ayed, “Boundary loss for highly unbalanced segmentation,” Medical Image Anal., vol. 67, p. 101851, 2021.
- K. Zhang, Y. Zhang, and H.-D. Cheng, “Crackgan: Pavement crack detection using partially accurate ground truths based on generative adversarial learning,” TITS, vol. 22, no. 2, pp. 1306–1319, 2020.
- A. Rezaie, R. Achanta, M. Godio, and K. Beyer, “Comparison of crack segmentation using digital image correlation measurements and deep learning,” Construction and Building Materials, vol. 261, no. 20, p. 120474, 2020.
- H. Chen, Y. Su, and W. He, “Automatic crack segmentation using deep high-resolution representation learning,” Applied Optics, vol. 60, no. 21, pp. 6080–6090, 2021.
- Y. Liu, J. Yao, X. Lu, R. Xie, and L. Li, “Deepcrack: A deep hierarchical feature learning architecture for crack segmentation,” Neurocomputing, vol. 338, pp. 139–153, 2019.
- H. Liu, X. Miao, C. Mertz, C. Xu, and H. Kong, “Crackformer: Transformer network for fine-grained crack detection,” in ICCV, 2021.
- L. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille, “Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 40, no. 4, pp. 834–848, 2018.
- O. Ronneberger, P. Fischer, and T. Brox, “U-net: Convolutional networks for biomedical image segmentation,” in MICCAI, 2015.
- F. Yang, L. Zhang, S. Yu, D. Prokhorov, X. Mei, and H. Ling, “Feature pyramid and hierarchical boosting network for pavement crack detection,” TITS, vol. 21, no. 4, pp. 1525–1535, 2020.
- Z. Wang, J. Chen, and S. C. H. Hoi, “Deep learning for image super-resolution: A survey,” TPAMI, vol. 43, no. 10, pp. 3365–3387, 2021.
- K. Zhang, L. V. Gool, and R. Timofte, “Deep unfolding network for image super-resolution,” in CVPR, 2020.
- J. Lee, J. Park, K. Lee, J. Min, G. Kim, B. Lee, B. Ku, D. K. Han, and H. Ko, “FBRNN: feedback recurrent neural network for extreme image super-resolution,” in CVPR Workshop, 2020.
- L. Lu, W. Li, X. Tao, J. Lu, and J. Jia, “MASA-SR: matching acceleration and spatial adaptation for reference-based image super-resolution,” in CVPR, 2021.
- J. W. Soh, S. Cho, and N. I. Cho, “Meta-transfer learning for zero-shot super-resolution,” in CVPR, 2020.
- K. Zhang, J. L. L. V. Gool, and R. Timofte, “Designing a practical degradation model for deep blind image super-resolution,” in ICCV, 2021.
- S. A. Hussein, T. Tirer, and R. Giryes, “Correction filter for single image super-resolution: Robustifying off-the-shelf deep super-resolvers,” in CVPR, 2020.
- J. Gu, H. Lu, W. Zuo, and C. Dong, “Blind super-resolution with iterative kernel correction,” in CVPR, 2019.
- L. Wang, Y. Wang, X. Dong, Q. Xu, J. Yang, W. An, and Y. Guo, “Unsupervised degradation representation learning for blind super-resolution,” in CVPR, 2021.
- Y. Guo, J. Chen, J. Wang, Q. Chen, J. Cao, Z. Deng, Y. Xu, and M. Tan, “Closed-loop matters: Dual regression networks for single image super-resolution,” in CVPR, 2020.
- S. Y. Kim, H. Sim, and M. Kim, “Koalanet: Blind super-resolution using kernel-oriented adaptive local adjustment,” in CVPR, 2021.
- T. Yoshida, Y. Kondo, T. Maeda, K. Akita, and N. Ukita, “Kernelized back-projection networks for blind super resolution,” arXiv, 2023.
- Z. Hao, Y. Liu, H. Qin, J. Yan, X. Li, and X. Hu, “Scale-aware face detection,” in CVPR, 2017.
- Z. Guo, G. Wu, X. Song, W. Yuan, Q. Chen, H. Zhang, X. Shi, M. Xu, Y. Xu, R. Shibasaki, and X. Shao, “Super-resolution integrated building semantic segmentation for multi-source remote sensing imagery,” IEEE Access, vol. 7, pp. 99 381–99 397, 2019.
- D. Choi, J. H. Choi, J. W. Choi, and B. C. Song, “Sharpness enhancement and super-resolution of around-view monitor images,” IEEE Trans. Intell. Transp. Syst., vol. 19, no. 8, pp. 2650–2662, 2018.
- M. Singh, S. Nagpal, R. Singh, and M. Vatsa, “Dual directed capsule network for very low resolution image recognition,” in ICCV, 2019.
- Y. Zhang, Y. Bai, M. Ding, S. Xu, and B. Ghanem, “Kgsnet: Key-point-guided super-resolution network for pedestrian detection in the wild,” IEEE Trans. Neural Networks Learn. Syst., vol. 32, no. 5, pp. 2251–2265, 2021.
- K. Akita, M. Haris, and N. Ukita, “Region-dependent scale proposals for super-resolution in object detection,” in IPAS, 2020.
- M. Haris, G. Shakhnarovich, and N. Ukita, “Task-driven super resolution: Object detection in low-resolution images,” in ICONIP, 2021.
- ——, “Deep back-projection networks for super-resolution,” in CVPR, 2018.
- H. Zhao, J. Shi, X. Qi, X. Wang, and J. Jia, “Pyramid scene parsing network,” in CVPR, 2017.
- Y. Yuan, X. Chen, and J. Wang, “Object-contextual representations for semantic segmentation,” in ECCV, 2020, https://github.com/openseg-group/openseg.pytorch/blob/master/MODEL_ZOO.md.
- J. Ma, J. Chen, M. Ng, R. Huang, Y. Li, C. Li, X. Yang, and A. L. Martel, “Loss odyssey in medical image segmentation,” Medical Image Anal., vol. 71, p. 102035, 2021.
- S. Ryou, S. Jeong, and P. Perona, “Anchor loss: Modulating loss scale based on prediction difficulty,” in ICCV, 2019.
- DeepMind, “Surface distance metrics,” https://github.com/deepmind/surface-distance.
- X. Wang, K. Yu, C. Dong, and C. C. Loy, “Recovering realistic texture in image super-resolution by deep spatial feature transform,” in CVPR, 2018.
- E. Agustsson and R. Timofte, “Ntire 2017 challenge on single image super-resolution: Dataset and study,” in CVPRW, 2017.
- R. Timofte, E. Agustsson, L. Van Gool, M.-H. Yang, and L. Zhang, “Ntire 2017 challenge on single image super-resolution: Methods and results,” in CVPRW, 2017.
- J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei, “Imagenet: A large-scale hierarchical image database,” in CVPR, 2009.
- K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition,” in ICLR, 2015.
- “Torchvision.models,” https://pytorch.org/vision/stable/models.html.
- D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” in ICLR, 2015.
- Khanhha, “Crack segmentation,” 2020, https://github.com/khanhha/crack_segmentation.
- L. Zhang, F. Yang, Y. D. Zhang, and Y. J. Zhu, “Road crack detection using deep convolutional neural network,” in ICIP, 2016.
- M. Eisenbach, R. Stricker, D. Seichter, K. Amende, K. Debes, M. Sesselmann, D. Ebersbach, U. Stoeckert, and H.-M. Gross, “How to get pavement distress detection ready for deep learning? a systematic approach.” in IJCNN, 2017.
- Y. Shi, L. Cui, Z. Qi, F. Meng, and Z. Chen, “Automatic road crack detection using random structured forests,” TITS, vol. 17, no. 12, pp. 3434–3445, 2016.
- R. Amhazand, S. Chambon, J. Idier, and V. Baltazart, “Automatic crack detection on two-dimensional pavement images: An algorithm based on minimal path selection,” TITS, vol. 17, no. 10, pp. 2718–2729, 2016.
- Q. Zou, Y. Cao, Q. Li, Q. Mao, and S. Wang, “Cracktree: Automatic crack detection from pavement images,” Pattern Recognition Letters, vol. 33, no. 3, pp. 227–238, 2012.
- L. Yang, B. Li, W. Li, L. Zhaoming, G. Yang, and J. Xiao, “Deep concrete inspection using unmanned aerial vehicle towards cssc database,” in IROS, 2017.
- Z. Wang, A. C. Bovik, H. R. Sheikh, and E. P. Simoncelli, “Image quality assessment: from error visibility to structural similarity,” IEEE Transactions on image processing, vol. 13, no. 4, pp. 600–612, 2004.
- M. Haris, G. Shakhnarovich, and N. Ukita, “Space-time-aware multi-resolution video enhancement,” in CVPR, 2020.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.