Building Damage Detection in Satellite Imagery Using Convolutional Neural Networks

Published 14 Oct 2019 in cs.CV, cs.LG, eess.IV, and stat.ML | (1910.06444v1)

Abstract: In all types of disasters, from earthquakes to armed conflicts, aid workers need accurate and timely data such as damage to buildings and population displacement to mount an effective response. Remote sensing provides this data at an unprecedented scale, but extracting operationalizable information from satellite images is slow and labor-intensive. In this work, we use machine learning to automate the detection of building damage in satellite imagery. We compare the performance of four different convolutional neural network models in detecting damaged buildings in the 2010 Haiti earthquake. We also quantify how well the models will generalize to future disasters by training and testing models on different disaster events.

Abstract PDF Upgrade to Chat

Citations (121)

View on Semantic Scholar

Summary

The paper demonstrates that twin-tower CNN architectures, especially the TTS variant, achieve a high AUC of 0.8302 for detecting building damage.
It employs a novel data generation pipeline using manual UNOSAT assessments and ML filtering to process high-resolution pre- and post-disaster images.
The study underscores the importance of diverse training data for cross-region generalization, enhancing model performance across various disaster events.

Building Damage Detection in Satellite Imagery Using CNNs

This paper focuses on automating the detection of building damage in satellite imagery using convolutional neural networks (CNNs). The research explores various CNN architectures to assess their efficacy in identifying damaged buildings, particularly following disasters like earthquakes.

Data Generation Pipeline

The study begins by constructing a novel data set comprising satellite imagery from three significant disaster events: Haiti, Mexico City, and Indonesia earthquakes. The data generation process includes obtaining high-resolution pre- and post-disaster images primarily sourced from DigitalGlobe's satellites. Building detection involves leveraging manual damage assessments, where damaged buildings are identified and labeled using a Faster-RCNN model trained on extensive datasets (Figure 1).

Figure 1: The data generation pipeline: pre- and post-disaster images are processed to identify damaged buildings through UNOSAT assessments.

Notably, the generation of undamaged building examples employs an ML model to detect structures, filtering out those tagged as damaged. These examples undergo standard normalization and data augmentation techniques to enhance model training.

Model Architecture Comparisons

The research evaluates four distinct CNN architectures: Concatenated Channel (CC), Post-image Only (PO), Twin-tower Concatenate (TTC), and Twin-tower Subtract (TTS). Each architecture processes satellite imagery differently to detect building damage.

Figure 2: Model architecture variants demonstrate different approaches to processing satellite imagery input.

The Twin-tower Subtract (TTS) model emerges as the most effective, achieving the highest AUC of 0.8302. This indicates superior capability in capturing differences between pre- and post-disaster images, which are indicative of damage. The twin-tower models generally outperform single tower models, emphasizing the importance of comparing pre- and post-disaster imagery at a feature level.

Cross-Region Generalization

A critical component of the research is assessing model generalization across different disasters and regions. Experiments reveal that models trained exclusively on one dataset perform poorly when tested on another due to insufficient diversity in training data. However, training models on multiple datasets enhances generalization. Furthermore, introducing manual data annotations from novel disaster conditions significantly boosts model accuracy.

The results underline the necessity for diverse training data to mitigate overfitting risks and improve cross-region performance. They also suggest strategic deployment involving modest manual annotations in new disaster zones to optimize model accuracy.

Conclusion

The paper successfully demonstrates the potential of CNNs in automating building damage detection from satellite imagery. It highlights the effectiveness of twin-tower architectures, particularly the TTS variant, and emphasizes the importance of data diversity for model generalizability. Future work aims to extend model applicability to new disaster types and refine robustness against common satellite image imperfections.

Overall, this research provides valuable insights into using automated methodologies for rapid and accurate damage assessment in disaster response scenarios, potentially transforming operational capabilities in humanitarian aid logistics.

The advancement of this research challenges existing paradigms in damage assessment, showcasing CNNs as viable tools for enhancing accuracy and efficiency in disaster response operations. Future endeavors may focus on augmenting training data repositories and refining algorithmic robustness to support real-time, scalable deployment in various disaster scenarios.

Markdown Report Issue