AID: A Benchmark Dataset for Performance Evaluation of Aerial Scene Classification

Published 18 Aug 2016 in cs.CV | (1608.05167v1)

Abstract: Aerial scene classification, which aims to automatically label an aerial image with a specific semantic category, is a fundamental problem for understanding high-resolution remote sensing imagery. In recent years, it has become an active task in remote sensing area and numerous algorithms have been proposed for this task, including many machine learning and data-driven approaches. However, the existing datasets for aerial scene classification like UC-Merced dataset and WHU-RS19 are with relatively small sizes, and the results on them are already saturated. This largely limits the development of scene classification algorithms. This paper describes the Aerial Image Dataset (AID): a large-scale dataset for aerial scene classification. The goal of AID is to advance the state-of-the-arts in scene classification of remote sensing images. For creating AID, we collect and annotate more than ten thousands aerial scene images. In addition, a comprehensive review of the existing aerial scene classification techniques as well as recent widely-used deep learning methods is given. Finally, we provide a performance analysis of typical aerial scene classification and deep learning approaches on AID, which can be served as the baseline results on this benchmark.

Abstract PDF Upgrade to Chat

Citations (1,566)

View on Semantic Scholar

Summary

The paper introduces AID, a benchmark dataset with over 10,000 aerial images in 30 categories for comprehensive evaluation of scene classification methods.
It offers a detailed baseline analysis comparing traditional and deep learning techniques using standard low-, mid-, and high-level features.
The dataset’s public availability and robust experimental protocols enhance reproducibility and drive future research in remote sensing.

Overview of "AID: A Benchmark Dataset for Performance Evaluation of Aerial Scene Classification"

The paper "AID: A Benchmark Dataset for Performance Evaluation of Aerial Scene Classification" introduces a comprehensive dataset designed to support research in aerial scene classification tasks. Authored by Gui-Song Xia, Jingwen Hu, Fan Hu, Baoguang Shi, Xiang Bai, Yanfei Zhong, and Liangpei Zhang, the paper outlines the construction and significance of the Aerial Image Dataset (AID) and provides a comparative analysis of existing methods in the field.

Introduction to Aerial Scene Classification

Aerial scene classification involves labeling aerial images with specific semantic categories. This task is fundamental for interpreting high-resolution remote sensing imagery and has grown in importance due to the increasing volume and complexity of aerial image data. Traditional datasets such as the UC-Merced and WHU-RS19 have been extensively used but are limited by their size, making it challenging to evaluate the performance of new classification algorithms comprehensively.

The Aerial Image Dataset (AID)

The AID dataset is introduced to address the limitations of existing datasets. It is a large-scale dataset composed of more than 10,000 annotated aerial scene images, spanning 30 diverse categories such as airport, forest, industrial, park, and residential areas. The dataset images are collected from different regions worldwide and vary in resolution, ensuring high intra-class diversity and low inter-class dissimilarity.

Key Contributions

Comprehensive Review: The paper presents an extensive review of aerial scene classification techniques, including traditional machine learning and modern deep learning approaches.
Large-Scale Dataset: AID is one of the largest datasets available for aerial scene classification, significantly advancing the potential for algorithm evaluation and development.
Baseline Evaluation: The authors provide extensive performance analyses of standard classification methods on the AID dataset, offering baseline results to aid future research efforts.
Public Availability: The dataset and the implementation code used for baseline evaluations are made publicly accessible, promoting reproducibility and further innovation in the field.

Evaluation of Existing Methods

The paper categorizes existing aerial scene classification methods into three groups:

Low-Level Features: Methods using descriptors such as SIFT, LBP, Color Histogram, and GIST, which capture basic visual features from images.
Mid-Level Features: Approaches like Bag of Visual Words (BoVW), Spatial Pyramid Matching (SPM), Locality-constrained Linear Coding (LLC), and various probabilistic models that encode local features into holistic representations.
High-Level Features: Deep learning-based methods, including architectures like CaffeNet, VGG-VD-16, and GoogLeNet, which automatically learn hierarchical features from large datasets.

Implications and Future Directions

The introduction of AID paves the way for more robust evaluation of aerial scene classification methods, highlighting the need for higher intra-class variance and lower inter-class dissimilarity in benchmark datasets. The comprehensive experiments and evaluation protocols presented in the paper suggest that high-level deep learning methods outperform traditional methods, demonstrating the efficacy of learned features over hand-crafted ones.

Practical Applications

Remote Sensing and Earth Observation: Enhanced classification accuracy can improve applications such as urban planning, land use monitoring, and environmental change detection.
Military and Surveillance: More accurate aerial scene classification aids in better reconnaissance and strategic planning.

Theoretical Implications

The findings underline the importance of dataset quality and size in training and evaluating machine learning models, specifically in the context of remote sensing imagery. Furthermore, the demonstrated superior performance of deep learning techniques suggests a shift towards data-driven, automated feature extraction methods in future research endeavors.

Conclusion

The AID dataset represents a significant step forward for aerial scene classification research. By providing a large and diverse set of annotated images, the dataset enables more accurate and generalizable evaluations of classification algorithms. The accompanying comprehensive review and baseline results serve as valuable resources for researchers aiming to push the boundaries of remote sensing image analysis. Future developments in AI for remote sensing will likely build upon the foundations laid by resources such as the AID dataset.