The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions

Published 28 Mar 2018 in cs.CV | (1803.10417v3)

Abstract: Training of neural networks for automated diagnosis of pigmented skin lesions is hampered by the small size and lack of diversity of available datasets of dermatoscopic images. We tackle this problem by releasing the HAM10000 ("Human Against Machine with 10000 training images") dataset. We collected dermatoscopic images from different populations acquired and stored by different modalities. Given this diversity we had to apply different acquisition and cleaning methods and developed semi-automatic workflows utilizing specifically trained neural networks. The final dataset consists of 10015 dermatoscopic images which are released as a training set for academic machine learning purposes and are publicly available through the ISIC archive. This benchmark dataset can be used for machine learning and for comparisons with human experts. Cases include a representative collection of all important diagnostic categories in the realm of pigmented lesions. More than 50% of lesions have been confirmed by pathology, while the ground truth for the rest of the cases was either follow-up, expert consensus, or confirmation by in-vivo confocal microscopy.

Abstract PDF Upgrade to Chat

Citations (2,396)

View on Semantic Scholar

Summary

The paper introduces a comprehensive dataset that unifies 10,015 multi-source dermatoscopic images for training neural networks.
It employs automated and manual cleaning techniques, achieving a top-1 accuracy of 98.68% to ensure high-quality imaging.
The dataset supports diagnosing diverse pigmented skin lesions and advances multi-class computational dermatology research.

Analyzing the HAM10000 Dataset: A Comprehensive Compilation of Dermatoscopic Images for Machine Learning Applications

The paper "The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions" authored by Philipp Tschandl, Cliff Rosendahl, and Harald Kittler, provides an extensive dataset designed to support the development and benchmarking of neural networks for the automated diagnosis of pigmented skin lesions. This HAM10000 ("Human Against Machine with 10000 training images") dataset addresses several limitations found in pre-existing datasets by providing a large, diverse, and well-annotated collection of dermatoscopic images.

Motivation and Background

The automated diagnosis of pigmented skin lesions using neural networks has faced challenges, primarily due to the limited size and diversity of available datasets. Previous datasets, such as PH2 and others available through the ISIC archive, have typically consisted of a restricted set of diseases, notably overly biased towards melanocytic lesions like melanomas and nevi. The lack of a diverse, comprehensive dataset has hampered the ability of machine learning algorithms to perform reliably in clinical settings.

Overview of HAM10000

The HAM10000 dataset consists of 10,015 dermatoscopic images sourced from two primary locations: the Department of Dermatology at the Medical University of Vienna, Austria, and a skin cancer practice in Queensland, Australia. The dataset includes images acquired through a variety of methodologies over a period of 20 years, making it a rich resource for developing robust diagnostic algorithms.

Methodological Rigor

The development of the HAM10000 dataset included several key methodological steps:

Image Collection and Standardization: Images were compiled from various sources and formats, including PowerPoint files, digital dermatoscopy systems, and digitized diapositives. These were standardized to ensure uniformity in terms of image format and resolution.
Automated and Manual Data Cleaning: Extraction of relevant images and metadata was followed by a semi-automated categorization process. A fine-tuned InceptionV3 network was employed to filter out non-dermatoscopic images, achieving a top-1 accuracy of 98.68%. Final manual reviews ensured the quality by removing irrelevant or suboptimal images.
Unification of Diagnoses: Variability in histopathological diagnoses was addressed by unifying diagnosis terminology and forming seven distinct diagnostic categories (akiec, bcc, bkl, df, mel, nv, and vasc).
Ground Truth Validation: The dataset's ground truth was established through multiple methodologies, including histopathology, reflectance confocal microscopy, follow-up assessments, and expert consensus.

Implications of the HAM10000 Dataset

Practical Implications:

The availability of the HAM10000 dataset is a significant step forward in training neural networks to diagnose a wide range of pigmented skin lesions. The large and diverse nature of the dataset increases the potential for developing algorithms that are more generalizable and capable of performing reliably in real-world clinical settings.

Theoretical Implications:

By encompassing diverse diagnostic categories and a balanced representation of both melanocytic and non-melanocytic lesions, HAM10000 facilitates the research community to move beyond binary classification tasks towards more complex, multi-class classification problems. This shift is critical for advancing the field of computational dermatology.

Future Directions in AI Dermatology

The aforementioned dataset can inspire several future research avenues:

Improving Diagnostic Algorithms: Use the HAM10000 dataset to refine existing machine learning models and explore novel architectures that could achieve even higher diagnostic accuracy.
Human-Machine Collaboration: Benchmark the performance of neural networks against human experts to evaluate and enhance collaborative diagnostic techniques.
Expanding Dataset Applications: Explore the use of the dataset for ancillary applications such as automatic feature extraction, lesion segmentation, and personalized treatment suggestions.

Conclusion

The HAM10000 dataset represents a valuable resource for advancing the field of automated dermatoscopic diagnosis. By addressing the limitations of previous datasets and providing a comprehensive, well-curated collection of images, it lays a robust foundation for future research. This dataset has the potential to significantly contribute to the development of more effective and reliable diagnostic tools in dermatology, ultimately improving patient outcomes through enhanced diagnostic accuracy.

Markdown Report Issue