A Visual Quality Assessment Method for Raster Images in Scanned Document

Published 25 Jul 2023 in eess.IV | (2307.13241v1)

Abstract: Image quality assessment (IQA) is an active research area in the field of image processing. Most prior works focus on visual quality of natural images captured by cameras. In this paper, we explore visual quality of scanned documents, focusing on raster image areas. Different from many existing works which aim to estimate a visual quality score, we propose a machine learning based classification method to determine whether the visual quality of a scanned raster image at a given resolution setting is acceptable. We conduct a psychophysical study to determine the acceptability at different image resolutions based on human subject ratings and use them as the ground truth to train our machine learning model. However, this dataset is unbalanced as most images were rated as visually acceptable. To address the data imbalance problem, we introduce several noise models to simulate the degradation of image quality during the scanning process. Our results show that by including augmented data in training, we can significantly improve the performance of the classifier to determine whether the visual quality of raster images in a scanned document is acceptable or not for a given resolution setting.

Abstract PDF Upgrade to Chat

Authors (7)

Summary

The paper introduces a machine learning-based classification method for assessing scanned raster image quality using psychophysical benchmarks.
It employs data augmentation with noise models to address dataset imbalance and simulate realistic scanning artifacts, enhancing classification performance.
The study demonstrates significant potential for improving digital document quality assurance in archiving and digital printing industries.

A Visual Quality Assessment Method for Raster Images in Scanned Documents

Introduction

The paper "A Visual Quality Assessment Method for Raster Images in Scanned Documents" (2307.13241) addresses the niche yet significant problem of assessing the visual quality of raster images in scanned documents. Unlike traditional Image Quality Assessment (IQA) research, which primarily focuses on natural images captured via digital cameras, this work zeroes in on scanned imagery. This distinction is critical given the unique artifacts and quality degradations introduced through the scanning process.

Methodology

The proposed methodology diverges from classical IQA techniques that typically aim to quantify image fidelity through a visual quality score. Instead, the authors introduce a machine learning-based classification approach to assess whether the quality of scanned raster images is considered acceptable at specific resolution settings. Leveraging a psychophysical study, the authors use human subject ratings to establish ground truth benchmarks for image quality acceptability, subsequently guiding the training of their classification model.

A notable challenge identified is the imbalance in the dataset, where a disproportionate number of images were deemed visually acceptable. To mitigate this, the authors employ several noise models that emulate potential degradation effects encountered during scanning. This data augmentation strategy significantly enhances the classifier's ability to discern acceptable from unacceptable quality images across varying resolution settings.

Data Collection and Psychophysical Study

The dataset collection, as detailed, involves curating a corpus of scanned documents with specific focus on raster image areas. Human subjects participated in a psychophysical experiment to rate these images and determine thresholds of quality acceptability. These ratings serve as a foundational ground truth for training purposes. The paper outlines the meticulous process of dataset creation and the experimental setup used to ensure robust and reliable human-derived quality assessments.

Experimental Results

The experimental results underscore the efficacy of the proposed method. By incorporating data augmentation via artificially degraded image examples based on realistic noise models, the classification performance improved markedly. This improvement empirically demonstrates the strength of the data-driven approach in learning complex quality determining features beyond traditional image quality metrics.

Conclusion and Implications

In conclusion, this research contributes a novel path for visual quality assessment among scanned documents, a domain often overshadowed by broader IQA research. The intersection of machine learning classification methods with psychophysically grounded assessments offers a promising avenue for improving automatic quality evaluation systems for scanned images. The implications of this work are particularly relevant for industries reliant on document digitization and quality assurance, such as archiving and digital printing enterprises.

Future Directions

This approach opens potential future work in extending the model's applicability across a wider spectrum of scanning devices and contexts. Additionally, developments could focus on refining noise models to cover an even broader range of quality degradation scenarios. The integration of advanced machine learning models, possibly leveraging deep learning architectures, may also further enhance the robustness and scalability of such quality assessment systems.

Overall, the paper enriches the discourse in image processing by advocating for tailored methodologies that recognize the distinct characteristics and requirements inherent in the scanning artifact landscape.

Markdown Report Issue