Papers
Topics
Authors
Recent
Search
2000 character limit reached

Stroke Controllable Fast Style Transfer with Adaptive Receptive Fields

Published 20 Feb 2018 in cs.CV and eess.IV | (1802.07101v4)

Abstract: The Fast Style Transfer methods have been recently proposed to transfer a photograph to an artistic style in real-time. This task involves controlling the stroke size in the stylized results, which remains an open challenge. In this paper, we present a stroke controllable style transfer network that can achieve continuous and spatial stroke size control. By analyzing the factors that influence the stroke size, we propose to explicitly account for the receptive field and the style image scales. We propose a StrokePyramid module to endow the network with adaptive receptive fields, and two training strategies to achieve faster convergence and augment new stroke sizes upon a trained model respectively. By combining the proposed runtime control strategies, our network can achieve continuous changes in stroke sizes and produce distinct stroke sizes in different spatial regions within the same output image.

Citations (114)

Summary

  • The paper introduces a stroke-controllable style transfer network featuring the innovative StrokePyramid module for adaptive receptive field control.
  • It employs both progressive and incremental training strategies to ensure faster convergence and the incorporation of diverse stroke sizes.
  • Experimental results demonstrate superior visual quality and consistent control over continuous and spatial stroke variations compared to traditional methods.

Stroke Controllable Fast Style Transfer with Adaptive Receptive Fields

The paper "Stroke Controllable Fast Style Transfer with Adaptive Receptive Fields" addresses a significant challenge in the domain of neural style transfer which involves providing control over stroke size in stylized images. Conventional neural style transfer methods have focused extensively on converting content photographs into artistic imagery; however, controlling the stroke size has remained an unrefined aspect.

Key Contributions

The researchers propose a stroke controllable style transfer network that integrates continuous and spatial stroke size controls. This innovative approach focuses on two primary factors affecting stroke size: the receptive field of the network and the scale of the style image. To address these factors, the authors introduce the StrokePyramid module. This module enables the network to adaptively alter receptive fields and facilitates the learning of various stroke sizes with different receptive fields.

Three notable contributions are highlighted:

  1. Identification of Stroke Size Influences: The paper analyzes factors influencing stroke size in stylized images and suggests considering receptive field and style image scale for effective stroke size control.
  2. Network Architecture & Training Strategies: The authors propose a novel network architecture and outline two distinctive training strategies—progressive and incremental training—to achieve both faster convergence and the inclusion of new stroke sizes in a trained model.
  3. Runtime Control Strategies: The implementation of continuous stroke size control and spatial stroke size control strategies equips the network to produce varied stroke sizes in single outputs and across different spatial regions.

Results and Comparison

The paper provides substantial numerical outcomes through comparative analysis with existing methods. The StrokePyramid approach allows for flexible manipulation of stroke size without compromising visual quality or efficiency. The demonstrated results consistently achieve superior quality and finer stroke preservation, compared to prominent style transfer algorithms. The experimental evaluations indicate the network's ability to achieve flexible continuous and spatial control over stroke sizes, thus providing artists and practitioners enhanced control over the aesthetic dimensions of their work.

Implications and Future Directions

Though primarily focused on artistic style transfer, the implications of adaptive receptive fields extend beyond this domain. This approach may contribute to advancing understanding of human visual systems, leading to applications in aesthetic assessment, image compression, and colorization tasks. Future work could explore integrating semantic segmentation tools directly within the network architecture to automate spatial stroke size adjustments further.

The paper does not overstate the results but presents a systematic method to achieve more refined control in neural style transfer, opening avenues for extensive research on dynamic receptive fields and their influence on other perceptual parameters in deep learning models. The proposition and validation of stroke-size control within a singular model demonstrate a persistent step towards unifying artistic expression with computational efficiency, a vital interest for both researchers and creative professionals.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Collections

Sign up for free to add this paper to one or more collections.