Houdini: Fooling Deep Structured Prediction Models

Published 17 Jul 2017 in stat.ML, cs.AI, cs.CR, cs.CV, and cs.LG | (1707.05373v1)

Abstract: Generating adversarial examples is a critical step for evaluating and improving the robustness of learning machines. So far, most existing methods only work for classification and are not designed to alter the true performance measure of the problem at hand. We introduce a novel flexible approach named Houdini for generating adversarial examples specifically tailored for the final performance measure of the task considered, be it combinatorial and non-decomposable. We successfully apply Houdini to a range of applications such as speech recognition, pose estimation and semantic segmentation. In all cases, the attacks based on Houdini achieve higher success rate than those based on the traditional surrogates used to train the models while using a less perceptible adversarial perturbation.

Abstract PDF Upgrade to Chat

Citations (268)

View on Semantic Scholar

Summary

The paper introduces Houdini, an adversarial attack method that directly optimizes task-specific loss functions for deep structured prediction models.
It integrates a stochastic margin with the task loss to generate subtle yet effective adversarial examples across speech recognition, human pose estimation, and semantic segmentation.
Empirical results show significant drops in key performance metrics, revealing vulnerabilities and encouraging robust defenses in structured prediction systems.

Insightful Overview of "Houdini: Fooling Deep Structured Prediction Models"

The paper "Houdini: Fooling Deep Structured Prediction Models" presents a method for generating adversarial examples, which are inputs intentionally crafted to mislead machine learning models, specifically targeting deep learning structured prediction tasks. Unlike previous methods primarily focused on classification tasks, Houdini targets the direct performance measure of the task of interest—even when these measures are combinatorial and non-differentiable. This innovative approach extends adversarial attack techniques to more complex applications such as speech recognition, human pose estimation, and semantic segmentation.

The authors challenge the conventional approach of surrogate losses tailored for structured tasks, presenting Houdini as a method harmonized to the specific task loss. In terms of adversarial effectiveness, Houdini strongly demonstrates superior performance compared to traditional differentiable surrogate-based losses in transforming adversarial goals into actionable successes across various applications.

Methodological Advancements

Houdini is structured on the premise of generating adversarial examples aligned with the task-specific loss function—whether combinatorial, non-decomposable, or non-differentiable. The cases chosen—speech recognition, pose estimation, and semantic segmentation—are fields with bespoke performance metrics like Word Error Rate (WER), Percentage of Correct Keypoints (PCKh), and mean Intersection over Union (mIoU), respectively.

Houdini leverages a probability distribution approach within its framework. It combines a stochastic margin term and the task loss. This compositional architecture ensures that Houdini's output closely approximates the intended task loss, allowing for efficient adversarial example generation that accomplishes targeted fooling of deep networks successfully.

Experiments and Results

In practical experiments, Houdini's performance across various structured tasks demonstrated effectiveness both in targeted and untargeted adversarial attacks:

Human Pose Estimation: Houdini achieved a drastic reduction in the PCKh metric, proving to generate less perceptible adversarial perturbations compared to the MSE loss used during training. It demonstrated that accurate keypoint predictions could be distorted imperceptibly, leading to incorrect poses even closer to the desired adversarial outcome.
Semantic Segmentation: Houdini's implementation in segmentation tasks resulted in a significant decline in mIoU compared to baseline cross-entropy approaches. The attacked models exhibited increased susceptibility to adversarial inputs leading to hallucinations such as arbitrary segmentations, demonstrating the reach of Houdini's custom-tailored loss adaptation.
Speech Recognition: Houdini was tested extensively on an ASR task using an end-to-end neural model. In trials involving Word and Character Error Rates (WER and CER), Houdini consistently outperformed the standard CTC loss function by leading to higher error rates with less distortion. Fascinatingly, its attacks were not distinguishable by human acoustic analysis as confirmed through ABX testing and further validated in a black-box setting against Google's voice-recognition software.

Implications and Speculations

The meticulous design of Houdini for loss-driven adversarial perturbation introduces a robust new tool in evaluating and challenging the reliability of structured prediction models. These findings not only highlight potential vulnerabilities in current models but also propose advanced avenues for furthering both adversarial strategies and defensive countermeasures in deep network design. Houdini opens future research pathways around task-specific robustness analytics and loss-function based adversarial mitigations.

Overall, the paper's theoretical insights and practical contributions direct the community towards nuanced evaluation paradigms, emphasizing configurational integrity of adversarial generation distinctly aligned with task-specific performance characteristics beyond mere classification precision. This work lays a foundational stone for realistic assessment scopes necessary for deploying learning systems in sensitive and mission-critical applications.

Markdown Report Issue