Duality for the Adversarial Total Variation

Leon Bungert; Lucas Schmitt

✨ TL;DR

This paper establishes a mathematical duality framework for adversarial total variation, showing that adversarial training of binary classifiers can be understood through nonlocal calculus of variations. The work provides rigorous characterizations of subdifferentials using dual representations and integration by parts formulas in both metric and Euclidean spaces.

01 · Problem

Adversarial training, a technique for making machine learning classifiers robust to small input perturbations, lacks a complete mathematical foundation. While it has been recognized that adversarial training can be reformulated as regularized risk minimization with a nonlocal total variation term, the mathematical properties of this regularizer—particularly its subdifferential structure—remain poorly understood. Understanding the subdifferential is crucial for optimization algorithms and theoretical analysis of adversarial robustness, but the nonlocal nature of the adversarial total variation makes standard calculus of variations techniques inapplicable.

02 · Approach

The authors develop a duality theory for the adversarial total variation by deriving a dual representation analogous to classical total variation duality. They introduce nonlocal versions of gradient and divergence operators and establish an integration by parts formula that connects these operators. The framework is developed in two settings: first for continuous functions vanishing at infinity on proper metric spaces (providing generality), and second for essentially bounded functions on Euclidean domains (providing practical applicability). Using these dual representations, they characterize the subdifferential of the adversarial total variation through variational techniques.

03 · Key insights

What the paper shows.

01Adversarial training naturally corresponds to a nonlocal total variation regularization, connecting robust machine learning to calculus of variations

02A dual representation of adversarial total variation can be established using nonlocal gradient and divergence operators, extending classical duality theory to the nonlocal setting

03Integration by parts formulas exist for nonlocal operators, providing a foundation for variational analysis of adversarial robustness

04The subdifferential structure can be characterized in both abstract metric spaces and concrete Euclidean domains, offering both theoretical generality and computational applicability

04 · Results

The paper establishes complete dual representations of the adversarial total variation in two function space settings. For continuous functions vanishing at infinity on proper metric spaces, they prove a duality formula and derive the corresponding subdifferential characterization. For essentially bounded functions on Euclidean domains, they provide analogous results with additional technical conditions. The integration by parts formula connects the nonlocal gradient and divergence operators, mirroring classical results but in the nonlocal context. Under specified regularity conditions, they provide explicit characterizations of subdifferentials that can be used for optimization and analysis of adversarially trained classifiers.

05 · Limitations

The subdifferential characterizations require additional technical conditions beyond the basic duality results, which may not hold in all practical scenarios. The framework is developed primarily for binary classifiers, and extension to multi-class settings is not addressed. The paper is highly theoretical and does not provide computational algorithms or numerical experiments demonstrating practical application of the duality results. The proper metric space setting, while mathematically general, may be too abstract for direct implementation in machine learning applications. The conditions under which the subdifferential characterizations hold may be restrictive for some adversarial training scenarios encountered in practice.

✨ Generated by Claude · Apr 21, 2026 · Read the PDF for authoritative content.

What the paper shows.

↘ Related papers