Safe Control using Learned Safety Filters and Adaptive Conformal Inference

Sacha Huriot; Ihab Tabbara; Hussein Sibai

✨ TL;DR

This paper introduces Adaptive Conformal Filtering (ACoFi), which combines learned safety filters with adaptive conformal inference to provide soft safety guarantees for control systems. The method dynamically adjusts switching criteria between nominal and safe policies based on prediction uncertainty, achieving better safety performance than fixed-threshold approaches.

01 · Problem

Safety filters are used to ensure control systems remain safe even when nominal policies are unsafe, but traditional synthesis methods face scalability issues with high-dimensional systems. Learning-based safety filters have been proposed as alternatives, but they suffer from inevitable prediction errors that compromise reliability and safety guarantees. The key challenge is how to account for these errors while maintaining safety assurances in real-world applications where the learned models may encounter distribution shifts or make incorrect predictions about action safety.

02 · Approach

ACoFi combines Hamilton-Jacobi reachability-based safety filters with adaptive conformal inference to create a dynamic switching mechanism. The method uses conformal prediction to quantify uncertainty in the learned safety filter's predictions by computing a range of possible safety values for the nominal policy's actions. When this uncertainty range suggests potential unsafety, the filter switches from the nominal policy to a learned safe policy. The switching threshold adapts over time based on observed prediction errors, allowing the system to learn from its mistakes and adjust its conservativeness accordingly. This adaptive mechanism is grounded in conformal inference theory, which provides statistical guarantees on the miscoverage rate.

03 · Key insights

What the paper shows.

01Conformal inference can be applied to safety filtering to provide probabilistic guarantees on prediction uncertainty rather than hard safety constraints

02Dynamic adjustment of switching thresholds based on observed errors allows the system to balance safety and performance more effectively than fixed thresholds

03Quantifying uncertainty through ranges of possible safety values provides a principled way to make conservative decisions when predictions are unreliable

04The approach provides asymptotic guarantees that the rate of incorrectly quantifying safety uncertainty is bounded by a user-defined parameter, offering soft safety guarantees

04 · Results

ACoFi was evaluated in two environments: a Dubins car simulation and Safety Gymnasium. The method significantly outperformed baseline approaches using fixed switching thresholds, achieving higher learned safety values while incurring fewer safety violations. The improvements were particularly pronounced in out-of-distribution scenarios where the learned models faced conditions different from their training data. The adaptive nature of ACoFi allowed it to adjust to these challenging scenarios more effectively than non-adaptive baselines, demonstrating both better safety performance and less conservative behavior when the nominal policy was actually safe.

05 · Limitations

The paper provides soft safety guarantees rather than hard safety guarantees, meaning there is a bounded probability of safety violations rather than absolute prevention. The guarantees are asymptotic, requiring sufficient data for the conformal inference bounds to hold reliably. The method's performance depends on the quality of the learned safety filter and the learned safe policy, which may still make errors. The approach requires careful tuning of the user-defined miscoverage parameter to balance safety and performance. Additionally, the evaluation is limited to simulation environments, and real-world deployment may face additional challenges not captured in these settings.

✨ Generated by Claude · Apr 21, 2026 · Read the PDF for authoritative content.

What the paper shows.

↘ Related papers