Tokenised Flow Matching for Hierarchical Simulation Based Inference

Giovanni Charles; Cosmo Santoni; Seth Flaxman; Elizaveta Semenova

✨ TL;DR

This paper proposes Tokenised Flow Matching for Posterior Estimation (TFMPE), a method that reduces simulator evaluation costs in hierarchical simulation-based inference by learning per-site neural surrogates and assembling synthetic observations. The approach is validated on infectious disease and computational fluid dynamics models with improved computational efficiency.

01 · Problem

Simulation-based inference (SBI) is computationally expensive because it requires many simulator evaluations. In hierarchical settings with shared global parameters and site-level observations, existing SBI methods still require simulating across multiple sites per training sample, which is inefficient. There is a need to exploit the hierarchical structure to reduce the number of required simulator calls while maintaining inference quality.

02 · Approach

The paper proposes likelihood factorisation (LF) to train from single-site simulations rather than multi-site ones. The method learns a per-site neural surrogate of the simulator and then assembles synthetic multi-site observations to amortise inference for the full hierarchical posterior. Building on this foundation, TFMPE uses tokenised flow matching to support function-valued observations through likelihood factorisation, enabling efficient hierarchical posterior estimation.

03 · Key insights

What the paper shows.

01Likelihood factorisation enables training from single-site simulations, reducing the number of required simulator evaluations compared to posterior factorisation approaches

02Neural surrogates of per-site simulators can be assembled to create synthetic multi-site observations for amortised inference

03Tokenised flow matching can be extended to handle function-valued observations in hierarchical settings

04The hierarchical structure with exchangeable site-level parameters and observations is key to achieving computational efficiency gains

04 · Results

TFMPE produces well-calibrated posteriors while significantly reducing computational cost compared to existing hierarchical SBI methods. The approach is validated on a newly introduced benchmark for hierarchical SBI as well as realistic models including infectious disease simulations and computational fluid dynamics models, demonstrating practical applicability across different domains.

05 · Limitations

The paper focuses on hierarchical settings with exchangeable site-level parameters and observations, which may limit applicability to non-hierarchical or non-exchangeable problems. The quality of the learned neural surrogates depends on their training, and the approach's performance on very high-dimensional or highly nonlinear simulators is not thoroughly explored. The computational savings are relative to existing hierarchical SBI methods, but absolute computational requirements for training surrogates are not extensively discussed.

✨ Generated by Claude · Apr 25, 2026 · Read the PDF for authoritative content.

What the paper shows.

↘ Related papers