Physics-Informed Neural Networks: A Didactic Derivation of the Complete Training Cycle

Abdeladhim Tahimi

✨ TL;DR

This paper provides a complete, step-by-step manual derivation of how Physics-Informed Neural Networks (PINNs) are trained, including forward propagation, loss computation, and backpropagation with explicit numerical examples. It bridges the gap between automatic differentiation libraries and the underlying mathematical operations, making PINN training transparent and verifiable.

01 · Problem

Existing tutorials and guides on Physics-Informed Neural Networks typically rely on automatic differentiation libraries to handle the training process, treating the underlying mathematical operations as a black box. This creates a pedagogical gap where practitioners use PINNs without understanding the complete algebraic mechanics of how gradients flow through both the network parameters and the physics-based loss terms. The lack of explicit, worked-through examples makes it difficult for learners to verify their understanding or debug implementations, particularly when dealing with the product rule complications that arise when computing gradients of ODE residuals through hidden layers.

02 · Approach

The paper uses a concrete first-order initial value problem with a known analytical solution as a running example throughout. It employs a small 1-3-3-1 multilayer perceptron (one input, two hidden layers with three neurons each, one output) with 22 trainable parameters to demonstrate every calculation with explicit numerical values. The authors manually derive forward propagation for both the network output and its temporal derivative, construct a composite loss function from the ODE residual and initial condition, perform complete backpropagation including the product rule terms in hidden layers, and execute gradient descent updates. From these concrete examples, they generalize to recursive formulas (sensitivity propagation relations) that apply to networks of arbitrary depth and connect these to automatic differentiation frameworks. A companion Jupyter/PyTorch notebook validates all hand-derived calculations against machine-computed gradients.

03 · Key insights

What the paper shows.

01The gradient computation in PINNs requires careful application of the product rule when backpropagating through ODE residuals, creating additional sensitivity terms beyond standard neural network training

02Complete transparency in PINN training can be achieved by working through explicit numerical examples with small networks, making the process verifiable at every step

03Recursive sensitivity propagation relations derived from concrete examples naturally extend to arbitrary network depths and connect to automatic differentiation engines

04Physics-informed loss alone (without data from the true solution) can achieve high accuracy, demonstrated by a relative L² error of 4.290×10⁻⁴ on the test problem

04 · Results

The trained 1-3-3-1 network achieved a relative L² error of 4.290×10⁻⁴ when validated against the exact analytical solution of the first-order initial value problem. This accuracy was obtained using only the physics-informed loss function composed of the ODE residual and initial condition, without incorporating any data points from the true solution. The companion PyTorch implementation successfully reproduced all manually derived gradient calculations, providing mutual validation between hand-computed and automatically differentiated gradients across all 22 trainable parameters throughout the complete training cycle.

05 · Limitations

The paper focuses on a simple first-order ODE with a single spatial/temporal dimension and uses a small network architecture for pedagogical clarity, which may not capture the complexities that arise in higher-dimensional PDEs or deeper networks used in practical applications. The demonstration is limited to a single training iteration with explicit calculations, and does not address convergence behavior, hyperparameter selection, or common training challenges like gradient pathologies in PINNs. The approach is purely didactic and does not propose new methods or improvements to PINN training efficiency or accuracy. The validation is performed on a problem with a known analytical solution, which is not representative of real-world scenarios where PINNs are typically applied to problems without closed-form solutions.

✨ Generated by Claude · Apr 21, 2026 · Read the PDF for authoritative content.

What the paper shows.

↘ Related papers