Module IV·Article I·~4 min read

Stochastic Differential Games: Problem Statement

Stochastic Differential Games

Turn this article into a podcast

Pick voices, format, length — AI generates the audio

Uncertainty in Strategic Interaction

In deterministic games, the future is predetermined: knowing the initial state and the strategies of both players, one can accurately predict the trajectory. In real systems, there is always noise: market volatility, turbulence, thermal fluctuations. Stochastic differential games (SDG) generalize deterministic theory to the case when the dynamics includes a random perturbation in the form of Brownian motion. This opens up new analytical tools: stochastic HJI, BSDE, connections with financial mathematics.

Stochastic Dynamics

Stochastic differential equation (SDE):

dX = f(X, u, v, t) dt + σ(X, t) dW

Here:

  • X ∈ ℝⁿ — state
  • u ∈ U, v ∈ V — controls of the players
  • σ(X, t) ∈ ℝ^{n×m} — volatility matrix
  • W — m-dimensional standard Brownian process (Wiener process)

Functional: J = E[g(X(T)) + ∫₀ᵀ F(X, u, v, t) dt] — expected cost.

Goal: min_u max_v J (or vice versa).

Bellman's Principle in the Stochastic Case

For V(x,t) = min_u max_v E[...| X(t) = x]:

V(x,t) = min_u max_v E_{dW}[F dt + V(X+dX, t+dt)]

We use Ito's formula: dV = V_t dt + ∇V·dX + (1/2)tr(σσᵀ ∇²V) dt.

Substituting and taking the limit:

Stochastic HJI equation:

∂V/∂t + (1/2)tr(σσᵀ ∇²V) + H*(x, t, ∇V) = 0

The additional term (1/2)tr(σσᵀ ∇²V) — the “Itô term” (a consequence of Ito’s formula for a stochastic differential).

Comparison:

  • Deterministic HJI: ∂V/∂t + H* = 0
  • Stochastic HJI: ∂V/∂t + (σ²/2)∆V + H* = 0

Stochasticity adds the Laplacian ∆V = tr(∇²V), which “smooths” the value function.

Optimal Strategies and the Minimax Principle

Optimal game Hamiltonian: H*(x,t,p,Q) = min_{u∈U} max_{v∈V} [F + pᵀf + (1/2)tr(σσᵀQ)]

Note: (1/2)tr(σσᵀQ) does not depend on u and v → the “noise” term does not affect the optimal strategies!

Consequence: the optimal u* and v* in a stochastic game are the same as in the deterministic game (with the same H). Noise changes the value of the game V but not the optimal strategies.

Related Backward Stochastic Differential Equations

For problems with horizon T and a terminal condition, V can be represented via a BSDE (backward SDE):

Forward SDE (for X): dX = f(X, u*, v*, t)dt + σ dW, X(0) = x₀.

Backward SDE (for Y ≈ V(X,t), Z ≈ σᵀ∇V):

dY = −H(X, u*, v*, ∇V, t) dt + Zᵀ dW, Y(T) = g(X(T)).

The pair (Y, Z) — “adjoint processes”. Z is the martingale component, analogous to the “volatility” of the adjoint process.

Connection between BSDE and the Pontryagin principle: the adjoint variable p(t) in the stochastic case is not just an ODE, but an SDE for (p, q), where q is an additional martingale term.

Complete Analysis: Stochastic LQ-Game

Problem: dX = (u+v)dt + σ dW, X(0) = 1, J = E[X(T)² + ∫₀ᵀ (u² − v²)dt].

Stochastic Riccati-Isaacs equation (same form as in the deterministic case):

−Ṗ = −P² + P² = 0 → P = const = 1.

Optimal strategies: u* = −X, v* = X (the same as in the deterministic problem!).

Closed system: dX = (−X+X)dt + σ dW = σ dW → X(t) = 1 + σW(t).

Value of the game: V = E[X(T)²] + ∫₀ᵀ E[u² − v²]dt = E[(1+σW(T))²] = 1 + σ²T.

Deterministic case (σ=0): V = 1. Stochastic: V = 1 + σ²T.

Physical meaning: noise “increases” the game’s value. This is intuitive: under random walks of X, the quadratic functional is larger due to the “spread” of X.

Connection with Stochastic Control and Finance

With a single player (no v): stochastic optimal control (Fleming-Rishel). HJB equation: ∂V/∂t + (σ²/2)∆V + min_u{F + p·f} = 0.

Black-Scholes equation is a particular case! dS = μS dt + σS dW (stock price). The option price V satisfies: ∂V/∂t + (1/2)σ²S²∂²V/∂S² + rS ∂V/∂S − rV = 0. This is HJB without control (u = 0) + boundary condition. All of financial option theory is a particular case of stochastic optimal control!

Stochastic Differential Games

When the dynamics is subject to random perturbations (market noise, wind, measurement uncertainty), the classical HJI is generalized to the stochastic HJI:

V_t + min_u max_v {F + ∇V · f + (1/2) tr(σσᵀ ∇²V)} = 0

The extra term tr(σσᵀ ∇²V) is the “diffusion” of the value function due to noise. The equation is parabolic (not hyperbolic, as in the deterministic case), which simplifies analysis: viscosity solutions exist under broad conditions.

Connection with Financial Mathematics

The Black-Scholes equation for option pricing is a particular case of a stochastic differential game (with one player—the option seller hedging against market movements). Extensions—games between traders (market making), optimal execution of large orders accounting for price impact.

Numerical Methods

  • Stochastic grids: extension of finite differences
  • Symmetric splitting: decomposition into deterministic and noise parts
  • Backward SDE (BSDE): representation of V via stochastic equations, numerical solution via the Longstaff-Schwartz method
  • Deep BSDE (E-Han-Jentzen, 2017): neural network approximation for high dimensions

Modern Applications

  • Portfolio management under risk-uncertainty: robust Markowitz-Shanning optimization
  • Distributed energy systems: coordination of millions of consumers with noisy loads
  • Autonomous vehicles: accounting for uncertainty in the behavior of other participants
  • Robotics under difficult conditions: quadcopters in wind

§ Act · what next