Model Calibration and Validation

Model calibration fits parameters to market data; validation confirms the model performs adequately for its intended use. Both processes require systematic workflows, quantitative acceptance criteria, and documentation suitable for regulatory review.

Calibration Workflow

Step 1: Data Hygiene and Input Checks

Before calibration begins:

Verify data timestamp (intraday staleness matters)
Check for outliers (quotes >3 standard deviations from history)
Exclude illiquid instruments (wide bid-ask, zero volume)
Confirm data source consistency (same provider across tenors)

Data quality checklist:

Quotes are from the same snapshot time
Bid-ask spreads are within normal ranges
All required tenors/strikes are present
No corporate actions affect calibration instruments

Step 2: Define Objective Function

The objective function measures fit quality:

Sum of squared errors: Objective = Σ_i (Model_IV_i - Market_IV_i)²

Weighted by vega: Objective = Σ_i vega_i × (Model_IV_i - Market_IV_i)²

This weights ATM options (high vega) more heavily than wings.

Regularization: Objective = Σ_i (error_i)² + λ × (parameter_penalty)

Regularization prevents extreme parameters that overfit current data.

Step 3: Optimization

Common approaches:

Gradient descent (for smooth objective)
Levenberg-Marquardt (standard for least squares)
Global optimization (simulated annealing, genetic algorithms)

Convergence criteria:

Maximum iterations: 1,000
Objective improvement threshold: < 0.01% change
Parameter change threshold: < 0.1% change

Step 4: Overfitting Detection

Signs of overfitting:

Parameters at boundary constraints
Large parameter swings day-to-day
Poor out-of-sample performance
Unusual parameter values (e.g., vol of vol > 200%)

Out-of-sample test: Reserve 20% of instruments for validation. After calibrating to 80%, check error on held-out 20%. If held-out error >> in-sample error, overfitting is likely.

Step 5: Documentation and Audit Trail

Record for each calibration:

Timestamp
Data source and snapshot time
Parameters before and after
Objective function value
Out-of-sample error
Any manual overrides

Acceptance Thresholds

Volatility model (e.g., Heston, SABR):

RMSE < 0.5 vols across calibration set
Maximum error < 2.0 vols at any point
Out-of-sample RMSE < 0.75 vols

Interest rate model (e.g., Hull-White, LMM):

RMSE < 0.3 vols for swaption surface
Yield curve repricing error < 0.1 bps

Failure action: If thresholds are not met:

Review input data for errors
Expand calibration set or adjust weights
Consider model limitations (may not fit this market)
Escalate to model governance if persistent

Governance Notes

Model calibration falls under SR 11-7 (Fed) and similar regulations:

Requirements:

Independent validation of calibration methodology
Regular backtesting against realized outcomes
Change management for calibration logic updates
Model inventory with tiering by materiality

Annual validation includes:

Review of calibration accuracy over trailing year
Benchmark comparison to alternative models
Stress testing of calibrated parameters
Documentation update for any methodology changes

Escalation timeline:

Calibration failure >2 consecutive days: Notify risk management
Threshold breach >5 days: Escalate to model governance committee
Remediation required within 30 days of formal finding

Example Calibration Analysis

Heston model calibration to S&P 500 options:

Parameters:

Parameter	Initial	Calibrated	Bound
κ (kappa)	2.0	1.8	[0.1, 10]
θ (theta)	0.04	0.052	[0.01, 0.25]
σ_v (vol of vol)	0.4	0.48	[0.1, 1.0]
ρ (correlation)	-0.6	-0.72	[-0.95, 0.0]
v₀ (initial var)	0.04	0.038	[0.01, 0.25]

Calibration results:

Metric	Value	Threshold	Status
In-sample RMSE	0.42 vols	< 0.5 vols	Pass
Out-of-sample RMSE	0.58 vols	< 0.75 vols	Pass
Max error	1.8 vols	< 2.0 vols	Pass
Parameters at bounds	None	None	Pass

Validation conclusion: Model calibration meets all thresholds. Approved for production use.

RMSE Threshold Reference

Model Type	Typical RMSE	Acceptable	Needs Review
Black-Scholes	N/A (closed form)	—	—
Heston equity	0.3-0.5 vols	< 0.5	> 0.75
SABR rates	0.2-0.4 vols	< 0.5	> 0.75
LMM swaptions	0.3-0.6 vols	< 0.75	> 1.0

Next Steps

For stress testing calibrated models, see Stress Testing Models for Extreme Moves.

To understand the models being calibrated, review Local vs. Stochastic Volatility Models.

Model Calibration and Validation

Model Calibration and Validation

Calibration Workflow

Step 1: Data Hygiene and Input Checks

Step 2: Define Objective Function

Step 3: Optimization

Step 4: Overfitting Detection

Step 5: Documentation and Audit Trail

Acceptance Thresholds

Governance Notes

Example Calibration Analysis

RMSE Threshold Reference

Next Steps

Related Articles

Volatility Term Structure Modeling

Stress Testing Models for Extreme Moves

Smile and Skew Interpretation