Walk-forward validation

A backtest optimized and reported on the same date range tells you how well a config fit that window's noise — not whether it will work tomorrow. Walk-forward validation is the honest alternative: optimize on one slice of time, measure on a later slice the optimizer never saw, and reserve a final slice nobody touches until the very end.

uv run python main.py walkforward \
  --strategy volume_spike --scanner volume --symbols NVDA,AAPL,MSFT \
  --start 2024-01-01 --end 2025-12-31 --capital 100000 \
  --mode anchored --folds 6 \
  --embargo-days 5 --holdout-days 60 \
  --method grid --objective sharpe_ratio --max-evals 50

What it does

For each fold it optimizes parameters on an in-sample (IS) window, then scores the chosen config on the following out-of-sample (OOS) window. An embargo gap separates IS from OOS so indicator warm-up can't leak across the boundary. A holdout window is carved off the end first and scored exactly once, at the very end — it never reaches any optimizer call.

The honest performance number is the OOS aggregate: metrics recomputed over the concatenation of every fold's OOS trades, not an average of per-fold numbers.

Key options

Flag	Meaning
`--mode anchored\|rolling`	Expanding IS window (anchored) or fixed-width sliding IS (rolling).
`--folds N`	Number of folds. Alternatively set `--train-days` / `--test-days`.
`--embargo-days N`	IS→OOS gap. Defaults to the strategy's required lookback in calendar days.
`--holdout-days N`	Final sacred window, scored once.
`--method grid\|random\|bayesian`	Search method used per fold (`bayesian` needs the `optimize` extra).
`--objective`	Metric to optimize in-sample (default `sharpe_ratio`).
`--pbo`	Also estimate the Probability of Backtest Overfitting (slower).
`--monte-carlo`	Block-bootstrap the OOS trades for a 5th-percentile Sharpe.
`--param-sensitivity`	Perturb the chosen params ±10% and re-test robustness.
`--leakage-probe`	Shift the data feed forward to detect future-data leakage.
`--save-config PATH`	Save the chosen config (with provenance) for a human to review.
`--results-csv PATH`	Write the per-fold table to CSV.

Reading the output

The report prints a per-fold table (IS vs OOS headline metrics, OOS trade count), the OOS aggregate block, walk-forward efficiency, IS→OOS degradation, the holdout block, and the promotion-gate verdict — a pass/fail per gate plus an overall promotable. A config is only promotable if it clears every gate (median OOS Sharpe, profit factor, efficiency, drawdown ratio, minimum OOS trades, deflated Sharpe, and — when requested — parameter sensitivity and the leakage probe).

Saving a config never changes live behaviour. It writes a JSON file to a gitignored configs/ directory; promoting it to live trading is a manual human step.

See the engineering wiki's Walk-forward validation page for the design, fold geometry, and the leakage-safety guarantees.

What it does​

Key options​

Reading the output​

What it does

Key options

Reading the output