Manuscript Figures
==================

Purpose
-------

This page tracks the target figure set for the future SPECTRAX-GK paper. A
figure is only ready for manuscript use when it has:

- one owning script,
- one reproducible artifact path,
- a declared reference,
- a declared acceptance status.

Current Readiness Snapshot
--------------------------

The current scoped manuscript stack is ready for claims about validated
quasilinear diagnostics/model selection, nonlinear-window comparison gates,
compact nonlinear startup-window finite-difference plumbing audits, mode-21
VMEC/Boozer geometry parity, reduced differentiable stellarator ITG
optimization examples, and linear/quasilinear VMEC/Boozer AD-vs-finite-
difference gradient gates on QH and Li383. The required release CI stack is the
quick-shard, docs/packaging, mypy, repo-hygiene, fast-coverage, and
wide-coverage matrix; treat the latest ``main`` run as the source of truth before
tagging. The companion ``docs/_static/manuscript_readiness_status.json`` report
currently has five active manuscript lanes closed and two lanes explicitly
deferred: W7-X zonal recurrence/damping and TEM / kinetic-electron stellarator
extension.

The broader plan is not fully closed. The current quasilinear figures are
publication-ready as diagnostics, model-selection evidence, and explicit
negative promotion gates, but they do not support a calibrated absolute-flux
predictor. The stellarator optimization figures are publication-ready for
reduced differentiable optimization/UQ plumbing and gradient validation, but
not yet for broad production nonlinear heat-flux optimization; the selected QA
optimized-equilibrium audit is a bounded positive audit, not a broad turbulent
optimization result. Those stronger claims require
converged post-transient nonlinear heat-flux windows, VMEC/Boozer nonlinear
turbulence-gradient or robust finite-difference gates, local-gradient
conditioning, and nonlinear audits of additional optimized equilibria. W7-X
zonal recurrence and TEM/kinetic-electron stellarator validation remain
deferred from the current manuscript scope.

The latest manuscript-stack additions are deliberately contract-level figures:
``docs/_static/quasilinear_holdout_gap_report.png`` with CSV/JSON/PDF
companions states exactly why absolute-flux promotion remains blocked;
``docs/_static/stellarator_objective_portfolio_gate.png``
validates the aggregate reduced-objective reducer used before expensive
VMEC/Boozer row production; and
``docs/_static/parallel_decomposition_status.png`` keeps production
independent-work parallelization claims separated from diagnostic nonlinear
whole-state/domain sharding metadata. The newer
``docs/_static/nonlinear_gradient_state_control_runbook.png`` is a claim
guardrail rather than a physics result: it shows that the QL-seeded
``Rsin_mid_surface_m1`` and ``Zcos_mid_surface_m1`` controls must be mapped to
perturbable VMEC input directions before nonlinear-gradient launches. The
newest ``LASYM=true`` ``RBS/ZBC`` response artifact provides that mapping with
rank ``2`` and condition number about ``1.02``, so the runbook now passes for
checked short-bracket launches. The older
``docs/_static/nonlinear_gradient_state_to_input_mapping_response.png`` is a
negative measured-response figure: the current stellarator-symmetric
``RBC/ZBS`` input perturbations do not move those asymmetric ``Rsin/Zcos``
state controls. The companion
``docs/_static/nonlinear_gradient_asymmetric_state_to_input_mapping_response.png``
is the positive symmetry-compatible mapping figure. The new
``docs/_static/nonlinear_gradient_state_control_short_bracket_launch_status.png``
then records that the mapped-control VMEC launch decks solved normally and that
bounded nonlinear campaign manifests are prepared, without promoting nonlinear
transport-gradient evidence. The follow-up
``docs/_static/nonlinear_gradient_state_control_short_bracket_nonlinear_audit_status.png``
records the first actual bounded nonlinear audit: all runtime and ensemble
window gates pass, but both finite-difference gradients fail closed because the
``1e-3`` bracket response is unresolved and asymmetric.

The new ``docs/_static/qa_low_turbulence_comparison.png`` panel adds the
aspect-6 QA low-turbulence optimization comparison requested for the
stellarator-design narrative. It is publication-ready for reduced
differentiable optimization plumbing, AD/finite-difference gates, and
side-by-side visualization of the control-only and transport-aware reduced
designs. It now labels the smooth heat-flux trace as the reduced
``Q_env = W_i E`` envelope and shows reduced ``|B|`` coloring on both the LCFS
surface renderings and the Boozer-LCFS maps. It is intentionally scoped away
from full VMEC/nonlinear-GK production claims. The companion
``docs/_static/qa_low_turbulence_time_horizon_audit.png`` panel checks the
reduced nonlinear-envelope horizon directly: the ``t=400`` late-window means
are unchanged at the ``1e-7`` level or better relative to a ``t=1000``
reference for the tracked designs, so the compact comparison panel does not
need to extend beyond ``t v_ti/a = 400``. The refreshed reduced objectives also
include a high-weight QA-compatible helical-shaping residual, so the displayed
LCFS surfaces are visibly non-axisymmetric rather than collapsing to an
axisymmetric boundary.
The companion solved-boundary guardrail
``docs/_static/vmec_jax_qa_transport_candidate_comparison.png`` is not a
promoted optimization result. It documents the VMEC-JAX/SPECTRAX-GK objective
assembly and WOUT-writing path, while deliberately failing closed when the
transport-weight refinement degrades the solved WOUT profile-iota and
quasisymmetry margins. The refreshed builder also treats gates reconstructed
from ``history.json`` and ``wout_final.nc`` as advisory only; paper-facing
admission requires an authoritative final ``solved_wout_gate.json``. A future
solved-boundary optimization figure must pass this full solved-candidate gate
before launching long-window nonlinear transport audits.
For VMEC-JAX replay-sensitive runs, the figure artifact must also state which
WOUT is authoritative. If ``wout_final_rerun.nc`` is chosen, require the
separate rerun-WOUT aspect/iota/QS gate and use that WOUT in the transport
audit commands; do not mix optimizer-state geometry with rerun-state
transport metrics.

Current Vs Deferred Figure Inventory
------------------------------------

.. list-table::
   :header-rows: 1
   :widths: 24 42 34

   * - Figure group
     - Current manuscript use
     - Deferred or blocked interpretation
   * - Benchmark atlas and nonlinear windows
     - Release atlas, nonlinear window statistics, eigenfunction overlays, and
       gate index support the scoped linear/nonlinear validation narrative.
     - ETG nonlinear pilots, TEM/KAW stress lanes, extra W7-X flux tubes, and
       non-indexed figures are not promoted release parity claims.
   * - Quasilinear diagnostics
     - Electrostatic spectra, shape gates, calibration provenance, negative
       simple-rule gates, and the ``spectral_envelope_ridge`` uncertainty panel
       support a model-selection result.
     - No runtime/TOML absolute-flux predictor, universal saturation law, or
       electromagnetic/KBM quasilinear calibration is promoted.
   * - Autodiff and VMEC/Boozer gradients
     - Inverse/UQ demos, zero-beta equal-arc parity, solver-ready gradients, and
       mode-21 QH/Li383 linear, quasilinear, and reduced nonlinear-window
       estimator gates are in scope.
     - Compact nonlinear FD audits are startup plumbing checks only; production
       nonlinear transport gradients and broader optimized-equilibrium audits
       beyond the selected QA candidate remain future gates.
   * - Performance and parallelization
     - Runtime/memory figures, independent ``k_y`` scan scaling,
       quasilinear/UQ ensemble scaling, and nonlinear RHS profiler artifacts are
       release-facing engineering evidence.
     - Production parallelization is the independent-work path. Whole-state
       nonlinear sharding and nonlinear domain sharding are identity/profiler
       evidence only unless the exact workload passes promotion gates.
   * - W7-X zonal and TEM guardrails
     - Open-research and TEM-status panels are useful guardrails for the paper
       plan and release notes.
     - W7-X long-window zonal recurrence, W7-X experimental fluctuation-spectrum
       validation, W7-X TEM/kinetic-electron nonlinear windows, and broad
       multi-flux-tube stellarator validation are deferred.

Core Validation Figures
-----------------------

.. list-table::
   :header-rows: 1

   * - Figure
     - Owning script
     - Status
     - Notes
   * - Linear benchmark master panel
     - ``tools/make_benchmark_atlas.py``
     - Closed for the release atlas; paper-level extensions remain scoped
     - Cyclone ITG, ETG, KBM, W7-X, HSX, and shaped tokamak coverage are represented in ``docs/_static/benchmark_core_linear_atlas.png``. TEM/kinetic-electron branch parity and additional W7-X multi-flux-tube scans remain outside the current release claim.
   * - Eigenfunction validation panel
     - ``tools/plot_eigenfunction_overlap_summary.py``, ``tools/plot_eigenfunction_reference_overlay.py``, ``tools/generate_kbm_reference_overlay.py``, and ``tools/generate_w7x_reference_overlay.py``
     - Closed for KBM and W7-X raw overlays
     - first shipped overlap artifact is ``docs/_static/kbm_eigenfunction_overlap_summary.png``; frozen raw GX bundles now exist for KBM and W7-X under ``docs/_static/reference_modes/``. The closed KBM raw overlay is ``docs/_static/kbm_eigenfunction_reference_overlay_ky0p3000.png`` with overlap ``0.999985`` and relative ``L^2`` mismatch ``0.00721``. The closed W7-X raw overlay is ``docs/_static/w7x_eigenfunction_reference_overlay_ky0p3000.png`` with overlap ``0.9999999994`` and relative ``L^2`` mismatch ``3.33e-5`` against the finite GX ``t≈2`` raw-mode bundle. Both overlay generators write JSON gate reports with ``overlap >= 0.95`` and ``relative L^2 <= 0.25`` requirements. ``tools/compare_gx_kbm.py --branch-summary-json`` writes branch-continuity gate metadata for selected KBM scans, and ``tools/generate_kbm_branch_gate_summary.py`` refreshes the no-rerun tracked artifact ``docs/_static/kbm_branch_gate_summary.json`` from ``docs/_static/kbm_gx_candidates.csv``. The current continuity-first branch summary passes the strict adjacent growth/frequency jump and successive-overlap gates.
   * - Nonlinear transport panel
     - ``tools/make_gx_summary_panel.py`` / ``tools/make_gx_publication_panel.py``
     - Closed for release-window gates; stricter manuscript tightening remains open
     - Cyclone, Miller, KBM, W7-X, HSX with matched windows. Current component artifacts: ``docs/_static/nonlinear_cyclone_diag_compare_t400.png``, ``docs/_static/nonlinear_cyclone_miller_diag_compare_t122.png``, ``docs/_static/nonlinear_kbm_diag_compare_t400_stats.png``, ``docs/_static/nonlinear_w7x_diag_compare_t200.png``, ``docs/_static/hsx_nonlinear_compare_t50_true.png``. ``tools/compare_gx_nonlinear_diagnostics.py --summary-json`` writes machine-readable mean-relative-mismatch gates for each plotted diagnostic with explicit transport-window bounds. The materialized release-window metadata are ``docs/_static/nonlinear_cyclone_gate_summary.json``, ``docs/_static/nonlinear_cyclone_miller_gate_summary.json``, ``docs/_static/nonlinear_kbm_gate_summary.json``, ``docs/_static/nonlinear_hsx_gate_summary.json``, and ``docs/_static/nonlinear_w7x_gate_summary.json``; all pass the current release gate. This is sufficient for the release validation atlas. Paper-level tightening remains open where case-specific references support narrower tolerances, and the older short Cyclone ``t=5`` diagnostic remains documented as an exploratory startup/resolved-spectrum audit, not a release gate.
   * - W7-X exact-state convention audit
     - ``tools/run_exact_state_audit.py`` and ``tools/plot_w7x_exact_state_audit.py``
     - Closed
     - current artifact base: ``docs/_static/w7x_exact_state_audit.png`` with CSV/JSON/PDF companions. It compares W7-X nonlinear VMEC startup state, late geometry/field arrays, and re-evaluated scalar diagnostics directly against GX exact-state dumps. The maximum finite pointwise relative error is ``4.62e-5`` under the explicit ``1e-4`` convention gate, while scalar diagnostics are below ``1.8e-7``. This closes the geometry/diagnostic convention layer but does not close the separate W7-X zonal-response recurrence lane.
   * - Windowed-statistics summary
     - ``tools/plot_nonlinear_window_statistics.py``
     - Closed for current release-window gates
     - current artifact base: ``docs/_static/nonlinear_window_statistics.png`` with CSV/JSON/PDF companions. It summarizes the per-diagnostic ``mean_rel_abs`` and ``max_rel_abs`` statistics from the frozen nonlinear GX comparison gate JSONs for Cyclone, Cyclone Miller, KBM, W7-X, and HSX. Exploratory/short-run diagnostics are explicitly excluded with ``gate_index_include=false``.
   * - Nonlinear startup-window finite-difference audit
     - ``tools/build_nonlinear_window_fd_audit.py``
     - Closed only as compact startup plumbing; transport-average and gradient promotion open
     - current artifact base: ``docs/_static/nonlinear_window_fd_audit.png`` with CSV/JSON/PDF companions. It runs actual compact SPECTRAX-GK nonlinear Cyclone startup windows at ``R/LTi = base +/- step`` plus a repeated base point, then checks finite outputs, repeatability, monotonic drive response, startup-window coefficient of variation, startup-window trend, and resolved central finite-difference response. The tracked response/base fraction is about ``0.111``. Its ``transport_average_gate`` is false because the run is too short for a post-transient running average; it is not a production nonlinear heat-flux, VMEC/Boozer nonlinear state-gradient, or optimized-equilibrium transport claim.
   * - VMEC/Boozer nonlinear startup finite-difference audit
     - ``tools/build_vmec_boozer_nonlinear_window_fd_audit.py``
     - Closed only as VMEC/Boozer geometry-perturbed startup plumbing; transport-average and local-gradient promotion open
     - current artifact base: ``docs/_static/vmec_boozer_nonlinear_window_fd_audit.png`` with CSV/JSON/PDF companions. It starts from the real mode-21 ``vmec_jax -> booz_xform_jax`` QH state bridge, writes perturbed sampled geometries to temporary NetCDF files, and runs compact nonlinear startup windows at ``Rcos_mid_surface_m1 = base +/- 1e-5`` plus a repeated base point. The gate checks finite outputs, deterministic repeatability, bounded startup-window coefficient of variation and trend, resolved geometry perturbation, and resolved central finite-difference response; response/base is about ``0.040``. Its ``transport_average_gate`` is false and the forward/backward response is asymmetric, so this is a startup observable-path audit rather than a promoted transport average, local nonlinear gradient, or optimized-equilibrium transport claim.
   * - VMEC-state nonlinear-gradient launch runbook
     - ``tools/design_nonlinear_gradient_ql_seed_screen.py``, ``tools/design_nonlinear_gradient_state_control_runbook.py``, ``tools/write_vmec_state_to_input_mapping_campaign.py``, ``tools/write_vmec_asymmetric_state_to_input_mapping_campaign.py``, ``tools/write_vmec_state_control_short_bracket_launch.py``, and ``tools/build_vmec_state_to_input_mapping_response.py``
     - Closed for checked short-bracket launch mapping; long-window nonlinear-gradient evidence still required
     - current artifact bases: ``docs/_static/nonlinear_gradient_state_control_runbook.png``, ``docs/_static/nonlinear_gradient_state_to_input_mapping_campaign.png``, ``docs/_static/nonlinear_gradient_state_to_input_mapping_response.png``, ``docs/_static/nonlinear_gradient_asymmetric_state_to_input_mapping_campaign.png``, ``docs/_static/nonlinear_gradient_asymmetric_state_to_input_mapping_response.png``, ``docs/_static/nonlinear_gradient_state_control_short_bracket_launch_status.png``, ``docs/_static/nonlinear_gradient_state_control_short_bracket_nonlinear_audit_status.png``, and ``docs/_static/nonlinear_gradient_state_control_bracket_sweep_status.png`` with CSV/JSON/PDF companions. The QH/Li383 QL seed screen admits ``Rsin_mid_surface_m1`` and ``Zcos_mid_surface_m1`` as sign-consistent internal VMEC-state controls. The measured ``RBC/ZBS`` response matrix is rank zero, as expected for the symmetry-forbidden branch, but the follow-up ``LASYM=true`` ``RBS/ZBC`` response has rank ``2`` and condition number about ``1.02``. The runbook now carries explicit least-squares input-control directions for both admitted controls, and the short-bracket launch status records six normally terminated VMEC solves plus two prepared bounded nonlinear campaign manifests. The first nonlinear audit completes all ``18`` runs and passes output/ensemble gates, but both central-FD gates fail closed because the ``1e-3`` bracket response is too small and asymmetric. The follow-up ``3e-3``/``1e-2`` bracket-amplitude sweep completes all ``36`` office-GPU runs, but all four central-FD gates still fail with response fractions below ``0.005``. This is launch-mapping and negative single-control bracket evidence, not yet a converged long-window nonlinear-gradient result.
   * - Nonlinear-gradient control-variate campaign
     - ``tools/build_nonlinear_gradient_variance_reduction_plan.py`` and ``tools/write_nonlinear_gradient_control_variate_campaign.py``
     - Closed for the rel7.5 independent control-mean uncertainty gate; broader nonlinear-gradient claims remain scoped
     - current artifact bases: ``docs/_static/qa_ess_zbs10_rel7p5_variance_reduction_plan.png``, ``docs/_static/qa_ess_zbs10_rel7p5_control_variate_campaign_plan.png``, and ``docs/_static/qa_ess_zbs10_rel7p5_control_mean_tmin600_t1100_gate.png`` with JSON/CSV/PDF companions. The rel7.5 ``ZBS(1,0)`` follow-up is local and response-resolved but variance limited. The midpoint common-mode control variate reduces apparent residual uncertainty to ``0.238``; the independent follow-up completes ``21`` matched plus/minus pairs and the strict late-window gate over ``t=[600,1100]`` passes with combined response uncertainty ``0.311 < 0.5``. This closes the evidence record for this specific variance-reduced nonlinear-gradient lane, not a universal nonlinear turbulent-flux optimization result.
   * - Nonlinear transport time-horizon audit
     - ``tools/build_nonlinear_transport_horizon_audit.py``
     - Closed as claim-scope guardrail; QH/CTH convergence promotion open
     - current artifact base: ``docs/_static/nonlinear_transport_time_horizon_audit.png`` with CSV/JSON/PDF companions. It audits the actual simulated time and claim scope for release nonlinear gates, startup finite-difference audits, reduced nonlinear-window estimators, and external-VMEC feasibility pilots. The new QH reduced-grid nonlinear pilot is extended from the earlier startup-scale ``t=20`` trace to ``t=150`` and reaches a meaningful late heat-flux window with mean about ``19.6``; it remains a feasibility result until a grid/window convergence gate passes. This panel prevents startup ``1e-11``-scale heat fluxes or reduced-envelope outputs from being described as post-transient nonlinear transport averages.
   * - Validation gate index
     - ``tools/make_validation_gate_index.py``
     - Closed for currently tracked gates
     - current artifact base: ``docs/_static/validation_gate_index.png`` and ``docs/_static/validation_gate_index.json``. This is not a physics result by itself; it is the audit panel for release-window gates, currently ``16/16`` passed.
   * - Open research lane status
     - ``tools/build_open_research_lane_status.py``
     - Closed as a claim-scope audit; underlying physics lanes remain scoped
     - current artifact base: ``docs/_static/open_research_lane_status.png`` with CSV/JSON/PDF companions. It reads the W7-X zonal recurrence, W7-X hypercollision probe, W7-X fluctuation/TEM extension status, quasilinear holdout, differentiable-geometry, and nonlinear-profiler artifacts and records which lanes are closed, partial, open, or blocked. The current status is intentionally conservative: nonlinear holdouts for the scoped quasilinear model-development claim and profiler-backed nonlinear hot-path localization are closed; W7-X fluctuation/TEM and differentiable geometry are partial bounded diagnostics; and W7-X long-window zonal recurrence/damping remains open. This panel is useful for the paper plan and release notes because it prevents partial diagnostics from being described as completed physics claims.
   * - Manuscript-readiness status panel
     - ``tools/build_manuscript_readiness_status.py``
     - Current manuscript scope with W7-X zonal and TEM deferred
     - current artifact base: ``docs/_static/manuscript_readiness_status.png`` with CSV/JSON/PDF companions. It records the narrower manuscript scope where W7-X zonal recurrence and TEM/kinetic-electron extensions are deferred. In that scope, quasilinear diagnostics and saturation-model selection are closed as a validated negative/model-selection result rather than as an absolute-flux predictor; VMEC/Boozer zero-beta equal-arc geometry parity is closed at ``mboz=nboz=21``; reduced differentiable stellarator ITG optimization is closed with AD/FD gates; and production solver-objective geometry gradients are closed for solver-ready arrays plus mode-21 VMEC/Boozer eigenfrequency, quasilinear heat-flux-weight, and reduced nonlinear-window estimator gates on QH and Li383. The compact nonlinear FD audits are retained only as startup plumbing checks with false transport-average gates. The production nonlinear optimization guard adds the D-shaped and circular long post-transient replicated holdout ensembles plus the selected optimized-equilibrium ``t=[350,700]`` seed/timestep replicated audit. Broader nonlinear turbulence-gradient, absolute-flux prediction, and multi-surface stellarator optimization claims remain separate gates.
   * - Aspect-6 QA low-turbulence optimization comparison
     - ``tools/build_qa_low_turbulence_comparison.py`` and ``tools/build_qa_low_turbulence_time_horizon_audit.py``
     - Closed for reduced differentiable optimization-plumbing claims
     - current artifact bases: ``docs/_static/qa_low_turbulence_comparison.png`` and ``docs/_static/qa_low_turbulence_time_horizon_audit.png`` with JSON/CSV/PDF companions. The panel compares a reduced QA constraints-only optimum against a reduced QA plus nonlinear-envelope optimum at aspect ``A = 6`` and minimum mean ``iota = 0.41``. It includes the fixed-``a/L_T`` ``Q_env`` versus ``a/L_n`` scan, fixed-gradient reduced-envelope traces, objective histories, reduced non-axisymmetric LCFS surfaces colored by ``|B|``, reduced Boozer-LCFS ``|B|`` maps, and gradient/constraint gates. The tracked artifact passes scalar, residual, and observable AD/finite-difference gates, runs the fixed-gradient reduced envelope to ``t v_ti/a = 400``, enforces the formal ``iota >= 0.41`` floor plus an operating ``iota >= 0.70`` floor, keeps a helical boundary amplitude near ``0.16``, and shows about ``10.7%`` reduced late-window heat flux at the fixed gradient. The horizon audit compares ``t=400`` against a ``t=1000`` reduced-envelope reference and finds relative late-window mean changes of ``1.2e-7`` or smaller, with CV/trend/half-window drift below ``1e-3``. The figure supports a reduced differentiable optimization and visualization claim only; it is not the upstream VMEC-JAX ``QA_optimization.py`` final WOUT, and long-window full nonlinear transport optimization remains governed by the production nonlinear audit gates.
   * - Quasilinear spectrum panel
     - ``tools/plot_quasilinear_spectrum.py``
     - Electrostatic diagnostic closed; absolute-flux prediction not promoted
     - current artifact bases: ``docs/_static/quasilinear_cyclone_spectrum.png``, ``docs/_static/quasilinear_cyclone_miller_spectrum.png``, ``docs/_static/quasilinear_hsx_spectrum.png``, and ``docs/_static/quasilinear_w7x_spectrum.png`` with CSV/JSON/PDF companions. They show electrostatic linear weights and explicitly uncalibrated mixing-length outputs from ``examples/linear/axisymmetric/runtime_cyclone_quasilinear.toml``, ``examples/linear/axisymmetric/runtime_cyclone_miller_quasilinear.toml``, ``examples/linear/non-axisymmetric/runtime_hsx_linear_quasilinear.toml``, and ``examples/linear/non-axisymmetric/runtime_w7x_linear_quasilinear_vmec.toml``. Scan spectra use requested ``ky`` for the x-axis and retain signed selected-mode coordinates as ``mode_ky`` when applicable. W7-X was generated from an external VMEC benchmark file via ``W7X_VMEC_FILE``; the equilibrium itself is not shipped. Absolute saturated-flux claims remain open until a held-out nonlinear calibration report passes.
   * - Quasilinear calibration audit
     - ``tools/build_quasilinear_calibration_report.py`` and ``tools/plot_quasilinear_calibration.py``
     - Initial train/holdout artifact closed as a failed model-transfer gate
     - current artifact bases: ``docs/_static/quasilinear_cyclone_calibration_audit.png``, ``docs/_static/quasilinear_cyclone_miller_calibration_audit.png``, ``docs/_static/quasilinear_cyclone_miller_train_holdout.png``, ``docs/_static/quasilinear_hsx_train_holdout.png``, ``docs/_static/quasilinear_w7x_train_holdout.png``, the manuscript-facing combined panel ``docs/_static/quasilinear_stellarator_train_holdout.png``, and the input-provenance audit ``docs/_static/quasilinear_validated_calibration_inputs.png`` with JSON/PDF companions. The current one-constant train/holdout report fits the heat-flux scale on Cyclone and the external-VMEC ITERModel case, then scores six held-out windows: Cyclone Miller, HSX, W7-X, D-shaped external VMEC, up-down asymmetric external VMEC, and circular external VMEC. It intentionally remains ``calibration_dataset`` with ``passed = false`` because held-out errors exceed the ``0.35`` gate; the current holdout mean relative error is about ``2.11``. HSX, W7-X, and the up-down asymmetric VMEC point are useful negative stellarator/tokamak transfer checks because the simple positive-growth mixing-length family predicts little or near-zero flux while the nonlinear windows are finite. The D-shaped external-VMEC point is a converged negative transfer constraint with finite late-window nonlinear heat flux but a grossly overpredicted Cyclone/ITERModel-scaled mixing-length estimate. The newly admitted circular external-VMEC point is a positive transfer check for the scaled one-constant diagnostic, but it does not rescue the aggregate absolute-flux gate. The input audit confirms that every current train/holdout nonlinear artifact maps to a passed nonlinear gate, while failed QH and CTH-like external-VMEC feasibility pilots remain excluded. This closes the NetCDF/CSV calibration machinery and provenance gate but not a calibrated absolute-flux claim.
   * - Quasilinear saturation-rule sweep
     - ``tools/plot_quasilinear_saturation_rule_sweep.py``
     - Initial model-development diagnostic closed as a negative result
     - current artifact base: ``docs/_static/quasilinear_saturation_rule_sweep.png`` with JSON/PDF companions. It fits one scalar on the two training cases, Cyclone and external-VMEC ITERModel, then scores the same six held-out windows for three simple rules: positive-growth mixing length, raw linear heat-flux weight, and an absolute-growth diagnostic. All three fail the held-out absolute-flux gate. The least-bad simple rule is now positive-growth mixing length with holdout mean relative error about ``2.11``; raw linear weight is about ``2.68`` and the absolute-growth diagnostic is about ``3.32``. The panel also includes a training-mean null baseline with holdout mean relative error about ``1.20``. Its JSON ``promotion_gate`` has no accepted rules, so future calibrated rules must beat both the quasilinear baselines and this null baseline before being promoted. This supports the next saturation-model lane while preventing premature absolute quasilinear transport claims.
   * - Shape-aware quasilinear saturation diagnostic
     - ``tools/plot_quasilinear_shape_aware_saturation.py``
     - Initial leave-one-geometry-out diagnostic closed as a negative result
     - current artifact base: ``docs/_static/quasilinear_shape_aware_saturation.png`` with JSON/PDF companions. It fits a shared nonlinear/quasilinear spectrum-shape exponent with per-case intercepts, uses only passed shape gates for the exponent fit, then fits the absolute heat-flux scale on training cases and scores each held-out geometry. The shape-aware model gives mean absolute relative error about ``0.664`` versus ``0.624`` for the linear-weight baseline and ``0.170`` for a deliberately simple training-mean null baseline. The JSON ``promotion_gate`` is false because the model fails the ``0.35`` transport gate and does not beat the null baseline on the current four-case dataset. This is retained as a manuscript-facing negative result because it rules out a too-simple one-exponent envelope before stellarator optimization claims.
   * - Quasilinear candidate uncertainty gate
     - ``tools/plot_quasilinear_candidate_uncertainty.py``
     - Eight-case uncertainty-aware candidate gate closed as a scoped model-selection result
     - current artifact base: ``docs/_static/quasilinear_candidate_uncertainty.png`` with JSON/PDF companions. It adds training-residual ``95%`` prediction intervals to leave-one-geometry-out candidate scoring on the current eight-case electrostatic-compatible dataset. The legacy calibrated linear-weight and one-exponent shape-power-law candidates remain rejected relative to the null/skill gates. The accepted research candidate is ``spectral_envelope_ridge``: it uses the positive-growth ``k_y`` centroid and heat-flux-weighted ``k_y`` width in a three-parameter log-linear ridge model, reaches leave-one-geometry-out mean relative error about ``0.295``, and has interval coverage ``7/8``. This is a bounded model-development result, not a runtime/TOML absolute-flux predictor or a universal saturation law.
   * - Quasilinear dataset-sufficiency gate
     - ``tools/plot_quasilinear_dataset_sufficiency.py``
     - Promotion guard closed for the scoped spectral-envelope candidate; higher-parameter and electromagnetic claims remain blocked
     - current artifact base: ``docs/_static/quasilinear_dataset_sufficiency.png`` with JSON/PDF companions. It audits the validated nonlinear windows before any richer saturation model is promoted. The current electrostatic-compatible set has eight cases, two explicit training geometries, and six held-out geometries. That is sufficient for the one-parameter linear-weight candidate, the two-parameter shape-power-law candidate, and the three-parameter ``spectral_envelope_ridge`` candidate, but not for the five-parameter ``linear_state_ridge`` model. KBM is explicitly excluded from this electrostatic quasilinear promotion lane because electromagnetic field-channel normalization and calibration remain future work. The gate therefore supports the scoped spectral-envelope model-selection claim while preventing broader absolute-flux or electromagnetic quasilinear claims.
   * - Quasilinear model-selection status
     - ``tools/plot_quasilinear_model_selection_status.py`` and :mod:`spectraxgk.quasilinear_model_selection`
     - Scoped model-selection lane closed; absolute-flux runtime promotion remains blocked
     - current artifact base: ``docs/_static/quasilinear_model_selection_status.png`` with CSV/JSON/PDF companions. It consolidates the dataset-sufficiency gate, uncertainty/skill gate, and tracked train/holdout calibration reports into one claim-boundary panel. The accepted ``spectral_envelope_ridge`` candidate reaches leave-one-geometry-out mean relative error about ``0.295`` with prediction-interval coverage ``7/8`` and beats both the training-mean null and linear-weight baselines. The same artifact also verifies that no tracked train/holdout report is promoted to ``calibrated_absolute_flux``. This is the manuscript-facing positive result for reduced candidate selection, not a runtime/TOML absolute-flux predictor.
   * - Quasilinear holdout-gap report
     - ``tools/build_quasilinear_holdout_gap_report.py``
     - Absolute-flux promotion boundary quantified; next nonlinear holdout requirements explicit
     - current artifact base: ``docs/_static/quasilinear_holdout_gap_report.png`` with CSV/JSON/PDF companions. It keeps ``absolute_flux_promoted=false`` and now records an ``absolute_flux_promotion_requirements`` block. The current absolute train/holdout error is about ``2.11`` against the ``0.35`` gate, i.e. about ``6.04`` times too large, with Cyclone Miller as the worst admitted holdout. The same block requires three additional independent passed holdouts, one additional external-VMEC holdout family, and one non-axisymmetric external-VMEC holdout family before absolute-flux promotion can even be reconsidered. The required nonlinear cases list points to the current near-miss and missing-family candidates while explicitly stating that adding those cases is evidence expansion only, not automatic promotion.
   * - External-VMEC next-holdout runbook
     - ``tools/build_external_vmec_holdout_runbook.py`` and :mod:`spectraxgk.external_holdout_plan`
     - Launch-plan artifact closed; unchanged same-family and failed-family replays blocked
     - current artifact base: ``docs/_static/external_vmec_next_holdout_runbook.png`` with CSV/JSON/PDF companions. It converts the holdout-gap report and external-VMEC linear screen into a fail-closed nonlinear launch plan. After the circular external-VMEC case closed at ``t=450`` and entered the holdout set, the shaped-tokamak pressure candidate was run to ``t=450`` at ``n48``/``n64`` but failed the high-grid convergence gate with common and least-window relative heat-flux differences about ``0.306``. The follow-on ``ITERModel_reference_nc`` same-family audit passes at ``t=450`` with common/least grid differences about ``0.056``/``0.055``, but it is scoped as reproducibility evidence because ITERModel is already consumed by the training reference. The refreshed linear screen adds ``li383`` as stable, ``QI_stel_seed_3127`` as near-marginal with ``gamma≈3.8e-3``, and ``basic_non_stellsym`` as a VMEC flux-tube contract failure. The runbook now emits no unchanged replay command for those cases and also requires ``gamma >= 0.02`` before any nonlinear launch command. This records the fail-closed admission requirements: ``split=holdout``, sufficiently unstable linear branch, passed grid/window convergence, post-transient transport window, and independence from the training reference. This is a planning artifact only; it does not promote an absolute-flux predictor.
   * - Quasilinear promotion guardrail audit
     - ``tools/check_quasilinear_promotion_guardrails.py``
     - Fast metadata gate closed; nonlinear simulation validation remains delegated to the source gates
     - current artifact: ``docs/_static/quasilinear_promotion_guardrails.json``. It scans the train/holdout calibration reports, saturation-model reports, nonlinear input-validation blocks, promotion gates, claim-scope README/docs wording, the quasilinear row in ``docs/_static/manuscript_readiness_status.json``, and the manuscript quasilinear model-development figure index. It requires finite nonlinear window means and standard deviations for train/holdout calibration points, explicit nonlinear and quasilinear artifact provenance, JSON sidecars for the tracked model-development figures, scoped non-absolute claim levels, explicit failed-baseline or blocker metadata, passed held-out gates before any ``calibrated_absolute_flux`` claim, and a manuscript-readiness quasilinear lane that remains scoped as diagnostic/model-selection evidence rather than a runtime absolute-flux predictor. This is deliberately a wording and metadata guard, not a calibrated absolute-flux claim or a replacement for nonlinear convergence simulations.
   * - Release claim-scope ledger
     - ``docs/release_scope.rst``
     - Closed as documentation guardrail
     - This documentation page centralizes the current claim boundaries across validation, quasilinear model selection, differentiable geometry, parallelization, performance, and deferred W7-X/TEM lanes. It should be updated whenever a new artifact promotes or demotes a claim.
   * - VMEC equilibrium portfolio for future quasilinear holdouts
     - ``tools/plot_vmec_jax_equilibrium_inventory.py``
     - Planning artifact closed; bounded linear smoke checks started; transport validation open
     - current artifact bases: ``docs/_static/vmec_jax_equilibrium_inventory.png``, ``docs/_static/external_vmec_candidate_linear_screen.csv``, ``docs/_static/quasilinear_vmec_qi_seed_linear_spectrum.png``, ``docs/_static/quasilinear_vmec_qi_seed_branch_refinement_gate.png``, ``docs/_static/quasilinear_vmec_dshape_linear_spectrum.png``, ``docs/_static/external_vmec_dshape_grid_convergence_gate.png``, ``docs/_static/external_vmec_dshape_t250_high_grid_convergence_gate.png``, ``docs/_static/external_vmec_dshape_replicates/dshape_replicate_t250_ensemble_gate.png``, ``docs/_static/external_vmec_circular_replicates/circular_replicate_t700_ensemble_gate.png``, ``docs/_static/quasilinear_vmec_jax_qh_linear_spectrum.png``, ``docs/_static/quasilinear_vmec_jax_cth_like_linear_spectrum.png``, ``docs/_static/external_vmec_qh_nonlinear_t150_pilot.png``, ``docs/_static/external_vmec_qh_nonlinear_t150_n48_pilot.png``, ``docs/_static/external_vmec_qh_nonlinear_t150_n64_pilot.png``, ``docs/_static/external_vmec_qh_grid_convergence_gate.png``, ``docs/_static/external_vmec_qh_high_grid_convergence_gate.png``, ``docs/_static/external_vmec_cth_like_nonlinear_t150_pilot.png``, ``docs/_static/external_vmec_cth_like_nonlinear_t150_n48_pilot.png``, and ``docs/_static/external_vmec_cth_like_grid_convergence_gate.png`` with JSON/PDF companions. The inventory scans external VMEC files from ``vmec_jax/examples/data`` without checking them into SPECTRAX-GK and now records ``11`` equilibria, including the newly detected ``wout_QI_stel_seed_3127.nc``. A broader five-point candidate screen selected DSHAPE as the strongest finite unstable branch with ``gamma≈0.096`` at ``ky≈0.476``. The refreshed local portfolio records li383 as stable, QI seed as near-marginal rather than launchable, and basic non-stellarator-symmetric geometry as a flux-tube contract failure; the QI branch-refinement gate passes finite/positive-run/Krylov-consistency subgates but fails nonlinear-launch growth because ``max(gamma)≈3.8e-3 < 0.02``. DSHAPE passes low-to-mid-grid convergence at ``t=150`` and passes the ``48x48x32`` to ``64x64x40`` high-grid gate after extension to ``t=250``: common-window and least-window symmetric relative differences are about ``0.139`` and ``0.108``, below the ``0.15`` threshold. The follow-up ``64x64x40`` DSHAPE seed/timestep replicate campaign passes the late-window ensemble gate on ``t=[170,250]`` with mean-relative spread ``0.141`` and combined SEM/mean ``0.054``. The independent circular external-VMEC holdout also has replicated evidence after extending from a non-promotable ``t=450`` terminal-window drift to a closed ``t=700`` window: ``docs/_static/external_vmec_circular_replicates/circular_replicate_t700_ensemble_gate.png`` passes with mean-relative spread ``0.035`` and combined SEM/mean ``0.043`` on ``t=[350,700]``. QH and CTH-like remain useful feasibility and negative convergence results rather than transport validation claims: QH fails both ``32->48`` and ``48->64`` gates, and CTH-like fails its first grid check. DSHAPE and circular tokamak are ready for calibration-report admission with replicated nonlinear-window uncertainty evidence; QH and CTH-like should stay excluded until production-resolution convergence gates pass.
   * - Quasilinear spectrum-shape gate
     - ``tools/plot_quasilinear_spectrum_shape_gate.py``
     - HSX, W7-X, and Cyclone Miller gates closed; Cyclone retained as a failed model gate
     - current artifact bases: ``docs/_static/quasilinear_hsx_spectrum_shape_gate.png``, ``docs/_static/quasilinear_w7x_spectrum_shape_gate.png``, ``docs/_static/quasilinear_cyclone_miller_spectrum_shape_gate.png``, and ``docs/_static/quasilinear_cyclone_spectrum_shape_gate.png`` with JSON/PDF companions. They compare normalized linear heat-flux-weight spectra against normalized nonlinear resolved ``HeatFlux_kyst`` spectra. HSX passes with ``TV≈0.112`` and cosine ``≈0.971``; W7-X passes with ``TV≈0.056`` and cosine ``≈0.992``; Cyclone Miller passes with ``TV≈0.094`` and cosine ``≈0.983``; Cyclone is kept as a failed gate with ``TV≈0.215`` and cosine ``≈0.896``. This supports spectrum-shape diagnostics while identifying a real saturation/window/model limitation before absolute saturated-flux claims. KBM is deferred from this gate because the current quasilinear diagnostic validates electrostatic channels only and the KBM lane is electromagnetic.
   * - Zonal-flow / GAM response panel
     - ``tools/plot_zonal_flow_response.py``, ``tools/plot_zonal_flow_response_from_output.py``, ``tools/generate_miller_zonal_response_pilot.py``, ``tools/generate_w7x_zonal_response_panel.py``, ``tools/digitize_w7x_zonal_reference.py``, ``tools/compare_w7x_zonal_reference.py``, ``tools/plot_w7x_zonal_contract_audit.py``, ``tools/plot_w7x_zonal_moment_tail_audit.py``, ``tools/plot_w7x_zonal_closure_ladder.py``, ``tools/plot_w7x_zonal_state_convention_audit.py``, and ``tools/plot_w7x_zonal_recurrence_sweep.py``
     - Open
     - should combine shaped-tokamak Rosenbluth-Hinton-style residuals with W7-X residual/damping envelopes using one figure convention; use signed ``Phi_zonal_mode_kxt`` or case-specific signed line averages for publication claims and keep ``Phi2_zonal_t`` only as an intermediate cross-check. The current Merlo Case-III artifact is ``docs/_static/miller_zonal_response_pilot.png`` from the initial-density setup at ``Nz=32``, ``Nl=4``, ``Nm=24``, ``dt=0.005``, and ``t≈60``. With Rosenbluth-Hinton first-sample normalization it gives ``residual≈0.192`` against the Merlo et al. Figs. 12/16 read-off of about ``0.19``; a literature-faithful common fit window ``t≈30`` with separate positive/negative-extrema damping fits gives ``γ_GAM R0 / v_i≈-0.176`` against the paper-scale read-off near ``-0.17``; and Hilbert-phase frequency extraction on that same window gives ``ω_GAM R0 / v_i≈2.20`` against the paper-scale read-off near ``2.24``. A higher-moment audit lowers the recurrence ratio but over-damps the GAM, while weak hypercollision scans are effectively inert, so the frozen Merlo artifact remains on the current ``Nm=24`` baseline. The W7-X side now uses the potential initializer, signed line-average observable, paper-facing line-first normalization, and no hidden time-axis scaling. The tracked long-window W7-X artifact is ``docs/_static/w7x_zonal_response_panel.png`` with replayable traces in ``docs/_static/w7x_zonal_response_panel.traces.csv``; it reaches the digitized Fig. 11 windows, but ``docs/_static/w7x_zonal_reference_compare.json`` remains open because residuals fail at ``k_x rho_i=0.07``, ``0.10``, and ``0.30`` and the late envelopes are much larger than the digitized stella/GENE traces. ``docs/_static/w7x_zonal_contract_audit.png`` is now the paper-facing diagnostic panel for that open mismatch and is intentionally excluded from the release gate index. ``docs/_static/w7x_zonal_state_convention_audit.png`` closes the paper-facing state convention layer: the recovered Gaussian potential has relative ``L2`` error ``1.85e-6``, off-target spectral content is zero to reported precision, and the diagnostic helpers agree with manual line/volume reductions near ``2e-16``. ``docs/_static/w7x_zonal_moment_tail_audit.png``, ``docs/_static/w7x_zonal_closure_ladder_kx070.png``, ``docs/_static/w7x_zonal_recurrence_sweep_kx070.png``, and ``docs/_static/w7x_zonal_hypercollision_probe_kx070.png`` are companion open diagnostics; together they support a recurrence / moment-closure hypothesis and show that weak or constant closure can reduce velocity-space tails without closing the paper trace. The refreshed closure ladder now covers constant Hermite, ``k_z``-weighted Hermite, mixed Laguerre-Hermite, Laguerre-only, and isotropic hypercollision families at ``0.01`` and ``0.03``. The best mean trace error is the isotropic ``nu_hyper=0.01`` row at about ``0.2755`` versus baseline ``0.2861``, but its late-window standard-deviation ratio is about ``4.25`` versus baseline ``4.10``. Thus no bounded closure family improves trace error, late-envelope recurrence, and moment-tail metrics simultaneously. The W7-X generator exposes explicit hypercollision and Gaussian-width audit overrides so future closure probes are reproducible from the tracked tool. A newer high-moment four-wavelength audit under ``tools_out/zonal_response/w7x_publication_nl16_nm64_dt005_t100`` verifies finite signed traces to ``t≈100`` after restart-continuation fixes. The tracked W7-X TOML keeps ``gaussian_width=1`` because the benchmark source writes the initializer as ``exp[-(z-z0)^2]``; wider profiles and non-unit time scales are retained only as audits. The lane remains open pending a more physical W7-X damping/closure and velocity-space recurrence fix under the paper-facing convention.
   * - W7-X fluctuation-spectrum panel
     - ``tools/plot_w7x_fluctuation_spectrum_panel.py`` and ``tools/build_w7x_tem_extension_status.py``
     - Initial simulation-spectrum diagnostic closed; TEM/multi-flux validation open
     - current artifact bases: ``docs/_static/w7x_fluctuation_spectrum_panel.png``, ``docs/_static/tem_branch_parity_audit.png``, and ``docs/_static/w7x_tem_extension_status.png`` with CSV/JSON/PDF companions. The fluctuation panel is regenerated from the gated W7-X nonlinear ``t≈200`` NetCDF artifact, requires the corresponding nonlinear gate summary to pass before plotting, and records ``gate_index_include=false`` because it is a diagnostic figure rather than an additional release gate. It shows normalized ``k_y`` spectra for ``|\phi|^2``, ``W_\phi``, and ``|Q_i|``, the time-averaged ``k_x``-``k_y`` fluctuation-power map, the signed heat-flux spectrum, and a windowed temporal spectrum for the dominant nonzonal and zonal traces. The TEM audit explicitly keeps TEM linear parity open: maximum absolute relative growth-rate mismatch is about ``4.25``, maximum absolute relative frequency mismatch is about ``3.3`` away from the near-zero reference denominator, and the frequency branch has Spearman coefficient about ``-0.986``. Because the TEM reference is a provisional literature digitization rather than a direct case dump, this artifact blocks broad W7-X/TEM validation claims without being a standalone tuning target. The extension-status panel also keeps W7-X multi-alpha/multi-surface scans and kinetic-electron nonlinear windows open. This closes the reproducible simulation-spectrum panel needed for the current manuscript stack but not broad W7-X/TEM validation.
   * - Velocity-space convergence panel
     - ``tools/generate_observed_order_gate.py`` plus dedicated full convergence refresh script to add
     - Open
     - should follow GX-style convergence evidence and write an observed-order gate report through ``spectraxgk.benchmarking.observed_order_gate_report`` so convergence rate and final-grid error are tracked explicitly. The current atlas summary already records a machine-readable high-vs-low Cyclone grid convergence gate for the tracked convergence tile. The CSV-backed Cyclone velocity-space artifact is ``docs/_static/cyclone_resolution_observed_order.png`` with metadata in ``docs/_static/cyclone_resolution_observed_order.json``; the current office/GPU ``ky=0.30`` sweep passes the strict pairwise-order and final-error gate.
   * - Stellarator validation panel
     - dedicated script to add
     - Open
     - W7-X multi-flux-tube + zonal-flow response + HSX summary as needed; add heavy-electron EM verification before realistic-electron EM claims
   * - Performance panel
     - existing performance tooling
     - Closed for release-level scoped claims
     - current artifact bases: ``docs/_static/runtime_memory_benchmark.png``, ``docs/_static/nonlinear_rhs_profile_miller.png``, ``docs/_static/nonlinear_rhs_profile_stellarator_runtime.png``, ``docs/_static/full_nonlinear_rhs_trace_summary.json``, ``docs/_static/full_nonlinear_rhs_trace_gpu_summary.json``, and ``docs/_static/nonlinear_sharding_profile_office_gpu.json``. The release claim is limited to current cold/warm runtime accounting, CPU/GPU nonlinear RHS hot-path localization, W7-X/HSX runtime-mode stellarator smoke profiles, and numerical-identity gates. It is not a production nonlinear domain-decomposition speedup claim.
   * - Parallelization identity gate
     - ``tools/generate_parallel_ky_scan_gate.py``
     - Closed for independent Cyclone ``k_y`` batching
     - current artifact base: ``docs/_static/parallel_ky_scan_gate.png`` with CSV/JSON/PDF companions. This is a real linear-solver gate: it compares serial and fixed-shape ``k_y``-batched Cyclone scans, requires numerical identity for ``gamma`` and ``omega``, and reports batch speedup separately from the acceptance criterion.

Differentiable-Physics Figures
------------------------------

.. list-table::
   :header-rows: 1

   * - Figure
     - Owning script
     - Status
     - Notes
   * - Sensitivity-analysis figure
     - ``examples/theory_and_demos/quasilinear_implicit_sensitivity.py``
     - Initial quasilinear eigenbranch gate closed
     - current artifact base: ``docs/_static/quasilinear_implicit_sensitivity.png`` with JSON/PDF companions. It differentiates a tiny Cyclone linear-RHS quasilinear objective ``[gamma, omega, kperp_eff^2, Qhat_i, Q_i^ML]`` with respect to ``[R/Ln, R/LTi]`` using the implicit left/right eigenpair system and checks the result against nearest-branch central finite differences. This is a differentiability/conditioning gate, not an absolute nonlinear-flux validation.
   * - Solver-objective geometry-gradient gate
     - ``tools/build_solver_objective_gradient_gate.py``, ``tools/build_vmec_boozer_solver_frequency_gradient_gate.py``, ``tools/build_vmec_boozer_quasilinear_gradient_gate.py``, ``tools/build_vmec_boozer_nonlinear_window_gradient_gate.py``, ``tools/build_vmec_boozer_gradient_holdout_matrix.py``, ``tools/build_vmec_boozer_multi_point_objective_gate.py``, ``tools/build_vmec_boozer_aggregate_line_search_comparison.py``, ``tools/build_vmec_boozer_aggregate_alpha_holdout_gate.py``, ``tools/build_vmec_boozer_aggregate_surface_holdout_gate.py``, ``tools/build_vmec_boozer_second_equilibrium_aggregate_gate.py``, ``tools/check_vmec_boozer_aggregate_holdout_gate.py``, ``tools/check_production_nonlinear_optimization_guard.py``, ``tools/build_nonlinear_window_fd_audit.py``, and ``tools/build_vmec_boozer_nonlinear_window_fd_audit.py``
     - Solver-ready linear-RHS gradient gate closed; mode-21 VMEC/Boozer state-to-solver eigenfrequency, quasilinear heat-flux-weight, and reduced nonlinear-window estimator gates closed for QH plus Li383; compact and VMEC/Boozer-perturbed nonlinear startup FD audits closed only as plumbing checks; multi-alpha reduced aggregate objective plumbing passes; selected optimized-equilibrium post-transient transport-window audit closed; earlier aspect-6 projected transport-gradient candidate and strict rerun-WOUT top-12 QA candidate audited as negative long-window transfer results; nonlinear turbulence-gradient promotion remains open
     - current artifact bases: ``docs/_static/solver_objective_gradient_gate.png``, ``docs/_static/vmec_boozer_solver_frequency_gradient_gate.png``, ``docs/_static/vmec_boozer_quasilinear_gradient_gate.png``, ``docs/_static/vmec_boozer_nonlinear_window_gradient_gate.png``, ``docs/_static/vmec_boozer_li383_nonlinear_window_gradient_gate.png``, ``docs/_static/vmec_boozer_gradient_holdout_matrix.png``, ``docs/_static/vmec_boozer_multi_point_objective_gate.png``, ``docs/_static/vmec_boozer_aggregate_line_search_comparison.png``, ``docs/_static/vmec_boozer_aggregate_alpha_holdout_gate.png``, ``docs/_static/vmec_boozer_aggregate_surface_holdout_gate.png``, ``docs/_static/vmec_boozer_second_equilibrium_aggregate_gate.png``, ``docs/_static/vmec_boozer_aggregate_holdout_promotion_gate.json``, ``docs/_static/production_nonlinear_optimization_guard.png``, ``docs/_static/optimized_equilibrium_replicates/optimized_equilibrium_replicate_t700_ensemble_gate.png``, ``docs/_static/qa_projected_transport_step1e3_matched_comparison.png``, ``docs/_static/qa_projected_transport_step1e3_redesign_report.json``, ``docs/_static/strict_qa_top12_edge_prelaunch_gate.json``, ``docs/_static/strict_qa_top12_edge_matched_nonlinear_transport.png``, ``docs/_static/strict_qa_top12_edge_matched_nonlinear_transport.json``, ``docs/_static/strict_qa_top12_edge_redesign_report.json``, ``docs/_static/strict_qa_rerun_baseline_ensemble_gate.json``, ``docs/_static/strict_qa_top12_step1p25em3_candidate_ensemble_gate.json``, ``docs/_static/nonlinear_window_fd_audit.png``, and ``docs/_static/vmec_boozer_nonlinear_window_fd_audit.png`` with CSV/JSON/PDF companions where available. The first differentiates actual electrostatic linear-RHS eigenpair observables with respect to solver-ready geometry arrays using the implicit left/right eigenpair system and checks ``gamma``, ``omega``, ``<k_perp^2>``, linear heat/particle-flux weights, and a mixing-length heat-flux proxy against nearest-branch central finite differences. The VMEC/Boozer frequency gate starts from a real ``vmec_jax`` state coefficient, maps through ``booz_xform_jax`` with ``mboz=nboz=21``, builds the SPECTRAX-GK linear RHS, and verifies the eigenfrequency gradient. The VMEC/Boozer quasilinear gate uses a richer ``Nl=2, Nm=3`` moment basis and checks ``gamma``, ``omega``, ``<k_perp^2>``, ``Q_i`` weight, and ``gamma Q_i/kperp^2`` against finite differences. The nonlinear-window estimator gates feed those observables into a smooth RK2 late-window envelope and check heat-flux mean, coefficient of variation, and normalized trend gradients. The multi-equilibrium matrix repeats the frequency, quasilinear, and estimator gates on the tracked QH and Li383 fixtures. The multi-alpha aggregate artifact shows reduced quasilinear objective sensitivity across two field lines and two ``k_y`` samples, while the growth-vs-quasilinear comparison shows that the two reduced objectives can select different VMEC coefficient directions. The alpha-heldout and surface-heldout splits pass reduced generalization checks, Li383 passes the second-equilibrium aggregate finite-difference plus line-search gate, and the production nonlinear optimization guard now includes D-shaped/circular replicated long-window holdouts plus the optimized-equilibrium ``t=[350,700]`` seed/timestep audit. The earlier aspect-6 projected transport-gradient candidate also has passed baseline/candidate seed/timestep ensembles, but the matched comparison gives a relative reduction of ``-0.00585`` and is not promoted. The stricter rerun-WOUT top-12 QA edge candidate improves the 18-point reduced metric by ``2.29%`` and passes both long-window ensemble gates, but its matched ``t=[350,700]`` nonlinear comparison gives only ``0.58%`` reduction with uncertainty z-score ``0.20`` and is also not promoted; the prelaunch gate now records that this reduced margin would be blocked against the calibrated ``4%`` threshold, so the next blocker is predictive transfer margin. These negative transfers require a better-conditioned multi-surface, multi-alpha transport objective before another expensive nonlinear audit. The compact nonlinear FD audits run actual nonlinear windows and verify startup conditioning/response only; their transport-average gates are false because they do not discard a long transient or demonstrate running-mean convergence. A surface-stencil path is available for memory-bounded diagnostics; nonlinear turbulence-gradient and broader multi-surface optimization claims remain separate gates.
   * - Inverse/UQ figure
     - ``examples/theory_and_demos/autodiff_inverse_growth.py``, ``examples/theory_and_demos/autodiff_inverse_twomode.py``, and ``tools/plot_stellarator_optimization_uq.py``
     - Scoped inverse and UQ validation closed; global identifiability and production optimization claims remain scoped
     - current artifact bases: ``docs/_static/autodiff_inverse_growth.png``, ``docs/_static/autodiff_inverse_twomode.png``, and ``docs/_static/stellarator_itg_optimization_uq.png`` with JSON/PDF/CSV companions. The inverse examples check AD/finite-difference consistency and conditioning for one- and two-parameter reduced objectives, while the UQ panel reports local Gauss-Newton covariance, rank, and sensitivity-map diagnostics. These figures validate the differentiable workflow and uncertainty reporting, not global uniqueness or production nonlinear stellarator optimization.
   * - Solved VMEC-JAX QA geometry figure
     - upstream ``vmec_jax`` ``QA_optimization.py`` workflow plus local panel
       stitch from solved-boundary and Boozer diagnostics
     - Solved-equilibrium geometry visual ready for README/docs baseline context
     - current artifact base:
       ``docs/_static/vmec_jax_qa_solved_boundary_boozer_panel.png``. The
       figure compares the initial and optimized solved VMEC LCFS surfaces
       colored by ``|B|`` and the corresponding Boozer-LCFS ``|B|`` contours.
       This is the manuscript-facing geometry visual for the QA baseline. It
       is not a nonlinear heat-flux optimization claim and should not be
       conflated with the reduced synthetic max-mode-1 optimization panels.
   * - Full max-mode-5 QA transport optimizer sweep
     - ``tools/build_vmec_jax_qa_full_sweep_panel.py`` from office
       ``vmec_jax_qa_full_sweep_20260605`` VMEC-JAX outputs
     - Strict QA baseline provenance closed; transport rows are scoped
       optimizer-output diagnostics with nonlinear ``Q(t)`` promotion pending
     - current artifact base:
       ``docs/_static/vmec_jax_qa_full_sweep_panel.png`` with JSON/CSV
       companions; a PDF companion is regenerated locally with ``--pdf`` when
       needed. The current campaign uses the upstream-style VMEC-JAX
       max-mode-5 simple seed and records an admitted constraints-only QA
       baseline with aspect ``4.999999``, mean iota ``0.410200``, and QS
       residual ``8.91e-6``. The growth, quasilinear, and nonlinear-window
       rows restart from that solved QA baseline and use one representative
       sample, ``s=0.64``, ``alpha=0``, ``k_y rho_i=0.30``, with
       ``mboz=nboz=21``. They produce real optimizer outputs, not synthetic
       surfaces, and their mean iota remains within the exploratory diagnostic
       acceptance band ``|iota| >= 0.39`` even though the stricter QA-admission
       gate records the small shortfall from ``0.41``. Consequently the
       nonlinear heat-flux subplot is intentionally marked pending and no
       turbulent-flux reduction is claimed from this sweep. The old projected-weight nonlinear audits remain historical
       single-point evidence tied to an earlier baseline and should not be
       conflated with this strict-baseline panel.
   * - Boundary-coefficient transport landscape
     - ``tools/build_vmec_boundary_transport_landscape.py`` and ``tools/patch_vmec_jax_wout_metadata.py``
     - Open until the refreshed true nonlinear ensemble overlay completes
     - current artifact base: ``docs/_static/vmec_boundary_transport_landscape_rbc11_full.png`` with JSON/CSV companions. The scan perturbs the strict QA ``RBC(1,1)`` coefficient over ``[-75%, +75%]`` with 31 points and evaluates growth plus all explicit electrostatic quasilinear rules on the optimizer sample set, ``s=(0.45,0.64,0.78)``, ``alpha=(0,pi/4)``, and ``k_y rho_i=(0.10,0.30,0.50)``, at ``ntheta=16``, ``mboz=nboz=21``, ``Nl=1``, and ``Nm=2``. The bottom panel is reserved for true long-window post-transient nonlinear heat-flux means; reduced/startup nonlinear-window diagnostics are deliberately excluded. Until the full ``t_max=700``, ``t=[350,700]``, ``n64:64:64:40:40`` ensemble overlay finishes, this figure is a launch/noise diagnostic and optimizer-design input, not a promoted nonlinear turbulent-flux optimization result.
   * - Development-only optimization-plumbing figure
     - ``examples/theory_and_demos/reduced_stellarator_itg/compare_stellarator_itg_optimizations.py`` and ``tools/plot_stellarator_optimization_uq.py``
     - Initial differentiable objective-reduction and weighted-residual UQ gates closed for development diagnostics; full VMEC/Boozer/GK optimization open
     - primary artifact base: ``docs/_static/stellarator_itg_optimization_uq.png`` with JSON/PDF companions and individual diagnostic panels ``docs/_static/stellarator_itg_growth_optimization.png``, ``docs/_static/stellarator_itg_quasilinear_optimization.png``, and ``docs/_static/stellarator_itg_nonlinear_optimization.png``. The supporting reduced comparison sidecar ``docs/_static/stellarator_itg_optimization_comparison.json`` records objective histories and reduction ratios, but its companion PNG is a synthetic reduced max-mode-1 surface diagnostic and is not a solved-geometry optimization figure. These files live under ``examples/theory_and_demos/reduced_stellarator_itg`` rather than ``examples/optimization``. The UQ panel computes covariance from the final weighted objective residual Jacobian and shows derivative parity, control uncertainty, covariance projection, and rank/conditioning diagnostics. The production QA optimization examples are the VMEC-JAX-style scripts in ``examples/optimization``; they remain separate from this diagnostic artifact stack and still require solved-WOUT gates plus converged nonlinear audits before transport-optimization claims.
   * - VMEC/Boozer parity matrix
     - ``tools/build_vmec_boozer_parity_matrix.py``
     - Multi-equilibrium zero-beta equal-arc geometry gate closed at ``mboz=nboz=21``
     - current artifact base: ``docs/_static/vmec_boozer_parity_matrix.png`` with JSON/CSV/PDF companions. It checks QH, QI, and shaped-tokamak fixtures against the imported VMEC/EIK runtime convention and rejects ``mboz,nboz < 21``. The current limiting matrix row is QI drift at ``7.13e-2`` against the ``8e-2`` release tolerance; evaluated QI ``ntheta=8,16`` variants pass, while input-only QI seeds without bundled ``wout`` references are explicitly artifact-limited. This should be presented as a field-line geometry convention gate, not as a finite-beta transport-gradient validation.

Caption Policy
--------------

Every manuscript-facing figure should answer these questions directly in the
caption:

1. what case and model are shown,
2. what horizon or fit window is used,
3. what the reference is,
4. what agreement was expected,
5. what agreement was found.

Open Issues Before Drafting
---------------------------

- close the W7-X zonal-flow residual and late-envelope mismatch under the
  paper-facing line-first normalization; current time coverage is sufficient,
  but residuals fail at three wavelengths and late envelopes remain too large
- either close the long-time recurrence follow-up for the shaped-tokamak
  Rosenbluth-Hinton / GAM response benchmark or explicitly scope publication
  claims to the benchmark-scale pre-recurrence metrics now frozen in the
  Merlo Case-III artifact
- extend the W7-X fluctuation-spectrum diagnostic to a Doppler-reflectometry
  transfer-function comparison if experimental-facing claims enter the paper
- extend eigenfunction-overlap metrics beyond the closed KBM and W7-X raw
  overlays where additional literature-backed mode-shape references are useful
- tighten the current windowed nonlinear statistics panel with stricter case-specific gates where reference windows justify them
- tie ETG nonlinear claims to the benchmark literature or keep them framed as a pilot
- add or refine publication-ready zonal-flow closure figures before making W7-X
  recurrence claims
- add an experimental transfer-function fluctuation-spectrum panel only if
  experimental-facing W7-X claims enter the paper