Formal Statistical Learning Theory in Lean 4

A machine-checked statistical learning theory library in Lean 4 / mathlib: one axiom-clean spine from empirical risk minimization through VC, Rademacher, and a tight change-of-measure PAC-Bayes layer. Among the Lean SLT libraries we surveyed, FormalSLT is the only one with a tight (change-of-measure) PAC-Bayes track — a machine-checked Donsker–Varadhan variational inequality wired into McAllester grid-peeling and Catoni posterior-risk bounds.

Every public theorem is checked with no sorry, no admit, no custom axiom; the full dependency chain closes against only the standard mathlib axioms [propext, Classical.choice, Quot.sound]. As of 2026-06-08: 60 modules, 31,533 lines of Lean. Zero sorry, zero admit, zero custom axioms. Axiom-clean against {propext, Classical.choice, Quot.sound}.

The architecture diagram shows the verified proof-dependency graph: core ERM definitions feed the concentration backbone and the symmetrization layer; the concentration backbone supplies sharp McDiarmid via Mathlib's conditional sub-Gaussian engine; symmetrization plus capacity feed the VC route; Donsker-Varadhan plus the iid product MGF bridge feed the PAC-Bayes track. Dark boxes are headline theorems whose axiom transcript appears below.

What is the contribution

The distinctive, verified result is the tight change-of-measure PAC-Bayes track:

PACBayesKL.donsker_varadhan: the Donsker–Varadhan variational inequality (real proof, not a stub).
PACBayesMcAllester.pacbayes_changeOfMeasure and PACBayesBoundedLoss.finiteMcAllesterGridPeeling_badEventMass_le_delta: McAllester-style bounds with grid λ-peeling.
PACBayesBoundedLoss.finiteCatoni_badEventMass_le_delta: Catoni posterior-risk bound.

Prior art for context (so the claim is precise): among Lean SLT libraries we surveyed, lean-rademacher and YuanheZ/lean-stat-learning-theory have no PAC-Bayes; the one with any PAC-Bayes (formal-learning-theory-kernel) has only the loose union-bound form and explicitly defers the tight change-of-measure version (PACBayes.lean: "TODO: Prove the tight version via change of measure (Catoni 2007)"). FormalSLT supplies that tight version, inside an end-to-end SLT development.

Comparison with adjacent formalizations

Property	FormalSLT	lean-rademacher (Sonoda et al., 2025)	Karayel-Tan AFP (2023)	mathlib4 (upstream)
Proof assistant	Lean 4 + Mathlib4	Lean 4 + Mathlib4	Isabelle / HOL	Lean 4
Sharp McDiarmid (constant 2)	yes, upper + lower + two-sided	yes, one-sided + negated form	yes, McDiarmid 1989 Lemma 1.2 form	no theorem named McDiarmid; sharp Hoeffding building blocks present
Sharp McDiarmid proof route	exposure martingale into Mathlib `measure_sum_ge_le_of_hasCondSubgaussianMGF`	direct Hoeffding-lemma argument, Hoeffding rebuilt in-tree	direct Hoeffding-lemma route (AFP `Hoeffdings_lemma_bochner` imports)	sub-Gaussian MGF API only (`Probability.Moments.SubGaussian`, R. Degenne 2025)
Tight PAC-Bayes (change of measure)	yes: Donsker-Varadhan, Catoni, McAllester grid peeling on bounded loss	no PAC-Bayes layer	no PAC-Bayes layer	no PAC-Bayes layer
Generalization spine	finite-class ERM through VC and PAC-Bayes confidence bounds	Rademacher-complexity uniform deviation pipeline	concentration toolkit, no learning spine	probability primitives only
Public axioms	`[propext, Classical.choice, Quot.sound]` with axiom transcript published	source-level scan: no `sorry` / `admit` / custom axiom across 11 `FoML/*.lean` files	standard Isabelle / HOL kernel	standard Lean / Mathlib axioms

The single most defensible claim that anchors this table: FormalSLT routes the exposure martingale through Mathlib's conditional sub-Gaussian engine, and pairs the resulting sharp McDiarmid bound with a tight change-of-measure PAC-Bayes track inside one axiom-clean library. lean-rademacher proves the same bound by a direct Hoeffding-lemma argument and has no PAC-Bayes; Karayel-Tan is Isabelle and has no learning-theory spine. Full prior-art notes are in docs/related-work.md; the table source is docs/comparison-table.md.

What's inside

1. The generalization pipeline: ERM → Rademacher → VC → PAC-Bayes

Stage	Representative theorem
ERM excess risk	`ERM.erm_excessRisk_le_two_uniformDeviation`
Symmetrization	`Rademacher.Symmetrization.expected_genGap_le_two_expected_empiricalRademacherComplexity`
Massart finite-class bound	`Rademacher.Massart.massart_finite_class`
High-probability Rademacher	`Rademacher.HighProbability.genGap_highProb_rademacher`
Sauer–Shelah	`VC.SauerShelah.sauerShelah_polynomial_bound`
VC ERM tail + sample complexity	`VC.SampleComplexity.vc_erm_excessRisk_tail`, `vc_erm_sample_complexity`
Tight PAC-Bayes	`PACBayesKL.pacbayes_changeOfMeasure`, `PACBayesBoundedLoss.finiteCatoni_badEventMass_le_delta`

2. Concentration backbone (`Azuma.*`, `Concentration.SharpMcDiarmid`)

A sharp (constant-2) McDiarmid bounded-differences inequality is included as a verified component and wired to the generalization gap:

Theorem	What it gives
`Azuma.HasBoundedDifferences`	the textbook per-coordinate predicate `∀ S k z', \|f S − f (update S k z')\| ≤ c k`
`Concentration.mcdiarmid_of_hasBoundedDifferences_sharp`	general product-measure upper tail `≤ exp(-2 ε² / Σ c_k²)`
`Concentration.mcdiarmid_twoSided_of_hasBoundedDifferences_sharp`	two-sided `≤ 2·exp(-2 ε² / Σ c_k²)`
`Azuma.ExposureMartingale.genGap_tail_bound_sharp`	the sharp inequality applied to the SLT generalization gap

The sharp constant is earned via the exposure-martingale increment's conditional fiber range (proxy (c_k/2)², vs the Azuma c_k² route, which co-exists in the same files). McDiarmid is classical and already formalized elsewhere in Lean and Isabelle — this is a verified re-formalization, not a first; see Scope and boundaries.

The 4x exponent gap between the sharp bound exp(-2 ε² / Σ c²) and the Azuma-loose exp(-ε² / (2 Σ c²)) form is plotted across representative n and c_i values below:

The math is classical (Boucheron-Lugosi-Massart, Theorem 6.2); the FormalSLT contribution is the mechanized Lean 4 / Mathlib proof and its wiring into genGap.

3. The finite Dudley chaining layer (`Covering.*`)

Discrete/finite chaining: two-scale peeling (Covering.DudleyChaining), the total-bounded discrete entropy sum (Covering.TotalBoundedDudley), finite two-point and unit-interval instances (Covering.TwoPointDudley, Covering.UnitIntervalDudley), and the finite sub-Gaussian chaining foundation (Covering.FiniteSubGaussianChaining). This layer is finite/discrete throughout; the continuous Dudley entropy integral is not part of this library.

Module map

Layer	Modules
Core	`Risk`, `ERM`, `UniformConvergence`, `GhostSample`
Probability	`Probability.{Concentration, FiniteUnionBound, FiniteExpectation, BernsteinMGF}`
Rademacher	`Rademacher.{FiniteSample, Symmetrization, Massart, HighProbability, UniformDeviation, ERMGeneralization, Contraction, LinearPredictor, …}`
Concentration / McDiarmid	`Azuma.{HasBoundedDifferences, ExposureMartingale, ExposureIncrementCondMGF, GenGapTail}`, `Concentration.SharpMcDiarmid`, `Concentration.SubGamma.*`
VC	`VC.{Dimension, SauerShelah, Rademacher, SampleComplexity, BinaryVCBridge}`
Covering (finite)	`Covering.{Rademacher, DudleyChaining, FiniteDiscreteDudley, FiniteSubGaussianChaining, TotalBoundedDudley, TwoPointDudley, UnitIntervalDudley}`
Stability / PAC-Bayes	`AlgorithmicStability`, `Stability.BousquetElisseeff`, `PACBayesKL`, `PACBayesMcAllester`, `PACBayesFiniteProductMGF`, `PACBayesBoundedLoss`

The library is 60 modules / ≈31,500 lines of Lean (mathlib excluded).

Reproducing the build and axiom audit

Install elan; it reads lean-toolchain and fetches the pinned Lean automatically.

git clone https://github.com/Robby955/FormalSLT.git
cd FormalSLT
lake exe cache get      # download the pinned Mathlib oleans
lake build FormalSLT    # builds the whole library (2951 jobs)

Axiom transcript (captured on `origin/main`)

'FormalSLT.PACBayesBoundedLoss.finiteCatoni_badEventMass_le_delta'                  depends on axioms: [propext, Classical.choice, Quot.sound]
'FormalSLT.PACBayesBoundedLoss.posteriorRisk_bound_of_priorDeviationMGF_le'         depends on axioms: [propext, Classical.choice, Quot.sound]
'FormalSLT.Concentration.mcdiarmid_of_hasBoundedDifferences_sharp'                  depends on axioms: [propext, Classical.choice, Quot.sound]
'FormalSLT.Concentration.mcdiarmid_twoSided_of_hasBoundedDifferences_sharp'         depends on axioms: [propext, Classical.choice, Quot.sound]
'FormalSLT.Rademacher.Massart.massart_finite_class'                                 depends on axioms: [propext, Classical.choice, Quot.sound]
'FormalSLT.VC.SampleComplexity.vc_erm_sample_complexity'                            depends on axioms: [propext, Classical.choice, Quot.sound]

propext, Classical.choice, and Quot.sound are the three axioms underlying ordinary classical mathematics in mathlib; there is no project-specific axiom and no sorry.

Scope frontier

The two-column frontier diagram lays out what FormalSLT covers and what remains open:

For the full scope statement and assumptions, see docs/assumptions-and-nonclaims.md.

Scope and honest boundaries

The contribution is the verified formalization and integration, not new mathematics.

The tight PAC-Bayes track is the distinctive piece. Scoped precisely: among the Lean SLT libraries surveyed, FormalSLT is the only one with the tight change-of-measure form; the one other Lean PAC-Bayes library leaves it as a TODO. This is a survey-scoped claim, not an absolute "first."
McDiarmid is not a first in any proof assistant. Karayel–Tan (AFP 2023, Isabelle/HOL) and lean-rademacher (Sonoda et al., arXiv:2503.19605) both have a McDiarmid; FormalSLT's is a verified re-formalization wired into the spine, not a priority claim.
The general sharp McDiarmid theorem is the one-sided upper tail; lower-tail and two-sided are separate proven siblings.
StandardBorelSpace Z is required (from mathlib's condExpKernel); covers ℝ, Bool, and all Polish spaces.
Homogeneous iid product for the general theorem; independent-non-identical coordinates are a separate sibling.
Finite/discrete-first throughout — finite hypothesis classes, finite grids, discrete Dudley. No continuous Dudley integral, no continuous-posterior PAC-Bayes (those are in-progress work, not in this library).

Documentation index

docs/comparison-table.md: full peer-library comparison with sources.
docs/architecture-flowchart.svg: proof-dependency architecture (shown above).
docs/sample-vs-tightness.svg: sharp vs Azuma-loose tail-bound plot.
docs/frontier-diagram.svg: in-scope and out-of-scope at a glance.
docs/diagrams.md: Mermaid theorem-dependency diagrams (indexed view).
docs/theorem-map.md: exact theorem names and statements.
docs/related-work.md: adjacent Lean / Isabelle projects.
docs/assumptions-and-nonclaims.md: scope and assumptions in detail.

How to cite

@software{formal_slt,
  title  = {FormalSLT: Formal Statistical Learning Theory in Lean 4},
  author = {Sneiderman, Robert},
  year   = {2026},
  url    = {https://github.com/Robby955/FormalSLT}
}

License

Released under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
.github		.github
FormalSLT		FormalSLT
docs		docs
examples		examples
scripts		scripts
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CITATION.cff		CITATION.cff
CONTRIBUTING.md		CONTRIBUTING.md
FormalSLT.lean		FormalSLT.lean
LICENSE		LICENSE
README.md		README.md
lake-manifest.json		lake-manifest.json
lakefile.lean		lakefile.lean
lean-toolchain		lean-toolchain

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Formal Statistical Learning Theory in Lean 4

What is the contribution

Comparison with adjacent formalizations

What's inside

1. The generalization pipeline: ERM → Rademacher → VC → PAC-Bayes

2. Concentration backbone (`Azuma.*`, `Concentration.SharpMcDiarmid`)

3. The finite Dudley chaining layer (`Covering.*`)

Module map

Reproducing the build and axiom audit

Axiom transcript (captured on `origin/main`)

Scope frontier

Scope and honest boundaries

Documentation index

How to cite

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Formal Statistical Learning Theory in Lean 4

What is the contribution

Comparison with adjacent formalizations

What's inside

1. The generalization pipeline: ERM → Rademacher → VC → PAC-Bayes

2. Concentration backbone (Azuma.*, Concentration.SharpMcDiarmid)

3. The finite Dudley chaining layer (Covering.*)

Module map

Reproducing the build and axiom audit

Axiom transcript (captured on origin/main)

Scope frontier

Scope and honest boundaries

Documentation index

How to cite

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

2. Concentration backbone (`Azuma.*`, `Concentration.SharpMcDiarmid`)

3. The finite Dudley chaining layer (`Covering.*`)

Axiom transcript (captured on `origin/main`)

Packages