A break point for a hypothesis set is any such that no set of inputs is shattered — equivalently, the growth function satisfies . Once has a break point, every larger is also a break point, is bounded by a polynomial of degree , and the VC bound becomes non-vacuous.

Definition

A break point for is a positive integer such that

The smallest such is the break point. By convention if no such exists ( shatters arbitrarily large input sets), has no break point, equivalently .

The break point and VC dimension are linked exactly:

If then shatters some set of inputs () but not any set of (), making the break point.

Examples

Hypothesis setBreak point
Positive rays ()
Positive intervals ()
2D perceptrons ()
Convex sets in none

Why It Matters

The Sauer–Shelah lemma turns a break point at into a polynomial bound on the growth function:

The transition from “could be exponential” () to “guaranteed polynomial” () is the crucial step that makes generalisation provable for infinite hypothesis sets:

shrinks to zero as if is polynomial. If there’s no break point, and the right-hand side is bounded below by a constant — learning is not guaranteed to generalise.

A single discrete fact (“does the hypothesis set fail to shatter even one input set of size ?”) thus controls the entire qualitative behaviour of generalisation.

Once a Break Point, Always

If is a break point for , so is every . Why? Suppose for contradiction that shatters some set of size . Take any subset of size — every dichotomy of extends to a dichotomy of , so shatters too, contradicting that is a break point.

This monotonicity is why we focus on the smallest break point: it determines when shattering first fails, and everything beyond inherits the failure.

  • growth-function — the function whose break point we’re finding.
  • vc-dimension — equals one less than the break point.
  • dichotomy — labelling patterns whose count saturates at before the break point.

Active Recall