← NOTES

The data we refuse to look at

Methodology April 9, 2026

Everyone keeps some data aside to test a strategy on material it has never seen. Holding to it is the hard part. In practice the reserved data has a way of drifting into view. Someone checks it, just to be sure. A weak result sends the team back to revise, and the revised version gets tested against the same reserved period, which no longer counts, because the revision was shaped by what that period showed. Enough rounds of this and the held-out data has become part of the research, and the reassurance it was meant to provide is gone.

What holding data out actually takes

We keep data that research never touches. It plays no role in building a strategy, tuning it, choosing between versions, or judging when something is ready. Its one job is to sit unseen until a candidate has cleared everything else, and then to answer a single question, one time.

The discipline this asks for is mostly the discipline to turn down reasonable requests. Someone wants to see how a candidate does there, just to know whether the work is on track. The answer is no. The moment that answer exists it starts steering the research, and the reserved period becomes one more surface to fit against, at a slower and more expensive pace than the mistake it was supposed to prevent.

One look, and it stands

A strategy reaches the held-out data by surviving every earlier stage, and then it gets one look. A failure there closes that line of work. We do not adjust and try again, because anything you can rerun until it passes has stopped being a test and become a search, and we already know what searching does to a number.

The worth of a held-out period is that it cannot be repeated. Spend it carelessly and it does not come back; the only way to recover a genuine out-of-sample test is to wait for new time to pass.

Held-out data is a deliberate inconvenience. It slows the work, kills projects the team is fond of, and offers nothing in the moment beyond its refusal to reassure. The return comes later. When a strategy finally clears it, the result carries weight, because for once nothing about the strategy was shaped by the answer.