One row per evaluation that ever touched the sealed test split: when, by
which protocol (hash), and with what headline result. Entries are
appended automatically by validate_protocol() and printed in full by
coding_report(). The mechanism is visibility, not enforcement – an
in-memory object cannot stop a determined user, and does not pretend to;
archive the study's LLM call log (see the archive workflow) when tamper
evidence is needed.
Arguments
- x
A
gold_set().
