Models

One card per (model, protocol) baseline. Ranked rows are current, complete, and task-set-pinned; diagnostic rows remain visible for auditability.