The board as a living instrument. Every point is seed-pinned and sealed by a content hash — hover any datum for its hash and the exact command that reproduces it, byte-for-byte.
Board health
graded artifacts over timeExploit-search: gamed in N sims
how little search it takes to construct a grader-accepted non-attempt20 audited artifacts · refreshed 2026-07-04T00:00:00+00:00
A clean · C one reward-hack signature · F two-plus, or a confirmed exploit · ERR didn't load or roll out (an env-compatibility signal, not a pass). Click a header to sort; hover a row for detail.
| grade | env | kind | source | findings | worst_margin | verifiers | report |
|---|---|---|---|---|---|---|---|
| F | skill_reward_hacking | gaming_audit | prime-hub | garbage 16 · ipt 0 · sound 18 · compl 0 · exploit 2 | +11.773 | 0.1.14 | card |
| F | skill_reward_hacking | gaming_audit_multiturn | prime-hub | garbage 10 · smells 2 · mut 10 · exploit 2 | +0.576 | 0.1.14 | card |
| ERR | reward_bench | gaming_audit | prime-hub | garbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 | — — | 0.1.6.post0 | card |
| ERR | synlogic | gaming_audit | prime-hub | garbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 | — — | 0.1.6.post0 | card |
| ERR | hud_text_2048 | gaming_audit_multiturn | — | garbage 0 · smells 0 · mut 0 · exploit 0 | — — | — | card |
| ERR | lights_out | gaming_audit_multiturn | — | garbage 0 · smells 0 · mut 0 · exploit 0 | -0.500 | — | card |
| ERR | sudoku | gaming_audit_multiturn | — | garbage 0 · smells 0 · mut 0 · exploit 0 | -0.500 | — | card |
| C | allenai_ifeval | gaming_audit | prime-hub | garbage 1 · ipt 0 · sound 0 · compl 0 · exploit 0 | +0.500 | 0.1.14 | card |
| C | anchoring_trap | gaming_audit | prime-hub | garbage 3 · ipt 0 · sound 0 · compl 0 · exploit 0 | +0.419 | 0.1.14 | card |
| C | ifeval | gaming_audit | prime-hub | garbage 3 · ipt 0 · sound 0 · compl 0 · exploit 0 | +0.500 | 0.1.14 | card |
| A | aime2025 | gaming_audit | prime-hub | garbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 | -0.500 | 0.1.15.dev187 | card |
| A | ascii_tree | gaming_audit | prime-hub | garbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 | -0.500 | 0.1.14 | card |
| A | gsm8k | gaming_audit | prime-hub | garbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 | -0.500 | 0.1.6.post0 | card |
| A | mastermind | gaming_audit | prime-hub | garbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 | -0.500 | 0.1.6.post0 | card |
| A | meta_reward_hack_format | gaming_audit | prime-hub | garbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 | -0.270 | 0.1.14 | card |
| A | ofc_gym | gaming_audit | first-party | garbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 | -0.500 | — | card |
| A | pydantic_adherence | gaming_audit | prime-hub | garbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 | -0.500 | 0.1.14 | card |
| A | reverse_text | gaming_audit | prime-hub | garbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 | -0.500 | 0.1.14 | card |
| A | sad | gaming_audit | prime-hub | garbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 | -0.500 | 0.1.5 | card |
| — | sokoban | difficulty | — | ρ(moves) 0.837 | — — | — | report |
Audit your environment
Shipping a verifiers-format environment? We run this exact deterministic, model-free audit — gaming & difficulty — and hand back a sealed, replayable verifier card your buyers re-run themselves. No trust required.