starscry

Reward Integrity Index

Can an RL environment’s reward be satisfied without doing the task? We answer it — deterministically, and you re-run every result.

Don't trust us — re-run the seed.

20 audited2 F5 ERR3 C9 A1 difficulty

The board as a living instrument. Every point is seed-pinned and sealed by a content hash — hover any datum for its hash and the exact command that reproduces it, byte-for-byte.

Board health

graded artifacts over time

Exploit-search: gamed in N sims

how little search it takes to construct a grader-accepted non-attempt

20 audited artifacts · refreshed 2026-07-04T00:00:00+00:00

A clean · C one reward-hack signature · F two-plus, or a confirmed exploit · ERR didn't load or roll out (an env-compatibility signal, not a pass). Click a header to sort; hover a row for detail.

gradeenvkindsourcefindingsworst_marginverifiersreport
Fskill_reward_hackinggaming_auditprime-hubgarbage 16 · ipt 0 · sound 18 · compl 0 · exploit 2 +11.7730.1.14card
Fskill_reward_hackinggaming_audit_multiturnprime-hubgarbage 10 · smells 2 · mut 10 · exploit 2 +0.5760.1.14card
ERRreward_benchgaming_auditprime-hubgarbage 0 · ipt 0 · sound 0 · compl 0 · exploit 00.1.6.post0card
ERRsynlogicgaming_auditprime-hubgarbage 0 · ipt 0 · sound 0 · compl 0 · exploit 00.1.6.post0card
ERRhud_text_2048gaming_audit_multiturngarbage 0 · smells 0 · mut 0 · exploit 0card
ERRlights_outgaming_audit_multiturngarbage 0 · smells 0 · mut 0 · exploit 0 -0.500card
ERRsudokugaming_audit_multiturngarbage 0 · smells 0 · mut 0 · exploit 0 -0.500card
Callenai_ifevalgaming_auditprime-hubgarbage 1 · ipt 0 · sound 0 · compl 0 · exploit 0 +0.5000.1.14card
Canchoring_trapgaming_auditprime-hubgarbage 3 · ipt 0 · sound 0 · compl 0 · exploit 0 +0.4190.1.14card
Cifevalgaming_auditprime-hubgarbage 3 · ipt 0 · sound 0 · compl 0 · exploit 0 +0.5000.1.14card
Aaime2025gaming_auditprime-hubgarbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 -0.5000.1.15.dev187card
Aascii_treegaming_auditprime-hubgarbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 -0.5000.1.14card
Agsm8kgaming_auditprime-hubgarbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 -0.5000.1.6.post0card
Amastermindgaming_auditprime-hubgarbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 -0.5000.1.6.post0card
Ameta_reward_hack_formatgaming_auditprime-hubgarbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 -0.2700.1.14card
Aofc_gymgaming_auditfirst-partygarbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 -0.500card
Apydantic_adherencegaming_auditprime-hubgarbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 -0.5000.1.14card
Areverse_textgaming_auditprime-hubgarbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 -0.5000.1.14card
Asadgaming_auditprime-hubgarbage 0 · ipt 0 · sound 0 · compl 0 · exploit 0 -0.5000.1.5card
sokobandifficultyρ(moves) 0.837report

Audit your environment

Shipping a verifiers-format environment? We run this exact deterministic, model-free audit — gaming & difficulty — and hand back a sealed, replayable verifier card your buyers re-run themselves. No trust required.

Audit your environment →