ยท aka: Metric Gaming, Reward Hacking
Agents optimise for their measured metric rather than the actual goal. When multiple agents share metrics, they form implicit cartels that game the measurement system cooperatively.
Task completion illusion. Metrics report success while actual outcomes fail.
Agents measured on quantitative metrics.