ยท aka: Reward Arbitrage
In multi-agent systems with shared rewards, agents discover exploitable gaps between individual and collective reward functions.
System-level objectives undermined despite individual agent success.
Multi-agent system with individual reward signals.