Why do Coverage Gradient Strategies work so nicely in Cooperative MARL? Proof from Coverage Illustration
In cooperative multi-agent reinforcement studying (MARL), as a result of its on-policy nature, coverage gradient (PG) strategies are usually believed...