Skip to content

Mava-0.1.0

Pre-release
Pre-release
Compare
Choose a tag to compare
@KaleabTessera KaleabTessera released this 06 Jul 14:59
· 4361 commits to main since this release
80004d3

Highlights

Mava Core

  • Components

    • Architectures
      • Added Centralised, Decentralised, Networked and State Based Architectures.
    • Modules
      • Added Broadcast Communication, Epsilon Decay Scheduling, Additive and Monotonic Mixing and Fingerprint Stabilization.
    • Networks
      • Added Additive and Monotic Mixing Networks, Hypernetworks, Communication Networks, Epsilon Greedy and DiscreteValued head.
  • Environment Loops

    • Added Parallel and Sequential Environment Loops.
  • Adders

    • Added Parallel versions of Transition, Sequential and Episode Adders.

Systems

  • Added feedforward training for maddpg, mad4pg, madqn, mappo, vdn and qmix.
  • Added recurrent training for madqn, dial, maddpg and mad4pg.
  • Added continuous network heads for maddpg, mad4pg and mappo.
  • Added decentralised architecture training for maddpg, mad4pg, madqn, mappo, dial, vdn and qmix.
  • Added centralised architecture training for maddpg, mad4pg and mappo.
  • Added state based architecture training for maddpg and mad4pg.
  • Added networked architecture training for maddpg.

Environments/ Environment Wrappers

  • Added PettingZoo, SMAC, RoboCup, OpenSpiel, Flatland, Debug Simple Spread, Debug Switch environment and Debug Two-Step game.

Examples

  • Added quickstart notebook.
  • Added basic examples for sample systems and environments.

Minor Changes and Fixes