Skip to content

Releases: instadeepai/Mava

0.1.3

15 Jun 14:29
bb1c893
Compare
Choose a tag to compare
0.1.3 Pre-release
Pre-release

Highlights

This is the last tensorflow system release. After this, tensorflow systems will be deprecated in favour of Jax systems and our new callback redesign (#457).

Systems

  • Updates to acme, reverb and tensorflow.
  • Working centralised and state based architectures.
  • Recurrent and Multiple Trainer PPO.

Environments

What's Changed

Full Changelog: 0.1.2...0.1.3

0.1.2

28 Mar 09:05
25d87f0
Compare
Choose a tag to compare
0.1.2 Pre-release
Pre-release

Highlights

Systems

  • Fixed observation network bug in mappo + changed implementation to use two optims.
  • Fixes in maddpg/mad4pg loss calculation.
  • Began on jax system implementations.

Environments

What's Changed

Full Changelog: 0.1.1...0.1.2

0.1.1

25 Feb 10:13
424025b
Compare
Choose a tag to compare
0.1.1 Pre-release
Pre-release

Highlights

Systems

  • Stable versions of all systems - noteably stable mappo, vdn and qmix.
  • Multiple trainer implementations for maddpg and mad4pg.
  • Removed the dial system.

Environments/ Environment Wrappers

What's Changed

New Contributors

Full Changelog: 0.1.0...0.1.1

Mava-0.1.0

06 Jul 14:59
80004d3
Compare
Choose a tag to compare
Mava-0.1.0 Pre-release
Pre-release

Highlights

Mava Core

  • Components

    • Architectures
      • Added Centralised, Decentralised, Networked and State Based Architectures.
    • Modules
      • Added Broadcast Communication, Epsilon Decay Scheduling, Additive and Monotonic Mixing and Fingerprint Stabilization.
    • Networks
      • Added Additive and Monotic Mixing Networks, Hypernetworks, Communication Networks, Epsilon Greedy and DiscreteValued head.
  • Environment Loops

    • Added Parallel and Sequential Environment Loops.
  • Adders

    • Added Parallel versions of Transition, Sequential and Episode Adders.

Systems

  • Added feedforward training for maddpg, mad4pg, madqn, mappo, vdn and qmix.
  • Added recurrent training for madqn, dial, maddpg and mad4pg.
  • Added continuous network heads for maddpg, mad4pg and mappo.
  • Added decentralised architecture training for maddpg, mad4pg, madqn, mappo, dial, vdn and qmix.
  • Added centralised architecture training for maddpg, mad4pg and mappo.
  • Added state based architecture training for maddpg and mad4pg.
  • Added networked architecture training for maddpg.

Environments/ Environment Wrappers

  • Added PettingZoo, SMAC, RoboCup, OpenSpiel, Flatland, Debug Simple Spread, Debug Switch environment and Debug Two-Step game.

Examples

  • Added quickstart notebook.
  • Added basic examples for sample systems and environments.

Minor Changes and Fixes

MAVA Pre-release

09 Jun 13:02
a5236d5
Compare
Choose a tag to compare
MAVA Pre-release Pre-release
Pre-release

MAVA Pre-release test before official 0.1.0 release.