Skip to content

Commit

Permalink
chore(release): v0.8.0
Browse files Browse the repository at this point in the history
  • Loading branch information
NOBLES5E committed Sep 26, 2021
1 parent 69dc482 commit 7e394b5
Showing 1 changed file with 54 additions and 4 deletions.
58 changes: 54 additions & 4 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,29 +1,79 @@
# CHANGELOG

## [unreleased]
## [0.8.0] - 2021-09-26

### Bug Fixes

- Fix ci pypi versioning
- Remove __init__.py and python __version__, use cargo version
- Only run publish once on git tag
- Move import bagua_install_library to install library function
- Merge bagua_install_library and setup.py, remove nccl<=2.6 support
- Fix alltoall_v parameter (#17)
- Reduce and allgather python interface
- Fix decompress incorrect pointer and typo in error msg
- Fix python gil deadlock during getting data ptr
- Fix benchmark script requirements
- Fix alltoall_v parameter types (#27)
- Always mark bagua padding tensor as ready
- Make compress/decompress of BaguaTensor `method` string consistent (#33)
- Fix scatter and reduce_scatter implementation (#40)
- Substract overflow error for decentralized op (#39)
- Fix QADAM params (#17)
- Fix assert precision (#18)
- Replace mutex with atomic bool for async op and add Aluminum submodule update (#67)
- Fix duplicated dependency downloading during installation (#77)
- Fix async algorithm aborting (#78)
- Fix async algorithm hanging when abort (#81)
- Fix qadam algorithm call (#20)
- Fix missing symbols in the zip library (#24)
- Fix random autotune-service hang
- Fix random autotune server hang (#206)
- Bagua-net library path mismatch, make `--enable_bagua_net` argument style consistent with other args (#218)
- Fix compressed buffer can not be scattered to odd number of ranks
- Handle conflicts caused by sklearn upgrade (#225)

### Features

- Add native async model average algorithm (#41)
- Add async model average algorithm (#110)
- Add cached dataset wrapper (#148)
- Support sync batchnorm (#151)
- Add `--enable-bagua-net` option in launcher (#183)
- Add pytorch examples for MNIST, ImageNet, SQuAD training (#1)
- Add requirements.txt, only download dataset on local rank 0 (#2)
- Initial commit of bagua core impl
- Add python packaging related files
- Only publish pypi for master commits
- Add __version__ variable
- Install nccl deps in bagua core and add generated __version__ variable
- Add version.py placeholder to prevent file not found error
- Initial support for python op (#2)
- Add 5 min timeout for buckets' comm op (#5)
- Replace NCCL with Aluminum (#7)
- Add synethetic benchmark script (#5)
- Add elastic training example (#7)
- Support alltoall_v (vector alltoall) (#14)
- Add reduce and allgather python interface
- Support reduce and allgather op with Reduction op enum
- Support creating BaguaTensor by passing torch tensor directly (#19)
- Compatible mode for getting pytorch tensor info with Python interpreter
- Better debug log including tensor info when executing ops
- Add native low precision decentralized operator (#26)
- Add (scatter, gather, scatter_reduce) and all inplace version communication primitives (#37)
- Make full precision decentralized op stateless (#36)
- Add communication_primitives example (#12)
- Support nccl 2.10 ncclAvg (#45)
- Use nccl 2.10 avg op for all algorithms using averaging (#46)
- Add opentelemetry to report tensor ready order (#42)
- Add deterministic flag (#15)
- Add native async model average algorithm (#41)
- Add examples for async model average algorithm (#14)
- Support packet splitting and multi-stream parallel transmission (#5)
- Support sync batchnorm (#151)
- Support ncclnet v3 and remove the dependency on nccl in the installation environment (#17)
- Add sync interval param to async examples (#19)
- Suppport tokio backend (#21)
- Support bagua-net (#89)
- Add `--enable-bagua-net` option in launcher (#183)
- Broadcast scalars for optimizers (#202)


## [0.7.0] - 2021-08-16
Expand Down

0 comments on commit 7e394b5

Please sign in to comment.